Skip to content

mariamashraf731/Multi-Omics-Cancer-Survival-Analytics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🧬 Multi-Omics Cancer Survival Analysis

Python Bioinformatics Analysis Data

📌 Project Overview

This repository hosts a comprehensive computational framework for analyzing multi-omics data from Kidney Renal Clear Cell Carcinoma (KIRC) and Lung Squamous Cell Carcinoma (LUSC).

The project integrates Clinical data, Gene Expression (GE), Copy Number Variation (CNV), and Protein levels (RPPA) to build predictive survival models and identify key genomic biomarkers using statistical and machine learning techniques.

🔬 Key Components

1. Survival Analysis (KIRC)

  • Objective: Predict patient survival outcomes based on integrated multi-omics features.
  • Methods:
    • Preprocessing: Imputation and normalization of clinical and genomic data.
    • Feature Selection: Using LASSO (L1 Regularization) to identify the most relevant prognostic markers.
    • Modeling: Building Cox Proportional Hazards models and Random Survival Forests.
  • Metric: Evaluated using the Concordance Index (C-Index).

2. Differential Expression Analysis (Volcano Plots)

  • Objective: Identify significantly up-regulated and down-regulated genes between tumor and normal tissues.
  • Visualization: Generated Volcano Plots to visualize statistical significance ($-\log_{10} P$) vs. magnitude of change ($\log_2 Fold Change$).
  • Top Hits: Extracted top 5 genes based on Fold Change and P-value for biological interpretation.

3. Regression & Correlation

  • Objective: Explore linear relationships between specific gene pairs and clinical variables.
  • Analysis: Simple and Multiple Linear Regression to understand gene-gene interactions.

🚀 How to Run

  1. Clone the repository:
    git clone [https://github.com/mariamashraf731/Multi-Omics-Cancer-Survival.git](https://github.com/mariamashraf731/Multi-Omics-Cancer-Survival.git)
  2. Install Requirements:
    pip install -r requirements.txt
  3. Run Survival Pipeline:
    python src/survival/train_model.py
  4. Generate Plots:
    python src/plotting/volcano_plot.py

📄 Documentation

For detailed methodology and biological interpretation, refer to the Final Project Report.

About

A multi-omics analysis framework for cancer survival prediction (KIRC/LUSC). Utilizes LASSO feature selection, Cox Regression, and Volcano plots to identify genomic biomarkers and predict patient outcomes from TCGA data.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors