Reproducible Mortality Prediction

Open-source, reproducible implementation of deep learning for in-hospital mortality prediction. Simplified replication of Rajkomar et al. (2018) using LSTM + Focal Loss + Calibration.

📄 Papers

This Work:

📘 Technical Paper (PDF) - Complete methodology and results

Original Paper:

Scalable and accurate deep learning with electronic health records
Rajkomar, A., Oren, E., Chen, K. et al.
npj Digital Medicine 1, 18 (2018)
https://doi.org/10.1038/s41746-018-0029-1

🎯 Objective

Implement and validate a Deep Learning model for ICU mortality prediction, incorporating:

Architecture: LSTM with strong regularization
Loss Function: Focal Loss (handles class imbalance)
Calibration: Post-training Isotonic Regression
Threshold Learning: Optimized by F1-Score
Dataset: Synthetic MIMIC-III (24,327 episodes)
Baseline: Logistic Regression

🏆 Main Results

Metric	Deep Learning	Baseline (LR)	Improvement
AUROC	0.8638	0.7042	+22.7%
AUPRC	0.6396	0.4564	+40.1%
Recall	96.81%	33.78%	+186.6%
F1-Score	0.5114	0.4025	+27.1%

✅ High Sensitivity: Detects 96.8% of deaths (critical in medicine)

📁 Project Structure

Systematic_Review/
├── README.md                      # This file
├── requirements.txt               # Python dependencies
│
├── data/
│   └── in-hospital-mortality/     # Synthetic MIMIC-III Dataset (24.3K episodes)
│       ├── train/                 # 16,972 episodes (69.9%)
│       ├── val/                   # 3,740 episodes (14.8%)
│       └── test/                  # 3,615 episodes (15.2%)
│
├── src/                           # Main source code
│   ├── train_dl.py                # DL Training (LSTM + Focal Loss + Calibration)
│   ├── train_baseline.py          # Baseline Training (Logistic Regression)
│   ├── generate_plots.py          # Generation of 14 visualizations
│   ├── generate_report.py         # Technical report generation
│   ├── calibration_utils.py       # Focal Loss + Calibration + Threshold Learning
│   ├── data_loader.py             # Data loading
│   └── validate_kfold.py          # K-Fold validation
│
├── scripts/                             # Generation and execution scripts
│   ├── generate_synthetic_mimic3.py     # Synthetic data generator
│   ├── process_synthetic_data.py        # Data processor
│   ├── regenerate_data.sh               # Complete regeneration
│   ├── run_plots_and_report.sh          # Generate plots + report
│   └── run_validation_and_report.sh     # K-Fold validation
│
├── config/
│   └── constants.py               # Centralized configuration parameters
│
├── models/                        # Trained models
│   ├── best_model_calibrated.keras     # Deep Learning model
│   ├── calibrator.pkl                  # Isotonic Regression calibrator
│   ├── baseline_model.pkl              # Baseline Logistic Regression
│   └── optimal_threshold.txt           # Optimal threshold (0.170)
│
├── results/                       # Results and visualizations
│   ├── plots/                     # 14 high-quality plots
│   ├── baseline/                  # Baseline metrics
│   └── TECHNICAL_REPORT.md        # Automatic technical report
│
├── docs/                          # Technical documentation
│   ├── SYNTHETIC_DATA_GENERATOR.md     # Generator documentation
│   ├── CONFIGURATION_PARAMETERS.md     # Parameters reference
│   └── DEEP_LEARNING_MODEL.md          # DL model documentation

🚀 Initial Setup

1. Create virtual environment

python3 -m venv venv
source venv/bin/activate  # On Mac/Linux

2. Install dependencies

pip install -r requirements.txt

3. Generate synthetic data

# Generate complete dataset (24,327 episodes)
python scripts/generate_synthetic_mimic3.py

# Process data
python scripts/process_synthetic_data.py

4. Verify data

ls -lh data/in-hospital-mortality/train/
head data/in-hospital-mortality/train/listfile.csv

📊 Dataset

Synthetic MIMIC-III (Current Version)

Total: 24,327 ICU episodes
Features: 15 clinical variables (vital signs + labs)
Time Window: 48 hours of observation
Mortality Rate: 20.8% (imbalanced, realistic)
Splits:
- Train: 16,972 episodes (69.9%)
- Validation: 3,740 episodes (14.8%)
- Test: 3,615 episodes (15.2%)

Generator Features:

✅ 13 implemented features:

Circadian patterns (24h)
Sleep and meal simulation
Temporal variability (noise, jitter, dropout, artifacts)
Realistic missingness (MCAR, MAR, MNAR)
Multivariate correlations
Documentation quality by shift
Individual variability (anti-overfitting)
Multivariate mortality model

📚 Complete documentation: docs/SYNTHETIC_DATA_GENERATOR.md

🧠 Model Architecture

Input (48 timesteps, 15 features)
    ↓
Masking Layer (ignores padding)
    ↓
LSTM (64 units, dropout=0.5, recurrent_dropout=0.3)
    ↓
Dense (32 units, ReLU, L2=0.01)
    ↓
Dropout (0.5)
    ↓
Dense (16 units, ReLU, L2=0.01)
    ↓
Dropout (0.5)
    ↓
Output (1 unit, Sigmoid)

Techniques Employed:

✅ Focal Loss (gamma=2.0, alpha=0.25) - Handles class imbalance
✅ Calibration - Post-training Isotonic Regression
✅ Threshold Learning - Optimized by F1-Score (0.170)
✅ Strong Regularization - Dropout 50%, Recurrent Dropout 30%, L2 0.01

📚 Complete documentation: docs/DEEP_LEARNING_MODEL.md

📈 Training

Option 1: Complete Pipeline (RECOMMENDED)

# Regenerate data + train baseline + train DL + generate visualizations
./scripts/regenerate_data.sh

Option 2: Individual Training

# 1. Train Baseline (Logistic Regression)
python src/train_baseline.py

# 2. Train Deep Learning (LSTM + Focal Loss + Calibration)
python src/train_calibrated.py --epochs 50 --batch-size 64

# 3. Generate visualizations and report
./scripts/run_plots_and_report.sh

Option 3: K-Fold Validation

# 5-fold cross-validation
./scripts/run_validation_and_report.sh

📊 Visualizations

The project automatically generates 14 high-quality visualizations:

ROC Curve
Precision-Recall Curve
Calibration Curve
Confusion Matrix
Probability Distribution
Threshold vs Metrics
Summary Metrics
Learning Curves (Loss, AUROC, AUPRC, Overfitting)
ROC Comparison (DL vs Baseline)
PR Comparison (DL vs Baseline)
Metrics Comparison (bars)
Improvement Chart (%)
Comparison Table (visual table)
metrics_summary.json

Location: results/plots/

🔄 Reproducibility

This project follows open-source and reproducibility best practices:

✅ Complete source code - All scripts included
✅ Synthetic data generator - No proprietary data needed
✅ Pre-trained models - Available in releases (optional)
✅ Reference results - Metrics and visualizations included
✅ Detailed documentation - Step-by-step reproduction guide

📚 Full reproduction guide: REPRODUCIBILITY.md

📚 Documentation

Main Documents:

README.md - This file (overview and quick start)
Technical Paper (PDF) - Complete methodology and results
REPRODUCIBILITY.md - Complete reproduction guide
results/TECHNICAL_REPORT.md - Automatic technical report

Technical Documentation (docs/):

SYNTHETIC_DATA_GENERATOR.md - Generator's 13 features and implementation
DEEP_LEARNING_MODEL.md - Architecture, techniques, and usage
CONFIGURATION_PARAMETERS.md - Complete parameters reference
VISUALIZATION_SYSTEM.md - Visualization and plotting system

🤝 Contributing

Contributions are welcome! Please read our Contributing Guidelines and Code of Conduct before submitting pull requests.

Ways to Contribute:

🐛 Report bugs or issues
💡 Suggest new features or improvements
📝 Improve documentation
🧪 Add tests
🔧 Submit bug fixes or enhancements

📝 Citation

If you use this code in your research, please cite:

This repository:

@software{lehdermann2025mortality,
  title={Reproducible Mortality Prediction: LSTM with Focal Loss and Calibration},
  author={Lehdermann Silveira, André},
  year={2025},
  url={https://github.com/lehdermann/reproducible-mortality-prediction},
  note={Open-source replication study}
}

Original paper:

@article{rajkomar2018scalable,
  title={Scalable and accurate deep learning with electronic health records},
  author={Rajkomar, Alvin and Oren, Eyal and Chen, Kai and others},
  journal={npj Digital Medicine},
  volume={1},
  number={1},
  pages={18},
  year={2018},
  publisher={Nature Publishing Group},
  doi={10.1038/s41746-018-0029-1}
}

See CITATION.cff for machine-readable citation metadata.

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Important: If using real MIMIC-III data, you must comply with PhysioNet's Data Use Agreement.

📜 Changelog

See CHANGELOG.md for a detailed history of changes and version information.

👤 Author

André Lehdermann Silveira
Master's Student in Applied Computing
Universidade do Vale do Rio dos Sinos (Unisinos)
📧 Contact: GitHub

⚠️ Disclaimer

This is an academic replication project for educational purposes. Synthetic data should not be used for real clinical decisions. For clinical use, only use properly approved real MIMIC-III data.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reproducible Mortality Prediction

📄 Papers

🎯 Objective

🏆 Main Results

📁 Project Structure

🚀 Initial Setup

1. Create virtual environment

2. Install dependencies

3. Generate synthetic data

4. Verify data

📊 Dataset

Synthetic MIMIC-III (Current Version)

Generator Features:

🧠 Model Architecture

📈 Training

Option 1: Complete Pipeline (RECOMMENDED)

Option 2: Individual Training

Option 3: K-Fold Validation

📊 Visualizations

🔄 Reproducibility

📚 Documentation

Main Documents:

Technical Documentation (docs/):

🤝 Contributing

Ways to Contribute:

📝 Citation

📄 License

📜 Changelog

👤 Author

⚠️ Disclaimer

About

Uh oh!

Releases 1

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
config		config
data		data
docs		docs
models		models
results		results
scripts		scripts
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CITATION.cff		CITATION.cff
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
REPRODUCIBILITY.md		REPRODUCIBILITY.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Reproducible Mortality Prediction

📄 Papers

🎯 Objective

🏆 Main Results

📁 Project Structure

🚀 Initial Setup

1. Create virtual environment

2. Install dependencies

3. Generate synthetic data

4. Verify data

📊 Dataset

Synthetic MIMIC-III (Current Version)

Generator Features:

🧠 Model Architecture

📈 Training

Option 1: Complete Pipeline (RECOMMENDED)

Option 2: Individual Training

Option 3: K-Fold Validation

📊 Visualizations

🔄 Reproducibility

📚 Documentation

Main Documents:

Technical Documentation (docs/):

🤝 Contributing

Ways to Contribute:

📝 Citation

📄 License

📜 Changelog

👤 Author

⚠️ Disclaimer

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Contributors

Uh oh!

Languages