This repository provides the code for the paper Deep Generative Models of Evolution: SNP-level Population Adaptation by Genomic Linkage Incorporation, submitted to 24th International Workshop on Data Mining in Bioinformatics (BioKDD25). The code is designed for creating artificial haplotypes, running simulations, preprocessing data, and training a VAE model for evolutionary data analysis.
- Python, Java, R
- Slurm for batch processing
- Mimicree2 (mim2-v206.jar)
- PoolSeq R library
- Python dependencies can be installed via:
pip install -r requirements.txtpython create_mimicree_files.pysbatch ./slurm_scripts/1_batch_slurm_simulate.shsbatch ./slurm_scripts/2_slurm_single_write_pathssbatch ./slurm_scripts/3_batch_slurm_preprocess.shsbatch ./slurm_scripts/3a_slurm_single_estimateNesbatch ./slurm_scripts/3b_batch_slurm_estimate_s.shsbatch ./slurm_scripts/4_batch_slurm_model_training.shsbatch ./slurm_scripts/5_batch_slurm_model_evaluation.sh- Short Data Analysis Plot for Appendix:
python plot_scripts/main_data_analysis_exp_II.py- Evaluation Plots:
python plot_scripts/plot_tmp_2.py
python plot_scripts/plot_linkage_correlations.pyFor any questions, please contact siekiera@uni-mainz.de.