Computational oral biofilm pipeline developed during internship at NIFE (Niedersächsisches Institut für angewandte Zellgewebezüchtung), Hannover — part of the SIIRI/TRR-298 consortium.
このリポジトリは、患者メタゲノム(ショットガンNGS)の菌叢データから、口腔バイオフィルムの代謝モデル(dFBA/COMETS)までを一気通貫でつなぐ計算パイプラインです。
NGS(shotgun)→ MetaPhlAn 4(菌叢プロファイル)→ init_comp.json(属レベル割合)
→ GEM(AGORAの代謝モデル)→ dFBA → COMETS(時間発展シミュレーション)
- まず試す: COMETS_beginner.ipynb(最低限の動作・可視化)
- パイプライン実行: run_comets_pipeline.py(Step A/B/C)
- 菌叢側(MetaPhlAn): metaphlan_pipeline.sh → metaphlan_feature_table_to_init_comp.py
- 主な出力:
comets/pipeline_results/(図、COMETSの入出力、比較結果)
| 用語 | 意味(このrepoでの使い方) | 例(ファイル/コマンド) |
|---|---|---|
| NGS(shotgun) | メタゲノムのショットガンシーケンス。生データから菌叢組成を推定する入口 | data/ |
| MetaPhlAn 4 | リードから分類学的プロファイル(菌種/属の相対存在量)を推定 | qsub data/metaphlan_pipeline.sh |
| taxonomic profile | サンプルごとの菌叢組成(相対存在量の表) | MetaPhlAn出力 |
| init_comp.json | COMETS側の初期組成。対象7属の割合に正規化したJSON | data/metaphlan_feature_table_to_init_comp.py |
| GEM | Genome-scale metabolic model(ゲノム規模代謝モデル) | comets/agora_gems/ |
| AGORA | ヒト腸内細菌などのGEMコレクション。ここでは口腔細菌GEMを利用 | comets/agora_gems/*.xml |
| dFBA | 動的フラックスバランス解析。代謝(FBA)と環境(基質)の時間変化を結合 | comets/oral_biofilm.py |
| COMETS | 複数菌種の代謝・増殖を(空間あり/なしで)シミュレートする枠組み | comets/run_comets_pipeline.py |
| 0D / 2D | 0Dは空間なし(混合)。2Dは格子上で空間あり(拡散など) | Step A(0D), Step B(2D) |
| cross-feeding | ある菌が作った代謝産物を別の菌が利用する現象 | 乳酸(lactate)など |
| Sobol感度解析 | パラメータの不確実性が結果へ与える寄与(全効果STなど)を推定 | qsub comets/run_sobol.sh |
| qsub / PBS | クラスタ投入(ジョブスケジューラ) | qsub ... |
This repository implements an end-to-end computational pipeline connecting patient metagenomic sequencing data to mechanistic biofilm simulation:
NGS (shotgun) → MetaPhlAn 4 → init_comp.json → GEM (AGORA) → dFBA → COMETS
The pipeline models multi-species oral biofilm on implant surfaces, focusing on the transition between commensal and dysbiotic states relevant to peri-implantitis.
nife_intern/
├── comets/ # COMETS / dFBA simulation
│ ├── agora_gems/ # AGORA GEM reconstructions (5 species)
│ ├── notebooks/ # Jupyter notebooks (beginner + visualization)
│ ├── spatial_dfba.py # 2D spatial Monod dFBA (60×40 grid, 7 species)
│ ├── oral_biofilm.py # 0D community dFBA model
│ ├── run_comets_pipeline.py # End-to-end pipeline runner (Step A/B/C)
│ ├── sweep_comets_0d.py # Parameter sweep (glucose, cross-feeding)
│ ├── run_sobol.sh # Sobol sensitivity analysis (PBS job)
│ ├── make_pipeline_overview.py # Pipeline figure generator
│ ├── pipeline_overview.tex # TikZ pipeline diagram
│ └── pipeline_results/ # Output figures and COMETS run files
└── data/
├── metaphlan_pipeline.sh # MetaPhlAn 4 PBS pipeline script
├── metaphlan_feature_table_to_init_comp.py # Profile → init_comp.json
├── download_prjeb71108_fastq.py # FASTQ download helper (PRJEB71108)
└── PRJEB71108_filereport.tsv # ENA file report
| Code | Genus | Role |
|---|---|---|
| Str | Streptococcus spp. | Early colonizer, glucose→lactate |
| Act | Actinomyces / Schaalia | Scaffolding, early colonizer |
| Vel | Veillonella spp. | Obligate lactate cross-feeder (anaerobe) |
| Hae | Haemophilus parainfluenzae | Aerobic/facultative, NO₃ reducer |
| Rot | Rothia spp. | Health-associated, aerobic |
| Fus | Fusobacterium spp. | Bridge species, anaerobe |
| Por | Porphyromonas spp. | Late pathogen, deep anaerobe |
qsub data/metaphlan_pipeline.shRuns bowtie2 alignment against the AGORA/CHOCOPhlAn database, produces per-sample taxonomic profiles and converts them to init_comp.json (normalized genus fractions for the 7 target genera).
# Step A: 0D parameter sweep
python comets/run_comets_pipeline.py --step A
# Step B: 2D spatial healthy vs diseased comparison
python comets/run_comets_pipeline.py --step B
# Step C: Patient-specific (requires MetaPhlAn output)
python comets/run_comets_pipeline.py --step Cqsub comets/run_sobol.sh # Sobol indices, N=256, 12 paramsKey result: Fn_mu_max (ST=0.49) and Vp_Km_lac dominate dysbiosis — driven by lactate cross-feeding bridge, not Porphyromonas directly.
- Dieckow et al. 2024, npj Biofilms Microbiomes 10:155 — implant biofilm ground truth (volume, viability, composition)
- Dukovski et al. 2021, Nat. Protocols — COMETS framework
- Frings, Mukherjee et al. 2025, Analyst — ATR-FTIR strain-level identification of oral bacteria
- Joshi et al. 2025, npj Biofilms Microbiomes — peri-implantitis submucosal microbiome
SIIRI / SFB TRR-298 — Safety Integrated and Infection Reactive Implants
Group: Prof. Meike Stiesch, MHH Department of Prosthetic Dentistry and Biomedical Materials Science
Experimental collaborators: Dr. Katharina Szafrański, Dr. Rumjhum Mukherjee, Dr. Pallavi Joshi