Skip to content

Add functions for parsing ML results and building Parquet files#24

Open
AbhirupaGhosh wants to merge 3 commits into
mainfrom
Merging_all_ML_results
Open

Add functions for parsing ML results and building Parquet files#24
AbhirupaGhosh wants to merge 3 commits into
mainfrom
Merging_all_ML_results

Conversation

@AbhirupaGhosh
Copy link
Copy Markdown
Contributor

@AbhirupaGhosh AbhirupaGhosh commented May 29, 2026

Description

This file contains functions to parse machine learning result files into structured metadata and to build Parquet files from various ML performance and feature datasets.

Model Performance and Top Features Summary

Model type Performance file Top features file
Single-drug/class model all_performances.parquet all_top_features.parquet
Leave-One-Drug-Out (LODO) LOO_drug_perf.parquet LOO_drug_top_features.parquet
Cross-drug test cross_drug_perf.parquet
Geographically stratified model country_perf.parquet country_top_features.parquet
Leave-One-Geography-Out (LOGO) LOO_country_perf.parquet LOO_country_top_features.parquet
Cross-geography test cross_country_perf.parquet
Time-stratified model year_perf.parquet year_top_features.parquet
Leave-One-Time-Period-Out (LOTO) LOO_year_perf.parquet LOO_year_top_features.parquet
Cross-time test cross_year_perf.parquet
Multiclass MDR model MDR_perf.parquet MDR_top_features.parquet

What kind of change(s) are included?

  • Feature (adds or updates new capabilities)
  • Bug fix (fixes an issue).
  • Enhancement (adds functionality).
  • Breaking change (these changes would cause existing functionality to not work as expected).

Checklist

Please ensure that all boxes are checked before indicating that this pull request is ready for review.

  • I have read and followed the CONTRIBUTING.md guidelines.
  • I have searched for existing content to ensure this is not a duplicate.
  • I have performed a self-review of these additions (including spelling, grammar, and related).
  • I have added comments to my code to help provide understanding.
  • I have added a test which covers the code changes found within this PR.
  • I have deleted all non-relevant text in this pull request template.
  • Reviewer assignment: Tag a relevant team member to review and approve the changes.

AbhirupaGhosh and others added 2 commits May 29, 2026 17:43
This file contains functions to parse machine learning result files into structured metadata and to build Parquet files from various ML performance and feature datasets.
Comment thread R/merge_ml_results.R Outdated
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant