Skip to content

Add SBOL to CSV dataset conversion script for ML pipeline#5

Open
qzMalekuz wants to merge 2 commits intoSynBioDex:mainfrom
qzMalekuz:feature/sbol-to-dataset
Open

Add SBOL to CSV dataset conversion script for ML pipeline#5
qzMalekuz wants to merge 2 commits intoSynBioDex:mainfrom
qzMalekuz:feature/sbol-to-dataset

Conversation

@qzMalekuz
Copy link

This PR introduces a utility script to convert SBOL files into a CSV dataset format compatible with ML pipelines.

Features:

  • Extracts DNA sequences from SBOL ComponentDefinitions
  • Resolves sequence references via SBOL document structure
  • Outputs a structured CSV (sequence, label)

This serves as an initial step towards integrating SBOL-based data into the SeqTrainer pipeline.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant