This folder contains scripts used to process WormBase WS295 GFF2 gene annotation files for C. elegans.
1_gff2bed.py: Converts WormBase WS295.gff2annotation files to BED format with chromosome mapping and gene feature extraction.2_process_bed.sh:- Filters for long genes (default: >2kb) separated by at least 500 bp
- Generates slopped BED files
- Creates strand-aware 100-bin TSS and TES regions
- Python 3
bedtools- Shell environment with
awk,bash
c_elegans.PRJNA13758.WS295.annotations.gff2- Chromosome size file (
ce11.chrom.sizes)
- Final processed gene BEDs and binned TSS/TES BED files
Cansu Kose, 2025 – UNC Chapel Hill | Sancar Lab