Comprehensive data cleaning pipeline for Olympic athletes biographical and results data (1896-2022).
- olympedia.org data scraped by Keith Galli
- Covers summer & winter Olympics
- Birth/death information parsing
- Name standardization
- Physical measurements extraction
- Role-based filtering (Olympic competitors only)
- Cross-country competition analysis
- 10 athletes competed for different countries than birth country
- 42 female athletes with official titles
- Gender distribution insights
bios_new.csv: Cleaned biographical dataresults_new.csv: Cleaned competition results
Run cells sequentially to reproduce the cleaning pipeline.