Add Python data loading library for IMO Bench datasets with testing by tryh4rd-26 · Pull Request #5 · google-deepmind/superhuman

tryh4rd-26 · 2026-02-06T14:53:17Z

I built a Python library to make working with IMO Bench way easier since right now everyone has to manually parse CSVs which is kind of a pain. The library gives you a clean API where you can just call load_answerbench(category="Algebra" and get back 100 type-safe problem objects instead of messing with csv.DictReader.

I added filtering for all the obvious things (category, difficulty, source, points), proper lazy loading for the big GradingBench file so it doesn't eat all your memory, and validation.

Everything is tested (35 tests all passing with the actual CSV files), uses dataclasses so your IDE knows what fields exist, and auto-finds the data directory so it just works out of the box.

This should unblock people who want to build evaluation pipelines or analyze the datasets without writing their own CSV parsers, and it's ready to publish to PyPI if we want. No breaking changes since it's all new code.

…suites

tryh4rd-26 · 2026-02-11T12:52:16Z

@ychervonyi

Ashutosh0x · 2026-02-12T22:00:57Z

I've tested the library in a Python 3.8 environment and found that the 'list[]' and 'dict[]' type hints cause errors. I've implemented compatibility fixes using 'typing.List' and 'typing.Dict' in my dependent PR #6. Would highly recommend incorporating those fixes into this PR to support older Python versions.

Add Python data loading library for IMO Bench datasets with testing …

2d7ef2e

…suites

Ashutosh0x mentioned this pull request Feb 12, 2026

Add Level column to IMO-AnswerBench #6

Open

Merge branch 'google-deepmind:main' into main

8c0303a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Python data loading library for IMO Bench datasets with testing#5

Add Python data loading library for IMO Bench datasets with testing#5
tryh4rd-26 wants to merge 2 commits intogoogle-deepmind:mainfrom
tryh4rd-26:main

tryh4rd-26 commented Feb 6, 2026

Uh oh!

tryh4rd-26 commented Feb 11, 2026

Uh oh!

Ashutosh0x commented Feb 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tryh4rd-26 commented Feb 6, 2026

Uh oh!

tryh4rd-26 commented Feb 11, 2026

Uh oh!

Ashutosh0x commented Feb 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants