Skip to content

docs: Include example dataset files in repo#25613

Draft
wtn wants to merge 1 commit intopola-rs:mainfrom
wtn:example
Draft

docs: Include example dataset files in repo#25613
wtn wants to merge 1 commit intopola-rs:mainfrom
wtn:example

Conversation

@wtn
Copy link
Copy Markdown
Contributor

@wtn wtn commented Dec 4, 2025

Fixes #10186.

Rust examples fail with No such file or directory because dataset files (parquet, ipc, ndjson) are gitignored and only generated on-demand by init_files() during test runs.

Fix

Add pre-generated dataset files directly to the repository so examples work out-of-the-box.

  • Add foods1/foods2/null_nutriscore in parquet, ipc, ndjson formats
  • Remove examples/datasets/.gitignore that blocked these files
  • Add negation patterns to root .gitignore for examples/datasets/

@github-actions github-actions Bot added documentation Improvements or additions to documentation python Related to Python Polars rust Related to Rust Polars labels Dec 4, 2025
@juansolm
Copy link
Copy Markdown
Contributor

juansolm commented Jan 7, 2026

@wtn could we make this generic so we can add more datasets to the resository?

See this issue with regard to scattered datasets: #23763

I would propose we rename examples to datasets and work with dataset folders inside to organize
So your example would be datasets/food/<files/subfolders>

@wtn
Copy link
Copy Markdown
Contributor Author

wtn commented Jan 7, 2026

@juansolm Would you like to collaborate on this PR?

If so, I can add you to my fork.

@juansolm
Copy link
Copy Markdown
Contributor

juansolm commented Jan 7, 2026

I'm unable right now, thanks for the offer.

Co-authored-by: Claude <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation python Related to Python Polars rust Related to Rust Polars

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Rust example for parquet is not running

2 participants