feat(cli): add reingest command for re-ingesting execution results by lewisjared · Pull Request #610 · Climate-REF/climate-ref

lewisjared · 2026-03-31T11:39:36Z

Description

Add ref executions reingest command that re-runs build_execution_result() on existing output files and re-ingests the results into the database without re-executing diagnostics. This is useful when new series definitions or metadata extraction logic has been added.

Key features:

Three modes via ReingestMode enum: additive (keep existing, add new), replace (delete and re-ingest), versioned (new execution record)
Enum-based --mode option for better UX (tab completion, validated choices shown in help)
Safety guards: requires at least one filter, confirmation prompt, dry-run support
Scratch directory pattern: copies results to scratch before re-extraction, cleans up via try/finally
Savepoint-based rollback: DB mutations happen in a single savepoint so failures don't corrupt state

Checklist

Please confirm that this pull request has done the following:

Tests added
Documentation added (where applicable)
Changelog item added to changelog/

Add `ref executions reingest` command that re-runs build_execution_result() on existing outputs without re-executing diagnostics. Uses ReingestMode enum for the --mode option, providing tab completion and validated choices (additive, replace, versioned) via Typer's native enum support.

…d scratch cleanup Collapse four near-identical metric ingestion functions into two by adding an optional `existing` parameter for additive dedup. Hoist the dimension query into `_ingest_metrics` to avoid duplicate DB queries. Fix double iteration of `iter_results()` generator. Add try/finally scratch directory cleanup to prevent disk accumulation during batch reingests. Remove redundant dirty-flag save/restore in CLI.

Copilot

Pull request overview

Adds a new “reingest” workflow to the Climate REF CLI/executor layer to re-run build_execution_result() against existing on-disk outputs and ingest updated metrics/metadata into the DB without re-executing diagnostics.

Changes:

Introduces climate_ref.executor.reingest with ReingestMode and reingest/query helpers.
Adds ref executions reingest CLI command with safety guards (filters required, confirm prompt, dry-run) and mode selection.
Adds unit tests for reingest modes and filtering, plus a changelog entry.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 8 comments.

Show a summary per file

File	Description
packages/climate-ref/src/climate_ref/executor/reingest.py	Implements reingest logic, scratch copying, savepoint-based DB mutation, and execution querying.
packages/climate-ref/src/climate_ref/cli/executions.py	Adds `ref executions reingest` command wiring and UX/safety features.
packages/climate-ref/src/climate_ref/executor/init.py	Exposes `ReingestMode` and `reingest_execution` in executor exports.
packages/climate-ref/tests/unit/executor/test_reingest.py	New unit tests covering reingest behavior across modes and filters.
changelog/610.feature.md	Documents the new CLI command and modes.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-31T11:48:11Z