You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
add a local examples/art-e Python package for an ART-E style email research agent
include deterministic inbox fixtures, search/read tools, LangGraph rollout, ART training entrypoint, and scoring helpers
add provider-free rollout/train smoke tests that monkeypatch LangGraph model creation and verify ART-E env handling
make Weave tracing optional for this example so local smoke tests can run even if Weave's transitive gql import is incompatible in a fresh environment
document offline checks/training and link the local example from the root README
Scope
This is a local deterministic ART-E-style example. The tests mock model calls and require no Gmail account, paid API, or private credentials. Real training uses art_e.train with a configured inference endpoint.
Added provider-free smoke coverage for the ART-E entrypoints: the new test monkeypatches LangGraph chat/model creation, drives the final-answer tool, verifies reward/scoring, and checks build_model reads the ART-E env vars. Validation rerun: uv run --project examples/art-e pytest examples/art-e/tests -q (6 passed), (cd examples/art-e && uv run python main.py), ruff check/format-check, compileall, and git diff --check.
Removed the generated per-example uv.lock from the PR to keep the example diff focused; this cuts 6,619 generated lines without changing the source package.\n\nReran validation after removal:\n\ntext\n$ uv run --project examples/art-e pytest examples/art-e/tests -q\n...... [100%]\n6 passed, 7 warnings in 3.01s\n\n$ (cd examples/art-e && uv run python main.py)\nScenario: Where and when is the team offsite?\nRetrieved: Team offsite logistics\nReward: 0.93\n\n$ uv run --project examples/art-e ruff check .\nAll checks passed!\n\n$ uv run --project examples/art-e ruff format --check .\n311 files already formatted\n\n\nAlso updated the PR body with a scope note: tests are deterministic/provider-free; real training requires a configured inference endpoint.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
examples/art-ePython package for an ART-E style email research agentScope
This is a local deterministic ART-E-style example. The tests mock model calls and require no Gmail account, paid API, or private credentials. Real training uses
art_e.trainwith a configured inference endpoint.Bounty
Validation
uv run --project examples/art-e pytest examples/art-e/tests -q- 6 passed(cd examples/art-e && uv run python main.py)uv run --project examples/art-e ruff check .uv run --project examples/art-e ruff format --check .python -m compileall examples/art-e/art_e examples/art-e/main.py examples/art-e/testsgit diff --check