You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Track the public roadmap for integrations where AgentV stays the repo-native eval authoring, gating, and artifact surface while external systems provide complementary execution, observability, or import/export workflows.
Scope:
Keep AgentV YAML, grading semantics, run bundles, and CI exit behavior authoritative for AgentV-authored evals.
Add narrow adapters or documented recipes for external observability and evaluation systems when they consume completed AgentV artifacts or emit trace data AgentV can correlate with.
Delegate benchmark-grade execution to purpose-built runners where appropriate, then import or gate on their results through AgentV artifacts.
Preserve privacy and portability by making raw content export opt-in and by keeping metadata/provenance explicit.
Initial themes:
Opik export/import recipes over completed AgentV runs and traces.
Harbor-backed benchmark execution and result import boundaries.
Promptfoo interoperability where format conversion helps users migrate or share eval inputs without adding runtime coupling.
Track the public roadmap for integrations where AgentV stays the repo-native eval authoring, gating, and artifact surface while external systems provide complementary execution, observability, or import/export workflows.
Scope:
Initial themes:
Non-goals: