Skip to content

Latest commit

 

History

History
411 lines (288 loc) · 22.4 KB

File metadata and controls

411 lines (288 loc) · 22.4 KB

Agents Plugin Roadmap

Date: 2026-03-23 Current release: v0.29.0

Direction

Run two explicit tracks in parallel:

  • Track A (70%): runtime reliability and operator safety
  • Track B (30%): agency adoption, packaged workflow UX, and agent-facing integration UX

Agents is being shaped for agencies and delivery teams responsible for operating Craft sites over time, especially where automation needs to stay explainable, governable, and safe in front of clients.

Near-term roadmap emphasis:

  • safer bounded automation for approved client surfaces
  • reusable workflow kits and delivery patterns
  • extension into real Craft stacks where teams already work
  • assistant-style operator support only after the trust boundary is stable

Done (v0.5.0)

  • Harden runtime contracts (API/scope/docs parity).
  • Improve operator UX in CP (clearer queue/actions/warnings).
  • Tighten validation and deterministic error behavior.
  • Expand regression coverage for auth/control/webhook/consumer paths.
  • Validate upgrade and migration safety.
  • Define and document three canonical "first agent jobs" with copy/paste examples.
  • Add an integration quickstart path focused on first successful action in under 30 minutes.

Release outcome:

  • Runtime behavior is stable in production, and new integrators can reach first value quickly.

Done (v0.6.0)

  • Ship observability baseline:
    • metrics taxonomy and naming
    • runtime metrics collection
    • metrics export endpoint
    • CP telemetry snapshot
    • runbook and alert guidance
  • Add adoption instrumentation:
    • first-call success funnel
    • time-to-first-success metric
    • credential activation and weekly usage tracking

Release outcome:

  • Operators can triage incidents quickly, and product teams can see where integration adoption drops.
  • v0.6.1 hotfix closed runtime reliability issues in adoption metrics, machine POST CSRF handling, and dual-approval race safety.
  • v0.6.2 corrected release-version metadata/tag alignment for plugin store ingestion.

Done (v0.7.0)

  • Ship one-click diagnostics bundle:
    • contract + redaction policy
    • diagnostics engine
    • CP download flow
    • CLI companion command
  • Improve integrator DX with schema/OpenAPI-based templates.
  • Publish reference automations for the canonical jobs (with tested sample payloads).

Release outcome:

  • Faster support resolution and repeatable onboarding from first call to production patterns.

Done (v0.8.0)

  • Expand Craft-native read coverage across remaining element families (users/assets/categories/tags/global sets/addresses/content blocks).
  • Expand Commerce read coverage with variants, subscriptions, transfers, and donations surfaces.
  • Extend unified incremental changes feed coverage across newly exposed resources.
  • Publish canonical agent-handbook discovery link in llms discovery outputs.

Release outcome:

  • Integrations can access a materially wider runtime surface with consistent sync semantics and discovery hints.

Done (v0.9.0)

  • Shipped schema/OpenAPI-based integration templates for canonical jobs.
  • Shipped three tested reference automations with fixture payloads.
  • Shipped copy/paste agent starter packs (curl, javascript, python) for onboarding.
  • Expanded operator reliability pack with threshold defaults, triage signals, and richer diagnostics bundle snapshots.
  • Shipped lifecycle governance controls: ownership metadata mapping, expiry/rotation reminders, and stale-key warnings.
  • Removed Control CP surface (tab/routes/forms/permissions) from public operator UX; kept internal control-plane internals feature-flagged for future adapter-based execution.

Release outcome:

  • Integrators can move from first call to production patterns faster, and operators get clearer reliability/lifecycle posture without exposing unfinished return workflows.

Done (v0.9.1)

  • Hidden Lifecycle Governance warning surfaces in the Agents CP view (summary panel + card warning strips) while keeping lifecycle APIs/services intact.

Release outcome:

  • Operators get a cleaner Agent card view now, with lifecycle governance still available for future reintroduction without backend rollback.

Done (v0.21.x)

  • Shipped operator notifications with email-first delivery, recipient routing, recent-delivery visibility, and scheduled status-check support.
  • Reintroduced the public operator IA around Status, Approvals, Accounts, and Settings and hardened the governed approval flow.
  • Published the first-worker bootstrap path with a public guide and example worker.
  • Bound approved governed entry-draft requests to exact saved drafts and blocked conflicting saved-draft creation to reduce ambiguous draft apply behavior.

Release outcome:

  • Operators now have materially stronger support surfaces for notifications, account bootstrap, and governed draft approvals.

Proposed Path to 1.0.0

Done (v0.25.5)

  • Reworked the Accounts template shelf around agency-first workflow profiles and split it into a tighter default set plus a secondary More templates section.
  • Added stronger service-shaped templates for SEO/metadata, Commerce catalog review, accessibility review, and launch QA, while removing the old Site Structure Review shelf entry.
  • Hid governed-write and Commerce-specific templates when those capabilities are unavailable so the shelf matches the site’s actual operating surface.

Release outcome:

  • The default account-template shelf is easier for agencies to explain, safer to start from, and better aligned with repeatable client delivery patterns.

Done (v0.25.4)

  • Added configurable approval timing directly to Rules, covering due, escalation, and expiry windows.
  • Reworked the Approvals tables around operator-readable cells, compact symbol actions, CP-user approval links, and clearer dual-control progress.
  • Surfaced past-due approval requests in the Status approvals card.

Release outcome:

  • Governed approvals are easier to scan, explain, and act on, while approval timing is now configured where operators already manage the rule itself.

Done (v0.25.3)

  • Reworked Status into a compact dashboard with a larger verdict anchor, compact operator cards, and modal-hosted detail/probe/sink surfaces.
  • Tightened Status routing so only problematic rows deep-link into specific follow-up views, while footer actions return to the unfiltered operator pages.
  • Switched the Accounts card registry to a four-column desktop grid and removed plugin-level table-header bottom borders for a calmer CP rhythm.

Release outcome:

  • Status now behaves more like an operator dashboard than a diagnostics console, while the surrounding CP surfaces stay visually flatter and easier to scan.

Done (v0.25.1)

  • Polished the Accounts card view around quieter summaries, visible worker .env previews with inline copy/download actions, and the removal of the empty add-form shell below the registry.
  • Fixed card-view pulse simulation so ?simulatePulse= lights the visible card indicators as expected.

Release outcome:

  • The Accounts card view is calmer and more usable for day-to-day operator review, while worker bootstrap details stay one click away inside the details surface.

Done (v0.25.0)

  • Implemented governed write target sets and CP helpers as the main operator-safety slice for bounded client automation.
  • Added reusable Target Sets with dedicated CP management, account assignment, approval summaries, and server-side request/execution enforcement.
  • Kept runnable worker setup account-scoped while moving target-set management into its own governance surface.
  • Continued flattening the public CP IA around shared page shells, cleaner section separators, and more consistent table-first registry views.

Release outcome:

  • Agencies can automate explicitly approved client surfaces with clearer trust boundaries, while operators configure bounded write lanes without overloading managed accounts.

Done (v0.24.0)

  • Implemented the production webhook probe.
  • Added an admin-only Webhook Probe card in Status that sends a synthetic signed delivery against the live runtime webhook target.
  • Added a dedicated probe ledger with recent run history, payload inspection, triggered-by metadata, and cooldown visibility.
  • Kept the production probe separate from the dev-only Webhook Test Sink while reusing the same signing and outbound HTTP transport path.

Release outcome:

  • Operators can validate live webhook transport safely in-place, without saving content or temporarily pointing delivery at a local sink.

Done (v0.23.0)

  • Reworked Accounts around a Craft-style managed-account registry with a default table view, an alternate card view, and modal-hosted details/actions that keep lifecycle operations coherent in both modes.
  • Simplified governed diff review down to Structured and Focus, added an After / Before toggle inside Focus mode, and removed stale warning/toggle copy that no longer helped approval decisions.
  • Published the public Agents vs Element API positioning page and refreshed CP/docs wording around the new Accounts registry model.

Release outcome:

  • Operators can compare and manage machine identities more like a real registry, and approval review now concentrates on the two modes that actually support proofing work.

Done (v0.22.2)

  • Restored async credential rotation in the Accounts details panel so rotate stays inline and can reveal the new one-time token without a full-page reload.
  • Brightened the one-time token overlay actions and switched them to a smaller Craft-native button treatment for clearer contrast and better visual hierarchy.

Release outcome:

  • Credential rotation is back to the intended inline workflow, and the token overlay actions are readable enough to support real operator use.

Done (v0.22.1)

  • Tightened the operator surfaces after approval diff review with cleaner Accounts, Approvals, and Status card framing built around shared muted header strips and more native Craft action treatments.
  • Fixed completed approval diffs so Applied / Completed can still show meaningful changed rows after an approved draft has been applied and the active draft no longer exists.
  • Added an operator-facing stale-status reset action and aligned the top Status verdict with the same final summary logic shown in the proof cards.

Release outcome:

  • The core governed-approval UX is more trustworthy in daily use, and the surrounding CP surfaces now read more consistently as Craft-native operator tooling.

Done (v0.22.0)

  • Implemented the approval content diff review surface.
  • Added a dedicated Diff action next to Review for governed entry-draft approvals.
  • Shipped the first version as changed-only, field-aware, and optimized for fast human judgment, including a text-focused redline view.

Release outcome:

  • Approvers can see what changed in a few seconds instead of inferring content changes from raw payloads or metadata.

Planned (Pre-1.0.0) First-Run Onboarding and Contract Stabilization

  • Implement fresh-install start screens and first-run onboarding as the main new-install adoption slice.
  • Add a branded welcome/start surface for fresh installs with short orientation copy, doc links, and a strong first-account CTA.
  • Add a lightweight bootstrap follow-up state that guides operators from “first account exists” to “first real machine use”.
  • Freeze the main CP IA.
  • Freeze canonical routes and scope naming.
  • Add full multi-site and multi-store support across the public contract:
    • explicit site/store selectors where they affect API behavior
    • predictable defaults when selectors are omitted
    • documentation that makes site/store context unambiguous for operators and integrators
  • Verify Craft Cloud compatibility and document the Cloud setup path:
    • Cloud env variables
    • SMTP mail setup
    • scheduled agents/notifications-check command
    • Cloud-specific operator guidance where wording differs from generic server setups
  • Tighten upgrade notes, deprecation rules, and compatibility discipline.
  • Remove avoidable churn from user-visible contracts.

Release outcome:

  • New installs should reach first value faster, while the public contract becomes materially safer to build against.

Released (v0.26.0) Agency Stack Extension Foundation

  • Delivered the first agency stack extension slice:
    • provider registry + registration event
    • dynamic external-resource routes
    • external read scopes merged into capabilities/OpenAPI/schema
    • Accounts scope assignment grouped by provider and resource
  • Added a minimal standalone Retour reference adapter package as the first end-to-end proof of the contract.
  • Kept the foundation independent of outside collaboration and positioned concrete adapters as optional follow-up proof points, not prerequisites.

Release outcome:

  • Agents can extend safely into plugin ecosystems agencies already standardize on, and the adapter contract is now real enough to build against.

Released (v0.27.0) Agency Workflow Starter Kits and Companion Workers

  • Delivered the first workflow starter-kit slice:
    • public Governed Content Refresh workflow guide
    • companion worker scaffold under examples/workers/governed-content-refresh/
    • direct CP links from Approvals and account worker helpers into the starter-kit path
  • Added manual control-plane approval requests for governed entry.updateDraft so operators can test and use bounded draft-write flows without leaving Craft.
  • Reworked the surrounding Approvals, Accounts, and Target Sets surfaces around calmer copy, relative short dates, clearer worker/bootstrap helpers, and cleaner registry tables.

Release outcome:

  • Agencies now have the first repeatable starter-kit workflow and companion worker path for bounded governed draft refresh work, alongside calmer operator-facing CP surfaces.

Released (v0.28.0) Operator-Managed Workflow Instances

  • Delivered a dedicated Workflows CP surface where operators can create, configure, inspect, duplicate, and hand off workflow instances.
  • Kept workflow creation template-based and read-only-first rather than introducing a generic builder.
  • Kept execution in external workers:
    • Agents stores workflow config, schedule intent, managed-account binding, and handoff visibility
    • the actual schedule runner, fetch/reasoning loop, raw outputs, and execution stay outside Agents
  • Promoted the workflow starter-kit direction into the first managed workflow templates and visible operator handoff path.
  • Reused the existing governed draft approval path instead of introducing a second write-workflow execution lane.
  • Added matching-account handoff guidance, recent-run placeholders, and calmer workflow/status wording so operators can inspect the surface without mistaking Agents for a hosted job runner.

Release outcome:

  • Agencies can turn a reusable starter workflow into a managed, repeatable operating surface inside Craft without turning Agents into a generic orchestration platform or pretending Agents is the runtime itself.

Released (v0.29.0) Read-Only Jobs, Bounded Reads, and Safer Account Matching

  • Added bounded read selectors for read-only Jobs so operators can define focused entry/product/site/section surfaces without broadening the governed-write boundary model.
  • Added explicit job runtime reporting and job-support scopes so recent-run visibility now follows an intentional contract instead of placeholder state.
  • Reworked operator-facing workflow language into Jobs, tightened account capability defaults, and made account selection more readable through prompt-first job setup and filtered account cards.

Release outcome:

  • Operators can define clearer read-only jobs, hand them to external agents with a smaller access surface, and review recent job activity without reasoning from raw workflow scaffolding.

Planned (v0.29.x) Workflow Runtime API and Agent Bootstrap Discovery

  • Add a machine-facing workflow discovery contract so an external agent can enumerate:
    • workflow types
    • configured workflow instances
    • read-boundary summaries
    • latest run state where available
  • Add machine-readable workflow and account handoff manifests so operators can point an external agent at Agents without requiring CP HTML scraping.
  • Keep workflow handoffs non-secret by default and introduce a separate explicit one-time bootstrap retrieval path for account secrets instead of a permanent token-read endpoint.
  • Keep the route shape aligned with the plugin’s existing list / show API style instead of inventing a second API convention for workflows alone.
  • Leave curated account/workflow creation from templates as a later follow-on once discovery, handoff parity, and bootstrap-secret safety have proven stable.
  • Sequencing note:
    • v0.28.x should first stabilize bounded read workflow scope and run reporting
    • v0.29.x should then reuse that same workflow/run shape for discovery and bootstrap instead of defining a second parallel contract
  • Dependency graph for the workflow discovery/bootstrap slice:
    • T1 depends_on: [] Define the machine-facing discovery and bootstrap contract, including response shapes, scope model, and the split between non-secret manifests and one-time secret retrieval.
    • T2 depends_on: [T1] Add workflow-type and workflow-instance discovery endpoints plus explicit auth scopes for those surfaces.
    • T3 depends_on: [T1] Add machine-readable workflow/account handoff manifests that mirror the CP bundle outputs and include output-storage guidance.
    • T4 depends_on: [T1] Add short-lived, audited bootstrap-secret retrieval for account handoff without creating a permanently readable token endpoint.
    • T5 depends_on: [T2, T3, T4] Integrate the discovery/bootstrap model with workflow runtime reporting and shared bundle generation so CP and API handoffs stay in sync.
    • T6 depends_on: [T2, T3, T4, T5] Add docs, QA, and future-ready hooks for later curated create-from-template endpoints.

Planned (v0.30.x) Agency Operator Copilot Foundation

  • Implement a provider-backed orchestration foundation for optional in-product LLM support.
  • Start with env-only site-level provider configuration as the convenient in-product path.
  • Support external assistants through the existing governed API and discovery surfaces, documented and supported in v1 without introducing a heavy first-class external profile system.
  • Support recommendation-first jobs such as scope recommendation, summaries, report drafting, and other agency-facing explanation work before broader in-product assistant behavior.
  • Keep per-account BYOM out of the main path unless real demand proves the extra secret-management complexity is justified.
  • Do not introduce broad autonomous operator control.

Release outcome:

  • Agency teams get a constrained copilot for discovery, recommendations, and client-facing drafts without weakening the existing trust boundary.

Planned (v0.30.x) Agency Fleet Operations Assist and Dependency-Led Extensibility

  • Begin fleet operations assist, phase 1 only.
  • Keep it read-first and recommendation-first:
    • status visibility
    • approval queue visibility
    • account posture visibility
    • guided remediation
  • Do not introduce broad delegated self-administration.
  • Implement only the useful parts of the custom-scope and field-policy layer if they are required by the adapter foundation, governed write boundaries, or later agency-facing governance work.
  • Keep extensibility work dependency-led rather than turning it into a standalone roadmap narrative.

Release outcome:

  • Agencies can operate more client sites with better guided insight, while extensibility grows only where it directly supports real agency workflows.

Planned (v0.31.x) Pre-1.0 Consolidation

  • Focus on bug fixing, upgrade safety, onboarding, and support polish.
  • Avoid major IA churn.
  • Validate that the core product promise and support model hold under real customer use.

Release outcome:

  • The product is ready for a 1.0.0 stability commitment rather than still behaving like a moving target.

1.0.0 Criteria

Before 1.0.0, the following must be stable:

  • top-level CP information architecture
  • canonical CP routes
  • core scope catalog and naming
  • core machine-readable descriptors
  • settings model and config-lock behavior
  • managed-account lifecycle behavior
  • webhook delivery and verification model
  • upgrade and migration expectations

Post-1.0.0 Candidate Commerce Governed Writes

  • Evaluate Commerce mutations through the same trust-boundary model as governed entry drafts, not as unrestricted Commerce admin APIs.
  • Prioritize the work in phases:
    • Phase 1: bounded price updates for existing products/variants
    • Phase 2: discount coupon/code creation with explicit limits
    • Phase 3: product creation only after the first two phases prove the governance model
  • Treat price updates as the strongest first slice:
    • clear agency value
    • bounded review surface
    • easier operator reasoning than full product creation
  • Treat coupon/code creation as valuable but higher risk:
    • requires explicit store scoping
    • expiry / usage-limit constraints
    • code uniqueness rules
    • promotion-policy review
  • Keep full product creation deliberately later:
    • much higher catalog complexity
    • more operator review burden
    • greater risk around variants, slugs, tax, inventory, shipping, and merchandising completeness
  • Reuse the existing target-bound governance patterns, but extend them with Commerce-specific write bounds such as:
    • store and currency selectors
    • max price delta / percentage movement
    • allowed discount archetypes
    • explicit product/variant collections
    • scheduled effective windows where applicable

Release outcome:

  • Agencies can support governed pricing and promotion workflows in Craft Commerce without exposing broad machine-admin power, while full catalog creation stays behind a higher bar.

Parked / Not Before 1.0.0 Unless Reassessed

  • agent commerce via stablecoin spend rail remains intentionally parked and should not shape the near-term core roadmap.

Success Checks

  • Support escalations reduced by at least 30% by end of v0.8.0 cycle.
  • Time-to-triage for integration incidents below 15 minutes.
  • Critical-path regression coverage at or above 90%.
  • Median time-to-first-successful integration action below 30 minutes.
  • At least 3 canonical agent jobs are shipped, documented, and validated end-to-end.
  • Weekly active credentials trend upward for two consecutive releases.

Out of Scope (this horizon)

  • New major action domains beyond current governed return/control model.
  • Large visual redesign unrelated to operator clarity.
  • Non-Craft platform expansion.