Skip to content

Integration Proposal: SDC Agents β€” Deterministic Semantic Data Artifacts from Enterprise DatastoresΒ #97

@twcook

Description

@twcook

πŸ”΄ Required Information

Is your feature request related to a specific problem?

ADK developers working with enterprise data sources (SQL databases, CSV, JSON, MongoDB, BigQuery) lack tooling to produce deterministic, W3C-compliant semantic data artifacts. Current integrations focus on retrieval and generation β€” none produce validated schemas (XSD, SHACL, JSON-LD) or signed XML instances from structured data.

Describe the Solution You'd Like

We maintain SDC Agents (sdc-agents on PyPI), a suite of 9 purpose-scoped ADK agents with 32 tools that transform enterprise data into validated, multi-format semantic artifacts.

What it does:

  • Introspect legacy datastores (SQL, CSV, JSON, MongoDB, BigQuery β€” read-only) and extract structure
  • Discover published schemas from a catalog of 6,400+ components (FHIR, NIEM, NIH CDEs, X12, SUS, CIHI)
  • Map source columns to semantic components with ontology links
  • Generate XML instances from mapped data
  • Validate and sign instances via a validation-as-a-service API
  • Distribute artifact packages to triplestores (Fuseki, Neo4j, GraphDB), REST APIs, and filesystems
  • Assemble new data models from component libraries

Architecture:
Each agent is an LlmAgent with a single BaseToolset. No agent has both datasource access and network access (security isolation by design). All tools are async, audited (JSONL), and cache-aware.

Usage:

from sdc_agents.agents import create_introspection_agent, create_catalog_agent

# Introspect a PostgreSQL database
introspect = create_introspection_agent()

# Search the published catalog of 6,400+ semantic components
catalog = create_catalog_agent()

Already supports:

  • ADK BaseToolset / FunctionTool / LlmAgent patterns
  • MCP export via adk_to_mcp_tool_type()
  • Docker image, CLI, PyPI package (pip install sdc-agents)
  • 184 tests, 82% coverage

We propose contributing:

  1. A thin wrapper module under contributing/samples/ with usage examples
  2. A documentation page for the adk-docs integrations directory

Impact on your work

Enables ADK agents to produce deterministic, standards-compliant data schemas and validated instances from enterprise data sources β€” closing the gap between agentic AI and formal data governance. Targets healthcare, government, financial, and research domains where data provenance and schema validation are mandatory.

Willingness to contribute

Yes β€” we have the integration ready. Corporate CLA for Axius SDC, Inc. is signed (2026-03-11).


🟑 Recommended Information

Additional Context

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions