Creative Agent Testing Principles

Test to the Spec, NOT to the Code

Write tests as if you have zero knowledge of the implementation.

The Problem

Tests that validate code output against code output catch nothing.

Example that missed production bugs:

# ❌ WRONG - Test written by looking at code
def test_list_formats():
    result = list_creative_formats()
    assert "formats" in result  # Passes even with bugs!

The Solution

Read the ADCP spec first. Use generated Pydantic schemas. Validate every response.

# ✅ CORRECT - Test written by reading spec
def test_list_formats():
    from schemas_generated import ListCreativeFormatsResponse

    result_json = list_creative_formats()
    result = json.loads(result_json)  # Catches double-encoding
    ListCreativeFormatsResponse.model_validate(result)  # Validates ALL fields

Process for Every Test

Read spec/schema FIRST - Never look at implementation
Import generated Pydantic model - From schemas_generated/
Call tool as client would - Test public API, get JSON string
Parse JSON once - json.loads(result) catches encoding bugs
Validate with Pydantic - .model_validate() catches all schema violations

What This Catches

Double-encoding: '{"result": "{...}"}' → json.loads() fails or wrong structure
Missing required fields: Pydantic raises ValidationError
Wrong field types: Pydantic raises ValidationError
Extra fields not in spec: Pydantic raises ValidationError (when extra="forbid")
Invalid values: Constraints like ge=0, pattern=... enforced

Example: Bugs Found

Real bugs caught by spec-first testing that code-first tests missed:

Missing preview_id - Required per spec, not returned
Missing renders array - Required per spec, not returned
Extra adcp_version - Not in spec, added by mistake

Old test EXPECTED the bug:

def test_preview():
    result = json.loads(preview_creative(...))
    assert "adcp_version" in result  # Test validates the bug!

New test CAUGHT the bug:

def test_preview():
    result = json.loads(preview_creative(...))
    PreviewCreativeResponse.model_validate(result)  # ValidationError: extra field!

Never

❌ Look at code before writing test
❌ Compare output to output: assert result == expected_from_code
❌ Trust variable names or comments
❌ Test internal types instead of wire format
❌ Mock everything (hides serialization bugs)

Always

✅ Read spec first
✅ Use generated Pydantic schemas
✅ Call public API (tools/endpoints)
✅ Parse JSON explicitly
✅ Validate with .model_validate()
✅ Test error cases per spec

For Protocol Implementations (MCP, ADCP, A2A)

Every response must:

Be valid JSON (single parse, no double-encoding)
Match published schema exactly (Pydantic validates)
Have no extra fields (unless spec allows)
Have all required fields (Pydantic enforces)
Use correct types (Pydantic enforces)

If your test would pass with broken code, it's not a good test.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Creative Agent Testing Principles

Test to the Spec, NOT to the Code

The Problem

The Solution

Process for Every Test

What This Catches

Example: Bugs Found

Never

Always

For Protocol Implementations (MCP, ADCP, A2A)

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

Creative Agent Testing Principles

Test to the Spec, NOT to the Code

The Problem

The Solution

Process for Every Test

What This Catches

Example: Bugs Found

Never

Always

For Protocol Implementations (MCP, ADCP, A2A)