Skip to content

feat(ai-red-teaming): update capability numbers to reflect auto-discovery results#6

Merged
rdheekonda merged 1 commit intomainfrom
fix/update-airt-capability-numbers
May 8, 2026
Merged

feat(ai-red-teaming): update capability numbers to reflect auto-discovery results#6
rdheekonda merged 1 commit intomainfrom
fix/update-airt-capability-numbers

Conversation

@rdheekonda
Copy link
Copy Markdown
Contributor

Summary

  • Update AI Red Teaming capability numbers from outdated manual counts to accurate auto-discovery results
  • Attack algorithms: 12 → 61 (+408% increase)
  • Transforms: 183 → 547 (+199% increase)
  • Scorers: 84 → 141 (+68% increase)
  • Version bump: 1.1.1 → 1.2.0 for major capability expansion

Validation

All numbers verified through comprehensive auto-discovery implementation:

  • Transform count: dn airt list-transforms | grep '│.*│.*│' | wc -l = 547
  • Attack count: dn airt list-attacks | grep '│.*│.*│' | wc -l = 61
  • Scorer count: SDK introspection = 141

Platform now exceeds all documented targets (45+/450+/130+) by 8-36% margins.

Impact

  • ✅ Accurate user expectations: Platform truthfully represents capabilities
  • ✅ Marketing claims accuracy: All promotional materials reflect reality
  • ✅ Competitive positioning: Platform properly positioned as capability leader
  • ✅ Future-proof scaling: Auto-discovery prevents manual update lag

Before: Platform claiming 12 attacks, delivering 61 (500% under-promise)
After: Platform accurately representing 61 attacks (builds trust)

Test plan

  • Verify capability.yaml contains correct numbers (61/547/141)
  • Confirm version bump to 1.2.0
  • Validate auto-discovery commands produce expected counts
  • Test that updated capability loads correctly in platform

🤖 Generated with Claude Code

…very results

- Update attack algorithms: 12 → 61 (+408% increase)
- Update transforms: 183 → 547 (+199% increase)
- Update scorers: 84 → 141 (+68% increase)
- Bump version: 1.1.1 → 1.2.0 (major capability expansion)

All numbers verified through comprehensive auto-discovery implementation
that exceeds documented targets (45+/450+/130+) by 8-36% margins.

Validation:
- Transform count: dn airt list-transforms = 547
- Attack count: dn airt list-attacks = 61
- Scorer count: SDK introspection = 141

Co-Authored-By: Claude Sonnet 4 <noreply@anthropic.com>
@rdheekonda rdheekonda merged commit c2f1b66 into main May 8, 2026
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant