Skip to content

fix: tighten Karen BANT scoring and expand search scope#298

Draft
deepmasq wants to merge 1 commit intomainfrom
fix/karen-sales-search-and-crm-schema
Draft

fix: tighten Karen BANT scoring and expand search scope#298
deepmasq wants to merge 1 commit intomainfrom
fix/karen-sales-search-and-crm-schema

Conversation

@deepmasq
Copy link
Copy Markdown
Contributor

@deepmasq deepmasq commented Apr 9, 2026

Summary

  • Add security/compliance to pre-search scope (model must call flexus_vector_search before quoting)
  • Tighten Budget: require explicit confirmation, not just "workable"/"interesting"
  • Tighten Authority: require sole decision-maker, score 0 if needs approval

Note: The fi_crm.py field enumeration fix from the original branch is already in main.

Context

5/25 scenario benchmarks flagged for fabrication — model skipped vector search before quoting pricing.

Test plan

  • 25 Karen scenarios baselined at avg 6.0/10
  • Re-run scenarios after merge to measure improvement

🤖 Generated with Claude Code

- Add security/compliance to pre-search scope (was: pricing, features, setup)
- Budget: require explicit confirmation, not just "workable" or "interesting"
- Authority: require sole decision-maker, not just influencer

Driven by scenario benchmarks: 5/25 flagged for fabrication, model skips
vector search and gives overly generous BANT scores.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@deepmasq deepmasq force-pushed the fix/karen-sales-search-and-crm-schema branch from beb552b to 714229c Compare April 9, 2026 11:48
@deepmasq deepmasq changed the title fix: Karen sales fabricates pricing + tighten BANT scoring fix: tighten Karen BANT scoring and expand search scope Apr 9, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants