feat(rules): refactor code to rename evaluators to rules by namrataghadi-galileo · Pull Request #245 · agentcontrol/agent-control

namrataghadi-galileo · 2026-06-22T22:02:27Z

Summary

What changed and why.

Scope

User-facing/API changes:
Internal changes:
Out of scope:

Risk and Rollout

Risk level: low / medium / high
Rollback plan:

Testing

Added or updated automated tests
Ran make check (or explained why not)
Manually verified behavior

Checklist

Linked issue/spec (if applicable)
Updated docs/examples for user-facing changes
Included any required follow-up tasks

codecov · 2026-06-22T22:26:53Z

Codecov Report

❌ Patch coverage is 97.87234% with 9 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
rules/builtin/src/agent_control_rules/json/rule.py	89.65%	3 Missing ⚠️
rules/builtin/src/agent_control_rules/sql/rule.py	91.66%	3 Missing ⚠️
models/src/agent_control_models/controls.py	96.87%	1 Missing ⚠️
...ules/builtin/src/agent_control_rules/_discovery.py	98.43%	1 Missing ⚠️
...co/src/agent_control_rule_cisco/ai_defense/rule.py	91.66%	1 Missing ⚠️

📢 Thoughts on this report? Let us know!

lan17

Why are we doing this?

Evaluator is a good name, no?

namrataghadi-galileo · 2026-06-28T20:42:45Z

@lan17 This is to comply with Cisco/Splunks naming as ACE is offered now as a product within Splunk Cloud Observability. The keyword "evaluators" is reserved for "metrics".. Heres the proposal we are going to go ahead with after our internal discussions. Credits to @abhinav-galileo for coining this proposal.

{
  "scope": {
    "step_types": ["llm"],
    "stages": ["post"]
  },
  "condition": {
    "selector": { "path": "output" },
    "evaluator": {
      "name": "luna.toxicity",
      "config": {
        "operator": "gte",
        "threshold": 0.7
      }
    }
  },
  "action": { "decision": "deny" }
}

This PR is going to change. We will rename evaluators to checks and not rules.
So the terms are:

Evaluator = reusable compute/check primitive, e.g. Toxicity, Context Relevance, Regex, JSON (this complies with Splunk Observability nomenclature)
Provider = Luna, Azure Content Safety, Guardrails AI, Agent Control built-ins, etc.
Condition/check = selected input + evaluator + config/operator/threshold -> matched/not matched
Control = scope/stage + condition/check + action

In Agent Control, controls use evaluators through conditions/checks. The action is applied when the condition matches.

lan17 · 2026-06-28T20:45:58Z

That's same as current design, no?

namrataghadi-galileo · 2026-06-28T20:53:01Z

@lan17 There is subtle difference in how it works today

ControlDefinition = scope/stage + condition + action
condition = selector + evaluator_spec, or a boolean tree of conditions
evaluator_spec = evaluator + config

Example today:
{
  "scope": {
    "step_types": ["llm"],
    "stages": ["post"]
  },
  "condition": {
    "selector": { "path": "output" },
    "evaluator": {
      "name": "galileo.luna",
      "config": {
        "scorer_label": "toxicity",
        "operator": "gte",
        "threshold": 0.7
      }
    }
  },
  "action": { "decision": "deny" }
}

Here, "galileo.luna" is the evaluator, while "toxicity" is hidden inside its config. The proposal is to bring out the metrics like toxicity out as evaluators like below

"evaluator": {
      "name": "luna.toxicity",
      "config": {
        "operator": "gte",
        "threshold": 0.7
      }
    }

I would even go further and change this to

"evaluator": {
      "name": "toxicity",
      "provider": "luna",
      "config": {
        "operator": "gte",
        "threshold": 0.7
      }
    }

lan17 · 2026-06-28T20:54:43Z

I'm still confused since evaluators currently are regex, Luna, etc, no?

namrataghadi-galileo · 2026-06-28T22:06:22Z

“Evaluator” is a good name, but Splunk Observability already uses it for a reusable metric that returns a score, boolean, or findings. In Agent Control, today’s evaluator also applies thresholds or matching logic and decides whether a control triggers, so it is closer to a check.
For example: Luna evaluates toxicity and returns a score; “toxicity ≥ 0.7” is the check. Similarly, regex evaluates whether a pattern exists, while the check decides to trigger on a match; JSON/SQL evaluators return validation findings, while checks decide which findings trigger.
The proposed terminology is: Provider implements Evaluators; Checks turn evaluator outputs into matched/not matched; Conditions combine checks; and Controls add scope and action. This keeps Agent Control consistent with Splunk Observability and avoids “evaluators inside evaluators.”

{
  "scope": {
    "step_types": ["llm"],
    "stages": ["post"]
  },
  "condition": {
    "check": {
      "selector": {"path": "output"},
      "evaluator": {
        "provider": "galileo.luna",
        "name": "toxicity",
        "config": {
          "scorer_id": "..."
        }
      },
      "match": {
        "operator": "gte",
        "value": 0.7
      }
    }
  },
  "action": {
    "decision": "deny"
  }
}

abhinav-galileo · 2026-06-29T07:52:14Z

@namrataghadi-galileo - I am not sure about adding another level of nesting with check..

refactor code to rename evaluators to rules

b3a85a0

namrataghadi-galileo changed the title ~~feat(evaluators): refactor code to rename evaluators to rules~~ feat(rules): refactor code to rename evaluators to rules Jun 22, 2026

fix ci

053c8df

namrataghadi-galileo added 5 commits June 22, 2026 15:39

multiple issues

fbeb75b

test coverage

c88e084

ci

a89f512

prettyfy and also test coverage

9a4c0f0

fix ui pretty

15f1689

lan17 reviewed Jun 28, 2026

View reviewed changes

namrataghadi-galileo marked this pull request as draft June 29, 2026 19:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(rules): refactor code to rename evaluators to rules#245

feat(rules): refactor code to rename evaluators to rules#245
namrataghadi-galileo wants to merge 7 commits into
mainfrom
feature/67290-replace-evaluators-to-rules

namrataghadi-galileo commented Jun 22, 2026

Uh oh!

codecov Bot commented Jun 22, 2026 •

edited

Loading

Uh oh!

lan17 left a comment

Uh oh!

namrataghadi-galileo commented Jun 28, 2026

Uh oh!

lan17 commented Jun 28, 2026

Uh oh!

namrataghadi-galileo commented Jun 28, 2026

Uh oh!

lan17 commented Jun 28, 2026

Uh oh!

namrataghadi-galileo commented Jun 28, 2026

Uh oh!

abhinav-galileo commented Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

namrataghadi-galileo commented Jun 22, 2026

Summary

Scope

Risk and Rollout

Testing

Checklist

Uh oh!

codecov Bot commented Jun 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

lan17 left a comment

Choose a reason for hiding this comment

Uh oh!

namrataghadi-galileo commented Jun 28, 2026

Uh oh!

lan17 commented Jun 28, 2026

Uh oh!

namrataghadi-galileo commented Jun 28, 2026

Uh oh!

lan17 commented Jun 28, 2026

Uh oh!

namrataghadi-galileo commented Jun 28, 2026

Uh oh!

abhinav-galileo commented Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov Bot commented Jun 22, 2026 •

edited

Loading