Skip to content

Gorilla BFCL patch update to automatize root cause detection and divergence in output between servings#4295

Open
dkalinowski wants to merge 1 commit into
mainfrom
gorilla-patch
Open

Gorilla BFCL patch update to automatize root cause detection and divergence in output between servings#4295
dkalinowski wants to merge 1 commit into
mainfrom
gorilla-patch

Conversation

@dkalinowski

Copy link
Copy Markdown
Collaborator

No description provided.

Copilot AI review requested due to automatic review settings June 17, 2026 11:23

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR updates the Gorilla/BFCL patch used by the demos/continuous_batching/accuracy workflow to help automate root-cause detection of output divergence between different serving modes by optionally capturing server-side verbose response details into BFCL result metadata.

Changes:

  • Adds a new --include-verbose-log CLI flag and threads it through BFCL generation/inference flow.
  • Captures selected fields from a model server’s __verbose response payload into BFCL result metadata for both multi-turn and single-turn inference.
  • Updates BFCL model registry entries for OVMS-backed evaluation (ovms-model, ovms-model-stream).

+ include_verbose_log: bool = typer.Option(
+ False,
+ "--include-verbose-log",
+ help="Include the __verbose field from model server responses (e.g. OVMS) in the result output; useful for debugging generation settings, prompts, and timings.",
Comment on lines +319 to +322
+ if include_verbose_log and hasattr(api_response, "model_extra") and api_response.model_extra:
+ verbose_data = api_response.model_extra.get("__verbose")
+ if verbose_data:
+ metadata["__verbose"] = {k: verbose_data[k] for k in ("prompt", "content") if k in verbose_data}
Comment on lines +337 to +340
+ if include_verbose_log and hasattr(api_response, "model_extra") and api_response.model_extra:
+ verbose_data = api_response.model_extra.get("__verbose")
+ if verbose_data:
+ metadata["__verbose"] = {k: verbose_data[k] for k in ("prompt", "content") if k in verbose_data}
Comment on lines +229 to +233
+ if include_verbose_log and hasattr(api_response, "model_extra") and api_response.model_extra:
+ verbose_data = api_response.model_extra.get("__verbose")
+ if verbose_data:
+ current_turn_verbose_log.append({k: verbose_data[k] for k in ("prompt", "content") if k in verbose_data})
+
Comment on lines +283 to +287
+ if include_verbose_log and hasattr(api_response, "model_extra") and api_response.model_extra:
+ verbose_data = api_response.model_extra.get("__verbose")
+ if verbose_data:
+ current_turn_verbose_log.append({k: verbose_data[k] for k in ("prompt", "content") if k in verbose_data})
+
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants