fix: capture per-task token usage to fix race condition in async execution #4286

wingding12 · 2026-01-27T16:01:40Z

Summary

The per-task token tracking had a race condition when using threading-based async_execution (task.async_execution=True with crew.kickoff()). Multiple concurrent tasks from the same agent would get incorrect token attribution because tokens_before was captured when tasks were queued, but tokens_after was captured sequentially when processing futures.

Problem

When multiple tasks with async_execution=True are executed by the same agent:

tokens_before was captured when the task was queued
Tasks run concurrently in threads
All concurrent tasks captured similar tokens_before values
tokens_after was captured sequentially in _process_async_tasks after calling future.result()
Later tasks got credited with tokens from earlier tasks that ran in parallel

Solution

Capture token usage snapshots within the execution context (thread or async task) rather than after futures are resolved:

Add token_usage field to TaskOutput to store per-task metrics
Add _get_agent_token_usage() helper to capture token snapshot from agent's LLM
Add _calculate_token_delta() helper to compute token differences
Modify _execute_task_async() to capture tokens within the thread immediately before and after task execution
Update _execute_core() and _aexecute_core() for consistent token tracking across all execution paths

Files Changed

lib/crewai/src/crewai/task.py - Token tracking logic in task execution
lib/crewai/src/crewai/tasks/task_output.py - Add token_usage field
lib/crewai/tests/task/test_task_token_tracking.py - Tests for the fix

Note

Adds accurate per-task token tracking and resolves async race conditions in token attribution.

Introduces token_usage on TaskOutput and imports UsageMetrics
Adds _get_agent_token_usage() and _calculate_token_delta() helpers
Updates _execute_task_async() to snapshot tokens before/after execution within the thread and store the delta on the result
Applies the same before/after token snapshotting in _execute_core() and _aexecute_core() so sync and async paths are consistent

^{Written by Cursor Bugbot for commit 56c3d02. This will update automatically on new commits. Configure here.}

…ution Fixes crewAIInc#4168 The per-task token tracking had a race condition when using threading-based async_execution (task.async_execution=True with crew.kickoff()). Multiple concurrent tasks from the same agent would get incorrect token attribution because tokens_before was captured when tasks were queued, but tokens_after was captured sequentially when processing futures. Changes: - Add token_usage field to TaskOutput to store per-task metrics - Add _get_agent_token_usage() helper to capture token snapshot from agent's LLM - Add _calculate_token_delta() helper to compute token differences - Modify _execute_task_async() to capture tokens within the thread immediately before and after task execution, ensuring accurate per-task attribution - Update _execute_core() and _aexecute_core() for consistent token tracking across all execution paths The fix captures token usage snapshots within the execution context (thread or async task) rather than after futures are resolved, ensuring each task gets its own accurate token count even when running concurrently.

cursor

Cursor Bugbot has reviewed your changes and found 3 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

This PR is being reviewed by Cursor Bugbot

Details

Your team is on the Bugbot Free tier. On this plan, Bugbot will review limited PRs each billing cycle for each member of your team.

To receive Bugbot reviews on all of your PRs, visit the Cursor dashboard to activate Pro and start your 14-day free trial.

cursor · 2026-01-27T16:14:22Z

lib/crewai/src/crewai/task.py

                agent=agent.role,
                output_format=self._get_output_format(),
                messages=agent.last_messages,  # type: ignore[attr-defined]
+                token_usage=token_delta,


Token usage lost when guardrails trigger task retry

Medium Severity

The token_usage tracking is incomplete. When _execute_core or _aexecute_core creates a TaskOutput with token_usage=token_delta, this value is lost if a guardrail fails and triggers a retry. The guardrail retry paths (_invoke_guardrail_function and _ainvoke_guardrail_function) create new TaskOutput objects without passing token_usage, resulting in token_usage=None for any task that goes through guardrail retry.

Additional Locations (1)

lib/crewai/src/crewai/task.py#L665-L669

cursor · 2026-01-27T16:14:22Z

lib/crewai/src/crewai/task.py

+            tokens_after = self._get_agent_token_usage(agent or self.agent)
+
+            # Calculate and store the delta in the result
+            result.token_usage = self._calculate_token_delta(tokens_before, tokens_after)


Redundant token tracking in async execution path

Low Severity

Token tracking happens twice when _execute_task_async calls _execute_core. Both methods capture tokens_before and tokens_after, and both set token_usage. Since _execute_core returns a TaskOutput with token_usage already set, the subsequent overwrite in _execute_task_async is redundant. Both run in the same thread, so the "within thread" rationale doesn't justify the duplication.

Additional Locations (1)

lib/crewai/src/crewai/task.py#L738-L755

cursor · 2026-01-27T16:14:22Z

lib/crewai/src/crewai/task.py

+            # Capture token usage after execution
+            tokens_after = self._get_agent_token_usage(agent)
+            token_delta = self._calculate_token_delta(tokens_before, tokens_after)
+


Token capture misses output conversion LLM calls

Medium Severity

The tokens_after snapshot is captured BEFORE _export_output(result) is called, but _export_output can make LLM calls through the Converter class when converting task output to Pydantic/JSON models. When the raw result isn't valid JSON and needs LLM-assisted conversion, those additional LLM calls consume tokens that are not included in the task's token_usage. The token capture needs to happen after output conversion completes.

Additional Locations (1)

lib/crewai/src/crewai/task.py#L649-L655

lorenzejay · 2026-01-27T20:46:21Z

can you add a test for this please with the above test case you mentioned. please have @pytest.mark.vcr() on as well for the test

wingding12 force-pushed the fix/token-tracking-race-condition-4168 branch from 98efb32 to 56c3d02 Compare January 27, 2026 16:06

cursor bot reviewed Jan 27, 2026

View reviewed changes

lorenzejay self-assigned this Jan 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: capture per-task token usage to fix race condition in async execution #4286

fix: capture per-task token usage to fix race condition in async execution #4286

wingding12 commented Jan 27, 2026 •

edited by cursor bot

Loading

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Jan 27, 2026

Uh oh!

cursor bot Jan 27, 2026

Uh oh!

cursor bot Jan 27, 2026

Uh oh!

lorenzejay commented Jan 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: capture per-task token usage to fix race condition in async execution #4286

Are you sure you want to change the base?

fix: capture per-task token usage to fix race condition in async execution #4286

Conversation

wingding12 commented Jan 27, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Solution

Files Changed

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

This PR is being reviewed by Cursor Bugbot

Uh oh!

cursor bot Jan 27, 2026

Choose a reason for hiding this comment

Token usage lost when guardrails trigger task retry

Uh oh!

cursor bot Jan 27, 2026

Choose a reason for hiding this comment

Redundant token tracking in async execution path

Uh oh!

cursor bot Jan 27, 2026

Choose a reason for hiding this comment

Token capture misses output conversion LLM calls

Uh oh!

lorenzejay commented Jan 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

wingding12 commented Jan 27, 2026 •

edited by cursor bot

Loading