Fix batch embedding averaging for batch_size > 1 by Chessing234 · Pull Request #3837 · lm-sys/FastChat

Chessing234 · 2026-04-06T11:44:42Z

Summary

Fix incorrect embedding averaging in model_worker.py when batch_size > 1
token_num was computed as a single scalar (torch.sum(attention_mask).item()) summing tokens across the entire batch, causing each sequence's embedding to be divided by the total token count of all sequences instead of its own
Changed to per-sequence token counts using attention_mask.sum(dim=1, keepdim=True) so each sequence is correctly normalized by its own token count

Test plan

Verify embedding output for single-sequence input remains unchanged
Verify embedding output for multi-sequence batch (batch_size > 1) now correctly averages each sequence independently
Confirm ret["token_num"] still returns the correct total token count as a scalar

🤖 Generated with Claude Code

token_num was computed as a single scalar summing all tokens across the entire batch, then used to divide each per-sequence embedding. This caused incorrect averaging when batch_size > 1, as every sequence was divided by the total token count instead of its own. Change token_num to a per-sequence tensor via attention_mask.sum(dim=1, keepdim=True) so each sequence is divided by its own token count. Fixes lm-sys#3785 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Chessing234 · 2026-04-07T09:05:51Z

Closing in favor of #3839 (duplicate)

Chessing234 closed this Apr 7, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix batch embedding averaging for batch_size > 1#3837

Fix batch embedding averaging for batch_size > 1#3837
Chessing234 wants to merge 1 commit intolm-sys:mainfrom
Chessing234:fix/batch-embedding-averaging

Chessing234 commented Apr 6, 2026

Uh oh!

Chessing234 commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Chessing234 commented Apr 6, 2026

Summary

Test plan

Uh oh!

Chessing234 commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant