Fix default token metadata for high language IDs by EduardF1 · Pull Request #319144 · microsoft/vscode

EduardF1 · 2026-05-30T14:55:39Z

Fixes #319118

Masks the encoded language ID before packing default token metadata so IDs above 255 cannot spill into the token type bits. Adds a regression test that exercises a language ID >= 256 and verifies the default token type stays Other.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Fixes incorrect default token metadata generation for large language IDs by preventing language ID bits from bleeding into token type bits, and adds a regression test for the scenario.

Changes:

Mask topLevelLanguageId when composing default token metadata to keep token type bits correct.
Add regression test covering high language ID values and validating StandardTokenType.Other is preserved.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
src/vs/editor/common/tokens/contiguousTokensStore.ts	Masks language ID bits when computing default metadata to avoid corrupting token type bits.
src/vs/editor/test/common/model/tokensStore.test.ts	Adds regression test ensuring default metadata yields `StandardTokenType.Other` for high language IDs.

EduardF1 · 2026-05-31T10:42:12Z

+		let languageId = '';
+		for (let i = 0; i < 255; i++) {
+			languageId = `language-${i}`;
+			codec.register(languageId);
+		}
+
+		const encodedLanguageId = codec.encodeLanguageId(languageId);
+		assert.ok(encodedLanguageId >= 256);


Updated the test to register languages until the encoded ID exceeds MetadataConsts.LANGUAGEID_MASK, so it no longer depends on a hard-coded registration count or codec starting index. Pushed in 31e4cd0.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

EduardF1 · 2026-06-01T09:52:22Z

@microsoft-github-policy-service agree

Fix default token metadata for high language IDs

e81d6e7

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Copilot AI review requested due to automatic review settings May 30, 2026 14:55

Copilot AI reviewed May 30, 2026

View reviewed changes

vs-code-engineering Bot assigned alexdima May 30, 2026

test: avoid hard-coded language ID threshold

31e4cd0

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix default token metadata for high language IDs#319144

Fix default token metadata for high language IDs#319144
EduardF1 wants to merge 2 commits into
microsoft:mainfrom
EduardF1:fix-319118-languageid-mask

EduardF1 commented May 30, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

EduardF1 May 31, 2026

Uh oh!

EduardF1 commented Jun 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

EduardF1 commented May 30, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

EduardF1 May 31, 2026

Choose a reason for hiding this comment

Uh oh!

EduardF1 commented Jun 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants