Skip to content

change(web): adjust TokenizationCorrector spec 🚂#15955

Draft
jahorton wants to merge 18 commits into
change/web/multi-token-prediction-intermediatesfrom
change/web/adjust-tokenization-corrector-spec
Draft

change(web): adjust TokenizationCorrector spec 🚂#15955
jahorton wants to merge 18 commits into
change/web/multi-token-prediction-intermediatesfrom
change/web/adjust-tokenization-corrector-spec

Conversation

@jahorton
Copy link
Copy Markdown
Contributor

Build-bot: skip build:web
Test-bot: skip

jahorton added 18 commits May 7, 2026 13:06
Rather than copying over part of the existing context just to delete it, we can simplify prediction calls by just pre-deleting the current token, then applying any relevant deleteLeft transform component afterward to resulting predictions.

Build-bot: skip build:web
Test-bot: skip
This method is designed to determine the appropriate range of tokens, within each context variant, should be eligible for correction when generating predictions and corrections.

Build-bot: skip build:web
Test-bot: skip
To better handle inputs that shift the word-boundary in some custom models and models released before Keyman 14.0, this PR provides generalized re-use of the whitespace-based token-transition algorithm used for our most prominently-supported models.

Build-bot: skip build:web
Test-bot: skip
…rection

Build-bot: skip build:web
Test-bot: skip
Lack of this string can break auto-correction and casing behaviors - and actually _did_ within engine/main!
…to change/web/rework-traversalless-prediction
It turns out that #15766 did not perfectly address all cases for generation of default "keep" suggestions.  This PR will remedy the situation.

Build-bot: skip build:web
Test-bot: skip
This reorganizes the type formerly known as CorrectionPredictionTuple, preparing it to share similarities with a new incoming type handling an earlier, tokenized intermediate stage that will be needed for some aspects of suggestion generation.

Build-bot: skip build:web
Test-bot: skip
Build-bot: skip build:web
Test-bot: skip
…e correction support

Converts early uses of CompositedPredictionData to TokenizedPredictionData to facilitate important token-based aspects of whitespace correction support, such as case-handling.

Build-bot: skip build:web
Test-bot: skip
Build-bot: skip build:web
Test-bot: skip
@keymanapp-test-bot
Copy link
Copy Markdown

keymanapp-test-bot Bot commented May 13, 2026

User Test Results

Test specification and instructions

User tests are not required

Test Artifacts

  • Web
    • KeymanWeb Test Home - build : all tests passed (no artifacts on BuildLevel "build")

@keymanapp-test-bot keymanapp-test-bot Bot changed the title change(web): adjust TokenizationCorrector spec change(web): adjust TokenizationCorrector spec 🚂 May 13, 2026
@keymanapp-test-bot keymanapp-test-bot Bot added this to the A19S29 milestone May 13, 2026
@github-actions github-actions Bot added the change Minor change in functionality, but not new label May 13, 2026
@keyman-server keyman-server removed this from the A19S29 milestone May 23, 2026
@keyman-server keyman-server added this to the A19S30 milestone May 23, 2026
@jahorton jahorton force-pushed the change/web/multi-token-prediction-intermediates branch from b763609 to 71a9162 Compare May 29, 2026 21:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

change Minor change in functionality, but not new epic-autocorrect web/predictive-text/ web/

Projects

Status: Todo

Development

Successfully merging this pull request may close these issues.

2 participants