Skip to content

fix: replace delisted x-ai/grok-4-fast default with the live openai/gpt-4.1-mini#246

Open
Fearvox wants to merge 1 commit into
EverMind-AI:mainfrom
Fearvox:fix/llm-model-default-delisted-grok
Open

fix: replace delisted x-ai/grok-4-fast default with the live openai/gpt-4.1-mini#246
Fearvox wants to merge 1 commit into
EverMind-AI:mainfrom
Fearvox:fix/llm-model-default-delisted-grok

Conversation

@Fearvox
Copy link
Copy Markdown
Collaborator

@Fearvox Fearvox commented Jun 3, 2026

What

The out-of-box LLM model default is a delisted OpenRouter model, so a new
user who copies env.template, fills in a real OPENROUTER_API_KEY, and runs
EverCore fails on the first LLM call.

x-ai/grok-4-fast is no longer served on OpenRouter:

  • It is absent from the OpenRouter model catalog (GET https://openrouter.ai/api/v1/models — 343 models; the only grok ids are
    x-ai/grok-build-0.1, x-ai/grok-4.3, x-ai/grok-4.20,
    x-ai/grok-4.20-multi-agent).
  • GET …/models/x-ai/grok-4-fast/endpoints returns 0 serving endpoints, so
    a chat-completions request with it gets no providers and fails.

src/memory_layer/llm/llm_provider.py:45 forwards LLM_MODEL verbatim, so the
template value reaches OpenRouter unchanged.

This PR replaces the dead id with openai/gpt-4.1-mini — which is the code's
own hardcoded DEFAULT_LLM_MODEL (llm_provider.py:9) and has live OpenRouter
endpoints — in the three places the dead id is shipped:

  • methods/EverCore/env.template (the runtime default)
  • methods/EverCore/docs/dev_docs/getting_started.md (setup example)
  • methods/EverCore/docs/usage/CONFIGURATION_GUIDE.md (the cost-effective-model example)

After this change the template default agrees with the code default, so a
fresh key-only setup works on the first call.

Why this id

openai/gpt-4.1-mini is already DEFAULT_LLM_MODEL in the code, is live on
OpenRouter, and is cost-effective — so template, docs, and code all agree.

Scope

Three files, one dead-id → live-id swap each. No code change. Intentionally
not touched: tests/test_llm_switching_e2e.py uses x-ai/grok-4-fast
deliberately as a disallowed model to assert white-list rejection — a delisted
id is still a valid "disallowed" example there, so its meaning is preserved.

Verification

OpenRouter catalog checked live (2026-06-02): x-ai/grok-4-fast not present;
openai/gpt-4.1-mini present. A running EverCore configured with
openai/gpt-4.1-mini serves normally.

Co-Authored-By: Claude Opus 4.8 (1M context) noreply@anthropic.com

🤖 Generated with Claude Code

…pt-4.1-mini

The shipped default LLM model id is delisted on OpenRouter, so a fresh setup
(copy env.template, add a real OPENROUTER_API_KEY, run) fails on the first LLM
call. x-ai/grok-4-fast is absent from the OpenRouter catalog and its endpoints
list is empty (0 serving providers); llm_provider.py forwards LLM_MODEL verbatim,
so the dead default reaches OpenRouter unchanged.

Replace it with openai/gpt-4.1-mini, which is the code's own DEFAULT_LLM_MODEL
(llm_provider.py:9) and is live on OpenRouter, in the three places the dead id is
shipped: env.template (runtime default), docs/dev_docs/getting_started.md (setup
example), and docs/usage/CONFIGURATION_GUIDE.md (cost-effective-model example).
Template default now agrees with the code default.

Leaves tests/test_llm_switching_e2e.py untouched: it uses x-ai/grok-4-fast on
purpose as a disallowed model to assert white-list rejection, and a delisted id
is still a valid "disallowed" example there.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Copilot AI review requested due to automatic review settings June 3, 2026 06:58
Copy link
Copy Markdown

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot was unable to review this pull request because the user who requested the review has reached their quota limit.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants