Skip to content

GLM-4.7-Flash is incoherent in further generations (KCPP 1.107 dev build) #1943

@aoleg

Description

@aoleg

Just reporting that https://github.com/LostRuins/koboldcpp/actions/runs/21299565080 introduced a bug causing garbled thinking/output at least on GLM 4.7 Flash and GLM 4.5 Air models regardless of whether Flash Attention is on or off. Rolling back to https://github.com/LostRuins/koboldcpp/actions/runs/21210595986 fixes the issue, so it was likely something merged in between. Could it be b70d251 ?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions