Skip to content

fix(gemma4_unified): conversion script and config bugs#46398

Merged
vasqu merged 2 commits into
huggingface:mainfrom
douglas-reid:fix-minor-unified-bugs
Jun 4, 2026
Merged

fix(gemma4_unified): conversion script and config bugs#46398
vasqu merged 2 commits into
huggingface:mainfrom
douglas-reid:fix-minor-unified-bugs

Conversation

@douglas-reid
Copy link
Copy Markdown
Contributor

Address issues uncovered with weights conversion scripts.

  • I confirm that this is not a pure code agent PR.

Before submitting

  • This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • Did you read the contributor guideline,
    Pull Request section?
  • Was this discussed/approved via a Github issue or the forum? Please add a link
    to it if that's the case.
  • Did you make sure to update the documentation with your changes? Here are the
    documentation guidelines, and
    here are tips on formatting docstrings.
  • Did you write any new necessary tests?

Who can review?

@ArthurZucker @molbap

Copy link
Copy Markdown
Member

@pcuenca pcuenca left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@pcuenca pcuenca requested review from ArthurZucker and molbap June 4, 2026 06:55
Copy link
Copy Markdown
Collaborator

@ArthurZucker ArthurZucker left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nothing really actionnable (no patch needed as config is mostly on the hub and conversion scripts are not shipped part of the release)

@HuggingFaceDocBuilderDev
Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

head_dim: int = 256
hidden_activation: str = "gelu_pytorch_tanh"
max_position_embeddings: int = 131_072
max_position_embeddings: int = 262_144
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You will need to modify the modular (that's why the CI is failing)

On the hub we still have 131_072, https://huggingface.co/google/gemma-4-12B-it/blob/main/config.json#L109 - should this be updated?

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, I think it should.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

modular now updated to match.

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Jun 4, 2026

[For maintainers] Suggested jobs to run (before merge)

run-slow: gemma4_unified

@vasqu vasqu enabled auto-merge June 4, 2026 12:16
@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Jun 4, 2026

CI Dashboard: View test results in Grafana

@vasqu vasqu added this pull request to the merge queue Jun 4, 2026
Merged via the queue into huggingface:main with commit ece3b9a Jun 4, 2026
22 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants