Skip to content

Bringup Qwen2.5-1.5B#3506

Open
ChingTsai wants to merge 1 commit intomainfrom
jimmytsai/bring-up-qwen2_5-1_5b
Open

Bringup Qwen2.5-1.5B#3506
ChingTsai wants to merge 1 commit intomainfrom
jimmytsai/bring-up-qwen2_5-1_5b

Conversation

@ChingTsai
Copy link
Copy Markdown
Collaborator

@ChingTsai ChingTsai commented Mar 26, 2026

Description

  • Bringup Qwen2.5-1.5B

FIXES: b/495594907

Tests

Maxtext -> HF

python3 -m tests.utils.forward_pass_logit_checker src/maxtext/con
figs/base.yml run_name=forward_pass_test_scanned model_name=qwen2.5-1.5b tokenizer_path=Qwen/Qwen2.5-1.5B-Instruct load_parameters_path=XXXX max_prefill_predict_length=4 max_target_length=8 dataset_type=synthetic scan_layers=true per_device_batch_size=1 skip_jax_distributed_system=True dtype=float32 --max_kl_div=0.015  --run_hf_model=True 
--hf_model_path=Qwen/Qwen2.5-1.5B-Instruct

Scanned
Unscanned

HF -> Maxtext


python3 -m tests.utils.hf_checkpoint_conversion_checker --original_ckpt=hf_cache/hub/models--Qwen--Qwen2.5-1.5B-Instruct/snapshots/989aa7980e4cf806f80c7fef2b1adb7bc71aa306 --converted_ckpt=qwen2.5-1.5b/hf_from_scanned

Log

Checklist

Before submitting this PR, please make sure (put X in square brackets):

  • I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
  • I have necessary comments in my code, particularly in hard-to-understand areas.
  • I have run end-to-end tests tests and provided workload links above if applicable.
  • I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

@codecov
Copy link
Copy Markdown

codecov bot commented Mar 26, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

@ChingTsai ChingTsai force-pushed the jimmytsai/bring-up-qwen2_5-1_5b branch from b4688d0 to e35f17f Compare March 26, 2026 02:30
@ChingTsai ChingTsai changed the title bringup qwen2.5-1.5B Bringup qwen2.5-1.5B Mar 26, 2026
@github-actions
Copy link
Copy Markdown

🤖 Hi @ChingTsai, I've received your request, and I'm working on it now! You can track my progress in the logs for more details.

@github-actions
Copy link
Copy Markdown

🤖 I'm sorry @ChingTsai, but I was unable to process your request. Please see the logs for more details.

@github-actions
Copy link
Copy Markdown

🤖 I'm sorry @ChingTsai, but I was unable to process your request. Please see the logs for more details.

Copy link
Copy Markdown
Collaborator

@RissyRan RissyRan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!

@ChingTsai ChingTsai force-pushed the jimmytsai/bring-up-qwen2_5-1_5b branch from e35f17f to 387df2d Compare March 27, 2026 08:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants