Skip to content

[AMD][MI355X] update model for gpt-oss #1638

Draft
ukannika wants to merge 2 commits into
SemiAnalysisAI:mainfrom
ukannika:amd/update_gpt_oss_mi355x_config
Draft

[AMD][MI355X] update model for gpt-oss #1638
ukannika wants to merge 2 commits into
SemiAnalysisAI:mainfrom
ukannika:amd/update_gpt_oss_mi355x_config

Conversation

@ukannika
Copy link
Copy Markdown
Collaborator

@ukannika ukannika commented Jun 2, 2026

Switch moe backend to CK to leverage the benefits from this PR.
vllm-project/vllm#42098


Note

Medium Risk
Changes which weights the MI355X GPT-OSS sweep loads and serves; throughput and eval numbers may shift versus the prior AMD checkpoint even though sweep geometry is unchanged.

Overview
Switches the MI355X GPT-OSS FP4 vLLM benchmark (gptoss-fp4-mi355x-vllm) from the AMD MXFP4 checkpoint amd/gpt-oss-120b-w-mxfp4-a-fp8 to the upstream Hugging Face weights openai/gpt-oss-120b, while keeping the same image (vllm/vllm-openai-rocm:v0.22.0), runner, precision label, and fixed-seq-len TP/concurrency sweep.

Documents the change in perf-changelog.yaml for config key gptoss-fp4-mi355x-vllm (PR #1638). The PR description ties this to using the CK MoE backend in vLLM (vllm#42098); that behavior is not altered in this diff—only the configured model id and changelog entry change.

Reviewed by Cursor Bugbot for commit a30921f. Bugbot is set up for automated code reviews on this repo. Configure here.

Copy link
Copy Markdown
Contributor

@claude claude Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

@chunfangamd
Copy link
Copy Markdown
Collaborator

/sweep test-config --config-files .github/configs/amd-master.yaml --config-keys gptoss-fp4-mi355x-vllm

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Jun 2, 2026

@chunfangamd Kicking off a sweep.

Run: https://github.com/SemiAnalysisAI/InferenceX/actions/runs/26845310867
Command: test-config --config-files .github/configs/amd-master.yaml --config-keys gptoss-fp4-mi355x-vllm
Pinned ref: decdda7
Approval: not required (trusted collaborator).

@chunfangamd chunfangamd force-pushed the amd/update_gpt_oss_mi355x_config branch from decdda7 to fd1594b Compare June 5, 2026 02:49
Record the GPT-OSS MI355X vLLM model update
(amd/gpt-oss-120b-w-mxfp4-a-fp8 -> openai/gpt-oss-120b).

Co-authored-by: Cursor <cursoragent@cursor.com>
@chunfangamd chunfangamd marked this pull request as draft June 5, 2026 03:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

Status: No status

Development

Successfully merging this pull request may close these issues.

2 participants