[AMD][MI355X] update model for gpt-oss by ukannika · Pull Request #1638 · SemiAnalysisAI/InferenceX

ukannika · 2026-06-02T15:36:39Z

Switch moe backend to CK to leverage the benefits from this PR.
vllm-project/vllm#42098

Note

Medium Risk
Changes which weights the MI355X GPT-OSS sweep loads and serves; throughput and eval numbers may shift versus the prior AMD checkpoint even though sweep geometry is unchanged.

Overview
Switches the MI355X GPT-OSS FP4 vLLM benchmark (gptoss-fp4-mi355x-vllm) from the AMD MXFP4 checkpoint amd/gpt-oss-120b-w-mxfp4-a-fp8 to the upstream Hugging Face weights openai/gpt-oss-120b, while keeping the same image (vllm/vllm-openai-rocm:v0.22.0), runner, precision label, and fixed-seq-len TP/concurrency sweep.

Documents the change in perf-changelog.yaml for config key gptoss-fp4-mi355x-vllm (PR #1638). The PR description ties this to using the CK MoE backend in vLLM (vllm#42098); that behavior is not altered in this diff—only the configured model id and changelog entry change.

^{Reviewed by Cursor Bugbot for commit a30921f. Bugbot is set up for automated code reviews on this repo. Configure here.}

claude

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

chunfangamd · 2026-06-02T20:12:50Z

/sweep test-config --config-files .github/configs/amd-master.yaml --config-keys gptoss-fp4-mi355x-vllm

github-actions · 2026-06-02T20:13:07Z

@chunfangamd Kicking off a sweep.

Run: https://github.com/SemiAnalysisAI/InferenceX/actions/runs/26845310867
Command: test-config --config-files .github/configs/amd-master.yaml --config-keys gptoss-fp4-mi355x-vllm
Pinned ref: decdda7
Approval: not required (trusted collaborator).

Record the GPT-OSS MI355X vLLM model update (amd/gpt-oss-120b-w-mxfp4-a-fp8 -> openai/gpt-oss-120b). Co-authored-by: Cursor <cursoragent@cursor.com>

ukannika requested a review from a team June 2, 2026 15:36

ukannika requested review from 1am9trash, billishyahao, chunfangamd, seungrokj and yctseng0211 as code owners June 2, 2026 15:36

github-project-automation Bot added this to InferenceMAX Board Jun 2, 2026

claude Bot reviewed Jun 2, 2026

View reviewed changes

Update amd-master.yaml

fd1594b

chunfangamd force-pushed the amd/update_gpt_oss_mi355x_config branch from decdda7 to fd1594b Compare June 5, 2026 02:49

chunfangamd added the full-sweep-enabled label Jun 5, 2026

Add perf-changelog entry for SemiAnalysisAI#1638

a30921f

Record the GPT-OSS MI355X vLLM model update (amd/gpt-oss-120b-w-mxfp4-a-fp8 -> openai/gpt-oss-120b). Co-authored-by: Cursor <cursoragent@cursor.com>

cursor Bot mentioned this pull request Jun 5, 2026

[AMD][MI355X] update model for gpt-oss #1670

Merged

chunfangamd marked this pull request as draft June 5, 2026 03:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMD][MI355X] update model for gpt-oss #1638

[AMD][MI355X] update model for gpt-oss #1638
ukannika wants to merge 2 commits into
SemiAnalysisAI:mainfrom
ukannika:amd/update_gpt_oss_mi355x_config

ukannika commented Jun 2, 2026 •

edited by cursor Bot

Loading

Uh oh!

claude Bot left a comment

Uh oh!

chunfangamd commented Jun 2, 2026

Uh oh!

github-actions Bot commented Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ukannika commented Jun 2, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Claude Code Review

Uh oh!

chunfangamd commented Jun 2, 2026

Uh oh!

github-actions Bot commented Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ukannika commented Jun 2, 2026 •

edited by cursor Bot

Loading