refactor(cuda & rocm): bump vllm to v0.20.2 by gitlawr · Pull Request #157 · gpustack/runner

gitlawr · 2026-05-21T09:24:37Z

No description provided.

Signed-off-by: gitlawr <lawrleegle@gmail.com>

Match the workflow behavior in .github/workflows/pack.yml: when pack/<backend>/Dockerfile.<service> exists, use it instead of the backend's main Dockerfile. Signed-off-by: gitlawr <lawrleegle@gmail.com>

gemini-code-assist

Code Review

This pull request updates the vLLM version from 0.20.1 to 0.20.2 across the project's Dockerfiles and build matrix configuration. It also modifies the Makefile to use the JOB_DOCKERFILE variable for specifying Dockerfile paths, allowing for more flexible build configurations. I have no feedback to provide.

Copilot

Pull request overview

Updates the container packaging configuration to build vLLM images against vLLM v0.20.2 for both AMD ROCm and NVIDIA CUDA, aligning default Dockerfile args and the build matrix; also fixes the packaging build to honor service-specific Dockerfiles.

Changes:

Bump vLLM base image tags and VLLM_VERSION from 0.20.1 to 0.20.2 for ROCm and CUDA packaging.
Update pack/matrix.yaml build args to use the v0.20.2 images across ROCm and CUDA variants.
Make make package use the computed service-specific Dockerfile path (Dockerfile.<service>) when present.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.

File	Description
pack/rocm/Dockerfile.vllm	Bumps default vLLM base image/version args to v0.20.2 for ROCm builds.
pack/cuda/Dockerfile.vllm	Bumps default vLLM base image/version args to v0.20.2 for CUDA builds.
pack/matrix.yaml	Updates matrix build args to vLLM 0.20.2 and corresponding base image tags.
Makefile	Uses `JOB_DOCKERFILE` so packaging builds can select `Dockerfile.<service>` when available.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

      - "linux/amd64"
    args:
      - "ROCM_VERSION=7.2.1"
-      - "VLLM_VERSION=0.20.1"
-      - "VLLM_BASE_IMAGE=vllm/vllm-openai-rocm:v0.20.1"
+      - "VLLM_VERSION=0.20.2"
+      - "VLLM_BASE_IMAGE=vllm/vllm-openai-rocm:v0.20.2"


gitlawr added 2 commits May 21, 2026 15:19

refactor(cuda/rocm): bump vllm to 0.20.2

478b4b2

Signed-off-by: gitlawr <lawrleegle@gmail.com>

fix(make): honor service-specific Dockerfile in package target

86022f2

Match the workflow behavior in .github/workflows/pack.yml: when pack/<backend>/Dockerfile.<service> exists, use it instead of the backend's main Dockerfile. Signed-off-by: gitlawr <lawrleegle@gmail.com>

Copilot AI review requested due to automatic review settings May 21, 2026 09:24

Copilot started reviewing on behalf of gitlawr May 21, 2026 09:25 View session

gemini-code-assist Bot reviewed May 21, 2026

View reviewed changes

Copilot AI reviewed May 21, 2026

View reviewed changes

Comment thread pack/matrix.yaml

Comment on lines 60 to +64

- "linux/amd64"

args:

- "ROCM_VERSION=7.2.1"

- "VLLM_VERSION=0.20.1"

- "VLLM_BASE_IMAGE=vllm/vllm-openai-rocm:v0.20.1"

- "VLLM_VERSION=0.20.2"

- "VLLM_BASE_IMAGE=vllm/vllm-openai-rocm:v0.20.2"

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(cuda & rocm): bump vllm to v0.20.2#157

refactor(cuda & rocm): bump vllm to v0.20.2#157
gitlawr wants to merge 2 commits into
gpustack:mainfrom
gitlawr:vllm20.2

gitlawr commented May 21, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gitlawr commented May 21, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants