[BugFix][ROCm] Prefer upstream PyTorch DLPack API in torch extension loader by zihaomu · Pull Request #585 · apache/tvm-ffi

zihaomu · 2026-05-12T03:27:54Z

Motivation:

If PyTorch already provides __dlpack_c_exchange_api__, tvm-ffi uses it directly on all backends.
The optional torch-c-dlpack-ext path is only used as a fallback for older PyTorch builds that do not provide the API.
The fallback extension library name is selected with backend-aware detection for CPU, CUDA, and ROCm.

Tests are added for backend detection, ROCm short-circuit behavior when PyTorch already provides the API, and GPU tensor metadata through DLPack.

Related PR: tile-ai/tilelang#2179, I have finished the A/B test locally.

gemini-code-assist

Code Review

This pull request refactors the PyTorch backend detection logic into helper functions within the torch_c_dlpack_ext and tvm_ffi modules to better support ROCm alongside CUDA and CPU. It also adds an exception handler for OSError during extension loading to gracefully handle incompatible prebuilt libraries and introduces new tests for GPU tensor metadata. The review feedback highlights potential AttributeError risks when accessing version attributes directly on PyTorch builds missing specific backend support, recommending the use of getattr for safer detection.

tqchen · 2026-05-12T11:20:13Z

+def _torch_extension_suffix() -> str:
+    """Return the backend suffix used by the prebuilt extension library."""
+    if torch.cuda.is_available():
+        return "rocm" if torch.version.hip is not None else "cuda"


use the same code as

def _torch_extension_device(torch_module: Any) -> str: """Return the torch backend name used in the optional extension library name.""" if torch_module.cuda.is_available(): if torch_module.version.cuda is not None: return "cuda" if torch_module.version.hip is not None: return "rocm" raise ValueError("Cannot determine whether to build with CUDA or ROCm.") return "cpu"

tqchen · 2026-05-12T11:22:39Z

        # Keep trying JIT
        pass
+    except OSError:
+        # A prebuilt torch-c-dlpack-ext wheel can be present but linked


assuming we detect correctly, is this still necessary?

Mainly want to know if it is something as a temp measure that we can delete after a few version cycles, if so, please add ad comment

tqchen · 2026-05-12T11:23:19Z

        # Keep trying JIT
        pass
+    except OSError:
+        # A prebuilt torch-c-dlpack-ext wheel can be present but linked


Mainly want to know if it is something as a temp measure that we can delete after a few version cycles, if so, please add ad comment

zihaomu · 2026-05-12T12:34:43Z

Hi @tqchen. The scope of this PR has shifted from fixing ROCm-specific Torch C DLPack extension loading to making tvm-ffi avoid loading the optional out-of-tree torch-c-dlpack-ext package when upstream PyTorch already provides the C DLPack exchange API.

If newer PyTorch ROCm builds already expose a working __dlpack_c_exchange_api__, in that case, tvm-ffi should trust PyTorch and return early instead of forcing a local extension override. The local extension remains only as a compatibility fallback for older PyTorch versions

tqchen

thanks, looking good after fixing some minor nits

tqchen · 2026-05-12T19:12:53Z

+def _torch_extension_device(torch_module: Any) -> str:
+    """Return the torch backend name used in the optional extension library name."""
+    if torch_module.cuda.is_available():
+        if getattr(torch_module.version, "hip", None) is not None:


would be good to use the following pattern, mainly also detect cuda version to be robust.

if torch_module.cuda.is_available(): if getattr(torch_module.version, "cuda", None) is not None: return "cuda" if getattr(torch_module.version, "hip", None) is not None: return "rocm" return "cuda" return "cpu"

tqchen · 2026-05-12T19:13:04Z



+def _torch_extension_device(torch_module: Any) -> str:
+    """Return the torch backend name used in the optional extension library name."""


if torch_module.cuda.is_available(): if getattr(torch_module.version, "cuda", None) is not None: return "cuda" if getattr(torch_module.version, "hip", None) is not None: return "rocm" return "cuda" return "cpu"

Fix ROCm torch C DLPack extension loading

5423def

gemini-code-assist Bot reviewed May 12, 2026

View reviewed changes

Comment thread addons/torch_c_dlpack_ext/torch_c_dlpack_ext/core.py Outdated

Comment thread python/tvm_ffi/_optional_torch_c_dlpack.py

zihaomu marked this pull request as ready for review May 12, 2026 07:57

tqchen requested changes May 12, 2026

View reviewed changes

tqchen reviewed May 12, 2026

View reviewed changes

tqchen requested changes May 12, 2026

View reviewed changes

Prefer upstream torch DLPack API

2c06b11

zihaomu changed the title ~~[BugFix] ROCm torch C DLPack extension loading~~ [BugFix] Prefer upstream PyTorch DLPack API in torch extension loader May 12, 2026

zihaomu changed the title ~~[BugFix] Prefer upstream PyTorch DLPack API in torch extension loader~~ [BugFix][ROCm] Prefer upstream PyTorch DLPack API in torch extension loader May 12, 2026

Default torch extension backend to CUDA

0aacc2d

tqchen reviewed May 12, 2026

View reviewed changes

Check torch CUDA version before ROCm

439fea3

tqchen approved these changes May 13, 2026

View reviewed changes

tqchen merged commit 4dbc260 into apache:main May 13, 2026
9 checks passed



		def _torch_extension_device(torch_module: Any) -> str:
		"""Return the torch backend name used in the optional extension library name."""

Conversation

zihaomu commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

tqchen May 12, 2026

Choose a reason for hiding this comment

Uh oh!

tqchen May 12, 2026

Choose a reason for hiding this comment

Uh oh!

tqchen May 12, 2026

Choose a reason for hiding this comment

Uh oh!

tqchen May 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

zihaomu commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tqchen left a comment

Choose a reason for hiding this comment

Uh oh!

tqchen May 12, 2026

Choose a reason for hiding this comment

Uh oh!

zihaomu May 12, 2026

Choose a reason for hiding this comment

Uh oh!

tqchen May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zihaomu May 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

zihaomu commented May 12, 2026 •

edited

Loading

zihaomu commented May 12, 2026 •

edited

Loading

tqchen May 12, 2026 •

edited

Loading