Make GraphicsResource inherit from Buffer by rparolin · Pull Request #1701 · NVIDIA/cuda-python

rparolin · 2026-02-27T16:33:41Z

Summary

Refactors GraphicsResource to inherit from Buffer instead of wrapping a separate Buffer object, eliminating the _MappedBufferContext intermediary class
map() now populates the GraphicsResource itself with device pointer/size (since it IS-A Buffer) and returns self
Makes stream a required parameter for map() and unmap() (removes implicit default stream usage)
Adds stream parameter to from_gl_buffer() for a convenient register-and-map-in-one-step pattern
Renames handle property to resource_handle to avoid conflicting with Buffer.handle (which now exposes the mapped device pointer)

Motivation

Review feedback requested that GraphicsResource be a Buffer directly rather than producing a separate Buffer via map(). This simplifies the API surface: a mapped GraphicsResource can be passed directly anywhere a Buffer is accepted (e.g., StridedMemoryView.from_buffer()), without needing a wrapper context manager.

Key changes

GraphicsResource(Buffer) — inherits from Buffer; handle and size are valid while mapped
Removed _MappedBufferContext — no longer needed since GraphicsResource itself is the buffer
__enter__/__exit__ — moved onto GraphicsResource directly; __exit__ auto-unmaps using the stream from map()
stream required — map(stream=) and unmap(stream=) no longer default to stream 0
from_gl_buffer(stream=) — optional stream param to register + map in one call, enabling with GraphicsResource.from_gl_buffer(vbo, stream=s) as buf: pattern
close(stream=None) — accepts stream kwarg for compatibility with Buffer.close()
resource_handle — renamed from handle to avoid shadowing Buffer.handle

🤖 Generated with Claude Code

copy-pr-bot · 2026-02-27T16:33:44Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

copy-pr-bot · 2026-02-27T19:49:19Z

Auto-sync is disabled for ready for review pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

rparolin · 2026-02-27T19:49:25Z

/ok to test

github-actions · 2026-02-27T20:01:36Z

Doc Preview CI
🚀 View preview at https://nvidia.github.io/cuda-python/pr-preview/pr-1701/
https://nvidia.github.io/cuda-python/pr-preview/pr-1701/cuda-core/
https://nvidia.github.io/cuda-python/pr-preview/pr-1701/cuda-bindings/
https://nvidia.github.io/cuda-python/pr-preview/pr-1701/cuda-pathfinder/
Preview will be ready when the GitHub Pages deployment is complete.

rparolin · 2026-03-02T21:42:33Z

/ok to test

cpcloud · 2026-03-03T22:07:47Z

cuda_core/cuda/core/_graphics.pyx

            return
        if self._mapped:
-            # Best-effort unmap before unregister
+            # Best-effort unmap before unregister (use stream 0 as fallback)


Potential stream-ordering bug here: map()/unmap() now require an explicit stream, but close() always unmaps on stream 0 and ignores both _map_stream and the optional close(stream=...) arg.

If a resource was mapped on a non-default stream and then closed while mapped, unmap can be issued on the wrong stream, which may break ordering guarantees with in-flight work. Could we unmap using _map_stream (or the passed stream, with a clear fallback policy) instead of hard-coding stream 0?

cpcloud · 2026-03-03T22:07:53Z

cuda_core/cuda/core/_graphics.pyx

+        self._map_stream = None
+
+    def __enter__(self):
+        return self


__enter__ currently returns self even when the resource is not mapped. That allows patterns like with GraphicsResource.from_gl_buffer(vbo) as buf: (without stream=), where buf.handle/buf.size are not valid.

Could we either (a) raise in __enter__ when not self._mapped, or (b) require/perform mapping for the context-manager path to avoid this silent footgun?

cpcloud

Thanks for the refactor here — the GraphicsResource(Buffer) direction looks good overall.

Requesting changes for two behavior issues called out inline:

close() currently unmaps on stream 0 even when mapping happened on a non-default stream, which can break stream-ordering guarantees.
__enter__ returns self even when unmapped, allowing context-manager usage where handle/size are invalid.

Please address these and I’m happy to re-review.

cpcloud · 2026-03-04T14:45:25Z

ping @rparolin. happy to fix this up if you want.

leofang · 2026-03-04T17:52:38Z

Thanks for the refactor here — the GraphicsResource(Buffer) direction looks good overall.

Yeah and thinking about it more, I suspect this is a must:

If GL buffer registration is context-independent: This PR makes the UX nicer
If GL buffer registration is locked to the current CUDA context: We need to track the underlying device/context, so we either inherit from Buffer (this PR) or add Buffer.from_gl_buffer() constructor (and keep GraphicsResource internal).
- From the viewpoint of staying consistent across cuda.core, the latter might be worth considering too, but one challenge is that Buffer today does not have the map/unmap concept.
- I would be happy for you @rparolin @Andy-Jost @cpcloud to brainstorm and I can review later 🙂

I will ask internally if it's context-independent or not, but it's not a blocker.

Andy-Jost · 2026-03-04T19:00:06Z

Thanks for the discussion here. I'd like to propose an alternative approach that I think addresses both of Phillip's concerns and Leo's observation about Buffer not having map/unmap.

Proposal: Use the resource_handles module instead of modifying Buffer

We already have a C++ handle layer (cuda/core/_cpp/resource_handles.hpp) that provides RAII-based lifetime management with std::shared_ptr custom deleters. This pattern already solves the exact problems raised here:

Stream capture: DevicePtrHandle already captures the allocation stream in its deleter and uses it for cuMemFreeAsync. We can do the same for graphics resources - capture the map stream and use it for unmap.
Dependency tracking: Handles can hold references to other handles, preventing out-of-order destruction. A mapped buffer handle can hold a reference to the GraphicsResourceHandle, ensuring the graphics resource outlives the mapped pointer.
Arbitrary release actions: The shared_ptr deleter can chain actions - e.g., unmap then unregister.

Concrete design:

GraphicsResource.map(stream) returns a Buffer (or just the existing DevicePtrHandle under the hood)
The returned handle's deleter captures:
- The GraphicsResourceHandle (keeps it alive)
- The map StreamHandle (for stream-ordered unmap)
When the mapped buffer handle is released, the deleter calls cuGraphicsUnmapResources on the captured stream
GraphicsResource itself doesn't need to track _mapped state - the existence of mapped buffer handles implies mapped state

Benefits:

Buffer stays unchanged (no map/unmap concept needed)
Stream ordering is correct by construction (cpcloud's concern 1)
No invalid __enter__ state possible (cpcloud's concern 2) - you either have a valid mapped handle or you don't
Dependency tracking prevents use-after-free
Consistent with how we handle DeviceMemoryResource allocations

This is the same pattern we use for memory pool allocations, where the pointer handle captures the pool handle and stream for proper cleanup ordering.

Thoughts?

cpcloud · 2026-03-06T22:38:56Z

+1 to @Andy-Jost's design. Looks solid.

leofang · 2026-03-07T06:11:07Z

FYI

I will ask internally if it's context-independent or not

it is context-dependent.

Proposal: Use the resource_handles module instead of modifying Buffer

Modifying Buffer in what way? Sorry it's not clear to me.

Can we sketch a bit how the user code would look like with this proposal? The changes in this PR were motivated by the sketch in here: #1608 (comment). Rob asked the right question there. Inheritance is probably not necessary, as long as the same UX can be delivered.

rparolin added 2 commits February 27, 2026 07:33

removing the _buffer usage example from docstring

b65554a

fixes

3539bd8

fixes

dbc41f2

rparolin changed the title ~~Rparolin/graphics feedback fixes 1~~ Make GraphicsResource inherit from Buffer Feb 27, 2026

rparolin requested a review from leofang February 27, 2026 19:48

rparolin self-assigned this Feb 27, 2026

rparolin added the enhancement Any code-related improvements label Feb 27, 2026

rparolin added this to the cuda.core v0.7.0 milestone Feb 27, 2026

rparolin marked this pull request as ready for review February 27, 2026 19:49

rparolin closed this Feb 27, 2026

rparolin reopened this Feb 27, 2026

leofang added P0 High priority - Must do! cuda.core Everything related to the cuda.core module labels Feb 28, 2026

Merge branch 'main' into rparolin/graphics_feedback_fixes_1

f0ae028

rparolin enabled auto-merge (squash) March 3, 2026 16:11

cpcloud reviewed Mar 3, 2026

View reviewed changes

cpcloud requested changes Mar 3, 2026

View reviewed changes

leofang mentioned this pull request Mar 7, 2026

Add CUDA-Graphics (OpenGL) interop support to cuda.core #1608

Merged

Conversation

rparolin commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

Key changes

Uh oh!

copy-pr-bot bot commented Feb 27, 2026

Uh oh!

copy-pr-bot bot commented Feb 27, 2026

Uh oh!

rparolin commented Feb 27, 2026

Uh oh!

github-actions bot commented Feb 27, 2026

Preview will be ready when the GitHub Pages deployment is complete.

Uh oh!

rparolin commented Mar 2, 2026

Uh oh!

cpcloud Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

cpcloud Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

cpcloud left a comment

Choose a reason for hiding this comment

Uh oh!

cpcloud commented Mar 4, 2026

Uh oh!

leofang commented Mar 4, 2026

Uh oh!

Andy-Jost commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cpcloud commented Mar 6, 2026

Uh oh!

leofang commented Mar 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rparolin commented Feb 27, 2026 •

edited

Loading

Andy-Jost commented Mar 4, 2026 •

edited

Loading

leofang commented Mar 7, 2026 •

edited

Loading