Support for peer-to-peer DMA using the DMA-BUF/P2PDMA framework on non-SoC platforms #1046

nullbytepl · 2026-03-03T15:52:44Z

nullbytepl
Mar 3, 2026

I'm currently working on modernizing a Linux driver for a certain data acquisition device that is basically just an FPGA that transfers data to a host PC using PCIe, after which it's processed using CUDA. Part of my work was to restore long-unused support for GPUDirect RDMA, which led me to find that:
a) peer-to-peer DMA is supposed to be handled using the P2PDMA framework in Linux these days;
b) the old method of doing P2P on Nvidia (linking to exported symbols in nvidia.ko) is basically impossible to consistently do across most of the OS/kernel configs I've tested;
c) Nvidia seems to support DMA-BUF with P2P in the open source drivers!

However, I couldn't get it working out of the box, which led me on a fairly long debugging journey, the conclusion of which was a modified driver, which was enough to get working peer-to-peer DMA on an Quadro RTX 5000 (Turing). All it took was to remove a few checks, which basically locked the feature down to SoCs with C2C interconnects (like Grace Superchip) or no framebuffer (GB10B/Thor).

Is there a reason for this limitation being present in the first place? Could Nvidia unlock it for non-SoC setups too, even if behind a flag for example? Even if it had similar limitations as GPUDirect RDMA, like only working with 1:1/disabled IOMMU, it would be better than a vendor-specific interface.

cc @pjarosik

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for peer-to-peer DMA using the DMA-BUF/P2PDMA framework on non-SoC platforms #1046

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Support for peer-to-peer DMA using the DMA-BUF/P2PDMA framework on non-SoC platforms #1046

Uh oh!

nullbytepl Mar 3, 2026

Replies: 0 comments

nullbytepl
Mar 3, 2026