Questions tagged [gpudirect]

Tag for NVIDIA GPUDirect

Using GPUDirect, multiple GPUs, third party network adapters, solid-state drives (SSDs) and other devices can directly read and write CUDA host and device memory, eliminating unnecessary memory copies, dramatically lowering CPU overhead, and reducing latency, resulting in significant performance improvements in data transfer times for applications running on NVIDIA Tesla™ and Quadro™ products.

Resources:

16 questions

votes

3 answers

Does AMD's OpenCL offer something similar to CUDA's GPUDirect?

NVIDIA offers GPUDirect to reduce memory transfer overheads. I'm wondering if there is a similar concept for AMD/ATI? Specifically: 1) Do AMD GPUs avoid the second memory transfer when interfacing with network cards, as described here. In case…

asked Feb 15 '12 at 03:07

arrayfire

1,744
12
19

votes

1 answer

GPUDirect RDMA transfer from GPU to remote host

Scenario: I have two machines, a client and a server, connected with Infiniband. The server machine has an NVIDIA Fermi GPU, but the client machine has no GPU. I have an application running on the GPU machine that uses the GPU for some calculations.…

cuda infiniband rdma gpudirect

asked Aug 14 '12 at 10:47

DaoWen

32,589
6
74
101

votes

1 answer

Does the nVidia RDMA GPUDirect always operate only physical addresses (in physical address space of the CPU)?

As we know: http://en.wikipedia.org/wiki/IOMMU#Advantages Peripheral memory paging can be supported by an IOMMU. A peripheral using the PCI-SIG PCIe Address Translation Services (ATS) Page Request Interface (PRI) extension can detect and signal…

cuda gpgpu pci-e memory-mapping gpudirect

asked Nov 07 '13 at 16:50

Alex

12,578
15
99
195

votes

1 answer

How to use GPUDirect RDMA with Infiniband

I have two machines. There are multiple Tesla cards on each machine. There is also an InfiniBand card on each machine. I want to communicate between GPU cards on different machines through InfiniBand. Just point to point unicast would be fine. I…

cuda openmpi infiniband gpudirect

asked Jun 25 '15 at 09:30

Hot.PxL

1,902
1
17
30

votes

1 answer

RDMA Read protection for local memory operations

I have the following scenario: My server allocates a buffer of 1MB, which is periodically updated and written to (about every 50ms). The client is connected to the the server via Infiniband and periodically reads that buffer via RDMA Read…

c++ infiniband rdma gpudirect

asked Apr 26 '22 at 15:18

hey0

votes

1 answer

Setting up GPUDirect for infiniband

I try to setup GPUDirect to use infiniband verbs rdma calls directly on device memory without the need to use cudaMemcpy. I have 2 machines with nvidia k80 gpu cards each with driver version 367.27. CUDA8 is installed and Mellanox OFED 3.4 Also the…

cuda infiniband gpudirect

asked Nov 11 '16 at 10:56

kusterl

votes

1 answer

Can I use in CUDA atomic-operations on remote GPU-RAM over GPUDirect 2.0 P2P?

For example i can use CUDA atomic operations atomicAdd(ptr, val), atomicCAS(ptr, old, new), ... on its global memory (GPU-RAM). With CUDA 6.5. But can I use these atomic-operations for the remote global memory over GPUDirect 2.0 P2P?

cuda nvidia gpgpu multi-gpu gpudirect

asked Jan 18 '15 at 18:47

Alex

12,578
15
99
195

votes

1 answer

How does GPUDirect enforce isolation on a shared device

I have been reading here https://developer.nvidia.com/gpudirect about GPUDirect, In there example there is a network card attached to the PCIe together with two GPU's and a CPU. How is isolation enforced between all clients trying to access the…

gpu nvidia pci-e rdma gpudirect

asked Dec 26 '13 at 10:17

yonigo

vote

1 answer

RDMA read and write data placement/visibility semantics

I am trying to get more details on the RDMA read and write semantics (especially data placement semantics) and I would like to confirm my understanding with the experts here. RDMA read : Would the data be available/seen in the local buffer, once…

rdma gpudirect

asked Oct 12 '21 at 03:25

user718134

vote

0 answers

glTexSubImage2D() and glGetTextureImageEXT() used instead of GPUDirect

My setup: NVIDIA Quadro p6000 (used for GPUDirect) AJA video card (SDI feed from camera and SDI output to the TV) I am trying to figure out what is the purpose of GPUDirect (in my specific case) if I can achieve the same amount of copying and the…

opengl gpu nvidia video-card gpudirect

asked Oct 18 '17 at 08:34

Bojan Lovrovic

vote

1 answer

Nvidia GPUDirect and camera capturing to GPU

I have a USB3 camera, and I need to have the captured images to be loaded into DirectX texture. Currently I'm just doing it in my code in the user mode - grab images and upload them to GPU, which is, of cause, certain overhead on CPU and delay of…

directx gpu nvidia gpudirect

asked Jul 31 '16 at 07:28

Michael Gopshtein

vote

1 answer

Can I use GPUDirect v2 Peer-to-Peer communication between two Quadro K1100M or two GeForce GT 745M?

Can I use GPUDirect v2 - Peer-to-Peer communication on a single PCIe-Bus?: between two: Mobile nVidia Quadro K1100M between two: Mobile nVidia GeForce GT 745M

cuda nvidia gpgpu gpudirect

asked Nov 19 '13 at 12:58

Alex

12,578
15
99
195

vote

3 answers

CUDA: GPUDirect on GeForce GTX 690

The GeForce GTX 690 (from vendors like Zotac and EVGA) can be used for CUDA programming, much like a Tesla K10. Question: Does the GeForce GTX 690 support GPUDirect? Specifically: If I were to use two GTX 690 cards, I would have 4 GPUs (two GPUs…

cuda nvidia gpudirect

asked Jan 01 '13 at 17:44

Prateeksha Shenoy

votes

0 answers

RDMA support GPUDirect feature

NVIDIA GPUs want to GPUDirect RDMA feature. Must it use hardware RDMA NIC by Mellanox like ConnectX-5 NIC, or I can use other RDMA by other companies and adapt it to support GPUDirect feature. If so, what do I need to install? Now, I make some RDMA…

rdma gpudirect

asked Mar 06 '23 at 08:45

xiaobin

votes

0 answers

GPUDirect RDMA out of range pin address by Quadro p620

I want to implement FPGA-GPU RDMA by nvidia quadro p620. Also, I used common PCIe BAR resources(BAR0 - BAR1 - BAR2) for FPGA registers and other chunk controllers handling which is independent from RDMA in my custom driver. PCIe managements are OK…

gpu fpga rdma gpudirect

asked Dec 18 '22 at 09:19

SkyCyborg

2 Next