I have some questions about pinned memory and paged memory
I saw some papers said that if we use pinned memory, we can get less latency of memory copy(Host to Device) than use paged memory
(e.g. Reducing GPU Offload Latency via Fine-Grained CPU-GPU Synchronization by Daniel Lustig and Margaret Martonos )
There are few questions as below:
- What makes pinned memory more faster than paged memory? (Because I still don't know the different between them)
- Is this only use on CUDA environments?