Questions tagged [nvml]

A C-based API for monitoring and managing various states of the NVIDIA GPU devices.

A C-based API for monitoring and managing various states of the NVIDIA GPU devices.

34 questions
6
votes
4 answers

Cannot run CUDA code that queries NVML - error regarding libnvidia-ml.so

Recently a colleague needed to use NVML to query device information, so I downloaded the Tesla development kit 3.304.5 and copied the file nvml.h to /usr/include. To test, I compiled the example code in tdk_3.304.5/nvml/example and it worked…
Brian R
  • 785
  • 1
  • 6
  • 13
5
votes
1 answer

NVidia NVML nvmlDeviceGetMemoryInfo() loads and unloads nvapi64.dll immediately

I use some NVIDIA Management Library features to produce metrics in my application. Every 1 second I call nvmlDeviceGetMemoryInfo() in a thread, and after a few minutes, in the output of Visual Studio, I can read hundreds of : 'MyApp.exe' (Win32):…
SamT
  • 528
  • 4
  • 14
5
votes
1 answer

Is there any way or even possible to get the overall utilization of a GPU during a period of time?

I am trying to get the information about the overall utilization of a GPU (mine is an NVIDIA Tesla K20, running on Linux) during a period of time. By "overall" I mean something like, how many streaming multi-processors are scheduled to run, and how…
rsm
  • 103
  • 1
  • 6
4
votes
1 answer

AMD's NVML counterpart (c++)

I would like to know what library AMD has that mimics the NVML counterpart of nvidia. What I want is to get temperature, powerusage, etc. in c++. Best regards!
yannickwe87
  • 105
  • 7
3
votes
0 answers

Slurm not optimally allocating multiple GPUs

We are using Slurm 20.02 with NVML autodetect, and on some 8-GPU nodes with NVLink, 4-GPU jobs get allocated by Slurm in a surprising way that appears sub-optimal. On a system with 8 Nvidia A40 GPUs, 4 NVLink bridges, and two AMD EPYC 7302 CPUs, we…
leilu
  • 367
  • 3
  • 10
3
votes
2 answers

NVML: Driver/library version mismatch

I don't know why nvidia-smi doesn't work what I need to do for fix it? I think my library and driver version is match but nvidia-smi dosen't recognize it test
Jonghoe Ku
  • 31
  • 1
  • 2
3
votes
1 answer

How to measure GPU usage per process in Windows using python?

I would like to measure the GPU usage per process as done in Windows taskmgr.exe, but I have encountered several problems when attempting to use the pyNVML library. As a result, I have a few questions. First, is it currently possible to measure the…
h1d3r00t
  • 43
  • 7
3
votes
1 answer

NVML library path

I compiled a software (GROMACS 2016.3) using cmake (3.5.1) with the following flags: cmake .. -DGMX_BUILD_OWN_FFTW=ON -DREGRESSIONTEST_DOWNLOAD=ON -DGMX_MPI=on -DGMX_GPU=on -DGMX_OPENMP=on -DCUDA_TOOLKIT_ROOT_DIR=/usr/local/cuda-8.0…
3
votes
1 answer

NVML Header File Missing

I am trying to execute some CUDA code which happens to have some NVML library functions like nvmlSystemGetDriverVersion. But, when I try to compile the code it says nvml.h not found. How should I install NVML on my system since nvml.h does not…
Vaibhav Sundriyal
  • 567
  • 5
  • 11
  • 18
2
votes
2 answers

GPU MHZ Utilization

I am developing a monitoring agent for GPU cards that is capable of providing real-time telemetry using CUDA and NVML libraries. I want to understand a little more about GPU core operation vs how Intel/AMD CPU cores work. One formula that can be…
2
votes
1 answer

NVIDIA-SMI, NVML, Power usage: [NOT SUPPORTED]

I tried to get current power usage with the following command in Windows 10 x64: nvidia-smi.exe --format=csv,noheader --query-gpu=power.draw And got the next result: [Not Supported] I checked it on the GTX1050(notebook) video card. Please also…
Ivan Kolesnikov
  • 1,787
  • 1
  • 29
  • 45
2
votes
0 answers

How can I get gpu utilization?

Previously, I tried NVML by using the function nvmlDeviceGetUtilizationRates(). I test it by this way, while the collection is running, I excute a DFT ( the Kernel is organised as <7,32>) on Tesla C2070 which have 14 SMs, on my thought, there were 7…
Loong Draw
  • 19
  • 1
2
votes
1 answer

GPU Utilization

I have been using NVML library to get the values of graphics and memory utilization for Rodinia benchmark suite. I observe that with different frequencies, the utilization of the same application shows different values. From the wiki link…
Vaibhav Sundriyal
  • 567
  • 5
  • 11
  • 18
1
vote
0 answers

How to understand the SmUtil returned by nvmlDeviceGetProcessUtilization?

I'm writing a program that monitors how processes use the GPU and I found an API provided by nvml, nvmlDeviceGetProcessUtilization. Acordding the comment of this API, It reads recent utilization of GPU SM (3D/Compute), framebuffer, video encoder,…
nezhababa
  • 11
  • 1
1
vote
1 answer

Why does nvmlDeviceGetTemperature only work in debug mode?

Using VS2022 the following code snippet works in debug mode but not in release mode: nvmlInit(); nvmlDevice_t devH; auto ret = nvmlDeviceGetHandleByIndex_v2(0, &devH); if (ret != NVML_SUCCESS) DPrint("ERROR!"); u32 tt{}; ret =…
poby
  • 1,572
  • 15
  • 39
1
2 3