0% GPU usage during cluster rendering

0% GPU usage during cluster rendering

Neil_Cross
Mentor Mentor
1,404 Views
9 Replies
Message 1 of 10

0% GPU usage during cluster rendering

Neil_Cross
Mentor
Mentor

I've set up a standard cluster, Vred Pro main client with Vred Core remote nodes on a LAN (all 2023.2)

When I execute a render, the cluster is clearly working as the render times (GPU RT) reduces significantly as I add nodes, but when I look at task manager the GPU usage is 0% on the remote nodes.  Weirdly, the temperatures on the GPU does increase during the rendering but I'm baffled as to why the utilisation is shown as 0%.

Neil_Cross_0-1670365089311.png

 

Not sure if the above image will come through clearly, but the remote desktop on the right is the "RPL" node in the cluster, it's 67% through the render but both the iGPU and the A5000 show 0% usage, but the temp on the A5000 went from 27C to 51C when the render started so it's doing the work, just not registering as activity?

0 Likes
Accepted solutions (1)
1,405 Views
9 Replies
Replies (9)
Message 2 of 10

richardlevene
Collaborator
Collaborator

Dont you have to use a proper gpu profiler to see the gpu activity vs viewing in windows taskmgr? Or was that only on older windows versions?

0 Likes
Message 3 of 10

michael_nikelsky
Autodesk
Autodesk
Accepted solution

The task manager only shows 3D (DirectX/Vulkan/OpenGL) usage by default, not Cuda usage. On the professional GPUs you can switch it to Cuda manually but it won´t save that setting. There is also a usage monitor in the nvidia settings that is much more reliable.

No idea what to do on Geforce GPUs though, maybe there are some other tools out there that give you a proper usage reading.



Michael Nikelsky
Sr. Principal Engineer
0 Likes
Message 4 of 10

Neil_Cross
Mentor
Mentor

What's confusing me though is the same render task shows 100% GPU usage on the primary machine with the Vred Pro client, but the slave running Vred Core is doing the exact same task showing 0%.  

And the Vray benchmark which is an explicit CUDA test shows as 100% usage in task manager:

Neil_Cross_0-1670401670452.png

If you don't think there's anything concerning to follow up on and this is expected behaviour with Vred I'll move on and just check these other profiling tools out 👍

0 Likes
Message 5 of 10

michael_nikelsky
Autodesk
Autodesk

I would consider the task manager to be completely useless when it comes to any GPU usage with CUDA/Optix. As long as the nvidia usage monitor doesn´t show any issue there is no issue.

But if you can switch the Task Manager from 3D to Cuda you should see some activity there as well. Not sure if this is possible to do on a Geforce GPU though because.....reasons I guess.....

I am actually more surprised that you see any activity on the main machine.



Michael Nikelsky
Sr. Principal Engineer
0 Likes
Message 6 of 10

Neil_Cross
Mentor
Mentor

Is this profiler/usage monitor that you would use the one which is part of the CUDA toolkit?

Either way I managed get CUDA to show up in task manager

Neil_Cross_0-1670402804621.png

For anyone finding this in the future, you have to disable hardware-accelerated GPU scheduling in Windows graphics settings to get the drop down option to show CUDA use.

I'll still try and find a better monitor, just don't particularly want to install the full CUDA toolkit just for that tbh unless it's available somewhere separately.

0 Likes
Message 7 of 10

michael_nikelsky
Autodesk
Autodesk

The usage monitor is in the Nvidia control panel under Workstation->Manage GPU usage.

 

Interesting find with the GPU scheduling, didn´t know that.



Michael Nikelsky
Sr. Principal Engineer
0 Likes
Message 8 of 10

Neil_Cross
Mentor
Mentor

Got it thanks, the Workstation menu in Nvidia Control Panel didn't appear when you're remote desktop into a client

0 Likes
Message 9 of 10

michael_nikelsky
Autodesk
Autodesk

There is also the non-fancy but very powerfull nvidia-smi tool you can run from the command line. It can give you tons of information about the GPUs. The common line I use with it would be this:

 

"C:\Windows\System32\nvidia-smi.exe" --query-gpu=index,utilization.gpu,utilization.memory,memory.used,clocks.gr,clocks.mem,power.draw,temperature.gpu --format=csv --loop-ms=1000"

 

 



Michael Nikelsky
Sr. Principal Engineer
0 Likes
Message 10 of 10

richardlevene
Collaborator
Collaborator

GPUz is a free simple option. Like CPU-z