Updated Arnold and Houdini to:
Houdini Core Version 20.0.653
Arnold Core: 7.3.1.0
HotA: 6.3.1.0
This log shows a "gpu jit compile" taking up some time, but generally IPR rendering is still incredibly slow.
Around 2mins per IPR pass to "update gpu scene data".
00:05:11 15299MB | -----------------------------------------------------------------------------------
00:05:11 15299MB | frame time 5:01.85 machine utilization (0.22%)
00:05:11 15299MB | license checkout time 0:00.03
00:05:11 15299MB | node init 0:00.63
00:05:11 15299MB | gpu update 0:43.94
00:05:11 15299MB | rendering 0:00.86
00:05:11 15299MB | output driver 0:00.07
00:05:11 15299MB | pixel rendering 0:00.79 (8 sample iterations @ min: 0:00.031, avg: 0:00.104, max: 0:00.324)
00:05:11 15299MB | unaccounted 4:16.37
00:05:11 15299MB | -----------------------------------------------------------------------------------
00:05:11 15299MB | top session self-times by category
00:05:11 15299MB | AiMsgDebug 1:04.55 (85.95%)
00:05:11 15299MB | thread blocked 0:03.10 ( 4.14%)
00:05:11 15299MB | GPU::update_scene 0:01.52 ( 2.04%)
00:05:11 15299MB | gpu jit compile 0:01.10 ( 1.47%)
00:05:11 15299MB | subdivision 0:00.67 ( 0.90%)
00:05:11 15299MB | node_update 0:00.67 ( 0.90%)
00:05:11 15299MB | -----------------------------------------------------------------------------------
00:05:11 15299MB | top session self-times by node
00:05:11 15299MB | GPU::update_scene 0:01.52 ( 2.04%)
00:05:11 15299MB | gpu jit compile 0:01.10 ( 1.47%)
00:05:11 15299MB | UpdateNodes 0:00.65 ( 0.87%)
00:05:11 15299MB | InitializeNodes 0:00.53 ( 0.71%)
00:05:11 15299MB | accumulateBucketSamples 0:00.42 ( 0.57%)
00:05:11 15299MB | polymesh:/Set/Name/Scatters/instancer_background_scatters.proto1_bush_geo_1Shape_id2 0:00.40 ( 0.53%)
00:05:11 15299MB | -----------------------------------------------------------------------------------
Here we get another log from the last IPR pass, showing a AOV driver taking some time?
00:08:02 15535MB | -----------------------------------------------------------------------------------
00:08:02 15535MB | top session self-times by category
00:08:02 15510MB | AiMsgDebug 1:04.55 (66.56%)
00:08:02 15510MB | driver_process_bucket 0:09.07 ( 9.36%)
00:08:02 15510MB | /HdArnoldRenderDelegate_00000045CC2DC100/HdArnoldRenderPass_aov_driver_1 0:06.20 ( 6.40%)
00:08:02 15510MB | accumulateBucketSamples 0:05.42 ( 5.59%)
00:08:02 15510MB | processGPUSample 0:04.79 ( 4.95%)
00:08:02 15510MB | thread blocked 0:03.55 ( 3.66%)
00:08:02 15510MB | GPU::update_scene 0:01.87 ( 1.94%)
00:08:02 15510MB | -----------------------------------------------------------------------------------
00:08:02 15510MB | top session self-times by node
00:08:02 15510MB | HdArnoldDriverAOV:/HdArnoldRenderDelegate_00000045CC2DC100/HdArnoldRenderPass_aov_driver_1 (driver_process_bucket) 0:06.20 ( 6.40%)
00:08:02 15510MB | accumulateBucketSamples 0:05.42 ( 5.59%)
00:08:02 15510MB | processGPUSample 0:04.79 ( 4.95%)
00:08:02 15510MB | GPU::update_scene 0:01.87 ( 1.94%)
00:08:02 15510MB | gpu jit compile 0:01.10 ( 1.14%)
00:08:02 15510MB | launch 0:00.94 ( 0.98%)
00:08:02 15510MB | -----------------------------------------------------------------------------------
Other logging, nothing looks wrong:
00:08:02 15510MB | -----------------------------------------------------------------------------------
00:08:02 15510MB | peak GPU memory consumed 9665.51MB
00:08:02 15510MB | CUDA context 159.97MB
00:08:02 15510MB | OptiX context 24.00MB
00:08:02 15510MB | framebuffers 187.02MB
00:08:02 15510MB | node overhead 925.13MB
00:08:02 15510MB | geometry 1998.81MB
00:08:02 15510MB | subdivs 1998.81MB
00:08:02 15510MB | accel structs 3067.58MB
00:08:02 15510MB | skydome importance map 7.80MB
00:08:02 15510MB | texture cache 322.06MB
00:08:02 15510MB | unaccounted 6035.74MB
00:08:02 15510MB | -----------------------------------------------------------------------------------
00:08:02 15510MB | ray counts ( /pixel, /sample) (% total) (avg. hits) (max hits)
00:08:02 15510MB | camera 273470933 ( 0.87, 1.00) ( 12.87%) ( 1.00) ( 1)
00:08:02 15510MB | shadow 1552431299 ( 4.94, 5.68) ( 73.07%) ( 0.70) ( 1)
00:08:02 15510MB | diffuse_reflect 230548798 ( 0.73, 0.84) ( 10.85%) ( 0.75) ( 1)
00:08:02 15510MB | specular_reflect 59092723 ( 0.19, 0.22) ( 2.78%) ( 0.88) ( 1)
00:08:02 15510MB | bssrdf 9173208 ( 0.03, 0.03) ( 0.43%) ( 0.23) ( 1)
00:08:02 15510MB | total 2124716961 ( 6.76, 7.77) (100.00%) ( 0.74) ( 1)
00:08:02 15510MB | by ray depth: 0 1 2 3
00:08:02 15510MB | total 59.2% 36.8% 4.0% 0.0%
00:08:02 15510MB | -----------------------------------------------------------------------------------
00:08:02 15510MB | geometry (% hit ) (instances) ( init mem, final mem)
00:08:02 15510MB | lists 1 ( 0.0%) ( 0) ( 0.00, 0.00)
00:08:02 15510MB | procs 27 ( 0.0%) ( 0) ( 2.17, 2.17)
00:08:02 15510MB | subdivs 1627 (100.0%) ( 284291) ( 569.08, 2682.32)
00:08:02 15510MB | -----------------------------------------------------------------------------------
00:08:02 15510MB | geometric elements ( min) ( avg.) ( max)
00:08:02 15510MB | objects (procs) 284291 ( 9) ( 10529.3) ( 17924)
00:08:02 15510MB | subdiv patches 14230441 ( 40) ( 8746.4) ( 608844)
00:08:02 15510MB | -----------------------------------------------------------------------------------
00:08:02 15510MB | triangle tessellation ( min) ( avg.) ( max) (/ element) (% total)
00:08:02 15510MB | subdivs 104298289 ( 148) ( 64104.7) ( 4730272) ( 7.33) (100.00%)
00:08:02 15510MB | iterations 1 66263776 ( 320) ( 2007993.2) ( 4730272) ( 7.77) ( 63.53%)
00:08:02 15510MB | iterations 2 33472 ( 1728) ( 2789.3) ( 5792) ( 31.82) ( 0.03%)
00:08:02 15510MB | adaptive 38001041 ( 148) ( 24020.9) ( 2862756) ( 6.67) ( 36.43%)
00:08:02 15510MB | unique triangles 104298289
00:08:02 15510MB | visible triangles 327592289
00:08:02 15510MB | CPU memory use 2682.32MB