- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report
Why we did it?
Was having a lot of stability issues with backburner at scale
far more control over resource sharing than the simple backburner groups
allows other software to also use expensive GPU nodes for rendering when idle
ability to have post jobs run after burn, ie make a QT from exr render
How we did it?
using a flame batch hook that runs when a burn job is submitted. It grabs the burn tar file, extracts and submits a deadline job
Backburner is still setup and running as normal. It handles all the background caching. burn jobs still get submitted to it using a burn group with no nodes, every night we clear out the BBM queue.
how AD can make it better in order of importance:
have burn_gpu always complete with a exit code 0 or 1 for success or failure. currently we have to tail the burn shell log file to catch for errors to fail tasks.
Right now we haven't figured out how to catch background caching jobs, or timeline rendering jobs. timeline renders, we do see the same burn tars, and are able to run the burn_gpu with it which looks like it renders frames. But it never updates the flame timeline.
having burn_gpu output it's log to stdout. logs are a bit messy in deadline, they have the the current task logs and each previous task.
Ability to disable backburner completely.
to setup:
Standard Deadline setup, nothing special needed on the service side.
(scripts attached)
flame.py can live on central storage for burn nodes to call
batch_hook.py and deadline_chunks.py lives in flame python folder
on all hosts that will run burn
add ACL to /opt/Autodesk/log/ so calling user can write to it.
on burn manager, edit this script to open up Jobs folder so calling user can pull burn tar's. ACL wasn't enough.
/opt/Autodesk/backburner/scripts/.systemd/adsk_backburner_manager.sh
......
chmod -R 777 /opt/Autodesk/backburner/Network/Jobs
exit 0
Hoping to rally some support to make this function better. With the added ability to easily share render resources w/ other apps, any size company could make use of this.
Solved! Go to Solution.