We’ve got a scenario that we’d like to add to the failure detection.
I was wondering if it is possible on my end?
Basically, I want a rule that says, if a job is still at 0% progress after x number of minutes, then the job should fail.
We have some long frames on a current job ~10 hours
Problem is, if we set the timeout to 11 hours, a frame that hangs on startup stays hung for 11 hours, doing nothing before it times out. It would be nice if it timed out after 20 mins of not starting.
Any hints would be appreciated.,
Thanks,
Rhys.