We noticed that quite often, machines don’t stick to their sequential jobs as they are supposed to.
One particular case i looked at had this error between tasks:
2015-08-27 18:19:51: 0: Render time for frame(s): 1.109 m
2015-08-27 18:19:51: 0: Total time for task: 1.144 m
2015-08-27 18:19:53: 0: Saving task log...
2015-08-27 18:19:54: Scheduler Thread - Render Thread 0 completed its task
2015-08-27 18:19:54: Scheduler Thread - Seconds before next job scan: 2
2015-08-27 18:19:56: Scheduler - Performing Job scan on Primary Pools with scheduling order Pool, Weighted, Balanced
2015-08-27 18:19:56: Scheduler - Error occurred while scanning for jobs: An error occurred while trying to Query the Database (deadline01.scanlinevfxla.com:27017,deadline02.scanlinevfxla.com:27017,deadline03.scanlinevfxla.com:27017). It is possible that Deadline failed to Authenticate properly. Please check that the Mongo Username/Password are correct.
2015-08-27 18:19:56: Full error: QueryFailure flag was assertion src/mongo/db/structure/btree/key.cpp:433 (response was { "$err" : "assertion src/mongo/db/structure/btree/key.cpp:433" }). (FranticX.Database.DatabaseConnectionException)
2015-08-27 18:19:56: Scheduler - Performing Job scan on Secondary Pools with scheduling order Pool, Weighted, Balanced
2015-08-27 18:19:57: Scheduler - Using enhanced scheduler balancing
2015-08-27 18:19:57: Scheduler - Job chooser found no jobs.
2015-08-27 18:19:57: 0: Shutdown
2015-08-27 18:19:57: 0: Exited ThreadMain(), cleaning up...
2015-08-27 18:19:57: 0: INFO: End Job called - shutting down 3dsmax plugin
2015-08-27 18:19:57: 0: Shutdown
2015-08-27 18:19:58: 0: Shutdown
2015-08-27 18:19:58: 0: Shutdown
Not sure whats going on, we are not using authentication.