About 20% of our deadline farm. Some are reported as stalled, some just say they are rendering (when they are not)
But some new errors.
Machine reporting stalled:
2013-02-22 11:13:10: Scheduler Thread - Synchronization time for plugin files: 62.500 ms
2013-02-22 11:13:10: Constructor: MayaBatch
2013-02-22 11:13:11: 0: Task timeout is being ignored because this is a Post Job Script Task
2013-02-22 11:13:11: 0: Loaded job: [HEAT] Software Render: HA_260_2510_maya_animation_animation.ma version: v0120 (5127b6beba5e7815f052dd81)
2013-02-22 11:13:11: 0: Plugin executing post job script
2013-02-22 11:13:11: 0: Executing Post Job Script: “//inferno2/exchange/software/managed/pythonScripts/site-packages/scl/farm/postScripts/removeSoftwareDepth.py”
2013-02-22 11:13:12: 0: INFO: Executing plugin script C:\Documents and Settings\ScanlineVFX\Local Settings\Application Data\Thinkbox\Deadline6\slave\LAPRO0241\plugins\MayaBatch.py
2013-02-22 11:13:12: 0: INFO: About: Maya Batch Plugin for Deadline
2013-02-22 11:13:12: 0: INFO: The job’s environment will be merged with the current environment before rendering
2013-02-22 11:13:12: 0: INFO: Executing post job script //inferno2/exchange/software/managed/pythonScripts/site-packages/scl/farm/postScripts/removeSoftwareDepth.py
2013-02-22 11:13:18: Events plugin names could not be collected from the repository because: Could not find a part of the path ‘\inferno2\deadline\repository6\custom\events’. (System.IO.DirectoryNotFoundException)
2013-02-22 11:13:30: Exception on Startup: System.Exception: Python Error: Object reference not set to an instance of an object. (System.NullReferenceException)
2013-02-22 11:13:30: Stack Trace:
2013-02-22 11:13:30: at Python.Runtime.ClassBase.tp_dealloc(IntPtr ob)
2013-02-22 11:13:30: at Python.Runtime.Runtime.PyImport_ExecCodeModule(String name, IntPtr code)
2013-02-22 11:13:30: at Python.Runtime.PythonEngine.ModuleFromString(String name, String code)
2013-02-22 11:13:30: at FranticX.Scripting.PythonNetScriptEngine.ExecuteScript(String scriptName, String script)
2013-02-22 11:13:30: —> System.NullReferenceException: Object reference not set to an instance of an object.
2013-02-22 11:13:30: at Python.Runtime.ClassBase.tp_dealloc(IntPtr ob)
2013-02-22 11:13:30: at Python.Runtime.Runtime.PyImport_ExecCodeModule(String name, IntPtr code)
2013-02-22 11:13:30: at Python.Runtime.PythonEngine.ModuleFromString(String name, String code)
2013-02-22 11:13:30: at FranticX.Scripting.PythonNetScriptEngine.ExecuteScript(String scriptName, String script)
2013-02-22 11:13:30: — End of inner exception stack trace —
2013-02-22 11:13:30: at FranticX.Scripting.PythonNetScriptEngine.a(Exception A_0)
2013-02-22 11:13:30: at FranticX.Scripting.PythonNetScriptEngine.ExecuteScript(String scriptName, String script)
2013-02-22 11:13:30: at a.a(String[] A_0)
2013-02-22 11:13:30: Deadline Slave will now exit.
2013-02-22 11:13:30: Traceback (most recent call last):
2013-02-22 11:13:30: File “…\Libraries\ThinkboxUI\UI\Controls\FilterLogBoxView.py”, line 199, in writeLine
2013-02-22 11:13:30: AttributeError: ‘RepositoryStorage’ object has no attribute ‘DataController’
Machine looking ‘active’, but in fact hanging for 3 days:
Last lines of log (these are regular, expected outputs):
2013-02-22 16:28:22: 0: STDOUT: PyQt4.uic.uiparser : DEBUG : pop widget QWidget centralwidget
2013-02-22 16:28:22: 0: STDOUT: PyQt4.uic.uiparser : DEBUG : new topwidget MainWindow
2013-02-22 16:28:22: 0: STDOUT: PyQt4.uic.uiparser : DEBUG : pop widget QMainWindow MainWindow
2013-02-22 16:28:22: 0: STDOUT: PyQt4.uic.uiparser : DEBUG : new topwidget None
2013-02-22 16:28:22: 0: STDOUT: Fri Feb 22 16:28:21 2013 (+889ms) : debug : Registering Node Class : “MayaRigCacheExporter”
2013-02-22 16:28:22: 0: STDOUT: Fri Feb 22 16:28:21 2013 (+0ms) : debug : Registering Node Class : “MayaRigCacheImporter”
Machine stalled:
2013-02-21 20:16:10: 0: STDOUT: Thu Feb 21 20:16:10 2013 (+0ms) : debug : Images.render3d Turntable not selected.
2013-02-21 20:16:10: 0: STDOUT: Thu Feb 21 20:16:10 2013 (+0ms) : debug : Images.render3d Software render selected.
2013-02-21 20:16:10: 0: STDOUT: Thu Feb 21 20:16:10 2013 (+0ms) : debug : Images.render3d Farm playblast begin
2013-02-21 20:16:10: 0: STDOUT: cached
2013-02-21 20:16:10: 0: STDOUT: Thu Feb 21 20:16:10 2013 (+16ms) : critical error : PUBLISH : *** Unhandled publish exception ***
2013-02-21 20:16:10: 0: STDOUT: Thu Feb 21 20:16:10 2013 (+0ms) : debug : New log file created using scLog v0.10
2013-02-21 20:16:12: 0: STDOUT: Error Traceback: Traceback (most recent call last):
2013-02-21 20:16:12: 0: STDOUT: File “\S2\exchange\software\managed\SoftwareConfiguration\mayaShared\env\configs\default\modules\scNodes\publish.py”, line 588, in publish
2013-02-21 20:16:12: 0: STDOUT: if oNode.activate():
2013-02-21 20:16:12: 0: STDOUT: File “\S2\exchange\software\managed\SoftwareConfiguration\mayaShared\env\configs\default\modules\scNodes\images.py”, line 926, in activate
2013-02-21 20:16:12: 0: STDOUT: if(not self.doFarmPlayBlast()):
2013-02-21 20:16:12: 0: STDOUT: File “\S2\exchange\software\managed\SoftwareConfiguration\mayaShared\env\configs\default\modules\scNodes\images.py”, line 808, in doFarmPlayBlast
2013-02-21 20:16:12: 0: STDOUT: sDir = self.getFilePath(sPassName)
2013-02-21 20:16:12: 0: STDOUT: File “\S2\exchange\software\managed\SoftwareConfiguration\mayaShared\env\configs\default\modules\scNodes\images.py”, line 255, in getFilePath
2013-02-21 20:16:12: 0: STDOUT: return (’%s/%s_%s_%s’ %(self.getAttr(‘sExportDir’).replace(’\’,’/’), sPassName, self.getColorTag(), self.getResolutionTag()))
2013-02-21 20:16:12: 0: STDOUT: File “\S2\exchange\software\managed\SoftwareConfiguration\mayaShared\env\configs\default\modules\scNodes\images.py”, line 241, in getResolutionTag
2013-02-21 20:16:12: 0: STDOUT: return renderAPI.getResolutionTagByName(self.getAttr(‘sResolutionTag’)).sName
2013-02-21 20:16:12: 0: STDOUT: File “\S2\exchange\software\managed\SoftwareConfiguration\mayaShared\env\configs\default\modules\renderAPI.py”, line 324, in getResolutionTagByName
2013-02-21 20:16:12: 0: STDOUT: return getResolutionTags()[0]
2013-02-21 20:16:12: 0: INFO: Deadline is ignoring error: “IndexError: list index out of range” because plugin setting Strict Error Checking is enabled and this error is not usually fatal.
2013-02-21 20:16:12: 0: STDOUT: IndexError: list index out of range
2013-02-21 20:16:12: 0: INFO: Deadline is ignoring error: “Warning: PUBLISH : *** Unhandled publish exception *** Error Traceback: Traceback (most recent call last):” because plugin setting Strict Error Checking is enabled and this error is not usually fatal.
2013-02-21 20:16:12: 0: STDOUT: Warning: PUBLISH : *** Unhandled publish exception *** Error Traceback: Traceback (most recent call last):
2013-02-21 20:16:12: 0: STDOUT: File “\S2\exchange\software\managed\SoftwareConfiguration\mayaShared\env\configs\default\modules\scNodes\publish.py”, line 588, in publish
2013-02-21 20:16:12: 0: STDOUT: if oNode.activate():
2013-02-21 20:16:12: 0: STDOUT: File “\S2\exchange\software\managed\SoftwareConfiguration\mayaShared\env\configs\default\modules\scNodes\images.py”, line 926, in activate
2013-02-21 20:16:12: 0: STDOUT: if(not self.doFarmPlayBlast()):
2013-02-21 20:16:12: 0: STDOUT: File “\S2\exchange\software\managed\SoftwareConfiguration\mayaShared\env\configs\default\modules\scNodes\images.py”, line 808, in doFarmPlayBlast
2013-02-21 20:16:12: 0: STDOUT: sDir = self.getFilePath(sPassName)
2013-02-21 20:16:12: 0: STDOUT: File “\S2\exchange\software\managed\SoftwareConfiguration\mayaShared\env\configs\default\modules\scNodes\images.py”, line 255, in getFilePath
2013-02-21 20:16:12: 0: STDOUT: return (’%s/%s_%s_%s’ %(self.getAttr(‘sExportDir’).replace(’\’,’/’), sPassName, self.getColorTag(), self.getResolutionTag()))
2013-02-21 20:16:12: 0: STDOUT: File “\S2\exchange\software\managed\SoftwareConfiguration\mayaShared\env\configs\default\modules\scNodes\images.py”, line 241, in getResolutionTag
2013-02-21 20:16:12: 0: STDOUT: return renderAPI.getResolutionTagByName(self.getAttr(‘sResolutionTag’)).sName
2013-02-21 20:16:12: 0: STDOUT: File “\S2\exchange\software\managed\SoftwareConfiguration\mayaShared\env\configs\default\modules\renderAPI.py”, line 324, in getResolutionTagByName
2013-02-21 20:16:12: 0: STDOUT: return getResolutionTags()[0]
2013-02-21 20:16:12: 0: INFO: Deadline is ignoring error: “IndexError: list index out of range” because plugin setting Strict Error Checking is enabled and this error is not usually fatal.
2013-02-21 20:16:12: 0: STDOUT: IndexError: list index out of range
2013-02-21 20:16:12: 0: STDOUT: Thu Feb 21 20:16:12 2013 (+2250ms) : Enabling viewports
2013-02-21 20:16:14: 0: STDOUT: Error: Render failed
2013-02-21 20:16:17: Scheduler Thread - Render Thread 0 threw a major error:
2013-02-21 20:16:17: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
2013-02-21 20:16:17: Exception Details
2013-02-21 20:16:17: RenderPluginException – Error in RenderTasks: Deadline caught error “Error: Render failed”. If this error message is unavoidable but not fatal to your render, please email support@thinkboxsoftware.com with the error message, and disable the Maya job setting Strict Error Checking.
2013-02-21 20:16:17: at Deadline.Plugins.ScriptPlugin.RenderTasks(String taskId, Int32 startFrame, Int32 endFrame, String& outMessage, AbortLevel& abortLevel)
2013-02-21 20:16:17: RenderPluginException.Cause: JobError (2)
2013-02-21 20:16:17: RenderPluginException.Level: Major (1)
2013-02-21 20:16:17: RenderPluginException.HasSlaveLog: True
2013-02-21 20:16:17: Exception.Data: ( )
2013-02-21 20:16:17: Exception.TargetSite: Void RenderTask(System.String, Int32, Int32)
2013-02-21 20:16:17: Exception.Source: deadline
2013-02-21 20:16:17: Exception.StackTrace:
2013-02-21 20:16:17: at Deadline.Plugins.Plugin.RenderTask(String taskId, Int32 startFrame, Int32 endFrame)
2013-02-21 20:16:17: at Deadline.Slaves.SlaveRenderThread.a(TaskLogWriter A_0)
2013-02-21 20:16:17: <<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<
2013-02-21 20:24:14: Scheduler Thread - Synchronizing job auxiliary files from \inferno2\deadline\repository6\jobs\5126c529de3c960be87af3f5
2013-02-21 20:24:14: Scheduler Thread - Synchronization time for job files: 46.875 ms
2013-02-21 20:24:14: Scheduler Thread - Synchronizing plugin files from \inferno2\deadline\repository6\plugins\MayaBatch
2013-02-21 20:24:14: Scheduler Thread - Synchronization time for plugin files: 468.750 ms
2013-02-21 20:24:15: Constructor: MayaBatch
2013-02-21 20:24:16: 0: Task timeout is being ignored because this is a Post Job Script Task
2013-02-21 20:24:16: 0: Loaded job: [TBOA] Software Render: FB_083_0220_maya_animation_Layout.ma version: v0039 (5126c529de3c960be87af3f5)
2013-02-21 20:24:16: 0: Plugin executing post job script
2013-02-21 20:24:16: 0: Executing Post Job Script: “//inferno2/exchange/software/managed/pythonScripts/site-packages/scl/farm/postScripts/removeSoftwareDepth.py”
2013-02-21 20:24:24: Scheduler Thread - Cancelling task because task “-1_0” could not be found
2013-02-21 20:24:24: Scheduler Thread - The task has either been changed externally (and requeued), or the Job has been deleted.
2013-02-21 20:24:24: 0: INFO: Executing plugin script C:\Documents and Settings\ScanlineVFX\Local Settings\Application Data\Thinkbox\Deadline6\slave\LAPRO0235\plugins\MayaBatch.py
2013-02-21 20:24:24: 0: INFO: About: Maya Batch Plugin for Deadline
2013-02-21 20:24:24: 0: INFO: The job’s environment will be merged with the current environment before rendering
2013-02-21 20:24:24: 0: INFO: Executing post job script //inferno2/exchange/software/managed/pythonScripts/site-packages/scl/farm/postScripts/removeSoftwareDepth.py
Machine reporting stalled:
2013-02-22 12:29:42: 0: INFO: Process Affinity: default
2013-02-22 12:29:42: 0: INFO: Process is now running
2013-02-22 12:29:42: 0: Plugin rendering frame(s): 1331-1334
2013-02-22 12:29:42: 0: INFO: Waiting until maya is ready to go
2013-02-22 12:29:49: Scheduler Thread - Cancelling task because task “22_1331-1334” could not be found
2013-02-22 12:29:49: Scheduler Thread - The task has either been changed externally (and requeued), or the Job has been deleted.
2013-02-22 12:29:50: 0: In the process of canceling current task: ignoring exception thrown by PluginLoader
2013-02-22 12:29:50: Scheduler Thread - In the process of canceling current tasks: ignoring exception thrown by render thread 0
2013-02-22 12:46:16: Traceback (most recent call last):
2013-02-22 12:46:16: File “DeadlineSlave\UI\Forms\MainWindowSlave.py”, line 291, in update
2013-02-22 12:46:16: File “DeadlineSlave\UI\Forms\MainWindowSlave.py”, line 315, in updateJobInfo
2013-02-22 12:46:16: InvalidOperationException: Collection was modified; enumeration operation may not execute.
2013-02-22 12:46:16: at System.Collections.Generic.Dictionary`2.KeyCollection.Enumerator.MoveNext()
2013-02-22 12:46:16: at Deadline.Slaves.Slave.GetLimitGroupStubs()
2013-02-22 13:46:11: Purging old statistics
2013-02-22 13:46:12: Purging slave statistics that are older than Oct 25/12 13:47:40
2013-02-22 13:46:12: Purging repository statistics that are older than Oct 25/12 13:47:40
2013-02-22 13:46:52: Purging obsolete slaves
2013-02-22 13:49:28: Purging obsolete slaves
2013-02-22 13:49:57: Purging repository temp files
2013-02-22 13:54:32: Slave - An error occurred while updating the slave’s info: An unexpected error occurred while interacting with the database (deadline.scanlinevfxla.com:27017):
2013-02-22 13:54:32: Exception of type ‘System.OutOfMemoryException’ was thrown. (FranticX.Database.DocumentException)
2013-02-22 13:54:52: Slave - An error occurred while updating the slave’s info: An unexpected error occurred while interacting with the database (deadline.scanlinevfxla.com:27017):
2013-02-22 13:54:52: Exception of type ‘System.OutOfMemoryException’ was thrown. (FranticX.Database.DocumentException)
2013-02-22 13:55:01: Slave - An error occurred while updating the slave’s info: An unexpected error occurred while interacting with the database (deadline.scanlinevfxla.com:27017):
2013-02-22 13:55:01: Exception of type ‘System.OutOfMemoryException’ was thrown. (FranticX.Database.DocumentException)
2013-02-22 13:55:07: Slave - An error occurred while updating the slave’s info: An unexpected error occurred while interacting with the database (deadline.scanlinevfxla.com:27017):
2013-02-22 13:55:07: Exception of type ‘System.OutOfMemoryException’ was thrown. (FranticX.Database.DocumentException)
2013-02-22 13:55:14: Slave - An error occurred while updating the slave’s info: An unexpected error occurred while interacting with the database (deadline.scanlinevfxla.com:27017):
2013-02-22 13:55:14: Exception of type ‘System.OutOfMemoryException’ was thrown. (FranticX.Database.DocumentException)
2013-02-22 13:55:23: Slave - An error occurred while updating the slave’s info: An unexpected error occurred while interacting with the database (deadline.scanlinevfxla.com:27017):
2013-02-22 13:55:23: Exception of type ‘System.OutOfMemoryException’ was thrown. (FranticX.Database.DocumentException)
Stalled machine, first log (last lines):
2013-02-21 20:03:54: 0: STDOUT: PyQt4.uic.uiparser : DEBUG : push QStatusBar statusbar
2013-02-21 20:03:54: 0: STDOUT: PyQt4.uic.uiparser : DEBUG : pop widget QStatusBar statusbar
2013-02-21 20:03:54: 0: STDOUT: PyQt4.uic.uiparser : DEBUG : new topwidget MainWindow
2013-02-21 20:03:54: 0: STDOUT: PyQt4.uic.uiparser : DEBUG : pop widget QMainWindow MainWindow
2013-02-21 20:03:54: 0: STDOUT: PyQt4.uic.uiparser : DEBUG : new topwidget None
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:53 2013 (+781ms) : debug : Registering Node Class : “RetimeExporter”
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:53 2013 (+0ms) : debug : Registering Node Class : “RetimeImporter”
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:53 2013 (+15ms) : debug : Evaluating all presets
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:53 2013 (+0ms) : debug : Registering Preset : “Export Master camera”
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:53 2013 (+0ms) : debug : Registering Preset : “Export Track camera”
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:53 2013 (+0ms) : debug : Registering Preset : “Export Projection camera”
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:53 2013 (+0ms) : debug : Registering Preset : “Create Rig exporter based on current scene”
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:53 2013 (+0ms) : debug : Registering Preset : “Create Playblast based on current scene”
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:53 2013 (+0ms) : debug : Registering Checks
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:53 2013 (+0ms) : Importing animation
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:53 2013 (+0ms) : debug : Import Checks folder: sceneJanitor.checks module: animation
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:54 2013 (+141ms) : Importing asset
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:54 2013 (+0ms) : debug : Import Checks folder: sceneJanitor.checks module: asset
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:54 2013 (+0ms) : Importing geometry
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:54 2013 (+0ms) : debug : Import Checks folder: sceneJanitor.checks module: geometry
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:54 2013 (+16ms) : Importing naming
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:54 2013 (+0ms) : debug : Import Checks folder: sceneJanitor.checks module: naming
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:54 2013 (+14ms) : Importing rig
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:54 2013 (+0ms) : debug : Import Checks folder: sceneJanitor.checks module: rig
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:54 2013 (+0ms) : Importing scene
2013-02-21 20:03:54: 0: STDOUT: Thu Feb 21 20:03:54 2013 (+0ms) : debug : Import Checks folder: sceneJanitor.checks module: scene
2013-02-21 21:52:23: Scheduler Thread - Cancelling task because task “13_1196-1210” could not be found
Then it had another log from the next day with only these lines:
2013-02-22 16:03:24: BEGIN - LAPRO0252\ScanlineVFX
2013-02-22 16:03:24: 0: Slave timed out – canceling current task…