AWS Thinkbox Discussion Forums

slave crashed, not handled

Slave thinks its rendering:

But there is no max process running.

Last lines of the log:

2016-05-19 10:51:03:  0: INFO: Preparing ray server...
2016-05-19 10:51:03:  0: INFO: Building static raycast accelerator...
2016-05-19 10:51:03:  0: INFO: Building static raycast accelerator...: done [00:00:00.1]
2016-05-19 10:51:03:  0: INFO: Preparing direct light manager...
2016-05-19 10:51:03:  0: INFO: Preparing global light manager...
2016-05-19 12:11:34:  0: Unhandled Exception: System.IO.IOException: Unable to read data from the transport connection: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond. ---> System.Net.Sockets.SocketException: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond
2016-05-19 12:11:34:  0:    at System.Net.Sockets.NetworkStream.EndRead(IAsyncResult asyncResult)
2016-05-19 12:11:34:  0:    --- End of inner exception stack trace ---
2016-05-19 12:11:34:  0:    at System.Net.Sockets.NetworkStream.EndRead(IAsyncResult asyncResult)
2016-05-19 12:11:34:  0:    at Deadline.Net.DeadlineMessageUtils.a.d(IAsyncResult A_0)
2016-05-19 12:11:34:  0:    at System.Net.LazyAsyncResult.Complete(IntPtr userToken)
2016-05-19 12:11:34:  0:    at System.Threading.ExecutionContext.RunInternal(ExecutionContext executionContext, ContextCallback callback, Object state, Boolean preserveSyncCtx)
2016-05-19 12:11:34:  0:    at System.Threading.ExecutionContext.Run(ExecutionContext executionContext, ContextCallback callback, Object state, Boolean preserveSyncCtx)
2016-05-19 12:11:34:  0:    at System.Threading.ExecutionContext.Run(ExecutionContext executionContext, ContextCallback callback, Object state)
2016-05-19 12:11:34:  0:    at System.Net.ContextAwareResult.Complete(IntPtr userToken)
2016-05-19 12:11:34:  0:    at System.Net.Sockets.BaseOverlappedAsyncResult.CompletionPortCallback(UInt32 errorCode, UInt32 numBytes, NativeOverlapped* nativeOverlapped)
2016-05-19 12:11:34:  0:    at System.Threading._IOCompletionCallback.PerformIOCompletionCallback(UInt32 errorCode, UInt32 numBytes, NativeOverlapped* pOVERLAP)
2016-05-19 13:16:10:  Updating outdated script: FontSync
2016-05-19 13:16:10:  Updating outdated script: Scanline
2016-05-19 13:16:10:  Updating outdated script: Salt
2016-05-19 13:16:10:  Updating outdated script: Puppet
2016-05-19 13:16:10:  Updating outdated script: FTrack
2016-05-19 13:16:10:  Updating outdated script: DraftEventPlugin
2016-05-19 13:16:11:  Updating outdated script: NIM
2016-05-19 13:16:11:  Updating outdated script: Shotgun
2016-05-19 13:16:11:  Enabling render stat mail
2016-05-19 13:16:11:  Loading event plugin Scanline (\\inferno2\deadline\repository8\events\Scanline)
2016-05-19 13:16:12:  Loading event plugin DraftEventPlugin (\\inferno2\deadline\repository8\events\DraftEventPlugin)
2016-05-19 16:49:06:  0: Task timed out -- canceling current task...
2016-05-19 18:26:16:  Updating outdated script: Shotgun
2016-05-19 18:26:16:  Updating outdated script: NIM
2016-05-19 18:26:16:  Updating outdated script: DraftEventPlugin
2016-05-19 18:26:29:  Updating outdated script: FTrack
2016-05-19 18:26:29:  Updating outdated script: Puppet
2016-05-19 18:26:29:  Updating outdated script: Salt
2016-05-19 18:26:29:  Updating outdated script: Scanline
2016-05-19 18:26:30:  Updating outdated script: FontSync
2016-05-19 18:26:30:  Enabling render stat mail
2016-05-19 18:26:30:  Loading event plugin Scanline (\\inferno2\deadline\repository8\events\Scanline)
2016-05-19 18:26:31:  Loading event plugin DraftEventPlugin (\\inferno2\deadline\repository8\events\DraftEventPlugin)

Been hanging like that for a day, no timeouts kicked in

slave version is 8.0.0.67

The machine has a secondary slave, thats also hanging for 23 hours now. Attached is its recent logs.
hanginglog.zip (12 KB)

Yeah, this seems to be another instance of the Sandbox issues you guys are experiencing. From what I understand, Ryan G has been able to reproduce this kind of thing, we just need to find the source of the actual problem and get it fixed.

Privacy | Site terms | Cookie preferences