AWS Thinkbox Discussion Forums

Slaves not responding to Power Management commands

Hi,

I’ve got two slaves on my render farm that refuse to accept power management commands when on. They will power up fine and accept render tasks but when the slave is running I can’t tell it to cancel tasks or restart.

I did have to flatten these machines a few weeks ago to deal with some install issues so this is perhaps why Pulse or Monitor is unable to talk to it. Is there anything I can do to find out whether commands are getting through?

Deadline 5.1.0.45496 (I will be upgrading to the latest beta today)
Windows 7

Cheers,

Jon

Hi Jon,

Do those slaves have the Deadline Launcher running on them? All remote communication is done through the Launcher, so if it’s not running, that could explain the problem.

Cheers,

  • Ryan

Hi,

Yes, the Launcher is running as I have the nodes automatically running the slave on startup. It’s when the slave is running it won’t accept management commands.

Could Pulse be the issue? These are computers that were on the farm but perhaps not properly removed when I had to rebuild them. I’m guessing here but could it be trying to send commands to the old computer?

Jon

Thanks for checking that. If you use the Remote Control menu from the Monitor Slave List (while in super user mode), do you get an error when you try to send the Cancel Tasks command? The results should show up in the Remote Command window. If there is an error, please send it to us.

If there is no error (ie: the command was “successful” according to the Remote Command window), go to the actual slave machine that you just sent the command to, and use the Launcher to explore the log folder. Find the most recent Launcher log, and post it.

Thanks!

  • Ryan

Hi,

Sending the command doesn’t get an error or a response from the slave. The log files don’t list any command and it’s just continuing rendering as normal. Do you still need it posted?

Jon

Hmm, I get the sense that you may be looking in the wrong spot. In the Monitor, enter Super User Mode, then find one of the problematic slaves in the Slave List, right-click on it, and select Remote Control -> Cancel Current Tasks. A Remote Command window will popup with the results (see attached image). It should show if it was successful or if there was an error.

Cheers,

Hi,

I don’t actually get a Remote Command Window popping up when I try to cancel task. Nothing acknowledges that action.

Actually, just tried it with a normal slave and that doesn’t do a cancel task either. Just realised none of them probably allow me to cancel task and that it’s never worked! I’ve just never needed to do it. Regardless, all of the other slaves do respond to restart slave, etc.

Jon

Weird. Try selecting Tools -> View Remote Command Status in the Monitor. Maybe it’s not set to open automatically (you can see the option for this in the previous screen shot). You may also want to run these tests with Monitor from the same machine that Pulse is running on, since Pulse is the one that sends the power management commands.

Cheers,

  • Ryan

Hi Ryan,

Ah, that’s it. Looking at the information in that panel, it’s stating the following for the problem slaves:

A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond 192.169.25.46:5042

Jon

Hi Jon,

Can you ping 192.169.25.46 from the machine you’re running the Monitor on?

Also, is it possible there is a firewall or antivirus program running on the slaves that is blocking the communication? If so, you will need to add an exception for the Deadline Launcher.

Cheers,

  • Ryan
Privacy | Site terms | Cookie preferences