AWS Thinkbox Discussion Forums

slave not picking up anything

I have only 1 slave (lapro0205) in the whole 7 repo enabled. All others are offline. 5-6 jobs are available for it to work on.

Yet in the logs i get:

[15:32:48] The license file being used will expire in 66 days.
[15:32:48] Scheduler - Performing Job scan on Primary Pools with scheduling order Pool, Weighted, Balanced
[15:32:48] Scheduler - The 54012ff57a3a9e066c3e60dd limit is maxed out.
[15:32:48] Scheduler - The 541c92233db7593610939189 limit is maxed out.
[15:32:48] Scheduler - The 541cd0783db75935406c6716 limit is maxed out.
[15:32:48] Scheduler - The 541c922b3db7592fa896667f limit is maxed out.
[15:32:48] Scheduler - The 541cd0803db75925bc62acdb limit is maxed out.
[15:32:48] Scheduler - The 53fbd3697a3a9e188466e9eb limit is maxed out.
[15:32:48] Scheduler - Job chooser found no jobs.
[15:32:48] Scheduler - Performing Job scan on Secondary Pools with scheduling order Pool, Weighted, Balanced

When querying the db for where these limits are used:

{ "_id" : "54012ff57a3a9e066c3e60dd", "LastWriteTime" : { "$date" : 1411180207407 }, "Props" : { "Limit" : 32, "RelPer" : -1, "Slaves" : [], "White" : false, "SlavesEx" : [] }, "Name" : "54012ff57a3a9e066c3e60dd", "Holds" : [ "lapro0205" ], "Used" : 1, "StubLevel" : 0, "Type" : 1 }
{ "_id" : "541c92233db7593610939189", "LastWriteTime" : { "$date" : 1411180207417 }, "Props" : { "Limit" : 1, "RelPer" : -1, "Slaves" : [], "White" : false, "SlavesEx" : [] }, "Name" : "541c92233db7593610939189", "Holds" : [ "lapro0205" ], "Used" : 1, "StubLevel" : 0, "Type" : 1 }
{ "_id" : "541cd0783db75935406c6716", "LastWriteTime" : { "$date" : 1411178428416 }, "Props" : { "Limit" : 1, "RelPer" : -1, "Slaves" : [], "White" : false, "SlavesEx" : [] }, "Name" : "541cd0783db75935406c6716", "Holds" : [ "lapro0205" ], "Used" : 1, "StubLevel" : 0, "Type" : 1 }

Something is really weird…? The machine thats trying to dequeue the jobs is holding the stub, yet it says its maxed out?

Since pulse is not running for this repo, the lapro0205 machine itself does periodic housecleaning itself

If i enable another slave, it picks up jobs. But lapro0205 doesn’t.

Hey Laszlo,

This definitely seems to be a bug caused by the Slave crashing while checking out the limit; its stub is kinda in limbo (we have a separate stub DB object in 7, which seems to not have been created).

I thought this would be something covered by the Repo Repair/Stalled detection, but it doesn’t seem to be doing it right now. I’ll get that fixed =)

Cheers,
Jon

Privacy | Site terms | Cookie preferences