Deadline 4 - CIFS repository error [SOLVED]

Hi all.

My network setup is very very simple, but i really can’t understand what wrong with my deadline setup.

  1. My computers:
    Windows XP - (laptop)
    Linux - OpenSUSE 11.2

On Windows XP i install deadline repository to point: D:/DeadlineRepository and make shared ( this look as //laptop/DeadlineRepository from linux box)

3
On linux i mount it to /DeadlineRepository with command:
mount -t cifs //laptop/DeadlineRepository /DeadlineRepository -o credentials=/storage/.creds,dir_mode=0777,file_mode=0777

all mounted ok. all writable and readable ok. (i can read, write, create files and folders and erase.) Ok. We have now readable and writable deadline repository folder.

Reboot. run on linux box deadlineslave from command line - it start, all work ok. Repository founded.
Then i try to render simple Nuke scene and what i see: (look to attached images. Sorry for this, but i don’t understand why copy-paste text not work in GTK apps, and work only for deadline apps. I cant grab text)


All tasks is active. In one time. But - nothing happen. No rendering.
Please help me understand what i do wrong ?
Very big thanks for answers…


im forget to say:

Why i say about CIFS error ?

My next attempt:

  1. i just install deadline repository on my small FAT32 drive. (Second HDD on Linux box - drive mounted to point /mnt/win_e ) deadline repository now on /mnt/win_e/DeadlineRepository

  2. I just start deadlinemonitor and point repositroy to "/mnt/win_e/DeadlineRepository "

  3. Submit Nuke job.

  4. All work nice!!!

this is are CIFS problem ?
and //laptop/share and /mnt/win_e/DeadlineRepository is both visible, readable, and writable

what difference ?

That’s really strange. First, can you enable Slave Verbose Logging? You can do this in the Monitor while in super user mode by selecting Tools -> Configure Repository Options. You’ll find the Slave Verbose Logging option under the Logging section. After enabling it, close the Slave application on the Linux box and requeue all the tasks for that job. Now, start up the Slave again and let it run until the job ends up in this state again.

Once you have reproduced this state, go to the Slave application and select Help -> Explore Log Folder. Then close the slave. Finally, find the most recent slave log in the log folder and attach it to your next post. Hopefully it will contain some information that explains the problem.

I guess one last thing to check would be the permissions for the Deadline Repository folder on your laptop. Just make sure that Everyone has full read/write access. This covers the Sharing tab:
software.primefocusworld.com/sof … On_Windows

And this covers the Security tab:
software.primefocusworld.com/sof … On_Windows

Cheers,

  • Ryan

We use CIFS here for shares without any problems, and we’ve used it to connect to repositories on Windows and Linux machines. I don’t see anything that immediately tells me there is a problem with your CIFS setup…

thanks for superfast answer!!
give me 10 minutes plz, i really can’t understand what may be wrong, and what the difference between two shares… But one work nice, and another - is not…
10 minutes.

Logs is ready…
deadlinemonitor(Livestation)-2010-02-01-0000.log

2010-02-01 21:54:58:  BEGIN - livestation\sst
2010-02-01 21:54:58:  Start-up
2010-02-01 21:54:58:  Deadline Monitor 4.0 [v4.0.0.39717 R]
2010-02-01 21:54:58:  2010-02-01 21:54:57
2010-02-01 21:55:00:  Attempting to contact Deadline Pulse (192.168.0.10)...
2010-02-01 21:55:00:  Could not update from Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:55:00:  Attempting to contact Deadline Pulse (192.168.0.10)...
2010-02-01 21:55:00:  Could not update from Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:55:06:  Enqueing: Submit Nuke Job To Deadline
2010-02-01 21:55:06:  Dequeued: Submit Nuke Job To Deadline
2010-02-01 21:55:17:  Could not send new job notification to Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:55:22:  Enqueing: &Refresh
2010-02-01 21:55:22:  Dequeued: &Refresh
2010-02-01 21:55:22:  Attempting to contact Deadline Pulse (192.168.0.10)...
2010-02-01 21:55:22:  Could not update from Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:55:22:  Attempting to contact Deadline Pulse (192.168.0.10)...
2010-02-01 21:55:22:  Could not update from Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:55:25:  Enqueing: &Refresh
2010-02-01 21:55:25:  Dequeued: &Refresh
2010-02-01 21:55:25:  Attempting to contact Deadline Pulse (192.168.0.10)...
2010-02-01 21:55:25:  Could not update from Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:55:25:  Attempting to contact Deadline Pulse (192.168.0.10)...
2010-02-01 21:55:25:  Could not update from Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:55:26:  Enqueing: &Refresh
2010-02-01 21:55:26:  Dequeued: &Refresh
2010-02-01 21:55:26:  Attempting to contact Deadline Pulse (192.168.0.10)...
2010-02-01 21:55:26:  Could not update from Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:55:26:  Attempting to contact Deadline Pulse (192.168.0.10)...
2010-02-01 21:55:26:  Could not update from Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:55:27:  Enqueing: &Refresh
2010-02-01 21:55:27:  Dequeued: &Refresh
2010-02-01 21:55:27:  Attempting to contact Deadline Pulse (192.168.0.10)...
2010-02-01 21:55:27:  Could not update from Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:55:28:  Attempting to contact Deadline Pulse (192.168.0.10)...
2010-02-01 21:55:28:  Could not update from Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:55:29:  Enqueing: &Refresh
2010-02-01 21:55:29:  Dequeued: &Refresh
2010-02-01 21:55:29:  Attempting to contact Deadline Pulse (192.168.0.10)...
2010-02-01 21:55:29:  Could not update from Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:55:29:  Attempting to contact Deadline Pulse (192.168.0.10)...
2010-02-01 21:55:29:  Could not update from Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:55:30:  Enqueing: &Refresh
2010-02-01 21:55:30:  Dequeued: &Refresh
2010-02-01 21:55:30:  Attempting to contact Deadline Pulse (192.168.0.10)...
2010-02-01 21:55:30:  Could not update from Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:55:30:  Attempting to contact Deadline Pulse (192.168.0.10)...
2010-02-01 21:55:30:  Could not update from Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:55:31:  Enqueing: &Refresh
2010-02-01 21:55:31:  Dequeued: &Refresh
2010-02-01 21:55:31:  Attempting to contact Deadline Pulse (192.168.0.10)...
2010-02-01 21:55:31:  Could not update from Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:55:31:  Attempting to contact Deadline Pulse (192.168.0.10)...
2010-02-01 21:55:31:  Could not update from Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:55:49:  Enqueing: &Refresh
2010-02-01 21:55:49:  Dequeued: &Refresh
2010-02-01 21:55:49:  Attempting to contact Deadline Pulse (192.168.0.10)...
2010-02-01 21:55:49:  Could not update from Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:55:49:  Attempting to contact Deadline Pulse (192.168.0.10)...
2010-02-01 21:55:49:  Could not update from Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:55:49:  Enqueing: &Refresh
2010-02-01 21:55:49:  Dequeued: &Refresh
2010-02-01 21:55:49:  Attempting to contact Deadline Pulse (192.168.0.10)...
2010-02-01 21:55:49:  Could not update from Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:55:50:  Attempting to contact Deadline Pulse (192.168.0.10)...
2010-02-01 21:55:50:  Could not update from Pulse because 'Connection refused (System.Net.Sockets.SocketException)
2010-02-01 21:56:40:  Enqueing: &Delete &Job
2010-02-01 21:56:40:  Dequeued: &Delete &Job
2010-02-01 21:56:41:  DeadlineForms.Controls.JobListView.RemoveJobs {
2010-02-01 21:56:41:  }  [ 9.351 ms  12 KB ]
2010-02-01 21:56:44:  Main window closing
2010-02-01 21:56:44:   Listener Thread - OnConnect: Listener Socket has been closed.
2010-02-01 21:56:44:  Main window closed

deadlineslave(Livestation)-2010-02-01-0000.log

2010-02-01 21:54:39:  BEGIN - livestation\sst
2010-02-01 21:54:39:  Start-up
2010-02-01 21:54:39:  2010-02-01 21:54:38
2010-02-01 21:54:39:  Deadline Slave 4.0 [v4.0.0.39717 R]
2010-02-01 21:54:39:  slave initialization beginning.
2010-02-01 21:54:39:  Slave 'Livestation' has stalled because it has not updated its state in 29.52 m. Performing house cleaning...
2010-02-01 21:54:40:  No stalled slave notification address specified in Repository Options - cannot send notification
2010-02-01 21:54:40:   Info Thread - Created.
2010-02-01 21:54:41:  Could not get license on first attempt because:
2010-02-01 21:54:41:  Cannot find license file.
 The license files (or license server system network addresses) attempted are 
listed below.  Use LM_LICENSE_FILE to use a different license file,
 or contact your software provider for a license file.
Feature:       deadline
Filename:      /usr/local/flexlm/licenses/license.dat
License path:  /usr/local/flexlm/licenses/license.dat:
FLEXnet Licensing error:-1,359.  System Error: 2 "No such file or directory"
For further information, refer to the FLEXnet Licensing documentation,
available at "www.acresso.com".
2010-02-01 21:54:41:  Could not get license on second attempt because:
2010-02-01 21:54:41:  Cannot find license file.
 The license files (or license server system network addresses) attempted are 
listed below.  Use LM_LICENSE_FILE to use a different license file,
 or contact your software provider for a license file.
Feature:       deadline
Filename:      /usr/local/flexlm/licenses/license.dat
License path:  /usr/local/flexlm/licenses/license.dat:
FLEXnet Licensing error:-1,359.  System Error: 2 "No such file or directory"
For further information, refer to the FLEXnet Licensing documentation,
available at "www.acresso.com".
2010-02-01 21:54:45:  starting between task wait - seconds: 2
2010-02-01 21:54:46:  LICENSE-FREE MODE: Repository has two slaves or less - no license required.
2010-02-01 21:54:47:   Scheduler Thread - slave initialization complete.
2010-02-01 21:54:47:  Scanning for dropped jobs...
2010-02-01 21:54:47:  Done.
2010-02-01 21:54:47:  Performing Job Repository Scan...
2010-02-01 21:54:47:  Done.
2010-02-01 21:54:47:   Scheduler Thread - performing house cleaning...
2010-02-01 21:54:47:   Scheduler - Contacting Deadline Pulse running on "192.168.0.10", port 5046.
2010-02-01 21:54:47:   Scheduler - Could not get a task from Deadline Pulse because it is not running.
2010-02-01 21:54:47:  LICENSE-FREE MODE: Repository has two slaves or less - no license required.
2010-02-01 21:54:47:   Scheduler - Job chooser found no jobs.
2010-02-01 21:54:48:  starting between task wait - seconds: 20
2010-02-01 21:55:08:  Scanning for dropped jobs...
2010-02-01 21:55:08:  Done.
2010-02-01 21:55:08:  Performing Job Repository Scan...
2010-02-01 21:55:08:  Done.
2010-02-01 21:55:08:   Scheduler Thread - performing house cleaning...
2010-02-01 21:55:08:   Scheduler - Contacting Deadline Pulse running on "192.168.0.10", port 5046.
2010-02-01 21:55:08:   Scheduler - Could not get a task from Deadline Pulse because it is not running.
2010-02-01 21:55:08:   Scheduler - Job chooser found no jobs.
2010-02-01 21:55:09:  starting between task wait - seconds: 20
2010-02-01 21:55:27:  LICENSE-FREE MODE: Repository has two slaves or less - no license required.
2010-02-01 21:55:29:  Scanning for dropped jobs...
2010-02-01 21:55:29:  Done.
2010-02-01 21:55:29:  Performing Job Repository Scan...
2010-02-01 21:55:29:  Done.
2010-02-01 21:55:29:   Scheduler Thread - performing house cleaning...
2010-02-01 21:55:29:   Scheduler - Contacting Deadline Pulse running on "192.168.0.10", port 5046.
2010-02-01 21:55:29:   Scheduler - Could not get a task from Deadline Pulse because it is not running.
2010-02-01 21:55:30:   Scheduler - Trying to dequeue task: /DeadlineRepository/jobs/999_050_999_5abd8805/Queued/999_050_999_5abd8805_00000_1-4.task
2010-02-01 21:55:30:   Scheduler - Obtaining limit group stubs.
2010-02-01 21:55:30:   Scheduler -   obtaining 999_050_999_5abd8805
2010-02-01 21:55:30:   Scheduler - minor error while trying to dequeue: Invalid parameter (System.IO.IOException)
2010-02-01 21:55:30:   Scheduler - Trying to dequeue task: /DeadlineRepository/jobs/999_050_999_5abd8805/Queued/999_050_999_5abd8805_00001_5-8.task
2010-02-01 21:55:30:   Scheduler - Obtaining limit group stubs.
2010-02-01 21:55:30:   Scheduler -   skipping 999_050_999_5abd8805 because we already have it
2010-02-01 21:55:30:   Scheduler - minor error while trying to dequeue: Invalid parameter (System.IO.IOException)
2010-02-01 21:55:30:   Scheduler - Trying to dequeue task: /DeadlineRepository/jobs/999_050_999_5abd8805/Queued/999_050_999_5abd8805_00002_9-12.task
2010-02-01 21:55:30:   Scheduler - Obtaining limit group stubs.
2010-02-01 21:55:30:   Scheduler -   skipping 999_050_999_5abd8805 because we already have it
2010-02-01 21:55:30:   Scheduler - minor error while trying to dequeue: Invalid parameter (System.IO.IOException)
2010-02-01 21:55:30:   Scheduler - Trying to dequeue task: /DeadlineRepository/jobs/999_050_999_5abd8805/Queued/999_050_999_5abd8805_00003_13-16.task
2010-02-01 21:55:30:   Scheduler - Obtaining limit group stubs.
2010-02-01 21:55:30:   Scheduler -   skipping 999_050_999_5abd8805 because we already have it
2010-02-01 21:55:30:   Scheduler - minor error while trying to dequeue: Invalid parameter (System.IO.IOException)
2010-02-01 21:55:30:   Scheduler - Trying to dequeue task: /DeadlineRepository/jobs/999_050_999_5abd8805/Queued/999_050_999_5abd8805_00004_17-20.task
2010-02-01 21:55:30:   Scheduler - Obtaining limit group stubs.
2010-02-01 21:55:30:   Scheduler -   skipping 999_050_999_5abd8805 because we already have it
2010-02-01 21:55:30:   Scheduler - minor error while trying to dequeue: Invalid parameter (System.IO.IOException)
2010-02-01 21:55:30:   Scheduler - Job chooser found no jobs.
2010-02-01 21:55:30:   Scheduler - Returning limit group stubs not in use.
2010-02-01 21:55:30:   Scheduler -   returning 999_050_999_5abd8805
2010-02-01 21:55:30:  starting between task wait - seconds: 20
2010-02-01 21:55:51:  Scanning for dropped jobs...
2010-02-01 21:55:51:  Done.
2010-02-01 21:55:51:  Performing Job Repository Scan...
2010-02-01 21:55:51:  Done.
2010-02-01 21:55:51:   Scheduler Thread - performing house cleaning...
2010-02-01 21:55:51:   Scheduler - Contacting Deadline Pulse running on "192.168.0.10", port 5046.
2010-02-01 21:55:51:   Scheduler - Could not get a task from Deadline Pulse because it is not running.
2010-02-01 21:55:51:   Scheduler - Job chooser found no jobs.
2010-02-01 21:55:52:  starting between task wait - seconds: 20
2010-02-01 21:55:56:  LICENSE-FREE MODE: Repository has two slaves or less - no license required.
2010-02-01 21:56:12:  Scanning for dropped jobs...
2010-02-01 21:56:12:  Done.
2010-02-01 21:56:12:  Performing Job Repository Scan...
2010-02-01 21:56:12:  Done.
2010-02-01 21:56:12:   Scheduler Thread - performing house cleaning...
2010-02-01 21:56:12:   Scheduler - Contacting Deadline Pulse running on "192.168.0.10", port 5046.
2010-02-01 21:56:12:   Scheduler - Could not get a task from Deadline Pulse because it is not running.
2010-02-01 21:56:12:   Scheduler - Job chooser found no jobs.
2010-02-01 21:56:13:  starting between task wait - seconds: 20
2010-02-01 21:56:25:  LICENSE-FREE MODE: Repository has two slaves or less - no license required.
2010-02-01 21:56:33:  Scanning for dropped jobs...
2010-02-01 21:56:33:  Done.
2010-02-01 21:56:33:  Performing Job Repository Scan...
2010-02-01 21:56:33:  Done.
2010-02-01 21:56:33:   Scheduler Thread - performing house cleaning...
2010-02-01 21:56:33:   Scheduler - Contacting Deadline Pulse running on "192.168.0.10", port 5046.
2010-02-01 21:56:33:   Scheduler - Could not get a task from Deadline Pulse because it is not running.
2010-02-01 21:56:33:   Scheduler - Job chooser found no jobs.
2010-02-01 21:56:34:  starting between task wait - seconds: 20
2010-02-01 21:56:48:  LICENSE-FREE MODE: Repository has two slaves or less - no license required.
2010-02-01 21:56:52:  OnFormClosing
2010-02-01 21:56:52:  Slave - slave shutdown: normal
2010-02-01 21:56:52:   Info Thread - requesting slave info thread quit.
2010-02-01 21:56:52:   Listener Thread - OnConnect: Listener Socket has been closed.
2010-02-01 21:56:52:   Scheduler Thread - performing house cleaning...
2010-02-01 21:56:52:  Main window closing
2010-02-01 21:56:52:   Scheduler Thread - shutdown complete
2010-02-01 21:56:52:  Waiting for threads to quit.  29 seconds until forced shutdown.
2010-02-01 21:56:53:  Waiting for threads to quit.  28 seconds until forced shutdown.
2010-02-01 21:56:53:  LICENSE-FREE MODE: Repository has two slaves or less - no license required.
2010-02-01 21:56:53:   Info Thread - shutdown complete
2010-02-01 21:56:54:  Waiting for threads to quit.  27 seconds until forced shutdown.
2010-02-01 21:56:56:  OnFormClosing
2010-02-01 21:56:56:  Main window closing
2010-02-01 21:56:56:  Main window closed

Ryan, i think something is wrong with windows (as always however) - because

  1. all access is granted, but anyway i see error.

Another words: tomorrow i must try on another computer.
I do some experiments with DeadLine, because our studio plan to buy renderfarm. I need show some results.
Thanks you for very fast feedback :slight_smile:

Strange results:

  1. I move repository on NTFS USB drive - and get the same error. I was wrong… This is not network CIFS error.
  2. I change in fstab mount options for my USB drive - umask=0, and all work nice. Render process is started.
  3. Now i need to do something with my experimental stupid magic Windows and mount options for it.

Result: This is are not CIFS error, this is are file access error.

great :slight_smile:

//laptop/DeadlineRepository /DeadlineRepository/ cifs  credentials=/.cred,file_mode=0777,dir_mode=0777,noperm 0 0

noperm – client does not perform permission checks. Needed if uid and gid are not the same on client and server.

Problem is solved.

Thanks Ryan ! :slight_smile:

cool this fixes the same issue on Ubuntu 9.10 using smb shares and deadline 4.0