Any views expressed within media held on this service are those of the contributors, should not be taken as approved or endorsed by the University, and do not necessarily reflect the views of the University in respect of any particular issue.

Computing Systems

Computing Systems

Informatics Computing Staff jottings

Long Running Jobs and AFS

As most of you will be aware, access to the School’s AFS file system requires that the user be in possession of a valid Kerberos ticket. Most of the time, this is handled behind the scenes and doesn’t cause any problems. Default Kerberos ticket is only valid for 18 hours though and this can cause problems when users attempt to run jobs for longer than 18 hours which require access to AFS space. Once the 18 hours is up, the Kerberos ticket associated with the job expires and the job loses access to the file system. This is probably not what you want.

Fortunately, there are ways around this. The tickets issued to Informatics users can be renewed for up to 28 days using a program called krenew. For jobs which need to run for even longer than this, the k5start program can use information held in a local file on a given host to obtain Kerberos tickets indefinitely. None of this is straightforward to do however and it is all too easy to make a minor error on the command line which leads to a job failing 18 hours later. Waiting 18 hours to see if something works makes for an awfully long run/debug/fix loop.

To simplify the lives of our users, we have written a wrapper script called longjob, now available on all DICE hosts, which takes care of much of the minutia of setting up long running jobs. Given an indication of how long a job is expected to last, the script will check whether suitable Kerberos tickets are in place, prompting the user for their Kerberos password if necessary to obtain new tickets, and then start the job. There is a man page which prospective users are encouraged to study and User Support will of course be happy to answer any questions about this script and indeed about long-running jobs in general.

css.php

Report this page

To report inappropriate content on this page, please use the form below. Upon receiving your report, we will be in touch as per the Take Down Policy of the service.

Please note that personal data collected through this form is used and stored for the purposes of processing this report and communication with you.

If you are unable to report a concern about content via this form please contact the Service Owner.

Please enter an email address you wish to be contacted on. Please describe the unacceptable content in sufficient detail to allow us to locate it, and why you consider it to be unacceptable.
By submitting this report, you accept that it is accurate and that fraudulent or nuisance complaints may result in action by the University.

  Cancel