Published on : 2017-05-01 06:29:36
apt-get install torque-server torque-client torque-mom torque-pam Installing the packages also sets up torque with a default setup that is in no way helpful. edu: Req d Req d Elap Job ID Username Queue Jobname SessID NDS TSK Memory Time S Time -------------------- -------- -------- ---------------- ------ ----- --- ------ ----- - ----- 31. Systems Engineer WeatherData Service Inc An Accuweather Company Naveed Near-Ansari wrote: Hi, i would like to make all nodes in my cluster submission hosts. pbs OR qstat -a immediately after submitting your job, you should get something like the following: $> qstat -a raven-srv. To help myself if I ever need to do this again, and to help anyone else in the same situation, I’ll detail below what I did. DOMAIN > /var/spool/torque/server_priv/acl_svr/acl_hosts echo [email protected] 04 LTS [Update Nov 2016: I have since confirmed that this method works without change on 16. I’ll write about that process once it’s done. Installing Torque/PBS job scheduler on Ubuntu 14. You can find information on this on the psub section of the Raven Cluster Tutorial page. Anyway, the problem at hand was the installation of the Torque/PBS job scheduler on a Ubuntu 14 ruserok failed validating. DOMAIN’ with your box’s fully-qualified domain name [ Note: see just below if your machine doesn’t have an official FQDN]. 00,ncpus=1,physmem=2057576kb,availmem=3640588kb,totmem=3741024kb, idletime=62424,nusers=1,nsessions=1,sessions=7572,uname=Linux raven1 2. In fact, the most-visited posts on this blog tend to be just those. Otherwise, the next step is to start the scheduler. Eventually, job submission would be extended to other machines, adding them also as compute nodes on additional queues. Note that you’ll need authorised SSH keys set up for this user to allow password-less ssh. DOMAIN > /etc/torque/server_name echo SERVER. This will not work if you do that, as the comparison is done  after truncating the name of the submitting host. You see, every time I encounter a problem with a non-obvious solution I like to write a blog post about it. A cluster is nothing without some compute nodes, so next we tell the server process that the box itself is a compute node (with 4 cores, below – change this to suit your requirements). Your /home/hosts file should never contain entries for more than the 32 raven compute nodes. The following also sets up the server process to allow user ‘root’ to change configurations in the database. edu By default, job submission is allowed only on the TORQUE server host (host on which pbs_server is running) ruserok failed validating. 04 LTS box [ Note: this has been confirmed to work without change on 16. These emails will list nodes to temporarily remove from your hosts file, etc. I prefer to use FQDN’s so that it’s easier later to add other compute nodes, job submission nodes, etc. DOMAIN The FQDN itself can be anything you want, but ideally choose something that cannot exist in reality, so something with a non-existent top-level domain. d/torque-server stop pbs_server -t create You’ll need to answer ‘yes’ here to overwrite the existing database.
ruserok failed validating

