DISC Tutorial B - The Makeflow Workflow System

Familiarity with basic Unix/Linux commands.

Ability to use a text editor to create and modify text files.

Completed Lecture 4 in the DISC online course.

To complete Tutorial B, you will need access to a Linux cluster equipped with some sort of batch systems such as HTCondor, PBS, Torque, SGE, or another system supported by Makeflow. If you don't have a cluster, you can run the simple examples on a single server or desktop running Linux.

Setup for Notre Dame

If using the wireless network, make sure that you are using the eduroam network and not the ND-Guest network.
Connect to a CRC front end node. If you are using Linux or Mac, just open up a terminal and use ssh:
```
ssh USERNAME@crcfe01.crc.nd.edu
```
If you are using a Windows machine, download and install PuTTY and use that to connect to the host condorfe.crc.nd.edu.
Once logged in, you will need to install the Makeflow software in your home directory. The simplest way is to check out the source code and build it, which should only take a minute:
```
git clone https://github.com/cooperative-computing-lab/cctools cctools-src
cd cctools-src
./configure
make
make install
```
The software is now installed in $HOME/cctools. To use it directly, you will need to add it to your path using one of these two commands: (if one fails, just try the other)
```
export PATH=$HOME/cctools/bin:$PATH
setenv PATH $HOME/cctools/bin:$PATH
```
Now, check that makeflow is in your path before proceeding:
```
makeflow -v
```
Finally, add the SGE commands to your path. Again, use whichever command works:
```
export PATH=/opt/sge/bin/lx-amd64:$PATH
setenv PATH /opt/sge/bin/lx-amd64:$PATH
```
And check that you can run qstat:
```
qstat
```

Simple Example

input.txt: LOCAL /bin/echo "Simulate Black Holes" > input.txt output.1: simulation.py input.txt ./simulation.py 1 < input.txt > output.1 output.2: simulation.py input.txt ./simulation.py 2 < input.txt > output.2 output.3: simulation.py input.txt ./simulation.py 3 < input.txt > output.3 output.4: simulation.py input.txt ./simulation.py 4 < input.txt > output.4

Running Makeflow with Work Queue

Of course, remembering port numbers all the time gets old fast, so try the same thing again, but using a project name (-N) to give makeflow and the worker the same project name. (Replace MYPROJECT with a name of your choice.)

Running Workers on the Cluster via SGE

sge_submit_workers -N MYPROJECT 5 Creating worker submit scripts in dthain-workers... Your job 18728 ("worker.sh") has been submitted Your job 18729 ("worker.sh") has been submitted Your job 18730 ("worker.sh") has been submitted Your job 18731 ("worker.sh") has been submitted Your job 18732 ("worker.sh") has been submitted

qstat -u $USER job-ID prior name user state submit/start at queue ------------------------------------------------------------------------------------------------ 18728 100.49976 worker.sh dthain r 06/02/2016 12:04:45 long@d6copt172.crc.nd.edu 18729 100.49976 worker.sh dthain r 06/02/2016 12:04:47 long@d6copt184.crc.nd.edu 18730 100.49976 worker.sh dthain r 06/02/2016 12:04:47 long@d6copt025.crc.nd.edu 18731 100.49976 worker.sh dthain r 06/02/2016 12:04:48 long@d6copt025.crc.nd.edu 18732 100.49976 worker.sh dthain r 06/02/2016 12:04:48 long@dqcneh084.crc.nd.edu

If you add the -d all option to Makeflow, it will display debugging information that shows where each task was sent, when it was returned, and so forth:

(Alternate) Running Workers on the Cluster via Condor

condor_q ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD 258192.0 dthain 5/31 16:03 0+00:00:12 R 0 0.7 work_queue_worker 258192.1 dthain 5/31 16:03 0+00:00:12 R 0 0.7 work_queue_worker 258192.2 dthain 5/31 16:03 0+00:00:12 R 0 0.7 work_queue_worker 258192.3 dthain 5/31 16:03 0+00:00:12 R 0 0.7 work_queue_worker 258192.4 dthain 5/31 16:03 0+00:00:11 R 0 0.7 work_queue_worker