Usage

The following describes the basic usage of HPCpy.

Getting a client object

As the package aspires to be "scheduler agnostic" from the outset, the recommended way to get a client object suitable for the HPC you are on is to use the get_client() method as follows:

from hpcpy import get_client
client = get_client()

This will return the most-likely client object based on the submission commands available on the system, raising a NoClientException if no host scheduler is detected. In the case of the factory being unable to return an appropriate client object (or if you need to be explicit), you may import the client explicitly for your system:

from hpcpy import PBSClient, SLURMClient
client_pbs = PBSClient()
client_slurm = SLURMClient()

Note

When using this approach you are bypassing any auto-detection of the host scheduler.

Submit

The simplest way to submit a pre-written job script is via the submit() command, which executes the appropriate command for the scheduler. Submitting a job will return a Job object from which you can interact with the scheduler.

HPCpy (Python)PBSSLURM

job = client.submit("/path/to/script.sh")

qsub /path/to/script.sh

sbatch /path/to/script.sh

Environment Variables

HPCpy (Python)PBSSLURM

job = client.submit(
    "/path/to/script.sh",
    variables=dict(a=1, b="test")
)

qsub -v a=1,b=test /path/to/script.sh

# Variables exported to the environment
sbatch /path/to/script.sh

Note

All environment variables are passed to the job as strings WITHOUT treatment of commas.

Script templates

Script templates can be used to generalise a single template script for use in multiple scenarios (i.e. different scheduling systems).

template.sh

#!/bin/bash
echo "{{message}}"

submit.py

job = client.submit(
    "/path/to/template.sh",
    render=True, # Note, this is False by default

    # Additional key/value pairs are added to rendering context
    message="Hello World."
)

This will do two things:

The template will be loaded into memory, rendered, and written to a temporary file at $HOME/.hpcpy/job_scripts (these are periodically cleared by hpcpy).
The rendered jobscript will be submitted to the scheduler.

Job script rendering has full access to the Jinja2 template rendering system and may be as simple or as complex as needed.

If you want to check the output of a rendered template prior to actually submitting the job, you may access the private method to write the rendered job script without submitting it.

For example:

job_script_filepath = client._render_job_script(
    "template.sh",
    message="Hello World!"
)

Status

To access the status of a job on the scheduler, simply call the status() method on the Job object:

HPCpy (Python)PBSSLURM

status = job.status()

qstat -f -F json $JOB_ID

squeue -j $JOB_ID

You may also access the status directly through the client, if you have the job_id readily at hand via the following command:

HPCpy (Python)

status = client.status(job_id)

The status will be a character code as listed in constants/__init__.py, however, certain shortcut methods are available for the most common queries.

# Check if the job is queued, using the client
client.is_queued(job_id)

# Check if the job is running, using the client
client.is_running(job_id)

More shorthand methods will be made available as required.

Note

All status related commands will poll the underlying scheduler; please be mindful of overloading the scheduling system with repeated, frequent calls.

Delete

Deleting a job is accomplished by calling the delete method on either the client or the job object:

HPCpy (Python)PBSSLURM

# For when you have the ID
client.delete(job_id)

# For when you are using the job object
job.delete()

qdel $JOB_ID

scancel $JOB_ID

Task dependence

HPCpy implements a simple task-dependence strategy at the scheduler level, whereby, we can use scheduler directives to make one job dependent on another.

HPCpy (Python)PBS

job1 = client.submit("job1.sh")
job2 = client.submit("job2.sh")

# depends_on accepts a Job ID `str`, `Job()` object, or a list containing either.
job3 = client.submit("job3.sh", depends_on=[job1.id, job2])

JOB1=$(qsub job1.sh)
JOB2=$(qsub -W depend=afterok:$JOB1 job2.sh)

Note

The depends_on accepts a Job ID str, Job() object, or a list containing either to maximise utility.

Consider the following snippet:

from hpcpy import get_client
client = get_client()

# Submit the first job
first_job = client.submit("job.sh")

# Submit some interim jobs all requiring the first to finish
jobs = list()
for x in range(3):
    jobx = client.submit("job.sh", depends_on=first_job)
    job_ids.append(jobx)

# Submit a final job that requires everything to have finished.
job_last = client.submit("job.sh", depends_on=jobs)

This will create 5 jobs:

1 x starting job
3 x middle jobs (which depend on the first)
1 x finishing job (which depends on the middle jobs to complete)

Essentially demonstrating a "fork and join" example.

More advanced graphs can be assembled as needed, the complexity of which is determined by your scheduler.