Job Arrays vs Launcher

Job Arrays vs Launcher#

Which is better? Which is faster? let’s compare them!

Short version: on Stampede3, “sarray” = Slurm job arrays (sbatch –array), and Launcher/PyLauncher = TACC’s parametric job launcher. Neither is globally “better”; each wins in different regimes.

Below is a quick opinionated comparison, then some rules of thumb.

Mental picture#

Slurm job array (“sarray”) One Slurm submission, but many independent jobs underneath, each with an index (SLURM_ARRAY_TASK_ID). Stampede3 explicitly supports them. (TACC Documentation)
Launcher / PyLauncher job One Slurm job allocation, inside which TACC’s launcher (or PyLauncher) runs a workpile (command list), feeding tasks to allocated cores until the list is exhausted. (Texas Advanced Computing Center)

Historically TACC discouraged arrays on some systems and pushed Launcher/PyLauncher instead, but on Stampede3 both arrays and PyLauncher are available. (Scribd)

Side-by-side comparison#

So which should you use on Stampede3?#

Use Slurm job arrays (“sarray”) when:#

Each task is moderately heavy (≥ 10–15 min);
Every task looks essentially the same: ibrun ./my_mpi_app … or python script.py param;
You want:
- native Slurm features (dependencies, array-task-limited runs with –array=1-1000%32, etc.);
- easy per-task monitoring, accounting, and failure handling.

Use Launcher / PyLauncher when:#

You have lots of small or medium tasks, especially:
- 10^2–10^5 runs;
- runtimes from a few seconds to a few minutes;
You want to bundle them into one Slurm job to:
- reduce scheduler and filesystem stress;
- improve aggregate throughput;
- keep a node allocation saturated with work;
You need flexibility:
- heterogeneous commands (different apps/args per line);
- multi-threaded or MPI jobs in the same workpile (PyLauncher). (TACC Documentation)

Direct answer to three questions#

Which is “better”?

Policy-wise on Stampede3: for classic HTC/param sweeps, TACC leans toward Launcher/PyLauncher as the recommended tool; arrays are fine but easier to abuse at scale. (ACM Digital Library)
Which is “faster”?
- For short/high-count tasks → Launcher/PyLauncher generally gives better time-to-solution.
- For long, heavy tasks → performance difference is negligible; pick whichever scripting model is nicer for you.
Main differences/advantages in one sentence:
- Job arrays: many independent Slurm jobs, simple $SLURM_ARRAY_TASK_ID logic, great for medium/long embarrassingly parallel runs with clean per-job accounting.
- Launcher/PyLauncher: one Slurm job acting as your own mini-scheduler, ideal for bundling huge ensembles of small/medium jobs while being friendly to the scheduler and file system.