Job Workflow

Job Workflow#

SLURM-Job Submission Workflow

Job Submission

You submit a job with a job script that specifies:
- Number of nodes and cores (e.g., 32 cores on 4 nodes – –ntasks=32, –nodes=4)
- Maximum runtime (e.g., 4 hours – –time=04:00:00)
- Memory requirements (e.g., –mem=16G)
- Partition (queue) to submit to (NODE TYPE – –partition=skx)
- Allocation (project)
- Submit with: sbatch job_script.sh
Job Enters the Queue
- Your job enters the SLURM queue and is assigned a priority.
- You can monitor the queue status using SLURM commands (e.g., squeue, sacct, etc.)
SLURM Schedules the Job

SLURM decides when and where to run your job based on:
- Requested resources: Number of nodes, memory, and runtime.
- Queue priority: System policies may prioritize shorter/smaller jobs.
- Current system load: Jobs may wait until required nodes become free.
Job Execution
- Once sufficient resources are available, SLURM starts the job.
- Your job runs on the assigned compute nodes for the allocated time.
Job Completion

When the job finishes:
- Output files and logs (e.g., SLURM-.out) are generated.
- You can check results and performance.