Job Scheduling

Job Scheduling#

TACC Scheduling & Policy Considerations

TACC uses a fair-share priority system to schedule jobs across all its users. This system aims to maximize both fairness and overall system efficiency, ensuring that compute resources are available to as many researchers as possible.

Recommendations for designing your workloads#

Is it better to submit many small jobs or a few large jobs?

It depends on your specific workload and goals:

When to prefer many small jobs	When to prefer fewer large jobs
- Parameter sweeps / independent simulations	- Large tightly coupled parallel jobs (MPI)
- Monte Carlo or uncertainty quantification	- When memory or compute needs exceed small job limits
- Tasks that don’t need to talk to each other

Advantages of many small jobs:
- Often start sooner, fit into open slots on the scheduler.
- Lower risk of long queue times if cluster is busy.
- Easier to rerun if one fails.
Advantages of fewer large jobs:
- Necessary for problems that require all processors working together (large finite element models, CFD, etc.).
- Potentially more efficient due to reduced I/O overhead between jobs.

Best practices before submitting your jobs#

Following these practices helps your jobs run faster, use fewer resources, and keep the system fair for everyone:

Summary#

TACC’s fair-share system means your priority adjusts over time based on your usage and the needs of others.
Smaller, shorter jobs generally get scheduled faster because they help fill in gaps in cluster availability.
Always design your computational approach with flexibility in mind — break large studies into multiple jobs when possible, but use large jobs only when the problem’s architecture demands it.

Job Scheduling

Contents

Job Scheduling#

How the fair-share scheduler works#

Recommendations for designing your workloads#

Best practices before submitting your jobs#

Summary#