File Storage

File Storage#

Understanding Storage on DesignSafe

In high-performance computing (HPC), every job either starts, ends, or both starts and ends with data. Whether you’re running a small script or a large-scale simulation, your workflow involves reading input files and writing output results.

Understanding where that data lives — and how quickly it can be accessed or saved — is critical to optimizing both your job performance and your research productivity. Choosing the right storage location for each phase of your workflow helps avoid unnecessary slowdowns, data loss, or needless file transfers.

Why File Storage Matters in HPC#

When running jobs on HPC systems like Stampede3, data input/output (I/O) — reading and writing files — is often the slowest part of the workflow. Even with powerful CPUs and GPUs, your job can be bottlenecked if it is waiting on file access.

Optimizing where and when your data is accessed can significantly improve both compute performance and researcher efficiency. Selecting the correct storage system — whether it’s Corral, a compute system directory, or node-local scratch — ensures your simulations and analyses run smoothly.

Storage Systems Overview#

DesignSafe provides several storage areas, each serving a distinct purpose:

Storage Type	Description
MyData	Personal storage; your private “home” space.
MyProjects	Project-specific collaborative storage shared among team members.
Work	High-performance project workspace for HPC jobs (fast, but not backed up).
CommunityData	Public datasets and shared examples (read-only).
Published	Published datasets (NHERI, NEES), curated and citable (read-only).

Note: CommunityData and Published are read-only and cannot be used for saving working files.

Additional system-level directories (e.g., Home and Scratch on Stampede3) exist and are covered in the Compute Systems section.

DesignSafe Storage Path Examples#

JupyterHub

Mounted paths, accessible directly in the notebook file browser. In JupyterHub, all storage systems have the same base path, making it very practical.

Type	Example Path
MyData	/home/jupyter/MyData/
Work	/home/jupyter/Work/stampede3/
Community	/home/jupyter/CommunityData/
MyProjects	/home/jupyter/MyProjects/PRJ-…
NHERI Published	/home/jupyter/NHERI-Published/PRJ-…
NEES Published	/home/jupyter/NEES/

Stampede3

Traditional HPC with absolute UNIX paths. These are the paths you’ll use when:
- SSH’ing into Stampede3
- Writing batch scripts or Tapis job submissions
Type

Example Path

Home

/home1/yourgroupid/jdoe/

Work

/work2/yourgroupid/jdoe/stampede3/

Scratch

/scratch/yourgroupid/jdoe/

To confirm your actual paths on the system:
```
cd $HOME && pwd       # → /home1/05072/silvia
cd $WORK && pwd       # → /work2/05072/silvia/stampede3
cd $SCRATCH && pwd    # → /scratch/05072/silvia
```

Type	Example Path
Home	/home1/yourgroupid/jdoe/
Work	/work2/yourgroupid/jdoe/stampede3/
Scratch	/scratch/yourgroupid/jdoe/

File Storage

Contents

File Storage#

Why File Storage Matters in HPC#

Storage Systems Overview#

DesignSafe Storage Path Examples#