Temporary/Scratch Storage on Compute nodes#
Each HPC compute node contains a limited amount of dedicated storage and its capacity and performance is different for each type of node. As this storage is local to the compute node, it can offer more predictable performance than writing to the high performance filesystem, /nobackup
, which is shared between all compute nodes
On our previous clusters, such storage is available for use by jobs – mostly under /scratch
and via the directory name contained in the $TMPDIR
environment variable, but also a small amount under /tmp
. However, jobs could not reserve the local space they need and so could fail unexpectedly if another job running on the same node was also writing to local disk.
On ARC3 and ARC4, if a job wishes to use the compute node disk, this should be explicitly requested by the job.
Usage#
Summary:
Use of
/tmp
should be avoided, as it is relatively small and space usage is uncontrolledUse of
/scratch
should be avoided, as this is now the same storage as/tmp
$TMPDIR
within a job refers to a per-job directory which has been assigned dedicated storage for the job
ARC3 local disk specifications#
Note
ssd - refers to a solid state disk, which has excellent performance characteristics for reading and writing
hdd - refers to a mechanical hard disk with a spinning platter, which has good performance characteristics for large sequential reads or writes only
Node Type |
Number of nodes |
Memory |
Local Disk Capacity |
Local Disk Type |
---|---|---|---|---|
Standard |
165 |
128GB |
100GB |
ssd |
High Memory |
2 |
768GB |
800GB |
hdd |
GPGPU K80 |
2 – each with 2 NVIDIA K80 GPUs |
128GB |
800GB |
hdd |
GPGPU P100 |
6 – each with 4 NVIDIA P100 GPUs |
256GB |
800GB |
hdd |
ARC4 local disk specifications#
Guide to the Nodes on ARC4 Node Type |
Number of nodes |
Memory |
Local Disk Capacity |
Local Disk Type |
---|---|---|---|---|
Standard |
149 |
192GB |
128GB |
ssd |
High Memory |
2 |
768GB |
800GB |
ssd |
GPGPU |
3 – each with 4 NVIDIA V100 GPUs |
192GB |
128GB |
ssd |
Requesting local disk#
qsub
options for requesting local disk on ARC3 and ARC4:
Option |
Description |
Default |
---|---|---|
|
Sets the limit of the local disk available under |
1G per core |
|
Specifies the type of local disk that files under
Reading and writing will be much quicker to a solid state drive and this option should be used if you have lots of read and write operations. |
* |
|
Specifies the directory to which the contents of The files will be copied to directory Please note that, for distributed parallel jobs, only the |
No copy by default. |
|
When used in conjunction with the |
false |