Category: Instructional
-
Portable Distributed Scripts (PoDS)
Run many independent serial jobs concurrently Some users may want to run a large set of sequential jobs, for example, post-processing or data archiving jobs, on Discover. Portable Distributed Scripts (PoDS) is a set of scripts created by SSSO that enables users to execute a series of independent, sequential jobs concurrently on Discover’s multi-core nodes.…
-
Discover SCU Hardware Specifications
The following page contains hardware specifications for each of the Scalable Units, in the Discover cluster. Scalable Unit 16 – CPU-only Nodes Scalable Unit 16 – CPU & GPU Nodes Scalable Unit 17 – CPU-only Nodes Scalable Unit 18 – CPU-only Nodes
-
SSH Fingerprints
Bastion Service (login.nccs.nasa.gov) Fingerprint MD5 [RSA] f1:12:60:c2:90:50:9e:5a:18:5b:54:f0:4f:25:6b:60 SHA256 [RSA] AXiAaDwklxb1qoBFoEMPZ/+EE/IB9j2mp6TY44W1CF0 SHA1 [RSA] mjFi13dUiHFs8Wly2hr9U3AKm4g MD5 [ED25519] 4d:28:91:64:25:eb:ff:f8:7d:44:f1:9c:a1:f7:cb:6c SHA256 [ED25519] j+kLZsJOuPV3WVY5PzyO/rfpdUw4bLOBqUMCa3wdsuE SHA1 [ED25519] vthL2pN7Lir+dy6Z/ucdsJgCgLM MD5 [ECDSA] fe:1a:d1:0e:3e:4e:02:c3:f1:27:66:95:05:18:f2:28 SHA256 [ECDSA] k1iG4MEErAzVClt9iDq9pFFpWfDcNvpQb7PiqdxMLwg SHA1 [ECDSA] 1Jx6Qm9m5FjKW1P2mpC2ox6RerY Discover login for the NASTRAN Project discover-nastran.nccs.nasa.gov MD5 [ED25519] 94:1a:33:31:34:2a:a0:22:08:58:e1:0e:e3:ef:60:21 SHA256 [ED25519] agk8tUw+vDzSALdA8tySnBwAwl164yNg+7rEBf7uca4 MD5 [ECDSA] 6e:d0:c7:68:47:ae:3f:e3:0b:ac:f9:26:b8:ed:fe:89 SHA256 [ECDSA] mhgWThLC/tXwBMp7KbRvY8/xuc8i9aKlKWk4Is3Y+3M MD5 [RSA] 05:85:dd:d2:f7:b8:b7:38:14:f1:92:7f:24:9d:05:4c SHA256 [RSA]…
-
Show Quota Discover
The HOME and NOBACKUP file systems are controlled by quotas. To determine your resource usage and how it compares to your quota, run the showquota command. *Use “showquota -h” to display your Discover quota with units (GB, MB, etc.).The -f option will also show the fileset name, and the -g option will show group ownership.…
-
Discover Quality of Service Details
Slurm’s Quality of Service (QoS) feature controls resource limits for every job in the Discover job queue. Available QoSs in the table below apply only to jobs submitted to the Slurm default partition. (It is important for maximum adaptability of your job scripts that you not specify any partition if you wish to use the…
-
Killing Jobs Using Slurm
Cancel a pending or running job To delete a job, use “scancel” followed by the job ID. For example: Cancel all of your pending and running jobs To delete all your jobs across all partitions simultaneously, in case they are mistakenly submitted, use: The –user option terminates all of your jobs, both pending and running.
-
Sbatch Options
The following table can be used as a reference for the basic flags available to the sbatch, salloc, and few other commands. To get a better understanding of the commands and their flags, please use the “man” command while logged into discover. For more information on sbatch, please refer to the man pages. Use the…
-
Slurm Example Scripts
Serial Job Script By default, Slurm executes your job from the current directory where you submit the job. You can change the work directory by “cd” to it in the script, or specify –workdir option for SBATCH. OPENMP Job Script Note: The option “–cpus-per-task=n” advises the Slurm controller that ensuring job steps will require “n”…
-
Srun Environment Variables
The following information is largely replicated from SchedMD’s srun man page, and is the subset that is likely most relevant to most NCCS users. The srun command honors the following environment variables, when present (these override any inline directives within your batch script, but will be overridden by those also specified on the srun command…
-
Salloc Environment Variables
The following information is largely replicated from SchedMD’s salloc man page, and is the subset that is likely most relevant to most NCCS users. The salloc command honors the following environment variables, when present (these override any inline directives within your batch script, but will be overridden by those also specified on the salloc command…

