In this manual step, examine the output from the "test teca_cf_restripe" step to (1) make sure it worked as expected, (2) check the size of the files, and (3) check the run time.
You can use information from (2) to determine whether you have adequate scratch space for a run on the full dataset. You can use information from (3) to set the SLURM settings as appropriate. Since teca_cf_restripe
tends to scale quite well, I use the following formula to figure out how many cores and nodes to request:
Variable | Description |
---|---|
n | Approx. number of cores to request |
Runtime of test run | |
Number of steps in test run | |
Number of nodes in test run | |
p | Cores-per-node (in test and production) |
s | Number of steps in full run (see output from teca_metadata_probe ) |
t | Desired runtime of production run |
n | Number of cores to use in production (the number for srun -n ) |
N | Number of nodes in production run |
In my test, I had teca_metadata_probe
gives me
It can be worth running the restripe test a few times and for a few different values of