我怎样才能得到一个SLURM工作总的CPU使用率?

问题描述 投票:6回答:1

我试图让CPU时间每个作业中使用的总量。我已经发现了几个有前途的sacct领域,但我应该使用哪一个?

根据文档(https://computing.llnl.gov/linux/slurm/sacct.html),TotalCPU反映了总SystemCPU和UserCPU的,但不是孩子的过程。但我想总的包括子进程...

TotalCPU
    The sum of the SystemCPU and UserCPU time used by the job or job step. The total CPU time of the job may exceed the job's elapsed time for jobs that include multiple job steps. The format of the output is identical to that of the elapsed field.

NOTE: TotalCPU provides a measure of the task's parent process and does not include CPU time of child processes.

对于其他候选人,cputimeraw不提供详细的同级别:

cputime
    Formatted number of cpu seconds a process was allocated.

cputimeraw
    How much cpu time process was allocated in second format, not formatted like above. 

我倾向于使用cputimeraw代替TotalCPU但我想确保它包括由工作产生了所有子进程总数。该文件并不表示任何关于子进程的一种方式或其他。

有没有人有什么建议?

谢谢,

罗伯特

slurm
1个回答
0
投票

下面的命令给出一个很好的总结:

seff jobid

输出:

Job ID: jobid
Cluster: cluster
User/Group: doe/clusterusers
State: TIMEOUT (exit code 0)
Nodes: 6
Cores per node: 28
CPU Utilized: 32-01:15:44
CPU Efficiency: 9.54% of 336-00:44:48 core-walltime
Job Wall-clock time: 2-00:00:16
Memory Utilized: 58.76 GB
Memory Efficiency: 8.74% of 672.00 GB
© www.soinside.com 2019 - 2024. All rights reserved.