如何使用nv-nsight-cu-cli查找gld_throughput和gst_throughput

问题描述 投票:0回答:1

无法使其正常工作,文档阅读起来有些棘手。在下面尝试过,将输出视为n / a。

root@teja:~/Projs/CUDA/05-Profiling# nv-nsight-cu-cli --device 0 --metrics gst_throughput,gld_throughput ./run 0
==PROF== Connected to process 28170 (/root/Projs/CUDA/05-Profiling/run)
==PROF== Profiling "Init" - 1: 0%....50%....100% - 1 pass
==PROF== Profiling "Transpose_rowRead_colWrite" - 2: 0%....50%....100% - 1 pass
==PROF== Disconnected from process 28170
[28170] [email protected]
  Init(mat<int>,mat<int>), 2020-May-01 14:35:43, Context 1, Stream 7
    Section: Command line profiler metrics
    ---------------------------------------------------------------------- --------------- ------------------------------
    gld_throughput                                                                                                (!) n/a
    gst_throughput                                                                                                (!) n/a
    ---------------------------------------------------------------------- --------------- ------------------------------

  Transpose_rowRead_colWrite(mat<int>,mat<int>), 2020-May-01 14:35:43, Context 1, Stream 7
    Section: Command line profiler metrics
    ---------------------------------------------------------------------- --------------- ------------------------------
    gld_throughput                                                                                                (!) n/a
    gst_throughput                                                                                                (!) n/a
    ---------------------------------------------------------------------- --------------- ------------------------------
nsight
1个回答
0
投票

使用nsight工具更改了名称。该表有助于获得新名称:https://docs.nvidia.com/nsight-compute/2019.5/NsightComputeCli/index.html#nvprof-metric-comparison使用以下cmd使其正常工作。

nv-nsight-cu-cli --metrics l1tex__t_bytes_pipe_lsu_mem_global_op_ld.sum.per_second,l1tex__t_sectors_pipe_lsu_mem_global_op_ld.sum,l1tex__t_bytes_pipe_lsu_mem_global_op_st.sum.per_second,l1tex__t_sectors_pipe_lsu_mem_global_op_st.sum, ./<program>
© www.soinside.com 2019 - 2024. All rights reserved.