我有一个在 kubernetes 上运行的 torchserve 服务,我已经能够在端口 8082 上使用它跟踪指标。我的问题是,从 kubernetes pod 我可以看到它记录硬件指标,例如:
[INFO ] pool-3-thread-2 TS_METRICS - CPUUtilization.Percent
[INFO ] pool-3-thread-2 TS_METRICS - DiskAvailable.Gigabytes
[INFO ] pool-3-thread-2 TS_METRICS - GPUMemoryUtilization.Percent
虽然,如果我检查我目前正在抓取的指标,我只能看到:
# TYPE ts_inference_requests_total counter
ts_inference_requests_total 144.0
ts_inference_requests_total 20.0
# HELP ts_inference_latency_microseconds Cumulative inference duration in microseconds
# TYPE ts_inference_latency_microseconds counter
ts_inference_latency_microseconds 6.051944813839998E8
ts_inference_latency_microseconds 4.7464253726E7
# HELP ts_queue_latency_microseconds Cumulative queue duration in microseconds
# TYPE ts_queue_latency_microseconds counter
ts_queue_latency_microseconds 2633867.5069999998
ts_queue_latency_microseconds 1080.43
是否也可以抓取记录在 kubernetes 上的指标? 感谢您的帮助!