使用 kubernetes 的普罗米修斯 Torchserve 指标

问题描述 投票:0回答:0

我有一个在 kubernetes 上运行的 torchserve 服务,我已经能够在端口 8082 上使用它跟踪指标。我的问题是,从 kubernetes pod 我可以看到它记录硬件指标,例如:

[INFO ] pool-3-thread-2 TS_METRICS - CPUUtilization.Percent
[INFO ] pool-3-thread-2 TS_METRICS - DiskAvailable.Gigabytes
[INFO ] pool-3-thread-2 TS_METRICS - GPUMemoryUtilization.Percent

虽然,如果我检查我目前正在抓取的指标,我只能看到:

# TYPE ts_inference_requests_total counter
ts_inference_requests_total 144.0
ts_inference_requests_total 20.0
# HELP ts_inference_latency_microseconds Cumulative inference duration in microseconds
# TYPE ts_inference_latency_microseconds counter
ts_inference_latency_microseconds 6.051944813839998E8
ts_inference_latency_microseconds 4.7464253726E7
# HELP ts_queue_latency_microseconds Cumulative queue duration in microseconds
# TYPE ts_queue_latency_microseconds counter
ts_queue_latency_microseconds 2633867.5069999998
ts_queue_latency_microseconds 1080.43

是否也可以抓取记录在 kubernetes 上的指标? 感谢您的帮助!

kubernetes gpu prometheus metrics torchserve
© www.soinside.com 2019 - 2024. All rights reserved.