如何在命令行中创建具有多个条件的 Gcloud 警报政策

问题描述 投票:0回答:2

我正在尝试通过具有多个条件的 Gcloud 为 Cloud-Sql instance_state 创建单个警报策略。

如果实例处于 “RUNNABLE”“FAILED” 状态超过 5 分钟,则应触发警报。我能够在控制台中创建它,下面是屏幕截图:

现在我使用命令行尝试相同的操作并给出这个

gcloud
命令:

    gcloud alpha monitoring policies create \
         --display-name='Test Database State Alert ('$PROJECTID')' \
         --condition-display-name='Instance is not running for 5 minutes'\
         --notification-channels="x23234dfdfffffff" \
         --aggregation='{"alignmentPeriod": "60s","perSeriesAligner": "ALIGN_COUNT_TRUE"}' \
         --condition-filter='metric.type="cloudsql.googleapis.com/database/instance_state" AND resource.type="cloudsql_database" AND (metric.labels.state = "RUNNABLE")' 
OR 'metric.type="cloudsql.googleapis.com/database/instance_state" AND resource.type="cloudsql_database" AND (metric.labels.state = "FAILED")' \
         --duration='300s' \
         --if='> 0.0' \
         --trigger-count=1 \
         --combiner='OR' \
         --documentation='The rule "${condition.display_name}" has generated this alert for the "${metric.display_name}".' \
         --project="$PROJECTID" \
         --enabled

我在条件的 OR 部分收到以下错误:

ERROR: (gcloud.alpha.monitoring.policies.create) unrecognized arguments:
  OR
  metric.type="cloudsql.googleapis.com/database/instance_state" AND resource.type="cloudsql_database" AND (metric.labels.state = "FAILED")

即使我将 ( ) 放在条件上仍然会失败,|| 运算符也会失败。

正确的

gcloud
命令是什么?另外,我希望警报策略的结构与云控制台中创建的结构类似,如上所示。

google-cloud-platform gcloud google-cloud-stackdriver google-cloud-monitoring
2个回答
1
投票

我能够使用 gcloud alpha 监控策略条件创建 来附加其他条件。

gcloud alpha monitoring policies create \
    --notification-channels=projects/qwiklabs-gcp-04-d822dd6cd419/notificationChannels/2510735656842641871 \
    --aggregation='{"alignmentPeriod": "60s","perSeriesAligner": "ALIGN_MEAN"}' \
    --condition-display-name='CPU Utilization >0.95 for 1m'\
    --condition-filter='metric.type="compute.googleapis.com/instance/cpu/utilization" resource.type="gce_instance"' \
    --duration='1m' \
    --if='> 0.95' \
    --display-name=' alert on spikes or consistantly high cpu' \
    --combiner='OR'

gcloud alpha monitoring policies list --format='value(name,displayName)'

gcloud alpha monitoring policies conditions create \
    projects/qwiklabs-gcp-04-d822dd6cd419/alertPolicies/1712202834227136574 \
    --aggregation='{"alignmentPeriod": "60s","perSeriesAligner": "ALIGN_MEAN"}' \
    --condition-display-name='CPU Utilization >0.80 for 10m'\
    --condition-filter='metric.type="compute.googleapis.com/instance/cpu/utilization" resource.type="gce_instance"' \
    --duration='10m' \
    --if='> 0.80' 

重复的

--condition-filter
条款对我不起作用。 YMMV.


1
投票

从文档gcloud alpha 监控策略创建来看,您似乎可以指定重复(!)出现的:

[--aggregation=AGGREGATION --condition-display-name=CONDITION_DISPLAY_NAME --condition-filter=CONDITION_FILTER --duration=DURATION --if=IF_VALUE --trigger-count=TRIGGER_COUNT     | --trigger-percent=TRIGGER_PERCENT]

所以我认为你需要用

--condition-filter
复制你的
--combiner="OR"
,即

gcloud alpha monitoring policies create \
--display-name='Test Database State Alert ('$PROJECTID')' \
--notification-channels="x23234dfdfffffff" \
--aggregation='{"alignmentPeriod": "60s","perSeriesAligner": "ALIGN_COUNT_TRUE"}' \
--condition-display-name='RUNNABLE'\
--condition-filter='metric.type="cloudsql.googleapis.com/database/instance_state" AND resource.type="cloudsql_database" AND (metric.labels.state = "RUNNABLE")' 
--duration='300s' \
--if='> 0.0' \
--trigger-count=1 \
--aggregation='{"alignmentPeriod": "60s","perSeriesAligner": "ALIGN_COUNT_TRUE"}' \
--condition-display-name='FAILED'\
--condition-filter='metric.type="cloudsql.googleapis.com/database/instance_state" AND resource.type="cloudsql_database" AND (metric.labels.state = "FAILED")' \
--duration='300s' \
--if='> 0.0' \
--trigger-count=1 \
--combiner='OR' \
--documentation='The rule "${condition.display_name}" has generated this alert for the "${metric.display_name}".' \
--project="$PROJECTID" \
--enabled
© www.soinside.com 2019 - 2024. All rights reserved.