From 9d653c4849a25eb272df7602c20a7a4241c12226 Mon Sep 17 00:00:00 2001 From: Ti Chi Robot Date: Tue, 11 Jul 2023 10:11:12 +0800 Subject: [PATCH] update ticdc alert rule (#13812) (#14192) --- ticdc/ticdc-alert-rules.md | 22 ++++------------------ 1 file changed, 4 insertions(+), 18 deletions(-) diff --git a/ticdc/ticdc-alert-rules.md b/ticdc/ticdc-alert-rules.md index 44d586fbce8f0..7e245b53e4c61 100644 --- a/ticdc/ticdc-alert-rules.md +++ b/ticdc/ticdc-alert-rules.md @@ -16,7 +16,7 @@ For critical alerts, you need to pay close attention to abnormal monitoring metr - Alert rule: - (time() - ticdc_processor_checkpoint_ts / 1000) > 600 + (time() - ticdc_owner_checkpoint_ts / 1000) > 600 - Description: @@ -30,7 +30,7 @@ For critical alerts, you need to pay close attention to abnormal monitoring metr - Alert rule: - (time() - ticdc_processor_resolved_ts / 1000) > 300 + (time() - ticdc_owner_resolved_ts / 1000) > 300 - Description: @@ -72,25 +72,11 @@ Warning alerts are a reminder for an issue or error. Collect TiCDC logs to locate the root cause. -### `ticdc_mounter_unmarshal_and_mount_time_more_than_1s` +### `cdc_sink_flush_duration_time_more_than_10s` - Alert rule: -`histogram_quantile(0.9, rate(ticdc_mounter_unmarshal_and_mount_bucket[1m])) * 1000 > 1000` - -- Description: - - It takes a replication task more than 1 second to unmarshal the data changes. - -- Solution: - - Collect TiCDC logs to locate the root cause. - -### `cdc_sink_execute_duration_time_more_than_10s` - -- Alert rule: - - `histogram_quantile(0.9, rate(ticdc_sink_txn_exec_duration_bucket[1m])) > 10` + `histogram_quantile(0.9, rate(ticdc_sink_txn_worker_flush_duration[1m])) > 10` - Description: