General Prometheus Alert Troubleshooting Topics
Alerts for PostgresqlFollowerReplicationLagSMA on sma-postgres-cluster pods with slot_name=“permanent_physical_1” can be ignored. This slot_name is disabled and will be removed in a future release.
Alerts for PostgresqlHighRollbackRate on spire-postgres pods can be ignored. This is caused by an idle session that requires a timeout. This will be fixed in a future release.
Alerts for PostgresqlInactiveReplicationSlot on sma-postgres-cluster pods with slot_name=“permanent_physical_1” can be ignored. This slot_name is disabled and will be removed in a future release.
Alerts for PostgresqlNotEnoughConnections for datname=“foo” and datname=“bar” can be ignored. These databases are not used and will be removed in a future release.
Alerts for CPUThrottlingHigh on gatekeeper-audit can be ignored. This pod is not utilized in this release.
Alerts for CPUThrottlingHigh on CFS services such as cfs-batcher and cfs-trust can be ignored. Because CFS is idle most of the time these services have low CPU requests, and it is normal for CFS service resource usage to spike when it is in use.