Check for any etcd cluster alarms and clear them as needed. An etcd cluster alarm must be manually cleared.
For example, a cluster’s database NOSPACE
alarm is set when database storage space is no longer available. A subsequent defrag may free up database storage space, but writes to the database will continue to fail while the NOSPACE
alarm is set.
Check for etcd cluster alarms.
An empty list will be returned if no alarms are set.
Check if any etcd alarms are set for etcd clusters in the services namespace.
ncn-mw# for pod in $(kubectl get pods -l etcd_cluster=cray-bos-etcd \
-n services -o jsonpath='{.items[*].metadata.name}')
do
echo "### ${pod} Alarms Set: ###"
kubectl -n services exec ${pod} -c etcd -- /bin/sh -c "ETCDCTL_API=3 etcdctl alarm list"
done
Example output:
### cray-bos-etcd-7cxq6qrhz5 Alarms Set: ###
### cray-bos-etcd-b9m4k5qfrd Alarms Set: ###
### cray-bos-etcd-tnpv8x6cxv Alarms Set: ###
### cray-bss-etcd-q4k54rbbfj Alarms Set: ###
### cray-bss-etcd-r75mlv6ffd Alarms Set: ###
### cray-bss-etcd-xprv5ht5d4 Alarms Set: ###
### cray-cps-etcd-8hpztfkjdp Alarms Set: ###
### cray-cps-etcd-fp4kfsf799 Alarms Set: ###
### cray-cps-etcd-g6gz9vmmdn Alarms Set: ###
### cray-crus-etcd-6z9zskl6cr Alarms Set: ###
### cray-crus-etcd-krp255f97q Alarms Set: ###
### cray-crus-etcd-tpclqfln67 Alarms Set: ###
### cray-externaldns-etcd-2vnb5t4657 Alarms Set: ###
### cray-externaldns-etcd-sc4b88ptg2 Alarms Set: ###
### cray-externaldns-etcd-smhxd9mb8n Alarms Set: ###
### cray-fas-etcd-j9qmtrxnhh Alarms Set: ###
### cray-fas-etcd-w8xl7vbn84 Alarms Set: ###
### cray-fas-etcd-zr2vnvhdwk Alarms Set: ###
### cray-hbtd-etcd-jcxl65xwwd Alarms Set: ###
### cray-hbtd-etcd-rpwx7qdtxb Alarms Set: ###
### cray-hbtd-etcd-vswmwrmhpl Alarms Set: ###
### cray-hmnfd-etcd-2rpvswtpd2 Alarms Set: ###
### cray-hmnfd-etcd-6pm4tm5d6x Alarms Set: ###
### cray-hmnfd-etcd-776b2g5d4l Alarms Set: ###
### cray-reds-etcd-m8wgp24k9p Alarms Set: ###
### cray-reds-etcd-wghvvfbnjp Alarms Set: ###
### cray-reds-etcd-zpzw8mpkfk Alarms Set: ###
### cray-uas-mgr-etcd-4xq5swfsr2 Alarms Set: ###
### cray-uas-mgr-etcd-kfd64zwpbz Alarms Set: ###
### cray-uas-mgr-etcd-nmqkdh8n2d Alarms Set: ###
Check if any etcd alarms are set for a particular etcd cluster in the services namespace.
ncn-mw# for pod in $(kubectl get pods -l etcd_cluster=cray-bos-etcd \
-n services -o jsonpath='{.items[*].metadata.name}')
do
echo "### ${pod} Alarms Set: ###"
kubectl -n services exec ${pod} -c etcd -- /bin/sh -c "ETCDCTL_API=3 etcdctl alarm list"
done
Example output:
### cray-bos-etcd-7cxq6qrhz5 Alarms Set: ###
### cray-bos-etcd-b9m4k5qfrd Alarms Set: ###
### cray-bos-etcd-tnpv8x6cxv Alarms Set: ###
Clear any etcd cluster alarms.
A list of disarmed alarms will be returned. An empty list is returned if no alarms were set.
Clear all etcd alarms set in etcd clusters.
ncn-mw# for pod in $(kubectl get pods -l app=etcd -n services \
-o jsonpath='{.items[*].metadata.name}')
do
echo "### ${pod} Disarmed Alarms: ###"
kubectl -n services exec ${pod} -c etcd -- /bin/sh -c "ETCDCTL_API=3 etcdctl alarm disarm"
done
Example output:
### cray-bos-etcd-7cxq6qrhz5 Disarmed Alarms: ###
### cray-bos-etcd-b9m4k5qfrd Disarmed Alarms: ###
### cray-bos-etcd-tnpv8x6cxv Disarmed Alarms: ###
### cray-bss-etcd-q4k54rbbfj Disarmed Alarms: ###
### cray-bss-etcd-r75mlv6ffd Disarmed Alarms: ###
### cray-bss-etcd-xprv5ht5d4 Disarmed Alarms: ###
### cray-cps-etcd-8hpztfkjdp Disarmed Alarms: ###
### cray-cps-etcd-fp4kfsf799 Disarmed Alarms: ###
### cray-cps-etcd-g6gz9vmmdn Disarmed Alarms: ###
### cray-crus-etcd-6z9zskl6cr Disarmed Alarms: ###
### cray-crus-etcd-krp255f97q Disarmed Alarms: ###
### cray-crus-etcd-tpclqfln67 Disarmed Alarms: ###
### cray-externaldns-etcd-2vnb5t4657 Disarmed Alarms: ###
### cray-externaldns-etcd-sc4b88ptg2 Disarmed Alarms: ###
### cray-externaldns-etcd-smhxd9mb8n Disarmed Alarms: ###
### cray-fas-etcd-j9qmtrxnhh Disarmed Alarms: ###
### cray-fas-etcd-w8xl7vbn84 Disarmed Alarms: ###
### cray-fas-etcd-zr2vnvhdwk Disarmed Alarms: ###
### cray-hbtd-etcd-jcxl65xwwd Disarmed Alarms: ###
### cray-hbtd-etcd-rpwx7qdtxb Disarmed Alarms: ###
### cray-hbtd-etcd-vswmwrmhpl Disarmed Alarms: ###
### cray-hmnfd-etcd-2rpvswtpd2 Disarmed Alarms: ###
### cray-hmnfd-etcd-6pm4tm5d6x Disarmed Alarms: ###
### cray-hmnfd-etcd-776b2g5d4l Disarmed Alarms: ###
### cray-reds-etcd-m8wgp24k9p Disarmed Alarms: ###
### cray-reds-etcd-wghvvfbnjp Disarmed Alarms: ###
### cray-reds-etcd-zpzw8mpkfk Disarmed Alarms: ###
### cray-uas-mgr-etcd-4xq5swfsr2 Disarmed Alarms: ###
### cray-uas-mgr-etcd-kfd64zwpbz Disarmed Alarms: ###
### cray-uas-mgr-etcd-nmqkdh8n2d Disarmed Alarms: ###
Clear all alarms in one particular etcd cluster.
ncn-mw# for pod in $(kubectl get pods -l etcd_cluster=cray-bos-etcd \
-n services -o jsonpath='{.items[*].metadata.name}')
do
echo "### ${pod} Disarmed Alarms: ###"
kubectl -n services exec ${pod} -c etcd -- /bin/sh -c "ETCDCTL_API=3 etcdctl alarm disarm"
done
Example output:
### cray-bos-etcd-7cxq6qrhz5 Disarmed Alarms: ###
memberID:14039380531903955557 alarm:NOSPACE
memberID:10060051157615504224 alarm:NOSPACE
memberID:9418794810465807950 alarm:NOSPACE
### cray-bos-etcd-b9m4k5qfrd Disarmed Alarms: ###
### cray-bos-etcd-tnpv8x6cxv Disarmed Alarms: ###