I have 3 node Sensu Go HA cluster running docker service.
Cluster is running but its very unstable. Frequently the cluster goes down with lot of etcd error.
“error”:“internal error: etcdserver: request timed out”
Would be great to know if anyone got a stable cluster.
Hey!
etcd can be very sensitive to disk latencies… so you might need to change your provisioning to increase CPU or IOPS
We have a section in our troubleshooting documents that covers etcd tuning that might help:
https://docs.sensu.io/sensu-go/latest/operations/maintain-sensu/troubleshoot/#symptoms-of-poor-performance
@Rehan_Ch hi there is this question regarding the same matter as this GitHub issue?
Any additional details you can share here or in that GitHub issue (perhaps following the guide @jspaleta mentioned above) would be greatly appreciated.
@calebhailey yes this question regarding the same matter as this GitHub issue.