Using a main agent for check running and others as fallback

Eldingsson · March 16, 2021, 1:30pm

Is there any way I can configure a check to be run by one agent every time and fallback to other agents only if the “master” is down? For example, say I’m trying to monitor which pods in a Kubernetes cluster are not running and send the result to Telegram. I have 3 nodes that can do that but I don’t want all 3 of them to run the check and receive 3 messages from 3 different nodes on Telegram. Instead, I would like to have one agent always monitoring these pods and only use one of the remaining agents if the “master” aka node one becomes unavailable.
Thanks in advance!

calebhailey · March 16, 2021, 3:47pm

Hey @Eldingsson - great question! What you’re looking for is called round robin check scheduling.

https://docs.sensu.io/sensu-go/latest/observability-pipeline/observe-schedule/checks/#round-robin-checks

It doesn’t work exactly how you described (it’s not an IFTTT style scheduler), but it does guarantee that only one agent out of pool of agents will execute a check.

I hope this helps!

Eldingsson · March 16, 2021, 5:16pm

Thanks for your reply @calebhailey that is exactly what I am using at the moment but I will still get messages on my Telegram if the problem isn’t solved when the next agent performs the check as it will see that the last time it ran the check the status was OK, which will be allowed on my filter and trigger a message to be sent to Telegram.
So there’s no such thing that makes the check work exactly as I tried to describe?

calebhailey · March 16, 2021, 9:17pm

I’m not sure if I’m following. Round robin checks are best used with the proxy_entity_name attribute, prompting Sensu to associate the check result with a common entity rather than the agent entity which executed the check.

See here for more information:

https://docs.sensu.io/sensu-go/latest/observability-pipeline/observe-schedule/checks/#proxy-entity-name-attribute

Does that help?

Topic		Replies	Views
Round robin check: reporting agent? Sensu Go	2	453	June 29, 2020
Sensu checks are running intermittently , is not running as scheduled Sensu Go	2	165	April 9, 2024
Execute single check across multiple agents Sensu Go	2	238	June 24, 2022
Can Sensu run a check on only one subscriber in a group? Sensu Classic (EOL)	4	481	September 14, 2015
Sensu Go and spotty connection to agents Sensu Go	2	419	May 29, 2020

Using a main agent for check running and others as fallback

Related topics