cannot get rid of keepalive's

Stuart_Cracraft · May 17, 2016, 5:13pm

Hi, I have keepalive’s configured on all my nodes on one Sensu server.

They used to be pointing at the other Sensu server.

My problem is that even though I’ve API deleted the nodes from the old Sensu server, they keep getting rediscovered and reported as keepalive failures.

Has anyone seen this?

Kyle_Anderson · May 18, 2016, 2:14am

Can you confirm there is no shared rabbitmq or redis between the new and old?
Also when you did the move, is it possible there is some stale dns?
(most processes, sensu included, don't re-resolve and re-connect, you
have to restart sensu clients to get them to point to a new rabbitmq,
etc.)

···

On Tue, May 17, 2016 at 10:13 AM, Stuart Cracraft <smcracraft@gmail.com> wrote:

Hi, I have keepalive's configured on all my nodes on one Sensu server.

They used to be pointing at the other Sensu server.

My problem is that even though I've API deleted the nodes from the old Sensu
server, they keep getting rediscovered and reported as keepalive failures.

Has anyone seen this?

Stuart_Cracraft1 · May 18, 2016, 2:19am

No shared RabbitMQ. One in each server.

No shared Redis either. We use RabbitMQ.

Also restarted Sensu client after API node delete.

Bizarre.

···

On May 17, 2016, at 7:14 PM, Kyle Anderson <kyle@xkyle.com> wrote:

Can you confirm there is no shared rabbitmq or redis between the new and old?
Also when you did the move, is it possible there is some stale dns?
(most processes, sensu included, don't re-resolve and re-connect, you
have to restart sensu clients to get them to point to a new rabbitmq,
etc.)

On Tue, May 17, 2016 at 10:13 AM, Stuart Cracraft <smcracraft@gmail.com> wrote:
Hi, I have keepalive's configured on all my nodes on one Sensu server.

They used to be pointing at the other Sensu server.

My problem is that even though I've API deleted the nodes from the old Sensu
server, they keep getting rediscovered and reported as keepalive failures.

Has anyone seen this?

Kyle_Anderson · May 18, 2016, 2:24am

Are you really sure you don't use redis? It is the only supported
datastore of Sensu.
https://sensuapp.org/docs/latest/data-store

I think it is very likely these two servers are sharing a redis.

Well keep in mind that the sensu client is pretty much always
re-registering itself, so if you restart the client *after* you delete
it from the api, it may re-register during that race.
You should probably restart the client first, allow it to connect to
the new rabbitmq, and *then* delete the client from the old server.

···

On Tue, May 17, 2016 at 7:19 PM, Stuart Cracraft <smcracraft@me.com> wrote:

No shared RabbitMQ. One in each server.

No shared Redis either. We use RabbitMQ.

Also restarted Sensu client after API node delete.

Bizarre.

On May 17, 2016, at 7:14 PM, Kyle Anderson <kyle@xkyle.com> wrote:

Can you confirm there is no shared rabbitmq or redis between the new and old?
Also when you did the move, is it possible there is some stale dns?
(most processes, sensu included, don't re-resolve and re-connect, you
have to restart sensu clients to get them to point to a new rabbitmq,
etc.)

On Tue, May 17, 2016 at 10:13 AM, Stuart Cracraft <smcracraft@gmail.com> wrote:
Hi, I have keepalive's configured on all my nodes on one Sensu server.

They used to be pointing at the other Sensu server.

My problem is that even though I've API deleted the nodes from the old Sensu
server, they keep getting rediscovered and reported as keepalive failures.

Has anyone seen this?

Stuart_Cracraft1 · May 18, 2016, 3:04am

Okay, thanks, I’ll look into it tomorrow.

Currently involved in, ironically, a completely separate non-Sensu Redis outage.

For now the Sensu keepalive’s are filtered and sequestered away from people.

···

On May 17, 2016, at 7:24 PM, Kyle Anderson <kyle@xkyle.com> wrote:

Are you really sure you don't use redis? It is the only supported
datastore of Sensu.
https://sensuapp.org/docs/latest/data-store

I think it is very likely these two servers are sharing a redis.

Well keep in mind that the sensu client is pretty much always
re-registering itself, so if you restart the client *after* you delete
it from the api, it may re-register during that race.
You should probably restart the client first, allow it to connect to
the new rabbitmq, and *then* delete the client from the old server.

On Tue, May 17, 2016 at 7:19 PM, Stuart Cracraft <smcracraft@me.com> wrote:

No shared RabbitMQ. One in each server.

No shared Redis either. We use RabbitMQ.

Also restarted Sensu client after API node delete.

Bizarre.

On May 17, 2016, at 7:14 PM, Kyle Anderson <kyle@xkyle.com> wrote:

Can you confirm there is no shared rabbitmq or redis between the new and old?
Also when you did the move, is it possible there is some stale dns?
(most processes, sensu included, don't re-resolve and re-connect, you
have to restart sensu clients to get them to point to a new rabbitmq,
etc.)

On Tue, May 17, 2016 at 10:13 AM, Stuart Cracraft <smcracraft@gmail.com> wrote:
Hi, I have keepalive's configured on all my nodes on one Sensu server.

They used to be pointing at the other Sensu server.

My problem is that even though I've API deleted the nodes from the old Sensu
server, they keep getting rediscovered and reported as keepalive failures.

Has anyone seen this?

Topic		Replies	Views
Transport is not connected, false keepalives Sensu Classic (EOL)	4	988	November 29, 2018
Sensu testing -- second machine stopped sending keepalives! Sensu Classic (EOL)	16	473	January 23, 2015
RabbitMQ clustering, node failure and Sensu Sensu Classic (EOL)	5	488	June 14, 2014
keepalives are failing Sensu Classic (EOL)	4	637	November 22, 2018
Sensu problems - communication(?) Sensu Classic (EOL)	1	432	April 20, 2016

cannot get rid of keepalive's

Related topics