Do not finish execution when gRPC stream closed #89

sfairat15 · 2023-04-27T11:53:10Z

Motivation

We have Falco installation at out k8s cluster. When we update Falco configs (customRules, for example), then all its DaemonSet pods recreating.

But at the same time all falco-exporters containers start restarting, because of gRPC stream closed reason.

These massive container restarts raise alert "Too many container restarts" at our monitoring.

Feature

May be you can add some retry logic for grpc reconnect instead of application exit? Thanks.

Additional context

Some falco-exporter container setup:

  containers:
  - args:
    - /usr/bin/falco-exporter
    - --client-socket=unix:///run/falco/falco.sock
    - --timeout=5m
    - --listen-address=0.0.0.0:9376
    image: docker.io/falcosecurity/falco-exporter:0.8.2

Falco-exporter log:

> kubectl logs --previous falco-exporter-z64vp
2023/04/12 07:56:43 connecting to gRPC server at unix:///run/falco/falco.sock (timeout 5m0s)
2023/04/12 07:56:43 listening on http://0.0.0.0:9376/metrics
2023/04/12 07:56:46 connected to gRPC server, subscribing events stream
2023/04/12 07:56:46 ready
2023/04/27 11:23:48 gRPC stream closed

The text was updated successfully, but these errors were encountered:

leogr · 2023-05-08T13:25:53Z

Hey @sfairat15

Interesting. We can let falco-exporter try to reconnect by itself (ie. without exiting) using the already implemented connection backoff mechanism. Would it be enough?

PS
I'm not sure if this can create side-effects during normal shutdown operations (likely not, I need to check) 🤔

poiana · 2023-08-06T13:32:41Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh with /remove-lifecycle stale.

Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle stale

leogr · 2023-08-10T11:16:56Z

/remove-lifecycle stale
/help

poiana · 2023-08-10T11:16:57Z

@leogr:
This request has been marked as needing help from a contributor.

Please ensure the request meets the requirements listed here.

If this request no longer meets these requirements, the label can be removed
by commenting with the /remove-help command.

In response to this:

/remove-lifecycle stale
/help

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

poiana · 2023-11-08T15:46:14Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh with /remove-lifecycle stale.

Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle stale

leogr · 2023-11-08T16:17:29Z

/remove-lifecycle stale

poiana · 2024-02-06T21:48:46Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh with /remove-lifecycle stale.

Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle stale

leogr · 2024-02-08T15:22:45Z

/remove-lifecycle stale

poiana · 2024-05-08T15:52:40Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh with /remove-lifecycle stale.

Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle stale

poiana · 2024-06-07T15:53:32Z

Stale issues rot after 30d of inactivity.

Mark the issue as fresh with /remove-lifecycle rotten.

Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle rotten

leogr · 2024-06-11T12:47:20Z

/remove-lifecycle rotten

poiana · 2024-09-09T16:10:35Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh with /remove-lifecycle stale.

Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle stale

poiana · 2024-10-09T16:11:18Z

Stale issues rot after 30d of inactivity.

Mark the issue as fresh with /remove-lifecycle rotten.

Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle rotten

sfairat15 added the kind/feature New feature or request label Apr 27, 2023

poiana added the lifecycle/stale label Aug 6, 2023

poiana added help wanted Extra attention is needed and removed lifecycle/stale labels Aug 10, 2023

poiana added the lifecycle/stale label Nov 8, 2023

poiana removed the lifecycle/stale label Nov 8, 2023

poiana added the lifecycle/stale label Feb 6, 2024

poiana removed the lifecycle/stale label Feb 8, 2024

poiana added the lifecycle/stale label May 8, 2024

poiana added lifecycle/rotten and removed lifecycle/stale labels Jun 7, 2024

poiana removed the lifecycle/rotten label Jun 11, 2024

poiana added the lifecycle/stale label Sep 9, 2024

poiana added lifecycle/rotten and removed lifecycle/stale labels Oct 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not finish execution when gRPC stream closed #89

Do not finish execution when gRPC stream closed #89

sfairat15 commented Apr 27, 2023

leogr commented May 8, 2023

poiana commented Aug 6, 2023

leogr commented Aug 10, 2023

poiana commented Aug 10, 2023

poiana commented Nov 8, 2023

leogr commented Nov 8, 2023

poiana commented Feb 6, 2024

leogr commented Feb 8, 2024

poiana commented May 8, 2024

poiana commented Jun 7, 2024

leogr commented Jun 11, 2024

poiana commented Sep 9, 2024

poiana commented Oct 9, 2024

Do not finish execution when gRPC stream closed #89

Do not finish execution when gRPC stream closed #89

Comments

sfairat15 commented Apr 27, 2023

leogr commented May 8, 2023

poiana commented Aug 6, 2023

leogr commented Aug 10, 2023

poiana commented Aug 10, 2023

poiana commented Nov 8, 2023

leogr commented Nov 8, 2023

poiana commented Feb 6, 2024

leogr commented Feb 8, 2024

poiana commented May 8, 2024

poiana commented Jun 7, 2024

leogr commented Jun 11, 2024

poiana commented Sep 9, 2024

poiana commented Oct 9, 2024