Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bool_gauge SLO's budget is burning continouosly #1231

Open
mdarii opened this issue Jul 30, 2024 · 0 comments
Open

bool_gauge SLO's budget is burning continouosly #1231

mdarii opened this issue Jul 30, 2024 · 0 comments

Comments

@mdarii
Copy link

mdarii commented Jul 30, 2024

I'm trying to define SLO to monitor the website availability. SLO relays on the prometheus blackbox-exporter metric: probe_success. Here is the SLO definition:

apiVersion: pyrra.dev/v1alpha1
kind: ServiceLevelObjective
metadata:
  annotations:
    pyrra.dev/description: 'TODO: Define the runbook for this SLO'
    pyrra.dev/summary: Website availability is below 99.999% 
  labels:
    pyrra.dev/team: ops
  name: availability
spec:
  alerting:
    absent: true
    burnrates: true
    disabled: true
  description: Website should have 99.999% availability
  indicator:
    bool_gauge:
      grouping:
      - cluster
      metric: probe_success{instance="https://example.com"}
  target: "99.999"
  window: 1w

After creating of the SLO, the error budget is continuously decreasing, but there's no errors(all blackbox checks were succesfull)
image
Could be that there's an error in the logic how the error budget is calculated?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant