Skip to content

Generic latency slo#44

Merged
metalmatze merged 3 commits intometalmatze:masterfrom
brancz:generic-latency-slo
Jan 6, 2021
Merged

Generic latency slo#44
metalmatze merged 3 commits intometalmatze:masterfrom
brancz:generic-latency-slo

Conversation

@brancz
Copy link
Copy Markdown
Contributor

@brancz brancz commented Dec 29, 2020

This PR has 3 changes:

  • Introduce notErrorSelector

The notErrorSelector allows using the latency burn function for metrics
other than those that have errors defined as not 5xx.

  • Template latency rules with named fields …

Reading and understanding the code was very difficult as the parameters
had to be counted, now it's much easier to reason about.

  • Add note about requirement of histogram bucket

@metalmatze

If I'm not mistaken then this should largely solve #15

The notErrorSelector allows using the latency burn function for metrics
other than those that have errors defined as not 5xx.
Reading and understanding the code was very difficult as the parameters
had to be counted, now it's much easier to reason about.
@brancz
Copy link
Copy Markdown
Contributor Author

brancz commented Dec 29, 2020

Regarding the last one, I can't help but wonder that there must be a better way to do this than requiring the exact histogram bucket to be present.

@brancz
Copy link
Copy Markdown
Contributor Author

brancz commented Dec 29, 2020

Somewhat related, is it really correct to compare non-errors with all requests here?

Isn't the point that we exclude any sort of errors?

@brancz brancz mentioned this pull request Dec 30, 2020
Copy link
Copy Markdown
Owner

@metalmatze metalmatze left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Simple and straight forward! 💯
Thanks!

@metalmatze metalmatze merged commit e5c2758 into metalmatze:master Jan 6, 2021
@metalmatze
Copy link
Copy Markdown
Owner

Regarding the last one, I can't help but wonder that there must be a better way to do this than requiring the exact histogram bucket to be present.

That's probably something that should be taken care of outside of this slo-libsonnet? Something generating rules with this library can check for different buckets and then suggest them.

@brancz brancz deleted the generic-latency-slo branch January 6, 2021 12:40
@brancz
Copy link
Copy Markdown
Contributor Author

brancz commented Jan 6, 2021

I think that would help, but if it could be generic to the chosen buckets (and there are investigations of dynamic bucketing) then that would be even better. I'll open an issue to investigate and see if we could find something.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants