-
Notifications
You must be signed in to change notification settings - Fork 8.6k
Regularly benchmarking and stress-testing the alerting framework and rule types #119845
Copy link
Copy link
Open
Labels
Feature:Alerting/RuleTypesIssues related to specific Alerting Rules TypesIssues related to specific Alerting Rules TypesFeature:Alerting/RulesFrameworkIssues related to the Alerting Rules FrameworkIssues related to the Alerting Rules FrameworkMetaTeam:ResponseOpsPlatform ResponseOps team (formerly the Cases and Alerting teams) t//Platform ResponseOps team (formerly the Cases and Alerting teams) t//estimate:needs-researchEstimated as too large and requires research to break down into workable issuesEstimated as too large and requires research to break down into workable issues
Metadata
Metadata
Assignees
Labels
Feature:Alerting/RuleTypesIssues related to specific Alerting Rules TypesIssues related to specific Alerting Rules TypesFeature:Alerting/RulesFrameworkIssues related to the Alerting Rules FrameworkIssues related to the Alerting Rules FrameworkMetaTeam:ResponseOpsPlatform ResponseOps team (formerly the Cases and Alerting teams) t//Platform ResponseOps team (formerly the Cases and Alerting teams) t//estimate:needs-researchEstimated as too large and requires research to break down into workable issuesEstimated as too large and requires research to break down into workable issues
Type
Fields
Give feedbackNo fields configured for issues without a type.
The alerting system must be regularly benchmarked and stress-tested before every production release. Preferably mirroring known complex customer environments. This ensures we do not introduce any regressions by benchmarking and comparing key health metrics.
There are various ongoing performance testing & framework / tool creation efforts that relate to Kibana, some research has been done to ensure the pros/cons and applicability of each so we can invest where we see the best value proposition balanced with quickest roi we can get. As research continues it seems clear we'll plan to extend one or more tools or frameworks into a given solution. So, while we may start with one tool as an incremental first-step or as a starting point, we're developing this to a set of requirements, foremost.
Front-runner for starting-point tool/library: The Kibana Alerting team / ResponseOps kbn-alert-load Alert / Rule testing tool
... see below for options that were declined for now.
Here are some of the WIP Requirements we are evaluating and building out:
Stretch / next goals:
FYI: Frameworks/Tools that have been researched and ruled out for immediate purposes:
Kibana-QA team created an API load testing tool - kibana-load-testing. It was researched by Patrick M in 2020 and Alert/Rules team did not end up collaborating on it, it uses the Kibana HTTP API and so isn't best suited to assess the (background process) Task Manager at the moment
Kibana Working group's coming tool - (including folks like Spencer A / Tyler S / Daniel M / Liza K - they are discussing and working on a performance testing tool and CI integration for Kibana needs.