[Pull-based Ingestion] Add support for dynamically updating ingestion error handling strategy with minor fixes#17565
Conversation
|
❌ Gradle check result for 92b576e: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
92b576e to
9f94093
Compare
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #17565 +/- ##
============================================
+ Coverage 72.27% 72.43% +0.15%
- Complexity 65611 65696 +85
============================================
Files 5311 5311
Lines 304942 304975 +33
Branches 44225 44229 +4
============================================
+ Hits 220407 220912 +505
+ Misses 66448 65918 -530
- Partials 18087 18145 +58 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
Would it be better to keep retrying indefinitely at some fixed rate? It can be hard to determine if errors are transient or not, and in this case if you enter the paused state because of a transient failure you'll just be stuck until something intervenes, right? |
server/src/main/java/org/opensearch/indices/pollingingest/DefaultStreamPoller.java
Outdated
Show resolved
Hide resolved
33fee6a to
e6e34ed
Compare
Good point, as discussed today, updated to indefinitely retry and skip only after user switches to a DROP policy. |
|
❌ Gradle check result for e6e34ed: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
e6e34ed to
5c0788c
Compare
|
❌ Gradle check result for 5c0788c: null Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
5c0788c to
589c235
Compare
|
❌ Gradle check result for 12bbe28: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
Signed-off-by: Varun Bharadwaj <varunbharadwaj1995@gmail.com>
Signed-off-by: Varun Bharadwaj <varunbharadwaj1995@gmail.com>
Signed-off-by: Varun Bharadwaj <varunbharadwaj1995@gmail.com>
12bbe28 to
4399432
Compare
|
❕ Gradle check result for 4399432: UNSTABLE
Please review all flaky tests that succeeded after retry and create an issue if one does not already exist to track the flaky failure. |
|
LGTM |
… error handling strategy with minor fixes (opensearch-project#17565) * Fix global checkpoint for p2p segrep in ingestion mode Signed-off-by: Varun Bharadwaj <varunbharadwaj1995@gmail.com> * Support updating ingestion error strategy Signed-off-by: Varun Bharadwaj <varunbharadwaj1995@gmail.com> * Handle race condition on calling flush before poller is initialized Signed-off-by: Varun Bharadwaj <varunbharadwaj1995@gmail.com> --------- Signed-off-by: Varun Bharadwaj <varunbharadwaj1995@gmail.com> Signed-off-by: Vinay Krishna Pudyodu <vinkrish.neo@gmail.com>
Description
This PR forms the base on which subsequent PRs will build on for adding pause/resume APIs.
Related Issues
Resolves part of #17442. Subsequent PRs will add pause/resume APIs.
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.