-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add retry policy to wait for job id to persist during rebalancing #13372
Add retry policy to wait for job id to persist during rebalancing #13372
Conversation
@Jackie-Jiang mind approving the workflow here when you have a chance |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM otherwise
...oller/src/main/java/org/apache/pinot/controller/api/resources/PinotTableRestletResource.java
Outdated
Show resolved
Hide resolved
...oller/src/main/java/org/apache/pinot/controller/api/resources/PinotTableRestletResource.java
Outdated
Show resolved
Hide resolved
...oller/src/main/java/org/apache/pinot/controller/api/resources/PinotTableRestletResource.java
Outdated
Show resolved
Hide resolved
...oller/src/main/java/org/apache/pinot/controller/api/resources/PinotTableRestletResource.java
Outdated
Show resolved
Hide resolved
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## master #13372 +/- ##
============================================
+ Coverage 61.75% 62.11% +0.36%
+ Complexity 207 198 -9
============================================
Files 2436 2548 +112
Lines 133233 139971 +6738
Branches 20636 21728 +1092
============================================
+ Hits 82274 86945 +4671
- Misses 44911 46436 +1525
- Partials 6048 6590 +542
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. |
…/resources/PinotTableRestletResource.java Co-authored-by: Xiaotian (Jackie) Jiang <[email protected]>
When rebalancing tables with a large number of segments, it takes a while for the jobId to persist in Zookeeper and polling for the jobId results in a ResourceNotFoundException, even though it would become available in a few seconds. This code aims to introduce a specific retry policy in order to counteract this. It forces a retry 3 times in order to check if the jobId has been found. If it still has not then the rebalance will fail.
Label: Bugfix