Skip to content

Conversation

AhmedSoliman
Copy link
Contributor

@AhmedSoliman AhmedSoliman commented Aug 1, 2025

This was not correct. is_alive() should strictly return true if the node is alive and not failing-over. This fix improves cluster controller's failover response time, and allows it to fail-over leader partitions as soon as it observes that a node shutdown has started.

// intentionally empty

Stack created with Sapling. Best reviewed with ReviewStack.

This was not correct. is_alive() should strictly return true if the node is alive and not failing-over. This fix improves cluster controller's failover response time, and allows it to fail-over leader partitions as soon as it observes that a node shutdown has _started_.

```
// intentionally empty
```
Copy link

github-actions bot commented Aug 1, 2025

Test Results

  7 files  ±0    7 suites  ±0   3m 37s ⏱️ - 1m 11s
 54 tests ±0   53 ✅ ±0  1 💤 ±0  0 ❌ ±0 
223 runs  ±0  220 ✅ ±0  3 💤 ±0  0 ❌ ±0 

Results for commit 3ba1c26. ± Comparison against base commit 742b77e.

♻️ This comment has been updated with latest results.

Copy link
Contributor

@pcholakov pcholakov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I had seen this implementation before and wondered why the FailingOver state is included - thanks for updating this, @AhmedSoliman!

@AhmedSoliman AhmedSoliman merged commit 3ba1c26 into main Aug 4, 2025
27 checks passed
@AhmedSoliman AhmedSoliman deleted the pr3618 branch August 4, 2025 09:41
@github-actions github-actions bot locked and limited conversation to collaborators Aug 4, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants