Skip to content

Conversation

@hemanthsavasere
Copy link
Contributor

Purpose

Linked issue: close #2372

Brief change log

Fix the unstable testListRebalanceProgress testcase that was previously failing,

Issue
FlinkProcedureITCase used to creates 10 tables rapidly while server 3 is marked PERMANENT_OFFLINE, causing all replicas to concentrate on servers 0, 1, and 2. This creates high load that can exceed the current 60-second timeout for waitUntilTableReady(). The test fails intermittently with replicas showing NoneReplica state instead of the expected OnlineReplica state.

Fix

  1. Make waitUntilTableReady() timeout configurable (allowing tests to specify longer timeouts)
  2. Add 100ms delays between table creations to reduce coordinator lock contention

Tests

FlinkProcedureITCase.java has been fixed.

@hemanthsavasere
Copy link
Contributor Author

Hi @luoyuxia,
Can you please review the PR.
Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Unstable testListRebalanceProgress

1 participant