Skip to content

[CI] TransportClusterStateActionDisruptionIT testNonLocalRequestAlwaysFindsMasterAndWaitsForMetadata failing #127422

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
elasticsearchmachine opened this issue Apr 26, 2025 · 3 comments · Fixed by #127523
Assignees
Labels
:Distributed Coordination/Distributed A catch all label for anything in the Distributed Coordination area. Please avoid if you can. medium-risk An open issue or test failure that is a medium risk to future releases Team:Distributed Coordination Meta label for Distributed Coordination team >test-failure Triaged test failures from CI

Comments

@elasticsearchmachine
Copy link
Collaborator

Build Scans:

Reproduction Line:

./gradlew ":server:internalClusterTest" --tests "org.elasticsearch.action.admin.cluster.state.TransportClusterStateActionDisruptionIT.testNonLocalRequestAlwaysFindsMasterAndWaitsForMetadata" -Dtests.seed=21018C38C16F89F4 -Dtests.locale=en-PN -Dtests.timezone=MET -Druntime.java=24

Applicable branches:
main

Reproduces locally?:
N/A

Failure History:
See dashboard

Failure Message:

java.lang.RuntimeException: Can't get master node Cannot invoke "org.elasticsearch.cluster.node.DiscoveryNode.getName()" because the return value of "org.elasticsearch.cluster.node.DiscoveryNodes.getMasterNode()" is null

Issue Reasons:

  • [main] 2 failures in test testNonLocalRequestAlwaysFindsMasterAndWaitsForMetadata (0.8% fail rate in 259 executions)

Note:
This issue was created using new test triage automation. Please report issues or feedback to es-delivery.

@elasticsearchmachine elasticsearchmachine added :Distributed Indexing/Distributed A catch all label for anything in the Distributed Indexing Area. Please avoid if you can. >test-failure Triaged test failures from CI labels Apr 26, 2025
elasticsearchmachine added a commit that referenced this issue Apr 26, 2025
…teActionDisruptionIT testNonLocalRequestAlwaysFindsMasterAndWaitsForMetadata #127422
@elasticsearchmachine
Copy link
Collaborator Author

This has been muted on branch main

Mute Reasons:

  • [main] 2 failures in test testNonLocalRequestAlwaysFindsMasterAndWaitsForMetadata (0.8% fail rate in 259 executions)

Build Scans:

@elasticsearchmachine elasticsearchmachine added needs:risk Requires assignment of a risk label (low, medium, blocker) Team:Distributed Indexing Meta label for Distributed Indexing team labels Apr 26, 2025
@elasticsearchmachine
Copy link
Collaborator Author

Pinging @elastic/es-distributed-indexing (Team:Distributed Indexing)

benchaplin pushed a commit to benchaplin/elasticsearch that referenced this issue Apr 28, 2025
…teActionDisruptionIT testNonLocalRequestAlwaysFindsMasterAndWaitsForMetadata elastic#127422
@kingherc kingherc added Team:Distributed Coordination Meta label for Distributed Coordination team :Distributed Coordination/Distributed A catch all label for anything in the Distributed Coordination area. Please avoid if you can. and removed :Distributed Indexing/Distributed A catch all label for anything in the Distributed Indexing Area. Please avoid if you can. Team:Distributed Indexing Meta label for Distributed Indexing team labels Apr 29, 2025
@elasticsearchmachine
Copy link
Collaborator Author

Pinging @elastic/es-distributed-coordination (Team:Distributed Coordination)

@JeremyDahlgren JeremyDahlgren self-assigned this Apr 29, 2025
@JeremyDahlgren JeremyDahlgren added low-risk An open issue or test failure that is a low risk to future releases and removed needs:risk Requires assignment of a risk label (low, medium, blocker) labels Apr 29, 2025
JeremyDahlgren added a commit to JeremyDahlgren/elasticsearch that referenced this issue Apr 29, 2025
Replaces the use of InternalTestCluster.getMasterName() with a
ClusterServiceUtils.addTemporaryStateListener() call that waits
for a new master node other than the previous master node.
InternalTestCluster.getMasterName() is not safe to use in
unstable clusters, per PR 127213.

Closes:
elastic#127466
elastic#127443
elastic#127424
elastic#127423
elastic#127422
@JeremyDahlgren JeremyDahlgren added medium-risk An open issue or test failure that is a medium risk to future releases and removed low-risk An open issue or test failure that is a low risk to future releases labels Apr 29, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Distributed Coordination/Distributed A catch all label for anything in the Distributed Coordination area. Please avoid if you can. medium-risk An open issue or test failure that is a medium risk to future releases Team:Distributed Coordination Meta label for Distributed Coordination team >test-failure Triaged test failures from CI
Projects
None yet
3 participants