kube SIGINT system test: fix race in timeout handling #24496

edsantiago · 2024-11-07T18:17:01Z

Up to now this test has been run using:

PODMAN_TIMEOUT=2 run_podman kube play ...

...and this gives podman time to start the pod before getting
the signal.

When run in parallel, under heavy load, the above command seems
to time out before podman has gotten its act together. Weird
things happen, like weird exit status and (most crucially)
zombie containers.

Solution: wait for container to actually start before we kill it.

Signed-off-by: Ed Santiago santiago@redhat.com

None

Up to now this test has been run using: PODMAN_TIMEOUT=2 run_podman kube play ... ...and this gives podman time to start the pod before getting the signal. When run in parallel, under heavy load, the above command seems to time out before podman has gotten its act together. Weird things happen, like weird exit status and (most crucially) zombie containers. Solution: wait for container to actually start before we kill it. Signed-off-by: Ed Santiago <santiago@redhat.com>

edsantiago · 2024-11-07T18:17:35Z

cherrypicked from #23275 where it has been working flawlessly for weeks

Luap99

LGTM

openshift-ci · 2024-11-07T18:35:33Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: edsantiago, Luap99

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [Luap99,edsantiago]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

mheon · 2024-11-07T20:02:51Z

/lgtm

openshift-ci bot added release-note-none approved Indicates a PR has been approved by an approver from all required OWNERS files. labels Nov 7, 2024

Luap99 approved these changes Nov 7, 2024

View reviewed changes

openshift-ci bot assigned mheon Nov 7, 2024

openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Nov 7, 2024

openshift-merge-bot bot merged commit b109a2b into containers:main Nov 7, 2024
51 of 53 checks passed

edsantiago deleted the sigint-flake branch November 11, 2024 13:13

stale-locking-app bot added the locked - please file new issue/PR Assist humans wanting to comment on an old issue or PR with locked comments. label Feb 10, 2025

stale-locking-app bot locked as resolved and limited conversation to collaborators Feb 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

kube SIGINT system test: fix race in timeout handling #24496

kube SIGINT system test: fix race in timeout handling #24496

Uh oh!

edsantiago commented Nov 7, 2024

Uh oh!

edsantiago commented Nov 7, 2024

Uh oh!

Luap99 left a comment

Uh oh!

openshift-ci bot commented Nov 7, 2024

Uh oh!

mheon commented Nov 7, 2024

Uh oh!

Uh oh!

Uh oh!

kube SIGINT system test: fix race in timeout handling #24496

kube SIGINT system test: fix race in timeout handling #24496

Uh oh!

Conversation

edsantiago commented Nov 7, 2024

Uh oh!

edsantiago commented Nov 7, 2024

Uh oh!

Luap99 left a comment

Choose a reason for hiding this comment

Uh oh!

openshift-ci bot commented Nov 7, 2024

Uh oh!

mheon commented Nov 7, 2024

Uh oh!

Uh oh!

Uh oh!