Skip to content

Fix remote trigger event expiry logic #17775

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
May 20, 2025
Merged

Conversation

bolekk
Copy link
Contributor

@bolekk bolekk commented May 18, 2025

Each workflow node waits for at least F+1 identical copies of each trigger event before trusting it and starting workflow execution.

Event copies need to be clustered within a relatively short time window (default = 1 minute). Anything received after that is rejected. The problem is that we can't trust the first received event to establish the "creationTs" - if one of them is sent too early, all subsequent ones can be incorrectly rejected.

The proposed fix is to remove the check. We already rely on messageCache.Ready() to apply the time window when aggregating events. That time window is applied backwards after each new event is received so it has the desired effect.

@bolekk bolekk force-pushed the bugfix/remote_trigger_expiry branch from 2072e10 to 4bed6af Compare May 18, 2025 23:40
@bolekk bolekk changed the title Unit test for remote trigger event expiry logic Fix remote trigger event expiry logic May 18, 2025
@bolekk bolekk force-pushed the bugfix/remote_trigger_expiry branch from 4bed6af to 960fdf2 Compare May 18, 2025 23:46
@bolekk bolekk marked this pull request as ready for review May 19, 2025 19:17
@bolekk bolekk requested review from a team as code owners May 19, 2025 19:17
Copy link
Contributor

@krehermann krehermann left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please a changeset so the release team can easily track

@bolekk bolekk force-pushed the bugfix/remote_trigger_expiry branch from 960fdf2 to 75df981 Compare May 19, 2025 21:24
krehermann
krehermann previously approved these changes May 19, 2025
justinkaseman
justinkaseman previously approved these changes May 19, 2025
@bolekk bolekk dismissed stale reviews from justinkaseman and krehermann via 55ec76c May 20, 2025 00:28
@bolekk bolekk force-pushed the bugfix/remote_trigger_expiry branch from 75df981 to 55ec76c Compare May 20, 2025 00:28
@bolekk bolekk requested review from a team as code owners May 20, 2025 00:28
@justinkaseman justinkaseman self-requested a review May 20, 2025 00:29
@bolekk bolekk enabled auto-merge May 20, 2025 00:30
@bolekk bolekk added this pull request to the merge queue May 20, 2025
Merged via the queue into develop with commit 5256f32 May 20, 2025
117 of 119 checks passed
@bolekk bolekk deleted the bugfix/remote_trigger_expiry branch May 20, 2025 01:21
bolekk added a commit that referenced this pull request May 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants