Stoppable Submissions in Rust #11

adamreichold · 2025-05-31T20:04:30Z

No description provided.

ashvardanian · 2025-05-31T20:32:31Z

Hi @adamreichold! The proposed variant doesn't cover the synchronization between threads. Other threads need a way to know that the work has stopped.

There is probably a way to reduce the contention on the stop and generation, but it seems like this is not the way to achieve that 🤷

adamreichold · 2025-05-31T20:49:10Z

The proposed variant doesn't cover the synchronization between threads.

The synchronization happens with the same atomics that are used to submit any work item to the workers.

Other threads need a way to know that the work has stopped.

By broadcasting the stop_trampoline, each thread will execute it once and thereby set its local stop variable. As written above, this relies on the existing synchronization used during operation of the pool.

ashvardanian · 2025-06-01T12:15:04Z

I am not sure, what "broadcasting" means? What kind of logic will be executed in the CPU - which instructions?

adamreichold · 2025-06-01T13:42:42Z

The same as when any other work item/trampoline is submitted to the thread pool. The atomic instructions used to modify threads_to_sync and generation. I also think, it might be best if you try the changes out to convince yourself that this works.

Maybe think of it this way: If you already have a method to run arbitrary closures on a given set of threads, do you really need additional shared state to tell these threads to exit?

Or if a reference to an authority helps: I did not come up with this. I think the first time I read about this method to handle shutting down a thread pool was in an old blog post by Herb Sutter. (Of course, I can't find it now...)

adamreichold · 2025-06-01T13:46:29Z

I am not sure, what "broadcasting" means?

It means what the existing method called "broadcast" does: Ensure that a given closure is called exactly one by each worker thread in the pool. Only the minor technicality that we are not interested in the result and do not need to run this on the current thread means that the actual method isn't called.

ashvardanian · 2025-06-01T14:09:23Z

I think an example may help. I think we are tackling two different use-cases:

Having each thread decide it's own completion time individually, vs.
Shutting down the thread pool.

The stop was used for the latter. And we don't need to pass an additional boolean into the function call, as any arbitrary lambda/closure context is propagated. So you can pass the thread_pool_t & into the lambda, and stop it from the inside. You can also pass std::atomic_flag or std::stop_token or any other primitive by reference - synchronized or not. Standard libraries often have too many pieces of functionality for that, and none of them are explicit about the underlying hardware features they leverage, so using them here is hard to justify 🤷

adamreichold · 2025-06-01T14:19:27Z

The stop was used for the latter.

And the stop: &mut bool argument is also used for the latter. It is internal work items submitted by the pool itself which can see this argument. And they use it to tell the worker which is currently executing them to stop. Only the current worker, but since the work item is broadcast, all workers will do this eventually and exit. So this is method to shutdown the whole pool just as with the stop atomic flag.

But the "message to stop", so to speak, is packaged into a closure and transferred using the existing work submission mechanism instead of using the out-of-band stop flag.

(This is also why the extra trampoline argument is required. The point of this is to avoid shared global state. It could also be a return value, but I figured this might be more costly if every work has to produce it compared to ignoring the argument. It could also be an array with one (non-atomic) flag per worker which is then indexed using the thread index, but that is just much more complicated than the on-stack state for no good reason.)

I think an example may help.

Every test case in this repository runs this code when it shut downs its thread pool. There is nothing extra to see here, this is just a more efficient mechanism to implement the existing shutdown semantics.

ashvardanian · 2025-06-03T10:29:17Z

Thanks for the explanations! We can have such logic in the lambda, but we also need a way to propagate the signal handlers and other interruptions originating outside of the loop. I'll come back to this in a couple of weeks and will try to incorporate your suggestions 🤗

conradludgate · 2025-06-04T06:11:46Z

fork_union.rs

        let context = inner.context();
        unsafe {
-            trampoline(context, thread_index);
+            trampoline(context, thread_index, &mut stop);


Why use an out pointer instead of a return value?

My initial idea was that the overhead might be lower as those work items which do not want to stop the pool (so basically all but one) can just ignore the additional argument (instead of producing a useless false as a return value). I should pull the initialization out of the loop though so that the worker loop does not pay the price of it for each work item...

I think there is an ever nicer way which does not need to touch the trampoline signature at all but using the identity of the stop_trampoline, c.f. adamreichold/fork-join-scope@cdb63bf But transplanting it here is not that trivial because of the rules around function items/pointers and their comparison. So personally, I would prefer to switch to using dyn Fn(usize) + Sync as the work items instead of manually splitting context and trampoline first before implementing that.

…side out.

chengts95 · 2025-10-15T21:49:22Z

I have tested the stop_trampoline solution in the new C++ threadpool. It works as intended. However, the improvement is hard to observe. Maybe the benefit is small since the atomic read won't occur when the worker is busy.
By the way, I just wanted to say—I really appreciate the quality of your work. This library brings a refreshing clarity to how parallel computation can be done in Rust. Beautifully designed and very inspiring!

ashvardanian · 2025-10-19T12:58:25Z

Thanks for suggestions, @adamreichold! And for the kind words, @chengts95 😊

I'll be adding early-stoppable parallel iterators in the upcoming v3, together with Zig support and other features. Let me know if you have any other ideas worth exploring 🤗

chengts95 · 2025-10-19T13:09:11Z

Hi. Maybe we can have an aysnc par for and allow manually join the threads.

ashvardanian changed the title ~~Avoid one atomic access per work item by stopping workers from the inside out.~~ Reduce stop Contention? Jun 1, 2025

ashvardanian changed the base branch from main to main-dev June 1, 2025 13:42

ashvardanian marked this pull request as draft June 1, 2025 13:42

adamreichold marked this pull request as ready for review June 1, 2025 13:42

adamreichold changed the title ~~Reduce stop Contention?~~ Avoid one atomic access per work item by stopping workers from the inside out. Jun 1, 2025

ashvardanian self-assigned this Jun 3, 2025

conradludgate reviewed Jun 4, 2025

View reviewed changes

Avoid one atomic access per work item by stopping workers from the in…

facd918

…side out.

adamreichold force-pushed the dont-stop branch from 74558fc to facd918 Compare June 4, 2025 17:31

ashvardanian force-pushed the main-dev branch from 8b25ec7 to 134eca9 Compare July 8, 2025 14:46

ashvardanian changed the title ~~Avoid one atomic access per work item by stopping workers from the inside out.~~ Stoppable Submissions in Rust Oct 19, 2025

Stoppable Submissions in Rust #11

Are you sure you want to change the base?

Stoppable Submissions in Rust #11

Uh oh!

Conversation

adamreichold commented May 31, 2025

Uh oh!

ashvardanian commented May 31, 2025

Uh oh!

adamreichold commented May 31, 2025

Uh oh!

ashvardanian commented Jun 1, 2025

Uh oh!

adamreichold commented Jun 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adamreichold commented Jun 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ashvardanian commented Jun 1, 2025

Uh oh!

adamreichold commented Jun 1, 2025

Uh oh!

ashvardanian commented Jun 3, 2025

Uh oh!

conradludgate Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

adamreichold Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

chengts95 commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ashvardanian commented Oct 19, 2025

Uh oh!

chengts95 commented Oct 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

adamreichold commented Jun 1, 2025 •

edited

Loading

adamreichold commented Jun 1, 2025 •

edited

Loading

chengts95 commented Oct 15, 2025 •

edited

Loading