Updating the segmentation to use Cellpose SAM #101

edyoshikun · 2025-06-16T15:34:59Z

This PR updates the segmentation pipline to use the Cellpose-SAM.

ziw-liu · 2025-06-30T18:31:16Z

biahub/segment.py


+        # Reorder channels based on channels_for_segmentation
+        cellpose_czyx = np.zeros(
+            (3, *czyx_data_to_segment.shape[1:]), dtype=czyx_data_to_segment.dtype


Is this the output datatype? If it's not stored in the same as input, it should always be integers whose width is determined automatically by cellpose. For example storing uint32 in a float32 container is lossy.

The real solution here would be to save integer labels in a separate labels array in the zarr store. For now we save everything in one float32 array, which is not ideal. I'd suggest casting cellpose_czyx to float32 and leaving a note that we should fix that later.

I meant if the output store is not the same as input it should be integers. There is no reason for a store that only have integer arrays to be initialized with float32 data type.

And since we do a lot of 2D segmentation, they wouldn't be stored with the 3D input.

I can default it to uint32.

ieivanov · 2025-07-15T16:51:08Z

@edyoshikun are you happy with this PR? Is it ready to merge? I understand this introduces a breaking change in the segmentation config, which we can coordinate with @tayllatheodoro - I think currently segmentation is not part of the processing pipeline, through it probably should be.

ieivanov · 2025-07-15T16:53:43Z

biahub/segment.py


    # Estimate resources
-    num_cpus, gb_ram_request = estimate_resources(shape=segmentation_shape, ram_multiplier=20)
+    num_cpus, gb_ram_request = estimate_resources(shape=segmentation_shape, ram_multiplier=10)


This seems like a lot of funny math to estimate the resources. Does estimate_resources not do everything you need? For example, it now allows for specifying the max number of CPUs.

I'd suggest leaving reasonable defaults in the code and using an sbatch file to tune these parameters as needed for specific reconstructions or depending on the cluster usage

this is the same as before. We just dont use the CPUs that much.

ieivanov · 2025-07-15T16:54:14Z

biahub/segment.py

                    input_channel_indices=[list(range(C))],
                    output_channel_indices=[list(range(C_segment))],
-                    num_processes=np.min([20, int(num_cpus * 0.8)]),
+                    num_processes=np.min([slurm_array_parallelism, int(num_cpus * 0.8)]),


The number of processes here should be decoupled from the slurm_array_parallelism?

It's tricky because this depends on the input dataset.

edyoshikun · 2025-07-30T23:25:36Z

I couldn't immediately figure out why when we run it locally, it doens't seem to find the GPU devices and defaults to cpu, which is super slow for cellpose.

ieivanov · 2025-08-08T01:42:02Z

@edyoshikun let's discuss the resource allocation in the PR and then we can merge it if you're happy with it.

edyoshikun · 2025-08-11T22:46:05Z

There is a caveat right now that this won't work with the Neuromast segmentations because there is no fine-tuned model for the Neuromast with CP4 yet.

edyoshikun added 2 commits June 15, 2025 20:55

upgrade the segmentation to use cellpose sam

4dbf169

update the segmentation config

a234a41

edyoshikun requested review from Soorya19Pradeep, ieivanov and tayllatheodoro June 16, 2025 15:35

ziw-liu reviewed Jun 30, 2025

View reviewed changes

edyoshikun added 3 commits July 2, 2025 17:42

fix bug when using 2d zslice

6d9245c

bump cellpose that supports fp16

8101134

modify default submitit parameters and optimizing segment_data func

95cb565

ieivanov reviewed Jul 15, 2025

View reviewed changes

setting to uint32

7443759

edyoshikun and others added 2 commits July 30, 2025 17:06

enforcing gpu in the config as optional

202f943

Merge branch 'main' into segment_cpsam

0f3ed50

mattersoflight added this to the Advanced Analysis milestone Aug 14, 2025

Updating the segmentation to use Cellpose SAM #101

Are you sure you want to change the base?

Updating the segmentation to use Cellpose SAM #101

Uh oh!

Conversation

edyoshikun commented Jun 16, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ieivanov commented Jul 15, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

edyoshikun commented Jul 30, 2025

Uh oh!

ieivanov commented Aug 8, 2025

Uh oh!

edyoshikun commented Aug 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants