Prototype example #1

wli51 · 2025-02-18T17:20:51Z

This is a self-contained-ish minimal example of trainer, callbacks and models for the sake of demo and collecting suggestion purpose. The actual PRs should probably be smaller.

…er passing itself into each callback each invocation the trainers and initialized as internal attribute of each callback during trainer initialization.

…ocstrings

…across batches

… allow for specification of early termination metric

…metric

review-notebook-app · 2025-02-18T17:20:56Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

…index and one example that make uses of it

…ermination metric is supplied the trainer runs for the specified amount of epochs.

… with prediction to stream line evaluation.

…ted on a subset of the target prediciton pairs.

…_image

…lected input and target patches/images from dataset along sid emodel prediction. The new plot functions include one, plot_predictions_grid_from_eval, that can operate down stream of existing inference/evaluation results to avoid redundant forward passes. A different plot function, plot_predictions_grid_from_model, internally performs inference and evluation and visualizes results from trained model and dataset to enable visualization without the need for beforehand inference and evaluation.

…otting function

…upports both PatchDataset and standard ImageDataset

…ng functions

MattsonCam · 2025-03-04T02:33:33Z

Hey @wli51, I responded to your responses. Let me know if you want to leave other responses. Otherwise, I will give my closing remarks (and my decision for this pr).

…ivation function

MattsonCam

This first iteration LGTM, good job @wli51! I think there are some changes we will want to make in future prs. I added additional comments to help guide some of these changes. Let me know if you have any questions, and when you want to meet

MattsonCam · 2025-03-04T17:05:47Z

datasets/CachedDataset.py

+
+        self.__dataset = dataset
+
+        self.__cache_size = cache_size if cache_size is not None else len(dataset)


Is there only a speedup if every sample can fit into memory? If len(dataset) = N and you can only fit x samples into memory, then couldn't you just remove and then add N-x samples each epoch from memory? I think this would improve substantially if there is enough memory

MattsonCam · 2025-03-04T17:53:07Z

datasets/CachedDataset.py

+            raise ValueError("No current index set")
+
+    @property
+    def input_channel_keys(self):


The user should have some idea of how the data is organized. I think 2 would take too much time (especially going the deep learning route), so we probably want the user to perform this type of data splitting/preprocessing. I think 1 sounds like a better choice. Also, I'm leaning towards the idea of allowing the user to specify which samples belong in each datasplit (having one folder per split is one option)

MattsonCam · 2025-03-04T17:59:01Z

datasets/GenericImageDataset.py

+
+        return torch.from_numpy(input_images).float(), torch.from_numpy(target_images).float()
+
+    def _cache_image(self, site_id: str) -> None:


That could be an option. There are probably other ways you could combine them as well. Agreed, probably not a top priority but something worth considering

MattsonCam · 2025-03-04T18:02:39Z

losses/GradientPenaltyLoss.py

+            inputs=interpolated,
+            grad_outputs=torch.ones_like(prob_interpolated),
+            create_graph=True,
+            retain_graph=True,


Interesting, in that case I would just keep it

metrics/AbstractMetrics.py

MattsonCam · 2025-03-04T18:37:04Z

trainers/Trainer.py

+            self,
+            model: torch.nn.Module,
+            optimizer: torch.optim.Optimizer,
+            backprop_loss: Union[torch.nn.Module, List[torch.nn.Module]],


Some of this could probably also be explained in the README to add clarity.

I do wonder how we would balance the clarity with redundancy of the dataset related initializations? Should we have a AbstractTrainer that branches off to AbstractGANTrainer and AbstractConvNetTrainer?

Having two branching abstract trainers may work for our purposes. This software will need to have constraints (e.g. the user won't be able to train all possible virtual staining model). Otherwise, the user would need to use a framework, like pytorch, to create their own models and training procedures. Alternatively, we don't want a weak base class in case we want to train new models or the same models in a different way. Sometimes the code just needs to be good enough in the case we want to make any changes.

wli51 added 24 commits February 14, 2025 14:15

Added readme file for the model folders

d7a03b1

Added model files

ef44247

Added readme for dataset folder

6a2b4c3

Added dataset files

bea6c02

Added readme for trainers

5bfaad6

Added trainer files

31f2f1b

Added loss files

3496984

Added metrics files

c187a31

Added callback files

3e921b9

Added transform files

8af1475

Added evaluation files

744efcc

Made some modifications to callback

81f3ccd

Added gitignore

c8a27ea

Added notebook that is a minimal example

b7c0df2

Modified the way trainers are accessed by callbacks, instead of train…

a311fd7

…er passing itself into each callback each invocation the trainers and initialized as internal attribute of each callback during trainer initialization.

Updated docstring and removed unneeded imports for the callbacks

e466533

Updated docstring and removed unneeded imports for the dataset classes

b5a6891

Update loss class forward function variable name and added/modified d…

10a8765

…ocstrings

Removed redundant metric name property and added documentation

cde174c

Added documentation

6b3d914

Re-ran notebook

022a46d

Added wrapper class for torch modules that accumulates metric values …

74b7b02

…across batches

Updated abstract trainer class's initialization and trian function to…

12d5f45

… allow for specification of early termination metric

Update example notebook to demonstrate alternative early termination …

d123b20

…metric

MattsonCam self-requested a review February 19, 2025 23:54

wli51 added 4 commits February 19, 2025 23:48

Added environment file for cp loss

f33b906

Updated existing type hint to comply to python 3.9 standards

ae573dd

Re-ran example notebook

d57d80a

Added dataset class that does not rely on pe2loaddata generated file …

09b63d5

…index and one example that make uses of it

wli51 added 15 commits March 1, 2025 23:36

Fixed early temination enable/disable logic to ensure when no early t…

65556a3

…ermination metric is supplied the trainer runs for the specified amount of epochs.

Modified predict_image function so it returns the target tensor along…

9751437

… with prediction to stream line evaluation.

Modified evlauation_per_image_metric function so metrics cna be compu…

1b5d7ec

…ted on a subset of the target prediciton pairs.

Modified plot_patches function for compatibility with updated predict…

a052113

…_image

Update return type hint

2ef624f

Added kwargs support for plot parameters and fixed metrics in title

fda7de7

Updated IntermedaitePlot callback class for compatibility with new pl…

f909e34

…otting function

Renamed callback name and modified type checking to reflect that it s…

1413280

…upports both PatchDataset and standard ImageDataset

Update comment to better reflect what the functions are doing

0da8875

Removed uneeded files and functions

8d5a616

Update example so it is consistent with the updated evaluation/plotti…

baeb80b

…ng functions

Bug fix for early termination disable

fc76155

Bug fix for dataset type checking

d78beae

Update example so it is consistent with the updated evaluation/plotti…

e23dd55

…ng functions

MattsonCam self-requested a review March 3, 2025 22:40

wli51 added 12 commits March 3, 2025 19:52

Renamed loss class for clarity and updated examples accordingly

8e652b8

Removed uneeded functions

e0cbf41

Removed TODO comment that will not be addressed at the moment

c6ce487

Removed uneeded comment

1b81e1e

Updated UNet and FNet to have consistent specification for output act…

ce391c4

…ivation function

Removed some type docstrings to reduce redundancy

741c121

Removed not planned TODOs

220c561

Added docstring

8c7e97c

Removed unwanted functions

c45fabf

Renamed WGANTrainer and updated examples accordingly

207c75e

Removed unplanned TODO

9596e53

Removed unused attributes and refactored to use the depth attribute

ad93208

MattsonCam approved these changes Mar 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Prototype example #1

Prototype example #1

Uh oh!

wli51 commented Feb 18, 2025

Uh oh!

review-notebook-app bot commented Feb 18, 2025

Uh oh!

MattsonCam commented Mar 4, 2025

Uh oh!

MattsonCam left a comment •

edited

Loading

Uh oh!

MattsonCam Mar 4, 2025

Uh oh!

MattsonCam Mar 4, 2025

Uh oh!

MattsonCam Mar 4, 2025

Uh oh!

MattsonCam Mar 4, 2025

Uh oh!

Uh oh!

MattsonCam Mar 4, 2025

Uh oh!

Uh oh!


		self.__dataset = dataset

		self.__cache_size = cache_size if cache_size is not None else len(dataset)


		return torch.from_numpy(input_images).float(), torch.from_numpy(target_images).float()

		def _cache_image(self, site_id: str) -> None:

Prototype example #1

Are you sure you want to change the base?

Prototype example #1

Uh oh!

Conversation

wli51 commented Feb 18, 2025

Uh oh!

review-notebook-app bot commented Feb 18, 2025

Uh oh!

MattsonCam commented Mar 4, 2025

Uh oh!

MattsonCam left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MattsonCam Mar 4, 2025

Choose a reason for hiding this comment

Uh oh!

MattsonCam Mar 4, 2025

Choose a reason for hiding this comment

Uh oh!

MattsonCam Mar 4, 2025

Choose a reason for hiding this comment

Uh oh!

MattsonCam Mar 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

MattsonCam Mar 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

MattsonCam left a comment •

edited

Loading