Fine-tune a Vision Transformer Model with a custom biomedical dataset #85

emre570 · 2024-04-25T17:30:52Z

I'm planning to make a notebook about fine-tuning a vit model with a custom biomedical dataset.
I have a code ready to use, made for my graduation project.
I used HF Datasets for dataset works, HF Transformers and Trainer for fine-tuning.
(Optional) Metrics won't show at training process momentarily, I can add a custom callback function.

If it's okay, i will push my code, then begin the editing for beginners. I am waiting for your opinions and suggestions.

review-notebook-app · 2024-04-25T17:30:57Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

stevhliu · 2024-04-25T22:14:23Z

Yes, looking forward to reviewing your notebook! 🤗

Added and edited the notebook

emre570 · 2024-04-26T11:41:38Z

Hello @stevhliu, made the first commit

I edited and organized all sections, waiting for your opinions and reviews

stevhliu · 2024-04-26T18:17:42Z

notebooks/en/fine_tuning_vit_custom_dataset.ipynb

@@ -0,0 +1,767 @@
+{


In "Dataset Info", it'd be nice to briefly explain what the images are of so users have more context about what they're training the model to do.

Reply via ReviewNB

stevhliu · 2024-04-26T18:17:42Z

notebooks/en/fine_tuning_vit_custom_dataset.ipynb

@@ -0,0 +1,767 @@
+{


Is this dataset available on the Hub? I think it'd be easier for users to follow along if they could also download the dataset or if you provided some more information/details about how a user can create their own dataset with their images.

Reply via ReviewNB

stevhliu · 2024-04-26T18:17:42Z

notebooks/en/fine_tuning_vit_custom_dataset.ipynb

@@ -0,0 +1,767 @@
+{


Maybe say something like the following to avoid confusion with the next sentence where you say we can see the features again.

"We can the image is a PIL.Image with a label associated with it."

Reply via ReviewNB

stevhliu · 2024-04-26T18:18:11Z

Make sure to add your notebook to the toctree!

emre570 · 2024-04-27T13:09:45Z

Hello @stevhliu, I made some changes

I put some images from dataset to "Dataset Info" section, but I have some questions.

The user can find similar datasets from Kaggle, and I can also upload the dataset to Hub. What should I do?
In toctree, which section should I put the notebook?
Last question, you said "We can the image is a PIL.Image with a label associated with it.". Sorry I didn't understand this. Where should I put it?

stevhliu · 2024-04-29T19:20:50Z

The user can find similar datasets from Kaggle, and I can also upload the dataset to Hub. What should I do?

I think it'd be easiest to upload the dataset to the Hub so users can follow along with your notebook without putting in the extra work of finding a similar dataset from Kaggle if they don't want to.

In toctree, which section should I put the notebook?

I think you can create a new "Computer vision" section.

Last question, you said "We can the image is a PIL.Image with a label associated with it.".

Sorry for the typo. You can put that text before you call train_ds[0]. So in other words:

We can see the image is a PIL.Image with a label associated with it.

train_ds[0]

- Added notebook to toctree - Put an image about dataset info - Pushed the dataset to Hub

emre570 · 2024-04-29T20:52:34Z

Hello @stevhliu, I made the changes you asked for.

Put an image to Dataset info so user can see some images from dataset.
Edited the sections you corrected.
Added notebook to toctree
Pushed the dataset to Hub and waiting to be public when notebook releases.

HuggingFaceDocBuilderDev · 2024-04-29T21:48:22Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

review-notebook-app · 2024-04-30T13:32:18Z

View / edit / reply to this conversation on ReviewNB

merveenoyan commented on 2024-04-30T13:32:17Z
----------------------------------------------------------------

maybe you could give link to the base model :)

emre570 commented on 2024-04-30T17:52:20Z
----------------------------------------------------------------

I already did, it should direct you to model's HF page

review-notebook-app · 2024-04-30T13:32:19Z

View / edit / reply to this conversation on ReviewNB

merveenoyan commented on 2024-04-30T13:32:18Z
----------------------------------------------------------------

nit: let's snake case the variable names for consistency with the rest of the recipe

review-notebook-app · 2024-04-30T13:32:20Z

View / edit / reply to this conversation on ReviewNB

merveenoyan commented on 2024-04-30T13:32:19Z
----------------------------------------------------------------

maybe call push_to_hub explicitly as well

review-notebook-app · 2024-04-30T13:32:21Z

View / edit / reply to this conversation on ReviewNB

merveenoyan commented on 2024-04-30T13:32:20Z
----------------------------------------------------------------

nit: scikit-learn

review-notebook-app · 2024-04-30T13:32:21Z

View / edit / reply to this conversation on ReviewNB

merveenoyan commented on 2024-04-30T13:32:21Z
----------------------------------------------------------------

this is nice, but also we could put classification score because in this case we care about recall a lot (we don't want to miss malignant ones that look like benign or malign)

merveenoyan

mostly nits, thanks a lot! we can merge afterwards IMO, very well made

emre570 · 2024-04-30T17:52:22Z

I already did, it should direct you to model's HF page

View entire conversation on ReviewNB

emre570 · 2024-04-30T18:15:20Z

Hello @merveenoyan, I made changes you asked. The notebook had issues. Your request for recall score saved nearly everything 😁.

Some cells had errors and could've ruined all work. I made the notebook from scratch and fixed all code. It is fully working now and ready to use.

review-notebook-app · 2024-05-02T17:22:27Z

View / edit / reply to this conversation on ReviewNB

stevhliu commented on 2024-05-02T17:22:27Z
----------------------------------------------------------------

The link to the image doesn't work. Can you upload it to https://huggingface.co/datasets/huggingface/cookbook-images and then link from there?

stevhliu · 2024-05-02T17:23:03Z

One more comment, then we can merge! 🤗

- Opened PR and added the image.

emre570 · 2024-05-02T18:45:11Z

Hey @stevhliu, I think I did it, opened a PR and uploaded the image, it should work now.

merveenoyan · 2024-05-02T19:22:16Z

@emre570 I just merged your PR to dataset repository

merveenoyan

thanks a lot @emre570 once @stevhliu approves we can merge!

emre570 · 2024-05-02T19:27:06Z

Thanks folks, it was a pleasure ❤️

stevhliu

LGTM, thanks for the contribution! 🤗

emre570 · 2024-05-06T14:57:52Z

HUGE thanks folks, again, it was a pleasure ❤️

Create test-notebook.ipynb

608dd73

Initial commit

247668f

Added and edited the notebook

stevhliu reviewed Apr 26, 2024

View reviewed changes

Made changes

54eaf89

- Added notebook to toctree - Put an image about dataset info - Pushed the dataset to Hub

merveenoyan reviewed Apr 30, 2024

View reviewed changes

Edited Notebook

e0e5bfb

Fixed "Dataset Info" Image

13c1f22

- Opened PR and added the image.

merveenoyan approved these changes May 2, 2024

View reviewed changes

stevhliu approved these changes May 6, 2024

View reviewed changes

stevhliu merged commit 3c0cff3 into huggingface:main May 6, 2024
1 check passed

Fine-tune a Vision Transformer Model with a custom biomedical dataset #85

Fine-tune a Vision Transformer Model with a custom biomedical dataset #85

Uh oh!

Conversation

emre570 commented Apr 25, 2024

Uh oh!

review-notebook-app bot commented Apr 25, 2024

Uh oh!

stevhliu commented Apr 25, 2024

Uh oh!

emre570 commented Apr 26, 2024

Uh oh!

stevhliu Apr 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stevhliu Apr 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stevhliu Apr 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stevhliu commented Apr 26, 2024

Uh oh!

emre570 commented Apr 27, 2024

Uh oh!

stevhliu commented Apr 29, 2024

Uh oh!

emre570 commented Apr 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Apr 29, 2024

Uh oh!

review-notebook-app bot commented Apr 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Apr 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Apr 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Apr 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Apr 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

merveenoyan left a comment

Choose a reason for hiding this comment

Uh oh!

emre570 commented Apr 30, 2024

Uh oh!

emre570 commented Apr 30, 2024

Uh oh!

review-notebook-app bot commented May 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stevhliu commented May 2, 2024

Uh oh!

emre570 commented May 2, 2024

Uh oh!

merveenoyan commented May 2, 2024

Uh oh!

merveenoyan left a comment

Choose a reason for hiding this comment

Uh oh!

emre570 commented May 2, 2024

Uh oh!

stevhliu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

emre570 commented May 6, 2024

Uh oh!

Uh oh!

stevhliu Apr 26, 2024 •

edited

Loading

stevhliu Apr 26, 2024 •

edited

Loading

stevhliu Apr 26, 2024 •

edited

Loading

emre570 commented Apr 29, 2024 •

edited

Loading

review-notebook-app bot commented Apr 30, 2024 •

edited

Loading

review-notebook-app bot commented Apr 30, 2024 •

edited

Loading

review-notebook-app bot commented Apr 30, 2024 •

edited

Loading

review-notebook-app bot commented Apr 30, 2024 •

edited

Loading

review-notebook-app bot commented Apr 30, 2024 •

edited

Loading

review-notebook-app bot commented May 2, 2024 •

edited

Loading