Skip to content

Which model weights to consider when you are fine-tuning the second time ? #164

Answered by mrdbourke
teddcp2 asked this question in Q&A
Discussion options

You must be logged in to vote

Hey there,

Your thoughts here would line up with how I would do it:

Should I take the feature-extracted model and increase the trainable layers from bottom with a lower learning rate ? OR From the last point i.e. I should take the same model where I just did the fine tune and start from the last epoch with a lower learning rate?'

In essence:

  • Feature extract
  • Unfreeze some layers
  • Fine-tune
  • Unfreeze
  • Fine-tune
  • Unfreeze
  • Fine-tune

etc...

Though where the best results will come from is a little bit of guess and check.

Jeremy Howard explains this methodology quite well in https://arxiv.org/abs/1801.06146 (see the ablation studies for different fine-tuning methods).

@realnihal's explanation ab…

Replies: 2 comments 3 replies

Comment options

You must be logged in to vote
1 reply
@teddcp2
Comment options

Comment options

You must be logged in to vote
2 replies
@teddcp2
Comment options

@mrdbourke
Comment options

Answer selected by teddcp2
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants