Support finetuning base model weights in QAT + LoRA flow

Now that the QAT + LoRA recipe has landed in #1931, we can support a finetuning flow like the one used to generate the quantized Llama 3.2 1B and 3B checkpoints (see e.g. the 1B checkpoint [here](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct-QLORA_INT4_EO8)). Unlike traditional LoRA, one path for finetuning with QAT + LoRA involves updating both the LoRA weights and the base model weights (with the fake quantization operation), as referenced in [this blog](https://ai.meta.com/blog/meta-llama-quantized-lightweight-models/). We should add the option in our QAT + LoRA recipe to make all params trainable, not just the LoRA ones. This can be done by modifying the call to `set_trainable_params` [here](https://github.com/pytorch/torchtune/blob/main/recipes/qat_lora_finetune_distributed.py#L493).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support finetuning base model weights in QAT + LoRA flow #2089

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Support finetuning base model weights in QAT + LoRA flow #2089

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions