Qwen3 not stopping generation after lora finetuning #7943

dittops · 2025-05-03T05:14:37Z

Reminder

I have read the above rules and searched the existing issues.

System Info

LLaMA-Factory.git@6a584b40928fb6d69e22c1403db226eb04358a30#egg=llamafactory
Python 3.11.12
Ubunutu 22.04

Nvidia A100 80GB

Reproduction

I'm training Qwen/Qwen3-4B-Base with language adapters. For SFT full parameter fine-tuning, the generation is coming out as expected. But when I train the same data and model with Lora, the generation is not stopping. Here is my train config

### model
model_name_or_path: Qwen/Qwen3-4B-Base
trust_remote_code: true
flash_attn: fa2

### method
stage: sft
do_train: true
finetuning_type: lora
lora_rank: 16
lora_target: all
deepspeed: examples/deepspeed/ds_z2_config.json

### dataset
dataset: alpaca_hindi
template: qwen3
overwrite_cache: true
preprocessing_num_workers: 16
dataloader_num_workers: 4

### output
output_dir: saves/qwen3-4b-base/lora/sft-hindi-v2
logging_steps: 10
plot_loss: true
overwrite_output_dir: true
save_only_model: true
report_to: wandb  # choices: [none, wandb, tensorboard, swanlab, mlflow]

### train
per_device_train_batch_size: 16
gradient_accumulation_steps: 1
learning_rate: 1.0e-4
num_train_epochs: 5.0
lr_scheduler_type: cosine
warmup_ratio: 0.1
bf16: true
ddp_timeout: 180000000
resume_from_checkpoint: null

Others

No response

The text was updated successfully, but these errors were encountered:

dittops · 2025-05-03T06:03:18Z

Instead of end token, it generates another token. And this is not consistent, when I reload I can see some other token.

hiyouga · 2025-05-03T07:37:30Z

You can try template: default for base models

dittops · 2025-05-03T08:10:06Z

I have tried the template: default for Qwen 3, and it is still the same. But if I change the model to LLama 3b base, then it's working fine.

hiyouga · 2025-05-03T08:32:12Z

maybe the eos token was wrong in qwen3 base model. You can use additional_target: embed_tokens to fine-tune the embedding tokens when using lora

dittops · 2025-05-03T11:19:56Z

I have tried enabling that. Still getting the same issue

### model
model_name_or_path: Qwen/Qwen3-4B-Base
trust_remote_code: true
flash_attn: fa2

### method
stage: sft
do_train: true
finetuning_type: lora
lora_rank: 16
lora_target: all
additional_target: embed_tokens
deepspeed: examples/deepspeed/ds_z2_config.json

### dataset
dataset: alpaca_en
template: default
# cutoff_len: 2048
# max_samples: 1000
overwrite_cache: true
preprocessing_num_workers: 16
dataloader_num_workers: 4

### output
output_dir: saves/qwen3-4b-base/lora/sft-test
logging_steps: 10
save_steps: 5000
plot_loss: true
overwrite_output_dir: true
save_only_model: false
report_to: wandb  # choices: [none, wandb, tensorboard, swanlab, mlflow]

### train
per_device_train_batch_size: 16
gradient_accumulation_steps: 1
learning_rate: 1.0e-4
num_train_epochs: 3.0
lr_scheduler_type: cosine
warmup_ratio: 0.1
bf16: true
ddp_timeout: 180000000
resume_from_checkpoint: null

hixulei · 2025-05-03T15:52:26Z

遇到了一样的问题

hiyouga · 2025-05-09T08:54:13Z

Try setting the eos token in the tokenizer config to <|endoftext|> and using default template if you are fine-tuning base model using lora
https://huggingface.co/Qwen/Qwen3-4B-Base/blob/main/tokenizer_config.json#L232

SelenoChannel · 2025-05-10T23:43:05Z

Try setting the eos token in the tokenizer config to <|endoftext|> and using default template if you are fine-tuning base model using lora https://huggingface.co/Qwen/Qwen3-4B-Base/blob/main/tokenizer_config.json#L232

It worked! Thanks!

dittops added bug Something isn't working pending This problem is yet to be addressed labels May 3, 2025

dittops changed the title ~~Qwen3 not sopping generation after lora finetuning~~ Qwen3 not stopping generation after lora finetuning May 3, 2025

hiyouga closed this as completed May 3, 2025

hiyouga added solved This problem has been already solved and removed bug Something isn't working pending This problem is yet to be addressed labels May 3, 2025

hiyouga reopened this May 3, 2025

Kuangdd01 mentioned this issue May 4, 2025

Lora相关方法测试指令微调生成文本末尾不正常 #7949

Closed

1 task

hiyouga closed this as completed May 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Qwen3 not stopping generation after lora finetuning #7943

Qwen3 not stopping generation after lora finetuning #7943

dittops commented May 3, 2025

dittops commented May 3, 2025

Uh oh!

hiyouga commented May 3, 2025

Uh oh!

dittops commented May 3, 2025

Uh oh!

hiyouga commented May 3, 2025

Uh oh!

dittops commented May 3, 2025

Uh oh!

hixulei commented May 3, 2025

Uh oh!

hiyouga commented May 9, 2025 •

edited

Loading

Uh oh!

SelenoChannel commented May 10, 2025

Uh oh!

Qwen3 not stopping generation after lora finetuning #7943

Qwen3 not stopping generation after lora finetuning #7943

Comments

dittops commented May 3, 2025

Reminder

System Info

Reproduction

Others

dittops commented May 3, 2025

Uh oh!

hiyouga commented May 3, 2025

Uh oh!

dittops commented May 3, 2025

Uh oh!

hiyouga commented May 3, 2025

Uh oh!

dittops commented May 3, 2025

Uh oh!

hixulei commented May 3, 2025

Uh oh!

hiyouga commented May 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SelenoChannel commented May 10, 2025

Uh oh!

hiyouga commented May 9, 2025 •

edited

Loading