[Liger] liger DPO support #2568

kashif · 2025-01-14T13:14:28Z

What does this PR do?

Add support for Liger-kernel losses for the DPO Kernel

Peft support: #3065

HuggingFaceDocBuilderDev · 2025-01-15T11:15:17Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec · 2025-01-17T16:23:36Z

tests/test_dpo_trainer.py

+        3. Loss values are reasonable and finite
+        4. Training works with both default and custom beta values
+        """
+        beta_values = [0.1, 0.5]  # Test multiple beta values


Can you use @parameterized.expand instead?

trl/trainer/dpo_trainer.py

qgallouedec · 2025-01-17T16:50:25Z

liger loss isn't compatible with ref precomputing right? If so we could add a warning or an error.

qgallouedec · 2025-01-17T16:57:26Z

docs/source/reducing_memory_usage.md

+
+## Liger for reducing peak memory usage
+
+[To complete]
+
+<hfoptions id="liger">
+<hfoption id="DPO">
+
+To use Liger for reducing peak memory usage, use the following code snippet:
+
+```python
+from trl import DPOConfig
+
+training_args = DPOConfig(..., use_liger_loss=True)
+```
+
+</hfoption>
+</hfoptions>


@kashif I've added this section in the new guide for reducing memory usage, if you've words to fill it

VProv · 2025-03-26T16:27:33Z

This PR needs to use _FSDPForwardRedirection or another solution to work with FSDP correctly
linkedin/Liger-Kernel#615
https://github.com/linkedin/Liger-Kernel/blob/2bb8dcfc18f10ff90f942f238b5cfe16c12749b6/src/liger_kernel/transformers/trainer/orpo_trainer.py#L18-L66

kashif · 2025-03-26T16:35:36Z

@VProv, at the moment, I was having issues getting the same outputs/metrics with and without liger in the trainer.

VProv · 2025-03-26T17:18:59Z

@VProv, at the moment, I was having issues getting the same outputs/metrics with and without liger in the trainer.

What setup are you using?

vaibhavjindal · 2025-04-22T22:03:51Z

Hi, I am working on fixing the output/metrics issue.
Added a PR in liger-kernel: linkedin/Liger-Kernel#676

vaibhavjindal · 2025-04-23T09:33:18Z

@kashif @qgallouedec can you please review the following PR which fixes the output/metrics issue? Thanks :)
#3346

kashif · 2025-04-23T09:58:35Z

thanks @vaibhavjindal done, i'll fix the merge conflict and then review this PR

hanbyul-kim · 2025-05-03T05:50:05Z

Hi, thanks for sharing your work! Can I use your code with DeepSpeed Zero 3? I tried running it with that setup, but it doesn't seem to be working. I think it's related to parameter partitioning based on my analysis of the error log.

[rank5]:   File "/mnt/nappipe/users/hanbyul-kim/RORL/apply_liger_loss/Liger-Kernel/src/liger_kernel/chunked_loss/dpo_loss.py", line 94, in forward
[rank5]:     return super().forward(
[rank5]:   File "/mnt/nappipe/users/hanbyul-kim/RORL/apply_liger_loss/Liger-Kernel/src/liger_kernel/chunked_loss/fused_linear_preference.py", line 241, in forward
[rank5]:     accumulate_chunk(input_chunk, target_chunk, ref_input_chunk, chosen_nll_target_chunk)
[rank5]:   File "/mnt/nappipe/users/hanbyul-kim/RORL/apply_liger_loss/Liger-Kernel/src/liger_kernel/chunked_loss/fused_linear_preference.py", line 159, in accumulate_chunk
[rank5]:     ) = fused_fwd_bwd(input_chunk, target_chunk, ref_input_chunk, chosen_nll_target_chunk)
[rank5]:   File "/mnt/nappipe/users/hanbyul-kim/RORL/apply_liger_loss/Liger-Kernel/src/liger_kernel/chunked_loss/fused_linear_preference.py", line 120, in fused_fwd_bwd
[rank5]:     return torch.func.grad_and_value(compute_loss, argnums=(0, 1), has_aux=True)(
[rank5]:   File "/root/.dpo_trainer_venv/lib/python3.10/site-packages/torch/_functorch/apis.py", line 440, in wrapper
[rank5]:     return eager_transforms.grad_and_value_impl(
[rank5]:   File "/root/.dpo_trainer_venv/lib/python3.10/site-packages/torch/_functorch/vmap.py", line 48, in fn
[rank5]:     return f(*args, **kwargs)
[rank5]:   File "/root/.dpo_trainer_venv/lib/python3.10/site-packages/torch/_functorch/eager_transforms.py", line 1409, in grad_and_value_impl
[rank5]:     output = func(*args, **kwargs)
[rank5]:   File "/mnt/nappipe/users/hanbyul-kim/RORL/apply_liger_loss/Liger-Kernel/src/liger_kernel/chunked_loss/fused_linear_preference.py", line 377, in _compute_loss
[rank5]:     ) = LigerFusedLinearPreferenceBase.chunk_forward(
[rank5]:   File "/mnt/nappipe/users/hanbyul-kim/RORL/apply_liger_loss/Liger-Kernel/src/liger_kernel/chunked_loss/fused_linear_preference.py", line 289, in chunk_forward
[rank5]:     logits_chunk = input_chunk @ weight.t()
[rank5]: RuntimeError: size mismatch, got input (322), mat (322x4096), vec (0)

hanbyul-kim · 2025-05-03T05:57:35Z

Continuing my analysis, I can confirm that it's definitely connected to DeepSpeed zero 3. When I switched to stage 2, it ran smoothly without any issues.

kashif · 2025-05-05T08:18:40Z

thanks @hanbyul-kim for the report

initial liger support

f50e74d

kashif mentioned this pull request Dec 22, 2024

[Tracking issue] Integrate native liger-kernel losses #2495

Open

7 tasks

kashif added 3 commits January 15, 2025 12:05

fix outputs

e3eebd3

fix config merge conflict

2d82b39

Merge branch 'main' into liger-dpo

50d341e

kashif added 2 commits January 15, 2025 12:19

fix comment

8ae06b1

fix peft training

cc2b7b9

qgallouedec reviewed Jan 17, 2025

View reviewed changes

qgallouedec added 3 commits January 17, 2025 16:31

use parametrized

03fd005

raise error as soon as dep is not met

5f4110f

move param to the right section

b22eb24

qgallouedec reviewed Jan 17, 2025

View reviewed changes

trl/trainer/dpo_trainer.py Outdated Show resolved Hide resolved

reducing memory doc

b8e6f8c

qgallouedec reviewed Jan 17, 2025

View reviewed changes

kashif added 8 commits January 21, 2025 14:57

use liger specifc method

6310dbd

Merge branch 'main' into liger-dpo

bdca4f1

Merge branch 'main' into liger-dpo

5efe4d0

update return signature

dbece54

Merge branch 'main' into liger-dpo

d21bd81

fix typo

c441925

fix tests

f1af5d6

truncation and logits to keep

2814228

VProv mentioned this pull request Mar 26, 2025

[WIP] PEFT 🤝 Liger DPO #3065

Draft

5 tasks

Fix liger loss mask and input for liger (#3346)

7c821b5

kashif added 2 commits April 23, 2025 13:17

fix formatting

baba3bb

Merge branch 'main' into liger-dpo

f41c023

Merge branch 'main' into liger-dpo

94422db

kashif added 3 commits May 5, 2025 10:21

update liger to fix dpo bug

614e5d9

skip test for python 3.9

8939796

fix asserts

50a4adc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Liger] liger DPO support #2568

[Liger] liger DPO support #2568

kashif commented Jan 14, 2025 •

edited

Loading

HuggingFaceDocBuilderDev commented Jan 15, 2025

qgallouedec Jan 17, 2025

qgallouedec commented Jan 17, 2025

qgallouedec Jan 17, 2025

VProv commented Mar 26, 2025

kashif commented Mar 26, 2025

VProv commented Mar 26, 2025

vaibhavjindal commented Apr 22, 2025

vaibhavjindal commented Apr 23, 2025

kashif commented Apr 23, 2025

hanbyul-kim commented May 3, 2025

hanbyul-kim commented May 3, 2025

kashif commented May 5, 2025

[Liger] liger DPO support #2568

Are you sure you want to change the base?

[Liger] liger DPO support #2568

Conversation

kashif commented Jan 14, 2025 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Jan 15, 2025

qgallouedec Jan 17, 2025

Choose a reason for hiding this comment

qgallouedec commented Jan 17, 2025

qgallouedec Jan 17, 2025

Choose a reason for hiding this comment

VProv commented Mar 26, 2025

kashif commented Mar 26, 2025

VProv commented Mar 26, 2025

vaibhavjindal commented Apr 22, 2025

vaibhavjindal commented Apr 23, 2025

kashif commented Apr 23, 2025

hanbyul-kim commented May 3, 2025

hanbyul-kim commented May 3, 2025

kashif commented May 5, 2025

kashif commented Jan 14, 2025 •

edited

Loading