[Liger] liger DPO support #2568

kashif · 2025-01-14T13:14:28Z

What does this PR do?

Add support for Liger-kernel losses for the DPO Kernel

Peft support: #3065

HuggingFaceDocBuilderDev · 2025-01-15T11:15:17Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

tests/test_dpo_trainer.py

trl/trainer/dpo_trainer.py

qgallouedec · 2025-01-17T16:50:25Z

liger loss isn't compatible with ref precomputing right? If so we could add a warning or an error.

docs/source/reducing_memory_usage.md

VProv · 2025-03-26T16:27:33Z

This PR needs to use _FSDPForwardRedirection or another solution to work with FSDP correctly
linkedin/Liger-Kernel#615
https://github.com/linkedin/Liger-Kernel/blob/2bb8dcfc18f10ff90f942f238b5cfe16c12749b6/src/liger_kernel/transformers/trainer/orpo_trainer.py#L18-L66

kashif · 2025-03-26T16:35:36Z

@VProv, at the moment, I was having issues getting the same outputs/metrics with and without liger in the trainer.

VProv · 2025-03-26T17:18:59Z

@VProv, at the moment, I was having issues getting the same outputs/metrics with and without liger in the trainer.

What setup are you using?

vaibhavjindal · 2025-04-22T22:03:51Z

Hi, I am working on fixing the output/metrics issue.
Added a PR in liger-kernel: linkedin/Liger-Kernel#676

vaibhavjindal · 2025-04-23T09:33:18Z

@kashif @qgallouedec can you please review the following PR which fixes the output/metrics issue? Thanks :)
#3346

hanbyul-kim · 2025-05-03T05:50:05Z

Hi, thanks for sharing your work! Can I use your code with DeepSpeed Zero 3? I tried running it with that setup, but it doesn't seem to be working. I think it's related to parameter partitioning based on my analysis of the error log.

[rank5]:   File "/mnt/nappipe/users/hanbyul-kim/RORL/apply_liger_loss/Liger-Kernel/src/liger_kernel/chunked_loss/dpo_loss.py", line 94, in forward
[rank5]:     return super().forward(
[rank5]:   File "/mnt/nappipe/users/hanbyul-kim/RORL/apply_liger_loss/Liger-Kernel/src/liger_kernel/chunked_loss/fused_linear_preference.py", line 241, in forward
[rank5]:     accumulate_chunk(input_chunk, target_chunk, ref_input_chunk, chosen_nll_target_chunk)
[rank5]:   File "/mnt/nappipe/users/hanbyul-kim/RORL/apply_liger_loss/Liger-Kernel/src/liger_kernel/chunked_loss/fused_linear_preference.py", line 159, in accumulate_chunk
[rank5]:     ) = fused_fwd_bwd(input_chunk, target_chunk, ref_input_chunk, chosen_nll_target_chunk)
[rank5]:   File "/mnt/nappipe/users/hanbyul-kim/RORL/apply_liger_loss/Liger-Kernel/src/liger_kernel/chunked_loss/fused_linear_preference.py", line 120, in fused_fwd_bwd
[rank5]:     return torch.func.grad_and_value(compute_loss, argnums=(0, 1), has_aux=True)(
[rank5]:   File "/root/.dpo_trainer_venv/lib/python3.10/site-packages/torch/_functorch/apis.py", line 440, in wrapper
[rank5]:     return eager_transforms.grad_and_value_impl(
[rank5]:   File "/root/.dpo_trainer_venv/lib/python3.10/site-packages/torch/_functorch/vmap.py", line 48, in fn
[rank5]:     return f(*args, **kwargs)
[rank5]:   File "/root/.dpo_trainer_venv/lib/python3.10/site-packages/torch/_functorch/eager_transforms.py", line 1409, in grad_and_value_impl
[rank5]:     output = func(*args, **kwargs)
[rank5]:   File "/mnt/nappipe/users/hanbyul-kim/RORL/apply_liger_loss/Liger-Kernel/src/liger_kernel/chunked_loss/fused_linear_preference.py", line 377, in _compute_loss
[rank5]:     ) = LigerFusedLinearPreferenceBase.chunk_forward(
[rank5]:   File "/mnt/nappipe/users/hanbyul-kim/RORL/apply_liger_loss/Liger-Kernel/src/liger_kernel/chunked_loss/fused_linear_preference.py", line 289, in chunk_forward
[rank5]:     logits_chunk = input_chunk @ weight.t()
[rank5]: RuntimeError: size mismatch, got input (322), mat (322x4096), vec (0)

hanbyul-kim · 2025-05-03T05:57:35Z

Continuing my analysis, I can confirm that it's definitely connected to DeepSpeed zero 3. When I switched to stage 2, it ran smoothly without any issues.

kashif · 2025-05-05T08:18:40Z

thanks @hanbyul-kim for the report

vaibhavjindal · 2025-06-09T22:54:01Z

@kashif just wanted to circle back and see if we can merge this now? We wanted to try it out internally at Linkedin.

qgallouedec · 2025-06-11T13:38:38Z

trl/trainer/dpo_trainer.py


 if is_wandb_available():
    import wandb


+def shift_tokens_right(input_ids: torch.Tensor, pad_token_id: int, decoder_start_token_id: int) -> torch.Tensor:


pad_token_id isn't used?

trl/trainer/dpo_trainer.py

qgallouedec · 2025-06-12T08:02:52Z

trl/trainer/dpo_trainer.py


 if is_wandb_available():
    import wandb


+def shift_tokens_right(input_ids: torch.Tensor, decoder_start_token_id: int) -> torch.Tensor:
+    """Shift input ids one token to the right, and pad with pad_token_id"""


this docstring ain't accurate I think

initial liger support

f50e74d

kashif mentioned this pull request Dec 22, 2024

[Tracking issue] Integrate native liger-kernel losses #2495

Open

7 tasks

kashif added 3 commits January 15, 2025 12:05

fix outputs

e3eebd3

fix config merge conflict

2d82b39

Merge branch 'main' into liger-dpo

50d341e

kashif added 2 commits January 15, 2025 12:19

fix comment

8ae06b1

fix peft training

cc2b7b9

qgallouedec reviewed Jan 17, 2025

View reviewed changes

tests/test_dpo_trainer.py Outdated Show resolved Hide resolved

qgallouedec added 3 commits January 17, 2025 16:31

use parametrized

03fd005

raise error as soon as dep is not met

5f4110f

move param to the right section

b22eb24

qgallouedec reviewed Jan 17, 2025

View reviewed changes

trl/trainer/dpo_trainer.py Outdated Show resolved Hide resolved

reducing memory doc

b8e6f8c

qgallouedec reviewed Jan 17, 2025

View reviewed changes

docs/source/reducing_memory_usage.md Outdated Show resolved Hide resolved

kashif added 8 commits January 21, 2025 14:57

use liger specifc method

6310dbd

Merge branch 'main' into liger-dpo

bdca4f1

Merge branch 'main' into liger-dpo

5efe4d0

update return signature

dbece54

Merge branch 'main' into liger-dpo

d21bd81

fix typo

c441925

fix tests

f1af5d6

truncation and logits to keep

2814228

VProv mentioned this pull request Mar 26, 2025

[WIP] PEFT 🤝 Liger DPO #3065

Draft

5 tasks

kashif added 2 commits April 23, 2025 13:17

fix formatting

baba3bb

Merge branch 'main' into liger-dpo

f41c023

Merge branch 'main' into liger-dpo

94422db

kashif added 3 commits May 5, 2025 10:21

update liger to fix dpo bug

614e5d9

skip test for python 3.9

8939796

fix asserts

50a4adc

kashif and others added 7 commits June 10, 2025 09:41

Merge branch 'main' into liger-dpo

11fa70c

formatting

d47ab9f

fix issue due to wraparound from roll

30ee209

revert back tol

4a46a0d

fix skip test

9fbdf80

use unwrapped model

9178c67

fix and expand doc

68aba88

qgallouedec reviewed Jun 11, 2025

View reviewed changes

trl/trainer/dpo_trainer.py Show resolved Hide resolved

qgallouedec reviewed Jun 11, 2025

View reviewed changes

trl/trainer/dpo_trainer.py Show resolved Hide resolved

qgallouedec and others added 4 commits June 11, 2025 13:45

nits in test

6de6070

style

e067a15

fix doc

28963ce

fix for review

588ebf1

qgallouedec reviewed Jun 12, 2025

View reviewed changes

qgallouedec approved these changes Jun 12, 2025

View reviewed changes

kashif added 2 commits June 12, 2025 11:11

Merge branch 'main' into liger-dpo

f04100c

add Flush and truncate to liger

3e5802a

kashif merged commit 53c4a7c into main Jun 12, 2025
11 checks passed

kashif deleted the liger-dpo branch June 12, 2025 10:25

[Liger] liger DPO support #2568

[Liger] liger DPO support #2568

Uh oh!

Conversation

kashif commented Jan 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Jan 15, 2025

Uh oh!

Uh oh!

Uh oh!

qgallouedec commented Jan 17, 2025

Uh oh!

Uh oh!

VProv commented Mar 26, 2025

Uh oh!

kashif commented Mar 26, 2025

Uh oh!

VProv commented Mar 26, 2025

Uh oh!

vaibhavjindal commented Apr 22, 2025

Uh oh!

vaibhavjindal commented Apr 23, 2025

Uh oh!

hanbyul-kim commented May 3, 2025

Uh oh!

hanbyul-kim commented May 3, 2025

Uh oh!

kashif commented May 5, 2025

Uh oh!

vaibhavjindal commented Jun 9, 2025

Uh oh!

qgallouedec Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

qgallouedec Jun 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

kashif commented Jan 14, 2025 •

edited

Loading