[Chat] Rlhf support SimPO #5850

YeAnbang · 2024-06-24T05:13:41Z

📌 Checklist before creating the PR

I have created an issue for this PR for traceability
The title follows the standard format: [doc/gemini/tensor/...]: A concise description
I have added relevant tags if possible for us to better distinguish different PRs
I have installed pre-commit: pip install pre-commit && pre-commit install

🚨 Issue number

Link this PR to your issue with words like fixed to automatically close the linked issue upon merge

e.g. fixed #1234, closed #1234, resolved #1234

📝 What does this PR do?

Summarize your work here.
if you have any plots/diagrams/screenshots/tables, please attach them here.

💥 Checklist before requesting a review

I have linked my PR to an issue (instruction)
My issue clearly describes the problem/feature/proposal, with diagrams/charts/table/code if possible
I have performed a self-review of my code
I have added thorough tests.
I have added docstrings for all the functions/methods I implemented

⭐️ Do you enjoy contributing to Colossal-AI?

🌝 Yes, I do.
🌚 No, I don't.

Tell us more if you don't enjoy contributing to Colossal-AI.

TongLi3701

Thanks Anbang, I left some comments. Please have a look.

applications/ColossalChat/coati/dataset/loader.py

applications/ColossalChat/examples/README.md

…lhf_SimPO

TongLi3701

Thanks, Anbang. Please remove TODO list in the README.

I left some comments. Please address them and merge.

applications/ColossalChat/README.md

applications/ColossalChat/examples/README.md

applications/ColossalChat/examples/training_scripts/train_dpo.py

…lhf_SimPO

…port lora with gradient checkpoint

…lhf_SimPO

for more information, see https://pre-commit.ci

YeAnbang added 3 commits June 24, 2024 02:12

add SimPO

82aecd6

Merge branch 'main' of https://github.com/hpcaitech/ColossalAI into main

4b59d87

fix dataloader

0b2d627

YeAnbang requested a review from a team as a code owner June 24, 2024 05:13

remove debug code

f3de5a0

TongLi3701 requested changes Jun 25, 2024

View reviewed changes

applications/ColossalChat/coati/dataset/loader.py Outdated Show resolved Hide resolved

applications/ColossalChat/examples/README.md Show resolved Hide resolved

applications/ColossalChat/examples/README.md Outdated Show resolved Hide resolved

YeAnbang added 5 commits June 27, 2024 07:20

add orpo

c8d1b4a

fix style

8aad064

fix colossalai, transformers version

384c640

fix colossalai, transformers version

afa5306

fix colossalai, transformers version

b117274

YeAnbang requested review from TongLi3701 and binmakeswell June 27, 2024 08:32

YeAnbang added 3 commits June 28, 2024 02:50

Merge branch 'main' of https://github.com/hpcaitech/ColossalAI into r…

e752776

…lhf_SimPO

fix torch colossalai version

a8af6cc

update transformers version

ff53520

TongLi3701 approved these changes Jun 30, 2024

View reviewed changes

applications/ColossalChat/README.md Show resolved Hide resolved

applications/ColossalChat/examples/README.md Show resolved Hide resolved

applications/ColossalChat/examples/training_scripts/train_dpo.py Outdated Show resolved Hide resolved

YeAnbang added 4 commits July 10, 2024 02:32

Merge branch 'main' of https://github.com/hpcaitech/ColossalAI into r…

16f3451

…lhf_SimPO

add benchmark for sft, dpo, simpo, orpo. Add benchmarking result. Sup…

d888c37

…port lora with gradient checkpoint

fix style

f6ef5c3

Merge branch 'main' of https://github.com/hpcaitech/ColossalAI into r…

33f1520

…lhf_SimPO

YeAnbang force-pushed the rlhf_SimPO branch from 7929b21 to 33f1520 Compare July 10, 2024 10:43

[pre-commit.ci] auto fixes from pre-commit.com hooks

8a9721b

for more information, see https://pre-commit.ci

YeAnbang merged commit dd9e1cd into main Jul 11, 2024
4 checks passed

YeAnbang deleted the rlhf_SimPO branch July 19, 2024 07:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Chat] Rlhf support SimPO #5850

[Chat] Rlhf support SimPO #5850

YeAnbang commented Jun 24, 2024

TongLi3701 left a comment

TongLi3701 left a comment •

edited

Loading

[Chat] Rlhf support SimPO #5850

[Chat] Rlhf support SimPO #5850

Conversation

YeAnbang commented Jun 24, 2024

📌 Checklist before creating the PR

🚨 Issue number

📝 What does this PR do?

💥 Checklist before requesting a review

⭐️ Do you enjoy contributing to Colossal-AI?

TongLi3701 left a comment

Choose a reason for hiding this comment

TongLi3701 left a comment • edited Loading

Choose a reason for hiding this comment

TongLi3701 left a comment •

edited

Loading