FIX: Correctly save a ArgillaTrainer + TRL model with peft_config #3795

tomaarsen · 2023-09-20T11:23:57Z

Description

If you pass peft_config to trainer.update_config, then it will currently still save the full (but broken) model with some LoRA (or other PEFT) parts interspersed that can't be loaded again. This is because the self._transformers_model was saved, but that model was the original model (e.g. Llama 2 7b) and not the PEFT model, which is created internally in e.g. the SFTTrainer.

In short, this now saves the actual model and tokenizer used by the underlying trainer, so that if that trainer make any other changes to the model, those changes will be saved too.

Type of change

Bug fix (non-breaking change which fixes an issue)
Breaking change (fix or feature that would cause existing functionality to not work as expected)

How Has This Been Tested

Created a new test and verified that there's now adapter_model.bin and adapter_config.json files after training.

Checklist

I followed the style guidelines of this project
I did a self-review of my code
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
I filled out the contributor form (see text above)
I have added relevant notes to the CHANGELOG.md file (See https://keepachangelog.com/)

Tom Aarsen

Rather than the model as we store it. This is required for Peft models to be saved correctly.

…hotfix/updated_save

tomaarsen · 2023-09-20T11:38:52Z

This also works in my RLHF tutorial.

codecov · 2023-09-20T12:33:06Z

Codecov Report

Patch has no changes to coverable lines.

📢 Thoughts on this report? Let us know!.

github-actions · 2023-09-20T12:46:56Z

The URL of the deployed environment for this PR is https://argilla-quickstart-pr-3795-ki24f765kq-no.a.run.app

tomaarsen added 3 commits September 20, 2023 13:11

Save the model stored by the internal trainer

e76e694

Rather than the model as we store it. This is required for Peft models to be saved correctly.

Add test showing that adapter saves now

d3cd3be

Merge branch 'develop' of https://github.com/argilla-io/argilla into …

eab5ae1

…hotfix/updated_save

tomaarsen added type: bug Indicates an unexpected problem or unintended behavior type: integration Indicates integrations with third parties area: trainer Indicates that an issue or pull request is related to the Argilla Trainer labels Sep 20, 2023

Add changelog entry

2b948b5

tomaarsen requested review from davidberenstein1957 and alvarobartt September 20, 2023 11:24

Merge branch 'develop' into hotfix/updated_save

635d20a

davidberenstein1957 approved these changes Sep 28, 2023

View reviewed changes

davidberenstein1957 merged commit 4dfa9e2 into develop Sep 28, 2023

davidberenstein1957 deleted the hotfix/updated_save branch September 28, 2023 13:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX: Correctly save a ArgillaTrainer + TRL model with peft_config #3795

FIX: Correctly save a ArgillaTrainer + TRL model with peft_config #3795

tomaarsen commented Sep 20, 2023

tomaarsen commented Sep 20, 2023

codecov bot commented Sep 20, 2023

github-actions bot commented Sep 20, 2023

FIX: Correctly save a ArgillaTrainer + TRL model with peft_config #3795

FIX: Correctly save a ArgillaTrainer + TRL model with peft_config #3795

Conversation

tomaarsen commented Sep 20, 2023

Description

tomaarsen commented Sep 20, 2023

codecov bot commented Sep 20, 2023

Codecov Report

github-actions bot commented Sep 20, 2023