peft to opendelta migration (#434) + memory optimization (#320) #486

glerzing · 2023-05-24T02:21:52Z

I closed #477 to reopen a new PR.

Feel free to ask questions or to make remarks. If you want the automated tests to be faster, I can reduce the number of models or configs to test. I mostly relied on unit tests for verification. A few things, like CausalILQLOutput, were added to make the methods more generic, and for easier automated tests.

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

glerzing · 2023-05-24T02:36:13Z

trlx/trainer/accelerate_base_trainer.py

+        else:
+            strict = True
+
+        self.accelerator.load_state(directory or self.config.train.checkpoint_dir, strict=strict, **kwargs)


For this function I'm not confident that everything I wrote makes sense. I have not tested it in a really distributed setting. And I'm not sure how useful it is because it just sets the internal state of accelerator but it does not set other things like self.model. And when not using peft, testing trainer.load(checkpoint_dir) was not working, because of some additional "base_model" prefix in the layer names.

LouisCastricato · 2023-05-24T16:31:19Z

Can you elaborate on what you mean by memory optimization beyond just the integration of PEFT?

glerzing · 2023-05-24T16:49:18Z

Can you elaborate on what you mean by memory optimization beyond just the integration of PEFT?

If you check AutoModelForCausalLMWithHydraValueHead.init and its Seq2Seq equivalent, when peft is used, we don't create a frozen head, because we can just bypass or temporarily deactivate the adapter. This comes from the issue #320.

…ration

jon-tow

Hi, @glerzing! Amazing contribution 🙂
I've left some initial requests for changes and pointed out a few issues I encountered while launching some of our examples. We'll need to sort out a few things that break backward compatibility before merging.

trlx/models/modeling_ppo.py

trlx/data/configs.py

trlx/models/modeling_ilql.py

trlx/models/modeling_base.py

trlx/trainer/accelerate_ppo_trainer.py

trlx/models/modeling_base.py

trlx/models/modeling_ppo.py

glerzing · 2023-06-03T20:25:38Z

I tried to use the model tiny-gpt2. It would be much faster than gpt2 for the tests, but unfortunately it looks like it causes some numerical instabilities in the test_backpropagation_and_disabling, which can fail depending on the random seed. I still replaced t5-small by google/t5-efficient-tiny, which works well.

maxreciprocate

LGTM!

jon-tow

One final change! Per the Discord discussion on the 8-bit model forward args issue; let's add a guard check for 8-bit loading and notify to users that that it is still an experimental feature.

jon-tow · 2023-06-23T19:55:08Z

trlx/models/modeling_base.py

        super().__init__()
        self.base_model = base_model
        # cache `forward` args for general use (avoids incompatible args across architectures)
        self.forward_kwargs = inspect.getfullargspec(self.base_model.forward).args
+        self.is_loaded_in_8bit = getattr(base_model, "is_loaded_in_8bit", False)


Hardcode this to False and log something along the lines of "8-bit loading is an experimental feature not yet fully tested; leaving for curious users to explore"

jon-tow

@glerzing Huge contribution and an amazing job! Thank you 🚀

Migrate to peft from opendelta for parameter efficient tuning methods (…

7e48164

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

glerzing commented May 24, 2023

View reviewed changes

glerzing and others added 2 commits June 1, 2023 18:17

fix from_config

ab0cbe5

Merge branch 'main' of https://github.com/CarperAI/trlx into peft_mig…

bf0ea68

…ration

jon-tow requested changes Jun 1, 2023

View reviewed changes

Review corrections

738db1e

ILQL generate when temperature is 0.

de52db0

glerzing requested a review from jon-tow June 8, 2023 21:11

Merge branch 'main' into peft_migration

fc8a072

maxreciprocate approved these changes Jun 23, 2023

View reviewed changes

jon-tow requested changes Jun 23, 2023

View reviewed changes

jon-tow added 2 commits June 23, 2023 21:15

revert: guard against experimental 8-bit loading support

51cb259

format: run black

3ee7f31

jon-tow approved these changes Jun 23, 2023

View reviewed changes

jon-tow merged commit d47996d into CarperAI:main Jun 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

peft to opendelta migration (#434) + memory optimization (#320) #486

peft to opendelta migration (#434) + memory optimization (#320) #486

glerzing commented May 24, 2023

glerzing May 24, 2023

LouisCastricato commented May 24, 2023

glerzing commented May 24, 2023

jon-tow left a comment

glerzing commented Jun 3, 2023

maxreciprocate left a comment

jon-tow left a comment

jon-tow Jun 23, 2023

jon-tow left a comment

peft to opendelta migration (#434) + memory optimization (#320) #486

peft to opendelta migration (#434) + memory optimization (#320) #486

Conversation

glerzing commented May 24, 2023

glerzing May 24, 2023

Choose a reason for hiding this comment

LouisCastricato commented May 24, 2023

glerzing commented May 24, 2023

jon-tow left a comment

Choose a reason for hiding this comment

glerzing commented Jun 3, 2023

maxreciprocate left a comment

Choose a reason for hiding this comment

jon-tow left a comment

Choose a reason for hiding this comment

jon-tow Jun 23, 2023

Choose a reason for hiding this comment

jon-tow left a comment

Choose a reason for hiding this comment