Collapse reference+learner hydra heads when using LoRa #320

jon-tow · 2023-02-21T17:01:39Z

🚀 The feature, motivation, and pitch

With additive (delta-style) parameter-efficient tuning methods such as LoRa, we should be able to make a slightly more mem-efficient hydra architecture by using a single block that does ~frozen_head + tunable_weights for the learner/policy head's fwd-pass and simply frozen_head for the reference, instead of maintaining 2x heads.

CC @LouisCastricato and @cat-state for pointing this out

Alternatives

No response

Additional context

No response

The text was updated successfully, but these errors were encountered:

LouisCastricato · 2023-02-21T17:20:16Z

Haha, I was not aware that Aman proposed the same thing.

Opdoop · 2023-04-18T16:48:09Z

When will this feature be available?

LouisCastricato · 2023-04-18T19:14:43Z

I am not sure anyone started. cc @jon-tow

jon-tow · 2023-04-18T19:20:31Z

Not yet. @glerzing is looking into peft migration, which should make this very simple as the package provides a context manager to disable adapters; then, we can use it over reference model calls:

https://github.com/huggingface/peft/blob/34027fe813756897767b9a6f19ae7f1c4c7b418c/src/peft/peft_model.py#L290-L299

Opdoop · 2023-04-19T06:34:30Z

Cool!!! Looking forward to this update. 👍

glerzing · 2023-04-25T21:41:32Z

Sorry to make you wait, but it should take a few weeks to get this done. As a very rough estimate, I would say that I may push a tested solution around the 10th may. But I'm new here so I don't know how much time would then happen before a new release version.

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

* Migrate to peft from opendelta for parameter efficient tuning methods (#434) + Collapse reference+learner hydra heads when using LoRa (#320) * fix from_config * Review corrections * ILQL generate when temperature is 0. * revert: guard against experimental 8-bit loading support * format: run `black` --------- Co-authored-by: jon-tow <[email protected]> Co-authored-by: maxreciprocate <[email protected]>

jon-tow added feature request New feature or request contributions welcome PR from open source contributors welcome to solve this issue. good first issue Good for newcomers labels Feb 21, 2023

jon-tow changed the title ~~Collapse reference-learner hydra heads when using LoRa~~ Collapse reference+learner hydra heads when using LoRa Feb 21, 2023

glerzing added a commit to glerzing/trlx that referenced this issue May 13, 2023

Migrate to peft from opendelta for parameter efficient tuning methods (…

32bbebe

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

glerzing added a commit to glerzing/trlx that referenced this issue May 13, 2023

Migrate to peft from opendelta for parameter efficient tuning methods (…

267c971

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

glerzing added a commit to glerzing/trlx that referenced this issue May 23, 2023

Migrate to peft from opendelta for parameter efficient tuning methods (…

3430029

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

glerzing added a commit to glerzing/trlx that referenced this issue May 23, 2023

Migrate to peft from opendelta for parameter efficient tuning methods (…

66a56f5

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

glerzing added a commit to glerzing/trlx that referenced this issue May 24, 2023

Migrate to peft from opendelta for parameter efficient tuning methods (…

9a86e63

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

glerzing added a commit to glerzing/trlx that referenced this issue May 24, 2023

Migrate to peft from opendelta for parameter efficient tuning methods (…

5abd209

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

glerzing added a commit to glerzing/trlx that referenced this issue May 24, 2023

Migrate to peft from opendelta for parameter efficient tuning methods (…

1e73234

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

glerzing added a commit to glerzing/trlx that referenced this issue May 24, 2023

Migrate to peft from opendelta for parameter efficient tuning methods (…

7e48164

…CarperAI#434) + Collapse reference+learner hydra heads when using LoRa (CarperAI#320)

glerzing mentioned this issue May 24, 2023

peft to opendelta migration (#434) + memory optimization (#320) #486

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Collapse reference+learner hydra heads when using LoRa #320

Collapse reference+learner hydra heads when using LoRa #320

jon-tow commented Feb 21, 2023

LouisCastricato commented Feb 21, 2023

Opdoop commented Apr 18, 2023

LouisCastricato commented Apr 18, 2023

jon-tow commented Apr 18, 2023 •

edited

Loading

Opdoop commented Apr 19, 2023

glerzing commented Apr 25, 2023

Collapse reference+learner hydra heads when using LoRa #320

Collapse reference+learner hydra heads when using LoRa #320

Comments

jon-tow commented Feb 21, 2023

🚀 The feature, motivation, and pitch

Alternatives

Additional context

LouisCastricato commented Feb 21, 2023

Opdoop commented Apr 18, 2023

LouisCastricato commented Apr 18, 2023

jon-tow commented Apr 18, 2023 • edited Loading

Opdoop commented Apr 19, 2023

glerzing commented Apr 25, 2023

jon-tow commented Apr 18, 2023 •

edited

Loading