add layerwise learning rate for adamw #35569

zhaoyinglia · 2021-09-08T03:59:53Z

PR types

New features

PR changes

OPs

Describe

add layerwise learningrate feature for adamw

paddle-bot-old · 2021-09-08T03:59:56Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

JZ-LIANG · 2021-09-09T02:33:08Z

paddle/fluid/operators/optimizers/adam_op.cc

@@ -236,6 +236,10 @@ class AdamWOpMaker : public AdamOpMaker {
 public:
  void Make() {
    AdamOpMaker::Make();
+    AddAttr<float>("lr_ratio",


why add this argument to adam? is that adamw and adam share the same .cc file ?

in this case, adamw should have its own .cc file

AdamWOpMaker inherits AdamOpMaker, and they use the same InferShape function of AdamOp.
In this case, 'lr_ratio' has no effect on Adam.

JZ-LIANG · 2021-09-09T02:35:49Z

python/paddle/optimizer/adamw.py

@@ -163,6 +165,9 @@ def __init__(self,
        self._apply_decay_param_fun = apply_decay_param_fun
        self._coeff = coeff
        self._lr_to_coeff = dict()
+        if lr_ratio is not None:
+            assert isinstance(lr_ratio, Callable)
+        self._lr_ratio = lr_ratio


you should think about how many kernel will be affected by "lr_ratio".
if you only want the lr_ratio the affect gpu and cpu kernel, you should raise an Unimplement Error for xpu and npu here.

JZ-LIANG · 2021-09-09T02:37:56Z

python/paddle/optimizer/adamw.py

@@ -163,6 +165,9 @@ def __init__(self,
        self._apply_decay_param_fun = apply_decay_param_fun
        self._coeff = coeff
        self._lr_to_coeff = dict()
+        if lr_ratio is not None:


should add explanation for the new lr_ration argument, which should follow the explanation for "apply_decay_param_fun"

JZ-LIANG

LGTM

* add layerwise learning rate for adamw * fix format * add unitest * add NotImplementedError * add gpu unitest * update gpuplace

zhaoyingli added 2 commits September 8, 2021 11:51

add layerwise learning rate for adamw

343b816

fix format

3b76854

zhaoyinglia changed the title ~~Adamw layerwise~~ add layerwise learning rate for adamw Sep 8, 2021

add unitest

03f5181

JZ-LIANG reviewed Sep 9, 2021

View reviewed changes

zhaoyingli added 3 commits September 9, 2021 11:52

add NotImplementedError

000e7b2

add gpu unitest

0114fd8

update gpuplace

442db6f

zhaoyinglia force-pushed the adamw_layerwise branch from 33a7e31 to a18e0d5 Compare September 13, 2021 12:05

PangHua previously approved these changes Sep 14, 2021

View reviewed changes

hbwx24 dismissed PangHua’s stale review via a18e0d5 September 14, 2021 02:21

zhaoyinglia force-pushed the adamw_layerwise branch from a18e0d5 to 442db6f Compare September 14, 2021 02:44

zhaoyinglia requested a review from PangHua September 14, 2021 02:56

PangHua approved these changes Sep 14, 2021

View reviewed changes

JZ-LIANG approved these changes Sep 14, 2021

View reviewed changes

JZ-LIANG merged commit 91cf918 into PaddlePaddle:develop Sep 14, 2021

sljlp mentioned this pull request Oct 16, 2021

Dropout+layerwisedecay #36482

Closed

zhaoyinglia deleted the adamw_layerwise branch August 30, 2023 06:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add layerwise learning rate for adamw #35569

add layerwise learning rate for adamw #35569

zhaoyinglia commented Sep 8, 2021

paddle-bot-old bot commented Sep 8, 2021

JZ-LIANG Sep 9, 2021

zhaoyinglia Sep 9, 2021

JZ-LIANG Sep 9, 2021

zhaoyinglia Sep 9, 2021

JZ-LIANG Sep 9, 2021

zhaoyinglia Sep 9, 2021

JZ-LIANG left a comment

add layerwise learning rate for adamw #35569

add layerwise learning rate for adamw #35569

Conversation

zhaoyinglia commented Sep 8, 2021

PR types

PR changes

Describe

paddle-bot-old bot commented Sep 8, 2021

JZ-LIANG Sep 9, 2021

Choose a reason for hiding this comment

zhaoyinglia Sep 9, 2021

Choose a reason for hiding this comment

JZ-LIANG Sep 9, 2021

Choose a reason for hiding this comment

zhaoyinglia Sep 9, 2021

Choose a reason for hiding this comment

JZ-LIANG Sep 9, 2021

Choose a reason for hiding this comment

zhaoyinglia Sep 9, 2021

Choose a reason for hiding this comment

JZ-LIANG left a comment

Choose a reason for hiding this comment