Added a Learning Rate Schedule for the BERT Example #96

Stealth-py · 2022-04-07T21:50:14Z

Fixes #84
This PR is ready for review!

google-cla · 2022-04-07T21:50:20Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

For more information, open the CLA check for this pull request.

mattdangerw

Thanks! Left some comments

.gitignore

examples/bert/run_pretraining.py

Stealth-py · 2022-04-08T07:09:06Z

Thanks for the review @mattdangerw!
I've made the necessary changes and resolved the comments above.

Added newline

mattdangerw

Looks good, just a few more style nits, then can merge!

examples/bert/run_pretraining.py

Stealth-py · 2022-04-11T20:43:17Z

Thank you! I've resolved the issues and pushed :D

mattdangerw · 2022-04-11T21:17:54Z

@Stealth-py thanks! Can you fix formatting errors? https://github.com/keras-team/keras-nlp/runs/5979856948?check_suite_focus=true

Stealth-py · 2022-04-11T21:24:44Z

Yeah, I'm sorry I should've ran it before pushing into the main directory.
It should be fixed now I think.

mattdangerw · 2022-04-11T21:34:58Z

@Stealth-py thanks! I will merge now with a few last copy edits.

Stealth-py · 2022-04-11T21:38:09Z

Alright!

mattdangerw · 2022-04-11T21:47:01Z

@Stealth-py no need! We generally merge through github which will close. Thanks!

* Added a Schedule Class * Added/Changed decay and increase factors * Added/Changed decay and increase factors * Edited the Scheduler, Changes * calculates steps_per_epoch now * uncommented the epochs flag * final few changes/fixes/cleaning up * Made the necessary changes * Forgot to add the newline in gitignore, added now * Update .gitignore Added newline * resolved reviews 2.0 * ran format.sh and lint.sh * Copy edits Co-authored-by: Matt Watson <[email protected]>

* Add PaliGemma2 arch * Enable mixed precision check for PaliGemma * Add conversion script * Revert ImageConverter and reduce mem usage in the conversion script * Remove `compute_output_spec` * Fix `compute_output_shape` issue for keras 3.1 * Add model cards and update conversion script * update presets --------- Co-authored-by: divyashreepathihalli <[email protected]>

* Add PaliGemma2 (#96) * Add PaliGemma2 arch * Enable mixed precision check for PaliGemma * Add conversion script * Revert ImageConverter and reduce mem usage in the conversion script * Remove `compute_output_spec` * Fix `compute_output_shape` issue for keras 3.1 * Add model cards and update conversion script * update presets --------- Co-authored-by: divyashreepathihalli <[email protected]> * Update pali_gemma_presets.py - remove mix presets * Update pali_gemma_presets.py * Update convert_pali_gemma2_checkpoints.py --------- Co-authored-by: james77777778 <[email protected]>

* Add PaliGemma2 (keras-team#96) * Add PaliGemma2 arch * Enable mixed precision check for PaliGemma * Add conversion script * Revert ImageConverter and reduce mem usage in the conversion script * Remove `compute_output_spec` * Fix `compute_output_shape` issue for keras 3.1 * Add model cards and update conversion script * update presets --------- Co-authored-by: divyashreepathihalli <[email protected]> * Update pali_gemma_presets.py - remove mix presets * Update pali_gemma_presets.py * Update convert_pali_gemma2_checkpoints.py --------- Co-authored-by: james77777778 <[email protected]>

* Adding PaliGemma2 to KerasHub (#1998) * Add PaliGemma2 (#96) * Add PaliGemma2 arch * Enable mixed precision check for PaliGemma * Add conversion script * Revert ImageConverter and reduce mem usage in the conversion script * Remove `compute_output_spec` * Fix `compute_output_shape` issue for keras 3.1 * Add model cards and update conversion script * update presets --------- Co-authored-by: divyashreepathihalli <[email protected]> * Update pali_gemma_presets.py - remove mix presets * Update pali_gemma_presets.py * Update convert_pali_gemma2_checkpoints.py --------- Co-authored-by: james77777778 <[email protected]> * Version bump to 0.18.0 * Update pali_gemma_presets.py (#2003) * Update pali_gemma_presets.py * code reformat * Adding PaliGemma2 to KerasHub (#1998) * Add PaliGemma2 (#96) * Add PaliGemma2 arch * Enable mixed precision check for PaliGemma * Add conversion script * Revert ImageConverter and reduce mem usage in the conversion script * Remove `compute_output_spec` * Fix `compute_output_shape` issue for keras 3.1 * Add model cards and update conversion script * update presets --------- Co-authored-by: divyashreepathihalli <[email protected]> * Update pali_gemma_presets.py - remove mix presets * Update pali_gemma_presets.py * Update convert_pali_gemma2_checkpoints.py --------- Co-authored-by: james77777778 <[email protected]> * Update pali_gemma_presets.py (#2003) * Update pali_gemma_presets.py * code reformat --------- Co-authored-by: james77777778 <[email protected]>

Stealth-py and others added 9 commits April 2, 2022 03:45

Added a Schedule Class

6f748b4

Added/Changed decay and increase factors

c01a700

Added/Changed decay and increase factors

911b274

Edited the Scheduler, Changes

dd6a813

calculates steps_per_epoch now

e59c27d

Merge branch 'keras-team:master' into master

a5f7601

uncommented the epochs flag

fc7acd1

Merge branch 'master' of https://github.com/Stealth-py/keras-nlp

90b7cd8

final few changes/fixes/cleaning up

f0cecef

mattdangerw requested changes Apr 7, 2022

View reviewed changes

Stealth-py and others added 3 commits April 8, 2022 12:21

Made the necessary changes

b01de5e

Merge branch 'keras-team:master' into master

fead8f1

Forgot to add the newline in gitignore, added now

fd328c4

Update .gitignore

a85e426

Added newline

mattdangerw approved these changes Apr 11, 2022

View reviewed changes

examples/bert/run_pretraining.py Outdated Show resolved Hide resolved

examples/bert/run_pretraining.py Outdated Show resolved Hide resolved

examples/bert/run_pretraining.py Outdated Show resolved Hide resolved

examples/bert/run_pretraining.py Outdated Show resolved Hide resolved

resolved reviews 2.0

2e8c79b

ran format.sh and lint.sh

4d98332

Copy edits

df0f0ac

mattdangerw merged commit 5cb1a16 into keras-team:master Apr 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added a Learning Rate Schedule for the BERT Example #96

Added a Learning Rate Schedule for the BERT Example #96

Stealth-py commented Apr 7, 2022

google-cla bot commented Apr 7, 2022

mattdangerw left a comment

Stealth-py commented Apr 8, 2022

mattdangerw left a comment

Stealth-py commented Apr 11, 2022

mattdangerw commented Apr 11, 2022

Stealth-py commented Apr 11, 2022

mattdangerw commented Apr 11, 2022

Stealth-py commented Apr 11, 2022 •

edited

Loading

mattdangerw commented Apr 11, 2022

Added a Learning Rate Schedule for the BERT Example #96

Added a Learning Rate Schedule for the BERT Example #96

Conversation

Stealth-py commented Apr 7, 2022

google-cla bot commented Apr 7, 2022

mattdangerw left a comment

Choose a reason for hiding this comment

Stealth-py commented Apr 8, 2022

mattdangerw left a comment

Choose a reason for hiding this comment

Stealth-py commented Apr 11, 2022

mattdangerw commented Apr 11, 2022

Stealth-py commented Apr 11, 2022

mattdangerw commented Apr 11, 2022

Stealth-py commented Apr 11, 2022 • edited Loading

mattdangerw commented Apr 11, 2022

Stealth-py commented Apr 11, 2022 •

edited

Loading