Retrained time distributed model with lower learning rate starting at epoch 3 #232

ejm714 · 2022-09-26T22:01:47Z

The time distributed model was retrained since the StepLR did not run as intended, meaning the learning rate was not cut in half at epoch 3 in the prior run. Using a lower learning rate at this point is also useful since this is the epoch at which the backbone is unfrozen with a learning rate of 1/100 of the head.

This yields to a 4 point gain in macro f1 and a slight bump in top 1 accuracy as well (which is smaller given that the improvement comes for less common species, not blanks).

This model will replace the previous time distributed model.

Additional notes:

For this training, we used auto_lr_find (with a workaround, not on master). This yielded 0.001096, essentially the same learning rate as the default (0.001). This is reassuring in that the default seems to generalize well for various batch sizes at least when training from scratch.

netlify · 2022-09-26T22:01:51Z

✅ Deploy Preview for silly-keller-664934 ready!

Name	Link
🔨 Latest commit	`c4d377e`
🔍 Latest deploy log	https://app.netlify.com/sites/silly-keller-664934/deploys/6332214d36928a00088d48d9
😎 Deploy Preview	https://deploy-preview-232--silly-keller-664934.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site settings.

github-actions · 2022-09-26T22:05:34Z

🚀 Deployed on https://deploy-preview-232--silly-keller-664934.netlify.app

codecov-commenter · 2022-09-26T22:21:18Z

Codecov Report

Merging #232 (c4d377e) into master (eba2cec) will not change coverage.
The diff coverage is n/a.

Additional details and impacted files

@@          Coverage Diff           @@
##           master    #232   +/-   ##
======================================
  Coverage    87.2%   87.2%           
======================================
  Files          28      28           
  Lines        1961    1961           
======================================
  Hits         1710    1710           
  Misses        251     251

Impacted Files	Coverage Δ
zamba/models/config.py	`96.9% <ø> (ø)`

new version of time distributed

c4d377e

ejm714 requested a review from pjbull September 26, 2022 22:01

pjbull approved these changes Sep 26, 2022

View reviewed changes

ejm714 merged commit 380ef40 into master Sep 27, 2022

ejm714 deleted the steplr-retrain-fix branch September 27, 2022 00:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Retrained time distributed model with lower learning rate starting at epoch 3 #232

Retrained time distributed model with lower learning rate starting at epoch 3 #232

ejm714 commented Sep 26, 2022 •

edited

Loading

netlify bot commented Sep 26, 2022 •

edited

Loading

github-actions bot commented Sep 26, 2022

codecov-commenter commented Sep 26, 2022 •

edited

Loading

Retrained time distributed model with lower learning rate starting at epoch 3 #232

Retrained time distributed model with lower learning rate starting at epoch 3 #232

Conversation

ejm714 commented Sep 26, 2022 • edited Loading

netlify bot commented Sep 26, 2022 • edited Loading

✅ Deploy Preview for silly-keller-664934 ready!

github-actions bot commented Sep 26, 2022

codecov-commenter commented Sep 26, 2022 • edited Loading

Codecov Report

ejm714 commented Sep 26, 2022 •

edited

Loading

netlify bot commented Sep 26, 2022 •

edited

Loading

codecov-commenter commented Sep 26, 2022 •

edited

Loading