[PyTorch] Add lr scheduler #1305

AnirudhDagar · 2020-08-07T18:27:46Z

Description of changes:

By submitting this pull request, I confirm that you can use, modify,
copy, and redistribute this contribution, under the terms of your
choice.

chapter_optimization/lr-scheduler.md

mli · 2020-08-07T18:48:37Z

Job d2l-en/PR-1305/2 is complete.
Check the results at http://preview.d2l.ai/d2l-en/PR-1305/

mli · 2020-08-07T18:54:09Z

Job d2l-en/PR-1305/1 is complete.
Check the results at http://preview.d2l.ai/d2l-en/PR-1305/

mli · 2020-08-07T19:27:04Z

Job d2l-en/PR-1305/5 is complete.
Check the results at http://preview.d2l.ai/d2l-en/PR-1305/

mli · 2020-08-09T20:37:20Z

Job d2l-en/PR-1305/6 is complete.
Check the results at http://preview.d2l.ai/d2l-en/PR-1305/

mli · 2020-08-09T22:54:11Z

Job d2l-en/PR-1305/7 is complete.
Check the results at http://preview.d2l.ai/d2l-en/PR-1305/

mli · 2020-08-10T00:31:43Z

Job d2l-en/PR-1305/8 is complete.
Check the results at http://preview.d2l.ai/d2l-en/PR-1305/

astonzhang · 2020-08-11T00:49:32Z

chapter_optimization/lr-scheduler.md

@@ -356,15 +441,14 @@ In some cases initializing the parameters is not sufficient to guarantee a good
 A rather simple fix for this dilemma is to use a warmup period during which the learning rate *increases* to its initial maximum and to cool down the rate until the end of the optimization process. For simplicity one typically uses a linear increase for this purpose. This leads to a schedule of the form indicated below.

 ```{.python .input}
-scheduler = lr_scheduler.CosineScheduler(20, warmup_steps=5, base_lr=0.5,
+scheduler = lr_scheduler.CosineScheduler(20, warmup_steps=5, base_lr=0.3,


why do we change base_lr?

PyTorch implementation was not converging regularly somehow when base_lr was 0.5. So I changed it across frameworks for consistency.

That's ok. Just give me a heads up if you change original mx code :)

Sure! I mentioned it here. Will make it more explicit in the future if mxnet code is changed.

astonzhang · 2020-08-11T01:05:31Z

chapter_optimization/lr-scheduler.md

-#@tab tensorflow
-scheduler = SquareRootScheduler(1.0)
+#@tab all
+scheduler = SquareRootScheduler(lr=0.1)


Here's another example of original code modification. Please make sure that similar results are obtained.

AnirudhDagar · 2020-08-11T02:28:07Z

Should we merge this now?

AnirudhDagar added 2 commits August 7, 2020 23:56

[PyTorch] Add lr scheduler

2797b9d

[PyTorch] remove verbose arg lr_sched

ed951e1

AnirudhDagar commented Aug 7, 2020

View reviewed changes

chapter_optimization/lr-scheduler.md Outdated Show resolved Hide resolved

[PyTorch] Fix typo 3->30 epochs

96f0612

AnirudhDagar added 2 commits August 8, 2020 00:26

[PyTorch] Fix typo 3->30 epochs

ec2f430

remove redundant

5048a00

[PyTorch] Adapt custom cosine annealing function

6fbe2ce

[PyTorch] typo and add stability to lr 0.5->0.3

44bee3d

[PyTorch] stable convergence 0.5->0.3

29359dd

AnirudhDagar changed the title ~~[WIP][PyTorch] Add lr scheduler~~ [PyTorch] Add lr scheduler Aug 9, 2020

AnirudhDagar requested a review from astonzhang August 10, 2020 07:25

astonzhang reviewed Aug 11, 2020

View reviewed changes

astonzhang approved these changes Aug 11, 2020

View reviewed changes

astonzhang reviewed Aug 11, 2020

View reviewed changes

astonzhang merged commit faf0bb3 into d2l-ai:master Aug 11, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[PyTorch] Add lr scheduler #1305

[PyTorch] Add lr scheduler #1305

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[PyTorch] Add lr scheduler #1305

[PyTorch] Add lr scheduler #1305

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!