Support the maximize parameter for adam when dtype is torch.half #35

alphaGem · 2022-07-03T08:15:47Z

Related to issue34

bmtrain/optim/adam.py

a710128 · 2022-07-04T02:22:08Z

AdamOffload needs to be updated.
https://github.com/OpenBMB/BMTrain/blob/0.1.7.post1/bmtrain/optim/adam_offload.py#L131-L172
The maximize parameter of torch.float also needs to be supported.
A bug: grad was not passed to the adam function.

…t passed to Adam

alphaGem · 2022-07-05T10:12:18Z

Update:

Adam Offload now updated.
Maximize parameter of torch.float now supported.
grad now passed to the adam function.

Support the maximize parameter for adam when dtype is torch.half

3847162

a710128 requested changes Jul 4, 2022

View reviewed changes

bmtrain/optim/adam.py Outdated Show resolved Hide resolved

a710128 linked an issue Jul 4, 2022 that may be closed by this pull request

[Feature] Supporting the maximize parameter of Adam optimizer when dtype is torch.half #34

Closed

alphaGem added 3 commits July 4, 2022 07:58

fix grad not passed to Adam function

d9c7cf2

Support param maximize for adam float32 and adam_offload; fix grad no…

7128324

…t passed to Adam

fix grad no view(-1) at float16 in adam_offload

2509746

a710128 approved these changes Jul 7, 2022

View reviewed changes

a710128 merged commit e6ba031 into OpenBMB:main Jul 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support the maximize parameter for adam when dtype is torch.half #35

Support the maximize parameter for adam when dtype is torch.half #35

Uh oh!

alphaGem commented Jul 3, 2022

Uh oh!

Uh oh!

a710128 commented Jul 4, 2022

Uh oh!

alphaGem commented Jul 5, 2022

Uh oh!

Uh oh!

Support the maximize parameter for adam when dtype is torch.half #35

Support the maximize parameter for adam when dtype is torch.half #35

Uh oh!

Conversation

alphaGem commented Jul 3, 2022

Uh oh!

Uh oh!

a710128 commented Jul 4, 2022

Uh oh!

alphaGem commented Jul 5, 2022

Uh oh!

Uh oh!