add fp32 grad plus fp16 param in adamw #51141

shaojiewang · 2023-03-03T03:33:54Z

PR types

Others

PR changes

OPs

Describe

optimizer with datatype casting

paddle-bot · 2023-03-03T03:34:03Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

JamesLim-sy · 2023-03-15T03:20:27Z

paddle/phi/kernels/gpu/adamw_kernel.cu

+              dev_ctx.template Alloc<T>(param_out),
+              master_in_data,
+              master_out_data,
+              param.numel());


Is grad_type completely independent with T or MPDType ?
If not, there might be a better way to gather 2 branches into just one interface with template.

Yes, grad_type is independent of T or MPDType.

JamesLim-sy

LGTM

…le#51141.

* Cherry-pick the register of bfloat16 for amp_kernel, pull request #45541. * Cherry-pick the master_grad support of adamw, pull request #51141. * add bf16 for some ops in static mode (#51582) * Add bfloat16 support for some api in static mode. * Fix codestyle. * Revert the change of layer_function_generator.py. --------- Co-authored-by: Shaojie WANG <wsjmessi@163.com>

add fp32 grad plus fp16 param in adamw

b51896c

shaojiewang added 6 commits March 4, 2023 15:58

add python UT

2c1ad9a

fix test case

df127e0

in test_adamw_op py file, force the moment2 value LE 0

a7f153d

add a compare option

d9ad6fa

remove bf16 fused adam kernel case

dfb684d

Merge branch 'PaddlePaddle:develop' into optimizer_with_cast

1ae5f0d

JamesLim-sy reviewed Mar 15, 2023

View reviewed changes

JamesLim-sy reviewed Mar 16, 2023

View reviewed changes

JamesLim-sy approved these changes Mar 16, 2023

View reviewed changes

JamesLim-sy merged commit 290aa36 into PaddlePaddle:develop Mar 16, 2023

Xreki added a commit to Xreki/Paddle that referenced this pull request Apr 9, 2023

Cherry-pick the master_grad support of adamw, pull request PaddlePadd…

1387e55

…le#51141.

Xreki mentioned this pull request Apr 9, 2023

Add bfloat16 support for several operators and apis. #52696

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add fp32 grad plus fp16 param in adamw #51141

add fp32 grad plus fp16 param in adamw #51141

Uh oh!

shaojiewang commented Mar 3, 2023

Uh oh!

paddle-bot bot commented Mar 3, 2023

Uh oh!

JamesLim-sy Mar 15, 2023

Uh oh!

shaojiewang Mar 16, 2023

Uh oh!

JamesLim-sy left a comment

Uh oh!

Uh oh!

add fp32 grad plus fp16 param in adamw #51141

add fp32 grad plus fp16 param in adamw #51141

Uh oh!

Conversation

shaojiewang commented Mar 3, 2023

PR types

PR changes

Describe

Uh oh!

paddle-bot bot commented Mar 3, 2023

Uh oh!

JamesLim-sy Mar 15, 2023

Choose a reason for hiding this comment

Uh oh!

shaojiewang Mar 16, 2023

Choose a reason for hiding this comment

Uh oh!

JamesLim-sy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!