Skip to content

Conversation

shaojiewang
Copy link
Contributor

PR types

Others

PR changes

OPs

Describe

optimizer with datatype casting

@paddle-bot
Copy link

paddle-bot bot commented Mar 3, 2023

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

dev_ctx.template Alloc<T>(param_out),
master_in_data,
master_out_data,
param.numel());
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is grad_type completely independent with T or MPDType ?
If not, there might be a better way to gather 2 branches into just one interface with template.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, grad_type is independent of T or MPDType.

Copy link
Contributor

@JamesLim-sy JamesLim-sy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@JamesLim-sy JamesLim-sy merged commit 290aa36 into PaddlePaddle:develop Mar 16, 2023
Xreki added a commit to Xreki/Paddle that referenced this pull request Apr 9, 2023
aoyulong pushed a commit that referenced this pull request Apr 9, 2023
* Cherry-pick the register of bfloat16 for amp_kernel, pull request #45541.

* Cherry-pick the master_grad support of adamw, pull request #51141.

* add bf16 for some ops in static mode (#51582)

* Add bfloat16 support for some api in static mode.

* Fix codestyle.

* Revert the change of layer_function_generator.py.

---------

Co-authored-by: Shaojie WANG <wsjmessi@163.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants