add a fusion op: fused_layernorm_residual_dropout_bias #35151

zkh2016 · 2021-08-25T12:21:58Z

PR types

New features

PR changes

OPs

Describe

Fused elementwise_add, dropout, elementwise_add and layer_norm into one operator, only support Forward

//before fusion
out1 = elementwise_add(src, bias)
out2 = dropout(out1)
out3 = elementwise_add(residual, out2)
out = layer_norm(out3, other args)

//after fusion
out = fused_layernorm_residual_dropout_bias(src, residual, bias, other args)

paddle-bot-old · 2021-08-25T12:22:52Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

…tGrad

…ropout_bias

zhangting2020 · 2021-09-16T12:15:37Z

paddle/fluid/operators/fused/fused_layernorm_residual_dropout_bias.h

+namespace paddle {
+namespace operators {
+
+namespace cg = cooperative_groups;


这个没用到？

zhangting2020 · 2021-09-16T12:17:11Z

paddle/fluid/operators/fused/fused_layernorm_residual_dropout_bias.h

+  if (is_test) {
+    factor = is_upscale_in_train ? static_cast<T>(1.0f)
+                                 : static_cast<T>(1.0f - dropout_prob);
+  }


这段忘记替换为GetFactor了

xingfeng01 · 2021-09-16T14:20:24Z

建议下个 PR 加些函数的注释，解释下计算逻辑。

xingfeng01 · 2021-09-17T03:40:39Z

LGTM

…35151) Fused elementwise_add, dropout, elementwise_add and layer_norm into one operator, only support Forward. No Python API changed.

zkh2016 added 6 commits August 23, 2021 08:16

add a fusion op: fused_residual_dropout_bias

bf318b8

simplify the code, andd opt reduce sum

507117a

resolve review comments and add comments to the code

462caa1

fused_dropout: optimize code structure to facilitate reuse

93e0638

Merge branch 'PaddlePaddle:develop' into develop

e2808ff

optimize code structure to facilitate reuse

036b430

zkh2016 added 3 commits August 30, 2021 10:30

modify the code according to the review comments

4d33b98

replace cudaMemcpy with TensorFromVector and TensorToVector in Dropou…

bd44d04

…tGrad

set dropout attr 'is_test':false

d2beab7

zkh2016 force-pushed the fused_layernorm_residual_dropout_bias branch from 87b2723 to aa27d96 Compare August 31, 2021 06:35

zkh2016 added 9 commits September 2, 2021 09:22

optimize the code according to the review comments

5d2bbc8

use static_cast

934fcac

fix the blocks for large shape

44610ea

Merge remote-tracking branch 'upstream/develop' into develop

3133d33

merge upstream, and used new AlignedVector

1a83adb

add a fusion op: fused_dropout_act_bias

4dba815

add a fused op: fused_layernorm_residual_dropout_bias

0fa4efa

remove unused code

f848739

Merge branch 'fused_dropout_act_bias' into fused_layernorm_residual_d…

bdb5f85

…ropout_bias

zkh2016 force-pushed the fused_layernorm_residual_dropout_bias branch from 9c77253 to bdb5f85 Compare September 9, 2021 02:12

zkh2016 added 3 commits September 9, 2021 11:53

Merge branch 'develop' into fused_dropout_act_bias

6d30340

redefine activation functor

b8a9861

Merge branch 'fused_dropout_act_bias' into fused_layernorm_residual_d…

42bafc1

…ropout_bias

zkh2016 marked this pull request as draft September 9, 2021 06:16

zkh2016 added 5 commits September 9, 2021 08:26

implement the same gelu as the baseline for FFN

fd01daa

add #define _USE_MATH_DEFINES for windows

cabb9d2

modify the code according to the review comment

3cfdff8

merge fused_dropout_act_bias

55aabdf

reused fused_residual_dropout_bias code

540f645

zkh2016 mentioned this pull request Sep 13, 2021

add a fusion op: fused_dropout_act_bias #35129

Merged

limin2021 mentioned this pull request Sep 14, 2021

Add fused_attention_op #35727

Closed

zkh2016 marked this pull request as ready for review September 15, 2021 03:15

zkh2016 added 2 commits September 16, 2021 11:44

Merge branch 'develop' into fused_layernorm_residual_dropout_bias

897dbe5

fix merge conflict

46e22a1

zhangting2020 reviewed Sep 16, 2021

View reviewed changes

fix code according the review comment

d801933

xingfeng01 approved these changes Sep 17, 2021

View reviewed changes

zhangting2020 approved these changes Sep 17, 2021

View reviewed changes

lanxianghit approved these changes Sep 17, 2021

View reviewed changes

lanxianghit merged commit 7975dfc into PaddlePaddle:develop Sep 17, 2021

This was referenced Oct 22, 2021

add op: fused_feedforward(forward) #35843

Merged

add op: fused_feedforward(backward) #35611

Merged

zkh2016 deleted the fused_layernorm_residual_dropout_bias branch August 19, 2022 04:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add a fusion op: fused_layernorm_residual_dropout_bias #35151

add a fusion op: fused_layernorm_residual_dropout_bias #35151

Uh oh!

zkh2016 commented Aug 25, 2021

Uh oh!

paddle-bot-old bot commented Aug 25, 2021

Uh oh!

zhangting2020 Sep 16, 2021

Uh oh!

zkh2016 Sep 17, 2021

Uh oh!

zhangting2020 Sep 16, 2021

Uh oh!

zkh2016 Sep 17, 2021

Uh oh!

xingfeng01 commented Sep 16, 2021

Uh oh!

xingfeng01 commented Sep 17, 2021

Uh oh!

Uh oh!

add a fusion op: fused_layernorm_residual_dropout_bias #35151

add a fusion op: fused_layernorm_residual_dropout_bias #35151

Uh oh!

Conversation

zkh2016 commented Aug 25, 2021

PR types

PR changes

Describe

Uh oh!

paddle-bot-old bot commented Aug 25, 2021

Uh oh!

zhangting2020 Sep 16, 2021

Choose a reason for hiding this comment

Uh oh!

zkh2016 Sep 17, 2021

Choose a reason for hiding this comment

Uh oh!

zhangting2020 Sep 16, 2021

Choose a reason for hiding this comment

Uh oh!

zkh2016 Sep 17, 2021

Choose a reason for hiding this comment

Uh oh!

xingfeng01 commented Sep 16, 2021

Uh oh!

xingfeng01 commented Sep 17, 2021

Uh oh!

Uh oh!