broadcast qkv_op #35780

fengxiaoshuai · 2021-09-15T16:23:22Z

PR types

Others

PR changes

Others

Describe

to support qk_bias is [batch, 1, 1, seq_len]

paddle-bot-old · 2021-09-15T16:23:29Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

shangzhizhou · 2021-09-16T05:45:21Z

paddle/fluid/inference/tensorrt/plugin/qkv_to_context_plugin.cu

@@ -233,6 +233,21 @@ __global__ void apply_scale(T *data, T scale, int n) {
 #endif
 }

+inline int round_up(int seq_len, int multiple = 32) {
+  assert(multiple);


请使用paddle规范的错误判断方式，如PADDLE_ENFORCE_NE

请使用paddle规范的错误判断方式，如PADDLE_ENFORCE_NE

好的，我改一下

shangzhizhou

LGTM

chenwhql · 2021-09-17T07:03:14Z

paddle/fluid/operators/fused/multihead_matmul_op.cu

@@ -132,6 +132,24 @@ void TransQKVWithBias(const int batch, const int seq_len, const int head_size,
  }
 }

+inline int round_up(int seq_len, int multiple = 32) {


为什么不使用驼峰式命名，其他地方也一样

为什么不使用驼峰式命名，其他地方也一样

我看这个文件里面很多地方都用下划线的方式，为了风格统一就延续了这种风格

chenwhql · 2021-09-17T07:05:23Z

paddle/fluid/operators/fused/multihead_matmul_op.cu

+  PADDLE_ENFORCE_GT(
+      multiple, 0,
+      platform::errors::InvalidArgument(
+          "multiple should be a positive number，but it's (%d)", multiple));


这个multiple需要标记一下吗？比如The input argument multiple，这个报错句子直接看语法是错的

这个multiple需要标记一下吗？比如The input argument multiple，这个报错句子直接看语法是错的

这个可以修改一下

先合入，@fengxiaoshuai，develop提个PR改一下，或者下一个PR带一下。

chenwhql · 2021-09-17T07:08:14Z

paddle/fluid/inference/tensorrt/plugin/qkv_to_context_plugin.cu

@@ -233,6 +233,24 @@ __global__ void apply_scale(T *data, T scale, int n) {
 #endif
 }

+inline int round_up(int seq_len, int multiple = 32) {


这两处代码是重复的吗？方便复用吗？

考虑过，不过目前就用这两次，放到公共的头文件中发现这个函数和其他函数类型相比有点不伦不类，二者一个是trt,一个是cuda所以目前不太好放，后续如果常用或者有合适的地方会考虑重构一下

chenwhql · 2021-09-17T07:09:53Z

paddle/fluid/inference/tensorrt/plugin/qkv_to_context_plugin.cu

-    const float *input1_data = static_cast<const float *>(inputs[1]);
+    // fit to [batch, head_num, length, length] + [batch, 1, 1, length]
+    framework::Tensor temp_qk_bias_tensor;
+    float *qk_bias = const_cast<float *>(static_cast<const float *>(inputs[1]));


这里const_cast要使用的理由是什么，需要解释下吗，这个输入为什么需要是const void *const *类型

这里const_cast要使用的理由是什么，需要解释下吗，这个输入为什么需要是const void *const *类型
这个是由于基类设置的接口的原因，目前没办法，trt这边plugin都是这么写的，具体也和秋良沟通过

zhiqiu

LGTM for this PR, maybe consider better design to avoid using const_cast

* broadcast qkv_op * use PADDLE_ENFORCE_GT to replace assert

broadcast qkv_op

dc979d0

shangzhizhou reviewed Sep 16, 2021

View reviewed changes

use PADDLE_ENFORCE_GT to replace assert

8c39b99

shangzhizhou approved these changes Sep 16, 2021

View reviewed changes

chenwhql reviewed Sep 17, 2021

View reviewed changes

zhiqiu approved these changes Sep 17, 2021

View reviewed changes

shangzhizhou merged commit cf9eae4 into PaddlePaddle:develop Sep 17, 2021

AnnaTrainingG pushed a commit to AnnaTrainingG/Paddle that referenced this pull request Sep 29, 2021

broadcast qkv_op (PaddlePaddle#35780)

ba94568

* broadcast qkv_op * use PADDLE_ENFORCE_GT to replace assert

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

broadcast qkv_op #35780

broadcast qkv_op #35780

fengxiaoshuai commented Sep 15, 2021

paddle-bot-old bot commented Sep 15, 2021

shangzhizhou Sep 16, 2021

fengxiaoshuai Sep 16, 2021

shangzhizhou left a comment

chenwhql Sep 17, 2021

fengxiaoshuai Sep 17, 2021

chenwhql Sep 17, 2021

fengxiaoshuai Sep 17, 2021

shangzhizhou Sep 17, 2021

chenwhql Sep 17, 2021

fengxiaoshuai Sep 17, 2021

chenwhql Sep 17, 2021

fengxiaoshuai Sep 17, 2021

zhiqiu left a comment

broadcast qkv_op #35780

broadcast qkv_op #35780

Conversation

fengxiaoshuai commented Sep 15, 2021

PR types

PR changes

Describe

paddle-bot-old bot commented Sep 15, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

shangzhizhou left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zhiqiu left a comment

Choose a reason for hiding this comment