Add New OP: gumbel_softmax #35506

YuanRisheng · 2021-09-06T11:14:00Z

PR types

New features

PR changes

OPs

Describe

Add new op named Gumbel Softmax for paddle.
This op is used for sampling from the Gumbel-Softmax distribution and optionally discretizes.

Verify the performance of Gumbel softmax operator locally. The input is random tensor (shape = [200,30]) and repeat the test 100 times. Compare Paddle with Pytorch: the average time-consuming of Paddle is 0.0114 seconds, and the average time-consuming of Pytorch is 0.0123 seconds. The performance of Gumbel softmax operator of Paddle is about 7% better than that of Pytorch.

API:
paddle.nn.functional.gumbel_softmax()

Usage example:

import paddle
import paddle.nn.functional as F

logits = paddle.randn([4, 6])
temperature = 0.01
gumbel_softmax = F.gumbel_softmax(logits, temperature)
print(gumbel_softmax)
The result is as follows:
 [[0.00000001, 1.        , 0.00000000, 0.00000000, 0.00000006, 0.00000000],
 [0.00000000, 0.00000000, 0.00000000, 0.00000000, 0.00000000, 1.        ],
 [0.00000062, 0.00000000, 0.00000000, 0.00000000, 0.00000000, 0.99999940],
 [0.00000000, 0.00000000, 0.00000000, 0.00001258, 0.99998736, 0.00000000]]

Doc
Cn Doc PR: PaddlePaddle/docs/3857

paddle-bot-old · 2021-09-06T11:14:05Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

MingMingShangTian · 2021-09-07T07:33:50Z

python/paddle/fluid/tests/unittests/test_gumbel_softmax_op.py

+                    name='x_int32', shape=[2, 3], dtype='int32')
+                paddle.nn.functional.gumbel_softmax(x_int32)
+
+        self.assertRaises(TypeError, test_dtype)


这里单测缺少了

if __name__ == '__main__': unittest.main()

chenwhql

在PR描述中加上英文文档预览截图，并且关联到中文文档PR，可参考 #33278

chenwhql · 2021-09-08T06:13:00Z

paddle/fluid/operators/gumbel_softmax_op.cc

+  }
+
+ protected:
+  framework::OpKernelType GetExpectedKernelType(


GetExpectedKernelType这里可以不写，只有一个输入X，默认根据X的特征确定Kernel类型

chenwhql · 2021-09-08T06:19:49Z

paddle/fluid/operators/gumbel_softmax_op.cc

+        "hard",
+        "(bool, default false) "
+        "if True, the returned samples will be discretized as one-hot vectors, "
+        "but will be differentiated as if it is the soft sample in autograd")


autograd后面需要.?

chenwhql · 2021-09-08T06:20:16Z

paddle/fluid/operators/gumbel_softmax_op.cc

+  void InferShape(framework::InferShapeContext* ctx) const override {
+    OP_INOUT_CHECK(ctx->HasInput("Out"), "Input", "Out", "gumbel_softmax_grad");
+    OP_INOUT_CHECK(ctx->HasInput(framework::GradVarName("Out")), "Input",
+                   "Out@grad", "gumbel_softmax_grad");


Out@grad -> Out@GRAD

chenwhql · 2021-09-08T06:22:30Z

paddle/fluid/operators/gumbel_softmax_op.cu

+  T min_, max_;
+  unsigned int seed_;
+  unsigned int offset_ = 0;
+  __host__ __device__ UniformGenerator(T min, T max, unsigned int seed)


__host__ __device__ -> HOSTDEVICE, 有全局的宏定义

chenwhql · 2021-09-08T06:26:40Z

paddle/fluid/operators/gumbel_softmax_op.cu

+#include "paddle/fluid/framework/op_registry.h"
+#include "paddle/fluid/framework/operator.h"
+#include "paddle/fluid/memory/memcpy.h"
+#include "paddle/fluid/operators/gumbel_softmax_op.h"


需要放到最上面，参见规范

https://zh-google-styleguide.readthedocs.io/en/latest/google-cpp-styleguide/headers/#include

chenwhql · 2021-09-08T06:32:04Z

paddle/fluid/operators/gumbel_softmax_op.h

+    int axis_dim = dX->dims()[axis];
+    // allocate memory on device.
+    dX->mutable_data<T>(context.GetPlace());
+    if (dX->numel() == 0) {


这里直接return是什么场景

这里的逻辑和softmax是一样的，所以也同样按照softmax算子的做法添加了对dx的边界处理

chenwhql · 2021-09-08T06:32:32Z

paddle/fluid/operators/gumbel_softmax_op.h

+    const int size_to_axis = SizeToAxis(axis, dX->dims());
+    const int size_from_axis = SizeFromAxis(axis, dX->dims());
+    Tensor dX_2d, Out_2d, dOut_2d;
+    dX_2d.ShareDataWith(*dX).Resize({size_to_axis, size_from_axis});


同上，为什么必须使用sharedatawith

这里需要将out的数据resize到2维进行处理计算结果，使用shareDataWith可以减少copy次数

chenwhql · 2021-09-08T06:33:36Z

python/paddle/fluid/tests/unittests/test_gumbel_softmax_op.py

+        self.check_output_customized(self.verify_output)
+
+    def test_check_grad(self):
+        self.check_grad(["X"], "Out", max_relative_error=0.01)


这里0.01的精度设置是什么根据？有特殊原因吗？默认是0.005

由于梯度计算逻辑和softmax算子是一样的，这里精度误差0.01取自softmax算子使用的精度误差

chenwhql · 2021-09-08T06:35:41Z

python/paddle/fluid/tests/unittests/test_gumbel_softmax_op.py

+        self.check_grad(["X"], "Out", max_relative_error=0.01)
+
+
+class TestGumbelSoftmaxOpGrad(unittest.TestCase):


为什么需要单独写这个单测，理论上前面应该测了反向？

这里测的hard为ture和false条件下，反向梯度是否一致，和前边测的反向内容略有不同

chenwhql · 2021-09-08T06:37:40Z

python/paddle/fluid/tests/unittests/test_gumbel_softmax_op.py

+        self.x_shape = [2, 3, 4, 5]
+        self.x = np.random.uniform(-1., 1., self.x_shape).astype(np.float32)
+        self.count_expected = 24
+        self.place = paddle.CUDAPlace(0) \


with cuda的话应该是CPU和CUDA都测试？没有CUDA的话也应该有CPU测试

这个下边代码是有的，没有cuda的话是cpu测试

chenwhql

LGTM

zhhsplendid

LGTM on ShareDataWith

lanxianghit

LGTM for API change

Xreki

LGTM for op benchmark ci

zhhsplendid

LGTM for ShareDataWith

* Add New Op: gumbel_softmax * Add New Op: gumbel_softmax * Add New Op: gumbel_softmax (amend) * add __main__ function in unit test * fix bugs when test in windows ci * update en docs * delete reletive error in unit test * delete relative error in unit test * set hard=True in unit test

YuanRisheng added 3 commits August 30, 2021 11:23

Add New Op: gumbel_softmax

da92e9e

Add New Op: gumbel_softmax

0bbcade

Add New Op: gumbel_softmax (amend)

a4325df

MingMingShangTian reviewed Sep 7, 2021

View reviewed changes

add __main__ function in unit test

ee3648c

chenwhql reviewed Sep 8, 2021

View reviewed changes

YuanRisheng added 3 commits September 9, 2021 07:24

fix bugs when test in windows ci

6d317f9

Merge branch 'PaddlePaddle:develop' into add_gumbel_softmax_api

04064a5

update en docs

fbd5db7

chenwhql previously approved these changes Sep 13, 2021

View reviewed changes

lanxianghit previously approved these changes Sep 13, 2021

View reviewed changes

zhhsplendid previously approved these changes Sep 13, 2021

View reviewed changes

delete reletive error in unit test

6779ffd

YuanRisheng dismissed stale reviews from zhhsplendid, lanxianghit, and chenwhql via 6779ffd September 13, 2021 07:53

YuanRisheng added 2 commits September 14, 2021 03:01

delete relative error in unit test

bedee6e

set hard=True in unit test

3fa5180

chenwhql approved these changes Sep 14, 2021

View reviewed changes

lanxianghit approved these changes Sep 14, 2021

View reviewed changes

Xreki approved these changes Sep 14, 2021

View reviewed changes

zhhsplendid approved these changes Sep 15, 2021

View reviewed changes

TCChenlong approved these changes Sep 15, 2021

View reviewed changes

chenwhql merged commit 18eda6c into PaddlePaddle:develop Sep 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add New OP: gumbel_softmax #35506

Add New OP: gumbel_softmax #35506

YuanRisheng commented Sep 6, 2021 •

edited

Loading

paddle-bot-old bot commented Sep 6, 2021

MingMingShangTian Sep 7, 2021

YuanRisheng Sep 7, 2021

chenwhql left a comment

chenwhql Sep 8, 2021

YuanRisheng Sep 10, 2021

chenwhql Sep 8, 2021

YuanRisheng Sep 10, 2021

chenwhql Sep 8, 2021

YuanRisheng Sep 10, 2021

chenwhql Sep 8, 2021

YuanRisheng Sep 10, 2021

chenwhql Sep 8, 2021

YuanRisheng Sep 10, 2021

chenwhql Sep 8, 2021

YuanRisheng Sep 10, 2021

chenwhql Sep 8, 2021

YuanRisheng Sep 10, 2021

chenwhql Sep 8, 2021

YuanRisheng Sep 10, 2021

chenwhql Sep 8, 2021

YuanRisheng Sep 10, 2021

chenwhql Sep 8, 2021

YuanRisheng Sep 10, 2021

chenwhql left a comment

zhhsplendid left a comment

lanxianghit left a comment

Xreki left a comment

zhhsplendid left a comment

		self.check_grad(["X"], "Out", max_relative_error=0.01)


		class TestGumbelSoftmaxOpGrad(unittest.TestCase):

Add New OP: gumbel_softmax #35506

Add New OP: gumbel_softmax #35506

Conversation

YuanRisheng commented Sep 6, 2021 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Sep 6, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chenwhql left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chenwhql left a comment

Choose a reason for hiding this comment

zhhsplendid left a comment

Choose a reason for hiding this comment

lanxianghit left a comment

Choose a reason for hiding this comment

Xreki left a comment

Choose a reason for hiding this comment

zhhsplendid left a comment

Choose a reason for hiding this comment

YuanRisheng commented Sep 6, 2021 •

edited

Loading