[MXNET-524] Broadcast like operator #11820

ifeherva · 2018-07-19T13:32:59Z

Description

Operator, which can output a broadcasted array for the given target. This allows easier broadcasting and hybridization.

Added appropriate shape inference

lanking520 · 2018-07-19T22:17:04Z

Thanks for your contribution! @zhanghang1989, @eric-haibin-lin for review

taliesinb · 2018-07-19T23:33:46Z

There is a generalization that would be extremely useful for this operator to have. The generalization is very similar to one that was discussed at https://discuss.mxnet.io/t/reshaping-broadcasting-without-hardcoding-target-dimensions/851/6 (you can skip to the last 4 comments, the thread contains an irrelevant proposal although the motivation is relevant).

In short, the generalization would allow only specific dimensions to be copied from the 'other' tensor. For example:

input.shape = (1, 2, 1, 3)
other.shape = (5, 6, 7, 8)
output = broadcast_like(input, other, input_axes:(0,2), other_axes:(1,3))
output.shape = (6, 2, 8, 3)

In other words, what's happening here is that the you can pick exactly which axes of the other tensor you want to use to "fill in" axes of the input tensor. This is how broadcast_axes works, except instead of providing the values via a size parameter, you are providing them from specific axes in the other tensor.

The reason this is so valuable is that it is common to have another tensor that contains the dimension you want to broadcast amongst a set of irrelevant dimensions. There is simply no other way of "extracting" the relevant dimension from elsewhere in the net, so currently you have to hardcode that dimension into a parameter list, which forces expensive workarounds like bucketing where otherwise cheap reshaping would work to make a net that is compatible with multiple sequence lengths, for example.

The current behavior of broadcast_like in the PR would be consistent with this generalization if the default value of input_axis is the empty tuple, which means "all axes".

zhanghang1989

LGTM

szha · 2018-07-20T01:57:31Z

docs/api/python/symbol/symbol.md

@@ -207,6 +207,7 @@ Composite multiple symbols into a new one by an operator.

    Symbol.broadcast_to
    Symbol.broadcast_axes
+    Symbol.broadcast_like


also need to add an entry at https://github.com/ifeherva/incubator-mxnet/blob/00eeeca61c9f052f3e85d4febe43130ed5669e61/docs/api/python/symbol/symbol.md#expanding-elements-1. Same goes with ndarray.

ifeherva · 2018-07-20T04:51:41Z

@taliesinb Very interesting proposal indeed. The implementation is quite straighforward and I am happy to do it if this is something that is planned to happen. Is there a JIRA ticket open for this? I propose to have it in a separate PR.

taliesinb · 2018-07-20T15:09:28Z

@ifeherva if you're enthusiastic about this proposal that's great! yes, another PR might make sense. i'm not aware of a JIRA ticket, but the design of reshape_like is very similar to this proposal and that design was proposed by @piiswrong with the goal of solving the same kind of problem (my colleague @sbodenstein is going to submit a PR for that reshape_like extension in the next few days).

ifeherva · 2018-07-20T16:02:37Z

@taliesinb Great! Once that one is merged I can adapt broadcast_like as well.

* Registered the broadcast_like operator with GPU and CPU Added appropriate shape inference * Added python interface to ndarray and symbol * Added python api documentation * Fixed backward operation * Added unit tests * Fixed linting issues * Added missing api doc

ifeherva added 5 commits July 18, 2018 11:35

Registered the broadcast_like operator with GPU and CPU

e9a8c06

Added appropriate shape inference

Added python interface to ndarray and symbol

57f7f2d

Added python api documentation

4cddf8a

Fixed backward operation

cc53496

Added unit tests

2c46e34

ifeherva requested review from anirudh2290 and szha as code owners July 19, 2018 13:33

Fixed linting issues

00eeeca

ifeherva mentioned this pull request Jul 19, 2018

[Feature Request] broadcast_like operator #11056

Closed

zhanghang1989 approved these changes Jul 19, 2018

View reviewed changes

szha reviewed Jul 20, 2018

View reviewed changes

Added missing api doc

28b1bf4

szha merged commit b16f875 into apache:master Jul 20, 2018

eric-haibin-lin mentioned this pull request Jul 20, 2018

[WIP] Add broadcast_like backend tensor operator #11443

Closed

6 tasks

sbodenstein mentioned this pull request Jul 24, 2018

Proposal: Generalize broadcast_like #11871

Open

ifeherva deleted the broadcast_like_symbol branch February 10, 2019 04:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MXNET-524] Broadcast like operator #11820

[MXNET-524] Broadcast like operator #11820

ifeherva commented Jul 19, 2018

lanking520 commented Jul 19, 2018

taliesinb commented Jul 19, 2018

zhanghang1989 left a comment

szha Jul 20, 2018

ifeherva Jul 20, 2018

ifeherva commented Jul 20, 2018

taliesinb commented Jul 20, 2018

ifeherva commented Jul 20, 2018

[MXNET-524] Broadcast like operator #11820

[MXNET-524] Broadcast like operator #11820

Conversation

ifeherva commented Jul 19, 2018

Description

lanking520 commented Jul 19, 2018

taliesinb commented Jul 19, 2018

zhanghang1989 left a comment

Choose a reason for hiding this comment

szha Jul 20, 2018

Choose a reason for hiding this comment

ifeherva Jul 20, 2018

Choose a reason for hiding this comment

ifeherva commented Jul 20, 2018

taliesinb commented Jul 20, 2018

ifeherva commented Jul 20, 2018