One hot encoding transform #7

glennmoy · 2021-02-01T15:11:41Z

A one-hot-encoding transform for categorical variables

MWE

x = ["foo", "bar", "baz"]

ohe = OneHotEncoding()

Transform.apply(x, ohe)

# output
3×3 Array{Int64,2}:
 1  0  0
 0  1  0
 0  0  1

rofinn · 2021-02-22T21:20:37Z

FWIW, if we're gonna change the type maybe we should use Bool for space efficiency?

julia> sizeof(true)
1

julia> sizeof(1)
8

That being said, maybe we could parameterize the type such that you state what you want returned? That way if I want to construct a pipeline where I know I'm gonna be merging the output from this type into an flattened array of floats then I'll construct it accordingly?

glennmoy added the new transform New transform request label Feb 1, 2021

nicoleepp self-assigned this Feb 12, 2021

nicoleepp mentioned this issue Feb 17, 2021

Add OneHotEncoding Transform #19

Merged

nicoleepp closed this as completed in #19 Feb 17, 2021

nicoleepp reopened this Feb 24, 2021

nicoleepp mentioned this issue Feb 24, 2021

Use Bool by default for OneHotEncoding Transform #31

Merged

nicoleepp closed this as completed in #31 Feb 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

One hot encoding transform #7

One hot encoding transform #7

glennmoy commented Feb 1, 2021

rofinn commented Feb 22, 2021

One hot encoding transform #7

One hot encoding transform #7

Comments

glennmoy commented Feb 1, 2021

rofinn commented Feb 22, 2021