Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Want MeanStdScaling that applies to multiple columns #56

Open
glennmoy opened this issue Mar 17, 2021 · 3 comments
Open

Want MeanStdScaling that applies to multiple columns #56

glennmoy opened this issue Mar 17, 2021 · 3 comments

Comments

@glennmoy
Copy link
Member

MeanStdScaling computes one set of mean and std params for all the data provided.

There is not a convenient way to compute separate mean and std params for, e.g., a list of columns without looping or creating a separate transform for each.

@nicoleepp
Copy link
Contributor

Maybe a kwarg like :per_col = true but there might be more implication on things under the hood for how this works

@nicoleepp
Copy link
Contributor

Plus one for this, used in GPF on a per slice basis

@molet
Copy link
Member

molet commented Jun 18, 2021

I think computing mean and std per column should be not only an optional feature but the default one. The current implementation is dangerous because people want to compute statistics per feature (per columns) variable and it is extremely rare to compute single statistics over multiple features.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants