-
-
Notifications
You must be signed in to change notification settings - Fork 25.8k
MinMaxScaler output datatype #25845
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MinMaxScaler output datatype #25845
Conversation
modifications: __init__ partial_fit transform minmax_scale _parameter_constraints docstring fix tests
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for picking up this issue and working on it. I've left a few comments
@glemaitre @betatim can you help me with the review? I just corrected the merge conflict in the changelog. |
We discuss this PR IRL with @jeremiedbb and @ogrisel to know if we really wanted this feature. In the end, the feature will be useful if we succeed to avoid some memory copy. Otherwise, we could instead have a So here, the fix is not the right one since we actually do a copy when calling Where it becomes more cumbersome is handling the sparse case since the API offered by This PR could be useful to actually evaluate how much cumbersome and additional maintenance effort is required if we want to go this path. |
Closing this PR as we are not sure we want this feature (and would need different implementation) and @ducatte indicated they were not interested in continuing on this but please re-open if desired. I've copied @glemaitre 's useful repy to the issue #18443 |
Reference Issues/PRs
Fixes #18443
What does this implement/fix? Explain your changes.
Modifies MinMaxScaler to give users the option to choose the output_dtype
Any other comments?
We may want to add this feature to other scalers
Still need to modify the changelog. Version 1.3, Enhancement?