Skip to content

Improve near constant feature detection in scalers #19898

@jeremiedbb

Description

@jeremiedbb

To avoid scaling near constant features by a very large value, a heuristic was introduced in #19527. However it was a bit too absolute (e.g. #19726). It was fixed for StandardScaler, but there might be similar issues for the other scalers.

It should be tested case by case, depending on how the scale is defined. To see if the heuristic is appropriate or not, it should be tested against a very wide range of scaling of the data. If it breaks similarly to #19726, we need to find an appropriate bound on the scale to declare the feature constant (as it was done in #19788 for StandardScaler)

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions