Skip to content

Remove "filtering" from the description of Transformer API #27900

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Dec 4, 2023

Conversation

jmmcd
Copy link
Contributor

@jmmcd jmmcd commented Dec 4, 2023

"Filtering" usually means removing rows (eg in SQL), but I believe Transformers should not do that.

"transformers that change the number of samples are not currently supported": #3855

This causes confusion, eg here: https://stackoverflow.com/questions/68307502/sklearn-pipelines-and-filtering-out-records

Some relevant links:

"Filtering" usually means removing rows, but I believe Transformers should not do that. This causes confusion, eg here:

https://stackoverflow.com/questions/68307502/sklearn-pipelines-and-filtering-out-records
Copy link

github-actions bot commented Dec 4, 2023

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

Generated for commit: 197d898. Link to the linter CI: here

Copy link
Member

@adrinjalali adrinjalali left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Wondering what @glemaitre thinks since imbalanced-learn does this.

Copy link
Member

@glemaitre glemaitre left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

One of the reason why we call those Sampler and they have another API :)

@glemaitre glemaitre merged commit 34c46d4 into scikit-learn:main Dec 4, 2023
@glemaitre
Copy link
Member

Uhm actually, filtering could correspond to feature selection where we remove columns.
@jmmcd Would you mind to make a new PR and have this information that filtering means removing columns.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants