Skip to content

[WIP] Fixes #8136: Added support for new labels #8164

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 9 commits into from
Closed

[WIP] Fixes #8136: Added support for new labels #8164

wants to merge 9 commits into from

Conversation

tzano
Copy link
Contributor

@tzano tzano commented Jan 6, 2017

This PR is for the feature discussed in #8136.

@tzano tzano changed the title [WIP] LabelEncoder: Adding flexibility to unknown/new labels [WIP] Fixes #8136: Added support for new labels Jan 7, 2017
@tzano tzano closed this Jan 7, 2017
@pchalasani
Copy link

this is a great functionality, that I've been looking for. Any idea why it wasn't merged?

An important use-case is when training data consists of multiple batches, and certain categorical features have a very high cardinality (in the tens of thousands for example). In one batch, we would not see all possible values of this variable, so we'd want to use an "updating" version of labelencoder that would incrementally update across batches.

@jnothman
Copy link
Member

LabelEncoder is not intended for categorical features. It is intended for classifiable targets. But we have something in development that is. I'm not yet sure if partial_fit will be in the first release. Have your say at #9151

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants