-
-
Notifications
You must be signed in to change notification settings - Fork 26k
FEA Implement categorical feature support to IterativeImputer
#31479
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
…e support Our implementation automatically detects categorical columns (based on dtype), uses a RandomForestClassifier to model and predict missing category labels, and then inverse-transforms those predictions back into the original categories. So, if given a certain X data, where X features are numerical and categorical, respectively, and there are values missing (nan) in the categorical one, _iterative.py, through the Random Forest Classifier, will predict those missing values and replace them in the data. Co-authored-by: Fabioprata23 <fabio.prata@tecnico.ulisboa.pt>
IterativeImputer
@betatim Hi. Could you help me out here please? |
The CI tests many different versions and optional dependencies that you might not have locally. That is probably why it doesn't fail locally. The log output for the failures is often helpful for understanding why something breaks. Or you can see what versions of packages are used in the CI job that fails and see if that reproduces the errors locally. |
Fixes #31219
Our implementation automatically detects categorical columns
(based on dtype), uses a RandomForestClassifier to model and predict missing category labels, and then inverse-transforms those predictions back into the original categories. So, if given a certain X data, where X features are numerical and categorical, respectively, and there are values missing (nan) in the categorical one, _iterative.py, through the Random Forest Classifier, will predict those missing values and replace them in the data.
Co-authored-by: Fabioprata23 fabio.prata@tecnico.ulisboa.pt
Reference Issues/PRs
What does this implement/fix? Explain your changes.
Any other comments?