ENH allow SelectKBest to select all features in a parameter search #1770
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Currently it is difficult to perform a grid-search over k for
SelectKBest
feature selection such that one parameter option is to ignoreSelectKBest
altogether (i.e.k
== infty). Ifk > X.shape[1]
, it throws an error, andX.shape
is not always known when the parameter grid is constructed. This patch allows one to use a parameter grid such as{'k': [5, 10, 20, 40, 80, SelectKBest.SELECT_ALL]}
to include bypassing the selector as an option.An alternative to this is to harness #1769 (together with
param_grid
being a list of dicts) to simply switch off theSelectKBest
component. Perhaps neither solution is intuitive!