MIN_CAT_SUPPORT in HGBT #19008
Labels
help wanted
Moderate
Anything that requires some knowledge of conventions and best practices
module:ensemble
Needs Benchmarks
A tag for the issues and PRs which require some benchmarks
Background
One leftover of categorical support in
HistGradientBoostingRegressor
andHistGradientBoostingClassifier
, merged in #18394, is theMIN_CAT_SUPPORT
in splitting.pyx, see #18394 (comment). It acts as a smoothing parameter to categorical variables and is currently fixed to the value10
.Proposal
I propose to investigate the impact of
MIN_CAT_SUPPORT
. In case of a larger impact it would be desirable to pass this as an option to the user.Additional context
AFAIK, very little reproducible results are known to date. For references, see the comment linked above.
The text was updated successfully, but these errors were encountered: