Skip to content

MIN_CAT_SUPPORT in HGBT #19008

Open
Open
@lorentzenchr

Description

@lorentzenchr

Background

One leftover of categorical support in HistGradientBoostingRegressor and HistGradientBoostingClassifier, merged in #18394, is the MIN_CAT_SUPPORT in splitting.pyx, see #18394 (comment). It acts as a smoothing parameter to categorical variables and is currently fixed to the value 10.

Proposal

I propose to investigate the impact of MIN_CAT_SUPPORT. In case of a larger impact it would be desirable to pass this as an option to the user.

Additional context

AFAIK, very little reproducible results are known to date. For references, see the comment linked above.

Metadata

Metadata

Assignees

No one assigned

    Labels

    ModerateAnything that requires some knowledge of conventions and best practicesNeeds BenchmarksA tag for the issues and PRs which require some benchmarkshelp wantedmodule:ensemble

    Type

    No type

    Projects

    Status

    Discussion

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions