Skip to content

ENH Extends outlier encoding scheme to HDBSCAN.dbscan_clustering #24698

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 10 commits into from
Feb 9, 2023

Conversation

Micky774
Copy link
Contributor

Reference Issues/PRs

Towards #24686

What does this implement/fix? Explain your changes.

Updates the HDBSCAN.dbscan_clustering method to conform to new outlier encoding strategy wherein noise, np.inf and np.nan samples are labelled with distinct negative labels. Updates documentation to partially clarify this encoding as well.

Any other comments?

Copy link
Member

@thomasjpfan thomasjpfan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for the PR!

Copy link
Member

@thomasjpfan thomasjpfan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Otherwise LGTM

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>
@Micky774 Micky774 added the Waiting for Second Reviewer First reviewer is done, need a second one! label Nov 22, 2022
@Micky774
Copy link
Contributor Author

Micky774 commented Feb 4, 2023

@jjerphan @glemaitre you may be interested in reviewing this

Copy link
Member

@jjerphan jjerphan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This LGTM. Thank you, @Micky774.

I just have a few comments.

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>
@jjerphan
Copy link
Member

jjerphan commented Feb 9, 2023

Since @thomasjpfan commented recently, I let it merge if this now LGTH.

@thomasjpfan thomasjpfan merged commit 78b66d5 into scikit-learn:hdbscan Feb 9, 2023
@Micky774 Micky774 deleted the HDBSCAN/dbscan_outliers branch February 9, 2023 21:18
Micky774 added a commit to Micky774/scikit-learn that referenced this pull request May 16, 2023
…cikit-learn#24698)

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>
Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants