DOC Linked examples for clustering algorithms in their docstrings (#26927) #30127

SuccessMoses · 2024-10-21T21:12:20Z

Reference Issues/PRs

Adds links to examples/cluster mentioned in #26927

What does this implement/fix? Explain your changes.

Adds links to auto-generated examples for classes AffinityPropagation, SpectralClustering, DBSCAN, HDBSCAN and OPTICS. The example shows comparison between different clustering methods.

Any other comments?

Examples linked:

examples/cluster
plot_cluster_comparison.py

github-actions · 2024-10-21T21:13:42Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 2a34caa. Link to the linter CI: here}

marenwestermann

Hi @SuccessMoses! Thank you for your PR.

Could you move the references to the end of the respective "Examples" sections in the docstrings of the estimators? (We haven't been consistent with placing the references in the past and are trying to be more consistent now. 🙂 )

Could you also reference this example in the same way in the docstrings of
MeanShift, MiniBatchKMeans, AgglomerativeClustering, AgglomerativeClustering, Birch and GaussianMixture?

SuccessMoses · 2024-10-29T15:35:05Z

I am working on it

marenwestermann · 2024-10-29T15:42:44Z

Could you also make a new rubric "Examples" at the end of the section here and link the example there (compare other PRs if needed).

SuccessMoses · 2024-10-29T22:00:25Z

@marenwestermann

marenwestermann

Just a few small comments. Otherwise it looks good.

marenwestermann · 2024-10-30T14:27:16Z

doc/modules/clustering.rst

+.. rubric:: Examples
+
+* :ref:`sphx_glr_auto_examples_cluster_plot_cluster_comparison.py`: Shows the 
+  characteristics of different clustering algorithms of 2D datasets.


Small nitpick: could you replace "of 2D datasets" with "on 2D datasets"?

sklearn/cluster/_affinity_propagation.py

marenwestermann · 2024-10-30T14:43:33Z

sklearn/cluster/_birch.py

@@ -483,6 +483,9 @@ class Birch(
    Birch(n_clusters=None)
    >>> brc.predict(X)
    array([0, 0, 0, 1, 1, 1])
+
+    For a comparison of BIRCH clustering algorithm with other clustering algorithms, see


small nitpick: could you write "of the BIRCH .."

sklearn/cluster/_dbscan.py

marenwestermann · 2024-11-03T10:15:32Z

doc/modules/clustering.rst

@@ -140,6 +140,11 @@ model with equal covariance per component.
 :term:`inductive` clustering methods) are not designed to be applied to new,
 unseen data.

+.. rubric:: Examples


@adrinjalali are you ok with having the example linked here? As noted before it's not obvious that it's possible to click on the images and access examples that way. Also, we haven't been consistent with linking examples in the user guide. There are quite a few places in the user guide where a link to an example is listed under the rubric "Examples" even though you can access the respective example through clicking on the image in that section (such as here: https://github.com/scikit-learn/scikit-learn/blob/main/doc/modules/clustering.rst?plain=1#L306)

That's a fair question. I'm not sure what I think about the question in general, but in this particular case, I think this is kinda repeating the same information right after where it's mentioned, and hence unnecessary.

However, your point on the images not being clearly clickable is valid, and we can probably improve the image caption to mention they can be clicked?

I'm not sure, maybe @Charlie-XIAO or @glemaitre have a better idea of how to improve this.

In this case I think the link is redundant, there's a nice description with a hyperlink above here.

However, it's a good point that it's not clear the the user can click on them. I wonder if @Charlie-XIAO would have an idea of how we could improve that UX.

Hi @SuccessMoses! Could you remove this link including the "Examples" rubric you made? We're then ready to merge. I'll open an issue for discussing how to best improve the UX.

adrinjalali

Otherwise LGTM.

adrinjalali · 2024-11-07T16:18:35Z

doc/modules/clustering.rst

@@ -140,6 +140,11 @@ model with equal covariance per component.
 :term:`inductive` clustering methods) are not designed to be applied to new,
 unseen data.

+.. rubric:: Examples


That's a fair question. I'm not sure what I think about the question in general, but in this particular case, I think this is kinda repeating the same information right after where it's mentioned, and hence unnecessary.

However, your point on the images not being clearly clickable is valid, and we can probably improve the image caption to mention they can be clicked?

I'm not sure, maybe @Charlie-XIAO or @glemaitre have a better idea of how to improve this.

adrinjalali · 2024-11-15T10:04:20Z

doc/modules/clustering.rst

@@ -140,6 +140,11 @@ model with equal covariance per component.
 :term:`inductive` clustering methods) are not designed to be applied to new,
 unseen data.

+.. rubric:: Examples


In this case I think the link is redundant, there's a nice description with a hyperlink above here.

However, it's a good point that it's not clear the the user can click on them. I wonder if @Charlie-XIAO would have an idea of how we could improve that UX.

StefanieSenger · 2025-01-10T13:28:11Z

Hi @SuccessMoses, I have just seen that meanwhile a few PRs have been merged that are touching on what you are doing here:

plot_affinity_propagation.py in DOC added links for plot_affinity_propagation.py #29759
plot_dbscan.py in DOC: added link plot_dbscan.py #29949

As you go forward with this PR, please consider to align this.

marenwestermann · 2025-01-10T14:49:12Z

The best way to align with the other PRs is to pull the changes from upstream main into your feature branch like so:

git fetch upstream
git merge upstream/main

marenwestermann · 2025-02-16T12:47:53Z

I finished this PR because it was blocking another PR from getting merged. Thank you for your contribution @SuccessMoses!

SuccessMoses added 5 commits October 21, 2024 13:36

DOC Linked examples for Affinity Propagation in thier docs

00322d7

DOC Linked examples for Spectral Clustering in thier docs

bb1b72b

DOC Linked examples for DBSCAN in thier docs

ce19dc7

DOC Linked examples for HDBSCAN in docstring

9dca598

DOC Linked examples for OPTICS in docstrings

3f3cfed

github-actions bot added module:cluster Documentation labels Oct 21, 2024

SuccessMoses added 2 commits October 22, 2024 14:50

Reformattted with black

34ec6aa

Merge branch 'main' into linking-clustering-ex

27a5aea

marenwestermann self-requested a review October 23, 2024 07:58

marenwestermann reviewed Oct 29, 2024

View reviewed changes

SuccessMoses added 4 commits October 29, 2024 20:28

Merge branch 'scikit-learn:main' into linking-clustering-ex

3b6f2ad

move examples below example section

e7a7b41

add rubric "examples"

c4a6d66

fix issues with ruff

07eb7f0

marenwestermann reviewed Oct 30, 2024

View reviewed changes

SuccessMoses added 2 commits October 31, 2024 10:57

fix stuff here and there

afcf7b7

fix lint

56a7d6d

SuccessMoses requested a review from marenwestermann October 31, 2024 16:39

virchan mentioned this pull request Oct 31, 2024

DOC add link plot_inductive_clustering #30182

Merged

marenwestermann approved these changes Nov 3, 2024

View reviewed changes

marenwestermann reviewed Nov 3, 2024

View reviewed changes

adrinjalali reviewed Nov 15, 2024

View reviewed changes

marenwestermann mentioned this pull request Jan 6, 2025

Improve user experience in the user guide - make it clear to users that images are clickable #30596

Open

StefanieSenger mentioned this pull request Jan 10, 2025

Add links to examples from the docstrings and user guide #30621

Closed

marenwestermann added 2 commits February 16, 2025 12:43

Merge remote-tracking branch 'upstream/main' into linking-clustering-ex

cfa4682

remove examples rubric

2a34caa

marenwestermann merged commit 7e861bc into scikit-learn:main Feb 16, 2025
33 checks passed

Uh oh!

DOC Linked examples for clustering algorithms in their docstrings (#26927) #30127

DOC Linked examples for clustering algorithms in their docstrings (#26927) #30127

Uh oh!

Conversation

SuccessMoses commented Oct 21, 2024

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

github-actions bot commented Oct 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

marenwestermann left a comment

Choose a reason for hiding this comment

Uh oh!

SuccessMoses commented Oct 29, 2024

Uh oh!

marenwestermann commented Oct 29, 2024

Uh oh!

SuccessMoses commented Oct 29, 2024

Uh oh!

marenwestermann left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

StefanieSenger commented Jan 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

marenwestermann commented Jan 10, 2025

Uh oh!

Uh oh!

marenwestermann commented Feb 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Oct 21, 2024 •

edited

Loading

StefanieSenger commented Jan 10, 2025 •

edited

Loading

marenwestermann commented Feb 16, 2025 •

edited

Loading