Skip to content

DOC Rework plot_document_clustering.py example #23528

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 57 commits into from
Jun 20, 2022
Merged
Changes from all commits
Commits
Show all changes
57 commits
Select commit Hold shift + click to select a range
595979d
First step to improve notebook style
May 23, 2022
e73ced2
Checkpoint
May 23, 2022
3b01b5d
Checkpoint
May 25, 2022
8d7e3e4
Merge branch 'main' of https://github.com/scikit-learn/scikit-learn i…
Jun 1, 2022
812b3b3
Add plots and improve narrative
Jun 2, 2022
7ae069a
Merge branch 'main' of https://github.com/scikit-learn/scikit-learn i…
Jun 2, 2022
f3ddf3a
Merge branch 'main' of https://github.com/scikit-learn/scikit-learn i…
Jun 3, 2022
47d5ccb
Merge branch 'main' into doc_clustering
ogrisel Jun 6, 2022
cbca1a1
Empty commit to trigger CI again
ogrisel Jun 6, 2022
35dc7a9
Rename benchmark function
Jun 7, 2022
39c52b2
Merge branch 'doc_clustering' of github.com:ArturoAmorQ/scikit-learn …
Jun 7, 2022
be09f9e
Workaround to avoid kmeans++ not updating initial centroids
Jun 7, 2022
7a73a69
Improve text
Jun 7, 2022
126db98
Improve narrative
Jun 7, 2022
8159abc
Apply suggestions from code review
ArturoAmorQ Jun 8, 2022
4470450
Iter on suggestions from code review
Jun 8, 2022
b0928ef
Fix repited line
Jun 8, 2022
a444286
Modify workaround to avoid silhouette message error
Jun 8, 2022
ce9b7e3
Get rid of cousin
Jun 8, 2022
172db17
Apply suggestions from code review
ArturoAmorQ Jun 8, 2022
03de379
Fix conflicts
Jun 8, 2022
5755395
Apply suggestions from code review
Jun 8, 2022
1babd21
Change sphinx references
Jun 8, 2022
49e38f1
Fix format
Jun 8, 2022
a67a41d
Change notation for instances
Jun 8, 2022
7c7b3fa
Use f-string
Jun 8, 2022
fd92d8b
Simplify plot
ArturoAmorQ Jun 8, 2022
df5f6ab
Merge branch 'doc_clustering' of github.com:ArturoAmorQ/scikit-learn …
Jun 8, 2022
d3f3087
Iter
Jun 8, 2022
ec29646
Update figure size
Jun 8, 2022
62b46ff
Fix reference to user guide
Jun 9, 2022
784515b
Add narrative to KMeans
Jun 9, 2022
8a9407e
Add narrative to explain cluster centers
Jun 9, 2022
3d6f21d
Apply suggestions from code review
ArturoAmorQ Jun 9, 2022
76fc085
Apply suggestions from code review
Jun 9, 2022
3a8e402
Format
Jun 9, 2022
a066e8e
Apply suggestions from code review
Jun 9, 2022
013d6a9
refer to the exact labels on the graph
Jun 9, 2022
7bebb85
Improve notation
Jun 9, 2022
a9accc8
Remove demo on un-scaled hashing
Jun 9, 2022
e93af5c
Apply suggestions from code review
ArturoAmorQ Jun 9, 2022
38b9fe2
Iter
Jun 9, 2022
0f52bbd
Explain use of supervised labels to evaluate unsupervised cluster
Jun 10, 2022
7cc26e3
Improve variables notation
Jun 10, 2022
1ecc76f
Improve output from custom function
Jun 10, 2022
59232d0
Rework notebook to include error bars
Jun 10, 2022
a6b90cc
Apply suggestions from code review
ArturoAmorQ Jun 13, 2022
34bde7d
Improve wording in conclusions
Jun 13, 2022
87e9016
Remove refs to k-means++ init and improve narrative
Jun 16, 2022
cd6b8d4
Merge branch 'main' of https://github.com/scikit-learn/scikit-learn i…
Jun 16, 2022
097abcb
Apply suggestions from code review
ArturoAmorQ Jun 18, 2022
64a3560
Update examples/text/plot_document_clustering.py
ArturoAmorQ Jun 20, 2022
66a763a
Final tweaks
Jun 20, 2022
4414849
Merge branch 'doc_clustering' of github.com:ArturoAmorQ/scikit-learn …
Jun 20, 2022
38bf125
Specify sparcity of bag of words approach
Jun 20, 2022
ea3ae4a
Trigger CI
ogrisel Jun 20, 2022
a214de5
Merge branch 'main' of https://github.com/scikit-learn/scikit-learn i…
Jun 20, 2022
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Loading