DOC Add links to preprocessing examples in docstrings and userguide #26877

StefanieSenger · 2023-07-21T12:59:37Z

This PR suggests to add links to the examples from the Preprocessing section to the docstrings of the respective classes and functions.

github-actions · 2023-07-21T13:01:22Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: beb8ca8. Link to the linter CI: here}

ArturoAmorQ

Thanks for the PR @StefanieSenger. Here is a first batch of comments.

sklearn/preprocessing/_target_encoder.py

sklearn/preprocessing/_discretization.py

doc/modules/preprocessing.rst

sklearn/preprocessing/_data.py

ArturoAmorQ

Apart from a bit of wording, LGTM :) Thanks again @StefanieSenger!

sklearn/preprocessing/_target_encoder.py

Co-authored-by: Arturo Amor <86408019+ArturoAmorQ@users.noreply.github.com>

adrinjalali

Left a few comments, it's hard to review since this PR is rather large and touching many examples, it's easier if PRs have a smaller scope.

examples/preprocessing/plot_all_scaling.py

sklearn/preprocessing/_data.py

adrinjalali · 2023-08-02T14:21:23Z

CI failing: https://dev.azure.com/scikit-learn/scikit-learn/_build/results?buildId=57497&view=logs&j=78a0bf4f-79e5-5387-94ec-13e67d216d6e&t=f1857171-4a53-55c7-3ab5-90acfe091baa&l=1240

StefanieSenger · 2023-08-04T15:14:11Z

Resolved the CI issues, thank you @adrinjalali

doc/modules/preprocessing.rst

sklearn/preprocessing/_data.py

ogrisel

Overall, LGTM, thanks for the PR. In addition to @adrinjalali's remarks here are a few more.

doc/modules/preprocessing.rst

ogrisel · 2023-08-22T15:39:07Z

sklearn/preprocessing/_data.py

@@ -291,6 +290,10 @@ class MinMaxScaler(OneToOneFeatureMixin, TransformerMixin, BaseEstimator):
    This transformation is often used as an alternative to zero mean,
    unit variance scaling.

+    MinMaxScaler doesn't reduce the effect of outliers; it only linearily
+    scales them down. For an example visualization, refer to :ref:`Compare


This statement holds for all scalers (StandardScaler, RobustScaler, MaxAbsScaler and MinMaxScaler). What is different is that the scale value found by RobustScaler is not sensitive to the presence of a few large marginal outliers while it is for StandardScaler and even more so for MinMaxScaler and MaxAbsScaler.

Yes, I see. To express how the MinMaxScaler differs from the other scalers concerning outliers, I have tried to come up with a new wording:

`MinMaxScaler` doesn't reduce the effect of outliers, but it linearily scales them down into a fixed range, where the largest occuring data point corresponds to the maximum value and the smallest one corresponds to the minimum value.

What do you think?

sklearn/preprocessing/_data.py

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

…er/scikit-learn into link_examples_preprocessing

…cikit-learn#26877) Co-authored-by: Arturo Amor <86408019+ArturoAmorQ@users.noreply.github.com> Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com> Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

…26877) Co-authored-by: Arturo Amor <86408019+ArturoAmorQ@users.noreply.github.com> Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com> Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

…cikit-learn#26877) Co-authored-by: Arturo Amor <86408019+ArturoAmorQ@users.noreply.github.com> Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com> Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

links to preprocessing examples added

a3715c4

github-actions bot added the module:preprocessing label Jul 21, 2023

StefanieSenger changed the title ~~links to preprocessing examples added~~ DOC Add links to preprocessing examples in docstrings and userguide Jul 21, 2023

github-actions bot added the Documentation label Jul 21, 2023

ArturoAmorQ reviewed Jul 28, 2023

View reviewed changes

adrinjalali mentioned this pull request Jul 28, 2023

Add links to examples from the docstrings and user guides #26927

Closed

changes after review

32babd1

ArturoAmorQ approved these changes Jul 31, 2023

View reviewed changes

sklearn/preprocessing/_target_encoder.py Outdated Show resolved Hide resolved

Update sklearn/preprocessing/_target_encoder.py

6f2ad44

Co-authored-by: Arturo Amor <86408019+ArturoAmorQ@users.noreply.github.com>

adrinjalali reviewed Jul 31, 2023

View reviewed changes

examples/preprocessing/plot_all_scaling.py Outdated Show resolved Hide resolved

sklearn/preprocessing/_data.py Outdated Show resolved Hide resolved

changed section links according to review

717ea68

satisfied CI

dd92aaf

ArturoAmorQ requested a review from adrinjalali August 18, 2023 09:21

adrinjalali reviewed Aug 18, 2023

View reviewed changes

doc/modules/preprocessing.rst Outdated Show resolved Hide resolved

sklearn/preprocessing/_data.py Outdated Show resolved Hide resolved

sklearn/preprocessing/_data.py Outdated Show resolved Hide resolved

ogrisel reviewed Aug 22, 2023

View reviewed changes

StefanieSenger and others added 7 commits August 23, 2023 21:33

Update doc/modules/preprocessing.rst

b86eb6a

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

Update doc/modules/preprocessing.rst

f515a2f

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

Update sklearn/preprocessing/_data.py

df1b51b

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

Update sklearn/preprocessing/_data.py

d061c21

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

changes after review

3e6727d

Merge branch 'link_examples_preprocessing' of github.com:StefanieSeng…

9c428c6

…er/scikit-learn into link_examples_preprocessing

Merge branch 'main' into link_examples_preprocessing

beb8ca8

adrinjalali approved these changes Aug 24, 2023

View reviewed changes

adrinjalali merged commit 8ccaf0d into scikit-learn:main Aug 24, 2023

marenwestermann mentioned this pull request Dec 10, 2023

Added link to plot_adaboost_multiclass.py example #27913

Merged

StefanieSenger deleted the link_examples_preprocessing branch April 18, 2024 11:00

virchan mentioned this pull request Nov 5, 2024

DOC: Link Examples for SVR, NuSVR, and SVM User Guide #30201

Merged

StefanieSenger mentioned this pull request Jun 16, 2025

DOC Add link to StandardScaler example in docstring #31547

Closed

Uh oh!

DOC Add links to preprocessing examples in docstrings and userguide #26877

DOC Add links to preprocessing examples in docstrings and userguide #26877

Uh oh!

Conversation

StefanieSenger commented Jul 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jul 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

ArturoAmorQ left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ArturoAmorQ left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

adrinjalali commented Aug 2, 2023

Uh oh!

StefanieSenger commented Aug 4, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ogrisel Aug 22, 2023

Choose a reason for hiding this comment

Uh oh!

StefanieSenger Aug 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

StefanieSenger commented Jul 21, 2023 •

edited

Loading

github-actions bot commented Jul 21, 2023 •

edited

Loading

StefanieSenger Aug 23, 2023 •

edited

Loading