DOC Release Highlights for version 1.6 #30392

jeremiedbb · 2024-12-02T17:04:21Z

Candidates for the highlights

FrozenEstimator
Transform metadata in Pipeline
Missing value support in ExtraTreesClassifier/Regressor
fetch_file
@ogrisel do we want to showcase an example for that ? If so what file should we download ?
News on array api support
News on metadata routing support
Free threading support
ping @lesteve, would you mind writing this item ? You'll be more accurate than me :)
Developer API

ping @adrinjalali who kindly proposed to write something for frozen estimator, metadata in pipeline and developper API.

cc/ @scikit-learn/communication-team We plan to release 1.6.0 final this week.
cc/ @scikit-learn/core-devs Feel free to correct inaccuracies that I may have done or add items that I have missed.

github-actions · 2024-12-02T17:05:38Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: e2208cd. Link to the linter CI: here}

lesteve · 2024-12-03T09:12:46Z

ping @lesteve, would you mind writing this item ? You'll be more accurate than me :)

Sure I guess #30360 should be merged first and then the highlights would have an even shorter description with a link to the changelog entry?

adrinjalali · 2024-12-04T09:59:17Z

I'm not sure if the pipeline's transform input is sth we should write about in this release, since there's no real example out there where this is now useful. WDYT?

jeremiedbb · 2024-12-04T10:34:06Z

I'm not sure if the pipeline's transform input is sth we should write about in this release, since there's no real example out there where this is now useful. WDYT?

Since it's one of the only 2 "major features" of this release I find it sad to not showcase it in the highlights. Can we make a toy example even if we can't benefit from it directly in sklearn but third party libraries might ?

adrinjalali · 2024-12-04T11:52:57Z

Ok, let me know what you think about it then. Added a non-executable piece of code.

lesteve · 2024-12-04T12:36:52Z

I have added free-threaded highlights which I pretty much copied from the changelog entry.

I chose to do this rather than having a shorter description with a link to the changelog entry to save one click. Let me know if you prefer the latter option!

jeremiedbb · 2024-12-04T13:48:09Z

Ok, let me know what you think about it then. Added a non-executable piece of code.

I think a non-executable snippet is fine, thanks !

jeremiedbb · 2024-12-04T13:48:49Z

I chose to do this rather than having a shorter description with a link to the changelog entry to save one click

Yeah it's better since the highlights are already linked in the changelog so no need to loop around once more

examples/release_highlights/plot_release_highlights_1_6_0.py

Co-authored-by: Jérémie du Boisberranger <jeremie@probabl.ai>

examples/release_highlights/plot_release_highlights_1_6_0.py

lorentzenchr · 2024-12-05T10:43:29Z

Could we add the newton-cholesky solver. The one PR of this release is not that big, but it completes a larger journey and we have not advertised it much.

examples/release_highlights/plot_release_highlights_1_6_0.py

glemaitre · 2024-12-05T10:51:59Z

examples/release_highlights/plot_release_highlights_1_6_0.py

+
+threshold_classifier = FixedThresholdClassifier(
+    estimator=FrozenEstimator(classifier), threshold=0.9
+)


Do we want to call fit? Maybe one way to show this is no-op, it to show some timing:

import time from sklearn.datasets import make_classification from sklearn.frozen import FrozenEstimator from sklearn.linear_model import SGDClassifier from sklearn.model_selection import FixedThresholdClassifier X, y = make_classification(n_samples=1_000, random_state=0) start = time.time() classifier = SGDClassifier().fit(X, y) print(f"Fitting the classifier took {(time.time() - start) * 1_000:.2f} milliseconds") start = time.time() threshold_classifier = FixedThresholdClassifier( estimator=FrozenEstimator(classifier), threshold=0.9 ).fit(X, y) print( f"Fitting the threshold classifier took {(time.time() - start) * 1_000:.2f} milliseconds" )

Fitting the classifier took 2.53 milliseconds Fitting the threshold classifier took 0.61 milliseconds

and add an extra conclusion line.

examples/release_highlights/plot_release_highlights_1_6_0.py

glemaitre

Only some comment regarding the grammar and two nitpicks.

jeremiedbb · 2024-12-05T10:57:33Z

Could we add the newton-cholesky solver. The one PR of this release is not that big, but it completes a larger journey and we have not advertised it much.

I agree that it could have been highlighted back then but I'm not very comfortable putting it in the highlights of 1.6 while it was released in 1.2.

The highlights should be about the new stuff that was not there previously. I think that it's not the best place to communicate more about it. Maybe @koaning would be interested in making a video about that ?

ogrisel

First pass of feedback:

examples/release_highlights/plot_release_highlights_1_6_0.py

ogrisel · 2024-12-05T11:23:43Z

I would be +1 about advertising the newton-cholesky solver, even if this release only adds support for the multinomial/multiclass case in LogisticRegression. This is a non-trivial PR with dramatic performance improvement on real-world application datasets processed with common feature engineering. Maybe we could link the benchmark results from the PR:

#28840 (comment)

That should not prevent anybody else to advertise it even more in blog/social media posts or videos.

koaning · 2024-12-05T12:31:08Z

@jeremiedbb I can for sure make another video for the scikit-learn YouTube channel, but I usually prefer to start work on that once the actual release is live and tested.

jeremiedbb · 2024-12-05T12:37:16Z

I would be +1 about advertising the newton-cholesky solver, even if this release only adds support for the multinomial/multiclass case in LogisticRegression. This is a non-trivial PR with dramatic performance improvement on real-world application datasets processed with common feature engineering. Maybe we could link the benchmark results from the PR

Alright, would you @ogrisel or @lorentzenchr mind writing this section ? I haven't followed that in details so you'll be a lot more precise and accurate than me :)

ogrisel · 2024-12-05T13:07:36Z

Let me give it a shot.

ogrisel · 2024-12-05T15:30:15Z

I pushed f0669ce to highlight the work on the new solver. I toyed a bit generating synthetic multiclass data where it would make a difference in terms of convergence to a better model but it's not easy to find the regime where it really shines so in the end I just added a paragraph with a link to the benchmark results from the PR.

I checked that I can still reproduce them from the current main.

examples/release_highlights/plot_release_highlights_1_6_0.py

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

…ease-highlights-1.6

jeremiedbb · 2024-12-05T17:21:16Z

So I can't approve my own PR, but since I didn't write most of it I give my +1 anyway 😄

Is it good for you as well ? If so, please give your approval so that we can merge it and continue the release process :)

ogrisel

LGTM as well.

Co-authored-by: adrinjalali <adrin.jalali@gmail.com> Co-authored-by: Loïc Estève <loic.esteve@ymail.com> Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

jeremiedbb added 2 commits December 2, 2024 17:53

draft release highlights 1.6

c2de0f1

blank line at the end

6355874

github-actions bot added the Documentation label Dec 2, 2024

lint

ff20e38

add frozen estimator and developer API sections

7808052

add pipeline transform

ba403e9

Add free-threaded highlights

0474a66

jeremiedbb commented Dec 4, 2024

View reviewed changes

examples/release_highlights/plot_release_highlights_1_6_0.py Outdated Show resolved Hide resolved

adrinjalali and others added 2 commits December 4, 2024 15:50

Update examples/release_highlights/plot_release_highlights_1_6_0.py

a529121

Co-authored-by: Jérémie du Boisberranger <jeremie@probabl.ai>

iter

f79a3fa

glemaitre reviewed Dec 5, 2024

View reviewed changes

examples/release_highlights/plot_release_highlights_1_6_0.py Outdated Show resolved Hide resolved

glemaitre self-requested a review December 5, 2024 10:31

glemaitre reviewed Dec 5, 2024

View reviewed changes

examples/release_highlights/plot_release_highlights_1_6_0.py Outdated Show resolved Hide resolved

glemaitre reviewed Dec 5, 2024

View reviewed changes

examples/release_highlights/plot_release_highlights_1_6_0.py Outdated Show resolved Hide resolved

glemaitre reviewed Dec 5, 2024

View reviewed changes

examples/release_highlights/plot_release_highlights_1_6_0.py Outdated Show resolved Hide resolved

glemaitre reviewed Dec 5, 2024

View reviewed changes

examples/release_highlights/plot_release_highlights_1_6_0.py Outdated Show resolved Hide resolved

glemaitre reviewed Dec 5, 2024

View reviewed changes

examples/release_highlights/plot_release_highlights_1_6_0.py Outdated Show resolved Hide resolved

glemaitre reviewed Dec 5, 2024

View reviewed changes

examples/release_highlights/plot_release_highlights_1_6_0.py Outdated Show resolved Hide resolved

glemaitre reviewed Dec 5, 2024

View reviewed changes

examples/release_highlights/plot_release_highlights_1_6_0.py Outdated Show resolved Hide resolved

glemaitre reviewed Dec 5, 2024

View reviewed changes

examples/release_highlights/plot_release_highlights_1_6_0.py Outdated Show resolved Hide resolved

glemaitre reviewed Dec 5, 2024

View reviewed changes

examples/release_highlights/plot_release_highlights_1_6_0.py Outdated Show resolved Hide resolved

glemaitre reviewed Dec 5, 2024

View reviewed changes

examples/release_highlights/plot_release_highlights_1_6_0.py Outdated Show resolved Hide resolved

glemaitre reviewed Dec 5, 2024

View reviewed changes

examples/release_highlights/plot_release_highlights_1_6_0.py Show resolved Hide resolved

glemaitre reviewed Dec 5, 2024

View reviewed changes

address review comments

26f4304

ogrisel reviewed Dec 5, 2024

View reviewed changes

address more review comments

d177910

jeremiedbb and others added 3 commits December 5, 2024 14:56

fix link

6349037

fix link

244a126

Highlight multiclass logistic regression with the newton-cholesky solver

f0669ce

ogrisel reviewed Dec 5, 2024

View reviewed changes

examples/release_highlights/plot_release_highlights_1_6_0.py Outdated Show resolved Hide resolved

jeremiedbb and others added 4 commits December 5, 2024 16:35

Update examples/release_highlights/plot_release_highlights_1_6_0.py

3daaec9

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

fix link

2c78f52

Merge remote-tracking branch 'origin/release-highlights-1.6' into rel…

cb962e2

…ease-highlights-1.6

iter

e2208cd

glemaitre approved these changes Dec 6, 2024

View reviewed changes

ogrisel approved these changes Dec 6, 2024

View reviewed changes

ogrisel merged commit a23aef1 into scikit-learn:main Dec 6, 2024
30 checks passed

jeremiedbb added a commit that referenced this pull request Dec 6, 2024

DOC Release Highlights for version 1.6 (#30392)

7015ee6

Co-authored-by: adrinjalali <adrin.jalali@gmail.com> Co-authored-by: Loïc Estève <loic.esteve@ymail.com> Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

Uh oh!

DOC Release Highlights for version 1.6 #30392

DOC Release Highlights for version 1.6 #30392

Uh oh!

Conversation

jeremiedbb commented Dec 2, 2024

Uh oh!

github-actions bot commented Dec 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

lesteve commented Dec 3, 2024

Uh oh!

adrinjalali commented Dec 4, 2024

Uh oh!

jeremiedbb commented Dec 4, 2024

Uh oh!

adrinjalali commented Dec 4, 2024

Uh oh!

lesteve commented Dec 4, 2024

Uh oh!

jeremiedbb commented Dec 4, 2024

Uh oh!

jeremiedbb commented Dec 4, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lorentzenchr commented Dec 5, 2024

Uh oh!

Uh oh!

glemaitre Dec 5, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

jeremiedbb commented Dec 5, 2024

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ogrisel commented Dec 5, 2024

Uh oh!

koaning commented Dec 5, 2024

Uh oh!

jeremiedbb commented Dec 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ogrisel commented Dec 5, 2024

Uh oh!

ogrisel commented Dec 5, 2024

Uh oh!

Uh oh!

jeremiedbb commented Dec 5, 2024

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Dec 2, 2024 •

edited

Loading

jeremiedbb commented Dec 5, 2024 •

edited

Loading