ENH: improve validation for SGD models to accept l1_ratio=None when penalty is not `elasticnet` #30730

MarcBresson · 2025-01-28T15:41:06Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Providing l1_ratio only makes sense when user provides penalty=elasticnet. The idea behind this PR is to make the l1_ratio parameter of both SGD models behave the same as l1_ratio of LogisticRegression.

For now, I did something non breaking, but ideally we would set the defaut value for SGDClassifier.l1_ratio to None. I'm waiting on feedback as to what path I should follow considering that this brings breaking API changes.

Here is the l1_ratio definition for logistic regression if you are curious.

Any other comments?

github-actions · 2025-01-28T15:42:44Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: c2e0b84. Link to the linter CI: here}

adrinjalali · 2025-02-12T10:26:14Z

This also needs a test to check the behavior when None is passed, both with elasticnet and not.

adrinjalali

We could also add a changelog here.

sklearn/linear_model/_stochastic_gradient.py

sklearn/linear_model/tests/test_sgd.py

…enalty is not `elasticnet`

…not specified

adrinjalali · 2025-02-18T10:00:41Z

Tests are failing here, and please avoid force pushing to the branch. Makes it harder to review. We squash and merge in the end anyway, so it doesn't matter how many commits you have here.

…ilure

MarcBresson · 2025-02-18T10:29:47Z

Sorry for the wild rebase on the upstream main branch, I tend to forget to specify --no-ff

adrinjalali

Otherwise LGTM.

adrinjalali · 2025-02-19T09:32:49Z

doc/whats_new/v1.7.rst

Please follow this for adding a changelog: https://github.com/scikit-learn/scikit-learn/blob/main/doc/whats_new/upcoming_changes/README.md

adrinjalali · 2025-02-20T14:45:01Z

@OmarManzoor maybe you could have a look?

OmarManzoor

Thanks for the PR @MarcBresson

doc/whats_new/upcoming_changes/sklearn.linear_model/30730.enhancement.rst

doc/whats_new/v1.7.rst

OmarManzoor · 2025-02-21T10:14:41Z

sklearn/linear_model/tests/test_sgd.py

+        {"penalty": "l1", "l1_ratio": None},
+    ],
+)
+def test_sgd_passing_validation(klass, kwargs):


Suggested change

def test_sgd_passing_validation(klass, kwargs):

def test_sgd_passing_penalty_validation(klass, kwargs):

"""Tests that acceptable values for the `penalty` parameter pass the

validation checks"""

OmarManzoor · 2025-02-21T10:14:53Z

sklearn/linear_model/tests/test_sgd.py

+        ),
+    ],
+)
+def test_sgd_failing_validation(klass, kwargs, err_msg):


Suggested change

def test_sgd_failing_validation(klass, kwargs, err_msg):

def test_sgd_failing_penalty_validation(klass, kwargs, err_msg):

"""Tests that improper values for the `penalty` parameter raise on

validation"""

sklearn/linear_model/tests/test_sgd.py

Co-authored-by: Omar Salman <omar.salman2007@gmail.com>

jeremiedbb

I applied the requested changes and modified the tests to avoid checking what's already covered by the common tests.

LGTM. Thanks @MarcBresson

…enalty is not `elasticnet` (scikit-learn#30730) Co-authored-by: Jérémie du Boisberranger <jeremie@probabl.ai> Co-authored-by: Omar Salman <omar.salman2007@gmail.com>

github-actions bot added the module:linear_model label Jan 28, 2025

adrinjalali approved these changes Feb 12, 2025

View reviewed changes

adrinjalali reviewed Feb 17, 2025

View reviewed changes

sklearn/linear_model/_stochastic_gradient.py Outdated Show resolved Hide resolved

sklearn/linear_model/tests/test_sgd.py Outdated Show resolved Hide resolved

MarcBresson added 6 commits February 17, 2025 13:42

ENH: improve validation for SGD models to accept l1_ratio=None when p…

f54f8c1

…enalty is not `elasticnet`

ENH: raise error in BaseSGD if penalty is elasticnet and l1_ratio is …

a9678be

…not specified

TST: add test for SGD parameters validation

efdf8f6

ENH: use proxy function for l1_ratio in sgd

ec74036

TST: match error message for set of params with failing validation

9b527d9

DOC: add changelog for PR#30730

e234307

MarcBresson force-pushed the main branch from 2c62177 to e234307 Compare February 17, 2025 12:46

TST: use different error message for different cause of validation fa…

e5f3a48

…ilure

MarcBresson added 2 commits February 18, 2025 11:55

TST: fix regex for error matching

f6d035f

TST: fix regex for error matching (bis)

8756b49

adrinjalali reviewed Feb 19, 2025

View reviewed changes

MarcBresson added 2 commits February 19, 2025 18:12

DOC: update changelog with PR#30730

6bf8989

Merge remote-tracking branch 'upstream/main'

50486f3

adrinjalali added the Waiting for Second Reviewer First reviewer is done, need a second one! label Feb 20, 2025

OmarManzoor reviewed Feb 21, 2025

View reviewed changes

jeremiedbb and others added 3 commits April 18, 2025 23:59

Merge remote-tracking branch 'upstream/main' into pr/MarcBresson/30730

3dad952

apply review comments + improve tests

750b3af

Co-authored-by: Omar Salman <omar.salman2007@gmail.com>

[azure parallel]

c2e0b84

jeremiedbb approved these changes Apr 18, 2025

View reviewed changes

jeremiedbb merged commit 9a6e90a into scikit-learn:main Apr 18, 2025
36 checks passed

-def test_sgd_passing_validation(klass, kwargs):
+def test_sgd_passing_penalty_validation(klass, kwargs):
+    """Tests that acceptable values for the `penalty` parameter pass the
+    validation checks"""

-def test_sgd_failing_validation(klass, kwargs, err_msg):
+def test_sgd_failing_penalty_validation(klass, kwargs, err_msg):
+    """Tests that improper values for the `penalty` parameter raise on
+    validation"""

Uh oh!

ENH: improve validation for SGD models to accept l1_ratio=None when penalty is not elasticnet #30730

ENH: improve validation for SGD models to accept l1_ratio=None when penalty is not elasticnet #30730

Uh oh!

Conversation

MarcBresson commented Jan 28, 2025

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

github-actions bot commented Jan 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

adrinjalali commented Feb 12, 2025

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

adrinjalali commented Feb 18, 2025

Uh oh!

MarcBresson commented Feb 18, 2025

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

adrinjalali Feb 19, 2025

Choose a reason for hiding this comment

Uh oh!

adrinjalali commented Feb 20, 2025

Uh oh!

OmarManzoor left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

OmarManzoor Feb 21, 2025

Choose a reason for hiding this comment

Uh oh!

OmarManzoor Feb 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jeremiedbb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ENH: improve validation for SGD models to accept l1_ratio=None when penalty is not `elasticnet` #30730

ENH: improve validation for SGD models to accept l1_ratio=None when penalty is not `elasticnet` #30730

github-actions bot commented Jan 28, 2025 •

edited

Loading