MNT Change LinearRegression default tol to not break behavior #30688

jeremiedbb · 2025-01-21T10:17:52Z

Now that LinearRegression has a tol parameter we need to set it to a low value if we want to compare with the exact solution.

github-actions · 2025-01-21T10:19:08Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 3b0c267. Link to the linter CI: here}

ogrisel

LGTM!

ogrisel · 2025-01-21T11:13:16Z

No need to backport to 1.6.X since #30521 was merged after the 1.6.0 release and the docstring mentions that the new tol parameter is intended to be released as part of the future 1.7.

test_linear_regression_sample_weights

lesteve · 2025-01-21T13:01:14Z

The change to the test seems fine, but at the same time this means that there is a behaviour change in #30521, right?

Maybe this should be mentioned in the changelog?

jeremiedbb · 2025-01-21T13:43:02Z

The change to the test seems fine, but at the same time this means that there is a behaviour change in #30521, right?

Maybe this should be mentioned in the changelog?

Alternatively we can set the default tol to 1e-6, which is the default for scipy lsqr (atol=1e-06, btol=1e-06), such that we don't change the behavior.

jeremiedbb · 2025-01-22T09:40:43Z

Reading more the issues and PRs leading to adding the tol parameter I didn't see any specific motivation to set it to 1e-4 by default. My guess is that it's the default tol of Ridge. The snippet in the original issue requires a tol as low as at least 1e-12 to pass, which is reflected in the common test.

So I think that we can change the default to 1e-6 now, which is harmless since it hasn't been released yet, to avoid the behavior change. We can consider changing the default later to match Ridge if we really want to but it doesn't feel necessary.

lesteve · 2025-01-22T13:25:01Z

Thanks for having a closer look!

I don't have a strong opinion on what is preferable:

tol=1e-6: more conservative by avoiding the behaviour change
tol=1e-4: consistency with default Ridge tol (if there is not a stronger argument) but introduces a behaviour change. Some may argue that LinearRegression should not be used for anything serious (you should always use Ridge with a small penalty), so maybe breaking backward compatibility is OKish

jeremiedbb · 2025-01-23T09:32:48Z

@ogrisel, @glemaitre what do you think ?

ogrisel · 2025-01-23T09:47:30Z

Let's do the backward compat for now. My long term plan would be to align LinearRegression better with ridge but this will involve more work.

lesteve · 2025-01-23T09:54:43Z

sklearn/linear_model/tests/test_base.py

@@ -72,7 +72,7 @@ def test_linear_regression_sample_weights(
    sample_weight = 1.0 + rng.uniform(size=n_samples)

    # LinearRegression with explicit sample_weight
-    reg = LinearRegression(fit_intercept=fit_intercept)
+    reg = LinearRegression(fit_intercept=fit_intercept, tol=1e-16)


So I guess this is not needed if we go for backward-compat i.e. tol=1e-6 by default?

Suggested change

reg = LinearRegression(fit_intercept=fit_intercept, tol=1e-16)

reg = LinearRegression(fit_intercept=fit_intercept)

I would keep it anyway because it's kind of "lucky" that it works with a higher tol. The reason tol was added was because it made a sample weight test fail and this test is also checking sample weights. Those tests require good convergence.

OK fair enough!

lesteve · 2025-01-23T10:31:42Z

Let's merge this one, thanks!

add tol

820e02c

github-actions bot added the module:linear_model label Jan 21, 2025

lesteve added the No Changelog Needed label Jan 21, 2025

jeremiedbb mentioned this pull request Jan 21, 2025

⚠️ CI failed on Linux_Runs.pylatest_conda_forge_mkl (last failure: Jan 21, 2025) ⚠️ #30684

Closed

ogrisel approved these changes Jan 21, 2025

View reviewed changes

[all random seeds]

302bbd0

test_linear_regression_sample_weights

set default tol to scipy's default tol

3b0c267

jeremiedbb changed the title ~~TST Fix test for LinearRegression with sample_weights~~ MNT Change LinearRegression default tol to not break behavior Jan 22, 2025

lesteve reviewed Jan 23, 2025

View reviewed changes

lesteve merged commit 73db8f1 into scikit-learn:main Jan 23, 2025
32 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

MNT Change LinearRegression default tol to not break behavior #30688

MNT Change LinearRegression default tol to not break behavior #30688

Uh oh!

jeremiedbb commented Jan 21, 2025

Uh oh!

github-actions bot commented Jan 21, 2025 •

edited

Loading

Uh oh!

ogrisel left a comment

Uh oh!

ogrisel commented Jan 21, 2025 •

edited

Loading

Uh oh!

lesteve commented Jan 21, 2025

Uh oh!

jeremiedbb commented Jan 21, 2025

Uh oh!

jeremiedbb commented Jan 22, 2025 •

edited by lesteve

Loading

Uh oh!

lesteve commented Jan 22, 2025 •

edited

Loading

Uh oh!

jeremiedbb commented Jan 23, 2025

Uh oh!

ogrisel commented Jan 23, 2025

Uh oh!

lesteve Jan 23, 2025

Uh oh!

jeremiedbb Jan 23, 2025

Uh oh!

lesteve Jan 23, 2025

Uh oh!

lesteve commented Jan 23, 2025

Uh oh!

Uh oh!

Uh oh!

	reg = LinearRegression(fit_intercept=fit_intercept, tol=1e-16)
	reg = LinearRegression(fit_intercept=fit_intercept)

Uh oh!

MNT Change LinearRegression default tol to not break behavior #30688

MNT Change LinearRegression default tol to not break behavior #30688

Uh oh!

Conversation

jeremiedbb commented Jan 21, 2025

Uh oh!

github-actions bot commented Jan 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

ogrisel commented Jan 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lesteve commented Jan 21, 2025

Uh oh!

jeremiedbb commented Jan 21, 2025

Uh oh!

jeremiedbb commented Jan 22, 2025 • edited by lesteve Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lesteve commented Jan 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jeremiedbb commented Jan 23, 2025

Uh oh!

ogrisel commented Jan 23, 2025

Uh oh!

lesteve Jan 23, 2025

Choose a reason for hiding this comment

Uh oh!

jeremiedbb Jan 23, 2025

Choose a reason for hiding this comment

Uh oh!

lesteve Jan 23, 2025

Choose a reason for hiding this comment

Uh oh!

lesteve commented Jan 23, 2025

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jan 21, 2025 •

edited

Loading

ogrisel commented Jan 21, 2025 •

edited

Loading

jeremiedbb commented Jan 22, 2025 •

edited by lesteve

Loading

lesteve commented Jan 22, 2025 •

edited

Loading