FIX Run common tests on SparseCoder #32077

FrancoisPgm · 2025-09-02T10:24:08Z

Reference Issues/PRs

Towards #26482
See also #27724 and #26691 which appear to be stalled.
Closes #27724
Closes #26691

What does this implement/fix? Explain your changes.

This runs the common estimator tests on SparseCoder.

To match the behavior expected by the common tests, n_components_ and n_features_in_ are changed from properties to attributes initialized in the fitmethod. validate_data is run in fit.

Specific dictionary arguments for checks are added in PER_ESTIMATOR_CHECK_PARAMS.

To be able to use a specific dictionary in check_set_output_transform, _yield_instances_for_check is used in test_set_output_transform.

github-actions · 2025-09-02T10:25:01Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: fbb9dbf. Link to the linter CI: here}

sklearn/decomposition/_dict_learning.py

Co-authored-by: Jérémie du Boisberranger <jeremie@probabl.ai>

jeremiedbb · 2025-09-02T15:46:08Z

sklearn/decomposition/_dict_learning.py

-    @property
-    def n_features_in_(self):
-        """Number of features seen during `fit`."""
-        return self.dictionary.shape[1]
-


Note for reviewers: With this modification, n_features_in_ is now only set if fit is called. It makes this estimator follow our API and in line with this discussion #27724 (comment)

…te fit docstring, refactor check_array parameters

…kit-learn into SparseCoder_common_tests

jeremiedbb

A couple of suggestions for the docstrings. Looks good otherwise. Please also add a changelog entry. You can say that SparseCoder now follows the transformer API of scikit-learn and that now parameter and input validation is executed in fit (you can follow these instructions to write a changelog fragment https://github.com/scikit-learn/scikit-learn/blob/main/doc/whats_new/upcoming_changes/README.md)

sklearn/decomposition/_dict_learning.py

Co-authored-by: Jérémie du Boisberranger <jeremie@probabl.ai>

doc/whats_new/upcoming_changes/sklearn.decomposition/32077.enhancement.rst

jeremiedbb

LGTM, Thanks @FrancoisPgm

jeremiedbb · 2025-09-03T09:36:10Z

maybe @adrinjalali for a second review since you reviewed the previous attempt #27724.

…ancement.rst Co-authored-by: Jérémie du Boisberranger <jeremie@probabl.ai>

adrinjalali · 2025-09-03T13:09:28Z

sklearn/utils/_test_common/instance_generator.py

@@ -711,6 +716,38 @@
        ],
    },
    SkewedChi2Sampler: {"check_dict_unchanged": dict(n_components=1)},
+    SparseCoder: {
+        "check_estimators_dtypes": dict(dictionary=rng.normal(size=(5, 5))),


is there a single case where we can set to have it pass all the tests?

dictionary is not a very friendly parameter because it needs to have a shape compatible with X, but all the checks have different Xs

adrinjalali · 2025-09-03T13:09:51Z

sklearn/tests/test_common.py

@@ -356,6 +360,7 @@ def test_set_output_transform(estimator):
            f"Skipping check_set_output_transform for {name}: Does not support"
            " set_output API"
        )
+    estimator = next(_yield_instances_for_check(check_set_output_transform, estimator))


seems to be a nice catch, but I think we should then go through all instances for this test, instead of only the first one.

Yes indeed, for the sparsecoder right now it's always one instance but it can be more. I'll add a for loop to go through all instances.

…rst one

jeremiedbb · 2025-09-04T12:33:53Z

doc/whats_new/upcoming_changes/sklearn.decomposition/32077.enhancement.rst

@@ -0,0 +1,2 @@
+- :class:`decomposition.SparseCoder` now follows the transformer API of scikit-learn.
+  In addition, the :meth:`fit` method now validates the input and parameters.


Please reference yourself as author of the PR: "By :user:`your name <your handle>`".

adrinjalali

Thanks!

add SparseCoder to estimators in common tests

f5380f5

github-actions bot added module:decomposition module:utils labels Sep 2, 2025

fix pandas and polars test

f346d11

jeremiedbb reviewed Sep 2, 2025

View reviewed changes

FrancoisPgm and others added 2 commits September 2, 2025 15:42

Update sklearn/decomposition/_dict_learning.py

d8aeb0a

Co-authored-by: Jérémie du Boisberranger <jeremie@probabl.ai>

Update sklearn/decomposition/_dict_learning.py

4ff1eb6

Co-authored-by: Jérémie du Boisberranger <jeremie@probabl.ai>

jeremiedbb reviewed Sep 2, 2025

View reviewed changes

FrancoisPgm added 2 commits September 2, 2025 17:56

Add a test in fit that X and dictionary have the same n_feature, upda…

70e29b4

…te fit docstring, refactor check_array parameters

Merge branch 'SparseCoder_common_tests' of github.com:FrancoisPgm/sci…

419fd84

…kit-learn into SparseCoder_common_tests

jeremiedbb reviewed Sep 2, 2025

View reviewed changes

sklearn/decomposition/_dict_learning.py Outdated Show resolved Hide resolved

sklearn/decomposition/_dict_learning.py Outdated Show resolved Hide resolved

FrancoisPgm and others added 3 commits September 3, 2025 09:59

Update sklearn/decomposition/_dict_learning.py

efe5b9b

Co-authored-by: Jérémie du Boisberranger <jeremie@probabl.ai>

Update sklearn/decomposition/_dict_learning.py

7204c5e

Co-authored-by: Jérémie du Boisberranger <jeremie@probabl.ai>

add changelog

88beb6a

jeremiedbb reviewed Sep 3, 2025

View reviewed changes

doc/whats_new/upcoming_changes/sklearn.decomposition/32077.enhancement.rst Outdated Show resolved Hide resolved

jeremiedbb approved these changes Sep 3, 2025

View reviewed changes

jeremiedbb added the Waiting for Second Reviewer First reviewer is done, need a second one! label Sep 3, 2025

Update doc/whats_new/upcoming_changes/sklearn.decomposition/32077.enh…

42f8be4

…ancement.rst Co-authored-by: Jérémie du Boisberranger <jeremie@probabl.ai>

adrinjalali reviewed Sep 3, 2025

View reviewed changes

iterate over _yield_instances_for_check instead of taking just the fi…

c9b1689

…rst one

jeremiedbb reviewed Sep 4, 2025

View reviewed changes

add user name to changelog

fbb9dbf

adrinjalali approved these changes Sep 9, 2025

View reviewed changes

adrinjalali merged commit 68218f7 into scikit-learn:main Sep 9, 2025
36 checks passed

jeremiedbb mentioned this pull request Sep 9, 2025

Run common test for SparseCoder and FeatureUnion #26482

Closed

		@@ -0,0 +1,2 @@
		- :class:`decomposition.SparseCoder` now follows the transformer API of scikit-learn.
		In addition, the :meth:`fit` method now validates the input and parameters.

Uh oh!

FIX Run common tests on SparseCoder #32077

FIX Run common tests on SparseCoder #32077

Conversation

FrancoisPgm commented Sep 2, 2025 • edited by jeremiedbb Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Uh oh!

github-actions bot commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jeremiedbb Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

jeremiedbb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jeremiedbb left a comment

Choose a reason for hiding this comment

Uh oh!

jeremiedbb commented Sep 3, 2025

Uh oh!

adrinjalali Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

jeremiedbb Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

adrinjalali Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

FrancoisPgm Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

jeremiedbb Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

FrancoisPgm commented Sep 2, 2025 •

edited by jeremiedbb

Loading

github-actions bot commented Sep 2, 2025 •

edited

Loading

jeremiedbb Sep 4, 2025 •

edited

Loading