FEA return final cross-validation score in `SequentialFeatureSelector` #31483

cboseak · 2025-06-04T20:54:32Z

Reference Issues/PRs

Fixes Add option to return final cross-validation score in SequentialFeatureSelector #31473

What does this implement/fix? Explain your changes.

Added an attribute (e.g., final_cv_score_) that stores the mean cross-validation score of the final model with the selected features. This would avoid having to run another cross-validation externally to get the final performance score.
- Currently, when using SequentialFeatureSelector, it internally performs cross-validation to decide which features to select, based on the scoring function. However, the final cross-validation score (e.g., recall) is not returned by the SFS object.

github-actions · 2025-06-04T20:55:32Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 1944c07. Link to the linter CI: here}

…tureSelector

OmarManzoor

Thanks for the PR @cboseak

sklearn/feature_selection/tests/test_sequential.py

doc/whats_new/upcoming_changes/sklearn.feature_selection/31483.feature.rst

OmarManzoor

LGTM. Thanks @cboseak

sklearn/feature_selection/_sequential.py

…tional

cboseak · 2025-06-12T13:52:48Z

See latest changes to address your comments

adrinjalali

I'd need more opinions on this to see if we'd like to include it.

cc @scikit-learn/core-devs

sklearn/feature_selection/_sequential.py

doc/whats_new/upcoming_changes/sklearn.feature_selection/31483.feature.rst

….feature.rst update based on PR suggestions Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

…_cv_score` to return raw values instead of mean

adrinjalali

I don't mind the implementation as is, but I do wonder its usecases and whether it's useful to enough users.

Tagging for a second opinion: @OmarManzoor @adam2392

adrinjalali · 2025-06-18T11:18:57Z

sklearn/feature_selection/_sequential.py

+        scores : ndarray of shape (n_splits,)
+            Array of cross-validation scores for each split.
+        """
+        _raise_for_params(params, self, "get_final_cv_score")


please have a test for this.

OmarManzoor · 2025-06-18T11:43:13Z

When I approved this I considered it being added as an attribute but since that increases the fit time I am not so sure about having a separate function that will still need to be called separately. Wouldn't that kind of be similar to just calling the code within the function? I guess if it adds some convenience to users we can add it.

adrinjalali · 2025-06-18T11:56:32Z

Wouldn't that kind of be similar to just calling the code within the function?

Not sure which function you mean.

OmarManzoor · 2025-06-18T12:23:40Z

Not sure which function you mean.

get_final_cv_score the one that is added in this PR

adam2392

IIUC, this is purely a convenience function right?

The computation time to get the answer that you'd want is the same with or without the function.

In that case, my main criterion would be looking at whether this makes the API more usable. Is this function name also present in other feature selectors? If so, let's add it imo. If not, shouldn't we consolidate?

adam2392 · 2025-06-18T12:25:58Z

sklearn/feature_selection/_sequential.py

@@ -193,6 +193,21 @@ def __init__(
        self.cv = cv
        self.n_jobs = n_jobs

+    def _get_cv(self, y):


I don't see why this function is needed. Perhaps I'm missing something?

It was a suggestion in one of the comments but basically we had duplicate code in 2 places (cv = check_cv(self.cv, y, classifier=is_classifier(self.estimator))) so it was moved into a function to clean it up

cboseak · 2025-06-23T15:32:59Z

I wanted to check in on this one. What do we need to do to finish this PR out. I can make any updates needed either way

adrinjalali · 2025-07-02T14:06:11Z

In that case, my main criterion would be looking at whether this makes the API more usable. Is this function name also present in other feature selectors? If so, let's add it imo. If not, shouldn't we consolidate?

I agree with @adam2392 here that it'd be nice to consider where else this could be used. Since we don't add estimator level functions lightly, I'd be happy if you could investigate @cboseak , for a consistent API across estimators.

github-actions bot added the module:feature_selection label Jun 4, 2025

cboseak added 5 commits June 4, 2025 21:17

per issue 31473, return final cross-validation score in SequentialFea…

1be5946

…tureSelector

linting

2a27ec9

linting

f9231ba

linting

7ed2e54

add changelog entry

05bf68f

cboseak force-pushed the issue-31473 branch from 239fbcd to 05bf68f Compare June 5, 2025 02:17

betatim changed the title ~~per issue 31473, return final cross-validation score in SequentialFea…~~ FEA return final cross-validation score in SequentialFea… Jun 6, 2025

Merge branch 'main' into issue-31473

8fef31e

OmarManzoor reviewed Jun 10, 2025

View reviewed changes

add suggested changes

e73d1b1

OmarManzoor approved these changes Jun 11, 2025

View reviewed changes

OmarManzoor added the Waiting for Second Reviewer First reviewer is done, need a second one! label Jun 11, 2025

Merge branch 'main' into issue-31473

3edfd4f

adrinjalali reviewed Jun 12, 2025

View reviewed changes

sklearn/feature_selection/_sequential.py Outdated Show resolved Hide resolved

sklearn/feature_selection/_sequential.py Outdated Show resolved Hide resolved

adrinjalali removed the Waiting for Second Reviewer First reviewer is done, need a second one! label Jun 12, 2025

cboseak added 3 commits June 12, 2025 06:51

move final_cv_score_ to function get_final_cv_score to make it op…

7ecdc1a

…tional

update versionadded to 1.8 on get_final_cv_score

39b517b

fix linting errors

21dd9b0

adrinjalali changed the title ~~FEA return final cross-validation score in SequentialFea…~~ FEA return final cross-validation score in SequentialFeatureSelector Jun 12, 2025

update documentation comment

8881186

adrinjalali reviewed Jun 12, 2025

View reviewed changes

doc/whats_new/upcoming_changes/sklearn.feature_selection/31483.feature.rst Outdated Show resolved Hide resolved

cboseak and others added 3 commits June 12, 2025 09:06

Update doc/whats_new/upcoming_changes/sklearn.feature_selection/31483…

83c7534

….feature.rst update based on PR suggestions Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

Create _get_cv function to remove redundant code. update `get_final…

e6e2f58

…_cv_score` to return raw values instead of mean

update test_get_final_cv_score_method test to match latest change

1944c07

adrinjalali reviewed Jun 18, 2025

View reviewed changes

adam2392 reviewed Jun 18, 2025

View reviewed changes

adrinjalali changed the title ~~FEA return final cross-validation score in SequentialFeatureSelector~~ FEA return final cross-validation score in SequentialFeatureSelector Jul 2, 2025

Uh oh!

FEA return final cross-validation score in SequentialFeatureSelector #31483

Are you sure you want to change the base?

FEA return final cross-validation score in SequentialFeatureSelector #31483

Uh oh!

Conversation

cboseak commented Jun 4, 2025

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Uh oh!

github-actions bot commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

OmarManzoor left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

OmarManzoor left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

cboseak commented Jun 12, 2025

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

adrinjalali Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

OmarManzoor commented Jun 18, 2025

Uh oh!

adrinjalali commented Jun 18, 2025

Uh oh!

OmarManzoor commented Jun 18, 2025

Uh oh!

adam2392 left a comment

Choose a reason for hiding this comment

Uh oh!

adam2392 Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

cboseak Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

cboseak commented Jun 23, 2025

Uh oh!

adrinjalali commented Jul 2, 2025

Uh oh!

Uh oh!

FEA return final cross-validation score in `SequentialFeatureSelector` #31483

FEA return final cross-validation score in `SequentialFeatureSelector` #31483

github-actions bot commented Jun 4, 2025 •

edited

Loading