ENH Adds get_feature_names_out to neighbors module #22212

Micky774 · 2022-01-14T15:52:08Z

Reference Issues/PRs

Addresses #21308

What does this implement/fix? Explain your changes.

Implements the get_feature_names_out method to the neighbors module

Any other comments?

This is a work in progress

Initial implementation of `get_feature_names_out`

Micky774 · 2022-01-14T16:43:39Z

Since both transformers map from a collection of feature vectors to a distance graph, the "output features" would just be the distance to each point of the fitted data right? This would be very verbose and potentially unwieldy for what I imagine the intended use of get_feature_names_out.

So the two obvious (not necessarily great) solutions to this I see are:

Make it so that each output feature name is the distance to point in the fitted data (e.g. output features = columns of distance graph).
Make it so there is a single output feature, which is just the entire distance graph.

Part of me thinks the more "romantic" solution would be to have n_neighbors -many output features, where each describes the (sorted) n_neighbors many non-zero distances but I don't really think that makes sense given that the output of the transformers is a distance graph.

I'm leaning towards the first option (output features = columns of distance graph).

sklearn/neighbors/_base.py

ogrisel · 2022-01-14T17:48:50Z

For KNeighborsTransformer, n_neighbors is fixed a priori. That should answer your comment above.

thomasjpfan

Thanks for the PR!

sklearn/neighbors/_base.py

- Moved `_ClassNamePrefixFeaturesOutMixin` to specific transformers - Added `_ClassNamePrefixFeaturesOutMixin` to NCA - Added `self._n_features_out` to NCA

Micky774 · 2022-01-14T18:40:21Z

For KNeighborsTransformer, n_neighbors is fixed a priori. That should answer your comment above.

I guess my question is more-so: since the outputs of KNeighborsTransformer and RadiusNeighborsTransformer are distance graphs, is it sufficient to treat them as returning n_samples_fit_ output features, and to then assign them class-prefixed feature names, as does my current implementation?

thomasjpfan · 2022-01-14T20:13:30Z

I guess my question is more-so: since the outputs of KNeighborsTransformer and RadiusNeighborsTransformer are distance graphs, is it sufficient to treat them as returning n_samples_fit_ output features, and to then assign them class-prefixed feature names, as does my current implementation?

Yup, the number of feature names out must match the amount of output features from transform.

sklearn/neighbors/_nca.py

sklearn/neighbors/_base.py

ogrisel

https://github.com/scikit-learn/scikit-learn/pull/22212/files#r785147042 still needs to be addressed. Once done, I think this PR LGTM.

thomasjpfan

Thanks for the update @Micky774 !

We need tests to check the actual names of get_feature_names_out. See:

scikit-learn/sklearn/cross_decomposition/tests/test_pls.py

Lines 613 to 614 in 1d69784

    
           def test_pls_feature_names_out(Klass): 
        
               """Check `get_feature_names_out` cross_decomposition module."""

sklearn/neighbors/_nca.py

sklearn/neighbors/_base.py

Micky774 · 2022-01-19T06:27:27Z

Added in the unit tests! Thank you for all your active feedback, and for your patience in this. Let me know if there's anything else I should add here!

ogrisel

I merged the main branch to resolve the conflict in the changelog. I also made this an enhancement instead of an API change because those features names method did not exist under a different name in the last released scikit-learn version.

One more code style detail, but otherwise, LGTM.

sklearn/neighbors/tests/test_nca.py

ogrisel

I wanted to approve this PR in previous review, instead of just commenting.

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

thomasjpfan

LGTM

Added get_feature_names_out support to neighbors module

631df1e

Initial implementation of `get_feature_names_out`

github-actions bot added the module:neighbors label Jan 14, 2022

Brought sklearn/neighbors/_base.py into style compliance

e861631

Micky774 mentioned this pull request Jan 14, 2022

Implement get_feature_names_out for all estimators #21308

Closed

14 tasks

Added _n_features_out attribute to NeighborsBase

1d360bf

ogrisel reviewed Jan 14, 2022

View reviewed changes

sklearn/neighbors/_base.py Outdated Show resolved Hide resolved

thomasjpfan reviewed Jan 14, 2022

View reviewed changes

sklearn/neighbors/_base.py Outdated Show resolved Hide resolved

Micky774 added 2 commits January 14, 2022 13:27

Improved implementation of get_feature_names_out

97d2dbd

- Moved `_ClassNamePrefixFeaturesOutMixin` to specific transformers - Added `_ClassNamePrefixFeaturesOutMixin` to NCA - Added `self._n_features_out` to NCA

Added full calculation of _n_features_out in NCA

968c568

Updated whats_new doc

0b24bcc

thomasjpfan reviewed Jan 14, 2022

View reviewed changes

sklearn/neighbors/_nca.py Outdated Show resolved Hide resolved

sklearn/neighbors/_base.py Outdated Show resolved Hide resolved

Moved _n_features_out creation to fit methods

0064d94

Micky774 changed the title ~~[WIP] ENH Adds get_feature_names_out to neighbors module~~ ENH Adds get_feature_names_out to neighbors module Jan 14, 2022

ogrisel reviewed Jan 17, 2022

View reviewed changes

Micky774 added 2 commits January 17, 2022 11:47

Merge branch 'main' into feature_names_neighbors

c84a407

Moved self._n_features_out calculation to transformers in neighbors

4b12ebb

thomasjpfan reviewed Jan 18, 2022

View reviewed changes

sklearn/neighbors/_nca.py Outdated Show resolved Hide resolved

sklearn/neighbors/_base.py Outdated Show resolved Hide resolved

Micky774 added 4 commits January 19, 2022 00:57

Improved nca._n_features_out strategy

fba68f2

Fixed typo in calculating NCA._n_features_out

c03ec3f

Added unit tests for get_feature_names_out in neighbors module

45c56c2

Updated test names

222cddd

Merge branch 'main' into feature_names_neighbors

d35cd5e

ogrisel reviewed Jan 19, 2022

View reviewed changes

sklearn/neighbors/tests/test_nca.py Outdated Show resolved Hide resolved

ogrisel approved these changes Jan 19, 2022

View reviewed changes

Update sklearn/neighbors/tests/test_nca.py

0b8acdc

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

thomasjpfan approved these changes Jan 20, 2022

View reviewed changes

thomasjpfan changed the title ~~ENH Adds get_feature_names_out to neighbors module~~ ENH Adds get_feature_names_out to neighbors module Jan 20, 2022

thomasjpfan merged commit 330881a into scikit-learn:main Jan 20, 2022

Micky774 deleted the feature_names_neighbors branch February 22, 2022 18:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH Adds get_feature_names_out to neighbors module #22212

ENH Adds get_feature_names_out to neighbors module #22212

Uh oh!

Micky774 commented Jan 14, 2022

Uh oh!

Micky774 commented Jan 14, 2022 •

edited

Loading

Uh oh!

Uh oh!

ogrisel commented Jan 14, 2022

Uh oh!

thomasjpfan left a comment

Uh oh!

Uh oh!

Micky774 commented Jan 14, 2022 •

edited

Loading

Uh oh!

thomasjpfan commented Jan 14, 2022

Uh oh!

Uh oh!

Uh oh!

ogrisel left a comment

Uh oh!

thomasjpfan left a comment

Uh oh!

Uh oh!

Uh oh!

Micky774 commented Jan 19, 2022

Uh oh!

ogrisel left a comment

Uh oh!

Uh oh!

ogrisel left a comment

Uh oh!

thomasjpfan left a comment

Uh oh!

Uh oh!

	def test_pls_feature_names_out(Klass):
	"""Check `get_feature_names_out` cross_decomposition module."""

Uh oh!

ENH Adds get_feature_names_out to neighbors module #22212

ENH Adds get_feature_names_out to neighbors module #22212

Uh oh!

Conversation

Micky774 commented Jan 14, 2022

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

Micky774 commented Jan 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ogrisel commented Jan 14, 2022

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Micky774 commented Jan 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thomasjpfan commented Jan 14, 2022

Uh oh!

Uh oh!

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Micky774 commented Jan 19, 2022

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Micky774 commented Jan 14, 2022 •

edited

Loading

Micky774 commented Jan 14, 2022 •

edited

Loading