RFC generalised Pipeline.get_feature_names #6424

jnothman · 2016-02-23T00:35:57Z

There has been some demand for Pipeline.get_feature_names (#2007, #5172, #6421) for the case where the last element in the pipeline is a feature extractor. Following on from #6372, we instead shall make get_feature_names able to transform some list of input features in the general case. I propose the following behaviour:

Pipeline.get_feature_names may be called with a list input_features as an argument only if all its estimators support get_feature_names with an argument. The initial input_features is transformed iteratively through the estimators.
Pipeline.get_feature_names may be called without an argument only if a suffix of its estimators support get_feature_names. The first of that suffix may or may not accept input_features, and the remainder must accept input_features; the output of the first get_feature_names call is iteratively modified by downstream transformers' get_feature_names.
- To be cautious until we find a use-case otherwise, get_feature_names will not be supported in the case that get_feature_names is available before (but not adjacent to) that suffix.
Otherwise, a ValueError is raised. Or: should the attribute become invisible, as for predict et al.?

The text was updated successfully, but these errors were encountered:

amueller · 2016-02-24T22:47:02Z

agreed on 1) and 2).
For three: maybe an AttriubuteError: the last step has no get_feature_names

jnothman · 2016-02-24T23:29:13Z

Do you mean an AttributeError if the last step has no get_feature_names? The problem with the AttributeError is that the definition currently allows for get_feature_names that does not take an argument. Testing for this when doing the attribute lookup is fairly heavy. (Though I suspect that we will require get_feature_names to take an argument, even if unused, in any estimator where the pipeline functionality is sought.)

amueller · 2016-02-25T16:14:11Z

Ah, I didn't think about that. But these are two different errors, right? one is there is no post-fix with get_features_names and the other is feature_names was passed and there is no post-fix that takes feature_names.

thomasjpfan · 2021-12-10T03:41:45Z

Now that we released Pipeline.get_feature_names_out in 1.0, I think this issue can be closed.

jnothman mentioned this issue Feb 23, 2016

Transformative get_feature_names for various transformers #6425

Closed

11 tasks

kmike mentioned this issue Oct 9, 2016

Scikit-learn Pipeline support TeamHG-Memex/eli5#15

Open

amueller mentioned this issue Nov 20, 2018

RFC Implement Pipeline get feature names #12627

Closed

3 tasks

jackwellsxyz mentioned this issue Mar 30, 2020

pipeline.get_feature_names() #16807

Closed

thomasjpfan closed this as completed Dec 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC generalised Pipeline.get_feature_names #6424

RFC generalised Pipeline.get_feature_names #6424

jnothman commented Feb 23, 2016

amueller commented Feb 24, 2016

jnothman commented Feb 24, 2016

amueller commented Feb 25, 2016

thomasjpfan commented Dec 10, 2021

RFC generalised Pipeline.get_feature_names #6424

RFC generalised Pipeline.get_feature_names #6424

Comments

jnothman commented Feb 23, 2016

amueller commented Feb 24, 2016

jnothman commented Feb 24, 2016

amueller commented Feb 25, 2016

thomasjpfan commented Dec 10, 2021