ENH: Add feature_names_ property to PolynomialFeatures #6216

maniteja123 · 2016-01-23T15:57:32Z

Added the property of feature_names_ for PolynomialFeatures in preprocessing. See issue #6185. I am not totally sure this is the expected solution. Please let me know if something else needs to be done. Thanks.

aniryou · 2016-01-28T11:44:43Z

There are multiple features, your approach doesn't take that in consideration.

For polynomial with degree 2, 3 features, we need:
['bias', 'X1', 'X2', 'X3', 'X1^2', 'X1*X2', 'X1*X3', 'X2^2', 'X2*X3', 'X3^2']

I generated it with:

@property
def feature_names_(self):
    check_is_fitted(self, 'n_input_features_')

    def pstr(p):
        if np.count_nonzero(p)==0:
            return 'bias'
        vars, exps = np.nonzero(p)[0], p[np.nonzero(p)]
        vstrs = ['X'+str(v+1) for v in vars]
        estrs = [('^'+str(e) if e>1 else '') for e in exps]
        terms = [v+e for v,e in zip(vstrs, estrs)]
        return '*'.join(terms)

    return [pstr(power) for power in self.powers_]

maniteja123 · 2016-01-28T11:50:33Z

Thanks for clarifying. So the property expected is the output feature mapping in terms of the polynomials of input features. Will do that and let you know.

maniteja123 · 2016-01-28T16:59:03Z

Hi everyone, the suggestion by @aniryou seems to be solving the use case here. I don't think I have enough exposure in this domain to decide on the best approach to take. Please let me know if it would be ideal to go ahead with his idea. I will proceed as per the consensus reached here. Thanks.

amueller · 2016-02-16T20:36:50Z

sklearn/preprocessing/data.py

@@ -1151,6 +1151,10 @@ class PolynomialFeatures(BaseEstimator, TransformerMixin):
        features is computed by iterating over all suitably sized combinations
        of input features.

+    feature_names_ : list, shape [n_input_features_]
+        Represents the names of input features
+        It is of the form ``['X1', 'X2', 'X3'...]``


They should be lower-case. upper case indicates matrices. Also, these are the output features, right?

jakevdp · 2016-02-16T21:01:59Z

I think @aniryou's version is the better one to use. Also I agree that this should be a get_feature_names function, as this is already used in a couple other places in the package, and is expected in a transformer by the pipeline code.

maniteja123 · 2016-02-17T02:43:52Z

Thanks for clarifying. It would indeed be better if someone experienced completes it. Sorry I didn't get to do the right thing here. Closing this as it is superseded by #6372.

ENH: Add feature_names_ property to PolynomialFeatures

48d5a92

amueller reviewed Feb 16, 2016
View reviewed changes

amueller mentioned this pull request Feb 16, 2016

[MRG+2] add get_feature_names to PolynomialFeatures #6372

Merged

maniteja123 closed this Feb 17, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH: Add feature_names_ property to PolynomialFeatures #6216

ENH: Add feature_names_ property to PolynomialFeatures #6216

Uh oh!

maniteja123 commented Jan 23, 2016

Uh oh!

aniryou commented Jan 28, 2016

Uh oh!

maniteja123 commented Jan 28, 2016

Uh oh!

maniteja123 commented Jan 28, 2016

Uh oh!

amueller Feb 16, 2016

Uh oh!

jakevdp commented Feb 16, 2016

Uh oh!

maniteja123 commented Feb 17, 2016

Uh oh!

Uh oh!

Uh oh!

ENH: Add feature_names_ property to PolynomialFeatures #6216

ENH: Add feature_names_ property to PolynomialFeatures #6216

Uh oh!

Conversation

maniteja123 commented Jan 23, 2016

Uh oh!

aniryou commented Jan 28, 2016

Uh oh!

maniteja123 commented Jan 28, 2016

Uh oh!

maniteja123 commented Jan 28, 2016

Uh oh!

amueller Feb 16, 2016

Choose a reason for hiding this comment

Uh oh!

jakevdp commented Feb 16, 2016

Uh oh!

maniteja123 commented Feb 17, 2016

Uh oh!

Uh oh!