-
-
Notifications
You must be signed in to change notification settings - Fork 26k
Implemented Supervised PCA #5196
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
of the model for classification based on logistic regression. References: Bair, Eric, et al. "Prediction by supervised principal components." Journal of the American Statistical Association 101.473 (2006).
Thanks for the PR. Also, is this seems to be the same as It is not that easy to do for LogisticRegression, but it will be with #4242. |
Hello, It is similar but not equivalent. Supervised PCA also contains an extra step of filtering out useless attributes. So, the steps are
The first step is something that can require a few lines of code, so having a new model makes life a bit easier :) In the book "The Elements of Statistical Learning" there were some examples on where this technique would be better against elastic net. I am not sure if they contained examples that include LDA. I can look into it. Best regards, |
Sorry, I misread the code then. |
Ah, ok, you do drop the number of components. And they use univariate selection. So it is I don't think this particular pipeline deserves it's own estimator. |
It should probably be an example, then.
|
…hef, concordance correlation chef, example of concordance vs pearson Added the following: 1) Improved version of supervised PCA 2) Example of supervised PCA against LDA and QDA 3) Example of supervised PCA against elasticNet 4) Pearson and concordance correlation coefficients 5) Example where the concordance correlation coefficient can be better than Pearson
There's already a somewhat similar example on scikit-learn's documentation. It's only missing the PCA step. |
closing as no reply and no excitement. This is a pretty straight-forward pipeline imho. |
Implemented Supervised PCA algorithm by Bair et al. plus an extension of the model for classification based on logistic regression.
References: Bair, Eric, et al. "Prediction by supervised principal
components." Journal of the American Statistical Association 101.473
(2006).