Implemented shrinkage LDA classifier. #3105

cbrnr · 2014-04-24T13:02:26Z

@kazemakase and I implemented shrinkage LDA (see also my comment in #1649). Note that unlike lda.LDA, our implementation does only classification and not transformation (dimensionality reduction). Note that we are not using the existing implementation because shrinkage is not possible with SVD.

We would be happy if someone could review the code.

mbillingr · 2014-04-24T15:54:31Z

Here is an example of the shrinkage LDA in action:

(20 training samples, 100 test samples, 50 repetitions, normally distributed features)

@cle1109 the lambda function makes the build fail on python 2.
Any idea what's causing the TypeError: unorderable types: type() < type() on python 3?

The duplicate LDA (lda.LDA and slda.LDA) caused problems with some automated tests.

sLDA: example, tests and fixes

coveralls · 2014-04-28T07:39:10Z

Coverage remained the same when pulling 2edb383 on cle1109:sldac into 463cea5 on scikit-learn:master.

larsmans · 2014-04-28T20:46:41Z

Does shrinkage mean regularization? (Statistics terminology is driven me crazy...)

larsmans · 2014-04-28T20:47:16Z

sklearn/slda.py

@@ -0,0 +1,235 @@
+"""
+The :mod:`sklearn.slda` module implements Shrinkage LDA.


If it's an LDA variant, it should go in sklearn.lda.

larsmans · 2014-04-28T20:51:36Z

Also, why is this a separate estimator? Can't this feature be added to the existing LDA class?

cbrnr · 2014-04-28T21:15:25Z

Yes, you are correct, shrinkage means regularization (we apply it to the covariance estimate). I can put it in sklearn.lda. I will fix the init function, and this should also be done for the original class in sklearn.lda(I took the code from there).

Concerning creating a separate estimator, I did not want to change the original class to avoid breaking anything. There are some problems when we also want to have LDA transformation with our implementation. To use shrinkage, we have to work with covariance estimates. The original LDA class uses SVD instead, which does have some advantages. For example, to compute the transformation, we would have to solve a generalized eigenvalue problem. In an underdetermined case, this works well (that's what shrinkage is used for), but it fails when using no shrinkage (eigh raises an exception I believe). In contrast, this does work with the current SVD implementation.

In addition, I'm not sure how to handle issue #1649 with our code. It looks like our implementation isn't robust to scaling either (same like the SVD implementation). However, with our implementation, this problem is not limited to the underdetermined case, but also seems to occur in the overdetermined case. This is not to say that the algorithm does not work; after all, it's the standard LDA algorithm described in Duda & Hart, which everyone seems to be using.

larsmans · 2014-04-28T21:18:56Z

You can avoid breaking working code by hardening the tests--I prefer new features to be added to existing estimators, as long as the classes don't become too heavy. Could you try to add the functionality to the existing class to see if it works?

amueller · 2014-04-29T02:04:10Z

Maybe a stupid question, but could you explain why it is not possible to use shrinkage in the SVD implementation by @ksemb?

cbrnr · 2014-04-29T06:30:58Z

Because we shrink covariance matrices used in the generalized eigenvalue problem. SVD does not work on covariance matrices, but instead decomposes the data directly into singular values.

It would be great if we could use shrinkage with SVD, but I believe this is not possible (correct me if I'm wrong).

The duplicate LDA (lda.LDA and slda.LDA) caused problems with some automated tests.

mbillingr · 2014-04-29T07:26:27Z

The reason we put the implementation in it's own class and module (for now) is that we made two fundamental changes to the original LDA.

Different API: lda.LDA is both, a classifier and a transform, while slda.SLDA is a classifier only.
Different implementation: SVD vs. covariance estimation

Once we know how to tackle these differences we can merge the different LDAs according to your guidlines.

Would be great to know if SVD can be shrunk somehow, but I don't believe that's possible either.

cbrnr · 2014-04-29T07:32:57Z

Still we could put the SLDA class in the lda module. However, because of these issues, it would be very cumbersome to merge everything into one class. Even creating a base LDA class would not be feasible, because we couldn't reuse anything.

How would you like us to proceed?

cbrnr · 2014-06-03T06:44:36Z

@ksemb, memory consumption could be an issue. What is a typical ballpark number of features in your field?

Concerning your second argument, I don't think that non-diagonal regularization is limited to SVD. In the case of Ledoit-Wolf shrinkage, we only modify the diagonal, but we could use other regularization methods (such as Tikhonov regularization).

Do you have a suggestion on how we could move forward with the two LDA classifier implementations? I am kind of confused, because there seem to be two issues at the moment:

SVD vs. covariance-based algorithms
Issue Univariate variate scaling in LDA #1649

I think we should focus on the first point here, and discuss the second issue separately.

The duplicate LDA (lda.LDA and slda.LDA) caused problems with some automated tests.

…sldac Conflicts: sklearn/tests/test_lda.py

mmbannert · 2014-06-27T09:02:25Z

I just wanted to briefly point out that I'm very glad that someone wrote code for carrying out shrinkage LDA in scikit-learn. Before I found this thread, I tried implementing it myself but I was not familiar with the implementation of LDA via SVD instead of explicit covariance matrices. If it isn't possible to perform shrinkage in the SVD scenario, I'd strongly favor the algorithm that calculates covariance matrices (because, from a pragmatic point of view, if the data are big, there is no benefit in computational efficiency if the estimators are unstable)

mblondel · 2014-08-11T10:24:22Z

I think automatic selection of the regularization constant by cross-validation is also useful. What you really want is to maximize classification accuracy on unseen data (generalization performance). I would thus add an alpha parameter (like other regularized classifiers) which can take a numerical value (e.g. alpha=1e-3) or a string (e.g. alpha="ledoit-wolf").

mbillingr · 2014-08-11T10:47:23Z

This would certainly make sense. Note that we no longer work on this PR, but continue in #3523. Could you move your suggestion over there, please?

Btw, I vaguely remember that Blankertz proofed in his shrinkage-LDA paper that the ledoit-wolf solution is optimal. If that is case cross-validation would probably not be required.

mblondel · 2014-08-11T10:53:54Z

I'd guess it is optimal on training but not on test data, isn't it? CV uses validation data so the result would be different.

mbillingr · 2014-08-11T11:40:57Z

Well, in that case 100% overfitting would be optimal :)

Sorry, I don't remember the details, but I'm sure that guy knew what he did. Anyway, I might be wrong. In that case a numerical parameter would certainly be useful.

mblondel · 2014-08-11T12:33:15Z

What is the title of the paper?

mbillingr · 2014-08-11T13:26:42Z

Blankertz et al. "Single-Trial Analysis and Classification of ERP Components - a Tutorial", NeuroImage, 2010.

I looked at the paper again and have to admit I was mistaken. There is no proof, and they even state that cross-validation could get give better results :) Sorry for the fuss.

cbrnr · 2014-08-12T06:53:45Z

I made a new comment in #3523, please continue the discussion over there.

Clemens Brunner and others added 3 commits April 24, 2014 14:50

Implemented shrinkage LDA classifier.

a9ab33d

sLDA example

db786bc

lda.score() instead of manual accuracy calculation

e20dc6b

Martin and others added 6 commits April 26, 2014 13:39

Changed class name from LDA to SLDA

9e700ba

The duplicate LDA (lda.LDA and slda.LDA) caused problems with some automated tests.

Replaced lambda with a named function

03653c1

Added more tests for improved coverage

dedb370

Merge pull request #1 from kazemakase/sldac

06f0494

sLDA: example, tests and fixes

Updated example and style.

02c0ae8

Fixed Python 2 division issue.

2edb383

larsmans reviewed Apr 28, 2014
View reviewed changes

Clemens Brunner and others added 8 commits April 29, 2014 09:25

Implemented shrinkage LDA classifier.

52fbac2

sLDA example

0efc4df

lda.score() instead of manual accuracy calculation

657edd7

Changed class name from LDA to SLDA

db6f4df

The duplicate LDA (lda.LDA and slda.LDA) caused problems with some automated tests.

Replaced lambda with a named function

ab272ff

Added more tests for improved coverage

7d94c7f

Updated example and style.

38849d6

Fixed Python 2 division issue.

7681662

Clemens Brunner and others added 17 commits June 3, 2014 14:01

Implemented shrinkage LDA classifier.

86384b5

sLDA example

ce3b119

lda.score() instead of manual accuracy calculation

6353b0a

Changed class name from LDA to SLDA

176470f

The duplicate LDA (lda.LDA and slda.LDA) caused problems with some automated tests.

Replaced lambda with a named function

b9753b0

Added more tests for improved coverage

f88231e

Updated example and style.

507e47e

Fixed Python 2 division issue.

b096de5

Implemented shrinkage LDA classifier.

a3d7303

sLDA example

1285ac0

Replaced lambda with a named function

1ccb555

Added more tests for improved coverage

79fb3a6

Updated tests to reflect changes in input validation.

7f3d22b

Renamed SLDA to ShrinkageLDA and moved it to lda.py.

dd2244d

Merge branch 'sldac' of https://github.com/cle1109/scikit-learn into …

a9ad7be

…sldac Conflicts: sklearn/tests/test_lda.py

Corrected doctest.

10b1d6f

Corrected errors introduced by manual merge.

73999e2

amueller mentioned this pull request Jul 29, 2014

Bug in the scikit-learn LDA function - results with non-zero correlation #3500

Closed

cbrnr mentioned this pull request Aug 1, 2014

[MRG+1] LDA refactoring #3523

Merged

cbrnr closed this Aug 1, 2014

cbrnr deleted the sldac branch September 22, 2014 14:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implemented shrinkage LDA classifier. #3105

Implemented shrinkage LDA classifier. #3105

cbrnr commented Apr 24, 2014

mbillingr commented Apr 24, 2014

coveralls commented Apr 28, 2014

larsmans commented Apr 28, 2014

larsmans Apr 28, 2014

larsmans commented Apr 28, 2014

cbrnr commented Apr 28, 2014

larsmans commented Apr 28, 2014

amueller commented Apr 29, 2014

cbrnr commented Apr 29, 2014

mbillingr commented Apr 29, 2014

cbrnr commented Apr 29, 2014

cbrnr commented Jun 3, 2014

mmbannert commented Jun 27, 2014

mblondel commented Aug 11, 2014

mbillingr commented Aug 11, 2014

mblondel commented Aug 11, 2014

mbillingr commented Aug 11, 2014

mblondel commented Aug 11, 2014

mbillingr commented Aug 11, 2014

cbrnr commented Aug 12, 2014

		@@ -0,0 +1,235 @@
		"""
		The :mod:`sklearn.slda` module implements Shrinkage LDA.

Implemented shrinkage LDA classifier. #3105

Implemented shrinkage LDA classifier. #3105

Conversation

cbrnr commented Apr 24, 2014

mbillingr commented Apr 24, 2014

coveralls commented Apr 28, 2014

larsmans commented Apr 28, 2014

larsmans Apr 28, 2014

Choose a reason for hiding this comment

larsmans commented Apr 28, 2014

cbrnr commented Apr 28, 2014

larsmans commented Apr 28, 2014

amueller commented Apr 29, 2014

cbrnr commented Apr 29, 2014

mbillingr commented Apr 29, 2014

cbrnr commented Apr 29, 2014

cbrnr commented Jun 3, 2014

mmbannert commented Jun 27, 2014

mblondel commented Aug 11, 2014

mbillingr commented Aug 11, 2014

mblondel commented Aug 11, 2014

mbillingr commented Aug 11, 2014

mblondel commented Aug 11, 2014

mbillingr commented Aug 11, 2014

cbrnr commented Aug 12, 2014