[MRG+1] LDA refactoring #3523

cbrnr · 2014-08-01T11:08:10Z

OK, here is a new version of the LDA class with (optional) shrinkage. The one in #3105 was already very messy, so I thought I'd start from scratch. By default, nothing has changed and the old SVD-based code is used.

ogrisel · 2014-08-01T21:06:55Z

This PR will need to be rebased on top of the current master and the input validation need to be updated to use the new utilities, see:

http://scikit-learn.org/stable/developers/utilities.html

Also as @larsmans said in the previous PR the narrative doc in ./doc/modules/lda_qda.rst needs to be updated to highlight this new option.

Finally it would be great to write and example in the examples folder to plot the impact of using shrinkage to improve performance accuracy and use that plot in the narrative documentation.

Once all this change are made please change the title of this PR from [WIP] to [MRG] for final review.

mbillingr · 2014-08-03T14:20:04Z

@cle1109 I've updated examples/plot_lda.py from the old PR to use the new LDA API: 2ef847f.
You can pull it from the slda branch in my fork.

coveralls · 2014-08-05T09:31:38Z

Changes Unknown when pulling 5a18f80 on cle1109:slda into * on scikit-learn:master*.

cbrnr · 2014-08-05T09:33:26Z

OK, I think this should be it, everything should be done.

coveralls · 2014-08-05T09:43:52Z

Changes Unknown when pulling e610f87 on cle1109:slda into * on scikit-learn:master*.

coveralls · 2014-08-05T11:39:17Z

Changes Unknown when pulling 64dc88b on cle1109:slda into * on scikit-learn:master*.

agramfort · 2014-08-06T09:05:03Z

doc/modules/lda_qda.rst

+Shrinkage LDA can be used by setting the ``use_covariance`` parameter of the
+:class:`lda.LDA` class to 'ledoitwolf'. This automatically determines the
+optimal shrinkage parameter in an analytic way following the lemma introduced by
+Ledoit and Wolf.


add ref here

forget it. It's right below. my bad

coveralls · 2014-08-06T12:05:37Z

Changes Unknown when pulling 807c83f on cle1109:slda into * on scikit-learn:master*.

mblondel · 2014-08-11T14:00:32Z

sklearn/lda.py

-            # centered group data
-            Xgc = Xg - meang
-            Xc.append(Xgc)
+        if self.use_covariance is None:


I would extract the two solvers as private functions (or private methods if you need access to the object).

cbrnr · 2014-08-12T06:52:56Z

OK, so here's my take on the discussions in #3500 and #3105 (might I suggest to move it here, because here's where our latest code is).

I really like the idea of introducing solver and alpha parameters. The default value for solver could be SVD (the current implementation with the bug fixed in the transform method). The other option could be covariance. The alpha parameter should work in both cases, but the value ledoit_wolf makes only sense in the latter.

I support the idea of introducing the possibility to optimize the parameter via cross-validation.

The other question is how close to the textbook we want to be in the covariance method. As @kazemakase pointed out, we don't need to solve the eigenvalue problem for classification. That's also how most algorithms are implemented, because inverting the between scatter matrix could be a problem (see also the discussion at Cross Validated).

So in principle we could go ahead and start implementing it according to all suggestions here. The only thing that needs to be clarified first is how we do both classification and transform in the covariance method.

mblondel · 2014-08-12T16:32:07Z

You need some sort of eigen/singular value decomposition for the transform, but you can fit the classifier with a matrix inverse or an equation solver.

Instead of introducing a fit_transform option as I previously suggested, I think we could introduce two solvers covariance-lsqr and covariance-eigen. The transform method would only be available if covariance-eigen was used.

We could also add a covariance-cg solver which uses conjugate gradient instead of lsqr for solving the least squares problem / system of equations. This should make it possible to solve the problem without materializing the covariance matrix in memory (see the _solve_sparse_cg function in the Ridge module). This can of course be addressed in another PR.

(I took the liberty to change the title of this PR and add a TODO list.)

mblondel · 2014-08-12T16:42:34Z

Actually, we can drop the covariance- prefix. This way, the names will be consistent with the Ridge module.

mblondel · 2014-08-12T17:01:43Z

Just svd. The fact that lsqr and eigen use the covariance matrix can be documented in the docstring. In the Ridge module, we give a short description of each solver in the docstring:
https://github.com/scikit-learn/scikit-learn/blob/master/sklearn/linear_model/ridge.py#L215.

cbrnr · 2014-08-12T17:43:57Z

OK! (I deleted my comment because I saw it in the todo list at the top)

mbillingr · 2014-08-13T12:37:40Z

i got a comment on one of the items in the todo list:

test that solvers return the same results (up to numerical errors)

Numerical errors can blow up and cause big differences in classification results if feature space is high dimensional. That is because these errors cause a small amount of uncertainty in the orientation of the hyperplane. The volume of this uncertainty increases exponentially with dimensionality. The bigger this volume the higher the chance of unseen samples lying on opposing sides of the hyperplane in different implementations.

In short, classification results can differ noticeably due to numerical errors. Something to keep in mind when designing the tests.

mblondel · 2014-08-13T20:02:33Z

sklearn/lda.py


    Parameters
    ----------
-    n_components: int
+    n_components : int
        Number of components (< n_classes - 1) for dimensionality reduction

    priors : array, optional, shape = [n_classes]
        Priors on classes



Docstring for intercept_ is missing. This is an array of size n_classes.

…icients.

…fit() method.

…leaned up code and added references.

cbrnr · 2014-12-16T10:53:00Z

@agramfort, I am very confident now that the code does what it is supposed to do :-). The minor issue of differently scaled scalings_ should be addressed in a follow-up PR (it doesn't affect the functionality, but for consistency it would be nice if the same coefficients are obtained by both solvers. What do you think (in particular @ogrisel)?

amueller · 2014-12-16T16:22:43Z

I think we should merge this. Maybe squash the commits, there are quite a few.

cbrnr · 2014-12-17T08:11:05Z

Do you want me to squash the commits (to only one?) - or will you do that?

agramfort · 2014-12-18T09:14:15Z

@amueller?

I don't mind pushing these commits as they are... let's merge this ASAP.

GaelVaroquaux · 2014-12-19T17:58:09Z

I don't mind pushing these commits as they are... let's merge this ASAP.

+1

amueller · 2014-12-19T18:03:51Z

Ok, it looks like we don't do squashing any more... I felt that was helpful for bugfix releases but ok....

[MRG+1] LDA refactoring

GaelVaroquaux · 2014-12-19T18:06:45Z

I felt that was helpful for bugfix releases but ok....

I agree with you. I think that we should encourage it. But it don't feel
that it is mandatory.

agramfort · 2014-12-19T19:32:49Z

🍻 @cle1109 !

cbrnr · 2014-12-20T10:36:44Z

😄 - awesome! Thank you guys, I really learned a lot while working on this PR. Looking forward to contributing more soon!

amueller · 2015-01-27T21:02:50Z

This fixes #1649, right? Or only for the new solvers?

agramfort · 2015-01-27T21:04:53Z

I think so

amueller · 2015-01-27T21:09:32Z

@agramfort is that to "fixed #1649" or to "only for the new solvers"?

ogrisel changed the title ~~New LDA class with shrinkage~~ [WIP] New LDA class with shrinkage Aug 1, 2014

cbrnr mentioned this pull request Aug 5, 2014

Ledoit Wolf covariance estimator should standardize data #3508

Closed

cbrnr changed the title ~~[WIP] New LDA class with shrinkage~~ [MRG] New LDA class with shrinkage Aug 5, 2014

agramfort reviewed Aug 6, 2014
View reviewed changes

mbillingr mentioned this pull request Aug 11, 2014

Implemented shrinkage LDA classifier. #3105

Closed

mblondel reviewed Aug 11, 2014
View reviewed changes

cbrnr mentioned this pull request Aug 12, 2014

Bug in the scikit-learn LDA function - results with non-zero correlation #3500

Closed

mblondel changed the title ~~[MRG] New LDA class with shrinkage~~ [WIP] LDA refactoring Aug 12, 2014

mblondel reviewed Aug 13, 2014
View reviewed changes

larsmans force-pushed the master branch from 58a55ad to 4b82379 Compare August 25, 2014 21:50

cbrnr added 15 commits December 16, 2014 11:35

Introduced step size to speed up the example.

1e469b2

(Hopefully) fixed Travis bug on Python 2.6.

7a5563d

Another try to fix the Travis bug.

f2fcafe

Inherit from LinearClassifierMixin.

7e4356b

Added shrinkage LDA entry.

5cb217d

Rename parameter value 'ledoit_wolf' to 'auto'.

aede3d6

Test that lsqr and eigen solvers return almost exactly the same coeff…

8c016e8

…icients.

Added deprecation warnings to store_covariance and tol parameters in …

70773bc

…fit() method.

Added sentence on lsqr vs. eigen solvers.

a88ad9e

Now correctly use weighted average to compute the class covariance. C…

095c7a9

…leaned up code and added references.

Works with one feature.

936acf7

Updated LDA tests.

d8024bb

Fix problem with np.linalg.norm for version <= 1.6.

71a23ef

Added documentation for the three solvers.

081418f

Updated documentation on shrinkage.

b23a55b

amueller added a commit that referenced this pull request Dec 19, 2014

Merge pull request #3523 from cle1109/slda

1aa5a9f

[MRG+1] LDA refactoring

amueller merged commit 1aa5a9f into scikit-learn:master Dec 19, 2014

amueller mentioned this pull request Oct 9, 2015

Error LDA shrinkage #5354

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG+1] LDA refactoring #3523

[MRG+1] LDA refactoring #3523

cbrnr commented Aug 1, 2014

ogrisel commented Aug 1, 2014

mbillingr commented Aug 3, 2014

coveralls commented Aug 5, 2014

cbrnr commented Aug 5, 2014

coveralls commented Aug 5, 2014

coveralls commented Aug 5, 2014

agramfort Aug 6, 2014

agramfort Aug 6, 2014

coveralls commented Aug 6, 2014

mblondel Aug 11, 2014

cbrnr commented Aug 12, 2014

mblondel commented Aug 12, 2014

mblondel commented Aug 12, 2014

mblondel commented Aug 12, 2014

cbrnr commented Aug 12, 2014

mbillingr commented Aug 13, 2014

mblondel Aug 13, 2014

cbrnr commented Dec 16, 2014

amueller commented Dec 16, 2014

cbrnr commented Dec 17, 2014

agramfort commented Dec 18, 2014

GaelVaroquaux commented Dec 19, 2014

amueller commented Dec 19, 2014

GaelVaroquaux commented Dec 19, 2014

agramfort commented Dec 19, 2014

cbrnr commented Dec 20, 2014

amueller commented Jan 27, 2015

agramfort commented Jan 27, 2015

amueller commented Jan 27, 2015

[MRG+1] LDA refactoring #3523

[MRG+1] LDA refactoring #3523

Conversation

cbrnr commented Aug 1, 2014

ogrisel commented Aug 1, 2014

mbillingr commented Aug 3, 2014

coveralls commented Aug 5, 2014

cbrnr commented Aug 5, 2014

coveralls commented Aug 5, 2014

coveralls commented Aug 5, 2014

agramfort Aug 6, 2014

Choose a reason for hiding this comment

agramfort Aug 6, 2014

Choose a reason for hiding this comment

coveralls commented Aug 6, 2014

mblondel Aug 11, 2014

Choose a reason for hiding this comment

cbrnr commented Aug 12, 2014

mblondel commented Aug 12, 2014

mblondel commented Aug 12, 2014

mblondel commented Aug 12, 2014

cbrnr commented Aug 12, 2014

mbillingr commented Aug 13, 2014

mblondel Aug 13, 2014

Choose a reason for hiding this comment

cbrnr commented Dec 16, 2014

amueller commented Dec 16, 2014

cbrnr commented Dec 17, 2014

agramfort commented Dec 18, 2014

GaelVaroquaux commented Dec 19, 2014

amueller commented Dec 19, 2014

GaelVaroquaux commented Dec 19, 2014

agramfort commented Dec 19, 2014

cbrnr commented Dec 20, 2014

amueller commented Jan 27, 2015

agramfort commented Jan 27, 2015

amueller commented Jan 27, 2015