[MRG+1] FIX consistency of memory layout for linear CD solver #5337

ogrisel · 2015-10-02T16:57:43Z

This should fix #5013.

This PR fixes several related consistency issues in the handling of the memory layout in models using the linear coordinate descent solvers (Gram or not).

Also I made the expectation w.r.t. memory layout explicit in the Cython prototypes directly which should prevent re-introducing regressions in the future.

@arthurmensch I would appreciate a review on this as I touched a lot of code you changed when introducing the ability to skip input checks. Especially if you have your benchmark scripts at hand it would be great if you could check that I do not re-introduce unwanted redundant input checks.

ogrisel · 2015-10-02T17:00:29Z

BTW, I still get a memory error reported by valgrind when I use openblas from ubuntu 14.04 but I think this is in openblas it-self and it does not seem to cause an error at our level. There might be a bug in openblas but I think this PR fixes the issue reported in #5013.

ogrisel · 2015-10-02T17:41:39Z

Travis revealed that the contiguity / ordering depends on the version of numpy. I am on it.

ogrisel · 2015-10-02T18:12:55Z

Should be fixed. My last commit might reveal a bug in graph lasso with old versions of numpy...

agramfort · 2015-10-04T14:43:32Z

LGTM

@arthurmensch are you ok ?

arthurmensch · 2015-10-04T15:29:43Z

I ll review and bench it this evening
On Oct 4, 2015 4:44 PM, "Alexandre Gramfort" notifications@github.com
wrote:

LGTM

@arthurmensch https://github.com/arthurmensch are you ok ?

—
Reply to this email directly or view it on GitHub
#5337 (comment)
.

arthurmensch · 2015-10-05T07:53:07Z

sklearn/linear_model/coordinate_descent.py

+    check_input : bool, default True
+        Skip input validation checks, including the Gram matrix when provided
+        assuming there are handled by the caller when check_input=False.
+


Ok so I guess it is okay to present this flag to the user ?

Yes I think so: check_input flags are already present in several other functions / methods in the scikit-learn code base.

arthurmensch · 2015-10-05T08:33:14Z

I see no performance regression on dictionary learning, so 👍

Looking back on _pre_fit, I feel that this line is not very clean

    if hasattr(precompute, '__array__') and (
            fit_intercept and not np.allclose(X_mean, np.zeros(n_features))
            or normalize and not np.allclose(X_std, np.ones(n_features))):

despite being important for performance. But that is another issue.

ogrisel · 2015-10-05T08:50:33Z

Great, thanks for the reviews merging. I will not update what's new as this issue was not present in 0.16.

…ayout [MRG+1] FIX consistency of memory layout for linear CD solver

amueller · 2015-10-12T23:23:10Z

sklearn/covariance/graph_lasso_.py

@@ -205,7 +205,8 @@ def graph_lasso(emp_cov, alpha, cov_init=None, mode='cd', tol=1e-4,
        d_gap = np.inf
        for i in range(max_iter):
            for idx in range(n_features):
-                sub_covariance = covariance_[indices != idx].T[indices != idx]
+                sub_covariance = np.ascontiguousarray(
+                    covariance_[indices != idx].T[indices != idx])


Shouldn't this be

mask = indices != idx covariance_[np.ix_(mask, mask)]

mask = indices != idx covariance_[np.ix_(mask, mask)]

That would probably be better indeed.

@MechCoder do you want to see if that is better and do a quick PR?

amueller · 2015-10-12T23:27:21Z

a deterministic regression test would be nice.
Thanks so much for fixing this!

ogrisel added this to the 0.17 milestone Oct 2, 2015

ogrisel force-pushed the fix-coordinate-descent-memory-layout branch from b503fbb to 0b49e26 Compare October 2, 2015 19:45

ogrisel mentioned this pull request Oct 2, 2015

Random segfault under windows in sklearn.decomposition.tests.test_sparse_pca.test_fit_transform #5013

Closed

ogrisel changed the title ~~[MRG] FIX consistency of memory layout for linear CD solver~~ [MRG+1] FIX consistency of memory layout for linear CD solver Oct 4, 2015

ogrisel added 2 commits October 4, 2015 18:32

FIX consistency of memory layout for linear CD solver

808fa64

FIX ensure contiguous Gram matrix in graph lasso

c3a3e69

ogrisel force-pushed the fix-coordinate-descent-memory-layout branch from 0b49e26 to c3a3e69 Compare October 4, 2015 16:38

arthurmensch reviewed Oct 5, 2015
View reviewed changes

ogrisel added a commit that referenced this pull request Oct 5, 2015

Merge pull request #5337 from ogrisel/fix-coordinate-descent-memory-l…

1025982

…ayout [MRG+1] FIX consistency of memory layout for linear CD solver

ogrisel merged commit 1025982 into scikit-learn:master Oct 5, 2015

amueller reviewed Oct 12, 2015
View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG+1] FIX consistency of memory layout for linear CD solver #5337

[MRG+1] FIX consistency of memory layout for linear CD solver #5337

Uh oh!

ogrisel commented Oct 2, 2015

Uh oh!

ogrisel commented Oct 2, 2015

Uh oh!

ogrisel commented Oct 2, 2015

Uh oh!

ogrisel commented Oct 2, 2015

Uh oh!

agramfort commented Oct 4, 2015

Uh oh!

arthurmensch commented Oct 4, 2015

Uh oh!

arthurmensch Oct 5, 2015

Uh oh!

ogrisel Oct 5, 2015

Uh oh!

arthurmensch commented Oct 5, 2015

Uh oh!

ogrisel commented Oct 5, 2015

Uh oh!

amueller Oct 12, 2015

Uh oh!

GaelVaroquaux Oct 13, 2015 via email

Uh oh!

amueller Oct 13, 2015

Uh oh!

amueller commented Oct 12, 2015

Uh oh!

Uh oh!

Uh oh!

[MRG+1] FIX consistency of memory layout for linear CD solver #5337

[MRG+1] FIX consistency of memory layout for linear CD solver #5337

Uh oh!

Conversation

ogrisel commented Oct 2, 2015

Uh oh!

ogrisel commented Oct 2, 2015

Uh oh!

ogrisel commented Oct 2, 2015

Uh oh!

ogrisel commented Oct 2, 2015

Uh oh!

agramfort commented Oct 4, 2015

Uh oh!

arthurmensch commented Oct 4, 2015

Uh oh!

arthurmensch Oct 5, 2015

Choose a reason for hiding this comment

Uh oh!

ogrisel Oct 5, 2015

Choose a reason for hiding this comment

Uh oh!

arthurmensch commented Oct 5, 2015

Uh oh!

ogrisel commented Oct 5, 2015

Uh oh!

amueller Oct 12, 2015

Choose a reason for hiding this comment

Uh oh!

GaelVaroquaux Oct 13, 2015 via email

Choose a reason for hiding this comment

Uh oh!

amueller Oct 13, 2015

Choose a reason for hiding this comment

Uh oh!

amueller commented Oct 12, 2015

Uh oh!

Uh oh!