Change graph_lasso to exploit block diagonal structure #4787

bnaul · 2015-05-29T04:17:21Z

I took a stab at implementing the optimization described here: the block diagonal structure of the graphical lasso solution can be identified by thresholding the sample covariance, and the exact solution is found by solving the graphical lasso for each block separately. The authors find that there is a huge speedup when the solution is very sparse; when the solution is mostly dense, the results are basically the same (or very slightly slower due to the extra thresholding step). This modification was made in the glasso R package some time ago; timing results are given in the paper above, but I also ran a test of my implementation with p=1000, n=100 and a block diagonal population covariance matrix.

Couple of questions:

the glasso R package was changed to only use this algorithm, so I followed the same convention and did not allow the user to choose whether to perform the block diagonal screening procedure. It would be very easy to add this, I'm just not sure if there's a case where it would ever be desired.
There's a bug in our connected_components function that was fixed a while ago in scipy (BUG: make a fully connected csgraph from an ndarray with no zeros scipy/scipy#3819). I included this in my commit, but maybe it should be a separate pull request?
Does the overall logic make sense here? I only added a couple of comments but if it's not clear what's going on then I can try to clarify.

amueller · 2015-06-01T17:31:26Z

Thanks for the PR. This looks like a great addition. We have a bit of a backlog on contributions, unfortunately. Maybe @GaelVaroquaux can find time to look at this one, as he created the original estimator.

bnaul · 2015-06-01T17:46:48Z

No problem, obviously nothing urgent.

On Mon, Jun 1, 2015 at 10:32 AM Andreas Mueller notifications@github.com
wrote:

Thanks for the PR. This looks like a great addition. We have a bit of a
backlog on contributions, unfortunately. Maybe @GaelVaroquaux
https://github.com/GaelVaroquaux can find time to look at this one, as
he created the original estimator.

—
Reply to this email directly or view it on GitHub
#4787 (comment)
.

NelleV · 2016-08-09T04:48:59Z

Hi Brett,

Do you mind rebasing master onto the branch? The PR is kind of old :)

Cheers,
N

agramfort · 2017-06-06T16:03:52Z

@bnaul can you rebase? I will have a look this week

bnaul · 2017-06-14T22:31:26Z

@agramfort rebased! Travis is only unhappy because of flake8 failures but those are all just lines that were already too long in the old version.

NelleV · 2017-06-20T19:27:07Z

@agramfort any luck in reviewing this?

agramfort · 2017-06-20T19:43:06Z

no luck soon... sprint is over and priorities changed :(

The block diagonal components of the graphical lasso solution can be identified by thresholding the same covariance matrix; the solution can then be found by solving a subproblem corresponding to each component, which can be much faster if the graph is very sparse. See http://faculty.washington.edu/dwitten/Papers/jcgs.2011.pdf for details.

stevendbrown · 2017-10-01T00:24:58Z

@bnaul Testing this modification with my completely-non-zero 1288x1288 input matrix has a runtime ~2x higher than the current master version of the function, and spends 80% of the runtime on the sub_covariance = np.ascontiguousarray(...) call. I am trying to merge this version of graph_lasso with the changes in #9858, but you will likely be able to implement the fix to memory allocation time better than I.

bnaul force-pushed the block_glasso branch from ed1227e to 1b9349a Compare May 29, 2015 04:40

amueller added the Enhancement label Jun 1, 2015

bnaul force-pushed the block_glasso branch from 1b9349a to 2a06cc5 Compare May 18, 2016 21:22

bnaul force-pushed the block_glasso branch from 2a06cc5 to cafba6b Compare August 9, 2016 05:10

bnaul force-pushed the block_glasso branch 3 times, most recently from 406f6f3 to cfdb3fd Compare June 14, 2017 22:30

bnaul force-pushed the block_glasso branch from cfdb3fd to 17d70df Compare July 13, 2017 17:43

stevendbrown mentioned this pull request Sep 30, 2017

[MRG+1] Reduce runtime of graph_lasso #9858

Merged

github-actions bot added the module:covariance label Mar 2, 2020

cmarmo added Needs Benchmarks A tag for the issues and PRs which require some benchmarks help wanted labels Sep 21, 2020

Base automatically changed from master to main January 22, 2021 10:48

bnaul closed this Nov 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Change graph_lasso to exploit block diagonal structure #4787

Change graph_lasso to exploit block diagonal structure #4787

Uh oh!

bnaul commented May 29, 2015

Uh oh!

amueller commented Jun 1, 2015

Uh oh!

bnaul commented Jun 1, 2015

Uh oh!

NelleV commented Aug 9, 2016

Uh oh!

agramfort commented Jun 6, 2017

Uh oh!

bnaul commented Jun 14, 2017

Uh oh!

NelleV commented Jun 20, 2017

Uh oh!

agramfort commented Jun 20, 2017 via email

Uh oh!

stevendbrown commented Oct 1, 2017 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Change graph_lasso to exploit block diagonal structure #4787

Change graph_lasso to exploit block diagonal structure #4787

Uh oh!

Conversation

bnaul commented May 29, 2015

Uh oh!

amueller commented Jun 1, 2015

Uh oh!

bnaul commented Jun 1, 2015

Uh oh!

NelleV commented Aug 9, 2016

Uh oh!

agramfort commented Jun 6, 2017

Uh oh!

bnaul commented Jun 14, 2017

Uh oh!

NelleV commented Jun 20, 2017

Uh oh!

agramfort commented Jun 20, 2017 via email

Uh oh!

stevendbrown commented Oct 1, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

stevendbrown commented Oct 1, 2017 •

edited

Loading