Rebase of barnes-hut TSNE #4887

amueller · 2015-06-22T19:55:24Z

Trying to rebase #4025.

amueller · 2015-08-03T20:53:24Z

If this implements a transform method, it should inherit from TransformerMixin which will give it additional tests.

gatapia · 2015-08-06T05:54:23Z

I'm trying this pull request out just to see if it does reduce memory and am not having any luck. This is the code I'm using, you'll see that my data is pretty conservative:

X = manifold.TSNE(2, method='barnes_hut').fit_transform(np.random.random(size=(40000, 2)))

This eats up 32GB ram in short order, I'm pretty sure I applied the pull request correctly otherwise method='barnes_hut' would throw an error no?

I was under the impression that the barnes hut implementation was much more memory friendly, am I correct?

Edit: this is on Windows 64bit

amueller · 2015-08-06T14:58:44Z

@gatapia in principle yes, this PR unfortunately not. There is still work to be done to use sparse matrices to reduce the memory footprint. In the current form, it "only" speeds up the computation about one or two orders of magnitude. Any help with going to sparse matrices is very welcome.

amueller · 2015-09-09T21:49:42Z

I'd really like to have this in 0.17. I removed the transform method because I'm not sure what the API should be there. While the current version could be improved, it gives 10x-100x speedups. I'll post some benchmarks soon.

amueller · 2015-09-09T22:17:43Z

amueller · 2015-09-09T22:18:37Z

On 5 classes of the digits its a factor of 2, on all classes it's a factor of 3, for the s-curve it's a factor of 10.

amueller · 2015-09-09T22:21:22Z

Not sure I'm entirely convinced the new example says a lot, though.

amueller · 2015-09-10T19:57:35Z

checks pass :) I'm thinking about polishing or removing the faces example, otherwise this should be good.

ogrisel · 2015-09-11T08:46:13Z

I'm thinking about polishing or removing the faces example, otherwise this should be good.

+1: either simplify it to only keep the best / fastest performing model or just remove it (a polished version can be re-added later).

ogrisel · 2015-09-11T08:47:45Z

In particular it's important to have some explanation that analyze and explain the obtained results: are they good, is the identity or the pose of faces preserved by the 2D embedding? Here it's not very clear to me.

amueller · 2015-09-11T15:15:44Z

what do you think about the rest?

ogrisel · 2015-09-11T16:07:06Z

I think it's already a good improvement upon what we currently have in master. We can work on the memory usage fix in another PR.

ogrisel · 2015-09-11T16:07:35Z

+1 for merge once the face example is removed (or simplified and explained).

amueller · 2015-09-11T17:18:50Z

OK I'll remove it for now and merge.

Renamed example file & added comments FEAT Barnes-Hut t-SNE Vectorized the math calls -- 3x faster on neg grad Draft of n-dimensional barnes-hut; failing tests Tests pass, error being computed in cython Added error calculation to pos gradient Changed cases where points are very close to be clearer Clearer variable names

Rebase of barnes-hut TSNE

amueller force-pushed the tsne_fixes branch from b81602e to e8ddf51 Compare June 22, 2015 20:19

amueller mentioned this pull request Jun 22, 2015

[MRG] Cemoody/bhtsne Barnes-Hut t-SNE #4025

Closed

24 tasks

amueller force-pushed the tsne_fixes branch 2 times, most recently from 57c3f06 to 8fabe50 Compare June 22, 2015 22:14

amueller force-pushed the tsne_fixes branch 4 times, most recently from 4dafbbb to 2366fcb Compare August 3, 2015 20:47

amueller force-pushed the tsne_fixes branch 2 times, most recently from 7ebf045 to 141cf52 Compare September 9, 2015 21:48

amueller added this to the 0.17 milestone Sep 9, 2015

cemoody and others added 2 commits September 11, 2015 13:19

add necessary blas files.

1ce9ba3

amueller force-pushed the tsne_fixes branch from 371ab10 to 1ab64d5 Compare September 11, 2015 17:20

Minor fixes, remove transform for now.

fe49161

amueller force-pushed the tsne_fixes branch from 1ab64d5 to fe49161 Compare September 11, 2015 17:24

amueller added a commit that referenced this pull request Sep 20, 2015

Merge pull request #4887 from amueller/tsne_fixes

770d089

Rebase of barnes-hut TSNE

amueller merged commit 770d089 into scikit-learn:master Sep 20, 2015

AlexanderFabisch mentioned this pull request Mar 6, 2016

TSNE - n_iter_without_progress not working #6450

Closed

amueller deleted the tsne_fixes branch May 19, 2017 20:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rebase of barnes-hut TSNE #4887

Rebase of barnes-hut TSNE #4887

amueller commented Jun 22, 2015

amueller commented Aug 3, 2015

gatapia commented Aug 6, 2015

amueller commented Aug 6, 2015

amueller commented Sep 9, 2015

amueller commented Sep 9, 2015

amueller commented Sep 9, 2015

amueller commented Sep 9, 2015

amueller commented Sep 10, 2015

ogrisel commented Sep 11, 2015

ogrisel commented Sep 11, 2015

amueller commented Sep 11, 2015

ogrisel commented Sep 11, 2015

ogrisel commented Sep 11, 2015

amueller commented Sep 11, 2015

Rebase of barnes-hut TSNE #4887

Rebase of barnes-hut TSNE #4887

Conversation

amueller commented Jun 22, 2015

amueller commented Aug 3, 2015

gatapia commented Aug 6, 2015

amueller commented Aug 6, 2015

amueller commented Sep 9, 2015

amueller commented Sep 9, 2015

amueller commented Sep 9, 2015

amueller commented Sep 9, 2015

amueller commented Sep 10, 2015

ogrisel commented Sep 11, 2015

ogrisel commented Sep 11, 2015

amueller commented Sep 11, 2015

ogrisel commented Sep 11, 2015

ogrisel commented Sep 11, 2015

amueller commented Sep 11, 2015