[MRG] Fix RCA_Supervised sklearn compat test #198

wdevazelhes · 2019-05-02T14:35:50Z

I noticed that we didn't test RCA_Supervised scikit-learn compatibility

A first problem in scikit-learn's check_estimator that prevented the test to pass was that RCA_Supervised couldn't form the default n_chunks number of chunks for one of the toy examples that scikit-learn had run. Fixing num_chunks=2 ensures a normal use of the algorithm is tested (if we had put 1 chunk then it would have become a corner case)and is robuts when we cannot do a lot of chunks of data.

A second problem was just a bug when the input was int, because we couldn't substract inplace the mean to chunk_data (which was an array of ints, and numpy returns an error in this case). Casting to float this object fixes it

perimosocordiae · 2019-05-06T18:50:49Z

metric_learn/rca.py

+  # through slices hence we do a copy. We will also need to
+  # ensure the data is float so that we can substract the
+  # mean on it
+  chunk_data = data[chunk_mask].astype(float, copy=True)


Logical indexing always creates a copy, so the copy=True isn't necessary here. In fact, astype() defaults to returning a new copy each time, so what we really want here is copy=False to avoid unnecessary copies.

That's right, thanks

perimosocordiae · 2019-05-06T18:50:53Z

test/test_sklearn_compat.py

-  #   check_estimator(RCA_Supervised)
+  def test_rca(self):
+    def stable_init(self, num_dims=None, pca_comps=None,
+                    chunk_size=2, preprocessor=None):


I'd use **kwargs here

I agree that would be better in general, but for here I think it might be useful to simulate dRCA to have the same arguments names as a real RCA, so that if scikit-learn have checks that depend on the arguments of RCA, they will be taken into account, what do you think ?
Though it's not the case here so maybe yes we can put **kwargs for simplicity

I'm not sure how scikit-learn's testing works, but that seems plausible. This is fine as-is.

bellet · 2019-05-09T12:20:12Z

Can you briefly explain the problem and its solution in the PR description?

wdevazelhes · 2019-05-09T13:13:46Z

Can you briefly explain the problem and its solution in the PR description?

Yes sorry
done

bellet · 2019-05-09T13:21:04Z

Thanks. Why is this stable_init thing needed?

wdevazelhes · 2019-05-09T13:38:52Z

Thanks. Why is this stable_init thing needed?

In fact there was also another bug because the default n_chunks was too big for some toy problem in scikit-learn check_estimator, so the stable_init fixing n_chunks=2 is a way to fix it (I updated the PR description, I had forgotten there was this bug too)

bellet · 2019-05-09T14:26:47Z

OK. I guess this could be fixed in a more robust way by making sure that n_chunks*chunk_size is not larger than the number of points, or having some kind of elegant break of the chunk generation procedure when no new chunk can be created. But let's keep this for another PR

perimosocordiae · 2019-05-10T14:15:29Z

test/test_sklearn_compat.py

-  #   check_estimator(RCA_Supervised)
+  def test_rca(self):
+    def stable_init(self, num_dims=None, pca_comps=None,
+                    chunk_size=2, preprocessor=None):


I'm not sure how scikit-learn's testing works, but that seems plausible. This is fine as-is.

wdevazelhes · 2019-05-13T08:49:52Z

Merging, since travis test are green, and 2 approvals

* Remove initialization of the data for RCA * Add deprecated flag for supervised version too * Remove comment saying we'll do PCA * Add ChangedBehaviorWarning and do tests * improve change behavior warning * Update message in case covariance matrix is not invertible * FIX: still ignore testing RCA while fixed in #198 * Some reformatting * Fix test string * TST: add test for warning message when covariance is not definite * Address #194 (comment)

FIX fix RCA_Supervised sklearn compat test

aba686e

wdevazelhes changed the title ~~Add checks for labels when having pairs~~ [MRG] Fix RCA_Supervised sklearn compat test May 2, 2019

wdevazelhes pushed a commit to wdevazelhes/metric-learn that referenced this pull request May 2, 2019

FIX: still ignore testing RCA while fixed in scikit-learn-contrib#198

ea68d0d

perimosocordiae requested changes May 6, 2019

View reviewed changes

Address scikit-learn-contrib#198 (review)

2b8437f

Refactor comment

aaed9b2

perimosocordiae approved these changes May 10, 2019

View reviewed changes

wdevazelhes added this to the v0.5.0 milestone May 10, 2019

wdevazelhes mentioned this pull request May 10, 2019

Break chunks generation in RCA when needed #200

Closed

wdevazelhes merged commit 9f73250 into scikit-learn-contrib:master May 13, 2019

RobinVogel mentioned this pull request Oct 29, 2019

Break chunks generation in RCA when not enough possible chunks, fixes issue #200 #254

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MRG] Fix RCA_Supervised sklearn compat test #198

[MRG] Fix RCA_Supervised sklearn compat test #198

Uh oh!

wdevazelhes commented May 2, 2019 •

edited

Loading

Uh oh!

perimosocordiae May 6, 2019

Uh oh!

wdevazelhes May 9, 2019

Uh oh!

perimosocordiae May 6, 2019

Uh oh!

wdevazelhes May 9, 2019

Uh oh!

perimosocordiae May 10, 2019

Uh oh!

bellet commented May 9, 2019

Uh oh!

wdevazelhes commented May 9, 2019

Uh oh!

bellet commented May 9, 2019

Uh oh!

wdevazelhes commented May 9, 2019 •

edited

Loading

Uh oh!

bellet commented May 9, 2019

Uh oh!

perimosocordiae May 10, 2019

Uh oh!

wdevazelhes commented May 13, 2019

Uh oh!

Uh oh!

[MRG] Fix RCA_Supervised sklearn compat test #198

[MRG] Fix RCA_Supervised sklearn compat test #198

Uh oh!

Conversation

wdevazelhes commented May 2, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

perimosocordiae May 6, 2019

Choose a reason for hiding this comment

Uh oh!

wdevazelhes May 9, 2019

Choose a reason for hiding this comment

Uh oh!

perimosocordiae May 6, 2019

Choose a reason for hiding this comment

Uh oh!

wdevazelhes May 9, 2019

Choose a reason for hiding this comment

Uh oh!

perimosocordiae May 10, 2019

Choose a reason for hiding this comment

Uh oh!

bellet commented May 9, 2019

Uh oh!

wdevazelhes commented May 9, 2019

Uh oh!

bellet commented May 9, 2019

Uh oh!

wdevazelhes commented May 9, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bellet commented May 9, 2019

Uh oh!

perimosocordiae May 10, 2019

Choose a reason for hiding this comment

Uh oh!

wdevazelhes commented May 13, 2019

Uh oh!

Uh oh!

wdevazelhes commented May 2, 2019 •

edited

Loading

wdevazelhes commented May 9, 2019 •

edited

Loading