[MRG] Adding fit_transform #26

bhargavvader · 2016-09-08T11:04:01Z

@perimosocordiae , could you have a look and see if this is the correct idea/direction to go in?
And this sort of fit_transform would be wanted for all the algorithms, correct?

This is with respect to #25.

bhargavvader · 2016-09-08T11:07:28Z

I think what I've currently done is a bit verbose - maybe just a -

def fit_transform(self, X, Y = none):
      self.fit()
      return self.transform(self.X)

would be better?

perimosocordiae · 2016-09-08T13:32:02Z

I think you might be able to define a generic fit_transform() method on the base class, similar to your example:

def fit_transform(self, *args, **kwargs):
    self.fit(*args, **kwargs)
    return self.transform()

Some subclasses might need special treatment, but that should be a good start.

We'll also want to add a test case for each algorithm's subclass.

bhargavvader · 2016-09-12T14:07:42Z

@perimosocordiae which subclasses might need special treatment? I made the change and tried it out on all the algorithms and it seems fine.

Should the test format be similar to what is already done in metric_learn_test?

perimosocordiae · 2016-09-12T18:39:10Z

The docstring needs some work. Maybe just describe the function as calling .fit() and then returning the result of .transform(). There should be a note to see the arguments passed to .fit(), specifically, because the generic *args, **kwargs parameters don't tell the user anything. Also, the docstring should specify that it returns the metric-transformed input data, not the transformation matrix.

For testing, it would be nice to have a simple loop that checks the result of calling .fit_transform() vs manually calling .fit() and then .transform() for all methods. Be sure to use some small set of test data, so that the test case doesn't take very long to run. This could be another file in the test/ directory. I'll probably rename the existing test to something like test_iris.py, and your new test file could be named test_fit_transform.py.

bhargavvader · 2016-09-13T16:25:07Z

Funnily enough, tests fail for ITML, SDML, RCA, LSML and LDFA when I do .fit() and .transform() and when I do fit_transform(). I'm still going to push with the failing tests and re-done docstrings, could you check and see maybe why they are failing? Is it to do with a random seed maybe?

edit: i'll change the specifics of the tests; this is just a rough outline of what they might look like.

perimosocordiae · 2016-09-13T17:33:53Z

The test failures look like a random seed issue, though I haven't verified on my end.

bhargavvader · 2016-09-14T13:33:35Z

How do you suggest getting around the random seed issue? Can I pass a seed to the algorithm; or would declaring it outside before running .fit() and .transform() work as well?

bhargavvader · 2016-09-15T13:03:44Z

@perimosocordiae could you please help verify the source of the errors?
While lsml, sdml, and itml use random_subset for constraints, rca and lfda don't.
So, some questions:

Could we allow the passing of a random seed to the algorithms which use random_subset ? It could always come in handy when users want to replicate results (and for testing, of course).
What is going on with rca and lfda? Also, funnily, lfda returns the same matrix but with the signs reversed when I use fit_transform. Any clue why why this may be so?

perimosocordiae · 2016-09-15T19:02:18Z

I made a new issue (#33) for your first point. RCA is affected by the same issue because the chunks method also uses random numbers.

LFDA is based on an eigen decomposition and eigenvectors are sign-invariant, so that's why you're seeing the sign flipping. Both +v and -v are equivalent, which makes things trickier to test. We could choose a convention that the first value of the first eigenvector must be positive, which would make LFDA results reproducible without changing its results.

bhargavvader · 2016-09-15T21:26:10Z

Thanks - this answers everything I wanted to know. Will get to this over the weekend.

…to fit_transform

bhargavvader · 2016-09-29T21:10:00Z

@perimosocordiae , for some reason despite the passing of seed which is np.random.RandomState(1234) the tests fail. Can you please have a look and see if I am doing something wrong?

bhargavvader · 2016-09-29T21:31:51Z

Thought I would fix it by adding parameters for random_state in the adjacency matrix and positive_negative_pairs calls in ITML and the others but it still comes up. Needs a closer look.

bhargavvader · 2016-10-03T10:58:57Z

ping @perimosocordiae could you have a look please?

perimosocordiae · 2016-10-03T21:13:10Z

You need to reset the random state between the two times you use it (in the fit_transform test cases).

bhargavvader · 2016-10-04T11:46:57Z

Ah, okay. The tests are fixed now.

bhargavvader · 2016-10-06T15:30:58Z

@perimosocordiae , could you see if this is fine?
I could redo the Notebook with examples of fit_transform.

perimosocordiae · 2016-10-06T17:03:34Z

Looks good, thanks!

added fit_transform

d26ef09

bhargavvader mentioned this pull request Sep 8, 2016

[MRG] Metric Learning Tutorial Notebook #27

Merged

Changed base class

eacd619

Added tests

a55aeb4

perimosocordiae mentioned this pull request Sep 15, 2016

Enable user-given random state for constraint generation #33

Closed

bhargavvader mentioned this pull request Sep 19, 2016

[MRG] Added random_states #35

Merged

bhargavvader added 2 commits September 29, 2016 22:13

Merge branch 'master' of https://github.com/all-umass/metric-learn in…

f7f1be3

…to fit_transform

Added seeds to tests

99e5ba8

Added random_state

33289da

Fixed tests

1216fc6

bhargavvader changed the title ~~[WIP] Adding fit_transform~~ [MRG] Adding fit_transform Oct 4, 2016

FIxed tests

55da13b

perimosocordiae merged commit c5087d7 into scikit-learn-contrib:master Oct 6, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG] Adding fit_transform #26

[MRG] Adding fit_transform #26

bhargavvader commented Sep 8, 2016 •

edited

Loading

bhargavvader commented Sep 8, 2016 •

edited

Loading

perimosocordiae commented Sep 8, 2016

bhargavvader commented Sep 12, 2016

perimosocordiae commented Sep 12, 2016

bhargavvader commented Sep 13, 2016 •

edited

Loading

perimosocordiae commented Sep 13, 2016

bhargavvader commented Sep 14, 2016

bhargavvader commented Sep 15, 2016 •

edited

Loading

perimosocordiae commented Sep 15, 2016

bhargavvader commented Sep 15, 2016

bhargavvader commented Sep 29, 2016 •

edited

Loading

bhargavvader commented Sep 29, 2016 •

edited

Loading

bhargavvader commented Oct 3, 2016

perimosocordiae commented Oct 3, 2016

bhargavvader commented Oct 4, 2016

bhargavvader commented Oct 6, 2016

perimosocordiae commented Oct 6, 2016

[MRG] Adding fit_transform #26

[MRG] Adding fit_transform #26

Conversation

bhargavvader commented Sep 8, 2016 • edited Loading

bhargavvader commented Sep 8, 2016 • edited Loading

perimosocordiae commented Sep 8, 2016

bhargavvader commented Sep 12, 2016

perimosocordiae commented Sep 12, 2016

bhargavvader commented Sep 13, 2016 • edited Loading

perimosocordiae commented Sep 13, 2016

bhargavvader commented Sep 14, 2016

bhargavvader commented Sep 15, 2016 • edited Loading

perimosocordiae commented Sep 15, 2016

bhargavvader commented Sep 15, 2016

bhargavvader commented Sep 29, 2016 • edited Loading

bhargavvader commented Sep 29, 2016 • edited Loading

bhargavvader commented Oct 3, 2016

perimosocordiae commented Oct 3, 2016

bhargavvader commented Oct 4, 2016

bhargavvader commented Oct 6, 2016

perimosocordiae commented Oct 6, 2016

bhargavvader commented Sep 8, 2016 •

edited

Loading

bhargavvader commented Sep 8, 2016 •

edited

Loading

bhargavvader commented Sep 13, 2016 •

edited

Loading

bhargavvader commented Sep 15, 2016 •

edited

Loading

bhargavvader commented Sep 29, 2016 •

edited

Loading

bhargavvader commented Sep 29, 2016 •

edited

Loading