Add documentation for the new API #133

wdevazelhes · 2018-11-21T16:02:35Z

For now there is nothing added but I'll update it soon with some draft of documentation

wdevazelhes · 2018-12-03T10:55:46Z

Hi, I just pushed an outline of documentation so that we can discuss it here

terrytangyuan · 2018-12-03T14:24:12Z

doc/outline.md

+
+	- Supervised Metric Learning: (add links to examples/images from examples at the right place in the description)
+		- Problem setting
+		- Input data (+ see Preprocessor section)


This is only going to be simple usage of preprocessor and then talk about more details in the preprocessor section, right?

Yes, something like: input data: a 2D array-like X (or 1D array-like of indices, see preprocessor section)

terrytangyuan · 2018-12-03T14:29:29Z

doc/outline.md

+
+- Examples/Tutorials: 
+	- One example with faces (prediction if same/different person) 
+	- One example of grid search to compare different algorithms (mmc, itml etc)


So the goal is to do the following in order:

grid search for each metric learning algorithm

grid search for a classifier (same classification algorithm)

and then compare the results using classification metrics?

Right? Or are you only comparing the visualization after metric transformation (if so, maybe this could be combined with "Data visualisation" section)?

I was thinking only grid search for Weakly Supervised Algorithms like MMC, the score being the accuracy score on predicting if a pair is positive/negative, since this case is the most unusual for users used to classic train/test sets of points and not pairs. But indeed we could also make a grid search on pipelines (SupervisedMetricLearner, KNN) for instance, like a regular classifier, with a classification accuracy score.
And I don't know if you were refering to it by 1., but maybe for Supervised Learners we could also do a GridSearch with the scoring of Weakly Supervised Algorithms (we would split the points X traditionnally, but on the test set we would sample some pairs and then evaluate the accuracy of score_pairs) I don't know if this kind of scoring could is a traditional option for Supervised Metric Learners ? @bellet any opinion on this ?

And I agree that it would be interesting to see the impact of hyperparameters tuning on data visualization: maybe this could make an example in the data visualization section indeed

terrytangyuan · 2018-12-03T14:30:56Z

doc/outline.md

+		- Problem setting
+		- Input data (+ See Preprocessor section)
+		- What you can do after fit (predict/score, tranform...)
+		- Scikit-learn compatibility (compatible with grid search + link to example of grid search)


It might also be good to show a sklearn Pipeline

Yes I agree, since the pipeline (Metric Learner, KNN) for instance, represents a major use case of metric learning for classification

…ection using sphinx-gallery

…d learners and copy the previous content to the different sections

wdevazelhes · 2018-12-19T11:06:42Z

Hi, I just added what could be a minimal documentation for a first merge, including minimal descriptions of the preprocessor and the recent changes in the API of Weakly Supervised Metric Learners. I just need to optimize a bit the style (align the text, format well examples sections etc, try to make the sandwich example plot something), before merging I think, but the content should not change so you can already have a look at it. Basically I copied what was already present in the doc/docstrings, into the new sections of the new doc, as we said @bellet and @nvauquie

wdevazelhes · 2018-12-19T15:34:05Z

@perimosocordiae @terrytangyuan maybe you could review this PR ? Then we could merge it as soon as possible into new_api, so that we can then merge new_api to master (cf discussion in #117 (comment) )

terrytangyuan

Looks good to me!

perimosocordiae

Overall looks good. I left a few comments, but I think we can merge this pretty soon.

perimosocordiae · 2018-12-19T15:48:33Z

doc/preprocessor.rst

+>>>     for img_path in arr:
+>>>         result.append(X[int(img_path[3:5])])
+>>>         # transforms 'img01.png' into X[1]
+>>>     return np.array(result)


This example could be simplified a bit:

def find_images(file_paths): # each file contains a small image to use as an input datapoint return np.row_stack([imread(f).ravel() for f in file_paths]) nca = NCA(preprocessor=find_images) nca.fit(['img01.png', 'img00.png', 'img02.png'], [1, 0, 1])

Yes, and your snippet with the imread etc is actually we would really do in a real use case

perimosocordiae · 2018-12-19T15:49:04Z

doc/preprocessor.rst

+>>>     for img_path in arr:
+>>>         result.append(X[int(img_path[3:5])])
+>>>         # transforms 'img01.png' into X[1]
+>>>     return np.array(result)


You can re-use find_images from the other example above here.

perimosocordiae · 2018-12-19T15:50:07Z

doc/preprocessor.rst

+ >>> y_pairs = np.array([1, -1])
+ >>>
+ >>> mmc = MMC(preprocessor=X)
+ >>> mmc.fit(pairs, y_pairs)


Maybe be explicit here that the X array is not used at all in this case.

I agree, I replaced it with a "not implemented yet" preprocessor, and wrote that the code would work anyways

perimosocordiae · 2018-12-19T15:50:57Z

examples/README.txt

+Examples
+========
+
+Below is a gallery of example of metric-learn use cases.


...a gallery of example metric-learn use cases.

Thanks, done

perimosocordiae · 2018-12-19T15:51:58Z

outline.md

@@ -0,0 +1,45 @@
+documentation outline:


This looks more suitable for the wiki, or maybe a master issue for improving docs.

yes, maybe you can put this in a wiki

Thanks, yes I forgot it here. That's right, I'll open a page on the wiki for that

…es formatting ...)

wdevazelhes · 2018-12-19T17:49:59Z

Thanks for the review @terrytangyuan and @perimosocordiae !

perimosocordiae · 2018-12-19T17:56:59Z

doc/preprocessor.rst

+
+ >>> from metric_learn import MMC
+ >>> def preprocessor_wip(array):
+ >>>    return NotImplementedError("This preprocessor does nothing yet.")


raise instead of return

Right, thanks

bellet

Thanks, looks good! I left some minor comments.

Additional remarks:

are the docstrings up to date? For instance it does not show preprocessor as an argument of init
code examples in the doc for weakly supervised algorithms use the supervised version! so this should be changed
the previous code examples could be used in the supervised section to document the supervised version of the weakly supervised algorithms (can add later)
like you said it would be nice to show the figures for the sandwich examples (but that can also be done later)
finally, there is an empty "module contents" section in the package overview part

bellet · 2018-12-19T17:25:29Z

doc/getting_started.rst

+
+Quick start
+===========
+


maybe just add a sentence or two to briefly describe what the code snippet does (compute cross validation score of NCA on iris dataset)

Agreed, I've added a quick description and also modified the example since it didn't work in fact (for now we don't have a scoring for cross-validation on supervised metric learners), so I updated the example with a pipeline nca + knn

bellet · 2018-12-19T17:27:08Z

doc/introduction.rst

+machine learning dedicated to automatically constructing optimal distance
+metrics.
+
+This package contains efficient Python implementations of several popular


emphasize the scikit-learn compatible aspect

You're right, done

bellet · 2018-12-19T17:28:35Z

doc/introduction.rst

+
+Each metric learning algorithm supports the following methods:
+
+-  ``fit(...)``, which learns the model.


missing a few other generic methods, like score_pairs etc?

Indeed, thanks. I've added score_pairs and replaced transformer() by the newtransformer_from_metric(metric). I didn't put _prepare_inputs since it is private, and neither check_preprocessor because it is public for now but probably should rather be private ?

bellet · 2018-12-19T17:29:20Z

doc/preprocessor.rst

+Array-like
+----------
+You can specify ``preprocessor=X`` where ``X`` is an array-like containing the
+dataset of points. In this case, the estimator will be able to take as


the fit method of the estimator?

I agree it may confuse users to say "the estimator will". I'll also say "predict etc", since preprocessors can also be used for prediction/scoring etc

bellet · 2018-12-19T17:30:36Z

doc/preprocessor.rst

+Preprocessor
+============
+
+Estimators in metric-learn all have a ``preprocessor`` option at instantiation.


maybe briefly explain default behavior (when preprocessor=None)

I agree, done

bellet · 2018-12-19T17:44:50Z

doc/supervised.rst

+
+.. rubric:: References:
+
+`Information-theoretic Metric Learning <http://machinelearning.wustl.edu/mlpapers/paper_files/icml2007_DavisKJSD07.pdf>`_ Jason V. Davis, et al.


this is not the right reference (this is for ITML)

Thanks, didn't see that
Done

bellet · 2018-12-19T17:46:17Z

doc/weakly_supervised.rst

+   ``preprocessor``, which will go fetch and form the tuples. This allows to
+   give more general indicators than just indices from an array (for instance
+   paths in the filesystem, name of records in a database etc...) See section
+   :ref:`preprocessor` for more details on how to use the preprocessor.


link to section does not seem to work (at least not on the version I compiled locally)

Thanks, that's because I forgot to put the right formatting for referencing sections :p
Done

bellet · 2018-12-19T17:47:32Z

doc/weakly_supervised.rst

+1. ITML
+-------
+
+Information Theoretic Metric Learning, Kulis et al., ICML 2007


This is Davis et al.

Indeed, thanks, done

bellet · 2018-12-19T17:48:01Z

doc/weakly_supervised.rst

+
+.. todo:: add more details on `_Supervised` classes
+
+1. ITML


the numbering of the section for each algorithm has an issue (e.g., shows 3.4.1. 1. ITML)

Thanks, done

bellet · 2018-12-19T17:49:33Z

doc/supervised.rst

+
+.. todo:: Covariance is unsupervised, so its doc should not be here.
+
+:class:`Covariance` does not "learn" anything, rather it calculates


it would be nice to have these references to link to the docstring page

Thanks, indeed I used the wrong way to reference them
Done

bellet

Thanks for all the changes! Two additional minor comments ;-)

bellet · 2018-12-20T12:59:40Z

doc/introduction.rst

+the domain.
+Distance metric learning (or simply, metric learning) is the sub-field of
+machine learning dedicated to automatically constructing optimal distance
+metrics.


I suggest two changes here:

"optimal" is not great (wrt what is it optimal?). I would instead say "to automatically construct task-specific distance metrics from (weakly) supervised data".

the new quick start example below makes me realize that we should clearly mention the relation between distance metric learning and embedding/representation learning. We could add a sentence like:
"The learned distance metric often corresponds to a Euclidean distance in a new embedding space, hence distance metric learning can be seen as a form of representation learning."

"optimal" is not great (wrt what is it optimal?). I would instead say "to automatically construct task-specific distance metrics from (weakly) supervised data".

I agree, done

the new quick start example below makes me realize that we should clearly mention the relation between distance metric learning and embedding/representation learning. We could add a sentence like:
"The learned distance metric often corresponds to a Euclidean distance in a new embedding space, hence distance metric learning can be seen as a form of representation learning."

I agree, for now I just added this sentence, but indeed we should emphasize this more, maybe in the future with some section in the docs, and also maybe this will get clearer with examples of metric learners used as transformers (examples of dimensionality reduction for instance)

bellet · 2018-12-20T13:00:42Z

doc/introduction.rst

+   :math:`D`-dimensional learned metric space :math:`X L^{\top}`,
+   in which standard Euclidean distances may be used.
+-  ``transform(X)``, which applies the aforementioned transformation.
+- ``score_pairs`` which returns the similarity of pairs of points.


Maybe clarify the inputs to score_pairs, and "similarity of pairs of points" --> "distance between pairs of points"?

That's right, done

…of the weakly supervised version

wdevazelhes · 2018-12-20T13:59:48Z

Thanks for the review @bellet ! To answer your remarks:

Thanks, looks good! I left some minor comments.
Additional remarks:
are the docstrings up to date? For instance it does not show preprocessor as an argument of init

This should be updated now that I merged back the updated new_api_design into the PR

code examples in the doc for weakly supervised algorithms use the supervised version! so this should be changed

I agree, I'll update them, for instance with this kind of examples:

from sklearn.datasets import fetch_lfw_pairs
from metric_learn import MMC

dataset = fetch_lfw_pairs()
pairs, y = dataset.pairs, dataset.target
pairs = pairs.reshape(pairs.shape[0], 2, -1)
y = 2*y - 1  # we want +1 to indicate similar pairs and -1 dissimilar pairs
mmc = MMC()
mmc.fit(pairs, y)

Or the same but maybe with a smaller artificial dataset

the previous code examples could be used in the supervised section to document the supervised version of the weakly supervised algorithms (can add later)

I agree, I've added a simple example with MMC. Indeed in the future it could be good to add more documentation on how the constraints are created from a labeled dataset. issue #135 should keep track on that.

like you said it would be nice to show the figures for the sandwich examples (but that can also be done later)

Yes, I'm still trying to understand why it's failing :p

finally, there is an empty "module contents" section in the package overview part

Yes, it was already there in the previous docs, but I'm still trying to understand how this modules and submodules part is generated. @perimosocordiae maybe an idea on that ?

bellet · 2018-12-20T14:10:09Z

I agree, I'll update them, for instance with this kind of examples:

from sklearn.datasets import fetch_lfw_pairs
from metric_learn import MMC

dataset = fetch_lfw_pairs()
pairs, y = dataset.pairs, dataset.target
pairs = pairs.reshape(pairs.shape[0], 2, -1)
y = 2*y - 1  # we want +1 to indicate similar pairs and -1 dissimilar pairs
mmc = MMC()
mmc.fit(pairs, y)

Looks good, but I would indeed use a smaller dataset

I agree, I've added a simple example with MMC. Indeed in the future it could be good to add more documentation on how the constraints are created from a labeled dataset. issue #135 should keep track on that.

Maybe this should be put in the supervised section instead? Linking "weakly-supervised algorithm" in the text to the appropriate section, and maybe rename "_Supervised version" into "Supervised versions of weakly-supervised algorithms"

bellet · 2018-12-20T14:47:04Z

finally, there is an empty "module contents" section in the package overview part

Yes, it was already there in the previous docs, but I'm still trying to understand how this modules and submodules part is generated. @perimosocordiae maybe an idea on that ?

maybe last reply in sphinx-doc/sphinx#3177 ?
otherwise it looks like "module contents" will show the docstring of __init__.py, and there is an option to show this before the submodules:
sphinx-doc/sphinx#2190

wdevazelhes · 2018-12-20T15:54:59Z

Looks good, but I would indeed use a smaller dataset

I've updated the docs with some toy dataset

wdevazelhes · 2018-12-20T16:27:44Z

Maybe this should be put in the supervised section instead? Linking "weakly-supervised algorithm" in the text to the appropriate section, and maybe rename "_Supervised version" into "Supervised versions of weakly-supervised algorithms"

Ah yes, I hadn't fully read your previous comment
I agree, done

wdevazelhes · 2018-12-20T16:36:04Z

I've updated the examples in the weakly supervised section as you said

wdevazelhes · 2018-12-20T16:53:38Z

finally, there is an empty "module contents" section in the package overview part

Yes, it was already there in the previous docs, but I'm still trying to understand how this modules and submodules part is generated. @perimosocordiae maybe an idea on that ?

maybe last reply in sphinx-doc/sphinx#3177 ?
otherwise it looks like "module contents" will show the docstring of __init__.py, and there is an option to show this before the submodules:
sphinx-doc/sphinx#2190

Turns out it comes from the file metric_learn.rst. I changed it manually, but I guess it was generated at the beginning with some command like sphinx-apidoc as in the link you gave @bellet
So I don't know if my manual modification is what would generate sphinx-apidoc with the right flag --implicit-namespaces from sphinx-doc/sphinx#3177 (comment), but maybe this is fine for now and later we could think of the process of generating the rst files ?

wdevazelhes · 2018-12-20T17:25:19Z

Turns out the problem with sandwich.py was that it needs to be named plot_sandwich.py to be executed: https://sphinx-gallery.readthedocs.io/en/latest/getting_started.html#structure-the-examples-folder
Now it works

wdevazelhes · 2018-12-20T17:26:51Z

I guess we should be ready to merge ! :)

bellet · 2018-12-20T17:28:00Z

Quick question on the example: why SDML seems to not do anything?

perimosocordiae · 2018-12-20T17:37:55Z

It's likely that we aren't using good parameter settings for SDML in that example. We should file a bug to investigate further, though.

perimosocordiae · 2018-12-20T17:38:45Z

Docs are merged! Now we can merge new_api_design into master.

William de Vazelhes added 2 commits November 21, 2018 17:01

Create some text to initialize the PR

61856b3

DOC: add doc outline

d5dd517

DOC: Add data visualisation to possible examples

b54ee34

terrytangyuan reviewed Dec 3, 2018

View reviewed changes

William de Vazelhes added 2 commits December 11, 2018 15:51

Update documentation outline

7495b68

Add doc from master

18325cd

bellet mentioned this pull request Dec 14, 2018

[MRG] Add preprocessor option #117

Merged

7 tasks

William de Vazelhes added 4 commits December 17, 2018 09:56

DOC: add beginning of doc tree

0ddaee3

DOC: add some beginning of example to get started with the examples s…

e998652

…ection using sphinx-gallery

DOC: modify gitignore to ignore auto_examples

41b9182

WIP: add preprocessor section and some section about weakly supervise…

0adb3c0

…d learners and copy the previous content to the different sections

terrytangyuan approved these changes Dec 19, 2018

View reviewed changes

perimosocordiae reviewed Dec 19, 2018

View reviewed changes

William de Vazelhes added 2 commits December 19, 2018 18:28

A few style improvements (text wraping to line limit, better referenc…

26306ba

…es formatting ...)

Address scikit-learn-contrib#133 (review)

4eb8495

perimosocordiae reviewed Dec 19, 2018

View reviewed changes

perimosocordiae approved these changes Dec 19, 2018

View reviewed changes

bellet approved these changes Dec 19, 2018

View reviewed changes

William de Vazelhes added 8 commits December 20, 2018 11:09

raise instead of return

3db2653

Merge branch 'new_api_design' into doc/add_documentation

813f658

Fix quickstart example

3891b93

Emphasize scikit-learn compatibility

7dcfb54

Update introduction with new methods

1b83569

address scikit-learn-contrib#133 (comment)

70f16a9

explain what happens when preprocessor=None

ed0a00e

Precisions in doc about the input accepted by the preprocessor

868d42b

address scikit-learn-contrib#133 (comment)

1fe3357

bellet approved these changes Dec 20, 2018

View reviewed changes

William de Vazelhes added 6 commits December 20, 2018 14:06

Better formulation of sentences

ea487b7

change title formatting in index

16ba60a

Fix references and some numering issue

95f0702

Reformat link to preprocessor

6cb328f

Fix automatic link to algorithms for the supervised section

ff4d30e

Reformatting and adding examples about supervised version in the end …

37cd11c

…of the weakly supervised version

William de Vazelhes added 2 commits December 20, 2018 15:12

add precisions in the intro

6eee862

add precisions for score_pairs in the intro

bee4a8c

Change examples for weakly supervised section

d49ba68

add _Supervised section in Supervised section

202e3fe

change examples in weakly supervised section

c107584

fix empty module contents

9de2e9c

rename sandwich.py into plot_sandwich.py to be found by sphinx-gallery

1371122

perimosocordiae merged commit 073451a into scikit-learn-contrib:new_api_design Dec 20, 2018

wdevazelhes deleted the doc/add_documentation branch January 3, 2019 09:23


		Each metric learning algorithm supports the following methods:

		- ``fit(...)``, which learns the model.


		.. rubric:: References:

		`Information-theoretic Metric Learning <http://machinelearning.wustl.edu/mlpapers/paper_files/icml2007_DavisKJSD07.pdf>`_ Jason V. Davis, et al.


		.. todo:: Covariance is unsupervised, so its doc should not be here.

		:class:`Covariance` does not "learn" anything, rather it calculates

Add documentation for the new API #133

Add documentation for the new API #133

Uh oh!

Conversation

wdevazelhes commented Nov 21, 2018

Uh oh!

wdevazelhes commented Dec 3, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wdevazelhes commented Dec 19, 2018

Uh oh!

wdevazelhes commented Dec 19, 2018

Uh oh!

terrytangyuan left a comment

Choose a reason for hiding this comment

Uh oh!

perimosocordiae left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wdevazelhes commented Dec 19, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bellet left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment