[MRG+1] Added support for sample_weight in linearSVR, including tests and documentation. Fixes #6862 #6907

imaculate · 2016-06-18T19:40:34Z

Reference Issue

What does this implement/fix? Explain your changes.

Any other comments?

…umentation

jnothman · 2016-06-18T21:11:45Z

sklearn/svm/tests/test_svm.py

        clf.fit(iris.data, iris.target)

        prob_predict = clf.predict_proba(iris.data)
        assert_array_almost_equal(
            np.sum(prob_predict, 1), np.ones(iris.data.shape[0]))
        assert_true(np.mean(np.argmax(prob_predict, 1)
-                    == clf.predict(iris.data)) > 0.9)
+                            == clf.predict(iris.data)) > 0.9)


Usually we wouldn't go fixing up cosmetic things when submitting an unrelated PR. It makes the PR somewhat harder to review. But at least this PR is small and focussed

jnothman · 2016-06-18T21:12:18Z

Thanks for this. After a skim, it looks good, but I'll give it a closer look when I have time.

agramfort · 2016-06-19T07:19:02Z

sklearn/svm/tests/test_svm.py

+
+    assert np.linalg.norm(lsvr.coef_ - lsvr_no_weight.coef_)\
+        / np.linalg.norm(lsvr_no_weight.coef_) < .1
+    assert np.abs(score1 - score2) < 0.1


don't use assert but assert_less or assert_true

jnothman · 2016-06-22T13:52:38Z

Frustratingly, you'll need to rebase on master. Otherwise looks good to me, and you should mention the enhancement in doc/whats_new.rst

Replicate solution to scikit-learn@9a52077 except that `_pairwise` should always be `True` for `KernelCenterer` because it's supposed to receive a Gram matrix. This should make `KernelCenterer` usable in `Pipeline`s. Happy to add tests, just tell me what should be covered.

…g cython fused types (scikit-learn#6846)

Fixes scikit-learn#6860

This is a smoke test. Hence there is no point having cv=4

imaculate · 2016-06-22T15:29:40Z

@jnothman I tried to rebase, not sure if I did the right thing.

jnothman · 2016-06-22T23:58:40Z

Rebase usually involves something like:

$ git checkout master
$ git pull https://github.com/scikit-learn/scikit-learn/ master
$ git checkout linearsvr_sampleweight
$ git rebase master
# sort out any merge conflicts
$ git push -f https://github.com/imaculate/scikit-learn linearsvr_sampleweight

jnothman · 2016-06-22T23:58:52Z

I.e. it is not done correctly here

…umentation

…e test tolerance

imaculate · 2016-06-23T11:26:01Z

Done!

imaculate · 2016-06-23T11:26:26Z

Done!

jnothman · 2016-06-23T11:31:23Z

@agramfort, a quick review?

agramfort · 2016-06-23T14:02:27Z

thx @imaculate

imaculate · 2016-06-23T14:29:23Z

Pleasure! Thanks too for the guidance!

… and documentation. Fixes scikit-learn#6862 (scikit-learn#6907) * Make KernelCenterer a _pairwise operation Replicate solution to scikit-learn@9a52077 except that `_pairwise` should always be `True` for `KernelCenterer` because it's supposed to receive a Gram matrix. This should make `KernelCenterer` usable in `Pipeline`s. Happy to add tests, just tell me what should be covered. * Adding test for PR scikit-learn#6900 * Simplifying imports and test * updating changelog links on homepage (scikit-learn#6901) * first commit * changed binary average back to macro * changed binomialNB to multinomialNB * emphasis on "higher return values are better..." (scikit-learn#6909) * fix typo in comment of hierarchical clustering (scikit-learn#6912) * [MRG] Allows KMeans/MiniBatchKMeans to use float32 internally by using cython fused types (scikit-learn#6846) * Fix sklearn.base.clone for all scipy.sparse formats (scikit-learn#6910) * DOC If git is not installed, need to catch OSError Fixes scikit-learn#6860 * DOC add what's new for clone fix * fix a typo in ridge.py (scikit-learn#6917) * pep8 * TST: Speed up: cv=2 This is a smoke test. Hence there is no point having cv=4 * Added support for sample_weight in linearSVR, including tests and documentation * Changed assert to assert_allclose and assert_almost_equal, reduced the test tolerance * Fixed pep8 violations and sampleweight format * rebased with upstream

jnothman reviewed Jun 18, 2016
View reviewed changes

agramfort reviewed Jun 19, 2016
View reviewed changes

jnothman changed the title ~~Added support for sample_weight in linearSVR, including tests and documentation. Fixes #6862~~ [MRG+1] Added support for sample_weight in linearSVR, including tests and documentation. Fixes #6862 Jun 22, 2016

fishcorn and others added 17 commits June 22, 2016 16:01

Adding test for PR scikit-learn#6900

0043885

Simplifying imports and test

069336e

updating changelog links on homepage (scikit-learn#6901)

039b6f3

first commit

f69fb7e

changed binary average back to macro

2d7929d

changed binomialNB to multinomialNB

1267f6d

emphasis on "higher return values are better..." (scikit-learn#6909)

f911bb6

fix typo in comment of hierarchical clustering (scikit-learn#6912)

1534d0c

[MRG] Allows KMeans/MiniBatchKMeans to use float32 internally by usin…

3c34fb3

…g cython fused types (scikit-learn#6846)

Fix sklearn.base.clone for all scipy.sparse formats (scikit-learn#6910)

2accd0c

DOC If git is not installed, need to catch OSError

a08a1fd

Fixes scikit-learn#6860

DOC add what's new for clone fix

943836c

fix a typo in ridge.py (scikit-learn#6917)

478614a

pep8

41000d5

TST: Speed up: cv=2

3dfb282

This is a smoke test. Hence there is no point having cv=4

Merge branch 'master' of https://github.com/scikit-learn/scikit-learn

99392a8

imaculate added 5 commits June 23, 2016 13:11

Merge branch 'master' of https://github.com/scikit-learn/scikit-learn

74414dc

Added support for sample_weight in linearSVR, including tests and doc…

e5e0320

…umentation

Changed assert to assert_allclose and assert_almost_equal, reduced th…

e9f2ff7

…e test tolerance

Fixed pep8 violations and sampleweight format

ae39622

rebased with upstream

65d1d93

imaculate force-pushed the linearsvr_sampleweight branch from c67fd55 to 65d1d93 Compare June 23, 2016 11:25

jnothman added the Waiting for Reviewer label Jun 23, 2016

agramfort merged commit 3cc7fea into scikit-learn:master Jun 23, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG+1] Added support for sample_weight in linearSVR, including tests and documentation. Fixes #6862 #6907

[MRG+1] Added support for sample_weight in linearSVR, including tests and documentation. Fixes #6862 #6907

imaculate commented Jun 18, 2016

jnothman Jun 18, 2016

jnothman commented Jun 18, 2016

agramfort Jun 19, 2016

jnothman commented Jun 22, 2016

imaculate commented Jun 22, 2016

jnothman commented Jun 22, 2016

jnothman commented Jun 22, 2016

imaculate commented Jun 23, 2016

imaculate commented Jun 23, 2016

jnothman commented Jun 23, 2016

agramfort commented Jun 23, 2016

imaculate commented Jun 23, 2016

[MRG+1] Added support for sample_weight in linearSVR, including tests and documentation. Fixes #6862 #6907

[MRG+1] Added support for sample_weight in linearSVR, including tests and documentation. Fixes #6862 #6907

Conversation

imaculate commented Jun 18, 2016

Reference Issue

What does this implement/fix? Explain your changes.

Any other comments?

jnothman Jun 18, 2016

Choose a reason for hiding this comment

jnothman commented Jun 18, 2016

agramfort Jun 19, 2016

Choose a reason for hiding this comment

jnothman commented Jun 22, 2016

imaculate commented Jun 22, 2016

jnothman commented Jun 22, 2016

jnothman commented Jun 22, 2016

imaculate commented Jun 23, 2016

imaculate commented Jun 23, 2016

jnothman commented Jun 23, 2016

agramfort commented Jun 23, 2016

imaculate commented Jun 23, 2016