-
-
Notifications
You must be signed in to change notification settings - Fork 25.8k
[MRG+1] Added support for sample_weight in linearSVR, including tests and documentation. Fixes #6862 #6907
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[MRG+1] Added support for sample_weight in linearSVR, including tests and documentation. Fixes #6862 #6907
Conversation
clf.fit(iris.data, iris.target) | ||
|
||
prob_predict = clf.predict_proba(iris.data) | ||
assert_array_almost_equal( | ||
np.sum(prob_predict, 1), np.ones(iris.data.shape[0])) | ||
assert_true(np.mean(np.argmax(prob_predict, 1) | ||
== clf.predict(iris.data)) > 0.9) | ||
== clf.predict(iris.data)) > 0.9) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Usually we wouldn't go fixing up cosmetic things when submitting an unrelated PR. It makes the PR somewhat harder to review. But at least this PR is small and focussed
Thanks for this. After a skim, it looks good, but I'll give it a closer look when I have time. |
|
||
assert np.linalg.norm(lsvr.coef_ - lsvr_no_weight.coef_)\ | ||
/ np.linalg.norm(lsvr_no_weight.coef_) < .1 | ||
assert np.abs(score1 - score2) < 0.1 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
don't use assert but assert_less or assert_true
Frustratingly, you'll need to rebase on master. Otherwise looks good to me, and you should mention the enhancement in |
Replicate solution to scikit-learn@9a52077 except that `_pairwise` should always be `True` for `KernelCenterer` because it's supposed to receive a Gram matrix. This should make `KernelCenterer` usable in `Pipeline`s. Happy to add tests, just tell me what should be covered.
…g cython fused types (scikit-learn#6846)
This is a smoke test. Hence there is no point having cv=4
@jnothman I tried to rebase, not sure if I did the right thing. |
Rebase usually involves something like:
|
I.e. it is not done correctly here |
c67fd55
to
65d1d93
Compare
Done! |
1 similar comment
Done! |
@agramfort, a quick review? |
thx @imaculate |
Pleasure! Thanks too for the guidance! |
… and documentation. Fixes scikit-learn#6862 (scikit-learn#6907) * Make KernelCenterer a _pairwise operation Replicate solution to scikit-learn@9a52077 except that `_pairwise` should always be `True` for `KernelCenterer` because it's supposed to receive a Gram matrix. This should make `KernelCenterer` usable in `Pipeline`s. Happy to add tests, just tell me what should be covered. * Adding test for PR scikit-learn#6900 * Simplifying imports and test * updating changelog links on homepage (scikit-learn#6901) * first commit * changed binary average back to macro * changed binomialNB to multinomialNB * emphasis on "higher return values are better..." (scikit-learn#6909) * fix typo in comment of hierarchical clustering (scikit-learn#6912) * [MRG] Allows KMeans/MiniBatchKMeans to use float32 internally by using cython fused types (scikit-learn#6846) * Fix sklearn.base.clone for all scipy.sparse formats (scikit-learn#6910) * DOC If git is not installed, need to catch OSError Fixes scikit-learn#6860 * DOC add what's new for clone fix * fix a typo in ridge.py (scikit-learn#6917) * pep8 * TST: Speed up: cv=2 This is a smoke test. Hence there is no point having cv=4 * Added support for sample_weight in linearSVR, including tests and documentation * Changed assert to assert_allclose and assert_almost_equal, reduced the test tolerance * Fixed pep8 violations and sampleweight format * rebased with upstream
… and documentation. Fixes scikit-learn#6862 (scikit-learn#6907) * Make KernelCenterer a _pairwise operation Replicate solution to scikit-learn@9a52077 except that `_pairwise` should always be `True` for `KernelCenterer` because it's supposed to receive a Gram matrix. This should make `KernelCenterer` usable in `Pipeline`s. Happy to add tests, just tell me what should be covered. * Adding test for PR scikit-learn#6900 * Simplifying imports and test * updating changelog links on homepage (scikit-learn#6901) * first commit * changed binary average back to macro * changed binomialNB to multinomialNB * emphasis on "higher return values are better..." (scikit-learn#6909) * fix typo in comment of hierarchical clustering (scikit-learn#6912) * [MRG] Allows KMeans/MiniBatchKMeans to use float32 internally by using cython fused types (scikit-learn#6846) * Fix sklearn.base.clone for all scipy.sparse formats (scikit-learn#6910) * DOC If git is not installed, need to catch OSError Fixes scikit-learn#6860 * DOC add what's new for clone fix * fix a typo in ridge.py (scikit-learn#6917) * pep8 * TST: Speed up: cv=2 This is a smoke test. Hence there is no point having cv=4 * Added support for sample_weight in linearSVR, including tests and documentation * Changed assert to assert_allclose and assert_almost_equal, reduced the test tolerance * Fixed pep8 violations and sampleweight format * rebased with upstream
Reference Issue
What does this implement/fix? Explain your changes.
Any other comments?
…umentation