BUG ensure monotonic property of lerp in numpy.percentile #15098

glemaitre · 2019-12-11T17:12:21Z

closes #14685

Change the way percentiles are computed (and linearly interpolated) to ensure monotonic property.

numpy/lib/function_base.py

glemaitre · 2019-12-11T17:16:48Z

ping @seberg, @eric-wieser, @arthertz which might be interested to have a look at this.

numpy/lib/function_base.py

glemaitre · 2019-12-11T21:17:02Z

Is there other regression tests which would be meaningful?

glemaitre · 2019-12-11T22:44:25Z

failures seem not associated with the PR changes

ogrisel

Small comments:

numpy/lib/tests/test_function_base.py

numpy/lib/function_base.py

Co-Authored-By: Olivier Grisel <olivier.grisel@ensta.org>

numpy/lib/function_base.py

seberg

Looks good to me. @eric-wieser want to squash merge it when you are happy? Since percentile should normally do most of its work in looking up the indices and not in the calculation, I do not think I would worry about speed.
I think we do not need release notes here.

numpy/lib/function_base.py

glemaitre · 2020-01-03T14:36:40Z

Tests are passing

glemaitre · 2020-01-13T10:00:10Z

@eric-wieser do you require any further changes?

mattip · 2020-01-23T14:40:19Z

@seberg ping. This is both approved and "pending review"

eric-wieser · 2020-01-23T16:00:00Z

numpy/lib/function_base.py

-        x1 = np.moveaxis(x1, axis, 0)
-        x2 = np.moveaxis(x2, axis, 0)


If you kept these lines, and did them before the computation, then you'd be able to do:

r_above = np.add(x1, diff_x2_x1 * weights_above, out=out) r_below = np.subtract(x2, diff_x2_x1 * weights_below, out=r_above, where=weights_above < 0.5)

seberg

OK, always wanted to look at the formula mainly and the logic seems fine to me. The test failures are real, and the issue with out being a scalar also. I think reorganizing the calculation to make it closer to the original (move it to where the add was) should probably fix that issue.

seberg · 2020-01-24T19:35:07Z

numpy/lib/function_base.py


-        # ensure axis with q-th is first
        x1 = np.moveaxis(x1, axis, 0)
        x2 = np.moveaxis(x2, axis, 0)


Move this before the diff calcluation I think.

You'll need to call moveaxis on the weight too, I think

Wait, do we even need this? We already moved the axis above...

I think perhaps the take should be axis=0

Ah, axis is already 0 here, so this is a no-op

seberg · 2020-01-24T19:46:03Z

numpy/lib/function_base.py

+        r_above = np.add(x1, diff_x2_x1 * weights_above, out=out)
+        r_below = np.subtract(
+            x2, diff_x2_x1 * weights_below, out=r_above,
+            where=weights_above < 0.5


This fun, but this way, please just call it r (or something longer) and not above and below, and add a comment that we first calculate everything for above, and then replace it with the "below" version.

Are there test for the combinations of "zerod" and out=... being given. Because this smells to me like we could get into trouble here.
Probably it is possible to move the if zerod block before these calculations to avoid that problem?

I think the point is we made everything 1d at the top of the function, so the zerod block should remain below - to convert the final result back to 0d.

Perhaps we should push forward on the leavewrapped thing at some point.

But out is never made 1-D, so we need to be careful?

Yeah, this is a good example of for a need of leavewrapped.

seberg · 2020-01-24T19:47:28Z

numpy/lib/function_base.py


        if out is not None:
-            r = add(x1, x2, out=out)
+            out[...] = r_above
+            r = out


Assignment to out should not be necessary here, the above operation is already in-place (i.e. similar to the old branch, which did not do this).

@seberg is the double-assignment cleanup a blocker?

Not on its own, but I think the above also needs some cleanups, maybe I should just put it on my todo list to do it myself.

charris · 2020-01-29T01:15:26Z

Tests are failing.

WarrenWeckesser · 2020-03-12T20:18:43Z

It would be nice to get this working.

@glemaitre, will you be able to resume working on this? There are a few issues still to be resolved.

seberg · 2020-04-10T18:56:49Z

@glemaitre do you have time/want to pick this up? Otherwise I am happy to drag it over the finish line.

glemaitre · 2020-04-11T22:37:02Z

@seberg I am sorry I don't think I will get any time to finish it up in the next 3 weeks. If you think you can finalize in the meanwhile, do not hesitate.

needs some fixups and re-review.

eric-wieser · 2020-05-17T13:56:16Z

Perhaps we ought to extract a lerp helper function here?

seberg · 2020-05-17T14:27:36Z

Yeah, could create a hidden ufunc, should be simple enough, maybe I will look at it beinning of the week.

eric-wieser · 2020-05-17T14:41:39Z

For now I've got a local patch that extracts it to a pure python function, which builds upon #16274.

eric-wieser · 2020-05-20T05:25:03Z

Alright, my cleanups are in - there's now a private _lerp function which this diff can be confined to.

seberg · 2020-06-27T16:53:43Z

Fixed in gh-16273 thanks @glemaitre, the credit of figuring out how to best fix this goes fully to you!

FIX ensure monotic property of lerp in percentile

1cdb83d

glemaitre commented Dec 11, 2019

View reviewed changes

numpy/lib/function_base.py Outdated Show resolved Hide resolved

spelling

845d219

eric-wieser reviewed Dec 11, 2019

View reviewed changes

numpy/lib/function_base.py Outdated Show resolved Hide resolved

eric-wieser reviewed Dec 11, 2019

View reviewed changes

numpy/lib/function_base.py Outdated Show resolved Hide resolved

glemaitre added 3 commits December 11, 2019 22:09

address comments

aac5e41

better syntax

313ea69

remove diff

2fbbf72

glemaitre commented Dec 11, 2019

View reviewed changes

numpy/lib/function_base.py Outdated Show resolved Hide resolved

address comments

786c7bb

glemaitre mentioned this pull request Dec 12, 2019

QuantileTransformer quantiles can be unordered because of rounding errors which cause np.interp to return nonsense results scikit-learn/scikit-learn#15733

Closed

ogrisel reviewed Dec 21, 2019

View reviewed changes

numpy/lib/tests/test_function_base.py Outdated Show resolved Hide resolved

numpy/lib/function_base.py Outdated Show resolved Hide resolved

glemaitre and others added 3 commits December 21, 2019 16:41

Update numpy/lib/tests/test_function_base.py

5ba6527

Co-Authored-By: Olivier Grisel <olivier.grisel@ensta.org>

add comments regarding the source

64258de

Merge remote-tracking branch 'origin/master' into is/14685

e5f1d0c

mattip requested a review from eric-wieser December 23, 2019 07:46

eric-wieser reviewed Dec 23, 2019

View reviewed changes

numpy/lib/function_base.py Outdated Show resolved Hide resolved

avoid intermediate copy

64c6fa1

seberg previously approved these changes Dec 23, 2019

View reviewed changes

Update function_base.py

9607d2b

seberg reviewed Dec 23, 2019

View reviewed changes

numpy/lib/function_base.py Outdated Show resolved Hide resolved

use copyto

733469d

seberg reviewed Dec 23, 2019

View reviewed changes

numpy/lib/function_base.py Outdated Show resolved Hide resolved

Update function_base.py

045a7e3

glemaitre force-pushed the is/14685 branch from 90a07e4 to 045a7e3 Compare December 24, 2019 08:36

ups

6f070c2

glemaitre force-pushed the is/14685 branch from f2d8920 to 6f070c2 Compare January 2, 2020 10:56

seberg self-requested a review January 16, 2020 23:18

eric-wieser reviewed Jan 23, 2020

View reviewed changes

address review

e9449a8

seberg reviewed Jan 24, 2020

View reviewed changes

seberg self-requested a review April 29, 2020 20:55

eric-wieser mentioned this pull request May 17, 2020

BUG: Order percentile monotonically #16273

Merged

eric-wieser mentioned this pull request May 17, 2020

MAINT: cleanups to quantile #16274

Merged

charris added 00 - Bug component: numpy.lib labels May 17, 2020

seberg closed this Jun 27, 2020

Uh oh!

BUG ensure monotonic property of lerp in numpy.percentile #15098

BUG ensure monotonic property of lerp in numpy.percentile #15098

Uh oh!

Conversation

glemaitre commented Dec 11, 2019

Uh oh!

Uh oh!

glemaitre commented Dec 11, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

glemaitre commented Dec 11, 2019

Uh oh!

glemaitre commented Dec 11, 2019

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

seberg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

glemaitre commented Jan 3, 2020

Uh oh!

glemaitre commented Jan 13, 2020

Uh oh!

mattip commented Jan 23, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

seberg left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

charris commented Jan 29, 2020

Uh oh!

WarrenWeckesser commented Mar 12, 2020

Uh oh!

seberg commented Apr 10, 2020

Uh oh!

glemaitre commented Apr 11, 2020

Uh oh!

eric-wieser commented May 17, 2020

Uh oh!

seberg commented May 17, 2020

Uh oh!

eric-wieser commented May 17, 2020

Uh oh!

eric-wieser commented May 20, 2020

Uh oh!

seberg commented Jun 27, 2020

Uh oh!

Uh oh!