Fix quantile empty 29315 #29326

imran4444shaik · 2025-07-05T18:12:45Z

Description

This PR fixes two related issues with np.quantile():

Empty array handling: Makes np.quantile([], 0.5) return np.nan consistently with np.median([]) instead of raising IndexError
Integer overflow: Fixes incorrect results for integer arrays with large values (e.g., np.array([32767, -1], dtype=np.int16))

Changes

Added explicit empty array check in _quantile() that returns NaN/NaT-filled array
Added safe float conversion for integer arrays before interpolation
Added tests verifying both fixes

Impact

Fixes BUG: quantile inconsitent with median for size=0 #29315 (quantile inconsistent with median for size=0)
Fixes integer overflow cases while maintaining backward compatibility
Matches median behavior for both edge cases
No effect on normal floating-point/datetime cases

Testing

Added test cases for:

Empty arrays of all supported types
Integer arrays with overflow potential
Verification against median results
Existing functionality remains unchanged

Before:
>>> np.quantile([], 0.5)
IndexError
>>> np.quantile(np.array([32767,-1], dtype=np.int16), 0.5)
49151.0  # Wrong due to overflow

After:
>>> np.quantile([], 0.5)
nan  # Matches median behavior
>>> np.quantile(np.array([32767,-1], dtype=np.int16), 0.5)
16383.0  # Correct, matches median

Fixes numpy#29315 by making return consistently with instead of raising an IndexError. The fix: 1. Explicitly checks for empty arrays (size=0) in _quantile() 2. Returns NaN/NaT-filled array with correct shape and dtype 3. Maintains consistency with median behavior for empty inputs 4. Preserves all existing functionality for non-empty arrays Handles all numeric, datetime and timedelta dtypes appropriately.

Fixes integer overflow in quantile calculation by converting integer arrays to float64 before interpolation. This ensures: 1. Correct calculation for extreme values (e.g., [32767, -1] in int16) 2. Consistent results with median for integer inputs 3. No effect on floating-point or datetime types The fix handles all integer, unsigned, and boolean types by safely casting to float before interpolation operations while maintaining existing behavior for other types. Matches median's behavior of returning float for integers.

jorenham · 2025-07-05T18:43:16Z

Personally I would a fail-fast approach, and have this raise an appropriate error. Since this is size-dependent, this won't help when applying this over an axis: Either all returned values are nan, or none of them are. So I don't see any advantage for returning nan, instead of raising an error.

imran4444shaik added 3 commits July 5, 2025 23:25

STY: Remove whitespace from blank line in test file

f209015

tylerjereddy added the component: numpy.lib label Jul 6, 2025

melissawm added this to NumPy first-time contributor PRs Jul 7, 2025

melissawm moved this to Pending authors' response in NumPy first-time contributor PRs Jul 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix quantile empty 29315 #29326

Fix quantile empty 29315 #29326

imran4444shaik commented Jul 5, 2025

Uh oh!

jorenham commented Jul 5, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Fix quantile empty 29315 #29326

Are you sure you want to change the base?

Fix quantile empty 29315 #29326

Conversation

imran4444shaik commented Jul 5, 2025

Description

Changes

Impact

Testing

Uh oh!

jorenham commented Jul 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

jorenham commented Jul 5, 2025 •

edited

Loading