API: Remove broadcasting ambiguity from np.linalg.solve #25914

asmeurer · 2024-03-01T23:10:05Z

Previously the np.linalg.solve documentation stated:

a : (..., M, M) array_like
    Coefficient matrix.
b : {(..., M,), (..., M, K)}, array_like

however, this is inherently ambiguous. For example, if a has shape (2, 2, 2) and b has shape (2, 2), b could be treated as a (2,) stack of (2,) column vectors, in which case the result should have shape (2, 2), or as a single 2x2 matrix, in which case, the result should have shape (2, 2, 2).

NumPy resolved this ambiguity in a confusing way, which was to treat b as (..., M) whenever b.ndim == a.ndim - 1, and as (..., M, K) otherwise.

A much more consistent way to handle this ambiguity is to treat b as a single vector if and only if it is 1-dimensional, i.e., use

b : {(M,), (..., M, K)}, array_like

This is consistent with, for instance, the matmul operator, which only uses the special 1-D vector logic if an operand is exactly 1-dimensional, and treats the operands as (stacks of) 2-D matrices otherwise.

This updates np.linalg.solve() to use this behavior.

This is a backwards compatibility break, as any instance where the b array has more than one dimension and exactly one fewer dimension than the a array will now use the matrix logic, potentially returning a different result with a different shape. The previous behavior can be manually emulated with something like

np.solve(a, b[..., None])[..., 0]

since b as a (M,) vector is equivalent to b as a (M, 1) matrix (or the user could manually import and use the internal solve1() gufunc which implements the b-as-vector logic).

This change aligns the solve() function with the array API, which resolves this broadcasting ambiguity in this way. See
https://data-apis.org/array-api/latest/extensions/generated/array_api.linalg.solve.html#array_api.linalg.solve and data-apis/array-api#285.

Fixes #15349
Fixes #25583

Previously the np.linalg.solve documentation stated: a : (..., M, M) array_like Coefficient matrix. b : {(..., M,), (..., M, K)}, array_like however, this is inherently ambiguous. For example, if a has shape (2, 2, 2) and b has shape (2, 2), b could be treated as a (2,) stack of (2,) column vectors, in which case the result should have shape (2, 2), or as a single 2x2 matrix, in which case, the result should have shape (2, 2, 2). NumPy resolved this ambiguity in a confusing way, which was to treat b as (..., M) whenever b.ndim == a.ndim - 1, and as (..., M, K) otherwise. A much more consistent way to handle this ambiguity is to treat b as a single vector if and only if it is 1-dimensional, i.e., use b : {(M,), (..., M, K)}, array_like This is consistent with, for instance, the matmul operator, which only uses the special 1-D vector logic if an operand is exactly 1-dimensional, and treats the operands as (stacks of) 2-D matrices otherwise. This updates np.linalg.solve() to use this behavior. This is a backwards compatibility break, as any instance where the b array has more than one dimension and exactly one fewer dimension than the a array will now use the matrix logic, potentially returning a different result with a different shape. The previous behavior can be manually emulated with something like np.solve(a, b[..., None])[..., 0] since b as a (M,) vector is equivalent to b as a (M, 1) matrix (or the user could manually import and use the internal solve1() gufunc which implements the b-as-vector logic). This change aligns the solve() function with the array API, which resolves this broadcasting ambiguity in this way. See https://data-apis.org/array-api/latest/extensions/generated/array_api.linalg.solve.html#array_api.linalg.solve and data-apis/array-api#285. Fixes numpy#15349 Fixes numpy#25583

asmeurer · 2024-03-02T00:50:01Z

spin test is passing for me locally, but doesn't seem to be including the tests that are failing on CI. Is spin test not the proper way to run the test suite?

rgommers · 2024-03-03T17:03:00Z

spin test is passing for me locally, but doesn't seem to be including the tests that are failing on CI. Is spin test not the proper way to run the test suite?

It is. spin test -m full -- numpy/linalg/tests/test_linalg.py -k TestSolve will reproduce the failures. When tests are marked with @pytest.mark.slow they only run when you add the -m full.

rgommers · 2024-03-03T17:23:09Z

The failures are for:

>>> a = np.ones((3, 0, 0))
>>> b = np.ones((3, 0))

>>> np.linalg.solve(a, b)  # with `main`
array([], shape=(3, 0), dtype=float64)

>>> np.linalg.solve(a, b)  # with this PR
ValueError: solve: Input operand 1 has a mismatch in its core dimension 0, with gufunc signature (m,m),(m,n)->(m,n) (size 3 is different from 0)

Second failure is the same, but then with shapes (3, 2, 2) / (3, 2).

asmeurer · 2024-03-04T21:11:21Z

That's really annoying. Why are the full tests not run by default? The full test suite only takes two and half minutes to run on my machine. And this option isn't even documented in spin test -h.

asmeurer · 2024-03-04T21:25:48Z

The linalg tests are formulated in a rather odd way. All the test cases have b matrices, but almost no tests use them. Only solve and lstsq do, and of them, only solve uses these particular "generalized" cases where a and b are stacked. All the other cases take b as an argument but ignore it, e.g.,

numpy/numpy/linalg/tests/test_linalg.py

Lines 530 to 536 in 15691c3

    
           class InvCases(LinalgSquareTestCase, LinalgGeneralizedSquareTestCase): 
        
               def do(self, a, b, tags): 
        
                   a_inv = linalg.inv(a) 
        
                   assert_almost_equal(dot_generalized(a, a_inv), 
        
                                       identity_like_generalized(a)) 
        
                   assert_(consistent_subclass(a_inv, a))

The dot_generalized() function is no longer needed. It was primarily there to handle the odd case for solve(), but the updated logic for that is now put in TestSolve itself, and dot_generalized is replaced with matmul() everywhere else.

asmeurer · 2024-03-04T23:17:17Z

OK, I've updated the tests. The linalg test cases had some odd dot_generalized helper which seemed to exist primarily to handle the odd solve() behavior. I've replaced it with matmul everywhere and moved the (updated) special handling of the solve logic into the TestSolve test itself.

The linalg test cases are still not great, but I'm not sure I'm up for trying to improve them here. Like I said, they all oddly reuse test cases with a and b even for functions that only test a. A probably more pertinent issue is that none of them really test broadcasting at all. I suppose that sort of thing might be standard since broadcasting logic is always handled by the (g)ufunc, but in this case, it might be a good idea to test it explicitly since solve() isn't exactly a gufunc, but rather two different gufuncs depending on the dimensionality of b.

mattip · 2024-03-06T08:40:02Z

Aside:

And this option isn't even documented in spin test -h

For me spin test --help has the line

By default, spin will run -m 'not slow'. To run the full test suite, use spin -m full

mattip

Historically, there is a difference bewteen the broadcast rules between np.dot and np.matmul, maybe the current behaviour is due to that discrepancy somehow. The rule here and in the Array API is much clearer.

This does need a release note. I sent a message to the mailing list since it is an API change.

numpy/linalg/_linalg.py

stefanv · 2024-03-06T17:31:39Z

Agreed that reducing this ambiguity is a good idea, especially since we have the opportunity to do so with 2.0.

mattip · 2024-03-06T18:14:12Z

Let's merge this, we can revert if the mailing list conversation gets heated (unlikely).

mattip · 2024-03-06T18:14:24Z

Thanks @asmeurer

asmeurer · 2024-03-06T23:31:55Z

Ah, I completely forgot that I never wrote a release note for this. Do you still want me to do that, or did you take care of it?

rgommers · 2024-03-08T09:24:26Z

Ah, I completely forgot that I never wrote a release note for this. Do you still want me to do that, or did you take care of it?

For completeness: the release note is included in gh-25937. Thanks for this PR!

github-actions bot added the 30 - API label Mar 1, 2024

This was referenced Mar 3, 2024

Coordination: last items before branching 2.0.x #25918

Closed

DOC: Document different broadcasting rules for linalg vs np.broadcast #25583

Closed

seberg added this to the 2.0.0 release milestone Mar 4, 2024

asmeurer added 2 commits March 4, 2024 16:11

Update the return shape part of the solve() docstring

c201194

Fix linter issue

c76278b

mattip reviewed Mar 6, 2024

View reviewed changes

numpy/linalg/_linalg.py Outdated Show resolved Hide resolved

Update numpy/linalg/_linalg.py

064b55c

mattip merged commit 7fc3d0f into numpy:main Mar 6, 2024

mattip mentioned this pull request Mar 7, 2024

DOC: 2.0 release highlights and compat notes changes #25937

Merged

asmeurer mentioned this pull request May 22, 2024

Broadcasting behaviour for linear algebra solvers pytorch/pytorch#52915

Open

asmeurer mentioned this pull request May 31, 2024

spin test should run the full test suite by default #26588

Closed

quant12345 mentioned this pull request Jun 2, 2024

Question: Solve the problem in new versions of np.linalg.solve #26598

Closed

mitchellcohen3 mentioned this pull request Jun 26, 2024

Minor fix in np.linalg.solve dimensions decargroup/navlie#124

Merged

robertsont mentioned this pull request Jan 17, 2025

Add support for Numpy 2.*, fix linting issues stanfordmlgroup/ngboost#365

Merged

brian-rose mentioned this pull request Feb 27, 2025

Numpy 2 compatible broadcasting in implicit solver climlab/climlab#241

Merged

alhom mentioned this pull request Mar 7, 2025

Explicit reshape to fix numpy broadcast ambiguity fmihpc/analysator#314

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API: Remove broadcasting ambiguity from np.linalg.solve #25914

API: Remove broadcasting ambiguity from np.linalg.solve #25914

asmeurer commented Mar 1, 2024

asmeurer commented Mar 2, 2024

rgommers commented Mar 3, 2024

rgommers commented Mar 3, 2024

asmeurer commented Mar 4, 2024

asmeurer commented Mar 4, 2024

asmeurer commented Mar 4, 2024

mattip commented Mar 6, 2024

mattip left a comment

stefanv commented Mar 6, 2024

mattip commented Mar 6, 2024

mattip commented Mar 6, 2024

asmeurer commented Mar 6, 2024

rgommers commented Mar 8, 2024

API: Remove broadcasting ambiguity from np.linalg.solve #25914

API: Remove broadcasting ambiguity from np.linalg.solve #25914

Conversation

asmeurer commented Mar 1, 2024

asmeurer commented Mar 2, 2024

rgommers commented Mar 3, 2024

rgommers commented Mar 3, 2024

asmeurer commented Mar 4, 2024

asmeurer commented Mar 4, 2024

asmeurer commented Mar 4, 2024

mattip commented Mar 6, 2024

mattip left a comment

Choose a reason for hiding this comment

stefanv commented Mar 6, 2024

mattip commented Mar 6, 2024

mattip commented Mar 6, 2024

asmeurer commented Mar 6, 2024

rgommers commented Mar 8, 2024