ENH: isclose/allclose: support array_like `atol`/`rtol` #24878

mdhaber · 2023-10-07T15:08:46Z

gh-14320 reported that np.allclose accepted vector atol but failed when a NaN was present.
gh-14343 began to add full support for array atol/rtol in allclose/isclose, but became inactive.
@seberg asked if anyone would like to finish it, hence this PR.

I preserved the original commits and made the requested changes. I also fixed the case of non-array array_like rtol/atol and strengthened the tests.

numpy/core/tests/test_numeric.py

doc/release/upcoming_changes/14343.future_change.rst

numpy/core/numeric.py

mhvk

It all looks pretty good, but looking more closely I felt a simplification was possible - see inline.

mhvk · 2023-10-12T00:45:52Z

numpy/core/numeric.py

@@ -2357,8 +2359,10 @@ def within_tol(x, y, atol, rtol):
        # lib.stride_tricks, though, so we can't import it here.
        x = x * ones_like(cond)
        y = y * ones_like(cond)
+        atol_f = np.broadcast_to(atol, x.shape)[finite] if atol.shape else atol


So this is weird, above it is noted we are not allowed broadcast_arrays, but here we use broadcast_to. So, clearly that comment is out of date, which means we can just do (following the definition of cond - rest can go):

x, y, atol, rtol = np.broadcast_arrays(x, y, atol, rtol, subok=True) cond[finite] = within_tol(x[finite], y[finite], atol[finite], rtol[finite])

(note addition of subok=True - useful for subclasses)

Could you check whether the tests still pass with this?

Yup, this looks better to me.

It turns out that in NumPy main, isclose is very flexible about the shape of atol and rtol when there are no NaNs.

import numpy as np np.isclose([1, 2], [[2], [1]], atol=[[[0.01]], [[2]]]) # array([[[False, True], # [ True, False]], # # [[ True, True], # [ True, True]]])

Before the last commit, this would have failed if there were a NaN in one of the arrays because atol cannot be broadcast to the shape of x (which is really the braodcasted shape of x and y at this point in the code).

Now this will work even when there are NaNs.

np.isclose([np.nan, 2], [[2], [np.nan]], atol=[[[0.01]], [[3]]], equal_nan=True) # array([[[False, True], # [ True, False]], # # [[False, True], # [ True, False]]])

mhvk

This looks all good to me! The only thing that I think should still be done, though, is to squash the commits - maybe to two to continue to give credit to the earlier attempt.

mhvk

Great, thanks!

mhvk

Looks good except I think the code can be simplified even more! See inline comment.

mhvk · 2023-10-28T16:35:53Z

numpy/_core/numeric.py

@@ -2361,7 +2360,8 @@ def within_tol(x, y, atol, rtol):
        x = x * ones_like(cond)


These two lines and the comment above can go, since we actually broadcast x and y already above.

….isclose

seberg · 2023-10-29T08:06:01Z

numpy/_core/numeric.py

@@ -2341,8 +2341,7 @@ def within_tol(x, y, atol, rtol):
        with errstate(invalid='ignore'), _no_nep50_warning():
            return less_equal(abs(x-y), atol + rtol * abs(y))

-    x = asanyarray(a)
-    y = asanyarray(b)
+    x, y, atol, rtol = np.broadcast_arrays(a, b, atol, rtol, subok=True)


Hmmmpf, this is one of the cases similar to gh-24957. So, atol and rtol stop being Python scalars (with NEP 50) or 0-d objects (without NEP 50) and that changes promotion.

We need a solution for that, right now, we could skip it add logic for them being type(atol) in (int, float, complex). The question remains a bit whether we want a way to say mark_was_weak_scalar(array), to turn on weak promotion for an array, but... I don't really want such a mark to survive for long and it is unclear what operations should retain it.

I would think that if NumPy is hitting this, then other libraries and users will hit it as well. Maybe we should rethink the transition and how we provide (or not) backward incompatibility shims.

I guess the casting could be done earlier - something like

dtype=np.result_type(a, b, atol, rtol) atol = np.asanyarray(atol, dtype) rtol = np.asanyarray(rtol, dtype)

and then broadcast?

If broadcast is a common culprit, we might add a new keyword argument that does this implicitly?

If broadcast is a common culprit, we might add a new keyword argument that does this implicitly?

~~Yes, please!~~ This is something Sebastian and I have discussed before.

Oops, but I've wanted the promotion to follow NEP50, so it would be nice if the user could choose which rules to follow.

Yes, that would be the suggestion: to allow the dtype of arrays created from python scalars to follow NEP 50 promotion rules.

There are different approaches:

just make the current stuff a bit more convenient.

Try to tag on the info to an array.

Just deal with it here, the isinstance check and special code paths are not that terrible (and also useful for micro optimization probably).

To me no solution is obvious, unfortunately. Tagging the info on to the array or dtype, means that you have a very tricky situation of how to drag the info around and mainly about when to preserve and when to drop it.
In this case for example, you are calling arr[boolean_idx] creating a copy, should that drop to prevent accidentally returning such an array? Assignments at least could be prevent by making it readonly, but... Also the special path to make arr > -1 work for such an array is another tricky thing.
I can somewhat think of ways to do this, but I am not sure they will be very pleasant.

Moving resolution to earlier (maybe with a helper) can help. But in some cases might also not be quite what we want because it makes sense to ask isclose(2 meters, 1.9 meters, atol=0.1, rtol=0.1 meters). But it is less clear that the: dtype = result_type(meters, float) is reasonable (for the atol)?

mhvk · 2023-10-30T16:20:30Z

Taking discussion out of code: You are right, result_type doesn't work for things like result_type(input-quantity, rtol-float).

I think in the end, perhaps one just has to write more carefully... Looking at this code, really the two different paths for finite and non-finite are not needed if one is a bit more careful, which means one can avoid the broadcasting altogether -- see mdhaber#1 (@mdhaber - I made this PR to your branch so that we can keep credit for both you and the earlier commit).

From that, perhaps it is useful to have a general helper routine that does asanyarray except passes through python scalars? I now do

    x, y, atol, rtol = (
        a if isinstance(a, (int, float, complex)) else asanyarray(a)
        for a in (a, b, atol, rtol))

Though it may be that we simply have to see how things pan out.

p.s. Just for completeness, the NEP 50 changes in astropy were not that large - astropy/astropy#15525 - with some code becoming more and other code becoming less obvious.

mhvk

Reapproving, though given that one of the commits is now mine, I won't merge but let @seberg have another look as well.

mattip · 2023-11-13T05:37:42Z

Thanks @mdhaber, @mhvk.

tylerjereddy · 2023-11-15T18:55:30Z

We did notice something related to this downstream in SciPy per: scipy/scipy#19526

I didn't open a NumPy issue, because it may actually be an improvement to sensitivity, can't tell just yet..

mdhaber · 2023-11-15T22:18:46Z

The failure is

E   Mismatched elements: 1 / 1 (100%)
E   Max absolute difference among violations: nan
E   Max relative difference among violations: nan
E    ACTUAL: array(-8.113438e+295-infj)
E    DESIRED: array(-8.113438e+295-infj)

I think this should still pass; the new failure is not an intended effect of the PR. It is probably failing because the magnitudes are not finite, so exact equality comparison is performed, but presumably the real parts are not exactly equal. What is surprising is that the new logic is similar to the old logic in this respect, so it's not jumping out at me why it didn't fail before. I'll take a closer look soon.

Update:
In the old logic:

        x = x * ones_like(cond)
        y = y * ones_like(cond)

(where x and y are np.asarray([-8.113438e+295-np.inf*1j]) and cond is a boolean array) turns x and y into complex NaNs.

import numpy as np
x = np.asarray([-8.113438e+295-np.inf*1j])
cond = np.asarray([True])
x * np.ones_like(cond)  # array([nan+nanj])

Then the NaN comparison logic kicked in, and the test passed. This was not exactly correct behavior either because the original inputs would be considered close even if the real parts were not close. So I think this case has exposed a bug in both the old and new code.

The obvious workaround to me is to perform real and imaginary part comparison separately, but maybe there is a more elegant solution.

mdhaber · 2023-11-22T18:03:06Z

@mhvk would you like to adapt your solution to work with complex numbers in which either the real or complex part is infinite?

mhvk · 2023-11-23T06:10:06Z

I'm actually not sure what the expectation should be: the current test assumes either x, y are finite, or, if not, that they should be the same infinite (same sign). Following that logic, for complex, I guess we can that say that the infinite should point in the same direction (i.e., any of 8 possibilities). Still, for the case in question, of x=finite ± infinite j, I guess one would then logically have to ignore the finite values altogether. Is that what we want?

On the other hand, one can see isclose as a way to look for machine rounding errors, in which case the answer is clearer: just check real and imaginary separately if the other is infinite. But then, we do not do that for reals: the maximum number representable is not treated as the same as infinite, even if it is just a rounding error away. But it is not obvious to me one would ever want to do isclose(x.real, y.real) & isclose(x.imag, y.imag) - it is important that rtol goes with abs(y) which includes both real and imaginary.

Anyway, I think it makes most sense to check for "same infinity" (ignoring real for -8.113438e+295+0j-np.inf*1j), which would make the scipy tests pass again, but probably best if we first agree this is OK...

p.s. It is all a bit weird: if I do,

np.asarray([-8.113438e+295-np.inf*1j])
# array([nan-infj])

This actually is a python problem:

-8.113438e+295-np.inf*1j
(nan-infj)

So, it looks like one has to construct the test values explicitly assigning real and imaginary...

mhvk · 2023-11-23T06:35:55Z

This is what the directional infinity check would look like:

    with errstate(invalid='ignore'), _no_nep50_warning():
        result = less_equal(abs(x-y), atol + rtol * abs(y))
        if not np.all(finite_y := isfinite(y)):
            result &= finite_y
            # We allow matches for infinities pointing in the same direction.
            if x.dtype.kind != "c" and y.dtype.kind != "c":
                result |= (x == y)
            else:
                # For complex, 8 ways to point in the same direction.
                x_real_inf = isinf(x.real)
                x_imag_inf = isinf(x.imag)
                match_real_inf = x_real_inf & (x.real == y.real)
                match_imag_inf = x_imag_inf & (x.imag == y.imag)
                neither_real_inf = ~(x_real_inf | isinf(y.real))
                neither_imag_inf = ~(x_imag_inf | isinf(y.imag))
                result |= match_real_inf & (match_imag_inf | neither_imag_inf)
                result |= match_imag_inf & neither_real_inf

            if equal_nan:
                result |= isnan(x) & isnan(y)

github-actions bot added the 01 - Enhancement label Oct 7, 2023

mdhaber commented Oct 7, 2023

View reviewed changes

numpy/core/tests/test_numeric.py Outdated Show resolved Hide resolved

doc/release/upcoming_changes/14343.future_change.rst Outdated Show resolved Hide resolved

mhvk reviewed Oct 8, 2023

View reviewed changes

numpy/core/numeric.py Outdated Show resolved Hide resolved

mdhaber changed the title ~~ENH: iscloses/allclose: support array_like atol/rtol~~ ENH: isclose/allclose: support array_like atol/rtol Oct 8, 2023

mhvk reviewed Oct 12, 2023

View reviewed changes

mhvk reviewed Oct 14, 2023

View reviewed changes

mdhaber force-pushed the gh14343 branch from 2c5b7d2 to f204c02 Compare October 14, 2023 17:09

mhvk approved these changes Oct 14, 2023

View reviewed changes

mdhaber requested a review from mhvk October 27, 2023 03:42

mhvk reviewed Oct 28, 2023

View reviewed changes

maxwell-aladago and others added 2 commits October 28, 2023 23:40

EHN: adding support for vector atol and rtol in np.all_isclose and np…

64fa171

….isclose

MAINT: isclose/allclose: simplify support for array-like tolerances

3030159

mdhaber force-pushed the gh14343 branch from 51cb733 to 3030159 Compare October 29, 2023 06:45

seberg reviewed Oct 29, 2023

View reviewed changes

mhvk mentioned this pull request Oct 30, 2023

Ensure NEP 50 promotion works for isclose mdhaber/numpy#1

Open

mdhaber changed the base branch from main to maintenance/1.0.3.x November 10, 2023 22:08

mdhaber changed the base branch from maintenance/1.0.3.x to main November 10, 2023 22:08

MAINT: ensure NEP 50 promotion is not broken in isclose and allclose

9212ec1

mdhaber force-pushed the gh14343 branch from 916e4a1 to 9212ec1 Compare November 10, 2023 22:15

mdhaber requested a review from seberg November 10, 2023 23:15

mhvk approved these changes Nov 12, 2023

View reviewed changes

mattip merged commit fc8292d into numpy:main Nov 13, 2023

mhvk mentioned this pull request Nov 23, 2023

CI, MAINT: special test_intermediate_overlow failure with latest NumPy (bisected) scipy/scipy#19526

Open

mattip mentioned this pull request Jul 3, 2024

EHN: Add support for vector atol and rtol in np.all_isclose and np… #14343

Closed

		@@ -2361,7 +2360,8 @@ def within_tol(x, y, atol, rtol):
		x = x * ones_like(cond)

Uh oh!

ENH: isclose/allclose: support array_like atol/rtol #24878

ENH: isclose/allclose: support array_like atol/rtol #24878

Uh oh!

Conversation

mdhaber commented Oct 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mhvk left a comment

Choose a reason for hiding this comment

Uh oh!

mhvk Oct 12, 2023

Choose a reason for hiding this comment

Uh oh!

mdhaber Oct 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mhvk left a comment

Choose a reason for hiding this comment

Uh oh!

mhvk left a comment

Choose a reason for hiding this comment

Uh oh!

mhvk left a comment

Choose a reason for hiding this comment

Uh oh!

mhvk Oct 28, 2023

Choose a reason for hiding this comment

Uh oh!

seberg Oct 29, 2023

Choose a reason for hiding this comment

Uh oh!

mattip Oct 29, 2023

Choose a reason for hiding this comment

Uh oh!

mhvk Oct 29, 2023

Choose a reason for hiding this comment

Uh oh!

mdhaber Oct 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mhvk Oct 29, 2023

Choose a reason for hiding this comment

Uh oh!

seberg Oct 30, 2023

Choose a reason for hiding this comment

Uh oh!

mhvk commented Oct 30, 2023

Uh oh!

mhvk left a comment

Choose a reason for hiding this comment

Uh oh!

mattip commented Nov 13, 2023

Uh oh!

tylerjereddy commented Nov 15, 2023

Uh oh!

mdhaber commented Nov 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mdhaber commented Nov 22, 2023

Uh oh!

mhvk commented Nov 23, 2023

Uh oh!

mhvk commented Nov 23, 2023

Uh oh!

Uh oh!

ENH: isclose/allclose: support array_like `atol`/`rtol` #24878

ENH: isclose/allclose: support array_like `atol`/`rtol` #24878

mdhaber commented Oct 7, 2023 •

edited

Loading

mdhaber Oct 14, 2023 •

edited

Loading

mdhaber Oct 29, 2023 •

edited

Loading

mdhaber commented Nov 15, 2023 •

edited

Loading