ENH: `special.logsumexp`: improve precision when one element is much bigger than the rest #21597

mdhaber · 2024-09-20T18:55:25Z

Reference issue

Closes gh-18295
Supersedes gh-18424
~~May be used to address gh-19521/gh-19549 and make log_softmax array API compatible.~~ (The idea of using log1p is essentially the same, but I think softmax needs its own implementation.)

What does this implement/fix?

gh-18295 reported that logsumexp can lose precision when one element is much bigger than the rest, especially when the exponential of it is close to 1. This improves the precision as described in the issue and linked paper.

Additional information

gh-18424 was out of date after converting logsumexp to the array API. For instance, xp.max does not work on the real component of complex arrays, so conversion of some parts would not be trivial. Also, there were some unresolved comments about the complexity, so I chose to start from scratch.

Also, logsumexp was getting quite complicated as it was. I found it challenging to work within the existing structure, so I refactored to simplify (26bb631) before getting started with the upgrade.

I'll add a review that documents the math inline with the code.

… than the rest

mdhaber · 2024-09-20T18:59:36Z

scipy/special/_logsumexp.py

-        a[b == 0] = -xp.inf
-
-    # Scale by real part for complex inputs, because this affects
-    # the magnitude of the exponential.
    if xp_size(a) == 0:


Make size-0 arrays a special case. It possible that they can be made to work with the existing code again. I'd be happy to review that as a simple follow-up. But conceptually, it's simpler to treat the special case here so we don't need to complicate the algorithmic code.

mdhaber · 2024-09-20T19:01:13Z

scipy/special/_logsumexp.py

+    # Deal with shape details - reducing dimensions and convert 0-D to scalar for NumPy
+    out = xp.squeeze(out, axis=axis) if not keepdims else out
+    sgn = xp.squeeze(sgn, axis=axis) if (sgn is not None and not keepdims) else sgn
+    out = out[()] if out.ndim == 0 else out
+    sgn = sgn[()] if (sgn is not None and sgn.ndim == 0) else sgn
+
+    return (out, sgn) if return_sign else out


For simplicity, _logsumexp uses keepdims throughout and always returns sgn (which may be None if it won't be used). Reduce the axes away, convert to 0d, and choose what to return at the end.

stefanv · 2024-09-20T19:29:20Z

scipy/special/_logsumexp.py

+
+    # Deal with shape details - reducing dimensions and convert 0-D to scalar for NumPy
+    out = xp.squeeze(out, axis=axis) if not keepdims else out
+    sgn = xp.squeeze(sgn, axis=axis) if (sgn is not None and not keepdims) else sgn


This is annoyingly complicated, but I guess we can't remove keepdims now. Necessary to do this for sgn? I suspect so, otherwise you wouldn't have this line here.

but I guess we can't remove keepdims now

It is easier to do the calculations internally with keepdims=True. The annoying thing is supporting keepdims=False, the default, but it is very natural for the user to want this function to behave like any other reducing operation.

It looked a little simpler in the old implementation, but this is how it was working - it used keepdims throughout (because that's much more convenient) and squeezed at the end.

It's possible that some of this could be removed by letting the last reducing operation in _logsumexp eliminate the axes (if keepdims=False). For simplicity, I just decided to ignore that possibility for now and separate these details in logsumexp from the math stuff in _logsumexp.

scipy/special/_logsumexp.py

mdhaber · 2024-09-20T20:50:37Z

~~Will fix the vectorization bug (in _elements_and_indices_with_max_real) that is causing the failures shortly.~~ Done.

mdhaber · 2024-09-20T22:01:04Z

Failure in Array API job surprised me given that there was no problem with array_api_strict, but data-apis/array-api-strict#62.

scipy/special/_logsumexp.py

[skip ci]

lucascolley

Thanks Matt, just a couple questions.

scipy/special/_logsumexp.py

lucascolley · 2024-09-21T15:32:45Z

Thanks Matt and for the review Stéfan!

apaszke · 2024-09-23T09:05:16Z

FYI it seems like this change has had a pretty big impact on the values of imaginary components of results: #21610

mdhaber added 3 commits September 18, 2024 17:58

MAINT: special.logsumexp: refactor for simplicity

26bb631

ENH: special.logsumexp: improve precision when one arg is much bigger…

8db7362

… than the rest

TST: integrate._tanhsinh: update test tolerance

7b0f8fa

mdhaber added enhancement A new feature or improvement scipy.special labels Sep 20, 2024

mdhaber requested review from person142 and steppi as code owners September 20, 2024 18:55

github-actions bot added the scipy.integrate label Sep 20, 2024

mdhaber changed the title ~~Gh18295~~ ENH: special.logsumexp: improve lost precision when one element is bigger than the rest Sep 20, 2024

mdhaber changed the title ~~ENH: special.logsumexp: improve lost precision when one element is bigger than the rest~~ ENH: special.logsumexp: improve precision when one element is much bigger than the rest Sep 20, 2024

mdhaber commented Sep 20, 2024

View reviewed changes

stefanv reviewed Sep 20, 2024

View reviewed changes

jakevdp reviewed Sep 20, 2024

View reviewed changes

scipy/special/_logsumexp.py Show resolved Hide resolved

MAINT: special.logsumexp: fix bug; adjustments per review

f428235

MAINT: special.logsumexp: fix torch failure

1dffb4e

mdhaber commented Sep 20, 2024

View reviewed changes

scipy/special/_logsumexp.py Outdated Show resolved Hide resolved

Update scipy/special/_logsumexp.py

ea72694

[skip ci]

stefanv approved these changes Sep 21, 2024

View reviewed changes

lucascolley reviewed Sep 21, 2024

View reviewed changes

scipy/special/_logsumexp.py Show resolved Hide resolved

scipy/special/_logsumexp.py Show resolved Hide resolved

lucascolley added this to the 1.15.0 milestone Sep 21, 2024

lucascolley added the needs-release-note a maintainer should add a release note written by a reviewer/author to the wiki label Sep 21, 2024

lucascolley mentioned this pull request Sep 21, 2024

BUG: special.logsumexp: fix precision issue #18424

Closed

lucascolley merged commit 6a7be0e into scipy:main Sep 21, 2024

apaszke mentioned this pull request Sep 23, 2024

BUG: special.logsumexp: imaginary component exceeds (-pi, pi] #21610

Closed

jakevdp mentioned this pull request Sep 23, 2024

ENH: array types: add JAX support #20085

Merged

3 tasks

mdhaber mentioned this pull request Sep 24, 2024

MAINT: special.logsumexp: enforce branch cut convention #21622

Merged

mdhaber removed the needs-release-note a maintainer should add a release note written by a reviewer/author to the wiki label Nov 17, 2024

hawkinsp mentioned this pull request Apr 28, 2025

BUG: special.logsumexp: nan in 1.15 #22903

Closed

Uh oh!

ENH: special.logsumexp: improve precision when one element is much bigger than the rest #21597

ENH: special.logsumexp: improve precision when one element is much bigger than the rest #21597

Uh oh!

Conversation

mdhaber commented Sep 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference issue

What does this implement/fix?

Additional information

Uh oh!

mdhaber Sep 20, 2024

Choose a reason for hiding this comment

Uh oh!

mdhaber Sep 20, 2024

Choose a reason for hiding this comment

Uh oh!

stefanv Sep 20, 2024

Choose a reason for hiding this comment

Uh oh!

mdhaber Sep 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mdhaber Sep 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mdhaber commented Sep 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mdhaber commented Sep 20, 2024

Uh oh!

Uh oh!

lucascolley left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

lucascolley commented Sep 21, 2024

Uh oh!

apaszke commented Sep 23, 2024

Uh oh!

Uh oh!

ENH: `special.logsumexp`: improve precision when one element is much bigger than the rest #21597

ENH: `special.logsumexp`: improve precision when one element is much bigger than the rest #21597

mdhaber commented Sep 20, 2024 •

edited

Loading

mdhaber Sep 20, 2024 •

edited

Loading

mdhaber Sep 20, 2024 •

edited

Loading

mdhaber commented Sep 20, 2024 •

edited

Loading