promql: Re-introduce direct mean calculation #16773

beorn7 · 2025-06-24T14:09:48Z

See commit description for details, also the discussion in #16714 .

(The very short version is that I was wrong assuming that the incremental mean calculation combined with Kahan summation in the improved way introduced in #16569 is always at least as good as the previous approach.)

/cc @charleskorn @crush-on-anechka

promql/functions.go

krajorama

LGTM, but I'm not super familiar with the subject, suggest another pair of eyes

beorn7 · 2025-06-25T13:17:24Z

Cleaned up the commits.

beorn7 · 2025-06-25T13:18:29Z

@charleskorn might be a suitable reviewer. (His findings triggered ultimately triggered this.)

promql/functions.go

bboreham · 2025-06-25T14:25:47Z

Can you summarise what the net difference of #16569 and this PR is, besides adding tests?

beorn7 · 2025-06-25T15:20:39Z

Can you summarise what the net difference of #16569 and this PR is, besides adding tests?

I did that in the commit description:

This commit brings back direct mean calculation (for `avg` and
`avg_over_time`) but isn't an outright revert of https://github.com/prometheus/prometheus/pull/16569. It keeps the
improved incremental mean calculation and features generally a bit
cleaner code than before.

Also, this commit...

- ...updates the lengthy comment explaining the whole situation and
  trade-offs.

- ...divides the running sum and the Kahan compensation term
  separately (in direct mean calculation) to avoid the (unlikely)
  possibility that sum and Kahan compensation together overflow
  float64.

- ...uncomments the tests that should now work again on darwin/arm64.

- ...uncomments the test that should now reliably yield the
  (inaccurate) value 0 on all hardware platforms. Also, the test
  description has been updated accordingly.

- ...adds avg_over_time tests for zero and one sample in the range.

charleskorn

LGTM modulo suggestion for another test case.

Thanks for fixing this @beorn7.

promql/promqltest/testdata/aggregators.test

These demonstrate that direct mean calculation has some merits after all. Signed-off-by: beorn7 <beorn@grafana.com>

The test in question actually worked fine even before #16569. The finding reported in the comment has turned out to be caused by something else. Signed-off-by: beorn7 <beorn@grafana.com>

This commit brings back direct mean calculation (for `avg` and `avg_over_time`) but isn't an outright revert of #16569. It keeps the improved incremental mean calculation and features generally a bit cleaner code than before. Also, this commit... - ...updates the lengthy comment explaining the whole situation and trade-offs. - ...divides the running sum and the Kahan compensation term separately (in direct mean calculation) to avoid the (unlikely) possibility that sum and Kahan compensation together ovorflow float64. - ...uncomments the tests that should now work again on darwin/arm64. - ...uncomments the test that should now reliably yield the (inaccurate) value 0 on all hardware platforms. Also, the test description has been updated accordingly. - ...adds avg_over_time tests for zero and one sample in the range. Signed-off-by: beorn7 <beorn@grafana.com>

beorn7 · 2025-06-27T12:35:20Z

Thanks. I'll merge on green.

Signed-off-by: Aleksandr Smirnov <5targazer@mail.ru>

beorn7 requested review from bboreham and krajorama June 24, 2025 14:09

beorn7 requested a review from roidelapluie as a code owner June 24, 2025 14:09

beorn7 removed the request for review from roidelapluie June 24, 2025 14:10

beorn7 mentioned this pull request Jun 24, 2025

Float histograms: implement methods for Add/Sub operations using Kahan summation #15687

Open

krajorama reviewed Jun 25, 2025

View reviewed changes

promql/functions.go Outdated Show resolved Hide resolved

beorn7 force-pushed the beorn7/promql branch from 4aa2cb8 to ec0945b Compare June 25, 2025 12:32

krajorama approved these changes Jun 25, 2025

View reviewed changes

beorn7 force-pushed the beorn7/promql branch from ec0945b to 4f900fb Compare June 25, 2025 13:17

bboreham reviewed Jun 25, 2025

View reviewed changes

promql/functions.go Show resolved Hide resolved

charleskorn approved these changes Jun 26, 2025

View reviewed changes

promql/promqltest/testdata/aggregators.test Show resolved Hide resolved

beorn7 force-pushed the beorn7/promql branch from 4f900fb to 39896b3 Compare June 26, 2025 13:34

bboreham approved these changes Jun 27, 2025

View reviewed changes

beorn7 added 3 commits June 27, 2025 14:34

promql: Add test cases for direct mean calculation

2b3fc1f

These demonstrate that direct mean calculation has some merits after all. Signed-off-by: beorn7 <beorn@grafana.com>

promql: Remove falsified comment from test

f71daa7

The test in question actually worked fine even before #16569. The finding reported in the comment has turned out to be caused by something else. Signed-off-by: beorn7 <beorn@grafana.com>

beorn7 force-pushed the beorn7/promql branch from 39896b3 to ce809e6 Compare June 27, 2025 12:34

beorn7 enabled auto-merge June 27, 2025 12:35

bboreham mentioned this pull request Jun 27, 2025

Prepare release 3.5.0-rc.0 #16778

Merged

beorn7 merged commit 9e73fb4 into main Jun 27, 2025
43 checks passed

beorn7 deleted the beorn7/promql branch June 27, 2025 12:57

beorn7 mentioned this pull request Jun 27, 2025

promql: Numerical accuracy issues with mean calculation #16714

Closed

crush-on-anechka added a commit to crush-on-anechka/prometheus that referenced this pull request Jul 10, 2025

Fix average calculation to match prometheus#16773 behavior

12b7d96

Signed-off-by: Aleksandr Smirnov <5targazer@mail.ru>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

promql: Re-introduce direct mean calculation #16773

promql: Re-introduce direct mean calculation #16773

Uh oh!

beorn7 commented Jun 24, 2025

Uh oh!

Uh oh!

krajorama left a comment

Uh oh!

beorn7 commented Jun 25, 2025

Uh oh!

beorn7 commented Jun 25, 2025

Uh oh!

Uh oh!

bboreham commented Jun 25, 2025

Uh oh!

beorn7 commented Jun 25, 2025

Uh oh!

charleskorn left a comment

Uh oh!

Uh oh!

beorn7 commented Jun 27, 2025

Uh oh!

Uh oh!

Uh oh!

promql: Re-introduce direct mean calculation #16773

promql: Re-introduce direct mean calculation #16773

Uh oh!

Conversation

beorn7 commented Jun 24, 2025

Uh oh!

Uh oh!

krajorama left a comment

Choose a reason for hiding this comment

Uh oh!

beorn7 commented Jun 25, 2025

Uh oh!

beorn7 commented Jun 25, 2025

Uh oh!

Uh oh!

bboreham commented Jun 25, 2025

Uh oh!

beorn7 commented Jun 25, 2025

Uh oh!

charleskorn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

beorn7 commented Jun 27, 2025

Uh oh!

Uh oh!

Uh oh!