Issue #6272: Window Scaled Repartitioning #6366

hawkfish · 2023-02-19T03:22:33Z

There is no need to copy the old partitioned data when repartitioning.
But if we have started to combine, then repartitioning will duplicate data,
so don't resize then.

There is no need to copy the old partitioned data when repartitioning. But if we have started to combine, then repartitioning will duplicate data, so don't resize then.

Mytherin · 2023-02-20T09:05:10Z

Thanks for the PR!

This still does not seem to solve the problem completely, though.

If I add a loop to the test:

# name: test/sql/window/test_window_repartition.test_slow
# description: Window reparitioning at scale
# group: [window]

statement ok
PRAGMA enable_verification

statement ok
create table df as 
	select d, i v1 
	from 
		range(date '2017-01-01', date '2020-12-31', interval '1' day) t(d), 
		range(3000) i
	;

loop i 0 100

query I
select count(*) 
from (
	select percent_rank() over (partition by d order by v1) as rank_v1 
	from df
);
----
4380000

endloop

Then running it I eventually get an off-by-a small amount still.

[0/1] (0%): test/sql/window/test_window_repartition.test_slow                   ================================================================================
Wrong result in query! (test/sql/window/test_window_repartition.test_slow:18)!
================================================================================
select count(*) 
from (
	select percent_rank() over (partition by d order by v1) as rank_v1 
	from df
);
================================================================================
Mismatch on row 1, column 1
4380253 <> 4380000
================================================================================
Expected result:
================================================================================
4380000
================================================================================
Actual result:
================================================================================
4380253

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
unittest is a Catch v2.13.7 host application.
Run with -? for options

-------------------------------------------------------------------------------
test/sql/window/test_window_repartition.test_slow
-------------------------------------------------------------------------------
/Users/myth/Programs/duckdb-bugfix/test/sqlite/test_sqllogictest.cpp:178
...............................................................................

test/sql/window/test_window_repartition.test_slow:18: FAILED:
explicitly with message:
  0

[1/1] (100%): test/sql/window/test_window_repartition.test_slow                 
===============================================================================
test cases:  1 |  0 passed | 1 failed
assertions: 21 | 20 passed | 1 failed

This amount always seems to be less than the vector size, so perhaps something happening with a resize while a vector is already in-flight?

Mytherin · 2023-02-21T08:18:49Z

Disabling all in-flight resizing seems to fix the issue, but this may be too heavy-handed of a solution:

diff --git a/src/execution/operator/aggregate/physical_window.cpp b/src/execution/operator/aggregate/physical_window.cpp
index fbd7865f02..958c574efe 100644
--- a/src/execution/operator/aggregate/physical_window.cpp
+++ b/src/execution/operator/aggregate/physical_window.cpp
@@ -170,7 +170,7 @@ private:
 
 void WindowGlobalSinkState::ResizeGroupingData(idx_t cardinality) {
        //      Have we started to combine? Then just live with it.
-       if (grouping_data && !grouping_data->GetPartitions().empty()) {
+       if (grouping_data) {
                return;
        }
        //      Is the average partition size too large?

lnkuiper · 2023-02-22T10:18:30Z

I was able to reproduce this locally, and this seems to fix it:

void PartitionedColumnData::FlushAppendState(PartitionedColumnDataAppendState &state) {
        for (idx_t i = 0; i < state.partition_buffers.size(); i++) {
                auto &partition_buffer = *state.partition_buffers[i];
                if (partition_buffer.size() > 0) {
                        partitions[i]->Append(partition_buffer);
                        partition_buffer.Reset(); // Add this line!
                }
        }
}

The buffer needed to be reset. Nowhere else in the codebase did we use the append state after flushing it, so this was not a problem before.

Fix reentry problem in PartitionedColumnData::FlushAppendState. Make stress test more brutal and reliable.

…into window-pct-range

Richard Wesley added 2 commits February 19, 2023 16:19

Issue duckdb#6272: Window Scaled Repartitioning

0d76983

There is no need to copy the old partitioned data when repartitioning. But if we have started to combine, then repartitioning will duplicate data, so don't resize then.

Merge branch 'master' into window-pct-range

31ba73d

hawkfish requested a review from lnkuiper February 19, 2023 03:22

Mytherin linked an issue Feb 20, 2023 that may be closed by this pull request

percent_rank bugs #6272

Closed

2 tasks

Mytherin and others added 6 commits February 22, 2023 11:20

Reset buffer (thanks @lnkuiper)

191fc2a

Format

465f04a

Issue duckdb#6272: Window Scaled Repartitioning

62f3d15

Fix reentry problem in PartitionedColumnData::FlushAppendState. Make stress test more brutal and reliable.

Merge branch 'window-pct-range' of https://github.com/hawkfish/duckdb …

1aa0177

…into window-pct-range

Merge branch 'master' into window-pct-range

ad9f390

Update test_window_repartition.test_slow

1a47cf2

Mytherin merged commit d58ab18 into duckdb:master Feb 23, 2023

hawkfish deleted the window-pct-range branch March 3, 2023 21:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Issue #6272: Window Scaled Repartitioning #6366

Issue #6272: Window Scaled Repartitioning #6366

Uh oh!

hawkfish commented Feb 19, 2023

Uh oh!

Mytherin commented Feb 20, 2023 •

edited

Loading

Uh oh!

Mytherin commented Feb 21, 2023

Uh oh!

lnkuiper commented Feb 22, 2023

Uh oh!

Uh oh!

Issue #6272: Window Scaled Repartitioning #6366

Issue #6272: Window Scaled Repartitioning #6366

Uh oh!

Conversation

hawkfish commented Feb 19, 2023

Uh oh!

Mytherin commented Feb 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Mytherin commented Feb 21, 2023

Uh oh!

lnkuiper commented Feb 22, 2023

Uh oh!

Uh oh!

Mytherin commented Feb 20, 2023 •

edited

Loading