Skip to content

Conversation

Mytherin
Copy link
Collaborator

We should bail-out when dict_size * DICTIONARY_THRESHOLD is bigger than the number of rows in the chunk, not the other way around.

Note that in practice this code doesn't really get triggered right now because we only emit dictionary vectors with ids.

Mytherin referenced this pull request Feb 23, 2025
…and use this in the aggregate HT to cache look-ups
@waitinfuture
Copy link

we only emit dictionary vectors with ids

Thanks for the explanation!

@Mytherin Mytherin merged commit 77a4b98 into duckdb:v1.2-histrionicus Feb 24, 2025
47 of 48 checks passed
krlmlr added a commit to duckdb/duckdb-r that referenced this pull request Mar 7, 2025
Aggregation: For dictionaries without an id - use the correct threshold to bail-out on using the dictionary (duckdb/duckdb#16364)
@Mytherin Mytherin deleted the invertcondition branch April 2, 2025 09:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants