Skip to content

Conversation

Tishj
Copy link
Contributor

@Tishj Tishj commented Apr 2, 2025

This PR fixes #16942

As mentioned here DuckDB does not follow the IEEE-754 behavior for nan comparisons, instead it makes NaN the biggest value in the floating point ordering.

This PR adjusts the pyarrow dataset filters we create when pushing DuckDB table filters in to pyarrow.

@Tishj Tishj force-pushed the pyarrow_filter_pushdown_for_nan branch from ec21ac2 to d3970ae Compare April 2, 2025 10:24
@Tishj Tishj changed the base branch from main to v1.2-histrionicus April 2, 2025 10:25
@Tishj Tishj marked this pull request as draft April 2, 2025 10:25
@Tishj Tishj marked this pull request as ready for review April 2, 2025 10:25
@Mytherin Mytherin merged commit c4b2780 into duckdb:v1.2-histrionicus Apr 2, 2025
20 of 21 checks passed
@Mytherin
Copy link
Collaborator

Mytherin commented Apr 2, 2025

Thanks! Looks great

krlmlr added a commit to duckdb/duckdb-r that referenced this pull request Apr 8, 2025
Allow overriding the Printer::Print output destination (duckdb/duckdb#16909)
[Python] Fix PyArrow filter pushdown for NaN (duckdb/duckdb#16952)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Can't filter for NaN in polars replacement scan
2 participants