Skip to content

Conversation

pdet
Copy link
Contributor

@pdet pdet commented Jan 28, 2025

With strict_mode set to True, the CSV parser will only accept files that consistently use either \r\n, \r, or \n as line endings. Any mixture of these line endings will result in a failure, accompanied by a suggestion to set strict_mode=False.

Fix: #15783

Mytherin added a commit that referenced this pull request Feb 11, 2025
At some point one of the CI refactors, disabled the Parallel CSV Tests.

This PR makes the CI run it again, all tests are slow, and they take
about ~20 seconds, so the extra flag is not really necessary.

The file `data/csv/mixed_new_line.csv` is breaking, but should be fixed
with: #15959

Hence I'm skipping it for now.
@duckdb-draftbot duckdb-draftbot marked this pull request as draft February 26, 2025 13:01
@pdet pdet marked this pull request as ready for review February 26, 2025 13:01
@pdet pdet requested a review from Tishj February 26, 2025 13:01
@Mytherin
Copy link
Collaborator

This still seems to be bundling too many files

@duckdb-draftbot duckdb-draftbot marked this pull request as draft March 11, 2025 21:04
@pdet pdet marked this pull request as ready for review March 11, 2025 21:04
@duckdb-draftbot duckdb-draftbot marked this pull request as draft March 11, 2025 21:29
@pdet pdet marked this pull request as ready for review March 11, 2025 21:30
@Mytherin
Copy link
Collaborator

Thanks! Looks good now - can you just resolve the merge conflict?

@duckdb-draftbot duckdb-draftbot marked this pull request as draft March 12, 2025 08:59
@pdet pdet marked this pull request as ready for review March 12, 2025 09:00
@pdet
Copy link
Contributor Author

pdet commented Mar 13, 2025

Thanks! Looks good now - can you just resolve the merge conflict?

Done! :-)

@Mytherin Mytherin merged commit a441204 into duckdb:main Mar 18, 2025
50 checks passed
@Mytherin
Copy link
Collaborator

Thanks!

krlmlr added a commit to duckdb/duckdb-r that referenced this pull request May 15, 2025
Make CSV Parser strict_mode=True fail on a mix of new line delimiters. (duckdb/duckdb#15959)
[Julia] support binding for vectors (duckdb/duckdb#16701)
JSON Reader: make read_position atomic so this can be read by the progress bar while processing the JSON file (duckdb/duckdb#16692)
krlmlr added a commit to duckdb/duckdb-r that referenced this pull request May 15, 2025
Make CSV Parser strict_mode=True fail on a mix of new line delimiters. (duckdb/duckdb#15959)
[Julia] support binding for vectors (duckdb/duckdb#16701)
JSON Reader: make read_position atomic so this can be read by the progress bar while processing the JSON file (duckdb/duckdb#16692)
krlmlr added a commit to duckdb/duckdb-r that referenced this pull request May 16, 2025
Make CSV Parser strict_mode=True fail on a mix of new line delimiters. (duckdb/duckdb#15959)
[Julia] support binding for vectors (duckdb/duckdb#16701)
JSON Reader: make read_position atomic so this can be read by the progress bar while processing the JSON file (duckdb/duckdb#16692)
krlmlr added a commit to duckdb/duckdb-r that referenced this pull request May 16, 2025
Make CSV Parser strict_mode=True fail on a mix of new line delimiters. (duckdb/duckdb#15959)
[Julia] support binding for vectors (duckdb/duckdb#16701)
JSON Reader: make read_position atomic so this can be read by the progress bar while processing the JSON file (duckdb/duckdb#16692)
krlmlr added a commit to duckdb/duckdb-r that referenced this pull request May 17, 2025
Make CSV Parser strict_mode=True fail on a mix of new line delimiters. (duckdb/duckdb#15959)
[Julia] support binding for vectors (duckdb/duckdb#16701)
JSON Reader: make read_position atomic so this can be read by the progress bar while processing the JSON file (duckdb/duckdb#16692)
@pdet pdet deleted the rfc_4180_stricter branch May 28, 2025 11:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

CSV reader with new_line:='\r\n' should reject file with bare \r unless rfc_4180:=false
2 participants