Skip to content

sniff_csv quote character detection regression in 1.2.0 #16215

@galloviolet

Description

@galloviolet

What happens?

On 1.1.3, sniff_csv detects " as the quote character in the following .csv file. On 1.2.0, sniff_csv detects ' as the quote character, which seems to be a regression.

.csv contents:

datecol,textcol
1/1/19,"text"
1/2/19,"text"
1/3/19,"text"
1/4/19,"text"
1/5/19,"text"
1/6/19,"text, with comma"
1/9/19,'text'

To Reproduce

FROM sniff_csv('myfile.csv', null_padding=true);

OS:

macOS Sequoia 15.2, arm64

DuckDB Version:

1.2.0

DuckDB Client:

Python

Hardware:

No response

Full Name:

Dustin Gallo

Affiliation:

DataChat

What is the latest build you tested with? If possible, we recommend testing with the latest nightly build.

I have tested with a stable release

Did you include all relevant data sets for reproducing the issue?

Yes

Did you include all code required to reproduce the issue?

  • Yes, I have

Did you include all relevant configuration (e.g., CPU architecture, Python version, Linux distribution) to reproduce the issue?

  • Yes, I have

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions