Skip to content

UTF-8 with BOM not working properly #2908

@kimmolinna

Description

@kimmolinna

When you load a test.csv file with UTF-8 BOM into table

D CREATE TABLE test AS SELECT * FROM read_csv_auto('test.csv');
D SELECT * FROM test;
┌─────┬────────┐
│ id │  name  │
├─────┼────────┤
│ 1   │ Mark   │
│ 2   │ Hannes │
└─────┴────────┘

everything seems to fine until you want to select the first column:

D SELECT id FROM test;
Error: Binder Error: Referenced column "id" not found in FROM clause!
Candidate bindings: "test. id", "test.name"

and then you notice that there is one extra character in the name of the first column. The file is working without a problem if you change the encoding of the file to UTF-8.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions