Skip to content

gcs filesystem query that worked in 0.9.2 no longer works in 0.10.0 #10698

@cpcloud

Description

@cpcloud

What happens?

I cannot read GCS data using the filesystem integration, in DuckDB version 0.10. This used to work in 0.9.

I get this exception:

duckdb.duckdb.HTTPException: HTTP Error: HTTP GET error on 'https://ibis-examples.s3.amazonaws.com/data/band_members.csv.gz' (HTTP 403)

To Reproduce

import duckdb
import fsspec

con = duckdb.connect()
con.load_extension("httpfs")

gcs = fsspec.filesystem("gcs")
con.register_filesystem(gcs)
con.sql("from 'gcs://ibis-examples/data/band_members.csv.gz'")

The strange thing is that when I avoid loading the httpfs extension, this code works:

import duckdb
import fsspec

con = duckdb.connect()
# con.load_extension("httpfs")

gcs = fsspec.filesystem("gcs")
con.register_filesystem(gcs)
con.sql("from 'gcs://ibis-examples/data/band_members.csv.gz'")

OS:

Linux x86_64

DuckDB Version:

0.10.0

DuckDB Client:

Python

Full Name:

Phillip Cloud

Affiliation:

Voltron Data

Have you tried this on the latest nightly build?

I have tested with a release build (and could not test with a nightly build)

Have you tried the steps to reproduce? Do they include all relevant data and configuration? Does the issue you report still appear there?

  • Yes, I have

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions