Databricks: Prevent parsing error when reading from a streaming file #6910

cpwithers · 2025-05-21T11:16:05Z

Brief summary of the change made

Add STREAM as a pre table keyword for the databricks dialect as this is causing parsing issues due to the parser mistaking the keyword and function as a table name and alias:

Taking this example query: SELECT * FROM STREAM read_files('gs://my-bucket/avroData', includeExistingFiles => false);

The parser was producing the following:

[L:  1, P: 10]      |            from_clause:
// Removed
[L:  1, P: 15]      |                                naked_identifier:             'STREAM'
[L:  1, P: 21]      |                        whitespace:                           ' '
[L:  1, P: 22]      |                        alias_expression:
[L:  1, P: 22]      |                            naked_identifier:                 'read_files'

After this change the output is:

[L:  1, P: 10]      |            from_clause:
// Removed
[L:  1, P: 15]      |                    from_expression_element:
[L:  1, P: 15]      |                        keyword:                              'STREAM'
[L:  1, P: 21]      |                        whitespace:                           ' '
[L:  1, P: 22]      |                        table_expression:
[L:  1, P: 22]      |                            table_reference:
[L:  1, P: 22]      |                                naked_identifier:             'READ_FILES'

Fixes: #6414

Are there any other side effects of this change that we should be aware of?

None

Pull Request checklist

Please confirm you have completed any of the necessary steps below.
Included test cases to demonstrate any code changes, which may be one or more of the following:
- .yml rule test cases in test/fixtures/rules/std_rule_cases.
- .sql/.yml parser test cases in test/fixtures/dialects (note YML files can be auto generated with tox -e generate-fixture-yml).

keraion · 2025-05-21T14:15:37Z

test/fixtures/dialects/databricks/select_from_read_file.sql

+    modifiedBefore => current_date());
+
+-- Reads a streaming table
+SELECT * FROM STREAM read_files('gs://my-bucket/avroData', includeExistingFiles => false);


nit: newline

fixed, thanks.

…file. Fixes: sqlfluff#6414

github-actions · 2025-05-21T16:58:05Z

Coverage Results ✅

Name    Stmts   Miss  Cover   Missing
-------------------------------------
TOTAL   19692      0   100%

249 files skipped due to complete coverage.

keraion

LGTM. Thanks for the contribution!

…qlfluff#6910) Co-authored-by: Chris Withers <chris.withers@flagstoineim.com>

keraion reviewed May 21, 2025

View reviewed changes

feat(databricks): Prevent parsing issue when reading from a streamed …

1e18b62

…file. Fixes: sqlfluff#6414

cpwithers force-pushed the feat/databricks-dialect-update branch from b9e53b8 to 1e18b62 Compare May 21, 2025 15:40

keraion approved these changes May 22, 2025

View reviewed changes

keraion added this pull request to the merge queue May 22, 2025

Merged via the queue into sqlfluff:main with commit 518970e May 22, 2025
28 checks passed

cpwithers deleted the feat/databricks-dialect-update branch May 22, 2025 08:36

thomascjohnson pushed a commit to thomascjohnson/sqlfluff that referenced this pull request Jun 17, 2025

Databricks: Prevent parsing error when reading from a streaming file (s…

797a0be

…qlfluff#6910) Co-authored-by: Chris Withers <chris.withers@flagstoineim.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Databricks: Prevent parsing error when reading from a streaming file #6910

Databricks: Prevent parsing error when reading from a streaming file #6910

Uh oh!

cpwithers commented May 21, 2025

Uh oh!

keraion May 21, 2025 •

edited

Loading

Uh oh!

cpwithers May 21, 2025

Uh oh!

github-actions bot commented May 21, 2025

Uh oh!

keraion left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Databricks: Prevent parsing error when reading from a streaming file #6910

Databricks: Prevent parsing error when reading from a streaming file #6910

Uh oh!

Conversation

cpwithers commented May 21, 2025

Brief summary of the change made

Are there any other side effects of this change that we should be aware of?

Pull Request checklist

Uh oh!

keraion May 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cpwithers May 21, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented May 21, 2025

Coverage Results ✅

Uh oh!

keraion left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

keraion May 21, 2025 •

edited

Loading