[SPIKE] Don't truncate query CSVs #1820

fgregg · 2022-09-26T17:27:01Z

Relates to #526

This is a minimal set of changes needed for having query CSVs attempt to download all the rows.

What's good about it is the minimalism.

What's bad about it:

We are abusing the _size argument to indicate we don't want truncation, which isn't the most obvious thing. Additionally, there are various checks that make sure the "_size" URL parameter is a positive integer, which we are relying on to prevent overloading.
The default CSV on a table page will use the max_returned_rows argument. Changing this could be a breaking change, since that's currently a place that has some facilities for pagination. Additionally, i think there's a limit under the hood somewhere which if we removed could lead to sql timeouts
There are similar reasons for leaving the current streaming method alone, as the current methods could allow for downloading very large files that could have a sql timeout if we tried to get them in one go.

📚 Documentation preview 📚: https://datasette--1820.org.readthedocs.build/en/1820/

codecov · 2022-09-26T20:32:47Z

Codecov Report

Base: 92.50% // Head: 92.51% // Increases project coverage by +0.01% 🎉

Coverage data is based on head (9bead2a) compared to base (eff1124).
Patch coverage: 100.00% of modified lines in pull request are covered.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1820      +/-   ##
==========================================
+ Coverage   92.50%   92.51%   +0.01%     
==========================================
  Files          35       35              
  Lines        4400     4406       +6     
==========================================
+ Hits         4070     4076       +6     
  Misses        330      330

Impacted Files	Coverage Δ
datasette/app.py	`94.11% <ø> (ø)`
datasette/views/base.py	`94.80% <100.00%> (+0.05%)`	⬆️
datasette/views/database.py	`95.29% <100.00%> (+0.06%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

fgregg · 2022-09-27T00:03:09Z

the pattern in this PR max_returned_rows control the maximum rows rendered through html and json, and the csv render bypasses that.

i think it would be better to have each of these different query renderers have more direct control for how many rows to fetch, instead of relying on the internals of the execute method.

generally, users will not want to paginate through tens of thousands of results, but often will want to download a full query as json or as csv.

fgregg added 2 commits September 26, 2022 13:18

don't truncate csvs

1d715e9

pass through db.get

21cad83

fgregg closed this Sep 26, 2022

fgregg added 3 commits September 26, 2022 13:36

merge

e01920d

typo

31cc5e2

typo and blacken

b2fcb07

fgregg reopened this Sep 26, 2022

fgregg closed this Sep 26, 2022

fgregg added 2 commits September 26, 2022 14:20

overload _size

767e6cf

doc string and cleaner diff

ddcc8bc

fgregg reopened this Sep 26, 2022

fgregg changed the title ~~No limit csv~~ Don't truncate query CSVs Sep 26, 2022

fgregg changed the title ~~Don't truncate query CSVs~~ [SPIKE] Don't truncate query CSVs Sep 26, 2022

fgregg force-pushed the no_limit_csv branch from feea811 to ddcc8bc Compare September 26, 2022 21:12

fgregg marked this pull request as draft September 27, 2022 01:06

fgregg added 2 commits October 6, 2022 14:23

use inspect data for hash and file size

b4b92df

Merge branch 'cache_hash' into no_limit_csv

92bc146

fgregg force-pushed the no_limit_csv branch from 96fe5f2 to 92bc146 Compare October 6, 2022 20:48

fgregg added 8 commits October 6, 2022 16:50

Merge remote-tracking branch 'upstream/master' into no_limit_csv

c3cacd9

make hash and cached_size lazy properties

e907b52

move hash property near size

c3a6940

Merge remote-tracking branch 'upstream/main' into cache_hash

c12447e

Merge branch 'cache_hash' into no_limit_csv

6346e95

ro hack

5006378

hack out immutable in another place

bd7f714

typo

9bead2a

fgregg closed this Oct 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[SPIKE] Don't truncate query CSVs #1820

[SPIKE] Don't truncate query CSVs #1820

Uh oh!

fgregg commented Sep 26, 2022 •

edited

Loading

Uh oh!

codecov bot commented Sep 26, 2022 •

edited

Loading

Uh oh!

fgregg commented Sep 27, 2022

Uh oh!

Uh oh!

Uh oh!

[SPIKE] Don't truncate query CSVs #1820

[SPIKE] Don't truncate query CSVs #1820

Uh oh!

Conversation

fgregg commented Sep 26, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Sep 26, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

fgregg commented Sep 27, 2022

Uh oh!

Uh oh!

fgregg commented Sep 26, 2022 •

edited

Loading

codecov bot commented Sep 26, 2022 •

edited

Loading