Perform direct write operation if input data are larger than buffer size #11203

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

Mytherin merged 3 commits into duckdb:main from quentingodeau:feature/buffer-writer

Mar 21, 2024

Contributor

quentingodeau commented Mar 16, 2024

Hello,

While debugging I saw that the BufferedFileWriter was splitting incoming buffer to smaller one before writing then.

# Test data
curl https://us-prd-motherduck-open-datasets.s3.amazonaws.com/hacker_news/parquet/hacker_news_2021_2022.parquet -o data/hacker_news_2021_2022.parquet

# Tests
for i in seq {1..10}; do rm out.parquet; time ./build/debug/duckdb -c "COPY (SELECT * FROM '../data/hacker_news_2021_2022.parquet' ORDER BY 'id') TO './out.parquet'"; done >with_changes.log 2>&1

Here the result I was expecting better improvement...

Before the change
without_changes.log

After
with_changes.log

But I still propose this PR because depending on the underlying filesystem it may be preferable to have larger buffer.


          Perform direct write operation if input data are larger than buffer size

a454782

quentingodeau commented

View reviewed changes

src/common/serializer/buffered_file_writer.cpp Outdated

+              			Flush(); // Flush buffer before writing every things else
+              		}
+              		idx_t remaining_to_write = write_size - to_copy;
+              		fs.Write(*handle, (void *)(buffer + to_copy), remaining_to_write);

Contributor Author

quentingodeau Mar 16, 2024

This is the only point that is bothering me... Here I have done a C cast style because the input data is a const std::uint8_t type...

Contributor

Tishj Mar 16, 2024 •

edited

Loading

That looks fine to me, it's important that the + to_copy is done before the cast and you did just that 👍
However we don't like having c-style casts, use reinterpret_cast instead here

Contributor Author

quentingodeau Mar 17, 2024

in this case unfortunately it's a const_cast operation :(
cf 3931b0b

Tishj reviewed

View reviewed changes

src/common/serializer/buffered_file_writer.cpp Outdated Show resolved Hide resolved

Tishj reviewed

View reviewed changes

src/common/serializer/buffered_file_writer.cpp

+              		// Perform direct IO, the buffer to write is larger than the internal buffer
+              		// it does not make sens to split a larger buffer into smaller one
+              		idx_t to_copy = 0;
+              		if (offset != 0) {

Contributor

Tishj Mar 16, 2024

Can this comment be clarified further?
It sounds like we're not appending to our buffer and flushing it as is, but in reality we're filling our buffer up to FILE_BUFFER_SIZE and then flushing it.

Contributor Author

quentingodeau Mar 17, 2024

is it better ? 3931b0b


          Fix C style cast and clarified comment

3931b0b

github-actions bot marked this pull request as draft

March 17, 2024 10:04

quentingodeau marked this pull request as ready for review

March 17, 2024 10:06

samansmink reviewed

View reviewed changes

Contributor

samansmink left a comment

Hey @quentingodeau thanks for the PR! I added one comment, the rest looks good!

src/common/serializer/buffered_file_writer.cpp Outdated Show resolved Hide resolved


          Fix condition for direct write

a3ca00b

github-actions bot marked this pull request as draft

March 19, 2024 06:12

quentingodeau marked this pull request as ready for review

March 19, 2024 06:16

Mytherin reviewed

View reviewed changes

Collaborator

Mytherin left a comment

Thanks! Looks good - this is a good idea. One comment:

src/common/serializer/buffered_file_writer.cpp

+              			to_copy = FILE_BUFFER_SIZE - offset;
+              			memcpy(data.get() + offset, buffer, to_copy);
+              			offset += to_copy;
+              			Flush(); // Flush buffer before writing every things else

Collaborator

Mytherin Mar 21, 2024

Can we call Flush() followed by fs.Write() of the entire incoming buffer below without first copying part of the buffer into the existing buffer? I don't quite see the benefit of first copying part of the buffer into the original file buffer, and it complicates the code here. Two write calls are happening in both cases.

Contributor Author

quentingodeau Mar 21, 2024

In my opinion the underlying FS expect that the buffered are size at least to the size that has been configured. On a remote FS I think it may impact some performance (depending of course of the implementation), and I wanted to avoid the underlying FS to add an extract layer of buffers.
This is only my opinion and you have the final words regarding this. Please confirm if you want me to do the change or not :)

Collaborator

Mytherin Mar 21, 2024

I can see that - let's keep it as-is then

Mytherin added the Changes Requested label

Collaborator

Mytherin commented Mar 21, 2024

Thanks!

Mytherin merged commit 2626cb2 into duckdb:main

github-actions bot pushed a commit to duckdb/duckdb-r that referenced this pull request


          chore: Update vendored sources to duckdb/duckdb@2626cb2

228da2d

Merge pull request duckdb/duckdb#11203 from quentingodeau/feature/buffer-writer

krlmlr added a commit to duckdb/duckdb-r that referenced this pull request


          chore: Update vendored sources to duckdb/duckdb@2626cb2

1b946f6

Merge pull request duckdb/duckdb#11203 from quentingodeau/feature/buffer-writer

github-actions bot pushed a commit to duckdb/duckdb-r that referenced this pull request


          chore: Update vendored sources to duckdb/duckdb@2626cb2

75b61f1

Merge pull request duckdb/duckdb#11203 from quentingodeau/feature/buffer-writer

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Changes Requested