Implementation changes to BufWriter #78551

Lucretiel · 2020-10-30T02:30:40Z

This PR contains some proposed implementation updates to BufWriter, with a focus on reducing total device write calls in the average case. These updates are based on lessons learned from the design of LineWriterShim, as well as some discussions with @workingjubilee on this topic.

There are three main changes to how BufWriter works:

write_all now fully fills the buffer before flushing. Previously, it would fill the buffer as much as possible without splitting any incoming writes. However, we assume that a caller of write_all is foremost interested in minimizing write calls to the underlying device, which we can best achieve by maximizing the size of buffers we flush. Of course, incoming data that exceeds the total size of the buffer is still forwarded directly to the underlying device.
- Conversely, we assume that a caller of write would prefer to have unbroken writes if possible (that is, would prefer Ok(n) where n == buf.len()). The implementation of write is therefore left unchanged with regard to buffer filling.
write_vectored is unchanged when inner.is_write_vectored(). However, in the case where the underlying device doesn't offer any specialization, we now take care to buffer together all the incoming sub-bufs, even if the total size exceeds our buffer, and only fall back to directly forwarded writes in the case that an individual slice exceeds our buffer size. As a result, BufWriter now also unconditionally returns true for is_write_vectored.
Added flush_buf_vectored(), a new internal method that is similar to flush_buf, but additionally attempts to use vectored operations to send the new incoming data along with the existing buffered data in a single operation. I took care to ensure that this never results in more system calls than it would have already; in particular, it immediately stops forwarding as soon as the existing buffer is fully forwarded, even if 0 new bytes have been sent. write and write_all were refactored to take advantage of it, and write_vectored was slightly modified to forward to write if exactly 1 buffer is being written.
Additionally, as a much more minor change, removed several unnecessary uses of the self.panicked guard fences. BufWriter::panicked is about preventing duplicate writes of buffered data, so it isn't necessary to fence writes to the underlying device when the buffer is known to be empty.

Follow up items

Tasks

Test the new behavior

Open questions

Stuff I want to call out and ensure is addressed in review:

Do the assumptions underpinning the BufWriter changes make sense.
Would it make sense for BufWriter to unconditionally return true for is_write_buffered, since it specializes vectored writes by buffering them together, even when the underlying device offers no such specialization?

This was split off to a separate PR from #78515

Changes to some BufWriter Write methods, with a focus on reducing total device write calls by fully filling the buffer in some cases

rust-highfive · 2020-10-30T02:30:44Z

r? @joshtriplett

(rust_highfive has picked a reviewer for you, use r? to override)

library/std/src/io/buffered/bufwriter.rs

m-ou-se · 2020-10-30T10:54:04Z

Do the assumptions underpinning the BufWriter changes make sense.

I'm slightly worried about splitting things from write_all when the data could fit in the buffer if it was flushed first.

The documentaion here says:

This method will continuously call write until there is no more data to be written or an error of non-ErrorKind::Interrupted kind is returned.

Strictly speaking it doesn't specify that it will call write on the whole buffer, but I think that's what it implies. It's not a weird assumption to make that a write_all() to a BufWriter with less than .capacity() bytes will be not be split. It might be fine to break this assumption, but right now there's no way to avoid it with write_all, as BufWriter does not expose anything like a write_out_the_internal_buffer() function that makes room for the next write_all() without also calling flush() on the underlying buffer.

Would it make sense for BufWriter to unconditionally return true for is_write_buffered, since it specializes vectored writes by buffering them together, even when the underlying device offers no such specialization?

The documentation says:

Determines if this Writeer has an efficient write_vectored implementation.

If a Writeer does not override the default write_vectored implementation, code using it may want to avoid the method all together and coalesce writes into a single buffer for higher performance.

So I'd say yes, this write_vectored implementation qualifies. You'd not want a user of BufWriter to manually buffer things to call write instead of write_vectored.

- BufWriter now makes use of vectored writes when flushing; it attempt to write both buffered data and incoming data in a single operation when it makes sense to do so. - LineWriterShim takes advantage of BufWriter's new "vectored flush" operation in a few places - Fixed a failing test. No new tests yet.

- Added BufWriter::available; used it to simplify various checks - Refactored write to make it more simple; extensively recommented it

- Fixed bugs in write implementation; decompressed it to make the flow more clear. - Replaced several uses of .capacity() with .available(); in these cases they're identical (because they always occur after a completed flush) but available makes more sense conceptually.

library/std/src/io/buffered/bufwriter.rs

Lucretiel · 2020-11-01T02:09:43Z

I'm slightly worried about splitting things from write_all when the data could fit in the buffer if it was flushed first.

Yeah, I went back on forth on this. The rationale I settled on is that, while write tries as much as is reasonable to not split the incoming buffer, a caller of write_all is probably more interested in all the data being processed as efficiently as possible, since it's known that write_all will loop as much as it needs to to ingest all the given data.

So I'd say yes, this write_vectored implementation qualifies. You'd not want a user of BufWriter to manually buffer things to call write instead of write_vectored.

Done!

Lucretiel · 2020-11-01T02:12:35Z

I added a section to the PR description, but I wanted to call it out in a new comment: I revised the PR so that write and write_all now take advantage of vectored writes when possible. Basically they'll pair the buffer and the incoming bytes into a vectored write when flushing.

library/std/src/io/buffered/bufwriter.rs

mzabaluev · 2020-11-07T22:10:46Z

library/std/src/io/buffered/bufwriter.rs

+        if let Some(buf) = only_one(bufs, |b| !b.is_empty()) {
+            // If there's exactly 1 incoming buffer, `Self::write` can make
+            // use of self.inner.write_vectored to attempt to combine flushing
+            // the existing buffer with writing the new one.
+            self.write(buf)


I'm not sure this special case is worth optimizing for. write_vectored is used when there are multiple slices to write, and all the other paths handle occasionally empty slices correctly. The added branching here adds overhead to the most typical case of multiple non-empty slices.

Hmm. I'll see if I can whip up a benchmark; my instinct is that branch prediction means that this overhead will be completely negligible (since, as you said, most of the time there'll be several inputs.)

The major point of differentiation between write and write_vectored here doesn't have anything to do with empty slices; it's that write gets to use flush_buf_vectored.

I went back and forth a lot on this, but I'm coming around to getting rid of it since, yeah, callers will likely end up calling the correct method. I've reviewed my own use of write_vectored in this PR (flush_buf_vectored etc) and confirmed that this usually ends up being the case. I'd love to get another opinion though (@m-ou-se?)

library/std/src/io/buffered/bufwriter.rs

Dylan-DPC-zz · 2020-11-24T19:02:16Z

@Lucretiel any updates on this (i know it is waiting on review but if you can address the last review before we get this reviewed by @joshtriplett it would be better)

Lucretiel · 2020-11-24T21:23:21Z

Yeah, my apologies, I've been busy with other stuff and honestly the election kind of took over all of my free cognitive space. I can follow up with this at least by this weekend.

Lucretiel · 2020-11-29T01:08:28Z

Strictly speaking it doesn't specify that it will call write on the whole buffer, but I think that's what it implies. It's not a weird assumption to make that a write_all() to a BufWriter with less than .capacity() bytes will be not be split. It might be fine to break this assumption, but right now there's no way to avoid it with write_all, as BufWriter does not expose anything like a write_out_the_internal_buffer() function that makes room for the next write_all() without also calling flush() on the underlying buffer.

The main reason I'm willing to break them up in this case is that write_all has always expressed to me "do whatever it takes to write the entire buffer". In particular, because write_all freely retries writes, it makes sense to me that a caller would be fine with the writes being split if the underlying system does so.

Additionally, by far the most common caller of write_all is going to be write! and friends, and in that case I'd assume the caller is especially uninterested in non-split writes, since it's going to end up doing several tiny writes anyway.

Lucretiel · 2020-11-29T18:30:23Z

Found a corner-case bug in this implementation, please don't merge until I update & write a regression test

- Found and fixed a bug where write_vectored could, in some circumstances, forward a write directly to the inner writer (skipping the buffer) without first flushing the buffer. - Added a regression test for this bug.

Lucretiel · 2020-11-29T19:19:19Z

Fixed

bors · 2020-12-09T04:31:44Z

☔ The latest upstream changes (presumably #78768) made this pull request unmergeable. Please resolve the merge conflicts.

Note that reviewers usually do not review pull requests until merge conflicts are resolved! Once you resolve the conflicts, you should change the labels applied by bors to indicate that your PR is ready for review. Post this as a comment to change the labels:

@rustbot modify labels: +S-waiting-on-review -S-waiting-on-author

tgnottingham · 2020-12-10T11:38:17Z

I'm working on some changes that may significantly improve BufWriter performance, incidentally as part of using BufWriter more heavily in the compiler. The changes are simple, but may be going in a different direction from this PR. They may be reconcilable though. I'll try to review and comment when I have a chance.

Dylan-DPC-zz · 2021-01-11T22:37:27Z

@Lucretiel any updates?

Dylan-DPC-zz · 2021-03-02T01:17:39Z

thanks for taking the time to contribute. I have to close this due to inactivity. If you wish and you have the time you can open a new PR with these changes and we'll take it from there. Thanks

Implementation changes to BufWriter

6a6a830

Changes to some BufWriter Write methods, with a focus on reducing total device write calls by fully filling the buffer in some cases

rust-highfive assigned joshtriplett Oct 30, 2020

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Oct 30, 2020

Lucretiel mentioned this pull request Oct 30, 2020

Switchable buffering for Stdout #78515

Closed

9 tasks

camelid added T-libs Relevant to the library team, which will review and decide on the PR/issue. T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. labels Oct 30, 2020

m-ou-se reviewed Oct 30, 2020

View reviewed changes

Lucretiel added 6 commits October 30, 2020 10:35

Edge case optimized in write_vectored

e961fe1

Various additional updates

41575e4

- Added BufWriter::available; used it to simplify various checks - Refactored write to make it more simple; extensively recommented it

Missing import

c022282

Elaborated on the exit condition commentary of write_vectored

414748d

the8472 reviewed Oct 30, 2020

View reviewed changes

library/std/src/io/buffered/bufwriter.rs Outdated Show resolved Hide resolved

Lucretiel mentioned this pull request Oct 31, 2020

Trim methods on slices rust-lang/rfcs#2547

Open

the8472 reviewed Oct 31, 2020

View reviewed changes

library/std/src/io/buffered/bufwriter.rs Show resolved Hide resolved

Added tests; fixed bugs

b7ca9f1

Lucretiel added 4 commits November 1, 2020 00:17

revert pointless change to Drop

0a9721c

x.py fmt still disagrees with rustfmt from the editor

4ee3bca

Replacee TODO with FIXME, per tidy

9d23026

Update comments in tests

89dc1ac

m-ou-se mentioned this pull request Nov 5, 2020

Use is_write_vectored to optimize the write_vectored implementation for BufWriter #78768

Merged

mzabaluev reviewed Nov 5, 2020

View reviewed changes

library/std/src/io/buffered/bufwriter.rs Show resolved Hide resolved

mzabaluev reviewed Nov 6, 2020

View reviewed changes

library/std/src/io/buffered/bufwriter.rs Outdated Show resolved Hide resolved

mzabaluev reviewed Nov 7, 2020

View reviewed changes

Cleanup & Optimize only_one function

5ef0d7a

simplify tail! macro

3ef40ea

Fix bug in BufWriter::write_vectored

6a73f0f

- Found and fixed a bug where write_vectored could, in some circumstances, forward a write directly to the inner writer (skipping the buffer) without first flushing the buffer. - Added a regression test for this bug.

tgnottingham mentioned this pull request Dec 11, 2020

Optimize BufWriter #79930

Merged

JohnCSimon added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Jan 11, 2021

JohnCSimon added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Feb 8, 2021

JohnCSimon added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Mar 1, 2021

Dylan-DPC-zz closed this Mar 2, 2021

Dylan-DPC-zz added S-inactive Status: Inactive and waiting on the author. This is often applied to closed PRs. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Mar 2, 2021

the8472 mentioned this pull request Jan 31, 2023

io: soften ‘at most one write attempt’ requirement in io::Write::write #107200

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation changes to BufWriter #78551

Implementation changes to BufWriter #78551

Lucretiel commented Oct 30, 2020 •

edited

Loading

rust-highfive commented Oct 30, 2020

m-ou-se commented Oct 30, 2020

Lucretiel commented Nov 1, 2020

Lucretiel commented Nov 1, 2020

mzabaluev Nov 7, 2020

Lucretiel Nov 8, 2020 •

edited

Loading

Lucretiel Nov 29, 2020

Dylan-DPC-zz commented Nov 24, 2020

Lucretiel commented Nov 24, 2020

Lucretiel commented Nov 29, 2020

Lucretiel commented Nov 29, 2020

Lucretiel commented Nov 29, 2020

bors commented Dec 9, 2020

tgnottingham commented Dec 10, 2020

Dylan-DPC-zz commented Jan 11, 2021

Dylan-DPC-zz commented Mar 2, 2021

Implementation changes to BufWriter #78551

Implementation changes to BufWriter #78551

Conversation

Lucretiel commented Oct 30, 2020 • edited Loading

Follow up items

Tasks

Open questions

rust-highfive commented Oct 30, 2020

m-ou-se commented Oct 30, 2020

Lucretiel commented Nov 1, 2020

Lucretiel commented Nov 1, 2020

mzabaluev Nov 7, 2020

Choose a reason for hiding this comment

Lucretiel Nov 8, 2020 • edited Loading

Choose a reason for hiding this comment

Lucretiel Nov 29, 2020

Choose a reason for hiding this comment

Dylan-DPC-zz commented Nov 24, 2020

Lucretiel commented Nov 24, 2020

Lucretiel commented Nov 29, 2020

Lucretiel commented Nov 29, 2020

Lucretiel commented Nov 29, 2020

bors commented Dec 9, 2020

tgnottingham commented Dec 10, 2020

Dylan-DPC-zz commented Jan 11, 2021

Dylan-DPC-zz commented Mar 2, 2021

Lucretiel commented Oct 30, 2020 •

edited

Loading

Lucretiel Nov 8, 2020 •

edited

Loading