Add stateless dictionary support #216

klauspost · 2020-02-01T04:41:24Z

Enable up to 8K dictionary for stateless compression.

flate/stateless.go

nhooyr · 2020-02-01T04:44:52Z

I can't use this in WebSockets due to the 8 KB limit unfortunately since I need to support at least the default 32 KB rolling window. I suppose I could fallback. 🤔

klauspost · 2020-02-01T04:55:37Z

Why? 8K should work just as well as 32K.

nhooyr · 2020-02-01T04:58:21Z

Why? 8K should work just as well as 32K.

Not all clients support configurable window sizes. E.g. every other Go WebSocket library does not since the standard library does not allow configuring it.

nhooyr · 2020-02-01T05:05:24Z

It's possible I'm confused but as far as I understand, the window size must be the same on both client/server, so the truncation that would be performed here wouldn't work.

klauspost · 2020-02-01T05:06:45Z

I think you may be misunderstanding what the dictionary is for. It is purely there for efficiency, and the compressor can have any history size. It is purely for better compression.

You can use the current stateless implementation, and it will work fine with any client. It will just compress slightly worse.

nhooyr · 2020-02-01T05:15:21Z

https://godoc.org/compress/flate#NewWriterDict

The compressed data written to w can only be decompressed by a Reader initialized with the same dictionary.

Why do the docs state that if the dictionary is purely for efficiency?

klauspost · 2020-02-01T05:27:48Z

That is if it is a pre-shared dictionary, meaning something that is initialized before the stream starts.

The decoder must have the dictionary, since the encoder may reference it. But the encoder does not have to reference the dictionary at all or only parts of it, which is why an 8K is fine even if the decoder buffers up to 32K.

For blocks, the entire 32K will be referenced because the dictionary is a sliding window from the current decompression point.

You are just limiting how far back the encoder can reference data. The decoder must be able to handle references 32K back, but from the point of the compressor it is just a matter of efficiency how far back to reference.

klauspost · 2020-02-01T05:29:58Z

In terms of the documentation, what it means is that the decoder must have the dictionary to decompress the data correctly.

nhooyr · 2020-02-01T06:10:51Z

Ahh makes perfect sense. Will test in my lib this weekend.

klauspost · 2020-02-13T05:01:39Z

@nhooyr Have you had a chance to test it?

nhooyr · 2020-02-13T06:39:03Z

Thanks for reminding me, just finally finished the rewrite with compression.

Here's the commit in which I add your library.

coder/websocket@aff9d13

All my tests pass including the entire autobahn test suite with race detection enabled so it definitely works well. I cannot comment on performance yet as I haven't written benchmarks just yet. As soon as I do, I'll let you know.

nhooyr · 2020-02-13T06:58:40Z

The one thing I was curious about is why it doesn't return the length of bytes written. I need this information to accurately implement io.Writer if an error occurs as then it's possible not all of p was written.

nhooyr · 2020-02-13T07:08:36Z

Updated commit: coder/websocket@7dbe93c

klauspost · 2020-02-13T18:57:49Z

@nhooyr Cool. I will take a peek :)

why it doesn't return the length of bytes written.

It doesn't really make much sense to keep track of that, since there isn't a 1:1 mapping between input and output. x bytes in will map to y bytes output and a partial write will result in garbage data.

Most writers consider a failed write as unrecoverable, including the stdlib which persists the error.

If you want to keep track of it you can buffer the output, so writes cannot fail. But in general I find that trying to recover a broken write is very unreliable at best, but I'll leave that to you.

nhooyr · 2020-02-13T19:01:07Z

Fair, noticed the stdlib writer just returns n = 0 if any error occurs.

nhooyr · 2020-02-13T23:05:07Z

Why don't we just encapsulate that into StatelessDeflate and have it return n so anyone using it doesn't have to think about it and to match stdlib flate's behaviour?

klauspost · 2020-02-13T23:39:37Z

I'm not sure what problem you are trying to solve, but there is also NewStatelessWriter if you want regular Write calls.

nhooyr · 2020-02-14T21:55:00Z

See #222

nhooyr · 2020-02-15T20:25:52Z

My benchmarks indicate your library is 2x faster than the stdlib which is sweet. For non stateless compression, it seems to allocate 2x though. See #107 (comment)

In regards to the stateless compression, I'm seeing 19301 B being allocated per op in my benchmark for echoing a 512 random byte message with a 8 KB dictionary. With no dictionary, it's like 100 B per op. I'm not sure how much memory it's using internally between calls, there doesn't seem to be an easy way to measure as it's using a pool to keep the memory usage down.

nhooyr · 2020-02-15T20:33:48Z

Disabled the bitWriterPool pool and I'm seeing 36931 B/op which seems really good, too good to be true. Is there any other pool I need to disable?

klauspost · 2020-02-15T23:06:39Z

@nhooyr The actual allocations depend on whether the content is expected to be compressed.

You are probably hitting this:

compress/flate/stateless.go

Line 108 in b7ccab8

if dst.n == 0 {

or maybe the check right below.

This is early bailout if content cannot be compressed in which case no huffman tables are generated and content is just written uncompressed.

nhooyr · 2020-02-16T01:50:44Z

So I changed my benchmark to use []byte(strings.Repeat("1234", 128)) and I'm seeing the same behaviour. Add println's to confirm that neither check is being hit.

nhooyr · 2020-02-16T01:55:41Z

Although no dictionary now allocates 15 KB/op.

nhooyr · 2020-02-16T01:57:08Z

Nvm ignore last comment, was due to the bit writer pool being disabled.

$ go test -bench=BenchmarkConn -run='^$' -memprofile /tmp/mem.prof -memprofilerate 1
goos: darwin
goarch: amd64
pkg: nhooyr.io/websocket
BenchmarkConn/disabledCompress-8         	   51957	     22579 ns/op	  22.68 MB/s	      32 B/op	       2 allocs/op
BenchmarkConn/compress-8                 	   11718	    102324 ns/op	   5.00 MB/s	   19378 B/op	       4 allocs/op
BenchmarkConn/compressNoContext-8        	   16272	     75338 ns/op	   6.80 MB/s	     106 B/op	       4 allocs/op
PASS
ok  	nhooyr.io/websocket	5.835s

On https://github.com/nhooyr/websocket/tree/compress and with the bit writer pool enabled in klauspost/compress.

nhooyr · 2020-02-16T02:08:11Z

It's also definitely compressing, I'm seeing 16 total bytes written (including the websocket header) for the 512 byte message.

klauspost · 2020-02-16T02:09:35Z

@nhooyr Did you ever doubt that :D

nhooyr · 2020-02-16T02:10:15Z

Honestly did, the memory being allocated is impressively low.

klauspost · 2020-02-16T02:15:36Z

It can use the stack since it is stateless, and other tricks since it is only really useful for smaller blocks.

It is (of course) quite inefficient, but something is better than nothing and speed is decent. Most important is of course that it doesn't keep memory between calls.

Add stateless dictionary support

6476175

Enable up to 8K dictionary for stateless compression.

klauspost mentioned this pull request Feb 1, 2020

compress/flate: Allow resetting writer with new dictionary golang/go#36919

Open

nhooyr reviewed Feb 1, 2020

View reviewed changes

flate/stateless.go Outdated Show resolved Hide resolved

Update golden files.

2ddf78b

klauspost mentioned this pull request Feb 3, 2020

runtime error: slice bounds out of range #217

Closed

klauspost added 3 commits February 3, 2020 13:30

Make the dictionary a byte slice.

941f4ea

Merge branch 'master' into stateless-dictionary-support

84305fd

Update README.md

6fd70f0

klauspost merged commit b7ccab8 into master Feb 4, 2020

klauspost deleted the stateless-dictionary-support branch February 4, 2020 17:44

nhooyr mentioned this pull request Feb 14, 2020

Return length from StatelessDeflate #222

Closed

nhooyr mentioned this pull request Feb 16, 2020

Increase in memory usage when compression is on gorilla/websocket#203

Open

nhooyr mentioned this pull request Apr 2, 2020

klauspost/compress is 21 megabytes coder/websocket#220

Closed

This was referenced Nov 23, 2022

pkg/profile: Use buffer pools to reduce allocs parca-dev/parca-agent#1056

Closed

pkg/profile: Replace gzip package to reduce allocs parca-dev/parca-agent#1065

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add stateless dictionary support #216

Add stateless dictionary support #216

klauspost commented Feb 1, 2020

nhooyr commented Feb 1, 2020

klauspost commented Feb 1, 2020

nhooyr commented Feb 1, 2020

nhooyr commented Feb 1, 2020

klauspost commented Feb 1, 2020 •

edited

Loading

nhooyr commented Feb 1, 2020

klauspost commented Feb 1, 2020 •

edited

Loading

klauspost commented Feb 1, 2020

nhooyr commented Feb 1, 2020

klauspost commented Feb 13, 2020

nhooyr commented Feb 13, 2020

nhooyr commented Feb 13, 2020

nhooyr commented Feb 13, 2020

klauspost commented Feb 13, 2020

nhooyr commented Feb 13, 2020

nhooyr commented Feb 13, 2020

klauspost commented Feb 13, 2020

nhooyr commented Feb 14, 2020

nhooyr commented Feb 15, 2020 •

edited

Loading

nhooyr commented Feb 15, 2020 •

edited

Loading

klauspost commented Feb 15, 2020

nhooyr commented Feb 16, 2020

nhooyr commented Feb 16, 2020

nhooyr commented Feb 16, 2020

nhooyr commented Feb 16, 2020 •

edited

Loading

klauspost commented Feb 16, 2020

nhooyr commented Feb 16, 2020

klauspost commented Feb 16, 2020

Add stateless dictionary support #216

Add stateless dictionary support #216

Conversation

klauspost commented Feb 1, 2020

nhooyr commented Feb 1, 2020

klauspost commented Feb 1, 2020

nhooyr commented Feb 1, 2020

nhooyr commented Feb 1, 2020

klauspost commented Feb 1, 2020 • edited Loading

nhooyr commented Feb 1, 2020

klauspost commented Feb 1, 2020 • edited Loading

klauspost commented Feb 1, 2020

nhooyr commented Feb 1, 2020

klauspost commented Feb 13, 2020

nhooyr commented Feb 13, 2020

nhooyr commented Feb 13, 2020

nhooyr commented Feb 13, 2020

klauspost commented Feb 13, 2020

nhooyr commented Feb 13, 2020

nhooyr commented Feb 13, 2020

klauspost commented Feb 13, 2020

nhooyr commented Feb 14, 2020

nhooyr commented Feb 15, 2020 • edited Loading

nhooyr commented Feb 15, 2020 • edited Loading

klauspost commented Feb 15, 2020

nhooyr commented Feb 16, 2020

nhooyr commented Feb 16, 2020

nhooyr commented Feb 16, 2020

nhooyr commented Feb 16, 2020 • edited Loading

klauspost commented Feb 16, 2020

nhooyr commented Feb 16, 2020

klauspost commented Feb 16, 2020

klauspost commented Feb 1, 2020 •

edited

Loading

klauspost commented Feb 1, 2020 •

edited

Loading

nhooyr commented Feb 15, 2020 •

edited

Loading

nhooyr commented Feb 15, 2020 •

edited

Loading

nhooyr commented Feb 16, 2020 •

edited

Loading