Timeline for IANA dictionary registry? #1669

cldellow · 2019-07-02T22:24:04Z

Hello - apologies if this isn't the best venue for this question. Please redirect me if that's the case!

https://tools.ietf.org/html/rfc8478#section-6.3 alludes to work in progress to provide pre-built dictionaries designed to optimize compressing certain types of content.

I have a use case (compressing many HTML files) that benefits from a dictionary -- even a small dictionary -- trained on HTML files. However, I'm hesitant to define an out-of-band process for distributing the dictionary and to become the steward for such a file, especially if such a standard dictionary may be coming soon, anyway.

I suspect an HTML dictionary would be one of the standard ones registered, so I was wondering - is there any publicly-shareable timeline for when such dictionaries may be available?

Thanks!

Cyan4973 · 2019-07-03T20:55:25Z

Unfortunately, don't expect it happening "soon".
To serve your own needs, you are better off today designing your own mechanism (we currently do the same at Facebook).

The topic of generic dictionaries for web contents has been discussed and stopped here :
mozilla/standards-positions#105

I still believe it's a good idea, especially as we have been able to measure gains, and they were substantial. But it will require some time to win the argument.

cldellow · 2019-07-08T20:46:47Z

Thank you for the context + link to that thread! I can see how it's a bit of a political mess. :(

andrew-aladev · 2019-10-12T21:03:28Z

Hello.

Unfortunately, don't expect it happening "soon".

It is frustrating. I was thinking that Facebook knows what competition it tried to jump in. Developers created awesome compression library with multiple dictionaries support. Now Facebook need to make an epic constantly repeated research (like Google did for brotli) and create registry with web optimized dictionaries.

Facebook can destroy Google in perspective, because web mutates and static brotli dictionary can stale. But Facebook stopped and don't want to provide money for such research.

Please let me know if I am wrong. I will be happy to be wrong.

Cyan4973 · 2019-10-13T18:27:04Z

The limitations do not come from Facebook side.

Actually, Facebook is already able to use Dictionary compression over http, though is contrained to its own private environment (own client, own server). The http ecosystem is very large, and it takes a lot of time to convince all actors that this innovation is a good thing for the web. Expect some progresses on this topic in the future (we are actively working on it, it's not abandoned), but at a pace compatible with the size of an ecosystem as gigantic as the http one.

andrew-aladev · 2019-10-14T16:53:41Z

@Cyan4973, Hello. Can you please ask in facebook something like rough estimate for target integration steps? Thank you.

Cyan4973 · 2019-10-14T17:13:54Z

This does not depend on Facebook.
A more critical actor for such a topic is IETF's W3C committee.

I would love to have such estimate, in fact we are working our way to get one, through direct contributions with active participation of @felixhandte .

But at this stage, it's too early, and we don't have any yet.

felixhandte · 2019-10-14T17:28:49Z

Hi @andrew-aladev,

I am working on this topic. I'm not sure how you came to the conclusion that we had abandoned this direction. (Certainly, my bank account disproves your assertion that Facebook isn't spending money to figure this out.)

What do you want me to tell you? It's a hard problem, both technically (especially w.r.t. security) and in terms of driving consensus and adoption. We welcome (constructive) contributions on either of those fronts.

At any rate, I'll be at IETF 106 to discuss the progress we've made and the plan going forward.

andrew-aladev · 2019-10-14T17:39:58Z

Hi @felixhandte, I have an assumption for RFC: please add a special encoding type: zstd-no-dictionary and it will be possible to integrate it everywhere today (web browsers too). Otherwise it is not possible because regular zstd requires dictionary for decompressing.

Web browsers with regular zstd support released today won't be able to decompress content from 2025, because it will require dictionaries.

felixhandte · 2019-10-14T17:43:21Z

The plan is the opposite: as I described in the caniuse thread, the RFC does not standardize the use of a dictionary. Responses with Content-Encoding: zstd should not use a dictionary. If and when a dictionary-based scheme is standardized for HTTP, it will use a different content-coding identifier.

andrew-aladev · 2019-10-14T17:53:22Z

Sorry, It is not clear, I can't see anything about dictionary in RFC 6.2 Content Encoding. Are you sure about that?

felixhandte · 2019-10-14T18:31:46Z

I agree that the text of the RFC is not as clear about this as it could have been. I can look into inserting something in a future version of the document.

I am pretty confident that I can speak authoritatively about Zstd and HTTP. We are not going to ship an extension to the spec that breaks existing clients and servers... That would be pretty obviously stupid. So any standardized way of using dictionaries will require at least one of: a totally different content-coding identifier, or extra negotiation beyond Accept-Encoding: zstd.

Cyan4973 added the question label Jul 2, 2019

cldellow closed this as completed Jul 8, 2019

andrew-aladev mentioned this issue Oct 12, 2019

HTTP content-encoding: zstd Fyrd/caniuse#4065

Closed

andrew-aladev mentioned this issue Oct 16, 2019

Remove mixing of function result and error codes #1825

Closed

felixhandte mentioned this issue Mar 21, 2022

Feature request: Standard dictionary for web usage #3100

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Timeline for IANA dictionary registry? #1669

Timeline for IANA dictionary registry? #1669

cldellow commented Jul 2, 2019

Cyan4973 commented Jul 3, 2019 •

edited

Loading

cldellow commented Jul 8, 2019

andrew-aladev commented Oct 12, 2019

Cyan4973 commented Oct 13, 2019

andrew-aladev commented Oct 14, 2019

Cyan4973 commented Oct 14, 2019

felixhandte commented Oct 14, 2019

andrew-aladev commented Oct 14, 2019

felixhandte commented Oct 14, 2019

andrew-aladev commented Oct 14, 2019 •

edited

Loading

felixhandte commented Oct 14, 2019

Timeline for IANA dictionary registry? #1669

Timeline for IANA dictionary registry? #1669

Comments

cldellow commented Jul 2, 2019

Cyan4973 commented Jul 3, 2019 • edited Loading

cldellow commented Jul 8, 2019

andrew-aladev commented Oct 12, 2019

Cyan4973 commented Oct 13, 2019

andrew-aladev commented Oct 14, 2019

Cyan4973 commented Oct 14, 2019

felixhandte commented Oct 14, 2019

andrew-aladev commented Oct 14, 2019

felixhandte commented Oct 14, 2019

andrew-aladev commented Oct 14, 2019 • edited Loading

felixhandte commented Oct 14, 2019

Cyan4973 commented Jul 3, 2019 •

edited

Loading

andrew-aladev commented Oct 14, 2019 •

edited

Loading