366559 - Brotli Accept-Encoding/Content-Encoding

Reporter

Description

•

18 years ago

Adding LZMA as a transfer-encoding method for HTTP offers the prospect of 30 - 40% greater compression for page downloads, compared to existing gzip compression, and hence could produce significantly increased page-load speed for mobile devices with restricted bandwidth. This would also benefit users in developing countries, where high-speed access is expensive or unavailable: for example, for the version of Firefox in the OLPC laptop. Example: On three representative web pages downloaded from the web, I got the following results (figure represent the total length of the three files when each compressed individually) Uncompressed: 201185 bytes, 100% gzip: 60911 bytes, 30% of original lzma: 44111 bytes, 21% of original, a factor of 1.38 less than gzip Thus, the extra compression offers a 38% increase in download speed relative to gzip encoding, or, to put it another way, could offer a 28% reduction in page download time. Since LZMA, like gzip, is a streaming compression technique, these increases would apply to incremental as well as total page load time. Advantages: * LZMA offers a significant speedup for bandwidth-restricted devices, where page load speed is most critical * Would only affect the HTTP transfer code, would be completely transparent to every other part of the application * HTTP Accept-transfer-encoding mechanism allows for complete forwards and backwards compatibility with all existing web server and client implementations* Code is small, won't add much to browser footprint * Will increase both total and time-to-first rendering page load times * Appears to be unencumbered by patent issues (please check!) * Free software implementations readily available * To reiterate: 30%+ speedup! Possible disadvantages (and mitigating factors): * LZMA is not yet standardized (but open source implementations are widely available, so implement first, then standardize -- perhaps call the method x-mozilla-lzma until it is standardized) * not yet supported by any webservers (this is a chicken-and-egg situation: support in Firefox will certainly drive adoption: in particular, just adding LZMA compression in Apache would instantly make LZMA available for 50% of the web server installed base) * memory footprint and CPU overhead need investigating (but implementation cannot make anything worse than status quo, even in the worst case, since neither web servers nor browsers would be forced to use it, so it can be switched off where necessary, and both CPU speed and memory are getting bigger much faster than download speeds for mobile devices -- also, since the CPU is often idle while waiting for download data, this may still increase download speeds even on relatively slow platforms: needs benchmarking)

Neil Harris

Reporter

Updated

•

18 years ago

Component: General → Networking: HTTP

OS: Linux → All

Product: Firefox → Core

QA Contact: general → networking.http

Hardware: PC → All

Version: unspecified → Trunk

Neil Harris

Reporter

Comment 1

•

18 years ago

The reference implementation for LZMA is available under the LGPL. See http://www.7-zip.org/sdk.html The busybox code appears to have a GPLv2 implementation of an LZMA decoder. It's surprisingly small: about 500 lines of C. See http://www.busybox.net/cgi-bin/viewcvs.cgi/trunk/busybox/archival/libunarchive/decompress_unlzma.c?rev=17164&view=markup This seems almost too small. Perhaps I've missed something and this isn't the whole thing?

Neil Harris

Reporter

Comment 2

•

18 years ago

After a bit more code inspection: no, the busybox code does not seem to use any symbols it does not define, except for things like reading/writing/malloc/free/error handling, and some simple macros.

Neil Harris

Reporter

Comment 3

•

18 years ago

There seems to be another LGPL'd implementation at http://tukaani.org/lzma/

Jesse Ruderman

•

18 years ago

Yet more: it looks like the RTT of TCP connections can be determined using getsockopt(fd,SOL_TCP, TCP_INFO, ...) and looking in the tcpi_rtt field in the returned structure. This could be used to flag long-latency connections at either the web server or client end, allowing the LZMA capability to be turned on or off at either end on a per-connection basis, depending on whether it would give an anticipated advantage.

Michael Buckley

Comment 14

•

•

12 years ago

Google's QUIC protocol is another sign that concentrating on the smallest possible data payload is the right way to achieve speed, by reducing the number of packets and round trips: they appear to be using going as far as using FEC to reduce round trips to recover from packet loss. This would be entirely compatible with initiatives such as QUIC and SPDY, as far as I can see.

Luke Deller

•

12 years ago

could definitely be interesting. should add it to the accept-encoding header (when pref'd on). does the library exist (and was your build patch tested with) for all of windows, linux, os x?

Luke Deller

Comment 30

•

12 years ago

Attached patch patch with adjusted diff settings (Part 1: code) (obsolete) — Details — Splinter Review

adjusting hg diff settings as per Justin's comment

Attachment #733317 - Attachment is obsolete: true

Luke Deller

Comment 31

•

•

12 years ago

Updated patches: - tested on both Windows (MSVC10) and Linux (Mint 14/amd64/gcc-4.7.2) - by default uses the XZ Embedded library imported into the mozilla source tree, but if configured with --with-system-lzma then it will use the system's liblzma library instead - adds two new prefs: "converter.xz.disabled" and "converter.xz.memory_limit_mb"

Luke Deller

Comment 40

•

12 years ago

Attached patch patch (part 3: test) (obsolete) — Details — Splinter Review

add a unit test (based on the existing test for Content-Encoding: gzip)

Luke Deller

Updated

•

12 years ago

Attachment #737131 - Flags: feedback?(mcmanus)

Jason Duell

•

12 years ago

Comment on attachment 737130 [details] [diff] [review] patch (part 0: import XZ embedded) Review of attachment 737130 [details] [diff] [review]: ----------------------------------------------------------------- Its a fairly short import - that's good :) you'll want to get gerv involved for licensing and maybe ted too to decide on where it should be from a build perspective. it would be good if you could measure how many bytes this adds to an opt windows build... both for the download and code size.

Attachment #737130 - Flags: feedback+

Patrick McManus [:mcmanus]

Assignee

Comment 45

•

12 years ago

Comment on attachment 737131 [details] [diff] [review] patch (part 1: code) splinter won't let me provide inline comments to this patch, so I apologize for the confusing format of this patch feedback. CreateNewHTTPXzConvFactory () - I know that's copy and pasted, but we can fix some stuff up here.. use NS_ENSURE_ARGS instead of checking !aResult explicitly.. you've also got trailing whitespace in here. also "inst" can be managed with a nsRefPtr instead of as plain pointer.. that will help you plug the leak you've got right now in the QI-fails error path. (just replace RELEASE(inst) with inst.forget()) you change nsNetStartup to fail if nsHttpXzConv::Init() fails. Can we be more robust here? I'd rather networking could continue without xz :) For the new files nsHttpXzConv.[cpp/h] I would prefer * 2 space indents * sorted include directives (unless something has a dep) * name the file Foo instead of nsFoo. Same thing for the classes * put everything in the namespace mozilla::net (you used old code as a template and the current conventions have changed without updating all the old code) can you make sure you need all those includes? you do an hg cp to create nsHttpXzConv.h - don't do that in this case. Just use regular cp and hg add it. I don't care to support system LZMA (MOZ_NATIVE_LZMA) if we have imported this code into the tree - that's too confusing and hurts testing by increasing diversity. (it also hurts rapid release). Just use the one embedded and drop the macros and build support for system versions. ironically mInitialised is unitialised ) I'm trying to understand why you have the mLazyInitLock and relatedly why the prefobserver is defined to be THREADSAFE_ISUPPORTS (instead of regular ISUPPORTS). When can this code called from a non-main thread? Your handling of the disabled pref doesn't seem sufficient. We'll just throw errors. We need to remove it from the Accept header request line when its disabled. I think you leak smPrefObserver. (which should be named sPrefObserver) put the nsHttpConv::Init() method after the ctor and dtor make mListener a nsCOMPTr and then you don't need to init, addref and release it manually You don't do anything with mSyncConvContext except hold a reference to it. I think its fine to delete that var entirely. mInpBuffer and mOutBuffer can be converted to nsAutoArrayPtr<unsigned char> to simplify their management I hate to make comments about syntax, but instead of if (foo) { you should do "if (foo) {" (one line) to stay consistent with new necko code.. there is a lot of this in ondataavailable() the coding guidelines also say to not do "if (foo == NULL)" just say "if (!foo)".. also if you do need to say NULL, say nullptr instead. use c++ casts instead of (unsigned char *) I would probably set a minimum size for the first allocation instead of letting streamlen be the floor.. that's because you try and reuse that same buffer in future ODA calls. 8KB maybe. not checking the return value of iStr->Read() is surely a mistake "(uint64_t)smPrefObserver->mXzMemoryLimitMB*1048576;" needs whitespace use std::min instead of XPCOM_MIN I don't understand why the handling of LZMA_BUF_ERROR is the same as LZMA_OK instead of __nsHTTPXzConv__h__ please use mozilla_net_HttpXzConv_h

Attachment #737131 - Flags: feedback?(mcmanus) → feedback+

Patrick McManus [:mcmanus]

Assignee

Comment 46

•

12 years ago

Comment on attachment 737132 [details] [diff] [review] patch (part 2: build) Review of attachment 737132 [details] [diff] [review]: ----------------------------------------------------------------- you need somebody like ted to help with this patch.. I think a compile disable switch is good - but we shouldn't support a system copy of the library in that case. ::: modules/libpref/src/init/all.js @@ +927,5 @@ > > // Enable http compression: comment this out in case of problems with 1.1 > // NOTE: support for "compress" has been disabled per bug 196406. > // NOTE: separate values with comma+space (", "): see bug 576033 > +pref("network.http.accept-encoding", "xz, gzip, deflate"); this change should be part of the streamconv patch not the build system one also, make xz last in the list. ::: netwerk/streamconv/converters/Makefile.in @@ +53,5 @@ > include $(topsrcdir)/config/rules.mk > > DEFINES += -DIMPL_NS_NET > + > +CFLAGS += -O0 I'm guessing this isn't the right place to do this

Patrick McManus [:mcmanus]

Assignee

Comment 47

•

12 years ago

Comment on attachment 737233 [details] [diff] [review] patch (part 3: test) Review of attachment 737233 [details] [diff] [review]: ----------------------------------------------------------------- I really appreciate the test don't do the hg cp thing here. it would be good if the test set the prefs you rely on explicitly.

Attachment #737233 - Flags: feedback+

Luke Deller

•

12 years ago

(In reply to Luke Deller from comment #49) > constructed, which I expect can happen in parallel if there are concurrent > HTTP requests both requiring xz decoding. all the decode happens serialized on the main thread. (the requests can be interleaved on the main thread, but the decoder won't actually run in parallel). So you can just MOZ_ASSERT(NS_IsMainThread()) > > Your handling of the disabled pref doesn't seem sufficient. We'll just throw > > errors. We need to remove it from the Accept header request line when its > > disabled. > > There was already a separate pref to remove it from the Accept header > request line: network.http.accept-encoding. Right.. I think if the disabled pref is set that it should scrub the accept-encoding line. I don't actually care for the fact that we expose the accept header as a pref at all but its kind of too late to change that (addons, etc..) given its not a huge problem. > (I would have preferred this pref to be able to remove the xz filter from > the stream converter service entirely, but looking at > netwerk/streamconv/src/nsStreamConverterService.cpp it seemed that there is > not a mechanism to do that currently) you'll need to do it over in httphandler.

Luke Deller

Comment 52

•

12 years ago

Attached patch patch (part 0: import XZ embedded) (obsolete) — Details — Splinter Review

minor adjustment to xz_config.h for MSVC boolean from upstream

Attachment #737130 - Attachment is obsolete: true

Luke Deller

Comment 53

•

12 years ago

Attached patch patch (part 1: code) (obsolete) — Details — Splinter Review

Updated to address feedback. Initialisation needed to be reworked: to update the "network.http.accept-encoding" pref in response to changes in the "converter.xz.disabled" pref requires our pref observer to be registered at app startup rather than just the first time someone uses xz encoding. I put an initialisation call from nsHttpHandler::Init, is that appropriate?

Attachment #737131 - Attachment is obsolete: true

Attachment #754472 - Flags: review?(mcmanus)

Luke Deller

Comment 54

•

12 years ago

Attached patch patch (part 2: build) (obsolete) — Details — Splinter Review

removed support for system liblzma as per feedback

Attachment #754475 - Flags: review?(ted)

Luke Deller

Updated

•

12 years ago

Attachment #737132 - Attachment is obsolete: true

Luke Deller

Comment 55

•

12 years ago

Attached patch patch (part 3: test) (obsolete) — Details — Splinter Review

address feedback: don't do hg cp; explicitly set prefs

Attachment #737233 - Attachment is obsolete: true

Attachment #754476 - Flags: review?(mcmanus)

Luke Deller

Updated

•

12 years ago

Attachment #754466 - Flags: review?(gerv)

Gervase Markham [:gerv]

Comment 56

•

12 years ago

Comment on attachment 754466 [details] [diff] [review] patch (part 0: import XZ embedded) r=gerv; no additional action needed for code which has been placed in the public domain/given universal permissions using an appropriate notice, which this is. Gerv

Attachment #754466 - Flags: review?(gerv) → review+

Neil Harris

Reporter

Comment 57

•

12 years ago

(In reply to Patrick McManus [:mcmanus] from comment #43) > (In reply to Neil Harris from comment #0) > > > * To reiterate: 30%+ speedup! > > a 30% reduction in bytes is not a 30% speedup. Also, to be fair, this only > applies to resources that are compressible with this scheme and the bulk of > a pages byte count are in images which are not applicable here. Indeed less > than 30% of the page is generally text to start with. > Agreed -- the majority of bytes associated with a page are usually compressed images and fonts, and better text compression won't help with these. My point here is that the arrival of the initial HTML page content, and to a lesser extent stylesheets and javascript, is the critical path for the first flash of page content on a new page, which is one of the most salient features for user perception of site responsiveness, and these resources are all compressable by LZMA: already-compressed fonts, images and so on can be loaded lazily. Better compression of text resource will be even more effective when moving from one page to another on a site where fonts, stylesheets, scripts, site-wide images etc. are already in the cache, and the HTML content and per-page images are the only thing that changes from page to page. LZMA transfer may also potentially be able to speed up loading of text-encoded dynamic content like JSON resources.

Arco Santosini

Comment 58

•

12 years ago

I mentioned this patch in wikipedia (http://en.wikipedia.org/wiki/HTTP_compression#Content-coding_tokens): lzma[citation needed] - Firefox and Gecko will be supporting LZMA compression, this is particularly interesting for smartphones and tablet where bandwidth is limited: LZMA has a very high compression ratio compared to gzip (patch discussed in [1]). I think it is important to make this patch known around so that http servers and other browsers might find interest in supporting this compression algorithm.

Justin Lebar (not reading bugmail)

Comment 59

•

12 years ago

> Firefox and Gecko will be supporting LZMA compression Although I'm excited about LZMA, I don't think we have committed to anything at this point.

Justin Lebar (not reading bugmail)

Comment 60

•

12 years ago

I'd edit the wikipedia article myself, but I think that would count as original research. :)

Patrick McManus [:mcmanus]

Assignee

Comment 61

•

12 years ago

(In reply to Arco Santosini from comment #58) > I mentioned this patch in wikipedia > (http://en.wikipedia.org/wiki/HTTP_compression#Content-coding_tokens): > lzma[citation needed] - Firefox and Gecko will be supporting LZMA > you really shouldn't say that until it is landed.

Luke Deller

•

12 years ago

(In reply to Ted Mielczarek [:ted.mielczarek] from comment #63) Thanks for looking at this Ted > There isn't a system libxz available on Linux, right? Not for "XZ Embedded" - its README explicitly warns against making a shared library, as it does not commit to any API/ABI stability across versions. There is a system liblzma available on Linux that we could use there. It is part of the "XZ Utils" source package that also provides the "xz" command line utility. This is from the same upstream maintainer as "XZ Embedded". The earlier version of my patch had a --with-system-lzma configure option to use this, and some #define/#ifdefs in the source to gloss over slight differences in the API between liblzma and XZ Embedded. Patrick's feedback was to remove this: (Patrick McManus from comment #45) > I don't care to support system LZMA (MOZ_NATIVE_LZMA) if we have imported this > code into the tree - that's too confusing and hurts testing by increasing > diversity. (it also hurts rapid release). Just use the one embedded and drop > the macros and build support for system versions.

(not currently active) Ted Mielczarek

Comment 66

•

12 years ago

Okay, thanks for the info. This is just the sort of thing that Linux distro maintainers will inevitably ask for. If we're committing to only supporting XZ embedded then it doesn't sound like an issue.

Luke Deller

Comment 67

•

12 years ago

•

12 years ago

Attachment #754472 - Flags: review?(mcmanus) → review-

Luke Deller

Comment 71

•

12 years ago

Thanks for getting to this Patrick. I have been on vacation for a couple of weeks, back now and looking at addressing your comments. On a couple of your points: (In reply to Patrick McManus [:mcmanus] from comment #70) > More substantively, to move forward with this we need some kind of partner > on the server side.. it doesn't need to be a big thing - it can just be a > patch included in some other open source project... or maybe even a pledge > to do so. I suggested before that you approach the mod_pagespeed folks and > gague their interest - have you done that or something else? I don't want to > include code that doesn't have a dance partner and can't find one either. Does Apache's existing content negotiation feature suffice for this? This is what I have been using for testing. I put some notes on how to set this up on the wiki page mentioned above: https://wiki.mozilla.org/LZMA2_Compression I was a bit shy of approaching the mod_pagespeed people as I wasn't sure whether this functionality would fit better into mod_deflate (or maybe a new module similar to mod_deflate). I can pursue dynamic compression further if you think it is worthwhile at this stage. > @@ +175,5 @@ > > +{ > > + if (!sPrefObserver) { > > + sPrefObserver = new HTTPXzConvPrefObserver(); > > + } > > + nsCOMPtr<nsIObserver> prefObserver = sPrefObserver; // this adds a ref > > can't you just get rid of the global sPrefObserver and replace it with > mPrefObserver = new HTTPXzConvPrefObserver(); and then use mPrefObserver in > its place.. that also lets you get rid of the already_AddRefed return value > and the forget() We need this preference observer to remain alive throughout the lifetime of the application, so that any changes to the disabled/enabled pref will result in the accept-encoding pref being updated. The observer will be constructed when the static method containing this code (HTTPXzConv::InitPrefObserver) is called from nsHttpHandler.cpp at startup, and it is destroyed when the preference service releases its reference to it at shutdown. The member variable mPrefObserver in the HTTPXzConv class is just me being defensive: the HTTPXzConv instance holds a reference to the observer so that it cannot be destroyed while the converter is alive. Would you rather drop mPrefObserver? Otherwise do you think this approach is reasonable?

Alex Xu

Comment 72

•

12 years ago

Attached patch patch (part 1: code) + whitespace fixes (obsolete) — Details — Splinter Review

I believe I have fixed all of the whitespace issues. I understand that typically in Mozilla culture, patches are usually not changed by people other than the original submitter. However, I'd like to poke this into getting moving again. :) I hope I have not offended.

Attachment #791070 - Flags: feedback?(mcmanus)

Attachment #791070 - Flags: feedback?(ldeller)

Alex Xu

Updated

•

12 years ago

Attachment #791070 - Attachment description: 366559-code.patch → patch (part 1: code) + whitespace fixes

Attachment #791070 - Attachment filename: 366559-code.patch → 366559-code-ws.patch

Jonathan Kew [:jfkthame]

Comment 73

•

12 years ago

Before we decide to press ahead and deploy this, I think we should also consider the new Brotli format under development by Google's data-compression team. See my post in dev.platform for further information.[1] ISTM that Brotli might be a better choice overall, given the substantially faster decompression rates it offers. [1] https://groups.google.com/forum/#!topic/mozilla.dev.platform/CBhSPWs3HS8

(dormant account)

Comment 74

•

12 years ago

•

11 years ago

Comment on attachment 8359769 [details] [diff] [review] patch (part 2: build) Review of attachment 8359769 [details] [diff] [review]: ----------------------------------------------------------------- ::: modules/xz-embedded/moz.build @@ +4,5 @@ > +# License, v. 2.0. If a copy of the MPL was not distributed with this > +# file, You can obtain one at http://mozilla.org/MPL/2.0/. > + > +EXPORTS += [ > + 'xz.h', As I said before, if you're only using this header in one place it might be better to just use LOCAL_INCLUDES in that place instead.

Attachment #8359769 - Flags: review?(ted) → review+

Patrick McManus [:mcmanus]

Assignee

Comment 80

•

11 years ago

to move forward here we're going to need some kind of server side support that indicates someone really wants to run this on the web. Being able to configure apache to do so, but having no real evidence of anyone willing to do so isn't really good enough to add a new format to the web. We'll want to be able to evaluate the results. I'm going to clear the review flags until there is evidence of that. But don't take this the wrong way - I'd like to give it a try.

Patrick McManus [:mcmanus]

Assignee

Updated

•

11 years ago

Attachment #8359763 - Flags: review?(mcmanus)

Patrick McManus [:mcmanus]

Assignee

Updated

•

11 years ago

Attachment #8359780 - Flags: review?(mcmanus)

Luke Deller

Comment 81

•

11 years ago

(In reply to Ted Mielczarek [:ted.mielczarek] from comment #79) > As I said before, if you're only using this header in one place it might be > better to just use LOCAL_INCLUDES in that place instead. Oh I meant to run this by you: I found that I would need to add LOCAL_INCLUDES to two different moz.build files so does your preference remain the same in this case? (xz.h is included into HTTPXzConv.h which is included into both netwerk/streamconv/converters/HTTPXzConv.cpp and netwerk/protocol/http/nsHttpHandler.cpp, so I would need to add LOCAL_INCLUDES to both netwerk/streamconv/converters/moz.build and netwerk/protocol/http/moz.build)

Luke Deller

Comment 82

•

11 years ago

(In reply to Patrick McManus [:mcmanus] from comment #80) > to move forward here we're going to need some kind of server side support > that indicates someone really wants to run this on the web. Being able to > configure apache to do so, but having no real evidence of anyone willing to > do so isn't really good enough to add a new format to the web. We'll want to > be able to evaluate the results. Ok - I had been hoping that the existing support for this in Apache via content negotiation would suffice initially, but wasn't sure of how you felt about that, so thanks for clarifying and I will pursue something more as you suggest.

(not currently active) Ted Mielczarek

Comment 83

•

11 years ago

(In reply to Luke Deller from comment #81) > Oh I meant to run this by you: I found that I would need to add > LOCAL_INCLUDES to two different moz.build files so does your preference > remain the same in this case? I guess do whatever's easiest. Traditionally EXPORTS was "public headers", but that distinction is fuzzy nowadays.

Patrick McManus [:mcmanus]

Assignee

Updated

•

11 years ago

Attachment #791070 - Flags: feedback?(mcmanus)

James Willcox (:snorp) (jwillcox@mozilla.com) (he/him)

Comment 84

•

11 years ago

Patrick, can we get the ball rolling on this again? I would like to use this with the proxy server. Luke, are you interested in working on this again? The patches mostly seem to apply and work. I just needed a couple tweaks in SpdyStream so that we actually send 'accept-encoding' headers. In SPDY, it's assumed you have gzip and deflate available.

Flags: needinfo?(mcmanus)

Flags: needinfo?(ldeller)

James Willcox (:snorp) (jwillcox@mozilla.com) (he/him)

Comment 85

•

11 years ago

A git branch with the relevant patches applied on top of a recent gecko is here: https://github.com/snorp/gecko-dev/tree/xz

Patrick McManus [:mcmanus]

Assignee

Comment 86

•

•

11 years ago

gzip lzma (% smaller gzip) brotli (% smaller gzip) Voxatron: 396k 309k (28%) 343k (13%) Osmos: 654k 481k (26%) 542k (17%) Zenbound2: 757k 548k (27%) 609k (20%) Democracy3: 892k 617k (30%) 703k (21%) FTL: 1426k 978k (31%) 1128k (20%) Dustforce DX: 1560k 948k (39%) 1126k (28%) Super Hexagon: 1562k 1081k (30%) 1261k (19%) AaaaaaaAAAaaa: 4952k 3076k (37%) 3719k (25%) Jack Lumber: 5520k 3352k (39%) 4051k (26%) Regarding decompression speed: if it happens on a background thread in a pipelined fashion (it does right?) AND if we assume more than one core (which, based on http://stats.unity3d.com/web/cpu.html, is a pretty reasonable assumption these days, at least for some application domains like gaming) (AND we ignore power), then it seems like all that matters is that lzma can decompress faster than the network can download. I don't have any data on in-memory decompression speed of lzma; do you? Also: do you know if the encode speed of the brotli code you linked to is representative? I ask because encoding is massively slower than lzma. Like, for AaaaaAAaaa and Jack Lumber, it took ~2m20s whereas xz took ~12s (this is after I changed the makefile to build with -O3).

Patrick McManus [:mcmanus]

Assignee

Comment 105

•

11 years ago

I continue to be open to the prospect of adding a non-deflate based decoder and the negotiation for it as part of Accept-Encoding. I've said upthread a few times that what we need is a content provider willing to do make a real use case available for negotiation that way - then we can see how it works out. Without a dance partner this is just a lot of buffer management code in the tree and that's never a good thing :) I don't have my heart set on either lzma or brotli (and I reserve the right to set my heart later :)) but in either event someone would probably have to put an unbitrotted patch set forward.

Jyrki Alakuijala

Comment 106

•

11 years ago

Numbers for decompression speed for a corpora of asm.js files with C++ on a modern high-end desktop cpu (E5-1650 v2 @ 3.50GHz): gzip 370.15 MB/s, compression density 4.745x brotli 348.21 MB/s, compression density 6.264x lzma 120.59 MB/s, compression density 7.309x Decompression speeds are measured from the uncompressed size. Some fuzzy opinions around the topic: Handheld devices have lesser cpus. To approximate this, we can look at a 2005 2.4 GHz AMD CPU that showed of 65-83 MB/s for gzip decompression and 20-33 MB/s for LZMA. http://tukaani.org/lzma/benchmarks.html A decompression speed of 20 MB/s with a 7x compression ratio corresponds to a network speed of 23 Mbit/s. If one has a faster network connection, then decompression will slow it down to an effective 23 Mbit/s. This is even more annoying if the file is coming from a local cache or a near-by network proxy -- or the local cache is force to hold an uncompressed copy. These compression density numbers for asm.js are not representative for html docs, there brotli and lzma have more of an equal match for compression density, but brotli still decompresses faster. In a multi-lingual reference corpus that I use for html, brotli is 2 % more dense than lzma and 17 % more dense than gzip. Minified JavaScript benefits less from improved compression, only about 10 % with Brotli vs gzip and 11 % with LZMA vs gzip, but there one still has to pay the full cost of slower decompression with LZMA.

Boyan Ilianov

Comment 107

•

11 years ago

Hey guys can you try out using this implementation? https://github.com/nmrugg/LZMA-JS

Yury Delendik (:yury)

Updated

•

10 years ago

Comment 108

•

10 years ago

(In reply to Boyan Ilianov from comment #107) > Hey guys can you try out using this implementation? > https://github.com/nmrugg/LZMA-JS I created some tests, see: https://github.com/pypyjs/pypyjs.github.io/issues/4#issuecomment-111549828

Jyrki Alakuijala

Comment 109

•

10 years ago

I want to share an update about the performance numbers on the asm.js corpus as we have improved brotli's decompression speed and compression density. Gzip is here with a 15 bit window, while lzma, lzham and brotli are applied with a 24 bit window, each algorithm with its highest quality setting. gzip 370.15 MB/s, compression density 4.745x brotli 392.73 MB/s, compression density 7.071x lzma 120.59 MB/s, compression density 7.284x lzham 242.14 MB/s, compression density 7.008 For this corpus brotli compresses 3 % less than lzma, 1 % more than lzham, and 33 % more densely than gzip.

Patrick McManus [:mcmanus]

Assignee

•

10 years ago

(In reply to Jyrki Alakuijala from comment #109) > I want to share an update about the performance numbers on the asm.js corpus > as we have improved brotli's decompression speed and compression density. > Gzip is here with a 15 bit window, while lzma, lzham and brotli are applied > with a 24 bit window, each algorithm with its highest quality setting. > > gzip 370.15 MB/s, compression density 4.745x > brotli 392.73 MB/s, compression density 7.071x > lzma 120.59 MB/s, compression density 7.284x > lzham 242.14 MB/s, compression density 7.008 > > For this corpus brotli compresses 3 % less than lzma, 1 % more than lzham, > and 33 % more densely than gzip. Since the low compression speed of LZMA has been the principal objection to implementing this, I think this is a game changer. It's clear from the above that Brotli now looks like being the superior algorithm, as it has near-LZMA compression size performance at near-gzip compression speed performance. So let's go for Brotli!

alpha_one_x86

Comment 112

•

10 years ago

It's not a problem for static file where the compression is cached. I live in bolivia, where the private server is hosted on 50KB/s upload. Then where the compression speed need just be greater than 1MB/s to sature the inter-city connexion...

Neil Harris

Reporter

Comment 113

•

10 years ago

I should just clarify my position: I'm _not_ saying we shouldn't deploy LZMA in favour of Brotli; I think we should [experimentally at first] deploy both, and trial this in the field with real-world webservers to find out the scale of the real speed advantages, which I think will be significant in many cases, particularly in the common cases of mobile or developing countries where RTTs are high and available bandwidth is low.

Neil Harris

Reporter

Updated

•

10 years ago

Summary: Firefox/Gecko should support LZMA or Brotli as an HTTP transfer-encoding method → Firefox/Gecko should support LZMA and/or Brotli as an HTTP transfer-encoding method

Neil Harris

Reporter

Comment 114

•

10 years ago

One final (for now) aside: we should also look at the possibility of supporting SDCH (see https://engineering.linkedin.com/shared-dictionary-compression-http-linkedin ) which potentially offers even higher degrees of compression, but at the cost of a lot of added complexity and inter-relationships within the web platform, and also some possible security risks if weak hashes were used. But since implementing SDCH would be a radically different issue from the one in this bug, this isn't the place to discuss it any further; I will look at filing a separate bug for SDCH support.

alpha_one_x86

Comment 115

•

10 years ago

I will add brotli to: http://catchchallenger.first-world.info/wiki/Quick_Benchmark:_Gzip_vs_Bzip2_vs_LZMA_vs_XZ_vs_LZ4_vs_LZO But cearly lack of benchmark/test on internet

Jyrki Alakuijala

Comment 116

•

10 years ago

#115 Quick_Benchmark is about compressing 445 MB single file, while the average web html document is 5000 times smaller. If you want to compare with an html file, you can look at the compression density for 'cp.html' in the Canterbury corpus: https://quixdb.github.io/squash-benchmark/ The '3 % less compression density than lzma' results I showed in #109 were for an asm.js corpus. For a multi-lingual html corpus with 93 languages brotli compresses 11 % more densely than lzma. Brotli needs less warm-up time and is relatively more efficient for small (roughly 1 MB or less) files. #114 Brotli's current encoder and decoder implementations have an interface for the custom dictionaries in SDCH (sandwich), so later we could serve brotli sandwich, too.

Chris Peterson [:cpeterson]

Updated

•

10 years ago

Keywords: feature

Patrick McManus [:mcmanus]

Assignee

•

10 years ago

Summary: Firefox/Gecko should support LZMA and/or Brotli as an HTTP transfer-encoding method → Brotli Accept-Encoding/Content-Encoding

Jonathan Kew [:jfkthame]

Comment 129

•

•

10 years ago

Attachment #8663699 - Flags: review?(daniel) → review+

alpha_one_x86

Comment 134

•

10 years ago

For one of my typical content brotli is 25% bigger than lzma.

Daniel Stenberg [:bagder]

Comment 135

•

10 years ago

Comment on attachment 8663700 [details] [diff] [review] patch 6, support different content encodings for http vs https Review of attachment 8663700 [details] [diff] [review]: ----------------------------------------------------------------- ::: netwerk/protocol/http/nsHttpHandler.cpp @@ +1164,5 @@ > nsXPIDLCString acceptEncodings; > rv = prefs->GetCharPref(HTTP_PREF("accept-encoding"), > getter_Copies(acceptEncodings)); > if (NS_SUCCEEDED(rv)) > + SetAcceptEncodings(acceptEncodings, false); Opportunity to add braces for the conditional expression?

Attachment #8663700 - Flags: review?(daniel) → review+

Chris Bentzel

Comment 136

•

10 years ago

You likely already know this, but we also plan to support Brotli for Content-Encoding in Chromium. We plan to restrict it to https like your plans to avoid many of the middlebox problems encountered when rolling out SDCH-over-HTTP a few years back. For fuzzing - we also want to step up fuzzing on Chromium side especially since the current use of Brotli-for-WOFF is in a sandbox and the Content-Encoding case will not be. Always good to have multiple fuzzers (especially since it would feed into shared brotli code base) but wondering if there's a way to combine efforts here.

Daniel Stenberg [:bagder]

Comment 137

•

10 years ago

Comment on attachment 8663701 [details] [diff] [review] patch 7, content-encoding brotli for https Review of attachment 8663701 [details] [diff] [review]: ----------------------------------------------------------------- ::: netwerk/streamconv/converters/nsHTTPCompressConv.cpp @@ +156,5 @@ > + nsHTTPCompressConv *self = static_cast<nsHTTPCompressConv *>(closure); > + *countRead = 0; > + > + // this is documented in brotli/dec/decode.h > + const uint32_t kOutSize = 128 * 1024; I would like this constant slightly better explained/motivated so that we can avoid having to look up that header and also, I did look in the header and it wasn't immediately obvious!

Attachment #8663701 - Flags: review?(daniel) → review+

Patrick McManus [:mcmanus]

Assignee

Updated

•

10 years ago

Flags: needinfo?(ldeller)

Patrick McManus [:mcmanus]

Assignee

Comment 138

•

10 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/c4b11255892f6af81bca578e1787b645a7326a8d bug 366559 - patch 1, update brotli snapshot r=jfkthame https://hg.mozilla.org/integration/mozilla-inbound/rev/2a30f1edd862d3a949fe26b0dda6df03149e5d04 bug 366559 - patch 2, fix nsHTTPCompressConv indentation r=bagder https://hg.mozilla.org/integration/mozilla-inbound/rev/6bfd3ac42ef0907c4c4d473a9287df9709bb2706 bug 366559 - patch 3, fix nsHTTPCompressConv bracing style r=bagder https://hg.mozilla.org/integration/mozilla-inbound/rev/9ce35eb8d2c4224929fd18fda7d4bdc6755533fc bug 366559 - patch 4, fix nsHTTPCompressConv namespace r=bagder https://hg.mozilla.org/integration/mozilla-inbound/rev/f504caa27f0aed5764fb5e58e1a3595756c6edef bug 366559 - patch 5, fix nsHTTPCompressConv manual addref r=bagder https://hg.mozilla.org/integration/mozilla-inbound/rev/5e0a3850571f6b84ca301bd0f4bb2a1712730e94 bug 366559 - patch 6, support different content encodings for http vs https r=bagder https://hg.mozilla.org/integration/mozilla-inbound/rev/4d45c4323570afd1a19695dea9f3c148a88891a4 bug 366559 - patch 7, content-encoding brotli for https r=bagder

Patrick McManus [:mcmanus]

Assignee

Comment 139

•

10 years ago

looks like the push gave the android build #include pain. will revert fix and resubmit

Patrick McManus [:mcmanus]

Assignee

Comment 140

•

10 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/0d687549721e4f0d8902ec4e9704488b0b052cc2 bug 366559 - backout due to android build bustage patch 7 on CLOSED TREE r=backout

Tyson Smith [:tsmith] (PTO)

Updated

•

10 years ago

Depends on: fuzzing-brotli

Patrick McManus [:mcmanus]

Assignee

Comment 141

•

10 years ago

https://treeherder.mozilla.org/#/jobs?repo=try&revision=d455966ac83e

Tyson Smith [:tsmith] (PTO)

•

10 years ago

Keywords: leave-open

Wes Kocher (:KWierso) (Not reading bugmail; email directly if needed)

Comment 143

•

10 years ago

https://hg.mozilla.org/mozilla-central/rev/c4b11255892f https://hg.mozilla.org/mozilla-central/rev/2a30f1edd862 https://hg.mozilla.org/mozilla-central/rev/6bfd3ac42ef0 https://hg.mozilla.org/mozilla-central/rev/9ce35eb8d2c4 https://hg.mozilla.org/mozilla-central/rev/f504caa27f0a https://hg.mozilla.org/mozilla-central/rev/5e0a3850571f https://hg.mozilla.org/mozilla-central/rev/4d45c4323570 https://hg.mozilla.org/mozilla-central/rev/0d687549721e

Patrick McManus [:mcmanus]

Assignee

Comment 144

•

10 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/f8e7377270a1614eff46ff5ca9f26b198f6ac222 bug 366559 - patch 7, content-encoding brotli for https r=bagder

Patrick McManus [:mcmanus]

Assignee

Updated

•

10 years ago

Keywords: leave-open

Carsten Book [:Tomcat]

Comment 145

•

10 years ago

https://hg.mozilla.org/mozilla-central/rev/f8e7377270a1

Status: ASSIGNED → RESOLVED

Closed: 10 years ago

status-firefox44: --- → fixed

Resolution: --- → FIXED

Target Milestone: --- → mozilla44

Jean-Yves Perrier [:teoli]

Updated

•

10 years ago

Keywords: dev-doc-needed

Patrick McManus [:mcmanus]

Assignee

•

10 years ago

and of course gerv's "but I didn't mean it that way" will be part of the discourse - as it always is. better off being professional and not trying to toe that line.

Jyrki Alakuijala

Comment 151

•

10 years ago

would you be fine with 'br' ?

Flags: needinfo?(jyrki.alakuijala)

Patrick McManus [:mcmanus]

Assignee

•

10 years ago

(In reply to Patrick McManus [:mcmanus] from comment #152) > (In reply to Jyrki Alakuijala from comment #151) > > would you be fine with 'br' ? > > sounds good - thanks. I'll make that change and let the content-provider I > know is working with this know. That's excellent. Thank you.

dE

Comment 155

•

10 years ago

Thanks for the fix. Now that a major browser supports it we may see more of it's adoption.

Jyrki Alakuijala

Comment 157

•

10 years ago

I have asked a feminist friend from the North American culture-sphere, and she advised against bro. We have found a compromise that satisfies us, so we don't need to discuss this further. Even if we don't understand why people are upset from our cultural standpoint, they would be (unnecessarily) upset and this is enough reason not to use it.

Comment hidden (advocacy)

Comment hidden (off-topic)

Gervase Markham [:gerv]

Updated

•

10 years ago

Restrict Comments: true

Daniel Veditz [:dveditz]

Comment 166

•

10 years ago

I'll call this sec-review+ because we now have a brotli fuzzer that has successfully uncovered several bugs and can be added to our on-going fuzzing efforts.

Flags: sec-review?(twsmith) → sec-review+

Michal Purzynski [:michal`] (use NEEDINFO)

Comment 167

•

10 years ago

I'd don't like it as just "br" for the reasons Jamal gave above. We need to find a new name.

(not reading bugmail) Nick Desaulniers [:\n]

Updated

•

10 years ago

Keywords: devrel-needed

Jean-Yves Perrier [:teoli]

Comment 168

•

10 years ago

Updated: https://developer.mozilla.org/en-US/docs/Web/HTTP/Content_negotiation#The_Accept-Encoding_header https://developer.mozilla.org/en-US/docs/Web/HTTP/Headers/Content-Encoding and https://developer.mozilla.org/en-US/Firefox/Releases/44#HTTP

Keywords: dev-doc-needed → dev-doc-complete

Neil Harris

Reporter

Comment 169

•

10 years ago

(In reply to Michal Purzynski [:michal`] (use NEEDINFO) from comment #167) > I'd don't like it as just "br" for the reasons Jamal gave above. We need to > find a new name. I think "br" (2 characters) is fine if you think of it as being like "gz" (also 2). On the other hand, "brotli" (5 characters) is fine if you think of it as being like "gzip" (4). Also, if you are using HTTP/2, which is likely to be the dominant protocol within a few years,once you've used it once, it will then be cached and header-compressed for the next of the life of a connection, so it's not as if the three extra characters in "brotli" would eat 3 more header bytes per transaction in perpetuity.

Neil Harris

Reporter

Comment 170

•

10 years ago

(In reply to Neil Harris from comment #169) > (In reply to Michal Purzynski [:michal`] (use NEEDINFO) from comment #167) > > I'd don't like it as just "br" for the reasons Jamal gave above. We need to > > find a new name. > > I think "br" (2 characters) is fine if you think of it as being like "gz" > (also 2). On the other hand, "brotli" (5 characters) is fine if you think of > it as being like "gzip" (4). Also, if you are using HTTP/2, which is likely > to be the dominant protocol within a few years,once you've used it once, it > will then be cached and header-compressed for the next of the life of a > connection, so it's not as if the three extra characters in "brotli" would > eat 3 more header bytes per transaction in perpetuity. Agh! s/5/6/, s/3/4/

Masatoshi Kimura [:emk]

•

9 years ago

Blocks: 1242904

Frederik Braun [:freddy]

Updated

•

9 years ago

Depends on: 1243724

[:philipp]

Updated

•

9 years ago

Depends on: 1261318

(Part 1: code) Add support for LZMA2 decompression "Content-Encoding: xz" using liblzma 12 years ago Luke Deller 14.45 KB, patch		Details \| Diff \| Splinter Review
(Part 2: build) Add support for LZMA2 decompression "Content-Encoding: xz" using liblzma 12 years ago Luke Deller 4.33 KB, patch		Details \| Diff \| Splinter Review
patch with adjusted diff settings (Part 1: code) 12 years ago Luke Deller 33.80 KB, patch		Details \| Diff \| Splinter Review
patch with adjusted diff settings (Part 2: build) 12 years ago Luke Deller 5.62 KB, patch		Details \| Diff \| Splinter Review
patch (part 0: import XZ embedded) 12 years ago Luke Deller 89.69 KB, patch	mcmanus : feedback+	Details \| Diff \| Splinter Review
patch (part 1: code) 12 years ago Luke Deller 24.64 KB, patch	mcmanus : feedback+	Details \| Diff \| Splinter Review
patch (part 2: build) 12 years ago Luke Deller 12.83 KB, patch		Details \| Diff \| Splinter Review
patch (part 3: test) 12 years ago Luke Deller 4.61 KB, patch	mcmanus : feedback+	Details \| Diff \| Splinter Review
patch (part 0: import XZ embedded) 12 years ago Luke Deller 89.70 KB, patch	gerv : review+	Details \| Diff \| Splinter Review
patch (part 1: code) 12 years ago Luke Deller 23.91 KB, patch	mcmanus : review-	Details \| Diff \| Splinter Review
patch (part 2: build) 12 years ago Luke Deller 6.75 KB, patch	ted : review-	Details \| Diff \| Splinter Review
patch (part 3: test) 12 years ago Luke Deller 4.53 KB, patch	mcmanus : review+	Details \| Diff \| Splinter Review
patch (part 2: build) 12 years ago Luke Deller 6.09 KB, patch	ted : review+	Details \| Diff \| Splinter Review
patch (part 1: code) + whitespace fixes 12 years ago Alex Xu 24.48 KB, patch		Details \| Diff \| Splinter Review
patch (part 1: code) 11 years ago Luke Deller 24.59 KB, patch		Details \| Diff \| Splinter Review
patch (part 2: build) 11 years ago Luke Deller 5.46 KB, patch	ted : review+	Details \| Diff \| Splinter Review
patch (part 3: test) 11 years ago Luke Deller 4.49 KB, patch		Details \| Diff \| Splinter Review
patch 1, update brotli snapshot 10 years ago Patrick McManus [:mcmanus] 207.03 KB, patch	jfkthame : review+	Details \| Diff \| Splinter Review
patch 2, fix nsHTTPCompressConv indentation 10 years ago Patrick McManus [:mcmanus] 33.01 KB, patch	bagder : review+	Details \| Diff \| Splinter Review
patch 3, fix nsHTTPCompressConv bracing style 10 years ago Patrick McManus [:mcmanus] 13.20 KB, patch	bagder : review+	Details \| Diff \| Splinter Review
patch 4, fix nsHTTPCompressConv namespace 10 years ago Patrick McManus [:mcmanus] 4.89 KB, patch	bagder : review+	Details \| Diff \| Splinter Review
patch 5, fix nsHTTPCompressConv manual addref 10 years ago Patrick McManus [:mcmanus] 1.20 KB, patch	bagder : review+	Details \| Diff \| Splinter Review
patch 6, support different content encodings for http vs https 10 years ago Patrick McManus [:mcmanus] 11.23 KB, patch	bagder : review+	Details \| Diff \| Splinter Review
patch 7, content-encoding brotli for https 10 years ago Patrick McManus [:mcmanus] 25.68 KB, patch	bagder : review+	Details \| Diff \| Splinter Review