Open Bug 625383 Opened 14 years ago Updated 8 months ago

HttpChunkedDecoder should not memcpy all the data bytes

Tracking

()

Status:

NEW

Project Flags:

Performance Impact

none

People

(Reporter: mcmanus, Unassigned, NeedInfo)

References

(Blocks 1 open bug)

Details

(Whiteboard: [necko-backlog])

Attachments

(1 file)

optimize http chunked decoder memmoves 10 years ago Patrick McManus [:mcmanus] 2.23 KB, patch	u408661 : review+	Details \| Diff \| Splinter Review

Patrick McManus [:mcmanus]

Reporter

Description

•

14 years ago

in nsHttpChunkedDecoder.cpp when we need to identify a new chunk we parse the byte stream into a chunk len and then remove those bytes from the stream with memmove by moving the rest of the buffer up by a few bytes.

http://mxr.mozilla.org/mozilla-central/source/netwerk/protocol/http/nsHttpChunkedDecoder.cpp#99

That can be a large copy of data. And if it might contain more than just the next chunk's data - it could contain chunk headers after that. Which means we will have to copy some of it yet again.

Exactly how much pain this is depends on the chunk sizes, how full our buffers are and probably how much damage it does it l2/l3 cache. Ironically, underbuffering probably helps us here.

Nonetheless - it can add up to something significant. On https://developer.mozila.org/en-US/mobile I see almost 100KB of these copies to just load that page.

It will take more than just changes to nsHttpChunkedDecoder, but we can reorient things to not assume they have to read from the beginning of the buffer and avoid the copies all together.

Patrick McManus [:mcmanus]

Reporter

Comment 1

•

10 years ago

The fundamental blocker here is that our apis tend to be of the form "here is my buffer of size N, fill it please"... so the network layer in this case has done a large read into the user buffer and then needs to excise the chunk markers from it. (which requires memmoving all the subsequent bytes up to cover the gaps). The result is pretty much all chunked data gets memcopied.

our http/2 code faces a similar problem.. frame headers (and control frames) constantly are mixed in the data stream. It addresses this by limiting reads to frame header size to figure out the size of the data chunk, and then reading the chunk size. No moves required.

http/1 chunked is a bit messier because it is not a binary frame format; but I've done something very similar with this patch. When looking for a chunk size read either 1KB (when getting all the headers) or 6 bytes (when aligned at an unparsed chunksize marker).. when we have a parsed chunk size then just read up to the remaining chunk size. This minimizes the amount of memmove

https://tbpl.mozilla.org/?tree=Try&rev=0a69e8d07c31

The first page I ran this on reduced memmoves from 1,865KB to 12KB while adding about 500 syscalls. I think that's a win - trivial syscalls are trivial, but memory bandwidth is always a bottleneck.

Patrick McManus [:mcmanus]

Reporter

Comment 2

•

10 years ago

Attached patch optimize http chunked decoder memmoves — Details — Splinter Review

Attachment #8370062 - Flags: review?(hurley)

Patrick McManus [:mcmanus]

Reporter

Updated

•

10 years ago

Assignee: nobody → mcmanus

Status: NEW → ASSIGNED

u408661

Updated

•

10 years ago

Attachment #8370062 - Flags: review?(hurley) → review+

Patrick McManus [:mcmanus]

Reporter

Comment 3

•

10 years ago

remote:   https://hg.mozilla.org/integration/mozilla-inbound/rev/73caaea5844f

Carsten Book [:Tomcat]

Comment 4

•

10 years ago

https://hg.mozilla.org/mozilla-central/rev/73caaea5844f

Status: ASSIGNED → RESOLVED

Closed: 10 years ago

Resolution: --- → FIXED

Target Milestone: --- → mozilla30

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Updated

•

10 years ago

Depends on: 971001

Kartikaya Gupta (email:kats@mozilla.staktrace.com)

Comment 5

•

10 years ago

Note that this change was backed out because of bug 971001.

https://hg.mozilla.org/integration/mozilla-inbound/rev/a9f3f9263459

Status: RESOLVED → REOPENED

Resolution: FIXED → ---

Ryan VanderMeulen [:RyanVM]

Comment 6

•

10 years ago

Merge of backout:
https://hg.mozilla.org/mozilla-central/rev/a9f3f9263459

Target Milestone: mozilla30 → ---

Patrick McManus [:mcmanus]

Reporter

Updated

•

8 years ago

Assignee: mcmanus → nobody

Whiteboard: [necko-backlog]

Firefox Bug Husbandry Bot

Comment 7

•

7 years ago

Bulk change to priority: https://bugzilla.mozilla.org/show_bug.cgi?id=1399258

Priority: -- → P1

Firefox Bug Husbandry Bot

Comment 8

•

7 years ago

Bulk change to priority: https://bugzilla.mozilla.org/show_bug.cgi?id=1399258

Priority: P1 → P3

BMO Automation

Updated

•

2 years ago

Severity: normal → S3

Valentin Gosu [:valentin] (he/him)

Comment 9

•

1 year ago

Considering the reason this got backed out, it's probably not the way to go.
But in any case, we should figure out if this is still something worth pursuing, especially since we now have H2/H3.
I suspect we might end up wontfix-ing this issue.

Status: REOPENED → NEW

Performance Impact: --- → ?

Flags: needinfo?(acreskey)

Andrew Creskey [:acreskey]

Comment 10

•

11 months ago

Keeping the ni' on myself as I'm hoping to look deeper into H2/H3 perf.

Blocks: necko-perf

Performance Impact: ? → none

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

HttpChunkedDecoder should not memcpy all the data bytes

Categories

(Core :: Networking: HTTP, enhancement, P3)

Tracking

()

People

(Reporter: mcmanus, Unassigned, NeedInfo)

References

(Blocks 1 open bug)

Details

(Whiteboard: [necko-backlog])

Crash Data

Security

(public)

User Story

Attachments

(1 file)

Description

Comment 1

Comment 2

Updated

Updated

Comment 3

Comment 4

Updated

Comment 5

Comment 6

Updated

Comment 7

Comment 8

Updated

Comment 9

Comment 10

Attachment

General

Description

File Name

Content Type