91242 - CSS parsing is 5.5% of startup time

David Baron :dbaron: (⌚️UTC-5, no longer working on Mozilla)

Reporter

Description

•

24 years ago

CSS parsing, according to a jprof profile I just took on Linux, is about 5.5% of startup time. The time spent can be divided into two parts (although actually splitting the profile between them would be hard): string manipulation and data structure construction. There are two approaches I can think of to the problems that we spend a lot of time mucking about with strings: 1) Improve the string use within the CSS parser by using the new string code and doing less copying. This could probably result in a significant improvement in the string manipulation part. 2) Store the CSS stylesheet data structures in the fastload file (see bug 68045). This would mean reconstructing the data structures out of the fastload file. I'm not sure what the performance of the fastload parsing is -- unlike JS script compilation, the processing done by the CSS parser is mostly very simple, so I'm not sure how big the win of replacing one parser with another would be, although there are certainly some things it would improve. I need to look more at the data structure construction bit. I'm not sure how much there really is beyond spending time in malloc. I'll attach a profile of the CSS parsing segment of startup.

David Baron :dbaron: (⌚️UTC-5, no longer working on Mozilla)

Reporter

Comment 1

•

24 years ago

Attached file profile of the CSS parsing part of startup — Details

Pierre Saslawsky

Comment 2

•

24 years ago

glazman worked on reducing string manipulation a few months ago, he may have more info on this. I think that we still do 2 or 3 copies of the data. Marking future for now but will re-prioritize when David comes with more stats.

Status: NEW → ASSIGNED

Keywords: perf

Priority: -- → P3

Summary: CSS parsing is 5.5% of startup time → [perf]CSS parsing is 5.5% of startup time

Target Milestone: --- → Future

Pierre Saslawsky

Comment 3

•

24 years ago

On a related matter, this is what Marc Attinasi wrote about a year ago in news://news.mozilla.org/3905EE19.361F497B%40netscape.com ---- I was looking into the SheetLoadData and CSSLoaderImpl string use and I found an interesting issue. Clearly, the two are using nsStrings to the exact same extent... The CSSLoader is duplicating the data coming in from the net (resulting in triplicate data, actually). The CSSLoader uses the SheetLoaderData to load the data. The SheetLoaderData gets data from the netwerk and then it makes a copy of it in an nsString, decoded in the correct charset. The decoded string is then passed from the SheetLoadData to the CSSLoader which makes another nsString copy of it to pass to StringUnicharInputStream. It seems that we could (should?) eliminate the second copy of the string data in the CSSLoaderImpl::DidLoadStyle method, however the current implementation of the StringUnicharInputStream takes ownership of the string so we may have to change that (we could change the way the LoadData passes the string to the Loader so that it passes ownership and allows the StringUnicharInputStream to delete the original too, which seems simpler actually). This should be a huge benefit, I'll look into the change. It could cut in half the string allocations related to loading the style sheets. To summarize, this is the current situation: Netwerk --- (data as char*) --> SheetLoadData --- (copies data to stack-based nsString) --> CSSLoader --- (makes another copy of the nsString passed in from SheetLoadData) --> StringUnicharInputStream --- (takes ownership of nsString and deletes when done) --| Possible optimization (eliminates one nsString copy of the data): Netwerk --- (data as char*) --> SheetLoadData --- (copies data to heap-allocated nsString) --> CSSLoader --- (takes ownership of nsString passed in from SheetLoadData) --> StringUnicharInputStream --- (takes ownership of nsString and deletes when done) --| OK, I tried the change and it works correctly, as you would expect. This should result in 572K less bloat at load. This one was pretty easy to fix in the CSSLoader, but in general it is a bigger problem to control ownership effectively and at the same time avoid duplicating data - it seems the more we avid duplicating data the murkier the ownership issues become. I'll check changes to CSSLoader and SheetLoadData in as soon as I have them reviewed and the tree is open. To answer a question posed below, all of my style memory stats are from Viewer, not from Mozilla. Also, my data shows memory footprint at seady-state, not transient data like the BloatStats show (from what little I understand about the bloat stats...).

David Baron :dbaron: (⌚️UTC-5, no longer working on Mozilla)

Reporter