43457 - MEM: Style Context Sharing needs to be improved to increase sharing and reduce performance degradation

Reporter

Description

•

24 years ago

Style Context Data sharing currently saves a lot of memory at the expense of a moderate performance hit. We need to improve the sharing design to minimize the performance hit and to increase the space savings. Some initial ideas include: - Allocate StyleContextData out of a Pool or Arena to minimize heap allocations - Share style data across documents: presumably this means storing the style context cache (or a style data cache) at a more global scope than the style set currently used to root the collection. - Enable the FastCaching algorithm: this requires closing the encapsulation-hole of GetMutableStyleData and having the style system manage changes to style data effectively (updating the CRC, updating the style context cache, possibly validating the style data is consistent) This bug is intended to track ideas and log results of experimentation and prototyping of designs that improve style context sharing in terms of performance and memory footprint (including bloat).

Marc Attinasi

Reporter

Comment 1

•

24 years ago

Added depends on bug 39618 since there is valuable information in that bug that pertains to this one.

Status: NEW → ASSIGNED

Depends on: 39618

Target Milestone: --- → M20

rbs

Comment 2

•

24 years ago

Adding myself to the Cc list...

leger

Comment 3

•

24 years ago

Adding perf keyword

Keywords: perf

Marc Attinasi

Reporter

Comment 4

•

24 years ago

Moving all non-nsbeta3 bugs to future milestone: these will be worked on after beta3/rtm.

Target Milestone: M20 → Future

Neelakanth Nadgir

Comment 5

•

24 years ago

adding myself to interest list

Pierre Saslawsky

Assignee

Updated

•

24 years ago

Keywords: embed

Marc Attinasi

Reporter

Comment 6

•

•

24 years ago

Moving to Mozilla 1.0 - this cannot be completed in the next week or two...

Target Milestone: mozilla0.8 → mozilla1.0

Axel Hecht

Comment 15

•

24 years ago

I tried to apply this patch (to test XSLT), to no avail. I got around the whitespace mess, but the filenames are just not found. I need a commandline to apply this on unix, I tried all the commandline opts I know. Btw, XSLT is targeted to go into the default builds for 0.9, if this patch is the code that caused us to regress this will be pretty serious. Axel

Daniel Bratell

Comment 16

•

24 years ago

Is the performance problems solved now?

Heikki Toivonen (remove -bugzilla when emailing directly)

•

24 years ago

I will soon have another big checkin. Reassigned to myself. Set milestone to mozilla0.8.1 (March 13th).

Assignee: attinasi → pierre

•

24 years ago

For those who want to review the code or test the diffs, the basic principles are the following: 1) Change of interfaces to modify the style data: GetMutableStyleData() was replaced by 2 methods: ReadMutableStyleData() and WriteMutableStyleData(). Look into nsIMutableStyleContext.h for more info on the helper classes and the implementation. 2) Memory footprint reduction: In nsStyleContext.cpp, the nsStyleContextData now contains pointers to the 15 individual style structures, instead of the structures themselves. When a pointer is null, it means that we are inheriting the data from the parent. 3) Simplification of the code: The 15 pointers in nsStyleContextData are stored as a union of individual pointers and an array of generic pointers. The array allows to loop through the structures instead of duplicating the code 15 times in switch() statements. This simplification required a slightly unfortunate change in the definition of nsStyleStruct (in nsStyleStruct.h) where we now expose methods that should be hidden from everybody but the StyleContext. It's a small drawback for a big gain.

Mike Shaver (:shaver emeritus)

•

24 years ago

nsStyleContextDebug.cpp and nsStyleContextDebug.h should be stored in the same directory as nsStyleContext.cpp (mozilla/content/base/src)

amardare

Comment 33

•

24 years ago

Adding myself to the list.

Heikki Toivonen (remove -bugzilla when emailing directly)

•

24 years ago

Attached file nsStyleContext.cpp with fixes for BIDI and XUL — Details

Hixie (not reading bugmail)

Updated

•

24 years ago

Blocks: 72381

Marc Attinasi

Reporter

Comment 39

•

24 years ago

Pierre, what testing have you done on this? Specifically, have you run the style/block/table regression tests? Have you run John Morrison's Load Time tests? Just curious.

Pierre Saslawsky

Assignee

•

24 years ago

I posted a request for review last Monday but got basically no response. My partner Marc is way too busy fighting other fires in Layout. Could someone please help me to review these changes? I would especially appreciate if someone could build and play a bit with Mozilla on Unix. Thanks!

Brendan Eich [:brendan]

Comment 44

•

24 years ago

Sounds like testing is more important than reviewing at this stage. Pierre, I can try an up-to-date patch -- should I just detach the diffs and the whole files and use them, or are they now slightly out of date with respect to top of trunk? Cc'ing jrgm, who should run the patch too, if he's able. /be

John Morrison

Comment 45

•

24 years ago

To build it, I need (?): attach_id=28072 -- diffs to layout/content, but remove the nsStyleContext*.(h|cpp) stuff attach_id=28118 -- nsStyleContextDebug.cpp attach_id=28119 -- nsStyleContextDebug.h attach_id=28142 -- nsStyleContext.cpp (updated version) where the last three are whole files, and go in mozilla/content/base/src. Is that correct? I will do this on win32, as my linux box is (currently) the test server, and my mac build-fu is weak. However, if someone can, running the gecko perf tests (which are more specific to this bug); and the block/table/layout regression tests would be good if people are concerned.

Pierre Saslawsky

Assignee

Comment 46

•

24 years ago

I haven't build in the past 4 days but according to Bonsai & CVS, it should still be ok. To merge the changes: - apply the diffs - add nsStyleContextDebug.cpp/.h to mozilla/content/base/src - overwrite nsStyleContext.cpp

John Morrison

•

24 years ago

Attached patch updated diffs, all in a single file — Details — Splinter Review

John Morrison

Comment 51

•

24 years ago

Sorry, I finally got these run done. As pierre mentioned, the times are about 4% slower, on either the first load (variable range), or a subsequent visit when cached (fairly consistent across the board). (This was for a tree pulled ~11pm last night, built, ran test, patched, built, ran test 2, on win2k).

Chris Waterson

Comment 52

•

24 years ago

Hrm, that's a pretty steep price to pay. What were the savings again?

chris hofmann

Comment 53

•

24 years ago

jrgm, still got that build? can you provide a pointer to it? how 'bout we get curt to to a special mem test run to look at that graph?

John Morrison

•

24 years ago

Pierre has an interesting idea here, about disabling style context sharing when we enable his style data sharing. However, it is not clear what the savings would be for style data sharing if style context sharing is disabled - it may not yield the same level of memory benefit. Also, style context sharing is actually faster than what I had originally measured. This is largely due to the faster StyleContextCache being put in, which I believe resulted in only a 5% slowdown for Viewer (and even less for SeaMonkey, reported anecdotally). I'd love to see what the performance cost of style conext sharing is for SeaMonkey, and now we have the tools to do that (eg. jrgm). I suggest running tests to characterize the performance and memory gains/losses across two dimensions: With and Without context sharing With and Without data sharing If we see that the footprint gains due to style context sharing alone are less than the gains due to style data sharing alone, and the performance gain from disabling style context sharing make style data sharing's performance loss acceptable (or zero), then we should turn off context sharing and turn on style data sharing. I think this is a good approach because unlike the style context sharing can be enabled or disabled with a #define in one file, so it ie easy to keep in the tree while we work on performance issues in style data sharing. (I tried to make this more confusing, but this is as obfuscated as I could get it...)

rbs

Comment 62

•

24 years ago

[mid-air collision - resubmittig - however, I noted that marc raised similar points to mine] BTW, there hasn't been much comments about the interaction of your "StyleData sharing" and the previous "StyleContext sharing". Could you brief us a little bit on what is happening on this front? Is there any inter-dependence? When both are used, do your changes entail an extra overhead in the implementation of the style context sharing? If both are used, does that lead to "double" (or some other factor) extra savings. Do your "low-level" styledata changes provide an independent alternative that aptly removes the need for the most expensive "higher-level" style-context sharing? [I am just asking these questions as an indication of the things that some might find useful to understand. Thanks]

Pierre Saslawsky

Assignee

Comment 63

•

24 years ago

I tried out the win32-talback drop (in fact I am sending this comment with it). No weirdness to report so far. Yep, pierre, now that instrumentation has shown an improved perf, your call makes sense. Indeed, the style data sharing is "inclusive" of the style context sharing (i.e., the very fact that two style contexts are identical means that all their style data are indentical and shared). By single-mindely focussing on style data sharing, maybe more consolidation and efficiency can be implemented in due time before 1.0.

Chris Waterson

Comment 74

•

24 years ago

So pierre is measuring a speedup, yet at one point I thought Netcsape QA measured a slowdown (I'm guessing with jrgm's test suite). Pierre, I presume you've only tried this on Mac? Might other platforms be affected differently? (e.g., because malloc() is more expensive on Win32/Linux due to lock contention?) It'd be good to undesrtand why we're seeing different reports. (Maybe the delta is just too small to realisitically resolve with our tools, in which case, performance should be considered a non-issue.) Out of curiosity, what do you see on a larger pages? (e.g., nsCSSFrameConstructor.cpp in LXR, tinderbox, table stress test.) Do we know the impact of this stuff on chrome? (Menus, tree, hover, tooltips, etc.) Maybe hyatt could help you assess that. Are you still looking for r= and sr=? Or are those covered?

rbs

Comment 75

•

24 years ago

Apparently this is the first time it is tested in isolation (without SC sharing)

Pierre Saslawsky

Assignee

Comment 76

•

24 years ago

Chris, - It is indeed the first time StyleData sharing is tested independently from StyleContext sharing. - I tested on the Mac only. - Results may vary depending on your platform but if experience is of any indication, they should really not vary by a lot. For instance, when I predicted a slowdown lower than 5% for SC + SD sharing on the Mac, jrgm found around 4% on other platforms. - I will test larger pages later today. - I don't know the impact on the chrome but if experience is of any indication, it will be good. - There has not been any formal review because _you_ and other people declined to reply to my repeated requests. So with Brendan's suggestion, I went through the process of a landing, with verification builds handed over to QA before checking in. I also supported Marc's standpoint that for these changes a thorough testing would be much more important that a formal review. Anyhow, if you'd like to review now, please be my guest: the diffs are attached. If you want the exact code, I can attach new diffs where StyleContext sharing is disabled.

Daniel (Leaf) Nunes

Comment 77

•

24 years ago

Were jrgm's tests run using optimized builds with these changes, or debug? Was the comparison made to a debug or optimized build? Could that account for the performance hit?

Chris Waterson

Comment 78

•

24 years ago

pierre, I apologize if you felt that I ignored you. (IIRC, when you asked me to sr=, I replied, asking if Marc liked the changes. You said something along the lines of ``let's just get a Unix build to work first''.) If you'd still like me to sr= these changes, attach the latest diffs to the bug and I'll go through them.

Pierre Saslawsky

Assignee

•

24 years ago

Never mind - I see what is going on. Sorry for the noise.

Chris Waterson

•

24 years ago

Scott: I posted my comments before I read yours. I agree with you and Chris on #1. If you prefer, I can implement it before checking in but since it doesn't change anything to the execution, I would not want that to cause another round of verification builds for QA. About the mutable style, my main guideline in all the declarations was to have as little an impact as possible in the code outside the StyleContext. GetMutableStyleData() is called in approximately 130 places in 35 files, and the pointers to the mutable style are used in probably hundreds of places (and fortunately nowhere in a persistent way). The thing that will make you frown the most is probably #5. If you want to give it a try, you're welcome. I sincerely tried many solutions with pure virtual methods or just virtual methods (like having a second base class that the StyleXXXImpl structures could inherit from, etc...) but given the facts that: - customers had to be able to use an object created by the StyleContext (ReadMutableStyleData) - the StyleContext had to be able to use an object created by the customer (GetStyle) - customers could be in another DLL (Layout) I believe I ended up with a very honorable solution but if there is a better one, I'm confident you will find it (beware of crashes when virtual tables don't match).

rbs

Comment 89

•

24 years ago

I will cast my vote for #1 to be addressed before check-in. Motivation: strike the iron while it is hot... Now that all eyes are on this, let's make (or at least try to make) the most of the brainstorming that is going on and the eagerness around w.r.t. reaping the savings of this style data sharing.

Chris Waterson

Comment 90

•

24 years ago

> 1) Override the operators in nsMutableStyleXXX instead of nsAutoStyleStruct: What rbs said: you're essentially redefining the API at this point (for all practical purposes, callers must use your nsMutableStyleXXX objects). If we don't fix this now, you'll be chasing the tree as people begin to start writing new code that uses the new API. One thing that I forgot to ask last night was, what is the linkage impact of these new classes? Does layout need to link against content now? (I guess not, since you've not changed any makefiles AFAICT.) Which would mean the code probably gets duplicated in each of content and layout. (Tangent: we need to redouble our efforts to get content & layout converted to using NS_IMPL_NSGETMODULE so they can be recombined for release.) > 2) GetStyleXXXFrom(mStyleContext): > ---------------------------------- > I prefer the current notation where it is clear that the helper classes are in > fact constructing something, Agreed. That makes sense. > 5) Pure virtual functions in nsStyleStruct: > ------------------------------------------- > It was my idea at first, of course. Unfortunately it would have required > all the classes that inherit from nsStyleStruct to declare and implement > these methods. What style structs *don't* need this? Why? > 14) Member variable instead of pooling: > --------------------------------------- > To minimize the impact on the existing code, I wanted ReadMutableStyleData() > to behave exactly like the current GetMutableStyleData() with the only > difference that the caller must call WriteMutableStyleData() when he's > done with the changes. But because of the nsMutableStyleXXX classes, no sane person would call [Read|Write]MutableStyleData() by hand, would they? (Are we bending over backwards to support a non-use case?) I'll have to read the code more carefully to really understand what you're trying to do here. More comments later.

Pierre Saslawsky

Assignee

•

24 years ago

To be honest, if you are looking for better solutions for the declaration of nsStyleStruct, the closest thing I found involved a big reshuffling of all the StyleXXXImpl and quite a bit of modifications in the StyleContext too: - Leave nsStyleStruct the way it is. - Use the nsStyleXXX structures for the only purpose of passing data back and forth with the customers, meaning that when we receive nsStyleXXX, we have to copy it into a StyleXXXImpl before we do anything (perf?). - Make the StyleXXXImpl structures not inherit from nsStyleStruct but from another base class that implements these 4 functions. - Declare a nsStyleXXX as a member variable in its corresponding StyleXXXImpl. It sounds simple like that but I think there was a problem (besides the reshuffling) and I can't remember what.

Chris Waterson

Comment 103

•

24 years ago

With the patches applied, looking at a tree, it's generally much easier to understand. :-) > I maintain (you will be interested!) that SetStyle() is evil, as well as > any API where we would have to accept a structure allocated by the > customer with the purpose of modifying the style context. This is for > almost the same reason as above: the implementations are done in the > StyleXXXImpl, which are a certain way private to the StyleContext, and > when writing into the StyleContext we need to receive back objects that > we have created in order to be able to talk to them (ie. objects that > implement our code). I find this argument to be pretty weak. I think the *real* issue is that this has been designed around trying to use C++ vtable dispatch in the back-end implementation, and that's lead you down this garden path. The style structs are the interface by which your consumers communicate with you about style data. Even if you did, say, need to compute a whole bunch of extra information for your own private use (which you don't), I still don't think the argument holds water. You don't need to *create* the style struct to be able to ``talk'' to it efficiently. You need to know its style struct ID. Think of *that* as your vtable. If you find the pervasive `switch' logic too burdensome (I would!), then roll your own dispatch! Places where you've used C++'s build-in virtual dispatch should simply use the style struct ID to do hand-rolled dispatch: use the style struct IDs indexes into a table of function pointers, and a *lot* of the ugly switch statement code will go away. I don't think that this would really entail much change to what you've already got. But, it would 1. Get rid of the need for you to have the nsStyleStruct implement any virtual methods. (This is unsighly anyway: it's propagated implementation detail out to a place where nobody should know or care about it.) 2. Get rid of the ad hoc style struct pool, which is another nail in the layout re-entrancy coffin. 3. Still allow you to achieve switch-less nirvana. 4. Still allow you to use your own private storage, if the day comes where style maintains ``extra'' computed information that's not visible to consumers of the API. What do you think?

Pierre Saslawsky

Assignee

Comment 104

•

24 years ago

You are correct about the origin of the problem, I wanted to keep the C++ dispatch. And I still do: I'll give a second look at what I described above because when I tried it, I had in fact the StyleXXXImpl inheriting both from a base class that declared the pure virtual methods and from the 'empty' nsStyleStruct. If we cut the inheritance between nsStyleStruct and StyleXXXImpl, we are left with quite a bit of reorg in the StyleXXXImpl, but I prefer to put the burden on these objects rather than on the rest of the code. The benefits would be the same, with in addition the C++ dispatch and inheritance, and less changes in StyleContext and StyleContextData. Same question: what do you think? And what do you mean by "Still allow you to use your own private storage..."? How can I have private storage if I receive a structure that I did not create?

Pierre Saslawsky

Assignee

•

24 years ago

Attached patch use C++ template to declare helper classes — Details — Splinter Review

Chris Waterson

Comment 116

•

24 years ago

Just an idea. Might this be a more elegant way to declare the helper classes?

Chris Waterson

Comment 117

•

•

24 years ago

Still, I would be curious to see if what I noticed with globals vs stack can be confirmed on other platforms. I'll make diffs but not until late.

rbs

Comment 128

•

24 years ago

Regarding the [im]mutable question, let me chime in to point to a comment of Peter Linss that I once saw in bug 1230. Basically, he was saying that there were two pathways to the style structs, and only some blessed objects were allowed to use the mutable pathway. On the same topic, in a reply to an e-mail message where I asked if MathML frames could use the GetMutable API to alter their resolved style data [I asked this because I was hoping that the MathML frames could change their style contexts so as to reflect MathML attributes that are not supported by the Style System], he replied something along the lines: "Nope, unless you are a style rule". And he said that he was planning to remove that method (that's why it was deprecated BTW), and via this process to fix the bad callers (I guess by this he meant calls that are modifying the style context outside the style resolution -- these can be seen on pierre's migration patch: the mutations at the frame-level, e.g., [nsGfxTextControl,Table,nsCSS]Frame). Pierre might better explain the Right Thing that should to be done for these random mutations: it is a costly style re-resolution in the associated sub-trees to bring the style tree back in sync. I guess that maybe why Peter Linss wanted to avoid the profileration of this practice, and to only let the possibility of straight mutations to the blessed objects during the controlled style resolution process. Hope these comments provide some insights into waterson's point 1 and to the genesis of the problem.

rbs

•

24 years ago

Attached patch partial diffs that use globals instead of the stack for Read/Write — Details — Splinter Review

John Morrison

Comment 133

•

24 years ago

Okay, so that system was not (is not?) well. I had to clean a bunch of stuff up to get it to run sanely at all (e.g., a virus program that meant that deleting an old profile took >5min). I'm still suspicious of the state of that machine, so I don't think these numbers mean a heck of a lot (e.g., it is nominally a faster machine with more memory than what waterson tested, but the times are slower than what he measured). But, here they are anyways (I only did 3 cycles; first is uncached, 2 and 3 are cached): avg med max min #1 #2 #3 before 9674 9694 25181 1694 9864 9283 9878 after 9842 9772 26890 1691 9851 9871 9805 -168 -78 -1709 3 13 -588 73 -1.7% -0.8% -6.8% 0.2% 0.1% -6.3% 0.7%

Pierre Saslawsky

Assignee

•

24 years ago

Ooops, I'm leaving on sabbatical in a bit more than 2 weeks, not 3. Yeehaaa!

Brendan Eich [:brendan]

Comment 140

•

24 years ago

drivers@mozilla.org don't see the need to get the API changes in for 0.9 -- the patches here are backed up, as well as on your machine. It's very hard right now to get any review and testing cycles, because everyone is rushing for the 0.9 deadline. We'll have plenty of confounding and causative variables to untangle tomorrow, and for a week (until 0.9 releases). Do we know exactly what the performance effects of the API-only patch is? Unless you get a windows build and jrgm tests going fast, I don't know those effects. They should be zero, but Murphy was an optimist. We don't need any more risk here, even if it is tiny -- we don't even have the "brainprint" to take these changes into the mix. Sorry, and we will get them in after 0.9, with performance improvements too, I hope -- possibly even next week if you, waterson, et al. can pin down where the cycles are going and optimize them away. /be

Pierre Saslawsky

Assignee

Comment 141

•

24 years ago

Sorry, I can't commit myself to work a week on this or even 3 days, I think. I have more than a hundred bugs to triage before I leave (out of 170+), and more than a couple of them would be nice to fix for 0.9.1 or 1.0. I'm not sure I'll be back before the tree closes for 1.0. Checking in the API would allow someone else to work on the style system memory model much more easily, as only 3 files would be different. The change of API would consist in changing the calls from: nsStyleXXX myStyle = myContext->GetMutableStyleData(eStyleStruct_XXX); to: nsMutableStyleXXX myStyle(myContext); where the template for nsMutableStyleXXX would be: template <class T, nsStyleStructID SID> class basic_nsAutoMutableStyle { protected: T mStyleStruct; public: basic_nsAutoMutableStyle(nsIMutableStyleContext* aContext) { aContext->GetMutableStyleData(SID, &mStyleStruct); } basic_nsAutoMutableStyle(nsIStyleContext* aContext) { aContext->GetMutableStyleData(SID, &mStyleStruct); } T* get() { return &mStyleStruct; } T* operator->() { return get(); } T& operator*() { return *get(); } }; As you can see, it's pretty much empty. A point to consider is that whatever the risk of checking in this empty code might be, it would better be managed if I were still around for a couple of weeks to cover any problems that may arise. After all, if risk there is, it would make even less sense to finally proceed to do the checkin the day before I leave. I would check it in now without hesitation but it's up to drivers@mozilla.org to balance the risk involved by any code, even empty, and the benefit to the Community of making it possible for its members to work on something important. Anyhow, API or not, now or later, a full resolution is in the Future for me. Marking as such.

Status: NEW → ASSIGNED

Target Milestone: mozilla0.9 → Future

Pierre Saslawsky

Assignee

Comment 142

•

24 years ago

Of course my point was that this mostly empty code doesn't need any performance testing and to answer your concerns, it was reviewed (and even contributed to) by waterson.

Pierre Saslawsky

Assignee

Comment 143

•

24 years ago

Based on the current API, an even emptier code would be: template <class T, nsStyleStructID SID> class basic_nsAutoMutableStyle { protected: T* mStyleStruct; public: basic_nsAutoMutableStyle(nsIMutableStyleContext* aContext) { mStyleStruct = aContext->GetMutableStyleData(SID); } basic_nsAutoMutableStyle(nsIStyleContext* aContext) { mStyleStruct = aContext->GetMutableStyleData(SID); } T* get() { return mStyleStruct; } T* operator->() { return get(); } T& operator*() { return *get(); } };

Chris Waterson

•

24 years ago

Time to weigh in here. I am of the opinion that all of the efforts taken so far to reduce the style system's footprint, although admirable, are moving in the wrong direction. None of the patches that have been produced have addressed the fundamental problem of the style system, namely that it eagerly resolves all possible values for every style context. The sharing strategies employed thus far continue to work under a fundamentally broken assumption: that all style data must be computed and only after a match is found can data be thrown away. This patch adds even more complicated sharing logic, and although it achieves a runtime footprint reduction, it does nothing to reduce the obscene amount of churn that the style system generates as it resolves all of these property values. I have been working on a proof of concept for a new system of rule matching that uses lazy resolution and that uses other tricks to reduce both footprint and performance costs. I've gotten far enough with the code that I'm confident it will be not only a large performance win, but that it will also be a large footprint win. I would like to work on developing this new rule-matching strategy for real, but the problem I have now is that I'm working off the current trunk, and have touched the same files (in particular, I've rewritten style contexts and the way rules are matched). If this patch goes in, I'll be in a world of hurt from a merge perspective. Given that I'd like to continue work on this new system, I'd prefer it if this patch did not go into the tree. I don't think it moves in the right direction, and I don't think it addresses the fundamental performance problems that plague the style system.

David Hyatt

•

24 years ago

Apparently the bloat went down 3 Mb on Linux and 4 Mb on other flavors of Unix. I'm stunned, I did not expect the numbers to be so high. People say the application still works. Let's wait for some more testing and performance metrics from QA.

David Hyatt

Comment 160

•

24 years ago

Here is the email that I sent to you, brendan, attinasi, and waterson. Maybe you don't recall it, but I sent it straight to you, and you even responded to it. ======= I was thinking about the style system sharing stuff last night, and I came up with an idea on how to generalize techniques used in the outliner widget to make our style sharing code even faster. What I would propose is that the style system have two levels of cache: the first line of defense would be a new cache that I'm going to outline momentarily, and the second line of defense would be the style sharing code that pierre and attinasi have done. This new cache complements the existing cache, and both are necessary for maximal footprint reduction and allocation reduction. I'll call the new cache the level 1 cache, and the existing cache, the level 2 cache. Here's the basic idea for the level 1 cache. The style system maintains a state machine that treats a sorted list of rules as an input word. This state machine transitions on the rules in the input word until all have been processed. The resultant final state contains all of the style data for that input word. On a miss, i.e., when no style data is found at the final state, you then check the level 2 cache. You have to allocate the style data and peform t he CRC computation to find matching style data structs that can then be cached at the final state in the level 1 cache. The first level cache will allow us to avoid even doing the allocations of temporary data that we then throw away, and we won't have to do a CRC computation (which gets expensive since there are many things to compare). Thoughts? I would be very interested in translating the outliner state machine into the style system if you guys think it's a good idea. I have all the data structures for this already implemented and tested. I think this works really well, because it complements the code you guys have already written. Level 1 (the new cache) makes sure the same set of rules gives you an immediate answer with no allocation. Level 2 (the existing cache) ensures that different sets of rules that map to similar style data end up being shared. Dave (hyatt@netscape.com) ===== I presented this idea to you on 4/4/2001, one month ago. Both you and attinasi responded. I describe the exact approach in this email of treating the sorted list of rules as an input word, fueling a state machine/lexicographic tree. To claim that I have been working on this in a vacuum without mentioning my ideas is completely ridiculous.

David Hyatt

•

24 years ago

I fixed the nsTraceRefcnt but the tinderboxen still show a gain of 2Mb, sometimes 3Mb. I think all the structures are accounted for, if someone wants to double check. Otherwise it's not 4Mb but I'll still take it. An interesting test would be to get the bloat stats for StyleContext + StyleData sharing. Any volunteer? Enable SHARE_STYLECONTEXTS in nsIStyleSet.h line #47 and recompile Content, Layout and Editor.

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Comment 171

•

24 years ago

r=dbaron on attachment 33121 [details] [diff] [review], although technically you should have a MOZ_DECL_CTOR_COUNTER for each one as well (but it's currently a no-op and I don't think that's likely to change)

Pierre Saslawsky

Assignee

Updated

•

24 years ago

Blocks: 78961

Pierre Saslawsky

Assignee

Comment 172

•

24 years ago

I checked in the MOZ_COUNT_CTOR yesterday (my apologies for this crass ignorance) and as I wrote, the bloat seems to be 2 or 3Mb lower. Here is the data I collected from the various tinderboxen: The "before" numbers comes from before my checkin on 05/03 at 06:12. The "after" numbers comes from after my checkin on 05/03 at 19:18. ------------------------------------------------------- T-BOX OS BEFORE AFTER GAIN ------------------------------------------------------- coffee linux 26 - 27 24 -(25) -2 (+) shrike linux 29 -(30) (27)- 28 -2 (-) speedracer sun 27 - 28 24 - 25 -3 bismark sun 29 26 - 27 -2 (+) cement irix 31 29 -2 monkeypox linux 29 26 -3 muerte bsd 30 27 -3 nebiros sun 29 26 -3 senna linux 29 26 -3 ------------------------------------------------------- 2Mb or 3Mb less bloat is enough of a step in the right direction. Closed as Fixed. The debate shall continue under bug 78961 and in the n.p.m.style newsgroup at news://news.mozilla.org/3AF314D7.4C896990%40netscape.com I would like to thank all the 34 people copied on this bug for their interest and their participation, with special thanks to the QA folks for their dedication, jrgm for his kindness, Brendan for his wisdom, and Waterson for his volunterism. If you want to continue to follow the debate, don't forget to add your email to the CC list under bug 78961.

Status: ASSIGNED → RESOLVED

Closed: 24 years ago

Resolution: --- → FIXED

Eugene Savitsky

Comment 173

•

24 years ago

I actualy does not see any mem usage improvements... For me it is now worse than it was. I can go very quickly to 70Mb mem used by mozilla after the check-in. Can someone confirm? PS On Win98.

Madhur Bhatia

Updated

•

22 years ago

Whiteboard: [whitebox]

list of modified files 24 years ago Pierre Saslawsky 1.27 KB, text/plain		Details
diffs of the breakup of nsStyleSpacing 24 years ago Pierre Saslawsky 175.62 KB, patch		Details \| Diff \| Splinter Review
list of modified files 24 years ago Pierre Saslawsky 1.02 KB, text/plain		Details
diffs of the breakup of nsStyleData + footprint reduction 24 years ago Pierre Saslawsky 255.84 KB, patch		Details \| Diff \| Splinter Review
nsStyleContext.cpp 24 years ago Pierre Saslawsky 137.10 KB, text/plain		Details
nsStyleContextDebug.cpp 24 years ago Pierre Saslawsky 25.98 KB, text/plain		Details
nsStyleContextDebug.h 24 years ago Pierre Saslawsky 8.65 KB, text/plain		Details
nsStyleContext.cpp with fixes for BIDI and XUL 24 years ago Pierre Saslawsky 137.20 KB, text/plain		Details
updated diffs, all in a single file 24 years ago Pierre Saslawsky 252.90 KB, patch		Details \| Diff \| Splinter Review
diffs with fixes for 2 regressions in BODY and TABLE 24 years ago Pierre Saslawsky 257.70 KB, patch		Details \| Diff \| Splinter Review
Performance chart from John Morrison: comparison Before/After changes 24 years ago Pierre Saslawsky 15.74 KB, image/gif		Details
diffs with StyleData sharing only 24 years ago Pierre Saslawsky 252.95 KB, patch		Details \| Diff \| Splinter Review
compiliation errors on linux 24 years ago Chris Waterson 11.40 KB, text/plain		Details
diffs for nsCSSStyleRule.cpp 24 years ago Pierre Saslawsky 6.31 KB, patch		Details \| Diff \| Splinter Review
diffs after code review 24 years ago Pierre Saslawsky 281.25 KB, patch		Details \| Diff \| Splinter Review
use C++ template to declare helper classes 24 years ago Chris Waterson 11.96 KB, patch		Details \| Diff \| Splinter Review
win2k; "first visit" loads 24 years ago John Morrison 15.10 KB, image/gif		Details
win2k (500mhz/128mb); 'already cached' loads 24 years ago John Morrison 14.43 KB, image/gif		Details
linux (266MHz/128MB); 'already cached' loads 24 years ago John Morrison 15.10 KB, image/gif		Details
partial diffs that use globals instead of the stack for Read/Write 24 years ago Pierre Saslawsky 207.66 KB, patch		Details \| Diff \| Splinter Review
patch for TraceRefcnt'ing of blobs 24 years ago Pierre Saslawsky 5.93 KB, patch		Details \| Diff \| Splinter Review