1367635 - stylo: share reset structs during the style traversal

Reporter

Description

•

8 years ago

I did a DMD run on the Obama Wikipedia page, which included these measurements: > Unreported { > 7,836 blocks in heap block record 2 of 3,899 > 2,507,520 bytes (2,507,520 requested / 0 slop) > Individual block sizes: 320 x 7,836 > 3.18% of the heap (19.67% cumulative) > 7.02% of unreported (43.38% cumulative) > Allocated at { > #01: replace_malloc (/home/njn/moz/mi2/memory/replace/dmd/DMD.cpp:1278 (discriminator 2)) > #02: alloc::heap::exchange_malloc (/checkout/src/liballoc/heap.rs:138) > #03: alloc::boxed::{{impl}}::new<style::stylearc::ArcInner<style::gecko_properties::ComputedValues>> (/checkout/src/liballoc/boxed.rs:238) > #04: style::matching::PrivateMatchMethods::cascade_with_rules<style::gecko::wrapper::GeckoElement> (/home/njn/moz/mi2/servo/components/style/matching.rs:225) > #05: style::matching::PrivateMatchMethods::cascade_internal<style::gecko::wrapper::GeckoElement> (/home/njn/moz/mi2/servo/components/style/matching.rs:274) > #06: style::matching::PrivateMatchMethods::cascade_primary<style::gecko::wrapper::GeckoElement> (/home/njn/moz/mi2/servo/components/style/matching.rs:294) > #07: style::matching::MatchMethods::match_and_cascade<style::gecko::wrapper::GeckoElement> (/home/njn/moz/mi2/servo/components/style/matching.rs:621) > #08: style::traversal::compute_style<style::gecko::wrapper::GeckoElement,style::gecko::traversal::RecalcStyleOnly> (/home/njn/moz/mi2/servo/components/style/traversal.rs:766) > #09: style::traversal::recalc_style_at<style::gecko::wrapper::GeckoElement,style::gecko::traversal::RecalcStyleOnly> (/home/njn/moz/mi2/servo/components/style/traversal.rs:629) > #10: style::gecko::traversal::{{impl}}::process_preorder (/home/njn/moz/mi2/servo/components/style/gecko/traversal.rs:47) > } > } > > Unreported { > 4,402 blocks in heap block record 3 of 3,899 > 1,408,640 bytes (1,408,640 requested / 0 slop) > Individual block sizes: 320 x 4,402 > 1.79% of the heap (21.46% cumulative) > 3.94% of unreported (47.32% cumulative) > Allocated at { > #01: replace_malloc (/home/njn/moz/mi2/memory/replace/dmd/DMD.cpp:1278 (discriminator 2)) > #02: alloc::heap::exchange_malloc (/checkout/src/liballoc/heap.rs:138) > #03: alloc::boxed::{{impl}}::new<style::stylearc::ArcInner<style::gecko_properties::ComputedValues>> (/checkout/src/liballoc/boxed.rs:238) > #04: geckoservo::glue::Servo_ComputedValues_Inherit (/home/njn/moz/mi2/servo/ports/geckolib/glue.rs:1256) > #05: mozilla::ServoStyleSet::ResolveStyleForText(nsIContent*, nsStyleContext*) (/home/njn/moz/mi2/layout/style/ServoStyleSet.cpp:380) > #06: mozilla::ServoRestyleManager::TextPostTraversalState::ComputeStyle(nsIContent*) (/home/njn/moz/mi2/layout/base/ServoRestyleManager.cpp:179 (discriminator 2)) > #07: mozilla::ServoRestyleManager::ProcessPostTraversalForText(nsIContent*, nsStyleChangeList&, mozilla::ServoRestyleManager::TextPostTraversalState&) (/home/njn/moz/mi2/layout/base/ServoRestyleM anager.cpp:458 (discriminator 1)) > #08: mozilla::ServoRestyleManager::ProcessPostTraversal(mozilla::dom::Element*, nsStyleContext*, mozilla::ServoStyleSet*, nsStyleChangeList&) (/home/njn/moz/mi2/layout/base/ServoRestyleManager.cp p:433) > #09: mozilla::ServoRestyleManager::ProcessPostTraversal(mozilla::dom::Element*, nsStyleContext*, mozilla::ServoStyleSet*, nsStyleChangeList&) (/home/njn/moz/mi2/layout/base/ServoRestyleManager.cp p:430 (discriminator 1)) > #10: mozilla::ServoRestyleManager::ProcessPostTraversal(mozilla::dom::Element*, nsStyleContext*, mozilla::ServoStyleSet*, nsStyleChangeList&) (/home/njn/moz/mi2/layout/base/ServoRestyleManager.cp p:430 (discriminator 1)) > } > } That's 4 MiBs of Arc<ComputedValue>s. The above output shows that each Arc<ComputedValue> is 320 bytes. This includes: - 23 Arc<> fields, each 8 bytes, giving 184 bytes - The cached_system_font field, which is Option<ComputedSystemFont>. ComputedSystemFont has 16 fields, all with type Au(?), so that's another 64 (for the Au fields) + 8 (for the Option tag) bytes. - The other four fields look like they're about another 8 words, so about 64 bytes. All that adds up to 320, roughly. That's a lot more than nsStyleContext, which is the equivalent Gecko type.

Boris Zbarsky [:bzbarsky]

Updated

•

8 years ago

Flags: needinfo?(manishearth)

Manish Goregaokar [:manishearth]

Comment 1

•

8 years ago

https://github.com/servo/servo/pull/17030 incidentally fixes the ComputedSystemFont stuff. Emilio moved it to the context and we don't use it anyway. We're probably not sharing Arcs as much as we could. Unsure. How large is nsStyleContext?

Flags: needinfo?(manishearth)

Boris Zbarsky [:bzbarsky]

Comment 2

•

8 years ago

nsStyleContext itself is about 20 words. If you have any reset structs that are not cacheable in the ruletree, you add another 15 words when you allocate the mCachedResetData. But I'm pretty sure that in most cases mCachedResetData is null.

Boris Zbarsky [:bzbarsky]

Updated

•

8 years ago

Blocks: 1367854

Manish Goregaokar [:manishearth]

Comment 3

•

8 years ago

https://github.com/servo/servo/pull/17041 removes a word from the struct.

Bobby Holley (:bholley)

Comment 4

•

8 years ago

So, the lion's share of the issue here is, as bz points out in comment 2, the fact that we don't cache reset structs in the rule tree. We could consider doing this, which would improve perf and memory to some degree at the expense of complexity. But note that we only save on memory to the extent that we have a lot of RuleNode sharing. Marking this as something that doesn't block us from shipping but a source of memory or perf improvements if we're scrounging.

Blocks: stylo-memory, stylo-perf

Priority: -- → P4

Summary: stylo: ComputedValues is too big → stylo: consider caching reset structs in the rule tree and shrinking ServoComputedValues

J. Ryan Stinnett [:jryans] (Use needinfo, replies may be slow)

Updated

•

8 years ago

Depends on: 1370719

Emilio Cobos Álvarez [:emilio]

Comment 5

•

8 years ago

(In reply to Bobby Holley (:bholley) (busy with Stylo) from comment #4) > So, the lion's share of the issue here is, as bz points out in comment 2, > the fact that we don't cache reset structs in the rule tree. We could > consider doing this, which would improve perf and memory to some degree at > the expense of complexity. But note that we only save on memory to the > extent that we have a lot of RuleNode sharing. Note also that this would force us to recreate the rule tree on every resize of any page with viewport units, and probably in a lot of other situations where we don't need to right now.

Bobby Holley (:bholley)

Comment 6

•

8 years ago

See bug 1367854 comment 19, which suggests (but does not yet prove) that we may need to do this.

Quick-n-dirty TLS cache prototype (with restrictions) 8 years ago Emilio Cobos Álvarez [:emilio] 28.21 KB, patch		Details \| Diff \| Splinter Review
rule node cache with conditions 8 years ago Cameron McCormack (:heycam) 20.95 KB, patch		Details \| Diff \| Splinter Review
rule node cache with conditions 8 years ago Cameron McCormack (:heycam) 112.15 KB, patch		Details \| Diff \| Splinter Review
rule node cache with conditions 8 years ago Cameron McCormack (:heycam) 181.43 KB, patch		Details \| Diff \| Splinter Review
rule node cache with conditions 8 years ago Cameron McCormack (:heycam) 181.26 KB, patch		Details \| Diff \| Splinter Review
rule node cache with conditions 8 years ago Cameron McCormack (:heycam) 179.74 KB, patch		Details \| Diff \| Splinter Review
Bug 1367635 - Part 1: Add a TLS-based style struct caching mechanism. 8 years ago Cameron McCormack (:heycam) 59 bytes, text/x-review-board-request	heycam : review+	Details
Bug 1367635 - Part 2: Set writing mode dependency and uncacheable state only for non-inherited properties. 8 years ago Cameron McCormack (:heycam) 59 bytes, text/x-review-board-request	emilio : review+	Details
Bug 1367635 - Part 3: Record the property we are computing on computed::Context, if it's a non-inherited one. 8 years ago Cameron McCormack (:heycam) 59 bytes, text/x-review-board-request	emilio : review+	Details
Bug 1367635 - Part 4: Make structs uncacheable if ex/ch units are used. 8 years ago Cameron McCormack (:heycam) 59 bytes, text/x-review-board-request	emilio : review+	Details
Bug 1367635 - Part 5: Make structs uncacheable if logical float/clear property values are used. 8 years ago Cameron McCormack (:heycam) 59 bytes, text/x-review-board-request	emilio : review+	Details
Bug 1367635 - Part 6: Add writing mode dependency if special MozLength keywords are used. 8 years ago Cameron McCormack (:heycam) 59 bytes, text/x-review-board-request	emilio : review+	Details
Bug 1367635 - Part 7: Make structs uncacheable if currentcolor is used on properties not using ComplexColor storage. 8 years ago Cameron McCormack (:heycam) 59 bytes, text/x-review-board-request	emilio : review+	Details
Bug 1367635 - Part 8: Don't cache structs if they have been adjusted. 8 years ago Cameron McCormack (:heycam) 59 bytes, text/x-review-board-request	emilio : review+	Details
Bug 1367635 - Part 9: Don't use rule cache for property-restricted pseudo-elements. 8 years ago Cameron McCormack (:heycam) 59 bytes, text/x-review-board-request	emilio : review+	Details
Bug 1367635 - Part 11: Don't re-cache structs if we just got them from the cache. 8 years ago Cameron McCormack (:heycam) 59 bytes, text/x-review-board-request	emilio : review+	Details
Bug 1367635 - Part 10: Don't cache structs with custom property references. 8 years ago Cameron McCormack (:heycam) 59 bytes, text/x-review-board-request	emilio : review+	Details