685438 - Avoid wasted space in nsTArray_base due to jemalloc rounding up

Reporter

Description

•

14 years ago

nsTArray_base<Alloc>::EnsureCapacity has this code: Header *header; if (UsesAutoArrayBuffer()) { // Malloc() and copy header = static_cast<Header*> (Alloc::Malloc(sizeof(Header) + capacity * elemSize)); if (!header) return PR_FALSE; memcpy(header, mHdr, sizeof(Header) + Length() * elemSize); } else { // Realloc() existing data size_type size = sizeof(Header) + capacity * elemSize; header = static_cast<Header*>(Alloc::Realloc(mHdr, size)); if (!header) return PR_FALSE; } AFAICT |capacity| is always a power-of-two, which means we're allocating slightly more than a power-of-two, which jemalloc rounds up to the next class size. E.g. 1024 -> 1032 -> 2048. I'm seeing this quite a lot relating to CSS stuff (e.g. below nsCSSRuleProcessor::CascadeSheet). Is it possible to store the header separately? It's late in the day and nsTArray is intimidating to a newcomer, so I don't yet know the answer to that question. Oh, I found this: // nsTArray_base stores elements into the space allocated beyond // sizeof(*this). This is done to minimize the size of the nsTArray // object when it is empty. struct NS_COM_GLUE nsTArrayHeader { static nsTArrayHeader sEmptyHdr; PRUint32 mLength; PRUint32 mCapacity : 31; PRUint32 mIsAutoArray : 1; }; That may minimize zero-length arrays, but it really hurts with larger arrays. This bug is probably highly related to bug 682735; I freely admit to not having followed the details and current situation with that bug, but it would be nice if the above behaviour could be taken into consideration.

Nicholas Nethercote [inactive]

Reporter

Comment 1

•

14 years ago

BTW, after opening Gmail and TechCrunch I see that there is almost 8MB of *cumulative* waste from this problem. If a lot of that cumulative waste is also from long-lived allocations (as CSS stuff tends to be), this could be a big win for memory usage.

Whiteboard: [MemShrink][clownshoes]

Boris Zbarsky [:bzbarsky]

Comment 2

•

14 years ago

> AFAICT |capacity| is always a power-of-two It's a power of two times the original capacity passed to the constructor or the first EnsureCapacity call, right? In practice, that will also be a power of 2, of course. > e.g. below nsCSSRuleProcessor::CascadeSheet Where below there? There are some temporary arrays there that can get long, and some long-lived ones that shouldn't get long (and that we're about to make into auto arrays)... would be good to know which ones you're particularly seeing. That said, it may be worth waiting for the existing bugs on the CSS arrays to be sorted out before measuring that.

Boris Zbarsky [:bzbarsky]

Comment 3

•

14 years ago

Er, just read comment 1. Does "cumulative" mean "total rounding-up of all allocations including ones that have already been deallocated since then" in this case?

Patch v1 14 years ago Justin Lebar (not reading bugmail) 24.30 KB, patch		Details \| Diff \| Splinter Review
Part 2. v1 14 years ago Justin Lebar (not reading bugmail) 6.94 KB, patch		Details \| Diff \| Splinter Review
Part 1. v2 14 years ago Justin Lebar (not reading bugmail) 24.53 KB, patch		Details \| Diff \| Splinter Review
Part 2 (fix up code). v2 14 years ago Justin Lebar (not reading bugmail) 16.24 KB, patch		Details \| Diff \| Splinter Review
Patch v3 14 years ago Justin Lebar (not reading bugmail) 2.95 KB, patch	roc : review+	Details \| Diff \| Splinter Review