631035 - Tinderbox log sends memory usage through the roof

Reporter

Description

•

14 years ago

Viewing a specific full tinderbox log sent my memory usage shooting up to >1.5gb. About:memory showed no obvious culprits on a repeat run which only pushed it to >1gb: Overview Memory mapped: 1,223,688,192 Memory in use: 1,202,889,248 Other Information malloc/allocated 1,202,891,752 malloc/mapped 1,223,688,192 malloc/committed 1,223,688,192 malloc/dirty 249,856 js/gc-heap 37,748,736 js/string-data 7,567,536 js/mjit-code 21,001,594 storage/sqlite/pagecache 33,448,616 storage/sqlite/other 1,286,888 images/chrome/used/raw 0 images/chrome/used/uncompressed 125,644 images/chrome/unused/raw 0 images/chrome/unused/uncompressed 0 images/content/used/raw 694,316 images/content/used/uncompressed 3,686,000 images/content/unused/raw 0 images/content/unused/uncompressed 0 layout/all 39,504,685 layout/bidi1, 916 gfx/surface/image 91,816

Josh Matthews [:jdm]

Reporter

Updated

•

14 years ago

blocking2.0: --- → ?

Boris Zbarsky [:bzbarsky]

Comment 1

•

14 years ago

There's clearly _something_ that's using a bunch of memory here that's not listed in that output. The DOM here contains a 30MB textnode. That's not included in the above breakdown. It might be 60MB if there's non-ASCII text in it. I _think_ layout/all is supposed to include the frame tree and style data and such. That might be plausible; if there's no bidi in that file (which may be an incorrect assumption) there should be at least 20.5MB of text frames on a 32-bit system. Were the numbers above 32 or 64 bit? That still leaves us a few hundred MB short, right?

Josh Matthews [:jdm]

Reporter

Updated

•

14 years ago

Summary: Tinderbox log send memory usage through the roof → Tinderbox log sends memory usage through the roof

Josh Matthews [:jdm]

Reporter

Comment 2

•

14 years ago

These are 32bit numbers, fwiw.

Nicholas Nethercote [inactive]

Comment 3

•

14 years ago

What's the URL of the tinderbox log? I tried one and it only went up to 560MB and I had several other tabs open already.

Josh Matthews [:jdm]

Reporter

Comment 4

•

14 years ago

See the URL field. Also, http://tinderbox.mozilla.org/showlog.cgi?log=Firefox/1296696733.1296700907.12147.gz&fulltext=1 . The best candidates for the hogging logs in my experience is ones containing process crashes for some reason.

Nicholas Nethercote [inactive]

Comment 5

•

14 years ago

On my Mac, I see 956 MB (mapped) to start with, dropping down to 540 MB. On my Linux64 box, I see 460 MB dropping down to 262 MB.

Boris Zbarsky [:bzbarsky]

Comment 6

•

14 years ago

262mb should be totally believable. We should get bugs filed to log the amounts of DOM and textrun data into about:memory.

Nicholas Nethercote [inactive]

Comment 7

•

14 years ago

Attached file Massif results — Details

These are the ones worth looking at: 17.15% (361,126,656B) gfxTextRun::AllocateStorage(void const*&, unsigned int, unsigned int) (mozalloc.h:247) 12.27% (258,379,116B) nsTArray_base<nsTArrayDefaultAllocator>::EnsureCapacity(unsigned int, unsigned int) (nsTArray.h:88) 11.55% (243,269,632B) gfxTextRun::CopyGlyphDataFrom(gfxTextRun*, unsigned int, unsigned int, unsigned int, int) (mozalloc.h:241) 08.03% (169,025,536B) PresShell::AllocateFrame(nsQueryFrame::FrameIID, unsigned long) (nsPresShell.cpp:2098) 06.26% (131,864,700B) nsTArray_base<nsTArrayDefaultAllocator>::EnsureCapacity(unsigned int, unsigned int) (nsTArray.h:84) 05.48% (115,343,360B) nsTextFragment::Append(unsigned short const*, unsigned int) (nsMemory.h:71) 03.59% (75,497,472B) nsHtml5TreeBuilder::accumulateCharacters(unsigned short const*, int, int) (mozalloc.h:241) 03.23% (67,956,228B) nsTArray_base<nsTArrayDefaultAllocator>::EnsureCapacity(unsigned int, unsigned int) (nsTArray.h:84) You can look at the attached file for the full results, which includes the call stacks that each of the above functions appeared in. Looks like all the space is being used by reflow.

Boris Zbarsky [:bzbarsky]

Comment 8

•

14 years ago

gfxTextRun::AllocateStorage is obvious. nsTArray_base<nsTArrayDefaultAllocator>::EnsureCapacity is almost all the text run's BreakAndMeasureText calling GetTabWidths. gfxTextRun::CopyGlyphDataFrom is also clear. PresShell::AllocateFrame is all continuing textframes (that should be the 20+MB; that being 8% matches njn's numbers). nsTArray_base<nsTArrayDefaultAllocator>::EnsureCapacity is the same as the previous one, in GetTabWidths (so in total GetTabWidths is 18.5% here). nsTextFragment::Append is the DOM node storage for that 30 or 60 MB string; I'm a little surprised this is smaller than the text frame amount, actually, unless there's a _bunch_ of bidi stuff involved. nsHtml5TreeBuilder::accumulateCharacters should be a transient allocation, I wouls hope. nsTArray_base<nsTArrayDefaultAllocator>::EnsureCapacity is GetAdjustedSpacingArray, under textrun text measurement. njn, are those total allocations or what's still live at the end?

Nicholas Nethercote [inactive]

Comment 9

•

14 years ago

Those numbers are from the point of peak memory usage. You can see in the attached file that Massif took 82 snapshots, only one of which (the peak) has the full details. I didn't wait around for things to subside, though the graph shows that memory usage had dropped a bit between the page load finishing and me exiting.

Boris Zbarsky [:bzbarsky]

Comment 10

•

14 years ago

Ah, ok. We cache textruns, but evict on a timer.

Nicholas Nethercote [inactive]

Comment 11

•

14 years ago

bz, sounds like there's nothing surprising you here. Is that right?

Boris Zbarsky [:bzbarsky]

Comment 12

•

14 years ago

Well, the size of the textrun allocations is a bit surprising to me. It looks like the textrun stuff taken all together is about 10x bigger than the original text. On the other hand, if this is an incremental load, we might have been throwing away and rebuilding textruns as we go... The other things in comment 7 are about as expected, yes.

Josh Matthews [:jdm]

Reporter

Comment 13

•

14 years ago

I hope I'm not misunderstanding, but sitting around waiting for 30 seconds would show me a drop of ~300mb, bringing me down to about 1GB on the larger runs. I didn't see any other significant memory changes after that until I closed the tab, at which point everything went back to reasonable levels.

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 14

•

14 years ago

Textruns make a copy of the text (1 or 2 bytes per character) and store at least 4 bytes per character of glyph position data. So textrun storage in AllocateStorage being 3x the DOM storage is expected, which is about what we see in comment #7. As Boris said, it's a time-based cache with a 30s timeout. It's very hard to squeeze more out of the glyph position data, that's already packed very carefully. With significant work we could probably eliminate the text copy for preformatted text at least --- doable, but not easy. I guess because we have really large textruns we are likely encounter some weird characters that trigger a "detailed glyph" record, so in gfxTextRun::CopyGlyphDataFrom we create an mDetailedGlyphs array --- one pointer per character --- so assuming 32bit that's twice the text memory, again looking about right in comment #7. We could use arraylets or something similar here to save almost all the memory without much effort, in the case where you have a few non-ASCII characters in a sea of ASCII (common). GetTabWidths looks like huge, huge fail on my part. I basically optimized for the case where there are no preformatted tabs and didn't think about what happens once there's even one preformatted tab. Loading one sample tbox log I see exactly one preformatted tab, naturally :-). 1) mTabWidths is an array of gfxFloat, i.e. doubles, when it should be float (or maybe int). 2) We allocate a tabwidths array for each textframe belonging to a textrun with a preformatted tab, whether the text in that frame has a tab or not. (i.e. every text frame in this example) We can easily avoid allocating an array for the textframes (think lines) that have no tab. That would basically make those allocations go away for this workload. Right now it's 8 bytes per char!

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 15

•

14 years ago

Also 3) The tabwidths arrays never ever go away. That probably explains what Josh observes in comment #13. We could fix that but once we've fixed #2 it won't matter for this workload. I suspect the HTML5 parser made these bugs worse because it puts all the text into a single text node, which makes us get giant textruns.

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 16

•

14 years ago

I'll try to find someone to fix the second and third paragraphs of comment #14.

Robert O'Callahan (:roc) (email my personal email if necessary)

Updated

•

14 years ago

Assignee: nobody → jfkthame

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 17

•

14 years ago

More precisely, what I'm suggesting for mDetailedGlyphs is making it an nsAutoArrayPtr<nsAutoArrayPtr<DetailedGlyph>>, where each inner array is at most 256 entries long (or some other power of 2). For GetTabWidths, we should switch it from gfxFloat to float. Then we should treat absence of a TabWidthProperty() as meaning "all zeroes". Then GetTabWidths would only allocate an array and set the property if it finds a tab character in the text (otherwise the array entries must be all zero).

Massif results 14 years ago Nicholas Nethercote [inactive] 102.28 KB, text/plain		Details
patch, optimize storage of DetailedGlyph records in gfxTextRun 14 years ago Jonathan Kew [:jfkthame] 40.68 KB, patch		Details \| Diff \| Splinter Review
patch, v2 - optimize storage of DetailedGlyph records 14 years ago Jonathan Kew [:jfkthame] 37.72 KB, patch	roc : review+	Details \| Diff \| Splinter Review
part 2 v1 - optimize storage of tab widths 14 years ago Jonathan Kew [:jfkthame] 7.85 KB, patch		Details \| Diff \| Splinter Review
part 2, v2 - optimize storage of tab widths, updated as per comments 14 years ago Jonathan Kew [:jfkthame] 8.87 KB, patch		Details \| Diff \| Splinter Review
reftest for changing the tab width 14 years ago Jonathan Kew [:jfkthame] 1.76 KB, patch		Details \| Diff \| Splinter Review
part 2, v3 - optimize storage of tab widths, updated as per comment 43 14 years ago Jonathan Kew [:jfkthame] 10.92 KB, patch	roc : review+ roc : approval2.0+	Details \| Diff \| Splinter Review
reftests for changing tab widths 14 years ago Jonathan Kew [:jfkthame] 4.89 KB, patch	roc : review+	Details \| Diff \| Splinter Review
Massif results on EnsureCapacity 14 years ago Nicholas Nethercote [inactive] 17.20 KB, application/octet-stream		Details