678940 - [Linux] Firefox 6 consumes a lot of system memory just after start when using layers.acceleration.force-enabled=true

Reporter

Description

•

13 years ago

User Agent: Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/535.1 (KHTML, like Gecko) Chrome/13.0.782.112 Safari/535.1 Steps to reproduce: I started firefox normally. My system is Arch Linux - kernel Linux 3.0.1, firefox from testing repository. Actual results: Firefox consumed all of system memory. Showing of dialogs is really slow. Firefox tend to freeze.

rbalent

Reporter

Updated

•

13 years ago

Summary: Firefox 6 consumes all of memory just after start → Firefox 6 consumes all system memory just after start

Matthias Versen [:Matti]

Comment 1

•

13 years ago

Please try it with http://support.mozilla.com/en-US/kb/Safe+Mode

rbalent

Reporter

Comment 2

•

13 years ago

Thanks, it works perfect in safe mode. But where could be the problem? I have all add-ons and plugins disabled in normal mode and it's absolutely unusable.

rbalent

Reporter

Comment 3

•

13 years ago

I created new Firefox profile and it's working properly in normal mode. So this issue is happening only on my profile migrated from Firefox 5.

rbalent

Reporter

Comment 4

•

13 years ago

This two errors are showing just after loading the old profile: Can't find symbol 'glXBindTexImageEXT' Can't find symbol 'glXReleaseTexImageEXT'

Matthias Versen [:Matti]

Comment 5

•

13 years ago

That makes no sense if it works in the safemode and a new profile but not in the normal mode without extensions. Do you really have all extensions disabled ? In that case i would try to to disable the hardware acceleration under tools/options/advanced/general but that doesn't explain why it works with a new profile.

rbalent

Reporter

Comment 6

•

13 years ago

Yes, I am sure I have all extension disabled, but you were right with hardware acceleration. I disabled it and Firefox works properly with my old profile. But it doesn't explain why it works OK with new profile and hardware acceleration enabled. Additional info: I have Intel GMA HD Integrated Graphics and xf86-video-intel 2.15.0 graphic driver from Arch Linux repository.

rbalent

Reporter

Comment 7

•

13 years ago

Menus and bookmark folders are showing slow too. When they are loading, there is a lot I/O operation and it freezes my computer for a few seconds. iotop is showing 100% UI. This is the second symptom. With disabled hardware acceleration, I/O operations are normal.

:aceman

Comment 8

•

13 years ago

Maybe you have some features of HW acceleration forcefully enabled in the old profile, that are not enabled automatically (on linux) in the new profile even when HW accel is enabled. Something like 'layers.acceleration.force-enabled' ?

rbalent

Reporter

Comment 9

•

13 years ago

Nice catch, it was my problem - layers.acceleration.force-enabled. When I set it to false, everything works perfectly.

:aceman

Comment 10

•

13 years ago

I think layers still has problems on Linux therefore it is not enabled by default (see https://wiki.mozilla.org/Blocklisting/Blocked_Graphics_Drivers). I have also seen this (FF using insane amounts of memory with one page loaded) when experimenting with HW accel and layers (ATI card). However I don't know if this is a known problem (I haven't found a bug for it). So let's leave this bug open and I'll do some more tests myself.

Summary: Firefox 6 consumes all system memory just after start → Firefox 6 consumes all system memory just after start when using layers.acceleration.force-enabled

:aceman

Comment 11

•

13 years ago

OK, I can see this too. When forcing layers enabled, on FF 9a1, ATI Radeon HD 4350. Firefox after start with the about:home page open takes 1.2GB resident memory (RES in top). I don't know if that is a real problem, as it is a non-default configuration and maybe still unsupported.

Status: UNCONFIRMED → NEW

Component: General → Graphics

Ever confirmed: true

Product: Firefox → Core

QA Contact: general → thebes

Hardware: x86_64 → All

Ryan S Kingsbury

Comment 12

•

13 years ago

I can confirm this on an Arch Linux x64 system with Intel HD graphics, kernel 3.04 and firefox package from [extra] repository (the standard, stable release). In my case FF didn't use *all* the system memory, but the --heap-unclassified portion was 2.2 GB compared to about 200 MB after disabling the layers.acceleration.force-enabled option. Attaching the output of glxinfo in case it helps

Ryan S Kingsbury

Comment 13

•

13 years ago

Attached file output of glxinfo — Details

:aceman

Comment 14

•

13 years ago

Yes, it does not explicitly take all memory available on the system, it probably just needs to allocate some big number (like 1-2GB, maybe dependent on driver), which incidentally may be all the user has. Even the reporter didn't say it crashed due to OOM or anything, it was just slow (maybe swapping).

Blocks: ogl-linux-beta, 680817

Summary: Firefox 6 consumes all system memory just after start when using layers.acceleration.force-enabled → [Linux] Firefox 6 consumes a lot of system memory just after start when using layers.acceleration.force-enabled=true

Marco Castelluccio [:marco]

Comment 15

•

13 years ago

I think resolving this bug could be helped by reducing heap-unclassified numbers in about:memory.

Depends on: DarkMatter

:aceman

Comment 16

•

13 years ago

Good idea, I'll see what about:memory shows when this happens.

:aceman

Comment 17

•

13 years ago

about:memory just after Firefox start (Firefox 10). Main Process Explicit Allocations 454.65 MB (100.0%) -- explicit ├──424.98 MB (93.47%) -- heap-unclassified ├───23.79 MB (05.23%) -- js │ ├──15.22 MB (03.35%) -- compartment([System Principal], 0xffffffffb3052000) │ │ ├───8.55 MB (01.88%) -- gc-heap │ │ │ ├──4.39 MB (00.97%) -- objects │ │ │ │ ├──2.50 MB (00.55%) -- non-function │ │ │ │ └──1.89 MB (00.42%) -- (1 omitted) │ │ │ └──4.16 MB (00.92%) -- (6 omitted) │ │ └───6.67 MB (01.47%) -- (8 omitted) │ ├───4.57 MB (01.01%) -- (7 omitted) │ └───4.00 MB (00.88%) -- stack ├────3.78 MB (00.83%) -- storage │ └──3.78 MB (00.83%) -- sqlite │ └──3.78 MB (00.83%) -- (10 omitted) └────2.10 MB (00.46%) -- (9 omitted) Other Measurements 0.05 MB -- gfx-surface-image 6.60 MB -- gfx-surface-xlib 437.11 MB -- heap-allocated 438.29 MB -- heap-committed 0.26% -- heap-committed-unallocated-fraction 0.08 MB -- heap-dirty 22.88 MB -- heap-unallocated 2 -- js-compartments-system 1 -- js-compartments-user 11.00 MB -- js-gc-heap 0.17 MB -- js-gc-heap-arena-unused 1.00 MB -- js-gc-heap-chunk-clean-unused 1.77 MB -- js-gc-heap-chunk-dirty-unused 0.00 MB -- js-gc-heap-decommitted 24.50% -- js-gc-heap-unused-fraction 1.14 MB -- js-total-analysis-temporary 1.55 MB -- js-total-mjit 5.26 MB -- js-total-objects 1.94 MB -- js-total-scripts 2.23 MB -- js-total-shapes 3.93 MB -- js-total-strings 0.08 MB -- js-total-type-inference 46 -- page-faults-hard 167,463 -- page-faults-soft 471.43 MB -- resident 673.85 MB -- vsize

Keywords: footprint

Marco Castelluccio [:marco]

Updated

•

13 years ago

Depends on: 598875

Krzysztof Kotlenga

Assignee

Comment 18

•

13 years ago

Attached patch Avoid allocating a huge array by using a hashtable instead (obsolete) — Details — Splinter Review

Something that was not supposed "to become a significant memory issue" (see the comment in the code) unfortunately has become one. The problem was introduced in "Bug 567626 - fix up opengl layers backend". The offending line is mUniformValues.SetLength(maxloc+1); The reason is that maxloc, as returned by fGetUniformLocation() happens to be pretty big number (usually between 300000 and 500000 on my machine). This can easily sum up to over a gigabyte of memory just by running over menus (because a new GL context is being created for every widget and this uniform "cache" is not being shared between them). The attached patch is probably a hack quality (this is my first patch and I'm not really a C++ programmer).

Benoit Jacob [:bjacob] (mostly away)

Comment 19

•

13 years ago

This is worrying indeed. cc'ing people. If maxloc is the max value returned by GetUniformLocation then mUniformValues.SetLength(maxloc+1); is indeed a recipe of uncontrollable large memory usage.

Whiteboard: MemShrink?

Nicholas Nethercote [inactive]

Comment 20

•

13 years ago

Does this only happen when force-enabled is true? Presumably this option is only rarely turned on? I'm just asking because the answers to these questions will help with the MemShrink prioritization. Thanks.

Marco Castelluccio [:marco]

Comment 21

•

13 years ago

Nicholas, force-enabled is the only way to use layers acceleration under Linux for now. So this problem will probably happen when bug 594876 will be resolved (if the video card and the driver in question won't be blacklisted).

:aceman

Comment 22

•

13 years ago

(In reply to Krzysztof Kotlenga from comment #18) > Created attachment 585261 [details] [diff] [review] You need to set a flag on the attachment to request review of it from some Core developer (see https://wiki.mozilla.org/Modules/Core (Graphics)). Set the flag as 'review ?' and his email address. Thanks for looking into this.

Benoit Jacob [:bjacob] (mostly away)

Comment 23

•

13 years ago

(In reply to Marco Castelluccio from comment #21) > Nicholas, force-enabled is the only way to use layers acceleration under > Linux for now. So this problem will probably happen when bug 594876 will be > resolved (if the video card and the driver in question won't be blacklisted). Except that it seems that this bug can also affect all GL layers, not only on Linux. GL layers are default on Mac and soon on Android. The large memory usage only happens when uniform location ids grow large, which is very implementation-dependent (depends on drivers).

Krzysztof Kotlenga

Assignee

Updated

•

13 years ago

Attachment #585261 - Flags: review?(joe)

:aceman

Updated

•

13 years ago

Assignee: nobody → k.kotlenga

Status: NEW → ASSIGNED

Benoit Jacob [:bjacob] (mostly away)

Comment 24

•

13 years ago

The present bug is in code that (IIUC) tries to be an optimization by caching the values of uniforms, so that we don't have to call glGetUniform. What data (bug number?) do we have to support the need for this optimization?

Nicholas Nethercote [inactive]

Updated

•

13 years ago

Whiteboard: MemShrink? → [MemShrink:P2]

Joe Drew (not getting mail)

Comment 25

•

13 years ago

Yeah, I'd much rather just remove the cache altogether. Vlad added this as part of bug 567626, saying it was to "avoid unnecessary state changes", but it was my understanding that the correct way to use OpenGL was to always set stuff, so it would surprise me if this measurably improved performance. (Note that I'm ready to be surprised!)

Vladimir Vukicevic [:vlad] [:vladv] (needinfo me, slow to respond)

Comment 26

•

13 years ago

Well, the correct way to use GL is to do everything in your power to avoid state changes being seen by GL. Many drivers are good at optimizing no-op changes, but they also often assume that you're going to be doing that yourself. Calling glGet *anything* is performance death. Uniforms may be cached client-side by some drivers, but really no code that cares about performance should ever call glGet anything. Changing the value of a uniform may require the entire uniform block to be sent to the GPU with the next draw call, for example; will that be avoided by the driver for a no-op change? Dunno, but do you really want to find out? :)

Joe Drew (not getting mail)

Comment 27

•

13 years ago

Still, I'd like to see numbers. Obviously we want to avoid glGet; I also would like to avoid adding a hash table, since even though that's O(1) there's a big difference between that and just using a pointer. So, I'd like to see numbers with what happens without a uniform cache on a variety of drivers before I r+ this patch.

Benoit Jacob [:bjacob] (mostly away)

Comment 28

•

13 years ago

(In reply to Joe Drew (:JOEDREW!) from comment #27) > So, I'd like to see numbers with what happens without a uniform cache on a > variety of drivers before I r+ this patch. I would suggest that since this patch fixes a potentially large memory consumption bug, it's worth taking as a stopgap solution; but do require filing a followup bug to continue investigating the best solution. Maybe the really good solution, if LayerManagerOGL wants to store and quickly read the values of uniforms, is to store each uniform as a separate data structure and hold them individually, rather than store them all in a single (array or map) data structure and have to look them up by id? Akin to what WebGL does, with the WebGLUniformLocation class.

Benoit Jacob [:bjacob] (mostly away)

Comment 29

•

13 years ago

Erm, except that WebGLUniformLocation doesn't actually store a copy of its value, so that was a bad example. I rather meant something like: struct GLUniformLocation { GLint location; GLenum type; GLsizei size; void *datal };

Joe Drew (not getting mail)

Comment 30

•

13 years ago

Except that'd be even worse, wouldn't it? An array to binary search?

Vladimir Vukicevic [:vlad] [:vladv] (needinfo me, slow to respond)

Comment 31

•

13 years ago

No, because each program that's in use will have a known finite number of uniforms and types. (In fact, I thought I implemented it like that originally? With the value being directly on the uniform? It's fuzzy...) One thing to try would be to just unconditionally do a glUniform call. The number of layers and layer renders is unlikely to be large, so this should probably not have a huge impact on perf.

Vladimir Vukicevic [:vlad] [:vladv] (needinfo me, slow to respond)

Comment 32

•

13 years ago

Oh, I did, I see what the issue is (the uniform location vs. index issue that I mentioned in the comment). Hmmm. Using an actual object for each uniform like WebGL would work, but it would complicate the API -- callers would need to set uniforms by object and not by location, and keep a pointer around (with associated memory management complications) to reference it. I would try getting rid of the caching entirely and see what happens. Can be tested with GL layers on on some heavy layer-producing web content -- any difference is likely to be most easily seen on mobile (assuming the baseline perf is ok to begin with), otherwise I'd try under Windows.

Joe Drew (not getting mail)

Comment 33

•

13 years ago

Comment on attachment 585261 [details] [diff] [review] Avoid allocating a huge array by using a hashtable instead Review of attachment 585261 [details] [diff] [review]: ----------------------------------------------------------------- Yeah, ok. Krzysztof, there's nothing wrong with this patch—it looks correct—but I think what we'd rather do to fix this bug is to remove the cache altogether. (We can add different caches, if it's needed, later.) Do you think you'd be able to put that sort of patch together?

Attachment #585261 - Flags: review?(joe) → review-

Krzysztof Kotlenga

Assignee

Comment 34

•

13 years ago

Attached patch Remove caching of uniform values — Details — Splinter Review

Just code removal and minor comment fixes. I don't know how to get some meaningful numbers to compare, so no numbers for now.

Attachment #586005 - Flags: review?(joe)

Joe Drew (not getting mail)

Updated

•

13 years ago

Attachment #585261 - Attachment is obsolete: true

Joe Drew (not getting mail)

Comment 35

•

13 years ago

Comment on attachment 586005 [details] [diff] [review] Remove caching of uniform values Review of attachment 586005 [details] [diff] [review]: ----------------------------------------------------------------- Lovely!

Attachment #586005 - Flags: review?(joe) → review+

Mozilla RelEng Bot

Comment 36

•

13 years ago

Try run for 4a721abc57b1 is complete. Detailed breakdown of the results available here: https://tbpl.mozilla.org/?tree=Try&rev=4a721abc57b1 Results (out of 271 total builds): exception: 2 success: 243 warnings: 24 failure: 2 Builds (or logs if builds failed) available at: http://ftp.mozilla.org/pub/mozilla.org/firefox/try-builds/jdrew@mozilla.com-4a721abc57b1

Mozilla RelEng Bot

Comment 37

•

13 years ago

Try run for 4a721abc57b1 is complete. Detailed breakdown of the results available here: https://tbpl.mozilla.org/?tree=Try&rev=4a721abc57b1 Results (out of 271 total builds): exception: 2 success: 243 warnings: 24 failure: 2 Builds (or logs if builds failed) available at: http://ftp.mozilla.org/pub/mozilla.org/firefox/try-builds/jdrew@mozilla.com-4a721abc57b1

Mozilla RelEng Bot

Comment 38

•

13 years ago

Try run for 4a721abc57b1 is complete. Detailed breakdown of the results available here: https://tbpl.mozilla.org/?tree=Try&rev=4a721abc57b1 Results (out of 271 total builds): exception: 2 success: 243 warnings: 24 failure: 2 Builds (or logs if builds failed) available at: http://ftp.mozilla.org/pub/mozilla.org/firefox/try-builds/jdrew@mozilla.com-4a721abc57b1

Mozilla RelEng Bot

Comment 39

•

13 years ago

Try run for 4a721abc57b1 is complete. Detailed breakdown of the results available here: https://tbpl.mozilla.org/?tree=Try&rev=4a721abc57b1 Results (out of 271 total builds): exception: 2 success: 243 warnings: 24 failure: 2 Builds (or logs if builds failed) available at: http://ftp.mozilla.org/pub/mozilla.org/firefox/try-builds/jdrew@mozilla.com-4a721abc57b1

Mozilla RelEng Bot

Comment 40

•

13 years ago

Try run for 4a721abc57b1 is complete. Detailed breakdown of the results available here: https://tbpl.mozilla.org/?tree=Try&rev=4a721abc57b1 Results (out of 271 total builds): exception: 2 success: 243 warnings: 24 failure: 2 Builds (or logs if builds failed) available at: http://ftp.mozilla.org/pub/mozilla.org/firefox/try-builds/jdrew@mozilla.com-4a721abc57b1

Joe Drew (not getting mail)

Comment 41

•

13 years ago

Those failures all look intermittent/fine. This is ready for checkin.

Keywords: checkin-needed

Dão Gottwald [:dao]

Comment 42

•

13 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/a3a0382b5de8

Keywords: checkin-needed

Target Milestone: --- → mozilla12

Ed Morley [:emorley]

Comment 43

•

13 years ago

https://hg.mozilla.org/mozilla-central/rev/a3a0382b5de8 Thanks for the patch Krzysztof! Hope to see you on IRC in #developers soon, where we can find you other things to work on if you are interested? :-)

Status: ASSIGNED → RESOLVED

Closed: 13 years ago

Resolution: --- → FIXED

Ed Morley [:emorley]

Comment 44

•

13 years ago

In case you hadn't spotted it, you got a mention here: http://blog.mozilla.com/nnethercote/2012/01/11/memshrink-progress-week-30/ :-)

:aceman

Updated

•

13 years ago

Status: RESOLVED → VERIFIED

Gregory Pappas [:gregp]

Updated

•

2 years ago

Duplicate of this bug: 711171

output of glxinfo 13 years ago Ryan S Kingsbury 12.08 KB, text/plain		Details
Avoid allocating a huge array by using a hashtable instead 13 years ago Krzysztof Kotlenga 5.83 KB, patch	joe : review-	Details \| Diff \| Splinter Review
Remove caching of uniform values 13 years ago Krzysztof Kotlenga 8.00 KB, patch	joe : review+	Details \| Diff \| Splinter Review