589905 - Investigate remote-prefs performance

Chris Jones [:cjones] inactive; ni?/f?/r? if you need me

Reporter

Description

•

14 years ago

Attached patch Dumb instrumentation of OOP prefs — Details — Splinter Review

When looking at IPC logs a while back, I noticed that we fetch quite a few prefs during startup+creating <browser>+loading page, something like 900 IIRC.  I added some dumb instrumentation to the prefs code (attached patch), and ran a little test that loaded the browser, navigated to google.com, then shut down the browser.  The entire test took 6-7secs, and the instrumentation said we spent about 1sec in prefs code.  This setup is flawed for a lot of reasons, but 1sec seems high enough to investigate further.

I also instrumented to see how many requests were unique to see if caching would be worthwhile.  I think the instrumentation wasn't quite right, but out of ~1200 requests during start-load-shutdown, ~600 were unique, so caching won't much help startup perf.

Since we seem to have a total of ~1000 prefs, it seems like a good impl would be for the content to load or receive all the pref values at once, then add a listener for a change to any pref (on the assumption that pref changes are relatively rare).  As a stop-gap fix, we could have the content process just read in the pref file directly.  A better impl would be to serialize the DB and send it from chrome->content.  Best would probably be to map the pref file into shmem somehow, and send a ref to shmem to the content process, but bsmedberg tells me that prefs are in a JAR now, so the simple mmap() impl isn't going to cut the mustard.

Chris Jones [:cjones] inactive; ni?/f?/r? if you need me

Reporter

Comment 1

•

14 years ago

This is a nice-to-have for fennec 2.0b1, but we need better profiling to know if blocking? and if so, what status, is appropriate.

dwitte@gmail.com

Updated

•

14 years ago

Assignee: nobody → dwitte

dwitte@gmail.com

Updated

•

14 years ago

Blocks: 592666

Chris Jones [:cjones] inactive; ni?/f?/r? if you need me

Reporter

Comment 2

•

14 years ago

Knowing how much this affects us in the field is definitely a blocker.  If it doesn't affect us, optimizing isn't a blocker.  If it is, then optimizing is a blocker.

tracking-fennec: --- → ?

dwitte@gmail.com

Comment 3

•

14 years ago

-> dougt per request.

Assignee: dwitte → doug.turner

Doug Turner (:dougt)

Assignee

Comment 4

•

14 years ago

Attached patch patch v.1 — Details — Splinter Review

first pass at remoting all preference in one go.

1) removes existing e10s preferences api

2) adds new sync "readprefs" api that returns all of the preferences as a string.  This string is basically what we would write out to disk if we were flushing preferences.

3) adds a new api to the internal interface of the preferences service called "serializePreferences".  This enumerates the preferences hash table and returns a string (basically what we would write to disk...) of both user and default perfs.

4) added a new save option for pref_savePref to allow us get access to the default preferences (things that wouldn't normally be written to the user preferences file.

It is a pretty small change.. However, it is untested and we need to get some measurements on how it performs.  I am uploading it now to get some early feedback.

Attachment #477415 - Flags: feedback?

Doug Turner (:dougt)

Assignee

Updated

•

14 years ago

Attachment #477415 - Flags: feedback? → feedback?(dwitte)

Doug Turner (:dougt)

Assignee

Comment 5

•

14 years ago

Attached patch tests patches (obsolete) — Details — Splinter Review

Attachment #477737 - Flags: review?

Doug Turner (:dougt)

Assignee

Updated

•

14 years ago

Attachment #477737 - Flags: review? → review?(dwitte)

Doug Turner (:dougt)

Assignee

Comment 6

•

14 years ago

Looks like it works as expected.

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 7

•

14 years ago

I guess you need to propagate dynamic pref changes in some manner?

Doug Turner (:dougt)

Assignee

Comment 8

•

14 years ago

roc - yeah, that already exists, check out:

http://mxr.mozilla.org/mozilla-central/source/modules/libpref/src/nsPrefBranch.cpp#751

and

http://mxr.mozilla.org/mozilla-central/source/dom/ipc/ContentChild.cpp#82

Chris Jones [:cjones] inactive; ni?/f?/r? if you need me

Reporter

Comment 9

•

14 years ago

Wait ... if we're going to load the entire pref DB in the content process, why not just re-use all the existing pref listener machinery and drop the code in ContentChild?  We could set up a single listener for the entire content process, that listens to all pref changes and updates the content-process DB.

Chris Jones [:cjones] inactive; ni?/f?/r? if you need me

Reporter

Comment 10

•

14 years ago

If we don't do that, then the content-process pref and chrome-process DBs will get out of sync.

Doug Turner (:dougt)

Assignee

Comment 11

•

14 years ago

we did.  The only code that i left was: the stuff to prevent write access in the child, and the code that allow the child process to get notifications.

on synchronization:

when the child process starts, we hand off a serialization of the db.  from that point forward, any preference changes in the parent are reflected to the child via the remote observer.  The shouldn't be any sync problems.

Maybe I shouldnt have put those mxr links into this bug as it may have caused confusion.  Please check out the first wip patch, and let me know if there is an issue.

Chris Jones [:cjones] inactive; ni?/f?/r? if you need me

Reporter

Comment 12

•

14 years ago

OK, cool.  The links threw me off, yeah.

Doug Turner (:dougt)

Assignee

Updated

•

14 years ago

Blocks: 588050

Benjamin Smedberg

Comment 13

•

14 years ago

I really don't think serializing all this as a string makes a lot of sense. Can we use IPDL array/union types to do it instead? That should be less expensive in terms of memory and have less chance of serialization/parsing errors.

Doug Turner (:dougt)

Assignee

Comment 14

•

14 years ago

yeah, not very proud of that.

I so not think there is going to be any serialization/parsing errors -- this is the same format as what is written to the disk.  Do we have serialization/parsing errors now?

I'd hate to write more code to serialize since libpref already does this just fine and has been used for a very long time.

however, we probably can optimize by preallocating a buffer, instead.  I can put up another patch that changes the serialization to something less allocatey... however, lets get dwitte (or someone else) review this patch as-is.

Doug Turner (:dougt)

Assignee

Updated

•

14 years ago

Attachment #477415 - Flags: feedback?(dwitte) → review?(dwitte)

dwitte@gmail.com

Comment 15

•

14 years ago

Comment on attachment 477415 [details] [diff] [review]
patch v.1

>diff --git a/modules/libpref/public/nsIPrefService.idl b/modules/libpref/public/nsIPrefService.idl

>+  string serializePreferences();

Make this return an ACString? No need to throw around extra allocations with char*.

>diff --git a/modules/libpref/src/nsPrefBranch.cpp b/modules/libpref/src/nsPrefBranch.cpp

> NS_IMETHODIMP nsPrefBranch::PrefIsLocked(const char *aPrefName, PRBool *_retval)
> {

>+  if (GetContentChild()) {
>+    NS_ERROR("cannot check lock pref from content process");
>+    return NS_ERROR_NOT_AVAILABLE;

Hmm, howcome? Is this going to be a problem?

>+    ContentChild* cpc = ContentChild::GetSingleton();
>+    nsCAutoString prefs;
>+    cpc->SendReadPrefs(&prefs);
>+
>+    PrefParseState ps;
>+    PREF_InitParseState(&ps, PREF_ReaderCallback, NULL);
>+    nsresult rv = PREF_ParseBuf(&ps, prefs.get(), prefs.Length());
>+    PREF_FinalizeParseState(&ps);

OK for now, but file a followup to serialize this directly as a hash? Doesn't need to block beta, but maybe final.

>+NS_IMETHODIMP nsPrefService::SerializePreferences(char** aPrefsOut)
>+{
>+  nsCAutoString prefs;

Yeah, just use an nsACString& arg, and ditch the intermediary.

>+  char** valueArray = (char **)PR_Calloc(sizeof(char *), gHashTable.entryCount);

new char*[gHashTable.entryCount]?

>+  if (!valueArray)
>+    return NS_ERROR_OUT_OF_MEMORY;

Infallible new.

>+  char** walker = valueArray;
>+  for (PRUint32 valueIdx = 0; valueIdx < gHashTable.entryCount; valueIdx++, walker++) {
>+    if (*walker) {

Should probably assert that it's nonnull, and not bother release-build checking.

>+  PR_Free(valueArray);

delete[] valueArray?

> PLDHashOperator
> pref_savePref(PLDHashTable *table, PLDHashEntryHdr *heh, PRUint32 i, void *arg)
> {

>+    if (!pref->key) {
>+        return PL_DHASH_NEXT;
>+    }

No braces, per file style.

I don't understand why this would happen with your patch but not without. Are you just being safe or is there a reason? If no reason, probably shouldn't change it. This is stable code. ;)

r=dwitte with followup bug.

Attachment #477415 - Flags: review?(dwitte) → review+

dwitte@gmail.com

Comment 16

•

14 years ago

Comment on attachment 477737 [details] [diff] [review]
tests patches

>diff --git a/gfx/layers/ThebesLayerBuffer.cpp b/gfx/layers/ThebesLayerBuffer.cpp

>+  aTarget->ResetClip();

Probably not, right? :)

>diff --git a/modules/libpref/test/unit_ipc/test_simpleIPC.js b/modules/libpref/test/unit_ipc/test_simpleIPC.js

Need lots more tests here -- observers especially. Can you copy/paste the code from the existing xpcshell test and adapt the relevant bits?

Canceling request pending new patch.

Attachment #477737 - Flags: review?(dwitte)

Shawn Wilsher :sdwilsh

Comment 17

•

14 years ago

Any chance this could be done on top of the patch in blocker bug 580790 so I don't have to unbitrot it from ipc changes again?

Doug Turner (:dougt)

Assignee

Comment 18

•

14 years ago

Feel free

Brad Lassey [:blassey] (use needinfo?)

Updated

•

14 years ago

tracking-fennec: ? → 2.0b2+

Whiteboard: [fennec-checkin-postb1]

dwitte@gmail.com

Comment 19

•

14 years ago

I'm not sure what [fennec-checkin-postb1] means, but if it means what I think it means, this isn't ready. It needs tests which aren't finished yet.

Doug Turner (:dougt)

Assignee

Comment 20

•

14 years ago

[fennec-checkin-postb1] doesn't apply to this bug.

Whiteboard: [fennec-checkin-postb1]

Brad Lassey [:blassey] (use needinfo?)

Comment 21

•

14 years ago

(In reply to comment #20)
> [fennec-checkin-postb1] doesn't apply to this bug.

why?

Doug Turner (:dougt)

Assignee

Comment 22

•

14 years ago

(1) the patch needs to be cleaned up before pushing -- see the comments.
(2) the tests need to be more comprehensive.

I suppose it will be checked in post b1 (just like everything not on the b1 blocker list), but I don't want anyone seeing that flag and pulling the trigger on this before it is complete.

Doug Turner (:dougt)

Assignee

Comment 23

•

14 years ago

Dan, thanks for the review:

Regarding lock pref - since we aren't setting from the child, i didn't think locking was important.  I took a quick look and didn't see any actual users of it.

The walking code was cribbed from else where in this file, so it might be better to convert both, or leave this one as is.  The point being consistency.

I will file a follow up to address the serialization.  We can do better, but will be shadowed by the gain by simply caching.

dwitte@gmail.com

Comment 24

•

14 years ago

(In reply to comment #23)
> Regarding lock pref - since we aren't setting from the child, i didn't think
> locking was important.  I took a quick look and didn't see any actual users of
> it.

Fair enough; as long as it throws I'm fine with that.

> The walking code was cribbed from else where in this file, so it might be
> better to convert both, or leave this one as is.  The point being consistency.

OK, fine as it is.

> I will file a follow up to address the serialization.  We can do better, but
> will be shadowed by the gain by simply caching.

Yeah. Maybe azakai can do a quick test of how long the serialization is taking (just wrap the parent walking code in PR_Now()'s, and the child reading code) as an upper bound on how much better we could make it? If it's significant we can final+ it.

Alon Zakai (:azakai)

Comment 25

•

14 years ago

(In reply to comment #24)
> Yeah. Maybe azakai can do a quick test of how long the serialization is taking
> (just wrap the parent walking code in PR_Now()'s, and the child reading code)
> as an upper bound on how much better we could make it? If it's significant we
> can final+ it.

For the entire serialization sync message + deserialization after, looks like 90ms in an optimized build, which is 2.8% of the entire time it takes to load cnn.com.

dwitte@gmail.com

Comment 26

•

14 years ago

So that's including the sync message time? What if we take that out? (We're going to be paying that no matter what.)

Doug Turner (:dougt)

Assignee

Comment 27

•

14 years ago

in fennec, we startup the content process asynchronously when we startup.

Alon Zakai (:azakai)

Comment 28

•

14 years ago

The overhead of the sync message itself? That is close to negligible here, a few ms or less.

dwitte@gmail.com

Comment 29

•

14 years ago

So the time just for serialization + deserialization is 85 - 90ms? That's significant. If 5% is a blocker, we should probably block final on 2.8% as well. :)

Doug Turner (:dougt)

Assignee

Comment 30

•

14 years ago

Attached patch test patch v.2 — Details — Splinter Review

this tests preferences set after the child is created.

dan - many of the tests in test_libPrefs are not designed to work in the child.  For example, all of the set/lock preferences do not work.  This ipc test tests the bare minimum that is required of preferences in fennec. It will test the 3 basic types of preferences set before the child is created.  It will also test the same times set after the child is created as you suggested (good idea).

Also, it is fair to note, that the pattern i used to test in the child probably could be factored out.  It allows you to just run a "verification" function in the child process without having to have a separate file.

Attachment #477737 - Attachment is obsolete: true

Doug Turner (:dougt)

Assignee

Updated

•

14 years ago

Attachment #478450 - Flags: review?(jmaher)

dwitte@gmail.com

Comment 31

•

14 years ago

Comment on attachment 478450 [details] [diff] [review]
test patch v.2

Nice, the observer tests look good.

Doug Turner (:dougt)

Assignee

Comment 32

•

14 years ago

create 599545 to follow up on serialization perf.

dwitte@gmail.com

Comment 33

•

14 years ago

Comment on attachment 478450 [details] [diff] [review]
test patch v.2

>+function verify_existing_prefs() {
>+  const Ci = Components.interfaces;
>+  const Cc = Components.classes;
>+  dump("verify_existing_prefs\n");

Extraneous?

>+function verify_observed_prefs() {
>+  const Ci = Components.interfaces;
>+  const Cc = Components.classes;
>+  dump("verify_observed_prefs\n");

Here too.

r=dwitte

Attachment #478450 - Flags: review?(jmaher) → review+

Doug Turner (:dougt)

Assignee

Comment 34

•

14 years ago

the context that those two methods are being executing in doesn't have those defined.

dwitte@gmail.com

Comment 35

•

14 years ago

I mean the dump. :)

Doug Turner (:dougt)

Assignee

Comment 36

•

14 years ago

then you would be correct!  I'll remove.

Stuart Parmenter

Updated

•

14 years ago

tracking-fennec: 2.0b2+ → 2.0b1+

Doug Turner (:dougt)

Assignee

Comment 37

•

14 years ago

http://hg.mozilla.org/mozilla-central/rev/f584b2db2a7e
http://hg.mozilla.org/mozilla-central/rev/a7300f9d11ae

Status: NEW → RESOLVED

Closed: 14 years ago

Resolution: --- → FIXED

Chris Jones [:cjones] inactive; ni?/f?/r? if you need me

Reporter

Comment 38

•

14 years ago

The old pref observer code in ContentChild.cpp that I thought this patch would obsolete is still around.  Any reason for that?

Doug Turner (:dougt)

Assignee

Comment 39

•

14 years ago

yes, it needs to stay.  We serialize the entire preferences hashtable, then send it to the child process.  After this point, for all preference changes in the parent, we will notify the child using the existing "old pref observer code" in the ContentChild.

Do you see a way for us to remove this?

Chris Jones [:cjones] inactive; ni?/f?/r? if you need me

Reporter

Comment 40

•

14 years ago

Looks like honest-to-goodness dead code to me: http://mxr.mozilla.org/mozilla-central/search?string=addremoteprefobserver

Josh Matthews [:jdm]

Updated

•

14 years ago

Blocks: 599900

Doug Turner (:dougt)

Assignee

Updated

•

14 years ago

Blocks: 600110

Doug Turner (:dougt)

Assignee

Comment 41

•

14 years ago

i'll remove in bug 600110!

dwitte@gmail.com

Updated

•

14 years ago

Blocks: 599545

Dumb instrumentation of OOP prefs 14 years ago Chris Jones [:cjones] inactive; ni?/f?/r? if you need me 6.88 KB, patch		Details \| Diff \| Splinter Review
patch v.1 14 years ago Doug Turner (:dougt) 24.76 KB, patch	dwitte : review+	Details \| Diff \| Splinter Review
tests patches 14 years ago Doug Turner (:dougt) 2.18 KB, patch		Details \| Diff \| Splinter Review
test patch v.2 14 years ago Doug Turner (:dougt) 2.64 KB, patch	dwitte : review+	Details \| Diff \| Splinter Review