871596 - investigate serialization/deserialization IPC overhead

Vladimir Vukicevic [:vlad] [:vladv] (needinfo me, slow to respond)

Reporter

Description

•

12 years ago

Our IPC code is directly in the critical path, especially on B2G (e.g. time from finger down to the action being reflected on the screen has IPC right in the middle of it). For some unrelated things, I was looking at the serialization/deserialization code. The higher-level code (e.g. structs etc.) calls down to lower level primitives, which eventually get to code in Pickle (e.g. ReadInt32) and friends. This code isn't in a header file, so likely doesn't get inlined, and each one looks something like: bool Pickle::ReadInt32(void** iter, int32_t* result) const { DCHECK(iter); if (!*iter) *iter = const_cast<char*>(payload()); if (!IteratorHasRoomFor(*iter, sizeof(*result))) return false; memcpy(result, *iter, sizeof(*result)); UpdateIter(iter, sizeof(*result)); return true; } IteratorHasRoomFor() does checks on the current iter pointer, the message header, and the remaining size to make sure that we can read sizeof(*result) from it. This all seems like a large amount of overhead to me. There's no reason why we shouldn't be able to generate code to directly [de]serialize all the primitive data types straight in the higher-level generated message/struct code for this, including doing one size check early on, instead of calling multiple layers to eventually get to a ReadInt32 above.

Chris Jones [:cjones] inactive; ni?/f?/r? if you need me

Comment 1

•

12 years ago

Word of warning, this code hasn't ever showed up on profiles, and my gut intuition is it's not hot enough to bother with. There are other parts of IPC that are more expensive than this, like the work covered by bug 787363. Would definitely want to start here with a testcase.

Vladimir Vukicevic [:vlad] [:vladv] (needinfo me, slow to respond)

Reporter

Comment 2

•

12 years ago

Yeah, I haven't seen it in profiles either, which is a good argument for not poking into it.. but at the same time, by inspection, it's not how you'd want to do it for performance. Testcases would definitely be good (I may poke, and get a standalone ipc setup going), as would just force-inlining all the Pickle functions and looking at the generated code that the compiler spits out for the higher level message sending functions.

Ben Turner (not reading bugmail, use the needinfo flag!)

Comment 3

•

12 years ago

It might also be useful to make sure that we have some ipc stuff in our pgo automation script. Maybe then we could just let the compiler take care of this.

(no longer active)

Comment 4

•

12 years ago

PGO won't help on b2g, but it's true that our current PGO profile gathering script doesn't do a whole lot of useful things (including examining the IPC code.)

rewrite Pickle's alignment mechanism to be more obviously optimizable 12 years ago Nathan Froyd [:froydnj] 5.67 KB, patch		Details \| Diff \| Splinter Review
part 1 - rewrite Pickle's alignment mechanism to be more obviously optimizable 12 years ago Nathan Froyd [:froydnj] 5.50 KB, patch	bent.mozilla : review+	Details \| Diff \| Splinter Review
part 2 - add a new memberAlignmentType to replace the scattered uint32_t alignments 12 years ago Nathan Froyd [:froydnj] 7.13 KB, patch	bent.mozilla : review+	Details \| Diff \| Splinter Review
part 3 - replace memcpys of POD datatypes in Pickle with something smarter 12 years ago Nathan Froyd [:froydnj] 9.87 KB, patch		Details \| Diff \| Splinter Review
part 4 - un-generalize alignment required for Read/Write in Pickle 12 years ago Nathan Froyd [:froydnj] 7.09 KB, patch		Details \| Diff \| Splinter Review
part 3 - replace memcpys of POD datatypes in Pickle with something smarter 12 years ago Nathan Froyd [:froydnj] 9.75 KB, patch		Details \| Diff \| Splinter Review
part 3 - replace memcpys of POD datatypes in Pickle with something smarter 12 years ago Nathan Froyd [:froydnj] 10.00 KB, patch	bent.mozilla : review+	Details \| Diff \| Splinter Review