1271102 - Release assert when sending large (>128MB) IPC messages

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Reporter

Description

•

9 years ago

This bug was filed from the Socorro interface and is report bp-5b3756e3-1086-4cd8-94b2-2a98e2160506. ============================================================= A new topcrash started in build 20160506052823. https://crash-stats.mozilla.com/signature/?product=Firefox&release_channel=nightly&platform=Windows&date=%3E%3D2016-04-01&signature=mozilla%3A%3Aipc%3A%3AProcessLink%3A%3ASendMessageW It's clearly a regression from: https://hg.mozilla.org/mozilla-central/rev/5cdbf605f972 It's not clear to me how the crashes should usefully be categorized, though. Are initial segments of the stack (but with more than one function) a useful way to categorize them? (i.e., should we add things to prefix_signature_re?) Or something else? FWIW, I looked at 5 reports, and there were two pairs of reports that were similar. Two reports had the stack top: 0 xul.dll mozilla::ipc::ProcessLink::SendMessageW(IPC::Message*) ipc/glue/MessageLink.cpp:161 1 xul.dll mozilla::ipc::MessageChannel::DispatchMessageW(IPC::Message const&) ipc/glue/MessageChannel.cpp:1607 2 xul.dll mozilla::ipc::MessageChannel::OnMaybeDequeueOne() ipc/glue/MessageChannel.cpp:1560 3 xul.dll nsRunnableMethodImpl<bool ( mozilla::ipc::MessageChannel::*)(void), 0, 1>::Run() obj-firefox/dist/include/nsThreadUtils.h:741 and two had: 0 xul.dll mozilla::ipc::ProcessLink::SendMessageW(IPC::Message*) ipc/glue/MessageLink.cpp:161 1 xul.dll mozilla::ipc::MessageChannel::Send(IPC::Message*) ipc/glue/MessageChannel.cpp:780 2 xul.dll mozilla::dom::PBrowserChild::SendInvokeDragSession(nsTArray<mozilla::dom::IPCDataTransfer> const&, unsigned int const&, nsCString const&, unsigned int const&, unsigned int const&, unsigned int const&, unsigned char const&, int const&, int const&) obj-firefox/ipc/ipdl/PBrowserChild.cpp:2127 3 xul.dll nsDragServiceProxy::InvokeDragSessionImpl(nsISupportsArray*, nsIScriptableRegion*, unsigned int) widget/nsDragServiceProxy.cpp:61

Flags: needinfo?(erahm)

Benjamin Smedberg

Comment 1

•

9 years ago

To the extent that we can't fix it quickly, we should classify these by the message type, I think. That's probably not available in any annotations, and perhaps not in the minidump at all currently.

Eric Rahm [:erahm]

Comment 2

•

9 years ago

We changed the behavior here to enforce the maximum message size on the sender side rather than the receiver side. In theory we're not causing more crashes, just crashing sooner. I could either add the message name to the assertion message or I could add an annotation (I'm not sure what the process for that is though). I also saw structured clone in some of the stacks: > 0 xul.dll mozilla::ipc::ProcessLink::SendMessageW(IPC::Message*) ipc/glue/MessageLink.cpp:161 > 1 xul.dll mozilla::ipc::MessageChannel::Send(IPC::Message*) ipc/glue/MessageChannel.cpp:780 > 2 xul.dll mozilla::dom::PBrowserParent::SendAsyncMessage(nsString const&, nsTArray<mozilla::jsipc::CpowEntry> const&, IPC::Principal const&, mozilla::dom::ClonedMessageData const&) obj-firefox/ipc/ipdl/PBrowserParent.cpp:231 > 3 xul.dll nsFrameLoader::DoSendAsyncMessage(JSContext*, nsAString_internal const&, mozilla::dom::ipc::StructuredCloneData&, JS::Handle<JSObject*>, nsIPrincipal*) dom/base/nsFrameLoader.cpp:2681

Flags: needinfo?(erahm)

Andrew McCreight [:mccr8]

Comment 3

•

9 years ago

If you do add an annotation, it would be nice to have the size of the message in there, like we do for NS_ABORT_OOM.

Eric Rahm [:erahm]

Comment 4

•

9 years ago

Annotating has a problem in that it's main thread only in content processes. I would guess that the IPC messaging layer is off main thread (I need to confirm this), if that's the case there's not much I can do.