Closed Bug 790826 Opened 13 years ago Closed 13 years ago

Heap-use-after-free in mozilla::plugins::parent

Tracking

(firefox16 unaffected, firefox17+ fixed, firefox18+ fixed, firefox-esr10 unaffected)

Status:

RESOLVED FIXED

Milestone:

mozilla19

Tracking Flags:

Tracking

Status

firefox16

---

unaffected

firefox17

fixed

firefox18

fixed

firefox-esr10

---

unaffected

People

(Reporter: inferno, Assigned: gfritzsche)

References

Details

(4 keywords, Whiteboard: [adv-track-main17+][asan])

Attachments

(6 files, 1 obsolete file)

Testcase 13 years ago Abhishek Arya 2.56 KB, text/html		Details
reduced testcase 13 years ago Bob Clary [:bc] (inactive) 933 bytes, text/html		Details
Fix race issue on NPObjWrappers 13 years ago Georg Fritzsche [:gfritzsche] 762 bytes, patch	benjamin : review+ abillings : sec-approval+	Details \| Diff \| Splinter Review
ASAN output and call stack 13 years ago Georg Fritzsche [:gfritzsche] 8.50 KB, text/plain		Details
Check for pso teardown nested within destroy 13 years ago Benjamin Smedberg 1.82 KB, patch		Details \| Diff \| Splinter Review
gdb session with adress traces 13 years ago Georg Fritzsche [:gfritzsche] 24.78 KB, text/plain		Details
GDB trace session 13 years ago Georg Fritzsche [:gfritzsche] 29.50 KB, text/plain		Details

Abhishek Arya

Reporter

Description

•

13 years ago

Attached file Testcase — Details

Reproduces on trunk. For reproducing reliably, just run > 10 simultaneous firefox instances. ================================================================= ==21539== ERROR: AddressSanitizer heap-use-after-free on address 0x7f8bcb1c0088 at pc 0x7f8c1f38cac0 bp 0x7fffbeb1a2f0 sp 0x7fffbeb1a2e8 WRITE of size 4 at 0x7f8bcb1c0088 thread T0 #0 0x7f8c1f38cac0 in mozilla::plugins::parent::_releaseobject(NPObject*) src/dom/plugins/base/nsNPAPIPlugin.cpp:1389 0x7f8bcb1c0088 is located 8 bytes inside of 32-byte region [0x7f8bcb1c0080,0x7f8bcb1c00a0) freed by thread T0 here: #0 0x42b630 in free ??:0 #1 0x7f8c1f3e57bd in NPObjWrapperPluginDestroyedCallback(PLDHashTable*, PLDHashEntryHdr*, unsigned int, void*) src/dom/plugins/base/nsJSNPRuntime.cpp:1944 previously allocated by thread T0 here: #0 0x42b6f0 in __interceptor_malloc ??:0 #1 0x7f8c236873a8 in moz_xmalloc src/memory/mozalloc/mozalloc.cpp:57 #2 0x7f8c1f96a10b in mozilla::plugins::PluginScriptableObjectParent::CreateProxyObject() src/dom/plugins/ipc/PluginScriptableObjectParent.cpp:557 #3 0x7f8c1f9c4f32 in mozilla::plugins::PPluginModuleParent::OnMessageReceived(IPC::Message const&) src/objdir-ff-asan/ipc/ipdl/PPluginModuleParent.cpp:858 #4 0x7f8c1f978d8d in mozilla::ipc::AsyncChannel::OnDispatchMessage(IPC::Message const&) src/ipc/glue/AsyncChannel.cpp:473 #5 0x7f8c1f9d3228 in mozilla::plugins::PPluginInstanceParent::CallNPP_GetValue_NPPVpluginScriptableNPObject(mozilla::plugins::PPluginScriptableObjectParent**, short*) src/objdir-ff-asan/ipc/ipdl/PPluginInstanceParent.cpp:374 #6 0x7f8c1f9490a0 in mozilla::plugins::PluginInstanceParent::NPP_GetValue(NPPVariable, void*) src/dom/plugins/ipc/PluginInstanceParent.cpp:1104 #7 0x7f8c1f95a48d in mozilla::plugins::PluginModuleParent::NPP_GetValue(_NPP*, NPPVariable, void*) src/dom/plugins/ipc/PluginModuleParent.cpp:709 Shadow byte and word: 0x1ff179638011: fd 0x1ff179638010: fd fd fd fd fd fd fd fd More shadow bytes: 0x1ff179637ff0: fd fd fd fd fd fd fd fd 0x1ff179637ff8: fd fd fd fd fd fd fd fd 0x1ff179638000: fa fa fa fa fa fa fa fa 0x1ff179638008: fa fa fa fa fa fa fa fa =>0x1ff179638010: fd fd fd fd fd fd fd fd 0x1ff179638018: fd fd fd fd fd fd fd fd 0x1ff179638020: fa fa fa fa fa fa fa fa 0x1ff179638028: fa fa fa fa fa fa fa fa 0x1ff179638030: fd fd fd fd fd fd fd fd Stats: 331M malloced (365M for red zones) by 595516 calls Stats: 49M realloced by 25909 calls Stats: 292M freed by 330351 calls Stats: 158M really freed by 206564 calls Stats: 536M (137309 full pages) mmaped in 134 calls mmaps by size class: 8:344043; 9:49146; 10:16380; 11:20470; 12:4096; 13:2048; 14:1536; 15:384; 16:576; 17:1280; 18:272; 19:40; 20:20; mallocs by size class: 8:461364; 9:70728; 10:23906; 11:25624; 12:5405; 13:2947; 14:2126; 15:554; 16:764; 17:1730; 18:303; 19:45; 20:20; frees by size class: 8:220906; 9:56741; 10:19146; 11:21782; 12:4052; 13:2704; 14:1865; 15:492; 16:612; 17:1711; 18:281; 19:42; 20:17; rfrees by size class: 8:136492; 9:35705; 10:11257; 11:16093; 12:2517; 13:1371; 14:1234; 15:312; 16:395; 17:1147; 18:34; 19:6; 20:1; Stats: malloc large: 2098 small slow: 3007 ==21539== ABORTING

Benjamin Smedberg

Updated

•

13 years ago

Component: General → Plug-ins

Keywords: sec-critical

Product: Firefox → Core

Benjamin Smedberg

Comment 1

•

13 years ago

bc, could you try and make this testcase smaller? I can't tell whether the aria or CSS is really relevant to the crash (I suspect it's not).

Assignee: nobody → georg.fritzsche

Bob Clary [:bc] (inactive)

Comment 2

•

13 years ago

after crashing immediately on my first try I'm having problems reproducing at all. :-( I'll see about getting better reproducibility and then will work on reduction.

Benjamin Smedberg

Updated

•

13 years ago

Blocks: 791798

Georg Fritzsche [:gfritzsche]

Assignee

Comment 3

•

13 years ago

(In reply to Bob Clary [:bc:] from comment #2) > I'll see about getting better reproducibility Did you have any luck with this?

Bob Clary [:bc] (inactive)

Comment 4

•

13 years ago

No, but I've been out with the Flu for a few days. I'll try to look at this again today.

Bob Clary [:bc] (inactive)

Comment 5

•

13 years ago

Attached file reduced testcase — Details

Daniel Veditz [:dveditz]

Comment 7

•

13 years ago

Georg: does the reduced testcase demonstrate the problem for you? You can find links to ASAN try builds here: http://people.mozilla.org/~choller/firefox/asan/

status-firefox18: --- → affected

tracking-firefox18: --- → +

Georg Fritzsche [:gfritzsche]

Assignee

Comment 8

•

13 years ago

(In reply to Daniel Veditz [:dveditz] from comment #7) > Georg: does the reduced testcase demonstrate the problem for you? Sorry for the delay - yes, it does reproduce (inconsistently) and also with custom builds. However, i haven't been able to resolve the issue yet.

Georg Fritzsche [:gfritzsche]

Assignee

Comment 9

•

13 years ago

Attached patch Fix race issue on NPObjWrappers — Details — Splinter Review

The heap-use-after-free is apparently caused by a race nsJSNPRuntime: * NPObjWrapperPluginDestroyedCallback() - deallocating the NPObject vs. * NPObjWrapper_Finalize() leading to DelayedReleaseGCCallback() and releaseref on the NPObject This patches fixes the issue, but i'm unsure if i'm overlooking any side-effects. Benjamin, can you review this?

Attachment #666632 - Flags: review?(benjamin)

Daniel Veditz [:dveditz]

Updated

•

13 years ago

Keywords: csec-uaf

Benjamin Smedberg

Comment 10

•

13 years ago

This code is all running on the main thread. Which function is first for this crash scenario? If it's NPObjWrapperPluginDestroyedCallback then NPObjWrapper_Finalize, this shouldn't happen. NPObjWrapperPluginDestroyedCallback calls ::JS_SetPrivate(entry->mJSObj, nullptr); So the finalizer should do nothing. If it's the other way around, I'm still not clear on what's happening: * NPObjWrapper_Finalize. Enqueues a sDelayedReleases->AppendElement(npobj). Removes the object from sNPObjectWrappers. * Then plugin destruction, NPObjWrapperPluginDestroyedCallback should not be called for this object because it was already removed from sNPObjectWrappers! Unless the GC which causes NPObjWrapper_Finalize happens *within* the hashtable enumeration (nsJSNPRuntime::OnPluginDestroy), which could cause sNPObjWrappers to become corrupted. The only JSAPI call within the enumeration is JS_SetPrivate, which I believe never triggers GC or finalizers, but I'd love for somebody to confirm that definitively. I think this needs more understanding.

Georg Fritzsche [:gfritzsche]

Assignee

Comment 11

•

13 years ago

Attached file ASAN output and call stack — Details

The ASAN error callstack and output i'm seeing.

Georg Fritzsche [:gfritzsche]

Assignee

Updated

•

13 years ago

Attachment #666632 - Flags: review?(benjamin)

Benjamin Smedberg

Comment 12

•

13 years ago

When I try this testcase in debug non-valgrind/ASAN builds, I always end up crashing but not in something directly related: > xul.dll!DoDeferredRelease<nsISupports *>(array={...}) Line 527 C++ xul.dll!XPCJSRuntime::GCCallback(rt=0x074dd368, status=JSGC_END) Line 733 C++ mozjs.dll!Collect(rt=0x074dd368, incremental=true, budget=0x0000000000000000, gckind=GC_NORMAL, reason=TRANSPLANT) Line 4654 C++ mozjs.dll!js::GCFinalSlice(rt=0x074dd368, gckind=GC_NORMAL, reason=TRANSPLANT) Line 4693 C++ mozjs.dll!js::FinishIncrementalGC(rt=0x074dd368, reason=TRANSPLANT) Line 178 C++ mozjs.dll!JS_TransplantObject(cx=0x0c42a390, origobjArg=0x11331040, targetArg=0x12737040) Line 1557 C++ xul.dll!xpc::TransplantObject(cx=0x0c42a390, origobj=0x11331040, target=0x12737040) Line 693 C++ xul.dll!nsGlobalWindow::SetNewDocument(aDocument=0x0ece9790, aState=0x00000000, aForceReuseInnerWindow=false) Line 1992 C++ xul.dll!DocumentViewerImpl::InitInternal(aParentWidget=0x00000000, aState=0x00000000, aBounds={...}, aDoCreation=true, aNeedMakeCX=true, aForceSetNewDocument=true) Line 928 C++ xul.dll!DocumentViewerImpl::Init(aParentWidget=0x00000000, aBounds={...}) Line 678 C++ xul.dll!nsDocShell::SetupNewViewer(aNewViewer=0x0e64f930) Line 8009 C++ xul.dll!nsDocShell::Embed(aContentViewer=0x0e64f930, aCommand=0x5af9890a, aExtraInfo=0x00000000) Line 6070 C++ xul.dll!nsDocShell::CreateContentViewer(aContentType=0x0e5d0020, request=0x0cb60f58, aContentHandler=0x0e5c3920) Line 7796 C++ xul.dll!nsDSURIContentListener::DoContent(aContentType=0x0e5d0020, aIsContentPreferred=false, request=0x0cb60f58, aContentHandler=0x0e5c3920, aAbortProcess=0x0035d24f) Line 122 C++ xul.dll!nsDocumentOpenInfo::TryContentListener(aListener=0x0eec4db8, aChannel=0x0cb60f58) Line 655 C++ xul.dll!nsDocumentOpenInfo::DispatchContent(request=0x0cb60f58, aCtxt=0x00000000) Line 356 C++ xul.dll!nsDocumentOpenInfo::OnStartRequest(request=0x0cb60f58, aCtxt=0x00000000) Line 248 C++ xul.dll!nsBaseChannel::OnStartRequest(request=0x0ec52400, ctxt=0x00000000) Line 731 C++ xul.dll!nsInputStreamPump::OnStateStart() Line 417 C++ xul.dll!nsInputStreamPump::OnInputStreamReady(stream=0x09cbf9b0) Line 368 C++ xul.dll!nsInputStreamReadyEvent::Run() Line 83 C++ xul.dll!nsThread::ProcessNextEvent(mayWait=false, result=0x0035d577) Line 612 C++ I also tried a debugging patch which would catch the "nested GC" error I mentioned, and it's not triggering anything.

Benjamin Smedberg

•

13 years ago

Attached file GDB trace session — Details

Traced a bit more and apparently the object is added again after being torn down: * nsNPObjWrapper::GetNewOrUsed() -> adds to sNPObjWrappers * NPObjWrapper_Finalize() -> removes from sNPObjWrappers, schedules delayed release * nsNPObjWrapper::GetNewOrUsed() -> adds to sNPObjWrappers ... * NPObjWrapperPluginDestroyedCallback() -> free object * DelayedReleaseGCCallback() -> access free'd object

Attachment #669604 - Attachment is obsolete: true

Georg Fritzsche [:gfritzsche]

Assignee

Comment 19

•

13 years ago

So... is the NPObjWrapper_Finalize() fine call at that time fine? Assuming that, does it make sense to clean any remaining delayed releases from nsJSNPRuntime::OnPluginDestroy()?

Bill McCloskey [inactive unless it's an emergency] (:billm)

Comment 20

•

13 years ago

In bug 729760 we started allowing JS/Gecko code to run in between finalization of JS objects and the GC_END callback. I don't understand why nsNPObjWrapper::GetNewOrUsed can return the NPObject after we've decided to release it. It looks like this is the call that's obtaining the NPObject: nsresult rv = GetValueFromPlugin(NPPVpluginScriptableNPObject, &npobj); Is there a rule that it's allowed to return the given NPObject until we call _releaseobject on it?

Benjamin Smedberg

•

13 years ago

Attachment #666632 - Flags: review?(benjamin) → review+

Benjamin Smedberg

Comment 24

•

13 years ago

Comment on attachment 666632 [details] [diff] [review] Fix race issue on NPObjWrappers How easily can the security issue be deduced from the patch? I suspect it would be quite difficult to figure out how to reproduce the problem and even more difficult to exploit. You' have to combine a heapsmash with a GC timing attack to weaponize this, and that's fiendishly difficult. Do comments in the patch, the check-in comment, or tests included in the patch paint a bulls-eye on the security problem? No Which older supported branches are affected by this flaw? If not all supported branches, which bug introduced the flaw? Bug 791798 seems to indicate that this is a regression in 17 from around 27-28 August, but we're not sure and nothing in the range seems obviously incorrect. Analysis says this could date all the way back to incremental GC, but the crashes aren't showing up on 16. Do you have backports for the affected branches? The patch here should work for 17. How likely is this patch to cause regressions; how much testing does it need? This patch feels fairly safe to me.

Attachment #666632 - Flags: sec-approval?

Daniel Veditz [:dveditz]

Updated

•

13 years ago

status-firefox-esr10: --- → unaffected

status-firefox16: --- → unaffected

status-firefox17: --- → affected

tracking-firefox17: --- → +

Al Billings [:abillings - ex-MoCo]

Updated

•

13 years ago

Attachment #666632 - Flags: sec-approval? → sec-approval+

Benjamin Smedberg

Comment 25

•

13 years ago

I'm going to put this patch on bug 791798, a public topcrash bug, and land it from there so that it looks less enticing.

Benjamin Smedberg

Comment 26

•

13 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/f6753216c72b

Target Milestone: --- → mozilla19

Benjamin Smedberg

Comment 27

•

13 years ago

https://hg.mozilla.org/mozilla-central/rev/f6753216c72b

Status: NEW → RESOLVED

Closed: 13 years ago

Resolution: --- → FIXED

Bill McCloskey [inactive unless it's an emergency] (:billm)

Comment 28

•

13 years ago

This may be related to bug 785806 and bug 787885. Can you request approval for branches, Benjamin?

Benjamin Smedberg

Comment 29

•

13 years ago

On Monday when I've verified it against crash-stats, yes.

Benjamin Smedberg

Comment 30

•

13 years ago

Landed on aurora/beta.

status-firefox17: affected → fixed

status-firefox18: affected → fixed

u279076

Updated

•

13 years ago

Keywords: testcase, verifyme

Al Billings [:abillings - ex-MoCo]

Updated

•

13 years ago

Whiteboard: [adv-track-main17+]

Daniel Veditz [:dveditz]

•

3 years ago

Product: Core → Core Graveyard

David Lawrence [:dkl]

Updated

•

1 year ago

Keywords: reporter-external

You need to log in before you can comment on or make changes to this bug.