Closed Bug 667915 Opened 13 years ago Closed 13 years ago

Assertion failure: pc_ >= script->code && pc_ < script->code + script->length | Assertion failure: compartment mismatched | Crash [@ moz_free | NS_Free_P nsMemory::Free(void*) XPCJSStackFrame::~XPCJSStackFrame() ]

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla8

People

(Reporter: bc, Assigned: luke)

References

(
URL
)

Details

(Keywords: assertion, crash, reproducible, Whiteboard: [inbound])

Attachments

(3 files, 1 obsolete file)

Fix AutoCompartment::enter 13 years ago Luke Wagner [:luke] 981 bytes, patch	mrbkap : review+	Details \| Diff \| Splinter Review
Fix FixupArity 13 years ago Luke Wagner [:luke] 8.53 KB, patch		Details \| Diff \| Splinter Review
don't let content oom chrome (WIP) 13 years ago Luke Wagner [:luke] 46.46 KB, patch		Details \| Diff \| Splinter Review
don't let content oom chrome 13 years ago Luke Wagner [:luke] 77.75 KB, patch	Waldo : review+ mrbkap : review+	Details \| Diff \| Splinter Review

Bob Clary [:bc] (inactive)

Reporter

Description

•

13 years ago

1. http://www.autoteile-immler.com/THULE+973+BackPac+Fahrradtr%e4ger+inkl.+Montagekit+973-15/a5730277_u161/

2. stop or continue slow script.

3. Crash with multiple signatures or assertions.

Mac OS X 10.5

Assertion failure: pc_ >= script->code && pc_ < script->code + script->length, at /work/mozilla/builds/nightly/mozilla/js/src/vm/Stack.cpp:978

Linux 64 bit

Assertion failure: compartment mismatched

Windows XP and 7

moz_free | NS_Free_P nsMemory::Free(void*) XPCJSStackFrame::~XPCJSStackFrame() XPCJSStackFrame::`vector deleting destructor'(unsigned int) + 0xe XPCJSStackFrame::Release()

Bob Clary [:bc] (inactive)

Reporter

Comment 1

•

13 years ago

stack for Mac: https://bugzilla.mozilla.org/attachment.cgi?id=542476

Luke Wagner [:luke]

Assignee

Comment 2

•

13 years ago

So check this out: the code chews up the whole stack (causing an exception to be thrown), when its time to report the exception, we try to push a dummy frame from AutoEnterCompartment, THAT also hits the end of the stack, tries to throw, but now the exception's compartment doesn't match the cx's, hence the assert.

Luke Wagner [:luke]

Assignee

Comment 3

•

13 years ago

Attached patch Fix AutoCompartment::enter — Details — Splinter Review

This fixes the compartment assert (which I can repro on linux64 by disabling all jits): AutoCompartment::enter switches the compartment before pushing the dummy frame (hence the mismatch when the CompartmentChecker asserts that GetGlobalForScopeChain (fp->scopeChain() in this case) is in cx->compartment).  I'm a bit confused why it was written this way in the first place; perhaps there is some hidden constraint here?

Assignee: general → luke

Status: NEW → ASSIGNED

Attachment #542601 - Flags: review?(mrbkap)

Blake Kaplan (:mrbkap) (inactive)

Comment 4

•

13 years ago

Comment on attachment 542601 [details] [diff] [review]
Fix AutoCompartment::enter

My only thought would be a fear that we'd have to read cx->compartment in order to allocate stack space for the dummy frame, but that doesn't appear to be the case, so r=mrbkap.

Attachment #542601 - Flags: review?(mrbkap) → review+

Luke Wagner [:luke]

Assignee

Comment 5

•

13 years ago

Attached patch Fix FixupArity — Details — Splinter Review

FixupArity calls getFixupFrame which reports despite a trashed regs.pc.

With these patches, I don't get any asserts.  But this does expose a problem I've been a'fearin', namely, that content can use up the stack and OOM unsuspecting chrome JS.  I'm going to think about this.

Attachment #542615 - Flags: review?(dvander)

Luke Wagner [:luke]

Assignee

Comment 6

•

13 years ago

Attached patch don't let content oom chrome (WIP) (obsolete) — Details — Splinter Review

So its actually pretty easy for content to oom chrome.  For example, this page will use up all the stack space such that the slow-script dialog will fail, eventually locking up the entire browser while running at 100% CPU:

  <script>
  function foo() {
    try {
      foo()
    } catch(e) {
      while (true) ;
    }
  </script>

The attached patch fixes this test case and also causes the page in comment 0 to have a normal slow-script dialog.  The approach is to report a smaller stack size to any script that doesn't have system principals.  That way, content ooms early and, when chrome runs, it still has plenty of space.  Still a bit of work to do.

Luke Wagner [:luke]

Assignee

Comment 7

•

13 years ago

Attached patch don't let content oom chrome — Details — Splinter Review

Mostly same patch but with build fixes for windows and a JSAPI test of the new mechanism.  This patch subsumes the other two and includes shell test-cases to trigger the specific asserts in the previous two patches.

All in all, I'm pretty happy to have this patch: its relatively simple and it fixes a long-standing concern since the introduction of the contiguous stack.  Incidentally, the other three browsers all show issues with the testcase above:
 - IE hangs and needs to be force-quit
 - Safari hangs and then shows what is probably supposed to be their slow-script dialog, but its a tiny empty rectangle. Fortunately, their default-close behavior kills the script.
 - Chrome UI remains responsive (process isolation ftw), but the tab is unresponsive and doesn't display the slow-script dialog that normally is shown; it has to be killed by clicking the tab's 'x'.

Blake: could you check out the xpconnect/caps changes and see if that all look sane?
Waldo: can you look at the rest?

Attachment #543023 - Attachment is obsolete: true

Attachment #543291 - Flags: review?(mrbkap)

Attachment #543291 - Flags: review?(jwalden+bmo)

Luke Wagner [:luke]

Assignee

Updated

•

13 years ago

Attachment #542615 - Flags: review?(dvander)

Luke Wagner [:luke]

Assignee

Comment 8

•

13 years ago

Green on try (once "if (getMaxArgs)" is added to regress-350256-02.js).

Jeff Walden [:Waldo]

Comment 9

•

13 years ago

Comment on attachment 543291 [details] [diff] [review]
don't let content oom chrome

Review of attachment 543291 [details] [diff] [review]:
-----------------------------------------------------------------

::: js/src/vm/Stack.h
@@ +1357,5 @@
>       * LeaveTree requires stack allocation to rebuild the stack. There is no
>       * good way to handle an OOM for these allocations, so this function checks
>       * that OOM cannot occur using the size of the TraceNativeStorage as a
> +     * conservative upper bound. Despite taking a 'cx', this function does not
> +     * report an error if it returns 'false'.

I'd pull the last sentence into its own paragraph; it blends in a bit here.

@@ +1362,2 @@
>       */
> +    inline bool ensureEnoughSpaceToEnterTrace(JSContext *cx, MaybeReportError report);

This method's always called with report == NO_REPORT_ERROR.  Is that for documentation purposes?  I'm not sure I see anything clearly better, but maybe there is something.

Attachment #543291 - Flags: review?(jwalden+bmo) → review+

Blake Kaplan (:mrbkap) (inactive)

Updated

•

13 years ago

Attachment #543291 - Flags: review?(mrbkap) → review+

Bob Clary [:bc] (inactive)

Reporter

Comment 10

•

13 years ago

another url: http://www.dualsimtelefone.de/Dual-Sim-Telefone-Anycool-T728-Dualsim-Touchscreen-Handy-MP3-MP4-NEU-OVP/a31177604_u4357_z922d8337-124d-4a11-a0d4-68e54fedfb5a/

let the script continue.

Luke Wagner [:luke]

Assignee

Comment 11

•

13 years ago

(In reply to comment #10)
Thanks for the test-case.  With the patch, I can repeatedly click 'Continue' and eventually 'Stop' and the page is responsive with no asserts.

Luke Wagner [:luke]

Assignee

Comment 12

•

13 years ago

http://hg.mozilla.org/integration/mozilla-inbound/rev/28be8df0deb7

Whiteboard: [inbound]

Nicholas Nethercote [inactive]

Updated

•

13 years ago

Blocks: 669123

Marco Bonardo [:mak]

Comment 13

•

13 years ago

http://hg.mozilla.org/mozilla-central/rev/28be8df0deb7
http://hg.mozilla.org/mozilla-central/rev/0661e7077dce

Status: ASSIGNED → RESOLVED

Closed: 13 years ago

Resolution: --- → FIXED

Target Milestone: --- → mozilla8

Justin Lebar (not reading bugmail)

Updated

•

13 years ago

Depends on: 669890

Comment 14

•

13 years ago

The patch aside, why is firefox even hanging on those pages?  For me, neither chrome 13 nor ie 9 hang on those pages, and opera 11.5 only momentarily pauses on the comment 10 page.  So while this patch helps firefox recover, it would be nice if there was a follow-up to address the root problem.

Luke Wagner [:luke]

Assignee

Comment 15

•

13 years ago

The root problem is we don't have content/chrome process separation.  Rest assured, work is underway: https://wiki.mozilla.org/Electrolysis.

A secondary root problem is that, if we have an error while running the operation callback, apparently we either (1) lose it or (2) choose not to halt the running script.  That sounds like a nice bug to file, if you'd like.

Comment 16

•

13 years ago

(In reply to comment #15)

Thanks, Luke.

> A secondary root problem is that, if we have an error while running the
> operation callback, apparently we either (1) lose it or (2) choose not to
> halt the running script.  That sounds like a nice bug to file, if you'd like.

Don't understand enough to file that one.  Any takers?

Luke Wagner [:luke]

Assignee

Comment 17

•

13 years ago

Filed bug 670183.

To be clear, I think the problem fixed by this bug is as "root" of a problem as the other two things; it fixes a class of DOSes.

Bob Clary [:bc] (inactive)

Reporter

Comment 19

•

13 years ago

Luke, is this really fixed on Beta/Firefox 8? I see a very similar crash (stack overflow) with oodles of 

XPCJSStackFrame::Release()
nsCOMPtr<nsIStackFrame>::~nsCOMPtr<nsIStackFrame>()
XPCJSStackFrame::~XPCJSStackFrame()

on the stack when visiting http://www.in-town.nl/web/ I don't see it in the automation for Aurora/9, or Nightly/10.

Luke Wagner [:luke]

Assignee

Comment 20

•

13 years ago

That stack trace would almost certainly have been fixed by bug 676934.  (Basically, we increased the scripted stack depth max (to something comparable to other engines) and it looks like XPCJSStackFrame was doing a recursive destructor call for each such frame which apparently is enough to overflow the C stack.  Bug 676934 just bounded the number of XPCJSStackFrames we create.)

You need to log in before you can comment on or make changes to this bug.