Closed Bug 1366296 Opened 7 years ago Closed 6 years ago

Intermittent devtools/client/netmonitor/test/browser_net_filter-02.js | application crashed [@ 0xf2d3dab477][@ js::TypeSet::GetValueType(JS::Value const &)]

Categories

(Core :: JavaScript: GC, defect)

defect
Not set
critical

Tracking

()

RESOLVED INCOMPLETE
Tracking Status
firefox-esr52 --- unaffected
firefox53 --- unaffected
firefox54 --- unaffected
firefox55 + fix-optional
firefox56 --- ?

People

(Reporter: intermittent-bug-filer, Assigned: jonco)

References

Details

(5 keywords)

Crash Data

Looks s-s to me (rax = 0xfffe4b4b4b4b4b4b). Philor pointed out bug 1366083 and bug 1366153 as well, which also have 4b4b4b4b on their stacks. Jon, any idea what might have caused this in the last couple days?
Group: javascript-core-security
Component: JavaScript Engine → JavaScript: GC
Flags: needinfo?(jcoppeard)
I can't see a smoking gun in js/src in the last few days.
Flags: needinfo?(jcoppeard)
[Tracking Requested - why for this release]: sec-high regression

It should be possible to bisect one of these on TreeHerder. Bug 1366083 is happening in a single test suite, for instance.
tracking as new regression in 55
This is happening in the same test directory as bug 1366083, which suggests that they are the same underlying issue.
See Also: → 1366153
See Also: → 1366083
This looks like it is happening on 3 different platforms, Win32, Win64 and OSX.

Jon, could you maybe take a look? It seems odd that at devtools change that is JS-only might be causing this.
Blocks: 1365635
Flags: needinfo?(jcoppeard)
Investigating.  This might be an interaction between incremental object finalization (bug 1352430) and bug 1189822.
FWIW, bug 1366083 seems like the most frequent variant.
I added some assertions and pushed a try build which shows that it's possible to have DOM proxies with expandos but without a preserved wrapper.  This would produce crashes like these.

https://treeherder.mozilla.org/#/jobs?repo=try&revision=52a3466149d1c7184cf932c1887a91365f004409&group_state=expanded&selectedJob=101699377
I wasn't able to track this down.  

The failures in bug 1366083 and bug 1366153 have dropped to nothing in the last week, but I wouldn't be surprised if there was still a problem here.

The only suspicious thing I found was that during unlinking nsWrapperCache::ReleaseWrapper can be called more than once for a DOM node that has a JS proxy object and expandoAndGeneration (we rely on the wrapper being preserved to mark the expando object).  However I wasn't able to observe any calls to get the expando object in between.
Flags: needinfo?(jcoppeard)
Are you going to keep investigating?  I will mark this fix-optional for 55, for now, since nothing seems immediately actionable.
Flags: needinfo?(jcoppeard)
(In reply to Liz Henry (:lizzard) (needinfo? me) from comment #11)
I'm not currently investigating this.  I wasn't able to find a link with bug 1352430.  

The associated intermittent test failures bug 1366083 and bug 1366153 have stopped happening but crashes with this signature are still occurring.

The crashes mostly seem to happen doing type monitoring after coming out of JIT code.  Jan, do you know who would be a good person to investigate further?
Flags: needinfo?(jcoppeard) → needinfo?(jdemooij)
Sean since you're on Linux, maybe you can try to repro with rr?
Flags: needinfo?(jdemooij) → needinfo?(sstangl)
Jan, I'm not able to reproduce the failure on Fedora x86_64. That may be expected: Comment 6 indicates that the failure is only on other platforms.
Flags: needinfo?(sstangl)
(In reply to Sean Stangl [:sstangl] from comment #15)
> Jan, I'm not able to reproduce the failure on Fedora x86_64. That may be
> expected: Comment 6 indicates that the failure is only on other platforms.

OK but maybe we can use Try debugging? We should fix this bug somehow.
Flags: needinfo?(sstangl)
Hi Jon:

I have assigned these security bugs to you to reassign them to appropriate developers in your team to investigate and fix them.

Thanks!

Wennie
Assignee: nobody → jcoppeard
>Jan, I'm not able to reproduce the failure on Fedora x86_64. That may be expected: Comment 6 indicates that the failure is only on other platforms.

You can get a loaner machine from ServiceNow with the right configuration if that's easier to debug than on try.
Do we know if this crash is still occurring? It looks like Bug 1366083 had some diagnostic patches landed to look into the issue, but no movement there either, and an extremely small failure rate.
I think marking it incomplete is okay...
Status: NEW → RESOLVED
Closed: 6 years ago
Resolution: --- → INCOMPLETE
Group: javascript-core-security
Flags: needinfo?(sstangl)
You need to log in before you can comment on or make changes to this bug.