Closed Bug 495464 Opened 15 years ago Closed 13 years ago

[tracking bug] Full crash data on talos and unittests

Categories

(Release Engineering :: General, defect, P3)

defect

Tracking

(Not tracked)

RESOLVED INCOMPLETE

People

(Reporter: catlee, Assigned: coop)

References

Details

(Whiteboard: [talos][unittest])

Our current minidumps aren't always sufficient to track down the source of crashes (see bug #495353).

We should be collecting full crash data, packaging that up with the build and debug symbols, and putting it somewhere where developers can examine it.

We need to continue to include the stack trace in the log; sometimes you can tell what's going wrong just from the stack trace.
So, I think what we can do here is make Breakpad write a dump, then pass off to the OS exception handler, which can write a full dump/core/whatever. We already have a setting in our code (exposed only on OS X, at the moment) to do precisely this:
http://mxr.mozilla.org/mozilla-central/source/toolkit/crashreporter/nsExceptionHandler.cpp#167
see the "defaults write" bit here:
https://developer.mozilla.org/en/Environment_variables_affecting_crash_reporting

We could easily expose this on all platforms via an environment variable.
This would be a quantum leap in our ability to debug these sorts of crashes.  It is difficult to overstate the value.  Can I donate a kidney?
Marking as future for now - seems like a pretty involved project and we haven't got the resources at the moment.  We are working on bits and pieces of this but we won't be able to roll this out any time soon.
Component: Release Engineering → Release Engineering: Future
Mass move of bugs from Release Engineering:Future -> Release Engineering. See
http://coop.deadsquid.com/2010/02/kiss-the-future-goodbye/ for more details.
Component: Release Engineering: Future → Release Engineering
Priority: -- → P3
See also bug 544062, which is about saving the minidumps from unittest runs for later inspection, which would get us partway to this.
Whiteboard: [[talos][unittest]
Whiteboard: [[talos][unittest] → [talos][unittest]
Assignee: nobody → jhford
Priority: P3 → P4
Whiteboard: [talos][unittest] → [talos][unittest][triagefollowup]
(In reply to comment #6)
> See also bug 544062, which is about saving the minidumps from unittest runs for
> later inspection, which would get us partway to this.

This single bug is masking a lot of complexity, I think. I agree that the final outcome seems full of win, but I'm unsure how to get there from here. 

Ted, I guess these questions are for you: Once bug 544062 is resolved, what other steps are required? Are those steps different based on platform? 

Comment #2 seems to explain part of it, but it sounds like there is mozilla-central work that would need to happen first too.
Whiteboard: [talos][unittest][triagefollowup] → [talos][unittest]
(In reply to comment #7)
> (In reply to comment #6)
> > See also bug 544062, which is about saving the minidumps from unittest runs for
> > later inspection, which would get us partway to this.
> 
> This single bug is masking a lot of complexity, I think. I agree that the final
> outcome seems full of win, but I'm unsure how to get there from here. 

I am going to turn this into a tracking bug for work to get this done for our three supported platforms, windows, osx and linux.
No longer depends on: 640343
Hardware: x86 → All
Summary: Full crash data on talos and unittests → [tracking bug] Full crash data on talos and unittests
Assignee: jhford → nobody
Priority: P4 → P5
Assignee: nobody → jhford
I don't really see the point of having this tracking bug now that we have the platform-specific ones, the Mac one having already been WONTFIXed.

I'm going to mine this bug for any platform-specific info, add it to the appropriate platform bugs, and then close this one.
Assignee: jhford → coop
Priority: P5 → P3
No longer depends on: 640343
No longer depends on: 640345
No longer depends on: 642167
Broken into smaller per-platform chunks in bug 640343 and 640345.
Status: NEW → RESOLVED
Closed: 13 years ago
Resolution: --- → INCOMPLETE
Product: mozilla.org → Release Engineering
You need to log in before you can comment on or make changes to this bug.