589955 - Submit empty crash reports when we can't get a backtrace

Reporter

Description

•

14 years ago

When we crash because we're out of memory, we can't reliably create a backtrace because the backtracer library can't allocate any memory. This results in us creating an empty minidump, which we then don't send to breakpad - it's lost forever.

In order to let us get some idea of how bad our memory issues are, we should always submit these empty reports, and Breakpad should mark down as much information as it has about the machine (OS, build id, etc).

Joe Drew (not getting mail)

Reporter

Comment 1

•

14 years ago

We should really get this done for Beta 5, especially considering that we now trap exceptions on 64-bit OSes, many of which can be out of memory.

blocking2.0: --- → ?

(not currently active) Ted Mielczarek

Assignee

Comment 2

•

14 years ago

If we write out an empty minidump (which is currently the case in these conditions), then we do try to submit it, we just fail because the Windows sender code doesn't cope with zero byte files (filed as bug 427446). However, Socorro currently relies on the minidump to provide the OS information, so the resulting crash report would not be very useful. If you want to add more information, we probably shouldn't do so in the crashed process, since it's in a really bad state if we fail like this. We could make the crash reporter client figure this out and add some information, I suppose. Socorro would need to be modified to know to use this additional information instead of relying on the minidump.

Dave Townsend [:mossop]

Comment 3

•

14 years ago

Presuming we can't get it done for b5 now

blocking2.0: ? → beta6+

(not currently active) Ted Mielczarek

Assignee

Comment 4

•

14 years ago

I don't think this is worthwhile blocking Firefox 4 on (or fixing at all, actually). We get empty crash reports on OS X/Linux, and the only thing they tell us is "we have problems". This isn't going to help us actually fix the problems. The main impetus for this bug was the D2D memory leak, which was fixed in bug 589809, so I don't think we're any worse off now than we were in previous versions. The right fix for all of this is bug 587729, which is probably too much effort/too risky to get into FF4 now, but I'd like to try to get it done for 4.next.

dwitte@gmail.com

Comment 5

•

14 years ago

How much memory are we talking here? Would it be feasible to preallocate some or all of it at startup, so we can guarantee success (unless we OOM on startup!) at crash time?

dwitte@gmail.com

Comment 6

•

14 years ago

Alternatively, we could fire the XPCOM memory pressure topic, and cross our fingers that it gives us back enough. (And if not, go make more services aware of it.)

Benjamin Smedberg

Comment 7

•

14 years ago

No, we are not going to enter XPCOM from within a crash handler.

AFAICT, the problem here is that we don't know exactly who is allocating the memory, so a reserve is very difficult to manage properly.

dwitte@gmail.com

Comment 8

•

14 years ago

Yeah, nevermind my comment about XPCOM. That didn't exactly make sense. :/

(not currently active) Ted Mielczarek

Assignee

Comment 9

•

14 years ago

The Linux and OS X dump generators are careful not to allocate memory. On Windows, we call DbgHelp!MinidumpWriteDump, which is apparently not as careful. I have no idea what kind of memory it allocates or how much, though.

OS: Linux → Windows XP

Joe Drew (not getting mail)

Reporter

Comment 10

•

14 years ago

We could just preallocate 1 or 2 MB, then free it before writing minidumps.

Johnathan Nightingale [:johnath]

Comment 11

•

14 years ago

I'm moving this off beta6, which joe agrees with. I'd sort of like to not block at all - but Joe's gonna work on articulating why "knowing we crashed without any data about where or why" matters enough to justify that block.

blocking2.0: beta6+ → final+

Joe Drew (not getting mail)

Reporter

Comment 12

•

14 years ago

That is certainly news to me, but I can do that!

Right now we have absolutely no information about crashes in which we fail to generate a crash report. We don't know whether we hit OOM conditions (presuming that is the only time we generate empty crash reports) all the time, some of the time, or almost never. We also don't know whether any changes we have made make it worse or better. This is a sorry state of affairs, and we shouldn't have to deal with it!

Vladimir Vukicevic [:vlad] [:vladv] (needinfo me, slow to respond)

Comment 13

•

14 years ago

Indeed -- the reason is that with the data that we get right now, we might see, let's say, a 2% daily crash rate out of 1000 users.  But the /real/ rate could be like 10%.  We'd behave very differently if that was the case, including doing things like prioritizing actually collecting more data here (e.g. out-of-process minidump writing and similar).  But right now whatever that delta is is just entirely lost, we never see it.

(not currently active) Ted Mielczarek

Assignee

Comment 14

•

14 years ago

Okay. I can fix this (I believe the only actual bug here is that the submission code fails to handle zero byte minidumps on Windows), but I really think it's not going to be as useful as you think it is. (Take a look at all the (null signature) crashes we have on OS X, and the lack of progress there.)

Assignee: nobody → ted.mielczarek

Mike Beltzner [:beltzner, not reading bugmail]

Comment 15

•

14 years ago

While I understand comment 14, I do think that comment 13 wins the day. We need to know the overall crashiness, as it will be a metric on which we base our release readiness.

blocking2.0: final+ → betaN+

(not currently active) Ted Mielczarek

Assignee

Comment 17

•

14 years ago

Patch up for review upstream:
http://breakpad.appspot.com/243001

(not currently active) Ted Mielczarek

Assignee

Comment 18

•

14 years ago

Patch landed upstream, will land in m-c shortly:
http://code.google.com/p/google-breakpad/source/detail?r=743

(not currently active) Ted Mielczarek

Assignee

Comment 19

•

14 years ago

Pushed to m-c:
http://hg.mozilla.org/mozilla-central/rev/68529f865a6e

There's still a Socorro bug that makes it impossible to view individual reports from zero-byte minidumps (bug 607810), but they ought to show up in topcrash reports.

Status: NEW → RESOLVED

Closed: 14 years ago

Resolution: --- → FIXED

Bugzilla

Quick Search

Submit empty crash reports when we can't get a backtrace

Categories

(Toolkit :: Crash Reporting, defect)

Tracking

()

People

(Reporter: joe, Assigned: ted)

References

Details

Crash Data

Security

(public)

User Story

Description

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Comment 10

Comment 11

Comment 12

Comment 13

Comment 14

Comment 15

Comment 17

Comment 18

Comment 19