Closed Bug 726570 Opened 12 years ago Closed 12 years ago

Deploy new minidump_stackwalk to solve stackwalking issues with MSVC2010

Categories

(Socorro :: Backend, task)

task
Not set
normal

Tracking

(Not tracked)

VERIFIED FIXED

People

(Reporter: ted, Assigned: rhelmer)

References

Details

Attachments

(4 files, 1 obsolete file)

Since we switched to VC++ 2010, we've been seeing some bad stacks in certain cases. It looks like the new compiler is producing stack unwind information that Breakpad doesn't know how to handle. There are two specific things that are causing problems:
1) A new operator in the unwind expressions: @. This appears to be an alignment operator.
2) A new built-in value: .raSearch. We haven't narrowed down exactly what the difference is between this and .raSearchStart.

I have a WIP Breakpad patch that adds support for both of these, but I'm not sure if it's correct, and if it's going to fix the problem.
Very much a WIP. I don't know if the @ implementation is correct, and I'm pretty sure the value of .raSearch isn't perfect.
I put my patch for the @ operator up for review upstream:
http://breakpad.appspot.com/349002/

bsmedberg has a patch to make .raSearch handling work which he's also going to upstream.
Assignee: nobody → benjamin
The .raSearch fix is up at http://breakpad.appspot.com/349003/

I need the @ patch to land before I can upload a useful patch for the remaining issue, since it touches/refactors the same code blocks.
Attachment #596661 - Attachment is obsolete: true
http://code.google.com/p/google-breakpad/source/detail?r=926
http://code.google.com/p/google-breakpad/source/detail?r=927

This bug is now ready to morph into a deployment bug. We need minidump_stackwalk recompiled against breakpad r927 and deployed to staging.
Assignee: benjamin → server-ops
Component: General → Server Operations: Web Operations
Product: Socorro → mozilla.org
QA Contact: general → cshields
Version: unspecified → other
Summary: Stackwalking issues with MSVC2010 → Deploy new minidump_stackwalk to solve stackwalking issues with MSVC2010
(In reply to Benjamin Smedberg  [:bsmedberg] from comment #6)
> http://code.google.com/p/google-breakpad/source/detail?r=926
> http://code.google.com/p/google-breakpad/source/detail?r=927
> 
> This bug is now ready to morph into a deployment bug. We need
> minidump_stackwalk recompiled against breakpad r927 and deployed to staging.

We now have a jenkins job for breakpad:
https://ci.mozilla.org/job/breakpad/52/

The Socorro job automatically pulls the last successful build (which is that one):
https://github.com/mozilla/socorro/blob/1c80b6cb7afcc0072d7b6a7010398d2cc0fcf906/scripts/build.sh#L12
https://ci.mozilla.org/job/breakpad/lastSuccessfulBuild/?

So I think we don't need to do anything special here, this should automatically go out on the next Socorro release (and should be on the dev server crash-stats-dev.allizom.org already)
rhelmer is going to help shepherd this onto stage so that it can go out on Wednesday with the scheduled release.
Assignee: server-ops → rhelmer
I'm going to attach a .dump and .extra file to test this with.
Attachment #601043 - Attachment mime type: application/xml → application/octet-stream
Component: Server Operations: Web Operations → Backend
Product: mozilla.org → Socorro
QA Contact: cshields → backend
Target Milestone: --- → 2.4.4
This is equivalent to the .extra file, but formatted as JSON so we can use the submitter tool.
I inserted crashes but can't complete testing quite yet, we're doing a DB import on dev and staging hbase seems to be down (bug 730986).
Status: NEW → ASSIGNED
Depends on: 730986
OK here is an example:
https://crash-stats-dev.allizom.org/report/index/e80d6c06-0bdb-4072-a71a-baf5f2120229

Can someone help to make sure this looks ok? I'd like to also make sure "normal" jobs work before marking verified.
Status: ASSIGNED → RESOLVED
Closed: 12 years ago
Resolution: --- → FIXED
KaiRo says comment 13 looks ok, and seems to do no harm with other crashes:
https://crash-stats-dev.allizom.org/report/index/ad99fc9c-c0ab-4454-8df4-5b37d2120229
Status: RESOLVED → VERIFIED
Are you sure the symbols were present at processing time? Without actually having the processing log, this looks like the symbols weren't fully uploaded/synced yet.
Oh, ignore me, the google_breakpad symbols are in xul.dll also. I don't know.
Kairo: those two crashes you linked are both 64-bit (note the amd64). We don't currently have useful unwind info in Win64 symbols. That's bug 548035.
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: