Closed Bug 475500 Opened 16 years ago Closed 16 years ago

Need core dump for macosx/crashtest running patch in bug 474771

Categories

(Release Engineering :: General, defect, P1)

x86
macOS
defect

Tracking

(Not tracked)

RESOLVED INVALID

People

(Reporter: gal, Assigned: gal)

Details

When trying to land 474771 the macosx tinderbox crashed during the talos run. I was not able to reproduce the crash offline or using the try server. I need the core dump of a talos VM crashing from this patch, along with the executable (with debug info would be fantastic). 474771 is a P1 blocker for JS.
Requesting b+ since its blocking a blocker.
Blocks: 474771
Severity: normal → critical
Flags: blocking1.9.1?
Priority: -- → P1
No longer blocks: 474771
I don't know how to do this. Ted tells me that we don't keep symbols for hourly builds so getting a core dump from any old run would be basically useless. Are we supposed to manually do a build, upload symbols and then manually run it through Talos and wait for a crash? Didn't we have to do something like this for b1 or b2? Anyone remember what we did there?
Now that I think about it I think a bit more we may have given someone access to a Talos machine so they could reproduce it themselves. Not entirely sure about that. Can anyone confirm that?
bug 461020 - you pulled the try talos box out of production, scrubbed it, and gave it to mrbkap.
mrbkap got access to do something like this, once. /be
Okay, I'll have to defer to John and/or Alice here, they know what the process is.
Andreas: So this crashed on one Talos mac, but not a try Talos mac? These machines are imaged the same, so maybe this is intermittent crash? Can you try sending the same patch 3-4 times to the try server? If *any* of your try runs crash, we can pull that slave out of the pool for you to poke around (and not get clobbered by the next incoming request!). Let us know when you are starting to send jobs to try server, so we can be ready-and-waiting as the runs complete. (Comment#5, comment#6 are both correct: we did exactly the same when mrbkap was chasing an intermittent crash in bug#461020)
Last comment in 474771 indicates that it didn't even crash Talos the last time it was landed on TM -- got backed out for unit-test crash and reftest failure (!), so reducing this in priority and reassigning to Andreas until we're sure that we need it. Sorry for the noise, releng!
Assignee: nobody → gal
Severity: critical → normal
We submitted the patch last night again to confirm that its still crashing and talos cycled now and dep unit failed. I will try to reproduce the latter offline (we only tried talos the last 2 days, not dep unit, since that used to pass)
Ok, I am unable to reproduce this offline (again) using the following command line: ../../debug-build/dist/MinefieldDebug.app/Contents/MacOS/firefox -console -P default -reftest crashtests.list -no-remote Is there anything else we could do offline to avoid pulling a core straight from the VM running the test?
Summary: Need core dump for macosx/talos running patch in bug 474771 → Need core dump for macosx/crashtest running patch in bug 474771
Mac/unit on tracemonkey seems to be green since the patch landed there b0592ff0f21a/Tue Jan 27 16:41. But in the merge to m-c today there's a REFTEST TEST-UNEXPECTED-FAIL | file:///builds/moz2_slave/mozilla-central-macosx-unittest/build/layout/base/crashtests/403175-1.html | timed out waiting for reftest-wait to be removed (after onload fired) Is that the signature of this issue ?
Its hard to tell. We had a bunch of sporadic crashes in both TM and MC. We can't clearly associate any of them with any particular patch. I will invalide this bug for the time being. I think a mechanism to get crash info of the tinderboxes is in the works as we speak.
Status: NEW → RESOLVED
Closed: 16 years ago
Resolution: --- → INVALID
Flags: blocking1.9.1?
Component: Release Engineering: Talos → Release Engineering
Product: mozilla.org → Release Engineering
You need to log in before you can comment on or make changes to this bug.