Closed Bug 824754 (t-snow-r4-0044) Opened 12 years ago Closed 11 years ago

t-snow-r4-0044 problem tracking

Tracking

(Not tracked)

Status:

RESOLVED FIXED

People

(Reporter: u429623, Unassigned)

References

Details

(Whiteboard: [buildduty][buildslaves][capacity][badslave])

Attachments

(2 files, 1 obsolete file)

Move snow-010 to production and snow-046 to staging 12 years ago Chris Cooper [:coop] (he/him) 2.66 KB, patch	armenzg : review+ coop : checked-in+	Details \| Diff \| Splinter Review
add snow-046 back to the production pool 11 years ago Armen [:armenzg] 1.46 KB, patch	bhearsum : review+ armenzg : checked-in+	Details \| Diff \| Splinter Review
add snow-046 back to the production pool and remove it from staging 11 years ago Armen [:armenzg] 2.09 KB, patch	bhearsum : review+ armenzg : checked-in+	Details \| Diff \| Splinter Review

u429623

Reporter

Description

•

12 years ago

talos-r4-snow-046 is reported as "seeing a rash of mysterious crashes in debug tests" - see bug 824498 for details Please run diagnostics to see if there are any hardware issues, and resolve as needed. Regardless of hardware issues, please reimage host before returning to releng.

u429623

Reporter

Updated

•

12 years ago

Summary: talos-r4-snow-046 → talos-r4-snow-046 showing inexplicable crashes, hardware suspected

u429623

Reporter

Updated

•

12 years ago

Depends on: 824755

Vinh Hua [:vinh]

Updated

•

12 years ago

colo-trip: --- → scl1

Salvador Espinoza [:sal]

Comment 1

•

12 years ago

I did a regular reboot on this host and it came up with no problems, I didn't read the diags part. Will get to this Monday

u429623

Reporter

Updated

•

12 years ago

Depends on: 825350

Salvador Espinoza [:sal]

Updated

•

12 years ago

Depends on: 825648

Amy Rich [:arr] [:arich]

Updated

•

12 years ago

Assignee: server-ops-dcops → nobody

Component: Server Operations: DCOps → Release Engineering: Machine Management

QA Contact: dmoore → armenzg

bhearsum@mozilla.com (:bhearsum)

Updated

•

12 years ago

Summary: talos-r4-snow-046 showing inexplicable crashes, hardware suspected → talos-r4-snow-046 problem tracking

bhearsum@mozilla.com (:bhearsum)

Updated

•

12 years ago

Depends on: 829293

Justin Wood (:Callek)

Comment 2

•

12 years ago

This is now re-enabled in prod

Status: NEW → RESOLVED

Closed: 12 years ago

Resolution: --- → FIXED

Phil Ringnalda (:philor)

Comment 3

•

12 years ago

https://tbpl.mozilla.org/php/getParsedLog.php?id=19457159&tree=Mozilla-Inbound is exactly the same sort of mysterious crash as bug 824498

Status: RESOLVED → REOPENED

Resolution: FIXED → ---

Phil Ringnalda (:philor)

Updated

•

12 years ago

Whiteboard: [buildduty][buildslaves][capacity] → [buildduty][buildslaves][capacity][badslave]

Justin Wood (:Callek)

Comment 4

•

12 years ago

disabled in slavealloc again. coop ideas on what our next step is?

Flags: needinfo?(coop)

Chris Cooper [:coop] (he/him)

Comment 5

•

12 years ago

(In reply to Justin Wood (:Callek) from comment #4) > disabled in slavealloc again. > > coop ideas on what our next step is? This is the point where we usually need to replace the logic board. Please open an IT bug with dcops to start that process. Bonus points if you can batch it with other slaves that need the same attention.

Flags: needinfo?(coop)

Justin Wood (:Callek)

Updated

•

12 years ago

Depends on: 838893

Chris Cooper [:coop] (he/him)

Comment 6

•

12 years ago

Slave has been repaired, reimaged, and is back in service.

Status: REOPENED → RESOLVED

Closed: 12 years ago → 12 years ago

Resolution: --- → FIXED

Phil Ringnalda (:philor)

Comment 7

•

12 years ago

https://tbpl.mozilla.org/php/getParsedLog.php?id=20162205&tree=Mozilla-Inbound

Status: RESOLVED → REOPENED

Resolution: FIXED → ---

Phil Ringnalda (:philor)

Comment 8

•

12 years ago

https://tbpl.mozilla.org/php/getParsedLog.php?id=20172738&tree=Mozilla-Inbound

Phil Ringnalda (:philor)

Comment 9

•

12 years ago

https://tbpl.mozilla.org/php/getParsedLog.php?id=20217285&tree=Mozilla-Inbound

Ryan VanderMeulen [:RyanVM]

Comment 10

•

12 years ago

See also bug 824498 - it looks like we're still getting intermittent crashes on this machine.

Ryan VanderMeulen [:RyanVM]

Comment 11

•

12 years ago

https://tbpl.mozilla.org/php/getParsedLog.php?id=20278709&tree=Mozilla-Inbound

Ryan VanderMeulen [:RyanVM]

Comment 12

•

12 years ago

Also, probably dupes of this - bug 845294, bug 847108, bug 847196

Ryan VanderMeulen [:RyanVM]

Comment 13

•

12 years ago

https://tbpl.mozilla.org/php/getParsedLog.php?id=20251647&tree=Mozilla-Aurora

Chris Cooper [:coop] (he/him)

Comment 14

•

12 years ago

I'm moving this slave to staging permanently and will mark it as such in slavealloc.

Status: REOPENED → RESOLVED

Closed: 12 years ago → 12 years ago

Resolution: --- → FIXED

Chris Cooper [:coop] (he/him)

Comment 15

•

12 years ago

Attached patch Move snow-010 to production and snow-046 to staging — Details — Splinter Review

Swapping 1-for-1 slaves between staging and production. See previous comments in this bug for reasons why snow-046 is unsuitable for production.

Assignee: nobody → coop

Status: RESOLVED → REOPENED

Attachment #720729 - Flags: review?(armenzg)

Resolution: FIXED → ---