Nagios says some reference boxes are down

RESOLVED WORKSFORME

Status

Infrastructure & Operations
RelOps
RESOLVED WORKSFORME
7 years ago
5 years ago

People

(Reporter: nthomas, Unassigned)

Tracking

Details

(Reporter)

Description

7 years ago
Since 2011-05-29 17:30, which is a little after the network outage in SJC:

<nagios-sjc1> [99] talos-r3-fed64-ref.build.mtv1 is DOWN: PING CRITICAL - Packet loss = 100%
<nagios-sjc1> [01] talos-r3-fed-ref.build.mtv1 is DOWN: PING CRITICAL - Packet loss = 100%
<nagios-sjc1> [02] talos-r3-w7-ref.build.mtv1 is DOWN: PING CRITICAL - Packet loss = 100%
<nagios-sjc1> [03] win64-ix-ref.build.mtv1 is DOWN: PING CRITICAL - Packet loss = 100%
<nagios-sjc1> [06] win2k3-ref-img.build.sjc1 is DOWN: CRITICAL - Host Unreachable (10.2.71.253)
<nagios-sjc1> [07] moz2-linux64-ref.build.sjc1 is DOWN: CRITICAL - Host Unreachable (10.2.71.245)
<nagios-sjc1> [08] linux-ref-platform.build.sjc1 is DOWN: CRITICAL - Host Unreachable (10.2.71.251)
<nagios-sjc1> [09] t-r3-w764-ref.build.mtv1 is DOWN: PING CRITICAL - Packet loss = 100%
(Reporter)

Comment 1

7 years ago
This might have just been some downtimes expiring. If I look at linux-ref-platform then the PING check has been down for more than 6 days and has been silenced, but the host itself has been down for 11 hours.
These were all down for reimaging.
Status: NEW → RESOLVED
Last Resolved: 7 years ago
Resolution: --- → WORKSFORME
Component: Server Operations: RelEng → RelOps
Product: mozilla.org → Infrastructure & Operations
You need to log in before you can comment on or make changes to this bug.