Closed
Bug 914570
Opened 11 years ago
Closed 11 years ago
DNS is broken in scl1 (buildapi giving 502 bad gateway errors)
Categories
(mozilla.org Graveyard :: Server Operations, task)
mozilla.org Graveyard
Server Operations
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: emorley, Assigned: Usul)
References
Details
[12:26:17.595] GET https://secure.pub.build.mozilla.org/buildapi/self-serve/mozilla-central/rev/be1053dc223b?format=json [HTTP/1.1 502 Bad Gateway 718ms]
Breaks TBPL retriggers/cancellation etc.
Comment 1•11 years ago
|
||
10.12.48.19 (ns-vip.build.scl1.mozilla.com) isn't responding to DNS requests. From nagios there is also an ntp problem (ntp.build.mozilla.org). We should check DHCP too.
Assignee: nobody → server-ops
Component: Buildduty → Server Operations
Product: Release Engineering → mozilla.org
QA Contact: armenzg → shyam
Summary: buildapi giving 502 bad gateway errors → DNS is broken in scl1 (buildapi giving 502 bad gateway errors)
Assignee | ||
Comment 2•11 years ago
|
||
(In reply to Nick Thomas [:nthomas] from comment #1)
> 10.12.48.19 (ns-vip.build.scl1.mozilla.com) isn't responding to DNS
> requests. From nagios there is also an ntp problem (ntp.build.mozilla.org).
> We should check DHCP too.
dhcp checked on both admin nodes.
Assignee | ||
Updated•11 years ago
|
Assignee: server-ops → ludovic
Reporter | ||
Comment 4•11 years ago
|
||
Just mentioning in case people haven't seen the duped bug - this bug means that both completed, running and pending jobs are not showing up on TBPL, and as such all main trees are closed.
Assignee | ||
Comment 5•11 years ago
|
||
ssh -l root ns-vip.build.scl1.mozilla.com
ssh: connect to host ns-vip.build.scl1.mozilla.com port 22: No route to host
I can't find anything on this host in Inventory. Nor in mana : https://mana.mozilla.org/wiki/dosearchsite.action?queryString=ns-vip.build.scl1.mozilla.com
Assignee | ||
Comment 6•11 years ago
|
||
Thanks to arr we kicked keepalived.
Assignee | ||
Updated•11 years ago
|
Status: NEW → RESOLVED
Closed: 11 years ago
Resolution: --- → FIXED
Comment 7•11 years ago
|
||
An outage report will be forthcoming with more details.
Comment 8•11 years ago
|
||
RelEng infra is all fixed up.
Reporter | ||
Comment 9•11 years ago
|
||
I've filed bug 914699 for a nagios alert for self-serve that emails sheriffs@
Comment 10•11 years ago
|
||
Bug 914735 added nagios checks for the DNS vip in SCL1. Previously we only had them on the two admin machines behind the vip.
Updated•10 years ago
|
Product: mozilla.org → mozilla.org Graveyard
You need to log in
before you can comment on or make changes to this bug.
Description
•