Closed
Bug 710826
Opened 13 years ago
Closed 13 years ago
ganglia thinks signing{1,2}.build.scl1.mozilla.com are offline
Categories
(Infrastructure & Operations :: RelOps: General, task)
Infrastructure & Operations
RelOps: General
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: catlee, Assigned: arich)
Details
http://ganglia1.build.scl1.mozilla.com/ganglia/?c=RelEngSCL1&h=signing1.build.scl1.mozilla.com&m=load_one&r=hour&s=descending&hc=4&mc=2
http://ganglia1.build.scl1.mozilla.com/ganglia/?c=RelEngSCL1&h=signing2.build.scl1.mozilla.com&m=load_one&r=hour&s=descending&hc=4&mc=2
These used to work, not sure what changed.
Reporter | ||
Comment 1•13 years ago
|
||
sorry, I should have added that these machines are up and healthy, it's just ganglia that's confused.
Assignee | ||
Comment 2•13 years ago
|
||
If I recall correctly, bhearsum added iptables filtering to these machines. Quite possibly that's what broke ganglia. If you turn off the filtering, do they come back (I don't want to do so since I don't want to break/interrupt whatever the filtering was set up for)?
Assignee | ||
Comment 3•13 years ago
|
||
Based on the fact that releng-mirror01 was also showing offline, I went digging and found that ganglia1 had the wrong netmask set. I corrected it and restarted gexecd and gmond on all of the feeder servers and the servers that were missing and everything appears to be back to normal now.
Assignee: server-ops-releng → arich
Status: NEW → RESOLVED
Closed: 13 years ago
Resolution: --- → FIXED
Updated•12 years ago
|
Component: Server Operations: RelEng → RelOps
Product: mozilla.org → Infrastructure & Operations
You need to log in
before you can comment on or make changes to this bug.
Description
•