Nagios is repeatedly but not consistently having timeouts connecting to jenkins1.dmz.phx1.mozilla.com. Additionally, jenkins1 can't reach its NTP time server and appears to sometimes timeout connecting to the puppet master. I've looked on the server side and don't see any issues. Can you verify everything looks ok from the switch/firewall/whatever side?
Failed bond0 over to eth1 to see if that helps with the timeouts.
Assignee: network-operations → server-ops
Component: Server Operations: Netops → Server Operations
QA Contact: ravi → shyam
So far, it seems to have helped. One oustanding alert cleared immediately. It can now reach its time server. Something about the bond or interface must have been in a bad state.
Status: NEW → RESOLVED
Last Resolved: 5 years ago
Resolution: --- → FIXED
Product: mozilla.org → mozilla.org Graveyard
You need to log in before you can comment on or make changes to this bug.