Closed
Bug 930024
Opened 11 years ago
Closed 11 years ago
Disconnects across multiple trees and platforms
Categories
(Infrastructure & Operations Graveyard :: CIDuty, task)
Infrastructure & Operations Graveyard
CIDuty
Tracking
(Not tracked)
RESOLVED
DUPLICATE
of bug 918677
People
(Reporter: emorley, Unassigned)
Details
eg: https://tbpl.mozilla.org/?rev=8803e8c0ee3e https://tbpl.mozilla.org/?rev=c0e6e76aafed https://tbpl.mozilla.org/?tree=Mozilla-Inbound&rev=124fc1fcd3eb All main non-try trees closed.
Comment 1•11 years ago
|
||
Looks like one of the VPN tunnels to us-east-1 went down at 10:27ET. Failed over to the other tunnel.
Comment 2•11 years ago
|
||
Tunnel 1 72.21.209.193 UP 2013-10-19 05:21 EDT 1 BGP ROUTES Tunnel 2 72.21.209.225 DOWN 2013-10-23 10:27 EDT IPSEC IS UP
Comment 3•11 years ago
|
||
(In reply to Chris AtLee [:catlee] from comment #1) > Looks like one of the VPN tunnels to us-east-1 went down at 10:27ET. Failed > over to the other tunnel. Once we're out of the woods, where do you get to see that failure? I don't see anything on #buildduty. Thanks!
Comment 4•11 years ago
|
||
Tunnel 2 looks like it's back up. Tunnel 1 72.21.209.193 UP 2013-10-19 05:21 EDT 1 BGP ROUTES Tunnel 2 72.21.209.225 UP 2013-10-23 11:08 EDT 1 BGP ROUTES Armen, this is from Amazon's VPC dashboard.
Comment 5•11 years ago
|
||
@timestamp,@source_host,@message 2013-10-23T14:27:53.000Z,fw1.releng.console.scl3.mozilla.net,%-RPD_BGP_NEIGHBOR_STATE_CHANGED: BGP peer 169.254.255.77 (External AS 7224) changed state from Established to Idle (event HoldTime) this is the only important event that happened between 14:20 and 14:40UTC on our infrastructure. Tunnels to AWS flapping happens regularly (lot more frequently than ipsec to any of our offices for example).
Reporter | ||
Comment 6•11 years ago
|
||
Disconnects haven't occurred since, and the other tree carnage seems to be under control; reopening. Do we have a bug open for increasing the resilience of these VPN tunnels? (I forget)
Severity: blocker → critical
Comment 7•11 years ago
|
||
(In reply to Ed Morley [:edmorley UTC+1] from comment #6) > Do we have a bug open for increasing the resilience of these VPN tunnels? (I > forget) Nothing more can be done on our side from what we can identify so far. We did open a new case with Amazon on the issue, and much more details are in Bug 918677. I'm going to close this one out in favor of the investigation going on in there.
Status: NEW → RESOLVED
Closed: 11 years ago
Resolution: --- → DUPLICATE
Reporter | ||
Comment 8•11 years ago
|
||
sgtm, thank you :-)
Updated•6 years ago
|
Product: Release Engineering → Infrastructure & Operations
Updated•4 years ago
|
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in
before you can comment on or make changes to this bug.
Description
•