Closed Bug 481755 Opened 15 years ago Closed 15 years ago

Terminate all BGP sessions on border1 and restore in proper order

Tracking

(Not tracked)

Status:

RESOLVED FIXED

People

(Reporter: dmoore, Assigned: dmoore)

Details

Derek Moore [:dmoore]

Assignee

Description

•

15 years ago

In order to properly restore all BGP sessions on border1, it is necessary to establish them in a particular order to avoid overloading the CPU. In particular, the overhead of establishing a BGP session with Mzima can cause timeouts and flapping of our OSPF sessions with GNI. If our BGP session with GNI is already established, this can cause an unintentional reset which quickly leads to a fatal chain-reaction of BGP session failures.

It appears the desired bring-up order will be:

* Any2 peers
* Mzima #1
* Mzima #2
* Internal sessions
* GNI

Currently, only our session with GNI is active on border1. To bring all peers online, we will need to intentionally terminate this session and reactivate all peers in the above order.

Derek Moore [:dmoore]

Assignee

Comment 1

•

15 years ago

Note: Ideally, the OSPF issue will be resolved by experimenting with more liberal timeouts during a future maintenance window.

matthew zeier [:mrz]

Updated

•

15 years ago

Group: infra

Derek Moore [:dmoore]

Assignee

Comment 2

•

15 years ago

Process completed.

During the phased activation of BGP peers, CPU utilization remained higher than expected. border1 was switching a significant number (~75%) of packets in software, saturating the CPU. Consultation with Cisco TAC suggested a memory management problem with the TCAM. The quickest solution was a reboot of border1. After the reboot, all BGP peers were restored without issue and CPU utilization remained minimal.

A second Level3 gigabit circuit, previously disabled due to faulty media, was also enabled during this window.

Status: NEW → RESOLVED

Closed: 15 years ago

Resolution: --- → FIXED

Nobody; OK to take it and work on it

Updated

•

9 years ago

Product: mozilla.org → mozilla.org Graveyard

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

Terminate all BGP sessions on border1 and restore in proper order

Categories

(mozilla.org Graveyard :: Server Operations, task)

Tracking

(Not tracked)

People

(Reporter: dmoore, Assigned: dmoore)

References

Details

Crash Data

Security

(public)

User Story

Description

Comment 1

Updated

Comment 2

Updated