Status

mozilla.org Graveyard
Server Operations
--
minor
RESOLVED FIXED
9 years ago
2 years ago

People

(Reporter: mrz, Assigned: dmoore)

Tracking

Dependency tree / graph
Bug Flags:
needs-downtime +

Details

(Reporter)

Description

9 years ago
DURATION:

Sunday, January 11th from 8:00 PM to 11:00 PM.


PURPOSE:

Scheduled sequential downtime of the MPT core1 and core2 switches for IOS software upgrades. This IOS upgrade is required to support our next generation of integrated load balancing hardware modules.

SCHEDULE:

Prior to upgrade: Load 12.2(18)SXF11 firmware to local flash storage on each switch. Create running-config backups on local flash storage on each switch. Archive output of the "show module", "show test all", "show port status", "show channel", and "show trunk" commands for later review.

8:00 PM 01/11/09: Perform IOS upgrade of core1 by copying firmware to the bootflash and updating necessary boot variables. Reload core1 and monitor boot progress. Validate output of show module, test, port, channel, and trunk. Confirm ACE module is properly detected and initialized. Soak and monitor network for 1 hour.

9:00 PM 01/11/09: Perform IOS upgrade of core2, as above.

10:00PM 01/11/09: Upgrade complete.
(Reporter)

Updated

9 years ago
Assignee: server-ops → dmoore
Flags: needs-downtime+
(Assignee)

Comment 1

9 years ago
Upgrade process has been postponed for one hour (to 9:00 PM), pending resolution of unrelated networking issues.
(Reporter)

Updated

9 years ago
Depends on: 473113
(Reporter)

Updated

9 years ago
Depends on: 473116
(Assignee)

Comment 2

9 years ago
Upgrade for core1 was completed within window. Core2 upgrade is postponed until dependencies have been resolved.
(Assignee)

Comment 3

9 years ago
More detail on core1 upgrade failure:

The IOS upgrade image was stored on core1's 1GB Compact Flash. Compact Flash was used because the internal supervisor bootflash was limited to 64MB, making it unable to hold two images simultaneously. The old IOS image was left untouched on the internal bootflash.

Core1 was reloaded with the intent of booting off of Compact Flash. Rommon was unable to read the Compact Flash card, halting the boot process. A manual attempt to revert back to the internal bootflash revealed the old image to be corrupt or inaccessible. At that time, no images were available to boot core1.

By formatting core2's Compact Flash card in IOS (as opposed to using the default format from the factory), it became readable by core1's rommon. The Compact Flash card was formatted in core2, images were loaded via TFTP, and then the card was manually transferred to core1. Core1 was able to boot from the newly-formatted card and returned to service.
(Reporter)

Comment 4

9 years ago
Yep.  Log caught this when you moved cards around:

Jan 11 22:44:23 core2 3927937: Jan 11 22:44:21.978 PDT: %PCMCIAFS-SP-5-DIBERR: PCMCIA disk 0 is formatted from a different router or PC. A format in this router is required before  an image can be booted from this device
(Reporter)

Updated

9 years ago
Depends on: 473122
(Reporter)

Updated

9 years ago
Blocks: 475371
(Assignee)

Comment 5

8 years ago
core2 upgrade proceeding as planned, beginning at 17:00
(Assignee)

Comment 6

8 years ago
core2 upgrade completed.

Notes:

* A configuration mismatch prevented certain portchannels (7,8,10) from rebuilding properly. This was corrected manually.

* ns-15k is single-homed to core2, so it (and all VIPs it services) were temporarily unavailable during the reload.
Status: NEW → RESOLVED
Last Resolved: 8 years ago
Resolution: --- → FIXED
Product: mozilla.org → mozilla.org Graveyard
You need to log in before you can comment on or make changes to this bug.