Closed Bug 1119613 Opened 9 years ago Closed 9 years ago

Adjust nagios alert levels for git1.dmz.scl3.mozilla.com:Load

Categories

(Infrastructure & Operations :: MOC: Service Requests, task)

task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: hwine, Assigned: rwatson)

Details

Attachments

(1 file)

git spawns many processes for small tasks, so git servers operate just fine with much higher load numbers than usual.

Please set the current "critical" level to be the new warning level, and make the critical level be 150.

Todays event alerted a 113 (140 seen upon login). A previous alerting "non event" was also around 104 bug 1087597.

An event that did require intervention (but not a restart) load numbers > 500 for more than 4 hours (see bug 1087640 attachment 8510038 [details])

The proposed values would only have alerted for the case where action was needed.

(We do expect to see higher load on git.mozilla.org as the FxOS release engineering team starts taking more of the build load. We may need to adjust further.))
1 yr max process load on git1
Assignee: nobody → rwatson
Worked with Ashish on this. Adjustments have been made. 

"Please set the current "critical" level to be the new warning level, and make the critical level be 150."
Status: NEW → RESOLVED
Closed: 9 years ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: