Closed Bug 1018181 Opened 11 years ago Closed 11 years ago

PostgreSQL Number Files in wal_archive on socorro3.db.phx1.mozilla.com is WARNING: 20 Files - warning

Categories

(Data & BI Services Team :: DB: MySQL, task)

Other
Other
task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: nagiosapi, Assigned: bjohnson)

References

()

Details

(Whiteboard: [id=nagios1.private.phx1.mozilla.com:332312])

Automated alert report from nagios1.private.phx1.mozilla.com: Hostname: socorro3.db.phx1.mozilla.com Service: PostgreSQL Number Files in wal_archive State: WARNING Output: 20 Files - warning Runbook: http://m.allizom.org/PostgreSQL+Number+Files+in+wal_archive
This alert has been happening regularly overnight.
Assignee: nobody → bjohnson
Component: Server Operations: MOC → Server Operations: Database
QA Contact: bpannabecker → scabral
The alert is a check that has been downtimed because it relies on a script that counts the number of files in the wal directory. This is a backup to streaming replication should the stream lose sync it can pickup the wal that is shipped to this directory. Now that more and more data is being pumped into socorro, more wal is being shipped and the settings should be modified as such.
We've adjusted the alerting parameters warn/crit from 15/100 to 100/200. This should solve the issue.
Status: NEW → RESOLVED
Closed: 11 years ago
Resolution: --- → FIXED
Product: mozilla.org → Data & BI Services Team
You need to log in before you can comment on or make changes to this bug.