PostgreSQL Number Files in wal_archive nagios check fails on socorro3.db.phx1

RESOLVED FIXED

Status

RESOLVED FIXED
5 years ago
4 years ago

People

(Reporter: mpressman, Assigned: mpressman)

Tracking

Details

(Assignee)

Description

5 years ago
This check fails with
NRPE: Command 'check_numfiles' not defined
(Assignee)

Updated

5 years ago
Assignee: server-ops-database → mpressman
puppetized the check_numfiles script - 

Sheeri-Cabral:manifests scabral$ svn commit -m "making it so socorro dbs have the proper nagios plugin check_numfils.sh"
Sending        db.pp
Transmitting file data .
Committed revision 79687.

but the pgexperts.cfg has to be put in place, and I'm not sure socorro::db is the right place for it, because only the places with WAL in place should have it.(so maybe the check_numfiles itself should move to wherever that goes...)
(Assignee)

Comment 2

5 years ago
There is an error on tp-socorro01-master02 as well which fails with looking into /var/lib/pgsql/wal_archive since it was moved to /wal/9.2

The argument in pg-wal-num-files for service check_postgresql_wal_archive is hard coded to point to /var/lib/pgsql/wal_archive

So we need to do two things, one for socorro3.db.phx1, make sure that check_numfiles.sh exists in the proper location and edit the nagios check itself to make sure it knows where to look for WAL
(Assignee)

Comment 3

4 years ago
this is now working
Status: NEW → RESOLVED
Last Resolved: 4 years ago
Resolution: --- → FIXED
Product: mozilla.org → Data & BI Services Team
You need to log in before you can comment on or make changes to this bug.