Add SSL certificate check for mozilla-hispano.org to Nagios

RESOLVED FIXED

Status

RESOLVED FIXED
5 years ago
4 years ago

People

(Reporter: bburton, Assigned: afernandez)

Tracking

Details

(Reporter)

Description

5 years ago
We managed the SSL certificate for the mozilla-hispano.org community and unfortunately because we don't normally add checks for these and because Geotrust failed to let us know it was expiring, it did expire on us, see bug 925952

Please add a Nagios check for the certificate

Thanks
(Reporter)

Updated

5 years ago
See Also: → bug 925952
(Assignee)

Comment 1

5 years ago
http/s+ssl expiring check added in r76357.
Assignee: server-ops → afernandez
Status: NEW → RESOLVED
Last Resolved: 5 years ago
Resolution: --- → FIXED
(In reply to Brandon Burton [:solarce] from comment #0)
> We managed the SSL certificate for the mozilla-hispano.org community and
> unfortunately because we don't normally add checks for these and because
> Geotrust failed to let us know it was expiring, it did expire on us, see bug
> 925952
> 
> Please add a Nagios check for the certificate
> 
> Thanks

The reason we didn't add a check before is it's not hosted by us and sysadmins need not be woken up at 3am on a weekend to be told that a site that's not hosted by us has an SSL certificate expiring in 14 days. 

Aj, can we make sure this is business hours only please?
Status: RESOLVED → REOPENED
Flags: needinfo?(afernandez)
Resolution: FIXED → ---
(Assignee)

Comment 3

5 years ago
Change committed in r76567;
@@ -21024,7 +21024,7 @@
     retry_interval          1
     max_check_attempts      2
     contact_groups          sysalerts
-    notification_period     24x7
+    notification_period     weekdayhours
     notification_options    d,u,r,s
     hostgroups              https-websites, http-websites
     }
@@ -21038,7 +21038,7 @@
     event_handler                   ping-event-handler
     check_command                   check-host-alive!10.22.75.42
     contact_groups                  sysalerts
-    notification_period             24x7
+    notification_period             weekdayhours
     notes_url                       http://m.allizom.org/PING
     }
Status: REOPENED → RESOLVED
Last Resolved: 5 years ago5 years ago
Flags: needinfo?(afernandez)
Resolution: --- → FIXED
This didn't quite work, I got paged on a Sunday morning for < nagios-scl3> | Sun 08:31:51 PDT [5864] www.mozilla-hispano.org:HTTP is CRITICAL: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 579 bytes in 0.346 second response time 

on a site we don't host or control. 

r77118 has :

--- scl3.pp	(revision 77117)
+++ scl3.pp	(working copy)
@@ -6310,7 +6310,6 @@
             notification_period   => 'weekdayhours',
             hostgroups => [
                 'https-websites',
-                'http-websites',
             ]
Removing ping check so that this won't alert when the site is down:

         'www.mozilla-hispano.org' => {
+            check_command => 'check_dummy!0',
             notification_period   => 'weekdayhours',
Product: mozilla.org → mozilla.org Graveyard
You need to log in before you can comment on or make changes to this bug.