Closed Bug 926064 Opened 11 years ago Closed 11 years ago

Add SSL certificate check for mozilla-hispano.org to Nagios

Categories

(mozilla.org Graveyard :: Server Operations, task)

x86
macOS
task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: bburton, Assigned: afernandez)

References

Details

We managed the SSL certificate for the mozilla-hispano.org community and unfortunately because we don't normally add checks for these and because Geotrust failed to let us know it was expiring, it did expire on us, see bug 925952

Please add a Nagios check for the certificate

Thanks
See Also: → 925952
http/s+ssl expiring check added in r76357.
Assignee: server-ops → afernandez
Status: NEW → RESOLVED
Closed: 11 years ago
Resolution: --- → FIXED
(In reply to Brandon Burton [:solarce] from comment #0)
> We managed the SSL certificate for the mozilla-hispano.org community and
> unfortunately because we don't normally add checks for these and because
> Geotrust failed to let us know it was expiring, it did expire on us, see bug
> 925952
> 
> Please add a Nagios check for the certificate
> 
> Thanks

The reason we didn't add a check before is it's not hosted by us and sysadmins need not be woken up at 3am on a weekend to be told that a site that's not hosted by us has an SSL certificate expiring in 14 days. 

Aj, can we make sure this is business hours only please?
Status: RESOLVED → REOPENED
Flags: needinfo?(afernandez)
Resolution: FIXED → ---
Change committed in r76567;
@@ -21024,7 +21024,7 @@
     retry_interval          1
     max_check_attempts      2
     contact_groups          sysalerts
-    notification_period     24x7
+    notification_period     weekdayhours
     notification_options    d,u,r,s
     hostgroups              https-websites, http-websites
     }
@@ -21038,7 +21038,7 @@
     event_handler                   ping-event-handler
     check_command                   check-host-alive!10.22.75.42
     contact_groups                  sysalerts
-    notification_period             24x7
+    notification_period             weekdayhours
     notes_url                       http://m.allizom.org/PING
     }
Status: REOPENED → RESOLVED
Closed: 11 years ago11 years ago
Flags: needinfo?(afernandez)
Resolution: --- → FIXED
This didn't quite work, I got paged on a Sunday morning for < nagios-scl3> | Sun 08:31:51 PDT [5864] www.mozilla-hispano.org:HTTP is CRITICAL: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 579 bytes in 0.346 second response time 

on a site we don't host or control. 

r77118 has :

--- scl3.pp	(revision 77117)
+++ scl3.pp	(working copy)
@@ -6310,7 +6310,6 @@
             notification_period   => 'weekdayhours',
             hostgroups => [
                 'https-websites',
-                'http-websites',
             ]
Removing ping check so that this won't alert when the site is down:

         'www.mozilla-hispano.org' => {
+            check_command => 'check_dummy!0',
             notification_period   => 'weekdayhours',
Product: mozilla.org → mozilla.org Graveyard
You need to log in before you can comment on or make changes to this bug.