Closed Bug 829169 Opened 11 years ago Closed 11 years ago

ftp nagios alerts going off shortly after unthrottling Firefox 18.0

Categories

(mozilla.org Graveyard :: Server Operations, task)

task
Not set
blocker

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: bhearsum, Assigned: rbryce)

Details

Saw this from Nagios:
14:06 < nagios-scl3> Thu 11:06:19 PST [543] releases-zlb.vips.scl3.mozilla.com:http - service for ftp vips is CRITICAL: HTTP CRITICAL: HTTP/1.1 500 
                     Internal Server Error - 630 bytes in 1.014 second response time (http://m.allizom.org/http+-+service+for+ftp+vips)
14:07 < nagios-scl3> Thu 11:07:09 PST [548] ftp3.dmz.scl3.mozilla.com:http - service for ftp vips is CRITICAL: Connection refused 
                     (http://m.allizom.org/http+-+service+for+ftp+vips)
14:07 < nagios-scl3> Thu 11:07:19 PST [550] ftp1.dmz.scl3.mozilla.com:http - service for ftp vips is CRITICAL: Connection refused 
                     (http://m.allizom.org/http+-+service+for+ftp+vips)
14:08 < nagios-scl3> Thu 11:08:28 PST [554] ftp6.dmz.scl3.mozilla.com:http - service for ftp vips is CRITICAL: Connection refused 
                     (http://m.allizom.org/http+-+service+for+ftp+vips)
14:09 < nagios-scl3> Thu 11:09:28 PST [559] ftp2.dmz.scl3.mozilla.com:http - service for ftp vips is CRITICAL: Connection refused 
                     (http://m.allizom.org/http+-+service+for+ftp+vips)
14:09 < nagios-scl3> Thu 11:09:38 PST [561] ftp1-zlb.vips.scl3.mozilla.com:https - service for ftp vips is CRITICAL: HTTP CRITICAL: HTTP/1.1 500 
                     Internal Server Error - 630 bytes in 0.019 second response time (http://m.allizom.org/https+-+service+for+ftp+vips)
14:09 < nagios-scl3> Thu 11:09:39 PST [563] ftp4.dmz.scl3.mozilla.com:http - service for ftp vips is CRITICAL: Connection refused 
                     (http://m.allizom.org/http+-+service+for+ftp+vips)
14:11 < nagios-scl3> Thu 11:11:08 PST [567] releases-zlb.vips.scl3.mozilla.com:https - service for ftp vips is CRITICAL: CRITICAL - Socket timeout 
                     after 10 seconds (http://m.allizom.org/https+-+service+for+ftp+vips)


And I've seen complaints from some users too.
All trees closed, since jobs are failing with variations on https://tbpl.mozilla.org/php/getParsedLog.php?id=18680520&tree=Mozilla-Inbound
Escalating this to blocker since it prevents QA from qualifying updates.
Severity: critical → blocker
Assignee: server-ops → rbryce
Assignee: rbryce → server-ops
Blocks: 828236
Assignee: server-ops → mburns
We have noticed similar issues in recent weeks, when the apache config changes.  I had to restart 1 ftp server manually.  Still investigating, but this appears to be resolved.
Assignee: mburns → rbryce
No longer blocks: 828236
I think this is fixed now?
ftp1-6 are in working order now.  We do have a bug to resolve what I think is the underlying issue here.  Bug 826405
Status: NEW → RESOLVED
Closed: 11 years ago
Resolution: --- → FIXED
Is it safe to reopen the trees?
(In reply to Rick Bryce [:rbryce] from comment #5)
> ftp1-6 are in working order now.  We do have a bug to resolve what I think
> is the underlying issue here.  Bug 826405

This is a legal bug, wrong bug #?
(In reply to Mounir Lamouri (:mounir) from comment #6)
> Is it safe to reopen the trees?

Done
(In reply to Ben Hearsum [:bhearsum] from comment #7)
> (In reply to Rick Bryce [:rbryce] from comment #5)
> > ftp1-6 are in working order now.  We do have a bug to resolve what I think
> > is the underlying issue here.  Bug 826405
> 
> This is a legal bug, wrong bug #?

Bug 826495
19b1 update testing seem fine on releasetest now.
Product: mozilla.org → mozilla.org Graveyard
You need to log in before you can comment on or make changes to this bug.