Closed Bug 1008413 Opened 10 years ago Closed 10 years ago

Bouncer.prod and Mozilla.org production/staging jobs (run from within the QA lab) see frequent 500s/503s

Categories

(Infrastructure & Operations Graveyard :: WebOps: Other, task)

task
Not set
normal

Tracking

(Not tracked)

RESOLVED WORKSFORME

People

(Reporter: stephend, Unassigned)

References

()

Details

(Whiteboard: [fromAutomation])

Attachments

(3 files)

Attached file consoleText.txt
Jake tried to help us a bit, earlier today, with this, but we're still seeing it, and I want to make sure it's tracked -- for my team and the project (Mozilla.org).

tl;dr: our bouncer.prod and the mozilla.com/.org production and staging 

<jakem> Service Protection Class banned-download: Too many concurrent connections from 63.245.221.36, dropped - 2 time(s)
<jakem> at 09/May/2014:11:09:28 -0700

http://selenium.qa.mtv2.mozilla.com:8080/view/Bouncer/job/bouncer.prod/38758/console
Flags: needinfo?(nmaul)
Moving this to the webops queue for Jake.
Assignee: infra → server-ops-webops
Component: Infrastructure: Other → WebOps: Other
QA Contact: jdow → nmaul
I added that IP to the allowed (un-throttled) list for bouncer, but not for mozilla.org... didn't realize you were getting similar problems for that, too.

I'll get that now. Will take a bit longer b/c it's in both datacenters.
Severity: critical → major
Flags: needinfo?(nmaul)
Done as of now. Let me know if it gets any better. Dropping prio to prevent paging.
Severity: major → normal
(In reply to Jake Maul [:jakem] from comment #3)
> Done as of now. Let me know if it gets any better. Dropping prio to prevent
> paging.

Thx, Jake.  I think we can call this fixed!
Status: NEW → RESOLVED
Closed: 10 years ago
Resolution: --- → FIXED
Should this be fixed on bouncer.prod? We are seeing similar failuresin the Jenkins test runs.

http://selenium.qa.mtv2.mozilla.com:8080/view/All%20not%20B2G/job/bouncer.prod/39156/console
Flags: needinfo?(nmaul)
Jake, mind looking into the corresponding log entries for http://selenium.qa.mtv2.mozilla.com:8080/view/Bouncer/job/bouncer.prod/39186/console and friends?  Thanks!
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Switching to :jabba; can you help us figure out why tests run from our VLAN are experiencing 500s/ISEs from Bouncer, while the ones run from https://ci.mozilla.org/job/bouncer.prod/ don't?  Thanks!
Flags: needinfo?(nmaul) → needinfo?(jdow)
Looking at bouncer logs from around that time shows some connection failures to MySQL
L
Attached image bouncer-good.png
Hopefully I don't jinx us, but it seems to have been fixed between:

Success > Console Output  #39517 	May 14, 2014 12:43:28 PM	 
Failed > Console Output  #39516 	May 14, 2014 11:30:48 AM
Flags: needinfo?(jdow)
Not sure if bug 1010383 or some other, related work, fixed this, so I'll mark it WFM now.
Status: REOPENED → RESOLVED
Closed: 10 years ago10 years ago
OS: Mac OS X → All
Hardware: x86 → All
Resolution: --- → WORKSFORME
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: