Closed Bug 856734 Opened 11 years ago Closed 11 years ago

mm-ci-master appears to be hung

Categories

(Mozilla QA Graveyard :: Infrastructure, defect)

defect
Not set
blocker

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: u279076, Assigned: rwood)

Details

Matt Wobensmith noticed that he hasn't received any 17.0.5esr results in the mozmill-ci dashboard since getting candidate builds on Friday. I checked the mozmill-ci emails and there are a lot of results with no dashboard URLs. When I try to load http://mm-ci-master.qa.scl3.mozilla.com:8080/ it takes a very very long time. There is also a considerable backlog of testruns which have been in queue for 3 hours.

I'm not sure how to debug this any further. This blocks our ability to qualify 17.0.5esr and 20.0RC (due to release tomorrow AM).
Rwood, time to put all that knowledge you learned from Henrik to action -- can you see if you can debug what's going on?  Perhaps the system just needs to be rebooted?
I tried restarting just jenkins but it wouldn't respond; so I did an entire restart of master. Master/jenkins etc. and dashboard now back online.
Site is accessible and I was able to kick off a test run, but pages only load partially and site appears almost too slow to be usable.
Hmmm... the site is running fine for me, and pages for the various slaves, etc. load fine and quickly...
Sadly, not for me. In fact, now the site connection is timing out completely.
(In reply to Matt Wobensmith from comment #5)
> Sadly, not for me. In fact, now the site connection is timing out completely.

It's nice and snappy now on my end. Matt, can you try clearing your cache?
It's up. I was someone switched off my VPN connection; now I'm back and site loads great.
Great, thank you Matt and thank you Rob for getting this fixed.
Status: NEW → RESOLVED
Closed: 11 years ago
Resolution: --- → FIXED
Thank you Rob for taking care of it. One thing through... Have you observed the system load and processes before you restarted the whole machine?  I'm really eager to know what happened to understand the problem and so we can make sure it will not happen again. Rebooting the server is really the last resort. Thanks.
Assignee: nobody → rwood
Product: Mozilla QA → Mozilla QA Graveyard
You need to log in before you can comment on or make changes to this bug.