Closed
Bug 918803
Opened 12 years ago
Closed 12 years ago
buildbot-master42,43 have very few or no pandas attached
Categories
(Infrastructure & Operations Graveyard :: CIDuty, task)
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: kmoir, Assigned: kmoir)
Details
On Wednesday I noticed that buildbot-master42 had no pandas attached. It should have 522-544, 258-307, 320-381. I reimaged the pandas with the android image, they attached and ran some jobs. The same problem is happening today. The master isn't showing any buildslaves attached and many of the devices have error.flgs. I verified the entries in the devices.json look good. The same thing appears to be happening for buildbot-master43, it looks like only has a couple of pandas attached.
Comment 1•12 years ago
|
||
kmoir, is this something you or pmoore could help investigate?
I don't think I will have time as buildduty to look into it as there are other bugs that I have to deal with.
| Assignee | ||
Comment 2•12 years ago
|
||
Yes, I'm looking at it. Bm43 looks good this morning, I'm investigating why bm42 continues to have problems.
| Assignee | ||
Updated•12 years ago
|
Assignee: nobody → kmoir
| Assignee | ||
Comment 3•12 years ago
|
||
Both masters were in the same state this morning, very few pandas attached.
I looked at the logs on mozpool and it looked like most of the pandas ran selftest.py which passed but then on the foopies a connection could not be made thus they didn't connect to the master.
http://mobile-imaging-010.p10.releng.scl1.mozilla.com/ui/log.html?device=panda-0546
They also didn't show the image as android in mozpool, only blank.
I'm not sure what's happening here, Callek do you have any suggestions? If not I'll talk to the ateam.
I reimaged the devices to get them to connect, since our current capacity is low with that many pandas down.
Flags: needinfo?(bugspam.Callek)
| Assignee | ||
Comment 4•12 years ago
|
||
I talked to Callek about this yesterday, he thinks that changing the scripts so that everything is rebooted via mozpool (bug 919533 and bug 889967) will resolve this problem. In the interim, I reimaged all the pandas and they seem to be staying up for the last 24 hours.
| Assignee | ||
Comment 5•12 years ago
|
||
This is not a problem any more. Not sure if this resolved itself since the devices are all managed by mozpool. Will work on resolving bug 919533 and bug 889967 in any case.
Status: NEW → RESOLVED
Closed: 12 years ago
Resolution: --- → FIXED
Updated•12 years ago
|
Flags: needinfo?(bugspam.Callek)
Updated•7 years ago
|
Product: Release Engineering → Infrastructure & Operations
Updated•6 years ago
|
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in
before you can comment on or make changes to this bug.
Description
•