Closed
Bug 1022267
Opened 11 years ago
Closed 11 years ago
B2G non-unified builds are falling way behind
Categories
(Release Engineering :: General, defect)
Release Engineering
General
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: RyanVM, Assigned: rail)
Details
Throughout the day, I've seen multiple instances where pending jobs from the *prior* periodic run haven't started yet before the next periodic run kicks off. Right now, I'm looking at m-c tip where the Windows PGO build has completed and finished running tests, but 5 out of 6 of the periodic B2G non-unified builds requested at the same time are still pending.
Reporter | ||
Updated•11 years ago
|
Severity: normal → critical
Reporter | ||
Comment 1•11 years ago
|
||
All 5 of the jobs in comment 0 are still pending.
Flags: needinfo?(rail)
![]() |
Assignee | |
Comment 2•11 years ago
|
||
Looks like we don't start new instances for nonunified builders. I pushed https://hg.mozilla.org/build/cloud-tools/rev/5f550c1b1762.
Keeping an eye on it now.
Flags: needinfo?(rail)
![]() |
Assignee | |
Comment 3•11 years ago
|
||
Looks better now:
014-06-08 07:22:44,697 - b2g_mozilla-central_emulator-debug_nonunified instance type bld-linux64 slaveset frozenset([u'bld-linux64-spot-131', u'bld-linux64-spot-138', u'bld-linux64-spot-139'])
(what aligns with to http://jacuzzi-allocator.pub.build.mozilla.org/v1/builders/b2g_mozilla-central_emulator-debug_nonunified)
2014-06-08 07:22:51,851 - 0 running for spot bld-linux64 frozenset([u'bld-linux64-spot-131', u'bld-linux64-spot-138', u'bld-linux64-spot-139']) (0 fresh)
2014-06-08 07:23:48,966 - need 1 spot bld-linux64 for slaveset frozenset([u'bld-linux64-spot-131', u'bld-linux64-spot-138', u'bld-linux64-spot-139'])
2014-06-08 07:23:52,824 - Spot request for bld-linux64-spot-131.build.releng.use1.mozilla.com (0.18)
2014-06-08 07:23:54,719 - bld-linux64 - started 1 spot instances for slaveset frozenset([u'bld-linux64-spot-131', u'bld-linux64-spot-138', u'bld-linux64-spot-139']); need 0
![]() |
Assignee | |
Comment 4•11 years ago
|
||
https://tbpl.mozilla.org/?jobname=.*_nonunified looks much better now, at least no pending.
![]() |
Assignee | |
Comment 5•11 years ago
|
||
I see that we started some jacuzzied instances for m-i non-unified builds as well
![]() |
Assignee | |
Comment 6•11 years ago
|
||
I just checked the only pending non-unified build on m-i. All of the slaves in http://jacuzzi-allocator.pub.build.mozilla.org/v1/builders/b2g_mozilla-inbound_emulator-jb-debug_nonunified are up and doing something. Sounds expected.
![]() |
Assignee | |
Comment 7•11 years ago
|
||
![]() |
Assignee | |
Updated•11 years ago
|
Assignee: nobody → rail
![]() |
Assignee | |
Comment 8•11 years ago
|
||
Also added flame builds in https://hg.mozilla.org/build/cloud-tools/rev/6e474160aa7b
![]() |
Assignee | |
Comment 9•11 years ago
|
||
B2g-Inbound has no pending builds now...
![]() |
Assignee | |
Comment 10•11 years ago
|
||
Same for m-i. The issue should be fixed now. I'll keep the bug open till tomorrow to make sure everything is OK.
Severity: critical → normal
Reporter | ||
Comment 11•11 years ago
|
||
Thanks for the quick work, Rail. Much appreciated :)
Comment 12•11 years ago
|
||
How can we track this kind of problem more generally ? In theory it should be possible to take the builder list from allthethings.json, determine which jobs run on AWS slaves (at least in part), and check the buildermap handles them. Possibly running on jenkins.
![]() |
Assignee | |
Comment 13•11 years ago
|
||
(In reply to Nick Thomas [:nthomas] from comment #12)
> How can we track this kind of problem more generally ? In theory it should
> be possible to take the builder list from allthethings.json, determine which
> jobs run on AWS slaves (at least in part), and check the buildermap handles
> them. Possibly running on jenkins.
I think we can use allthethings.json to build the map dynamically. I'll file a follow up bug.
Status: NEW → RESOLVED
Closed: 11 years ago
Resolution: --- → FIXED
Updated•7 years ago
|
Component: General Automation → General
You need to log in
before you can comment on or make changes to this bug.
Description
•