Closed Bug 661525 Opened 14 years ago Closed 14 years ago

Split up test masters by platform (again)

Categories

(Release Engineering :: General, defect, P2)

x86_64
Linux
defect

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: catlee, Assigned: catlee)

References

Details

(Whiteboard: [buildmasters])

Attachments

(8 files)

Our test master instances are huge (see bug 657954), and until we know why, we need to get them to be a bit slimmer. Let's start by setting up 3 test master instances, each handling one of fedora32/64, leopard/snowleopard, and winxp/win7. We'll also need separate pools in slavealloc for each of these platforms.
Depends on: 661650
filed bug 661650 for RelOps to create 3 new vm masters
Priority: -- → P3
Whiteboard: [buildmasters]
So this basically means pools tests-scl1-linux tests-scl1-darwin tests-scl1-windows which shouldn't be a problem. Let me know when the masters are set up and ready to go, and I'll add the pools. Then we can put the masters in those pools and start reassigning slaves to those pools, and finally delete the tests-scl1 pool.
During the reconfig today, bm04-tests and bm06-tests (possibly other test-masters as well) were unresponsive via http, but still running. bm04 reconfiged after multiple attempts; bm06 I had to gracefully shut down. The builder masters were also sluggish to reconfig, but on the whole I think the test masters overall needed a lot more help. Pretty sure this bug will help solve that sluggishness.
Assignee: nobody → catlee
Priority: P3 → P2
Do the puppet manifests or json need to indicate which platforms are running on a test master?
Attachment #538088 - Flags: review?(dustin)
Attachment #538091 - Flags: review?(bhearsum)
Comment on attachment 538091 [details] [diff] [review] json for buildbot-master{15,16} Review of attachment 538091 [details] [diff] [review]: -----------------------------------------------------------------
Attachment #538091 - Flags: review?(bhearsum) → review+
also fix slave port that should have landed a while ago
Attachment #538092 - Flags: review?(bhearsum)
Comment on attachment 538092 [details] [diff] [review] teach tests_localconfig.py about limit_platforms Review of attachment 538092 [details] [diff] [review]: -----------------------------------------------------------------
Attachment #538092 - Flags: review?(bhearsum) → review+
Attachment #538094 - Flags: review?(dustin)
Attachment #538094 - Flags: review?(dustin) → review+
Attachment #538088 - Flags: review?(dustin) → review+
Attachment #538091 - Flags: checked-in+
Depends on: 662994
Attachment #538088 - Flags: checked-in+
Attachment #538092 - Flags: checked-in+
Comment on attachment 538094 [details] slavealloc csv for new masters imported this into slavealloc, and marked the masters as disabled for now
Depends on: 663017
bm{15,16}-tests1-windows are now set up. they need the buildbot-configs patch above merged to production, and then updated on the masters. also waiting on nagios checks and mysql access. one those are ready, we can start the masters, and move some windows slaves into the tests-scl1-windows pool.
Attached patch enable bm15,16Splinter Review
Attachment #539229 - Flags: review?(lsblakk)
Attachment #539229 - Flags: review?(lsblakk) → review+
Attachment #539236 - Flags: review?(bhearsum)
Attachment #539229 - Flags: checked-in+
Attachment #539236 - Flags: review?(bhearsum) → review+
Attachment #539236 - Flags: checked-in+
We have a bit of a backlog of win64 talos & tests (currently 323) because the three active slaves are connected to buildbot-master05 and 06, which don't have win64 in the list of ACTIVE_PLATFORMS anymore. I'd shift them to test-master01, but I don't know how busy that is with the Tegras and don't want to destabilise that.
(In reply to comment #15) > We have a bit of a backlog of win64 talos & tests (currently 323) because > the three active slaves are connected to buildbot-master05 and 06, which > don't have win64 in the list of ACTIVE_PLATFORMS anymore. I'd shift them to > test-master01, but I don't know how busy that is with the Tegras and don't > want to destabilise that. I had lost track of where I read the comment. I dealt with these slaves yesterday by moving them to test-master01 for now.
Doesn't make sense to have BuildSlaves created that will never be used. This patch prevents creation of BuildSlaves for non-active platforms, with a bit of ugliness for current projects that rely on platforms being defined.
Attachment #540754 - Flags: review?(bhearsum)
Attachment #540766 - Flags: review?(dustin)
Attachment #540766 - Flags: review?(dustin) → review+
Attachment #540766 - Flags: checked-in+
Comment on attachment 540754 [details] [diff] [review] Remove slaves for non-active platforms Review of attachment 540754 [details] [diff] [review]: -----------------------------------------------------------------
Attachment #540754 - Flags: review?(bhearsum) → review+
Attachment #540754 - Flags: checked-in+
No longer blocks: 667734
No longer blocks: 663128
Splitting out just windows is good enough for now.
Status: NEW → RESOLVED
Closed: 14 years ago
Resolution: --- → FIXED
Product: mozilla.org → Release Engineering
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: