Closed
Bug 1198317
Opened 9 years ago
Closed 9 years ago
reduce the number of available b-2008-ix instances in TRY in order to force y-2008-spot instantiation
Categories
(Infrastructure & Operations :: RelOps: Puppet, task)
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: grenade, Assigned: grenade)
References
Details
(Whiteboard: [windows][aws])
Attachments
(1 file)
5.29 KB,
image/png
|
Details |
disabled instances: b-2008-ix-0036 b-2008-ix-0039 b-2008-ix-0019 b-2008-ix-0038 b-2008-ix-0025 b-2008-ix-0054 b-2008-ix-0022 b-2008-ix-0058 b-2008-ix-0026 b-2008-ix-0059
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0036
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0039
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0019
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0038
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0022
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0025
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0026
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0059
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0054
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0058
Assignee | ||
Comment 1•9 years ago
|
||
disabled instances extended to include: b-2008-ix-0030 b-2008-ix-0174 b-2008-ix-0057 b-2008-ix-0023 b-2008-ix-0047 b-2008-ix-0046 b-2008-ix-0041 b-2008-ix-0044 b-2008-ix-0061 b-2008-ix-0055 b-2008-ix-0043 b-2008-ix-0049 b-2008-ix-0035 b-2008-ix-0029 b-2008-ix-0031 b-2008-ix-0062 b-2008-ix-0045
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0174
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0061
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0044
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0030
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0057
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0041
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0043
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0023
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0049
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0031
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0062
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0045
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0029
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0035
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0046
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0047
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0055
Assignee | ||
Comment 2•9 years ago
|
||
all machines returned to pool. will disable more tomorrow.
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0040
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0024
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0020
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0184
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0060
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0032
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0183
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0064
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0027
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0033
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0021
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0051
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0037
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0028
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0048
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0042
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0063
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0181
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0182
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0050
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0034
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0056
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0052
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0018
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0173
Assignee | ||
Comment 3•9 years ago
|
||
progress: - reduced ix capacity to a single instance (b-2008-ix-0043) - pushed win32, win64 m-c build to try (https://treeherder.mozilla.org/#/jobs?repo=try&revision=272cab1322fc) - observed messages in watch pending log indicating our max bid price (0.4) would not be successful - updated max bid price for y-2008 to 0.5 (https://github.com/mozilla/build-cloud-tools/pull/109) - observed successful spot requests in ec2 console (3 for use1, 3 for usw2, as expected/configured in slavealloc) - observed spot instances starting, successfully running userdata, naming themselves and mailing logs - now awaiting build output at https://ftp-ssl.mozilla.org/pub/mozilla.org/firefox/try-builds/rthijssen@mozilla.com-272cab1322fc
Assignee | ||
Comment 4•9 years ago
|
||
us-east-1: https://secure.pub.build.mozilla.org/builddata/reports/slave_health/slave.html?class=try&name=y-2008-spot-001 https://secure.pub.build.mozilla.org/builddata/reports/slave_health/slave.html?class=try&name=y-2008-spot-002 https://secure.pub.build.mozilla.org/builddata/reports/slave_health/slave.html?class=try&name=y-2008-spot-003 us-west-2: https://secure.pub.build.mozilla.org/builddata/reports/slave_health/slave.html?class=try&name=y-2008-spot-101 https://secure.pub.build.mozilla.org/builddata/reports/slave_health/slave.html?class=try&name=y-2008-spot-102 https://secure.pub.build.mozilla.org/builddata/reports/slave_health/slave.html?class=try&name=y-2008-spot-103
Assignee | ||
Comment 5•9 years ago
|
||
the us-east-1 instances appear to have hung mid build. rdp'ing to the instances (001 - 003) as cltbld shows this running but apparently going nowhere cmd prompt.
Attachment #8653370 -
Flags: feedback?(mcornmesser)
Assignee | ||
Comment 6•9 years ago
|
||
the us-west-2 instances have all terminated. I cannot find any evidence that they did any work before terminating (slave_health/treeherder). The PaperTrail logs end like this: Aug 27 02:09:48 y-2008-spot-101.try.releng.usw2.mozilla.com USER32: The process c:\windows\SysWOW64\shutdown.exe (Y-2008-SPOT-101) has initiated the shutdown of computer Y-2008-SPOT-101 on behalf of user Y-2008-SPOT-101\cltbld for the following reason: No title for this reason could be found Reason Code: 0x800000ff Shutdown Type: shutdown Comment: #015
Assignee | ||
Comment 7•9 years ago
|
||
I think we've demonstrated that the spinning up and terminating processes work. We obviously have work to do to get mozilla-build's undies untwisted, but that's another bug...
Status: NEW → RESOLVED
Closed: 9 years ago
Resolution: --- → FIXED
Comment 8•9 years ago
|
||
There are alerts in #buildduty that indicate there's a buildbot misconfiguration/missing configuration: [sns alert] Thu 06:08:03 PDT buildbot-master78.bb.releng.usw2.mozilla.com watch_twistd_log.py: Count: 675 | First instance: 2015-08-27 05:28:27-0700 | Most recent instance: 2015-08-27 06:00:02-0700 | Twistd exception: twisted.cred.error.UnauthorizedLogin - unknown 10.132.67.67 [sns alert] Thu 06:08:03 PDT buildbot-master78.bb.releng.usw2.mozilla.com watch_twistd_log.py: Count: 681 | First instance: 2015-08-27 05:28:27-0700 | Most recent instance: 2015-08-27 06:00:01-0700 | Twistd exception: twisted.cred.error.UnauthorizedLogin - unknown 10.132.67.101 I've verified that those are windows spot instances 10.132.67.67 (y-2008-spot-103) and 10.132.67.101 (y-2008-spot-102)
Comment 9•9 years ago
|
||
All of the alerts were for use1 IPs, I didn't see any for usw2.
Assignee | ||
Updated•9 years ago
|
Attachment #8653370 -
Flags: feedback?(mcornmesser)
You need to log in
before you can comment on or make changes to this bug.
Description
•