Closed Bug 977032 Opened 10 years ago Closed 10 years ago

Integration Trees closed, high number of pending linux compile jobs about ±2 hour backlog

Categories

(Infrastructure & Operations Graveyard :: CIDuty, task)

x86
All
task
Not set
major

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: cbook, Unassigned)

Details

seems the integration trees see a high number of pending linux build jobs and backlog of jobs is about 2 hours on m-i and fx-team. Closing integration trees for this
04:17 < mgerva> Tomcat|sheriffduty: there's a bug about  scl3 <-> usw2 link starting to degrade (Bug 975438)
04:18 < Tomcat|sheriffduty> mgerva: ok will comment there , thankx
Looks like maybe the spot instances are having problems puppetizing:

Feb 26 04:43:25 bld-linux64-ec2-005 puppet-agent[1268]: Could not request certificate: Error 400 on SERVER: this master is not a CA

the hostname mentioned in the log is not what FQDN is in spot_setup
trees reopened at 5:13 PST
Spot instances started since then seem to be working fine...maybe the error above is a red herring
Severity: blocker → major
Bug 975438 does not seem to have been related.
seems backlog is starting to build up again :/
The problem was due to DNS entries not being created for new spot instances. We've created those now, and are seeing good response times from the build farm now.
Status: NEW → RESOLVED
Closed: 10 years ago
Resolution: --- → FIXED
Product: Release Engineering → Infrastructure & Operations
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in before you can comment on or make changes to this bug.