Closed
Bug 1363890
Opened 7 years ago
Closed 7 years ago
Many command queue alerts from large code pushes
Categories
(Release Engineering :: General, defect)
Release Engineering
General
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: nthomas, Assigned: nthomas)
References
Details
Attachments
(1 file)
879 bytes,
patch
|
catlee
:
review+
catlee
:
checked-in+
|
Details | Diff | Splinter Review |
Lots of alerts like <nagios-releng> Wed 14:54:57 PDT [4231] [] buildbot-master77.bb.releng.use1.mozilla.com:Command Queue is CRITICAL: 2 dead items recently, which turn out to be timeouts after 2 minutes trying to add large changes to the statusdb, eg 2017-05-10 14:00:14,579 - INSERT INTO sourcestamps (branch, revision, patch_id) VALUES (%s, %s, %s) 2017-05-10 14:00:14,579 - ('integration/autoland', 'eb62dc9d8524742ec288004e12d6380f1535c031', None) 2017-05-10 14:00:14,655 - INSERT INTO changes (number, branch, revision, who, comments, `when`) VALUES (%s, %s, %s, %s, %s, %s) 2017-05-10 14:00:14,655 - (9227481L, 'integration/autoland', 'eb62dc9d8524742ec288004e12d6380f1535c031', 'cbook@mozilla.com', 'Merge mozilla-central to autoland', datetime.datetime(2017, 5, 10, 13, 36, 40))2017-05-10 14:00:14,978 - INSERT INTO file_changes (file_id, change_id) VALUES (%s, %s) 2017-05-10 14:00:14,978 - ((3840659L, 6514173L), (979845L, 6514173L), (712747L, 6514173L), (1945983L, 6514173L), (1946189L, 6514173L)... A work around is to just increase the -m argument in /etc/init.d/command_runner (from 60 to 600), so it has time to finish once, then subsequent jobs for the change are quick. We can also look at if anything uses the file lists on changes; if not we can stop inserting them.
Assignee | ||
Comment 1•7 years ago
|
||
Workaround/stop gap solution to avoid manual work recovering dead queues.
Attachment #8866544 -
Flags: review?(catlee)
Updated•7 years ago
|
Attachment #8866544 -
Flags: review?(catlee) → review+
Comment 2•7 years ago
|
||
https://hg.mozilla.org/build/puppet/rev/b508b69c55a82f2d10590597797188cfe558b885 Bug 1363890: Increase command queue timeout to 10 minutes. r=catlee
Updated•7 years ago
|
Attachment #8866544 -
Flags: checked-in+
Updated•7 years ago
|
Status: NEW → RESOLVED
Closed: 7 years ago
Resolution: --- → FIXED
Updated•6 years ago
|
Component: General Automation → General
You need to log in
before you can comment on or make changes to this bug.
Description
•