Closed
Bug 840524
Opened 13 years ago
Closed 13 years ago
improve release runner failure e-mail
Categories
(Release Engineering :: Release Automation, defect)
Release Engineering
Release Automation
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: bhearsum, Assigned: bhearsum)
Details
(Whiteboard: [shipit])
Attachments
(1 file, 1 obsolete file)
1.17 KB,
patch
|
rail
:
review+
bhearsum
:
checked-in+
|
Details | Diff | Splinter Review |
1) Should include part or all of the log.
2) Should tell somewhere exactly where to find the full log on the machine it ran on.
#2 needs to include either - which credentials to use, or (better, imo), cltbld should have read access to the logs
Oh, or even better - iirc, the supervisord web UI provides access to some logs. That would allow the possibility of an eventual tunnel for non-build-vpn release runners.
Assignee | ||
Updated•13 years ago
|
Whiteboard: [kickoff]
Assignee | ||
Updated•13 years ago
|
Whiteboard: [kickoff] → [shipit]
Assignee | ||
Comment 3•13 years ago
|
||
This caused delay during 19.0b1 - we really need to fix it soon.
![]() |
||
Comment 4•13 years ago
|
||
I received this warning email from shipit:
'A buildbot master reconfig started at Wed, 20 Feb 2013 10:02:13 PST has been running for
0:10:00.382857 without completing.'
Can we enhance this email to include the name of the buildbot master whose reconfig is stuck?
Assignee | ||
Comment 5•13 years ago
|
||
(In reply to Hal Wine [:hwine] from comment #1)
> cltbld should have read access to the logs
100% agreed. These aren't puppetized yet (bug 836289) so I went ahead and did this by hand:
[cltbld@buildbot-master36 ~]$ tail -F /var/log/supervisor/release-runner.log
2013-02-22 13:45:14,910 - DEBUG - Sleeping for 30 seconds before polling again
2013-02-22 13:45:44,909 - DEBUG - Fetching release requests
2013-02-22 13:45:45,050 - INFO - No new releases: {'releases': []}
2013-02-22 13:45:45,050 - DEBUG - Sleeping for 30 seconds before polling again
2013-02-22 13:46:15,051 - DEBUG - Fetching release requests
2013-02-22 13:46:15,175 - INFO - No new releases: {'releases': []}
2013-02-22 13:46:15,175 - DEBUG - Sleeping for 30 seconds before polling again
2013-02-22 13:46:45,175 - DEBUG - Fetching release requests
2013-02-22 13:46:45,298 - INFO - No new releases: {'releases': []}
2013-02-22 13:46:45,298 - DEBUG - Sleeping for 30 seconds before polling again
Assignee | ||
Comment 6•13 years ago
|
||
Completely untested.
![]() |
||
Comment 7•13 years ago
|
||
Comment on attachment 717314 [details] [diff] [review]
send some output with the log
I think this is the easiest way to go.
As a possible another way to go would be sending CRITICAL and ERROR log messages via email.
Attachment #717314 -
Flags: feedback?(rail) → feedback+
Assignee | ||
Comment 8•13 years ago
|
||
(In reply to Rail Aliiev [:rail] from comment #7)
> Comment on attachment 717314 [details] [diff] [review]
> send some output with the log
>
> I think this is the easiest way to go.
>
> As a possible another way to go would be sending CRITICAL and ERROR log
> messages via email.
Hmm, I don't think this will catch all errors? Eg, uncaught exceptions.
Here's a tested version of my patch which seems to work.
Attachment #717314 -
Attachment is obsolete: true
Attachment #718018 -
Flags: review?(rail)
![]() |
||
Updated•13 years ago
|
Attachment #718018 -
Flags: review?(rail) → review+
Assignee | ||
Updated•13 years ago
|
Attachment #718018 -
Flags: checked-in+
Assignee | ||
Comment 9•13 years ago
|
||
I updated the checkout on bm36 and restarted release runner is supervisord.
(In reply to Hal Wine [:hwine] from comment #2)
> Oh, or even better - iirc, the supervisord web UI provides access to some
> logs. That would allow the possibility of an eventual tunnel for
> non-build-vpn release runners.
I think this is explicitly a separate bug. I don't feel strongly about it. If you do, please file it.
(In reply to Hal Wine [:hwine] from comment #1)
> #2 needs to include either - which credentials to use, or (better, imo),
> cltbld should have read access to the logs
cltbld has access to the logs.
(In reply to John Hopkins (:jhopkins) from comment #4)
> I received this warning email from shipit:
>
> 'A buildbot master reconfig started at Wed, 20 Feb 2013 10:02:13 PST has
> been running for
> 0:10:00.382857 without completing.'
>
> Can we enhance this email to include the name of the buildbot master whose
> reconfig is stuck?
This is also a separate bug IMO. I filed it in bug 847380.
Status: ASSIGNED → RESOLVED
Closed: 13 years ago
Resolution: --- → FIXED
Updated•12 years ago
|
Product: mozilla.org → Release Engineering
You need to log in
before you can comment on or make changes to this bug.
Description
•