Closed Bug 840524 Opened 13 years ago Closed 13 years ago

improve release runner failure e-mail

Categories

(Release Engineering :: Release Automation, defect)

defect
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: bhearsum, Assigned: bhearsum)

Details

(Whiteboard: [shipit])

Attachments

(1 file, 1 obsolete file)

1) Should include part or all of the log. 2) Should tell somewhere exactly where to find the full log on the machine it ran on.
#2 needs to include either - which credentials to use, or (better, imo), cltbld should have read access to the logs
Oh, or even better - iirc, the supervisord web UI provides access to some logs. That would allow the possibility of an eventual tunnel for non-build-vpn release runners.
Whiteboard: [kickoff]
Whiteboard: [kickoff] → [shipit]
This caused delay during 19.0b1 - we really need to fix it soon.
I received this warning email from shipit: 'A buildbot master reconfig started at Wed, 20 Feb 2013 10:02:13 PST has been running for 0:10:00.382857 without completing.' Can we enhance this email to include the name of the buildbot master whose reconfig is stuck?
(In reply to Hal Wine [:hwine] from comment #1) > cltbld should have read access to the logs 100% agreed. These aren't puppetized yet (bug 836289) so I went ahead and did this by hand: [cltbld@buildbot-master36 ~]$ tail -F /var/log/supervisor/release-runner.log 2013-02-22 13:45:14,910 - DEBUG - Sleeping for 30 seconds before polling again 2013-02-22 13:45:44,909 - DEBUG - Fetching release requests 2013-02-22 13:45:45,050 - INFO - No new releases: {'releases': []} 2013-02-22 13:45:45,050 - DEBUG - Sleeping for 30 seconds before polling again 2013-02-22 13:46:15,051 - DEBUG - Fetching release requests 2013-02-22 13:46:15,175 - INFO - No new releases: {'releases': []} 2013-02-22 13:46:15,175 - DEBUG - Sleeping for 30 seconds before polling again 2013-02-22 13:46:45,175 - DEBUG - Fetching release requests 2013-02-22 13:46:45,298 - INFO - No new releases: {'releases': []} 2013-02-22 13:46:45,298 - DEBUG - Sleeping for 30 seconds before polling again
Attached patch send some output with the log (obsolete) — Splinter Review
Completely untested.
Assignee: nobody → bhearsum
Status: NEW → ASSIGNED
Attachment #717314 - Flags: feedback?(rail)
Comment on attachment 717314 [details] [diff] [review] send some output with the log I think this is the easiest way to go. As a possible another way to go would be sending CRITICAL and ERROR log messages via email.
Attachment #717314 - Flags: feedback?(rail) → feedback+
Attached patch send logsSplinter Review
(In reply to Rail Aliiev [:rail] from comment #7) > Comment on attachment 717314 [details] [diff] [review] > send some output with the log > > I think this is the easiest way to go. > > As a possible another way to go would be sending CRITICAL and ERROR log > messages via email. Hmm, I don't think this will catch all errors? Eg, uncaught exceptions. Here's a tested version of my patch which seems to work.
Attachment #717314 - Attachment is obsolete: true
Attachment #718018 - Flags: review?(rail)
Attachment #718018 - Flags: review?(rail) → review+
Attachment #718018 - Flags: checked-in+
I updated the checkout on bm36 and restarted release runner is supervisord. (In reply to Hal Wine [:hwine] from comment #2) > Oh, or even better - iirc, the supervisord web UI provides access to some > logs. That would allow the possibility of an eventual tunnel for > non-build-vpn release runners. I think this is explicitly a separate bug. I don't feel strongly about it. If you do, please file it. (In reply to Hal Wine [:hwine] from comment #1) > #2 needs to include either - which credentials to use, or (better, imo), > cltbld should have read access to the logs cltbld has access to the logs. (In reply to John Hopkins (:jhopkins) from comment #4) > I received this warning email from shipit: > > 'A buildbot master reconfig started at Wed, 20 Feb 2013 10:02:13 PST has > been running for > 0:10:00.382857 without completing.' > > Can we enhance this email to include the name of the buildbot master whose > reconfig is stuck? This is also a separate bug IMO. I filed it in bug 847380.
Status: ASSIGNED → RESOLVED
Closed: 13 years ago
Resolution: --- → FIXED
Product: mozilla.org → Release Engineering
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: