Closed Bug 570891 Opened 14 years ago Closed 14 years ago

Do not coalesce try server pushes

Categories

(Release Engineering :: General, defect)

x86
macOS
defect
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: ehsan.akhgari, Assigned: lsblakk)

References

Details

(Whiteboard: [tryserver])

Attachments

(3 files, 1 obsolete file)

Coalescing builds and unit test runs on normal branches makes a lot of sense, but it's a major inconvenience on the try server.  I've had this happen a lot of times when I pushed something to the try server and checked back in a couple of hours only to see that someone else pushed something less than than three minutes after me and they stole my builds and unit test runs.

Let's not do this on try!
Summary: Do not coalesce try server builds on the try server → Do not coalesce try server builds
Pushes within 3 minutes of each other ? Or some other time period ?
Summary: Do not coalesce try server builds → Do not coalesce try server pushes
We probably need to set treeStableTimer to 0 for try,
 http://hg.mozilla.org/build/buildbotcustom/file/default/misc.py#l519
(In reply to comment #1)
> Pushes within 3 minutes of each other ? Or some other time period ?

I'm not sure if it's 3 minutes.  I think you actually told me on irc once that it's set to 3 minutes.  :-)
Where these builds combined ?

bbirtles@mozilla.com
Tue Jun 08 17:42:57 2010 -0700	ef4c8b473e8c	Brian Birtles — Bug 492458 - SVG SMIL: Implement backwards seeking - Part 1 -interval and instance time filtering 
...
eakhgari@mozilla.com
Tue Jun 08 17:40:59 2010 -0700	3e5095185cea	Ehsan Akhgari — Bug 569709 - Figure out the max nr of entries we should store in the disk cache 

That's most recent push from you that I can find.
Well, actually it wasn't.  Tinderbox is so slow right now that I can't go back in history to find an example...
did a checkconfig on a local master instance to make sure this passes. Setting the treeStableTimer to None will make sure that each push creates its own change.
Assignee: nobody → lsblakk
Attachment #450466 - Flags: review?(nrthomas)
(In reply to comment #6)
> Ehsan is this an example of what you are hitting?

Yes, I guess so...
My patch doesn't address that talos issue though. generateTalosBranchObjects is merging those builds, which is strange.  Seems like two different issues though.
so this will handle the second issue - the merging of builds in talos.
Attachment #450485 - Flags: review?(anodelman)
Attachment #450485 - Flags: review?(anodelman) → review+
Comment on attachment 450466 [details] [diff] [review]
change no-merge scheduler to have None for treeStableTimer

>diff --git a/misc.py b/misc.py
>+        extra_args['treeStableTimer'] = None

I think this is going to give us a build for each changeset rather than push; we set 3 seconds on the old try. If testing proves me wrong then go ahead and ask for review from bhearsum/me as timezones dictate.
Attachment #450466 - Flags: review?(nrthomas)
You're right - None leads to a change per changeset, not per push.  Knocking this down to 3 seconds to match the way old try did it.
Attachment #450466 - Attachment is obsolete: true
Attachment #450763 - Flags: review?(nrthomas)
Attachment #450763 - Flags: review?(nrthomas) → review+
Comment on attachment 450763 [details] [diff] [review]
change no-merge scheduler to have 3 seconds for treeStableTimer

http://hg.mozilla.org/build/buildbotcustom/rev/ee4a6ca2e197
Attachment #450763 - Flags: checked-in+
Reconfig'd try master to pick up the new treeStableTimer.
Status: NEW → RESOLVED
Closed: 14 years ago
Resolution: --- → FIXED
This just happened to me on a try push (867c3741e16d) - it got coalesced with the push right after it. Maybe this didn't get propagated to buildbot 0.8.0?
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
This was missed originally, and I landed a fix...but didn't update the masters :(

I've updated the master (pm01/scheduler_master), so this shouldn't happen any more.

Please reopen if that's not the case!
Status: REOPENED → RESOLVED
Closed: 14 years ago14 years ago
Resolution: --- → FIXED
Can we add an automated test for this? It hurt a lot :(
(In reply to comment #12)
> Created an attachment (id=450763) [details]
> change no-merge scheduler to have 3 seconds for treeStableTimer
> 
> You're right - None leads to a change per changeset, not per push.  Knocking
> this down to 3 seconds to match the way old try did it.

How were you getting a change per changeset?  Doesn't the hg poller only give us tips?

Just caught an instance of two tests arriving at the same time, so < 3 seconds, so merged.  Need to work harder on fixing this!
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
(In reply to comment #18)
> Can we add an automated test for this? It hurt a lot :(

I added a check that will e-mail us when it detects jobs that have != changes in them.
(In reply to comment #20)
> (In reply to comment #18)
> > Can we add an automated test for this? It hurt a lot :(
> 
> I added a check that will e-mail us when it detects jobs that have != changes
> in them.

I mean, != 1 change in them.
It also happened to me once after the patches landed in this bug.  My changeset c00bbd837385 was coalesced into the next changeset I pushed: d1653821d023.
(In reply to comment #22)
> It also happened to me once after the patches landed in this bug.  My changeset
> c00bbd837385 was coalesced into the next changeset I pushed: d1653821d023.

Those happened prior to me landing the changes on the master at ~19:00.
Lukas' test of treeStableTimer=None was done on buildbot 0.7.10, which didn't support treeStableTimer=None.  Scheduler.addChange was raising exceptions, and so the poller kept submitting the same change over and over since it wasn't saving its last seen revision.

I've tested it with buildbot 0.8.0 and it looks like it works fine.
Attachment #452293 - Flags: review?(lsblakk)
Attachment #452293 - Flags: review?(lsblakk) → review+
Comment on attachment 452293 [details] [diff] [review]
set treeStableTimer=None

773:d0f4f41ac108
Attachment #452293 - Flags: checked-in+
pm02/try-trunk-master and pm01/scheduler_master updated and reconfigured
I think this is fixed now.
Status: REOPENED → RESOLVED
Closed: 14 years ago14 years ago
Resolution: --- → FIXED
Product: mozilla.org → Release Engineering
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: