Closed Bug 1256439 Opened 9 years ago Closed 9 years ago

Marionette slowdown (runs twice that long) since implicit waits have been landed

Tracking

(firefox46 unaffected, firefox47 fixed, firefox48 fixed)

Status:

RESOLVED FIXED

Milestone:

mozilla48

Tracking Flags:

Tracking

Status

firefox46

---

unaffected

firefox47

---

fixed

firefox48

---

fixed

People

(Reporter: whimboo, Assigned: whimboo)

References

Details

(Keywords: perf, regression)

Attachments

(1 file)

MozReview Request: Bug 1256439 - Run implicit wait callback once before start polling. r?automatedtester 9 years ago Henrik Skupin [:whimboo][⌚️UTC+1] 58 bytes, text/x-review-board-request	automatedtester : review+	Details

Henrik Skupin [:whimboo][⌚️UTC+1]

Assignee

Description

•

9 years ago

We had bug 1244837 which fixed a massive slowdown for Firefox 47.0a1 and 46.0a2. But in the last days I can still see that Marionette runs about twice that long for me on my local machine and also all taskcluster builds. We are really wasting lots of machine power here. I wish we would have a perf test for comparison that we no longer cause those regressions. I will have a look at mozregression now.

Henrik Skupin [:whimboo][⌚️UTC+1]

Assignee

Updated

•

9 years ago

status-firefox46: --- → unaffected

status-firefox47: --- → affected

status-firefox48: --- → affected

Henrik Skupin [:whimboo][⌚️UTC+1]

Assignee

Comment 1

•

9 years ago

Here a comparison for the same kind of test: SUMMARY (46.0a2) ------- 0:18.47 LOG: MainThread INFO passed: 10 0:18.47 LOG: MainThread INFO failed: 0 0:18.47 LOG: MainThread INFO todo: 0 0:18.52 SUITE_END: MainThread SUMMARY (47.0a2) ------- 0:55.06 LOG: MainThread INFO passed: 10 0:55.06 LOG: MainThread INFO failed: 0 0:55.06 LOG: MainThread INFO todo: 0 0:55.10 SUITE_END: MainThread

Henrik Skupin [:whimboo][⌚️UTC+1]

Assignee

Comment 2

•

9 years ago

Something in that pushlog (retrieved via nightly tester tools) should have caused it: https://hg.mozilla.org/releases/mozilla-aurora/pushloghtml?fromchange=383d2c72f4c77a81d44704272dcfbbe3c8cafca8&tochange=0b81120cfc00bfc33091dab4340b2e109a0964c1

Henrik Skupin [:whimboo][⌚️UTC+1]

Assignee

Comment 3

•

9 years ago

Regression started with this merge from inbound to mozilla-central: https://hg.mozilla.org/mozilla-central/pushloghtml?fromchange=ad029c4e30b017efb2a255294b4528071d951430&tochange=c1e0d1890cfee9d86c8d566b0490053f21e0afc6 Testing changesets (aa5e5a12ce63 and 52b85b8b8c14) around the patch landing date of bug 1250102 gave me the real regression range.

Severity: normal → major

Flags: needinfo?(dburns)

Flags: needinfo?(ato)

Keywords: regressionwindow-wanted → perf

Henrik Skupin [:whimboo][⌚️UTC+1]

Assignee

Comment 4

•

9 years ago

Maybe the reason is the following default interval? > function implicitlyWaitFor(func, timeout, interval = 100) { This could definitely slow down everything.

Henrik Skupin [:whimboo][⌚️UTC+1]

Assignee

Comment 5

•

9 years ago

Ok, that's definitely the problem here: https://dxr.mozilla.org/mozilla-central/source/testing/marionette/element.js#668 Shortening the interval to 10ms makes it faster so all in all it takes about 20s instead of 55s in my local build.

Henrik Skupin [:whimboo][⌚️UTC+1]

Assignee

Updated

•

9 years ago

Blocks: 1250102

Henrik Skupin [:whimboo][⌚️UTC+1]

Assignee

Comment 6

•

9 years ago

Using 0s for the interval is even worse the latest Firefox beta: 16s vs 9s

Henrik Skupin [:whimboo][⌚️UTC+1]

Assignee

Comment 7

•

9 years ago

Hm. Just a side-note for this method. Do we leak a timer instance given that we only cancel it when resolving the promise?

Henrik Skupin [:whimboo][⌚️UTC+1]

Assignee

Comment 8

•

9 years ago

The underlying issue here is the initial wait time of 100ms before the function gets evaluated the first time. I do not see why this is necessary. Instead we should do the first evaluation immediately and only wait if the callback fails. This will restore the former performance of Marionette and leaves all tests as passing.

Henrik Skupin [:whimboo][⌚️UTC+1]

Assignee

Comment 9

•

9 years ago

I will take it.

Assignee: nobody → hskupin

Status: NEW → ASSIGNED

Flags: needinfo?(dburns)

Flags: needinfo?(ato)

Henrik Skupin [:whimboo][⌚️UTC+1]

Assignee

Comment 10

•

9 years ago

Here a measure of our firefox-ui-functional tests on my local machine with and without the upcoming fix. Without fix: 5:32.17 LOG: MainThread INFO passed: 92 5:32.17 LOG: MainThread INFO failed: 0 5:32.17 LOG: MainThread INFO todo: 1 5:32.21 SUITE_END: MainThread With fix: 2:44.97 LOG: MainThread INFO passed: 92 2:44.97 LOG: MainThread INFO failed: 0 2:44.97 LOG: MainThread INFO todo: 1 2:44.99 SUITE_END: MainThread Those are 332s vs. 165s. So we can indeed speed-up Marionette by a factor of 2x. I'm wrestling through exiting Marionette unit tests now given that a lot of them fail on my machine.

Henrik Skupin [:whimboo][⌚️UTC+1]

Assignee

Comment 11

•

9 years ago

Attached file MozReview Request: Bug 1256439 - Run implicit wait callback once before start polling. r?automatedtester — Details

As of now the implicit wait method waits the default interval before doing the first check against the passed in callback. This is not necessary and it can be changed for an immediate check. If that one fails we can still enter the timer driven loop and continue to wait. Review commit: https://reviewboard.mozilla.org/r/40093/diff/#index_header See other reviews: https://reviewboard.mozilla.org/r/40093/

Attachment #8730697 - Flags: review?(dburns)

Henrik Skupin [:whimboo][⌚️UTC+1]

Assignee

Updated

•

9 years ago

Summary: Marionette slowdown (runs twice that long) since Firefox 47.0 → Marionette slowdown (runs twice that long) since implicit waits have been landed

Andreas Tolfsen ❲:ato❳

Comment 12

•

9 years ago

https://reviewboard.mozilla.org/r/40093/#review36585 This change looks good to me, but can you please rename the commit message to be more descriptive? Instead of explaining what it should not do, try to explain the change. E.g. “Run implicit wait function once before polling” or something similar, and then explain what it’s doing wrong in the commit message.

Henrik Skupin [:whimboo][⌚️UTC+1]

Assignee

Comment 13

•

9 years ago

Comment on attachment 8730697 [details] MozReview Request: Bug 1256439 - Run implicit wait callback once before start polling. r?automatedtester Review request updated; see interdiff: https://reviewboard.mozilla.org/r/40093/diff/1-2/

Attachment #8730697 - Attachment description: MozReview Request: Bug 1256439 - Implicit wait should not wait an interval before doing the first check. r?automatedtester → MozReview Request: Bug 1256439 - Run implicit wait callback once before start polling. r?automatedtester

David Burns :automatedtester

Comment 14

•

9 years ago

Comment on attachment 8730697 [details] MozReview Request: Bug 1256439 - Run implicit wait callback once before start polling. r?automatedtester https://reviewboard.mozilla.org/r/40093/#review36743

Attachment #8730697 - Flags: review?(dburns) → review+

Pulsebot

Comment 15

•

9 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/7314496982c0

Carsten Book [:Tomcat]

Comment 16

•

9 years ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/7314496982c0

Status: ASSIGNED → RESOLVED

Closed: 9 years ago

status-firefox48: affected → fixed

Resolution: --- → FIXED

Target Milestone: --- → mozilla48

Henrik Skupin [:whimboo][⌚️UTC+1]

Assignee

Comment 17

•

9 years ago

Test-only change which fixes a regression in Marionette which is also necessary on current aurora.

Whiteboard: [checkin-needed-aurora]

David Burns :automatedtester

Comment 18

•

9 years ago

This has already been uplifted to Aurora https://hg.mozilla.org/releases/mozilla-aurora/rev/0a61b7f086d7

Whiteboard: [checkin-needed-aurora]

Henrik Skupin [:whimboo][⌚️UTC+1]

Assignee

Updated

•

9 years ago

status-firefox47: affected → fixed

BMO Automation

Updated

•

3 years ago

Product: Testing → Remote Protocol

You need to log in before you can comment on or make changes to this bug.