1493907 - Run Wd tests in headless mode

Reporter

Description

•

7 years ago

The WPT wdspec test type tests Firefox’ WebDriver (geckodriver + Marionette) implementation exhaustively. This would be a good test suite to run headlessly to discover problems with headless mode and prevent it from regressing.

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 1

•

7 years ago

Also lots of people use Selenium and geckodriver with Firefox running in headless nowadays. So it would be very helpful to see any kind if regression as early as possible.

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 2

•

7 years ago

I'm trying to enable headless mode by using `build-projects` in the following line: https://searchfox.org/mozilla-central/source/taskcluster/ci/test/web-platform.yml#155 But sadly `mach try fuzzy` doesn't recognize that change, so that I'm not able to push to try. Andrew, any idea what's wrong here?

Flags: needinfo?(ahal)

Andrew Halberstadt [:ahal]

Comment 3

•

7 years ago

Fuzzy's default is all tasks that run on mozilla-central. If you want to schedule a task that doesn't run on mozilla-central you need to pass --full.

Flags: needinfo?(ahal)

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 4

•

7 years ago

Attached patch Run Wd tests in headless mode — Details — Splinter Review

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Updated

•

7 years ago

Assignee: nobody → hskupin

Status: NEW → ASSIGNED

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 5

•

7 years ago

(In reply to Andrew Halberstadt [:ahal] from comment #3) > Fuzzy's default is all tasks that run on mozilla-central. If you want to > schedule a task that doesn't run on mozilla-central you need to pass --full. No, this doesn't help for the attached patch. It still doesn't list headless. Maybe using `built-projects` isn't correct here? Or do you see if something else could cause it?

Assignee: hskupin → nobody

Status: ASSIGNED → NEW

Flags: needinfo?(ahal)

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Updated

•

7 years ago

Assignee: nobody → hskupin

Status: NEW → ASSIGNED

Component: web-platform-tests → geckodriver

Andrew Halberstadt [:ahal]

Comment 6

•

7 years ago

If you don't see your task with |mach try fuzzy --full|, then it's not being generated by the taskgraph module. To verify you can run: $ ./mach taskgraph full | grep headless The full taskgraph is pre target task filtering, so run-on-projects shouldn't have any affect on what shows up in the full taskgraph.

Flags: needinfo?(ahal)

Andrew Halberstadt [:ahal]

Comment 7

•

7 years ago

I think you need to add 'web-platform-tests-wdspec-headless' to test-platforms.yml, and that's why it's not showing up.

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Updated

•

7 years ago

Depends on: 1370636

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 8

•

7 years ago

(In reply to Andrew Halberstadt [:ahal] from comment #7) > I think you need to add 'web-platform-tests-wdspec-headless' to > test-platforms.yml, and that's why it's not showing up. Oh, right. But this is only once source, and will work for Linux. For Mac and Windows I have to update the appropriate sets in test-sets.yml.

Priority: -- → P1

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 9

•

7 years ago

https://treeherder.mozilla.org/#/jobs?repo=try&revision=3a108f137a16121b213050a54074bb7cfa72caf4

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Updated

•

7 years ago

Depends on: 1496409

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 10

•

7 years ago

As expected we see lots of hangs in that try push when minimizing and fullscreen' a window. Lets wait for the patch on bug 1492499 to be landed before continuing on this bug.

Depends on: 1492499

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 11

•

6 years ago

Now that bug 1492499 has been fixed I pushed another try build: https://treeherder.mozilla.org/#/jobs?repo=try&revision=d298689820576f510ab7ba49cbc0808aefe96707

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 12

•

6 years ago

Most of the window manipulation tests fail with failures like: > /webdriver/tests/maximize_window/maximize.py | test_restore_the_window - assert False That is actually exactly what we also see on wpt.fyi for all those commands. I assume that they only run in headless mode, and as such we haven't seen it yet ourselves.

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 13

•

6 years ago

This failure is actually for `document.hidden`: > 05:54:34 INFO - > assert document_hidden(session) > 05:54:34 INFO - E assert False I will file a new bug for that particular issue.

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 14

•

6 years ago

(In reply to Henrik Skupin (:whimboo) from comment #13) > This failure is actually for `document.hidden`: > > > 05:54:34 INFO - > assert document_hidden(session) > > 05:54:34 INFO - E assert False > > I will file a new bug for that particular issue. Actually as discussed in the WebDriver meeting yesterday, Andreas will go ahead and file that issue.

Flags: needinfo?(ato)

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 15

•

6 years ago

Andreas, can you please follow-up on it? Thanks!

Andreas Tolfsen ❲:ato❳

Reporter

Comment 16

•

6 years ago

Filed https://bugzilla.mozilla.org/show_bug.cgi?id=1510305 about document.hidden in headless mode. I wonder if we should not just go ahead and enable WPT WebDriver tests in headless mode for the time being, with the failing tests marked as expected to fail?

Flags: needinfo?(ato)

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 17

•

6 years ago

Is there a way to automatically generate the manifest files? Given the amount of failing tests I don't want to do it manually.

Andreas Tolfsen ❲:ato❳

Reporter

Comment 18

•

6 years ago

I know there’s a way. jgraham, is it documented anywhere?

Flags: needinfo?(james)

Andreas Tolfsen ❲:ato❳

Reporter

Comment 19

•

6 years ago

For context: What is being requested here is to take the test failure log from TC and pass it into "./mach wpt" to have it update the expected results, so that we can ignore the failing Wd tests in headless mode.

James Graham [:jgraham]

Comment 20

•

6 years ago

It's not fully documented, and possibly doesn't work out of the box with headless. You want a try run with both passing and failing examples (so the code knows what the condition is causing the fail. But in this case that won't work because we don't by-default use headless mode as a criterion. So you could either add "headless" to both lists at [1] or just use the failing logs and then use sed or whatever to update the generated criterion) To fetch the logs I have a tool [2] which you can install on the path and then fetchlogs try <sha1> --log-type wptreport --out-dir logs Once you have the logs then run ./mach wpt-udpate /path/to/logs/* [1] https://searchfox.org/mozilla-central/source/testing/web-platform/tests/tools/wptrunner/wptrunner/browsers/firefox.py#159 [2] https://github.com/jgraham/fetchlogs

Flags: needinfo?(james)

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 21

•

6 years ago

Sounds like lot of work. Given that there are more important things on my plate, this will have to wait until the dependencies have been fixed, or someone else takes it.

Assignee: hskupin → nobody

Status: ASSIGNED → NEW

Priority: P1 → P2

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Updated

•

6 years ago

Depends on: 1521179

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Updated

•

6 years ago

Depends on: 1510305

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 22

•

6 years ago

Joel, what's our current situation with headless tests? I know that you disabled a lot of them, so I wonder if we still want to run those, and if yes, on which platforms. Maybe that helps us to get the wdspec ones landed easier.

Joel Maher ( :jmaher ) (UTC -8) (PTO back normal Nov 17)

Comment 23

•

6 years ago

headless tests are only run to ensure compatibility of headless mode, not to run in parallel or to safe resources. If there is a future investment into headless mode to make it support more of our needs for tests, then we could look at running normal tests as tier-2 and headless as tier-1 taking advantage of the faster runtime and possibility of parallel execution.

Andreas Tolfsen ❲:ato❳

Reporter

Comment 24

•

6 years ago

Given those constraints, running Wd in headless makes sense as
users are relying on using headless WebDriver and we want to avoid
any regressions in that area.

I should also add that—although not intended to be—Wd is probably
the best regression test suite we have for headless mode, considering
its scope is to ensure all things related to browser automation
works.

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Updated

•

6 years ago

Blocks: 1560181

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 25

•

6 years ago

Andreas, we could try to get the headless tests added, and simply mark those tests as expected fail where we know those are failing due to broken behavior in Firefox. I hope that those shouldn't be that many affected tests.

Enabling headless is important before we can get started with bug 1560181.

I just pushed a try build:
https://treeherder.mozilla.org/#/jobs?repo=try&revision=db4d725fb86b5294d0d24e05d64ea9271644098a

Brendan Dahl [:bdahl]

Comment 26

•

6 years ago

Try run is looking better with a patch:

https://treeherder.mozilla.org/#/jobs?repo=try&revision=4b90f83a778aa26f792f0f8d94478e8834a96511

Still have one more failure:
TEST-UNEXPECTED-FAIL | /webdriver/tests/get_window_rect/get.py | test_payload - AssertionError: assert {'height': 60...x': 0, 'y': 0} == {'height': 600...100, 'y': 100}

Andreas Tolfsen ❲:ato❳

Reporter

Comment 27

•

6 years ago

(In reply to Henrik Skupin (:whimboo) [⌚️UTC+2] from comment #25)

Andreas, we could try to get the headless tests added, and simply
mark those tests as expected fail where we know those are failing
due to broken behavior in Firefox. I hope that those shouldn't be
that many affected tests.

Perfect! I agree with that approach.

(In reply to Brendan Dahl [:bdahl] from comment #26)

Try run is looking better with a patch:

Lovely, thanks for pitching in!

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 28

•

6 years ago

Note, that I will wait a bit more before marking tests as expected fail. After talking to Brendan yesterday he has to do some more verification if his patch doesn't produce any regression (which did happen in the past). If all goes well, we might be able to run nearly all the tests, which is fantastic!

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 29

•

6 years ago

Brendan, please let me know if there is something I could help with. We would appreciate if we could get at least the recent patch landed.

Flags: needinfo?(bdahl)

Brendan Dahl [:bdahl]

Updated

•

6 years ago

Depends on: 1562025

Brendan Dahl [:bdahl]

Comment 30

•

6 years ago

Patch is up in bug 1562025

Flags: needinfo?(bdahl)

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 31

•

6 years ago

Wonderful. Thanks a lot again. I will have a look once it landed, what the remaining test failure is related to, and if it's even a bug in the test.

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Updated

•

6 years ago

Depends on: 1563161

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 32

•

6 years ago

So I investigated the two remaining failing tests. The payload one for GetWindowRect was just poorly written. After refactoring it, it works fine. For the negative coordinates test I filed bug 1563161, and will mark the test as expected fail for now when run under headless.

Assignee: nobody → hskupin

Status: NEW → ASSIGNED

Priority: P2 → P1

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 33

•

6 years ago

https://treeherder.mozilla.org/#/jobs?repo=try&revision=45acc639bc511ee17634431c0196f84b5b1a1226

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 34

•

6 years ago

Attached file Bug 1493907 - [wptrunner] Expose headless flag for expected meta data. r=#webdriver — Details

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 35

•

6 years ago

Attached file Bug 1493907 - [wdspec] Refactor payload test for "Get Window Rect". r=#webdriver (obsolete) — Details

Depends on D36723

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 36

•

6 years ago

Attached file Bug 1493907 - [wdspec] Mark remaining failing tests as expected fail for headless mode. r=#webdriver — Details

Depends on D36724

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Comment 37

•

6 years ago

Attached file Bug 1493907 - [wdspec] Run Wdspec tests for shippable builds in headless mode on all platforms. r=#webdriver — Details

Depends on D36725

Treeherder Bug Filer

Updated

•

6 years ago

Depends on: 1563248

Treeherder Bug Filer

Updated

•

6 years ago

Depends on: 1563251

Phabricator Automation

Updated

•

6 years ago

Attachment #9075598 - Attachment description: Bug 1493907 - [wdspec] Mark "test_negative_x_y" for "Set Window Rect" as expected fail under headless. r=#webdriver → Bug 1493907 - [wdspec] Mark remaining failing tests as expected fail for headless mode. r=#webdriver

Phabricator Automation

Updated

•

6 years ago

Attachment #9075597 - Attachment is obsolete: true

Pulsebot

Comment 38

•

6 years ago

Pushed by hskupin@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/0609705a3472 [wptrunner] Expose headless flag for expected meta data. r=webdriver-reviewers,ato https://hg.mozilla.org/integration/autoland/rev/d62f57d8e0b7 [wdspec] Mark remaining failing tests as expected fail for headless mode. r=webdriver-reviewers,ato https://hg.mozilla.org/integration/autoland/rev/60ee55f4c31d [wdspec] Run Wdspec tests for shippable builds in headless mode on all platforms. r=webdriver-reviewers,ato

Oana Pop-Rus

Comment 39

•

6 years ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/0609705a3472
https://hg.mozilla.org/mozilla-central/rev/d62f57d8e0b7
https://hg.mozilla.org/mozilla-central/rev/60ee55f4c31d

Status: ASSIGNED → RESOLVED

Closed: 6 years ago

status-firefox69: --- → fixed

Resolution: --- → FIXED

Target Milestone: --- → mozilla69

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Updated

•

6 years ago

Blocks: 1563516

Web Platform Test Sync Bot [:wpt-sync] (Matrix: #interop:mozilla.org)

Comment 40

•

6 years ago

Created web-platform-tests PR https://github.com/web-platform-tests/wpt/pull/17724 for changes under testing/web-platform/tests

Web Platform Test Sync Bot [:wpt-sync] (Matrix: #interop:mozilla.org)

Comment 41

•

6 years ago

Can't merge web-platform-tests PR due to failing upstream checks: Github PR https://github.com/web-platform-tests/wpt/pull/17724 * Taskcluster (pull_request) (https://tools.taskcluster.net/task-group-inspector/#/J_LoqpdWSx2zK8yQuIs_ZA)

Web Platform Test Sync Bot [:wpt-sync] (Matrix: #interop:mozilla.org)

Comment 42

•

6 years ago

Can't merge web-platform-tests PR due to failing upstream checks: Github PR https://github.com/web-platform-tests/wpt/pull/17724 * Taskcluster (pull_request) (https://tools.taskcluster.net/task-group-inspector/#/O_agcpZATxm7XQLlD8aHwA)

Web Platform Test Sync Bot [:wpt-sync] (Matrix: #interop:mozilla.org)

Comment 43

•

6 years ago

Can't merge web-platform-tests PR due to failing upstream checks: Github PR https://github.com/web-platform-tests/wpt/pull/17724 * Taskcluster (pull_request) (https://tools.taskcluster.net/task-group-inspector/#/J5slsTOUSbCXbun1x6pL_A)

Web Platform Test Sync Bot [:wpt-sync] (Matrix: #interop:mozilla.org)

Comment 44

•

6 years ago

Upstream PR was closed without merging

Web Platform Test Sync Bot [:wpt-sync] (Matrix: #interop:mozilla.org)

Comment 45

•

6 years ago

Upstream PR merged

Pulsebot

Comment 46

•

6 years ago

Pushed by wptsync@mozilla.com: https://hg.mozilla.org/integration/mozilla-inbound/rev/264ce2118077 [wpt PR 17724] - [Gecko Bug 1493907] [wptrunner] Expose headless flag for expected meta data., a=testonly

Raul Gurzau (:RaulG)

Comment 47

•

6 years ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/264ce2118077

Henrik Skupin [:whimboo][⌚️UTC+2]

Assignee

Updated

•

3 years ago

No longer depends on: 1563251

Run Wd tests in headless mode 7 years ago Henrik Skupin [:whimboo][⌚️UTC+2] 1.18 KB, patch		Details \| Diff \| Splinter Review
Bug 1493907 - [wptrunner] Expose headless flag for expected meta data. r=#webdriver 6 years ago Henrik Skupin [:whimboo][⌚️UTC+2] 47 bytes, text/x-phabricator-request		Details \| Review
Bug 1493907 - [wdspec] Refactor payload test for "Get Window Rect". r=#webdriver 6 years ago Henrik Skupin [:whimboo][⌚️UTC+2] 47 bytes, text/x-phabricator-request		Details \| Review
Bug 1493907 - [wdspec] Mark remaining failing tests as expected fail for headless mode. r=#webdriver 6 years ago Henrik Skupin [:whimboo][⌚️UTC+2] 47 bytes, text/x-phabricator-request		Details \| Review
Bug 1493907 - [wdspec] Run Wdspec tests for shippable builds in headless mode on all platforms. r=#webdriver 6 years ago Henrik Skupin [:whimboo][⌚️UTC+2] 47 bytes, text/x-phabricator-request		Details \| Review