svg/animations/end-of-time-*-crash.html wpt tests run out of memory on linux CI
Categories
(Testing :: CI Configuration, defect)
Tracking
(firefox128 fixed, firefox129 fixed)
People
(Reporter: jcristau, Assigned: jcristau)
References
(Blocks 1 open bug)
Details
(Keywords: intermittent-failure)
Attachments
(2 files, 1 obsolete file)
|
48 bytes,
text/x-phabricator-request
|
Details | Review | |
|
48 bytes,
text/x-phabricator-request
|
phab-bot
:
approval-mozilla-beta+
|
Details | Review |
The web-platform-tests-crashtest task on linux CI appears to frequently run out of memory. On the old X11 / ubuntu 18.04 workers, the task survives after a 45s timeout. On the newer wayland / ubuntu 22.04 workers, especially with a debug build, more often than not the worker itself seems to run OOM.
https://profiler.firefox.com/from-url/https%3A%2F%2Ffirefox-ci-tc.services.mozilla.com%2Fapi%2Fqueue%2Fv1%2Ftask%2FCvcJevkOSbif9Fc73EUfGg%2Fruns%2F2%2Fartifacts%2Fpublic%2Ftest_info%2Fprofile_resource-usage.json/marker-chart/?globalTrackOrder=0&thread=0&timelineType=stack&v=10 is from one of the rare runs on ubuntu 22.04 debug that did not die.
https://profiler.firefox.com/from-url/https%3A%2F%2Ffirefox-ci-tc.services.mozilla.com%2Fapi%2Fqueue%2Fv1%2Ftask%2FFg6CnG_zSDaYFtp_sErs8A%2Fruns%2F0%2Fartifacts%2Fpublic%2Ftest_info%2Fprofile_resource-usage.json/marker-chart/?globalTrackOrder=0&thread=0&timelineType=stack&v=10 is from a run on a ubuntu 18.04 worker.
Should we use workers with more ram, skip this test, change it so it's not so hungry, something else?
Comment 1•1 year ago
|
||
is more ram == larger instance? right now the wayland machines are running on n2-standard-2 which is the same as t-linux-large* runs on. So we are running on the same instance size, I am not sure if the VM vs docker consumes more resources, nor if 2204/wayland consumes more than 1804/x11.
| Assignee | ||
Comment 2•1 year ago
|
||
They're both running out of memory, but the old worker seems to recover better than the new one.
Comment 3•1 year ago
|
||
if we are going to make 2204 a real tier1, we need 2204-xlarge, so we should create that and give it a try
Comment 4•1 year ago
|
||
Updated•1 year ago
|
Comment 5•1 year ago
|
||
Comment 6•1 year ago
|
||
Comment on attachment 9407981 [details]
Bug 1902979 - Don't dispatch SMIL events unless there are listeners r=smaug
Revision D214010 was moved to bug 1903214. Setting attachment 9407981 [details] to obsolete.
Comment 7•1 year ago
|
||
bug 1903214 should help since we'll no longer send any events in these testcases.
We could also reduce the number of runnnables we create by having a queue of (target, event) pairs. And then a single runnable per animation frame tick or some such. That runnable would then dispatch many events. Or something along those lines.
| Assignee | ||
Comment 8•1 year ago
|
||
Looks like I misread the profiles yesterday. The ubuntu 18.04 workers for wpt have 16GB ram, while the wayland / 22.04 ones have only 8, so that explains why they behave differently. The test still times out, but that's separate from killing the worker.
| Assignee | ||
Updated•1 year ago
|
| Assignee | ||
Comment 9•1 year ago
|
||
This switches the wayland wpt tasks to run on xlarge workers, like the
corresponding x11 tasks.
Comment 10•1 year ago
|
||
Comment 11•1 year ago
•
|
||
This combined with bug 1903214 is looking great on autoland. Please request Beta approval when you get a chance.
Comment 12•1 year ago
|
||
| bugherder | ||
| Assignee | ||
Comment 13•1 year ago
|
||
This switches the wayland wpt tasks to run on xlarge workers, like the
corresponding x11 tasks.
Original Revision: https://phabricator.services.mozilla.com/D214127
Updated•1 year ago
|
Comment 14•1 year ago
|
||
beta Uplift Approval Request
- User impact if declined: none
- Code covered by automated testing: yes
- Fix verified in Nightly: yes
- Needs manual QE test: no
- Steps to reproduce for manual QE testing: n/a
- Risk associated with taking this patch: low
- Explanation of risk level: test-only
- String changes made/needed: n/a
- Is Android affected?: no
| Assignee | ||
Updated•1 year ago
|
Updated•1 year ago
|
Updated•1 year ago
|
Comment 15•1 year ago
|
||
is this not uplifting to beta? I saw the + and now the -;
Comment 16•1 year ago
|
||
See my comment in Phabricator.
Updated•1 year ago
|
Updated•1 year ago
|
Updated•1 year ago
|
Comment 17•1 year ago
|
||
| uplift | ||
| Comment hidden (Intermittent Failures Robot) |
Description
•