Closed Bug 1543369 Opened 6 years ago Closed 6 years ago

Raptor cold page-load tests on Firefox android fennec

Categories

(Testing :: Raptor, enhancement, P1)

Version 3
enhancement

Tracking

(firefox69 fixed)

RESOLVED FIXED
mozilla69
Tracking Status
firefox69 --- fixed

People

(Reporter: alexandrui, Assigned: alexandrui, Mentored)

References

Details

Attachments

(1 file, 2 obsolete files)

Add Raptor support to run the tp6m page-load test suites on the android fennec app in cold-load mode

Assignee: nobody → alexandru.ionescu
Priority: -- → P1

I don't think we should expand the cold page-load tests yet until we determine exactly what cold page-load process to use, i.e. see Bug 153794.

Depends on: 153794

(In reply to Robert Wood [:rwood] from comment #3)

I don't think we should expand the cold page-load tests yet until we determine exactly what cold page-load process to use, i.e. see Bug 153794.

Bug typo, see Bug Bug 1537941.

Depends on: 1537941
No longer depends on: 153794

(In reply to Robert Wood [:rwood] from comment #4)

(In reply to Robert Wood [:rwood] from comment #3)

I don't think we should expand the cold page-load tests yet until we determine exactly what cold page-load process to use, i.e. see Bug 153794.

Bug typo, see Bug Bug 1537941.

As per discussion on IRC we will go ahead with the current page-load implementation for now, as Bug 1537941 won't be resolved quickly.

No longer depends on: 1537941

We are going to land the Raptor test definitions and taskcluster configs for cold page-load on Fenix first (Bug 1545175), and then we will mirror those test definitions here for Fennec.

Depends on: 1545175
Blocks: 1518975
Depends on: 1544758
Attachment #9059463 - Attachment is obsolete: true

Note: Raptor support for cold page-load tests for Fennec already exists; this bug is for adding the actual Raptor test INIs/taskcluster configs to turn this on in production where desired.

Attachment #9059463 - Attachment is obsolete: false
Attachment #9057236 - Attachment description: Bug 1543369 Raptor cold page-load tests on Firefox android fennec → Bug 1543369 Patch for fixing fennec try builds. ONLY FOR GATHERING RESULTS. Can't be landed.

Rob, I am working on bug 1549669 which needs to run every warm/cold test from raptor apps (fennec, geckoview, refbrow, fenix) to make sure there's no more regression. How can we synchronize with the code? I assume you only have locally the tasks you picked up from me.

Flags: needinfo?(rwood)

(In reply to Alexandru Ionescu from comment #9)

Rob, I am working on bug 1549669 which needs to run every warm/cold test from raptor apps (fennec, geckoview, refbrow, fenix) to make sure there's no more regression. How can we synchronize with the code? I assume you only have locally the tasks you picked up from me.

Sorry Alex I don't understand your question... I'm not working on this (I did try pushes to get numbers on Fenix v64 if that's what you mean, but they were just try pushes and not anything landed).

If you need fennec configs to test out your fix in bug 1549669 then you could make the patch here for the fennec configs/tests, make a new bookmark, apply your fennec configs/tests patch from this bug, and also apply your fix in bug 1549669 and push that to try in one push?

Flags: needinfo?(rwood)
Attachment #9057236 - Attachment is obsolete: true

Pushed this try build https://treeherder.mozilla.org/#/jobs?repo=try&revision=f50524475413560319940172f95f2cb82dfdc220&selectedJob=246613944
tp6m-1-fennec-cold on G5: my test passed but failed due to timeout... see the very bottom of this log https://taskcluster-artifacts.net/falXyZgBTTqz35B3RHDKIQ/0/public/logs/live_backing.log

Same test, on P2 Aarch64:
15:01:37 INFO - raptor-main finished
15:01:37 ERROR - Traceback (most recent call last):
15:01:37 INFO - File "/builds/task_1557932368/workspace/build/tests/raptor/raptor/raptor.py", line 1219, in <module>
15:01:37 INFO - main()
15:01:37 INFO - File "/builds/task_1557932368/workspace/build/tests/raptor/raptor/raptor.py", line 1182, in main
15:01:37 INFO - raptor.run_test(next_test, timeout=int(next_test['page_timeout']))
15:01:37 INFO - File "/builds/task_1557932368/workspace/build/tests/raptor/raptor/raptor.py", line 962, in run_test
15:01:37 INFO - self.run_test_teardown()
15:01:37 INFO - File "/builds/task_1557932368/workspace/build/tests/raptor/raptor/raptor.py", line 181, in run_test_teardown
15:01:37 INFO - self.check_for_crashes()
15:01:37 INFO - File "/builds/task_1557932368/workspace/build/tests/raptor/raptor/raptor.py", line 1102, in check_for_crashes
15:01:37 INFO - remote_dir = posixpath.join(self.device_profile, 'minidumps')
15:01:37 INFO - AttributeError: 'RaptorAndroid' object has no attribute 'device_profile'
15:01:37 INFO - certutil: function failed: SEC_ERROR_BAD_DATABASE: security library: bad database.
15:01:37 ERROR - Return code: 1
15:01:37 WARNING - setting return code to 1

Flags: needinfo?(rwood)
Flags: needinfo?(dave.hunt)
Flags: needinfo?(bob)
Attachment #9059463 - Attachment is obsolete: true
Attachment #9059463 - Attachment is obsolete: false
Attachment #9059463 - Attachment is obsolete: true

On successful test runs I see 30 * pausing 30 seconds to let browser settle... * messages which is 15 minutes of waiting. If you are going to run so many iterations of the cold tests you need to adjust the max run time to account for the waiting.

Please, Please Someone needs to take care of the test tear down so that it doesn't access properties that were not completely initialized, like device_profile, if the test fails prematurely before all of the properties are defined.

Comment 12 is mitmprox failure: Bug 1549209, Bug 1549882

Flags: needinfo?(bob)

Wow yeah so fennec is even slower, so the jobs are exceeding the max time we have set (not the page cycle timeout). We can't really reduce the browser cycles down b/c of data noise.

Since this is only landing for try only (or is this supposed to be on central and try??) I'd say increase the overall taskcluster time limit for these jobs is fine. If you look at the logs they are very close to being finished the last browser cycle in most cases. So I'd suggest setting:

max-run-time: 2700

You'll need to do that for each job / in each raptor.yml section you're adding because we don't want to change the default (1800) at the top.

ps. If you get a 'Task aborted - max run time exceeded' error please don't retrigger the job, it will still get that same error in most cases so retriggers are not necessary.

Flags: needinfo?(rwood)

:rwood, this is supposed to be scheduled on CI actually, to get a comparison of the pageload fennec v64 vs fenix.

Flags: needinfo?(dave.hunt)
Pushed by aionescu@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/2f20a0d737e1 Raptor cold page-load tests on Firefox android fennec r=Bebe

Same as https://bugzilla.mozilla.org/show_bug.cgi?id=1546323#c8 for bing-restaurants. I only got to land this. I can file a bug for disabling this test.

Flags: needinfo?(rwood)

(In reply to Alexandru Ionescu from comment #18)

Same as https://bugzilla.mozilla.org/show_bug.cgi?id=1546323#c8 for bing-restaurants. I only got to land this. I can file a bug for disabling this test.

This bug is for fennec (latest) not fennec v64, so not sure what you mean here. If this is a serious intermittent (on latest fennec) then you may want to consider disabling this particular test - depends on the frequency, your call. Thanks!

Flags: needinfo?(rwood)

Oh, sorry, it's the latest. Must have mixed the up a bit.

Status: NEW → RESOLVED
Closed: 6 years ago
Resolution: --- → FIXED
Target Milestone: --- → mozilla69
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: