Closed Bug 1632391 Opened 4 years ago Closed 4 years ago

Intermittent Android raptor-perftest Critical: Process has been unexpectedly closed

Categories

(Testing :: Raptor, defect, P1)

Version 3
ARM64
Android
defect

Tracking

(firefox-esr68 unaffected, firefox75 unaffected, firefox76 unaffected, firefox77 fixed)

RESOLVED FIXED
mozilla77
Tracking Status
firefox-esr68 --- unaffected
firefox75 --- unaffected
firefox76 --- unaffected
firefox77 --- fixed

People

(Reporter: intermittent-bug-filer, Assigned: whimboo)

References

(Depends on 1 open bug, Regression)

Details

(Keywords: intermittent-failure, regression)

Attachments

(2 files)

Filed by: aiakab [at] mozilla.com
Parsed log: https://treeherder.mozilla.org/logviewer.html#?job_id=298959283&repo=autoland
Full log: https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/YZ_--I88RriqAm8u8Irrdg/runs/0/artifacts/public/logs/live_backing.log


[task 2020-04-23T02:51:46.628Z] 02:51:44 INFO - adb shell_output: adb -s FA7A51A00163 wait-for-device shell am start -W -n org.mozilla.geckoview_example/org.mozilla.geckoview_example.GeckoViewActivity -a android.intent.action.MAIN -d about:blank, timeout: None, root: False, timedout: None, exitcode: 0, output: Starting: Intent { act=android.intent.action.MAIN dat=about:blank cmp=org.mozilla.geckoview_example/.GeckoViewActivity }
[task 2020-04-23T02:51:46.628Z] 02:51:44 INFO - Status: ok
[task 2020-04-23T02:51:46.628Z] 02:51:44 INFO - Activity: org.mozilla.geckoview_example/.GeckoViewActivity
[task 2020-04-23T02:51:46.628Z] 02:51:44 INFO - ThisTime: 387
[task 2020-04-23T02:51:46.628Z] 02:51:44 INFO - TotalTime: 387
[task 2020-04-23T02:51:46.628Z] 02:51:44 INFO - WaitTime: 390
[task 2020-04-23T02:51:46.628Z] 02:51:44 INFO - Complete
[task 2020-04-23T02:51:46.628Z] 02:51:44 INFO - adb shell_output: adb -s FA7A51A00163 wait-for-device shell pidof org.mozilla.geckoview_example, timeout: None, root: False, timedout: None, exitcode: 0, output: 9408
[task 2020-04-23T02:51:46.628Z] 02:51:45 INFO - raptor-control-server Info: received webext_loaded: raptor runner.js is loaded!
[task 2020-04-23T02:51:46.628Z] 02:51:45 INFO - raptor-control-server Info: received webext_status: test name is: raptor-tp6m-bbc-geckoview-cold
[task 2020-04-23T02:51:46.628Z] 02:51:45 INFO - raptor-control-server Info: received webext_status: test settings url is: http://127.0.0.1:57525/json/raptor-tp6m-bbc-geckoview-cold.json
[task 2020-04-23T02:51:46.628Z] 02:51:45 INFO - raptor-control-server Info: received webext_status: starting raptorRunner
[task 2020-04-23T02:51:46.628Z] 02:51:45 INFO - raptor-control-server Info: reading test settings from json/raptor-tp6m-bbc-geckoview-cold.json
[task 2020-04-23T02:51:46.628Z] 02:51:45 INFO - raptor-control-server Info: sent test settings to webext runner
[task 2020-04-23T02:51:46.628Z] 02:51:45 INFO - raptor-control-server Info: received webext_status: * pausing 1 seconds to let browser settle... *
[task 2020-04-23T02:51:46.628Z] 02:51:45 INFO - adb shell_output: adb -s FA7A51A00163 wait-for-device shell pidof org.mozilla.geckoview_example, timeout: None, root: False, timedout: None, exitcode: 0, output: 9408
[task 2020-04-23T02:51:59.818Z] 02:51:46 INFO - raptor-control-server Info: received webext_status: running 1 pagecycles of https://www.bbc.com/news/business-47245877
[task 2020-04-23T02:51:59.818Z] 02:51:46 INFO - adb shell_output: adb -s FA7A51A00163 wait-for-device shell pidof org.mozilla.geckoview_example, timeout: None, root: False, timedout: None, exitcode: 0, output: 9408
[task 2020-04-23T02:51:59.818Z] 02:51:47 INFO - raptor-control-server Info: received webext_status: begin page cycle 1
[task 2020-04-23T02:51:59.818Z] 02:51:47 INFO - raptor-control-server Info: received webext_status: found active tab with id 10001
[task 2020-04-23T02:51:59.818Z] 02:51:47 INFO - raptor-control-server Info: received webext_status: update tab 10001 for https://www.bbc.com/news/business-47245877
[task 2020-04-23T02:51:59.818Z] 02:51:47 INFO - raptor-control-server Info: received webext_status: tab 10001 updated
[task 2020-04-23T02:51:59.818Z] 02:51:47 INFO - raptor-control-server Info: received webext_status: ended page cycle 1
[task 2020-04-23T02:51:59.818Z] 02:51:48 INFO - adb shell_output: adb -s FA7A51A00163 wait-for-device shell pidof org.mozilla.geckoview_example, timeout: None, root: False, timedout: None, exitcode: 0, output: 9408
[task 2020-04-23T02:51:59.818Z] 02:51:49 INFO - adb shell_output: adb -s FA7A51A00163 wait-for-device shell pidof org.mozilla.geckoview_example, timeout: None, root: False, timedout: None, exitcode: 0, output: 9408
[task 2020-04-23T02:51:59.818Z] 02:51:50 INFO - raptor-control-server Info: received webext_status: results received
[task 2020-04-23T02:51:59.818Z] 02:51:50 INFO - raptor-control-server Info: received webext_results: {u'expected_browser_cycles': 15, u'subtest_unit': u'ms', u'name': u'raptor-tp6m-bbc-geckoview-cold', u'lower_is_better': True, u'measurements': {u'dcf': [1484], u'fcp': [1134], u'fnbpaint': [1064], u'loadtime': [2222]}, u'browser_cycle': u'9', u'subtest_lower_is_better': True, u'cold': True, u'browser': u'GeckoView Example 77.0a1 20200423005435', u'type': u'pageload', u'page': u'https://www.bbc.com/news/business-47245877', u'unit': u'ms', u'alert_threshold': 2}
[task 2020-04-23T02:51:59.818Z] 02:51:50 INFO - perftest-results-handler Info: received results in RaptorResultsHandler.add
[task 2020-04-23T02:51:59.818Z] 02:51:50 INFO - raptor-control-server Info: received webext_status: Not closing last Tab: 10001
[task 2020-04-23T02:51:59.818Z] 02:51:50 INFO - raptor-control-server Info: received request to shutdown the browser
[task 2020-04-23T02:51:59.818Z] 02:51:50 INFO - raptor-control-server Info: shutting down android app org.mozilla.geckoview_example
[task 2020-04-23T02:51:59.818Z] 02:51:50 INFO - adb shell_output: adb -s FA7A51A00163 wait-for-device shell am force-stop org.mozilla.geckoview_example, timeout: None, root: False, timedout: None, exitcode: 0, output:
[task 2020-04-23T02:51:59.818Z] 02:51:50 INFO - adb shell_bool: adb -s FA7A51A00163 wait-for-device shell test -d /sdcard/raptor/profile/minidumps, timeout: None, root: False, timedout: None, exitcode: 0, output:
[task 2020-04-23T02:51:59.818Z] 02:51:50 INFO - adb shell_output: adb -s FA7A51A00163 wait-for-device shell sync, timeout: None, root: False, timedout: None, exitcode: 0, output:
[task 2020-04-23T02:51:59.818Z] 02:51:50 INFO - adb shell_bool: adb -s FA7A51A00163 wait-for-device shell test -d /sdcard/raptor/profile/minidumps, timeout: None, root: False, timedout: None, exitcode: 0, output:
[task 2020-04-23T02:51:59.818Z] 02:51:50 INFO - adb command_output: adb -s FA7A51A00163 wait-for-device pull /sdcard/raptor/profile/minidumps /tmp/tmpWS3h7O/minidumps, timeout: None, timedout: None, exitcode: 0, output: /sdcard/raptor/profile/minidumps/: 0 files pulled, 0 skipped.
[task 2020-04-23T02:51:59.818Z] 02:51:50 CRITICAL - raptor-perftest Critical: Process has been unexpectedly closed
[task 2020-04-23T02:51:59.818Z] 02:51:50 INFO - raptor-webext-android Info: removing reverse socket connections
[task 2020-04-23T02:51:59.818Z] 02:51:50 INFO - adb command_output: adb -s FA7A51A00163 wait-for-device reverse --remove-all, timeout: None, timedout: None, exitcode: 0, output:
[task 2020-04-23T02:51:59.818Z] 02:51:50 INFO - raptor-webext-android Info: skipping check_for_crashes: application has not been launched
[task 2020-04-23T02:51:59.818Z] 02:51:50 INFO - raptor-mitmproxy Info: Stopping mitmproxy playback, killing process 794
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - raptor-mitmproxy Info: Successfully killed the mitmproxy playback process
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - raptor-mitmproxy Info: Reading confidence values from: /builds/task_1587610071/workspace/build/blobber_upload_dir/mitm_netlocs_mitm4-pixel2-fennec-bbc.json
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - perftest-results-handler Info: RaptorResultsHandler.add_supporting_data received mozproxy-replay data
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - raptor-webext Info: removing webext /builds/task_1587610071/workspace/build/tests/raptor/raptor/webextension/../../webext/raptor
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - raptor-webext-android Info: removing test folder for raptor: /sdcard/raptor
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - adb shell_output: adb -s FA7A51A00163 wait-for-device shell sync, timeout: None, root: False, timedout: None, exitcode: 0, output:
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - adb shell_output: adb -s FA7A51A00163 wait-for-device shell rm -r /sdcard/raptor, timeout: None, root: False, timedout: None, exitcode: 0, output:
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - adb shell_output: adb -s FA7A51A00163 wait-for-device shell sync, timeout: None, root: False, timedout: None, exitcode: 0, output:
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - adb shell_bool: adb -s FA7A51A00163 wait-for-device shell test -e /sdcard/raptor, timeout: None, root: False, timedout: None, exitcode: 1, output:
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - raptor-perftest Info: Removing temporary directory: /tmp/tmpPvMnW4
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - raptor-perftest Info: Removing temporary directory: /tmp/tmpfr9Qiq
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - raptor-perftest Info: Removing temporary directory: /tmp/tmpnXcaKe
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - raptor-perftest Info: Removing temporary directory: /tmp/tmpfxEydg
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - raptor-perftest Info: Removing temporary directory: /tmp/tmpb2q1cz
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - raptor-perftest Info: Removing temporary directory: /tmp/tmpO7vFs0
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - raptor-perftest Info: Removing temporary directory: /tmp/tmpM9qkt3
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - raptor-perftest Info: Removing temporary directory: /tmp/tmpZJ4l2s
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - raptor-perftest Info: Removing temporary directory: /tmp/tmpZS5eIz
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - raptor-control-server Info: shutting down control server
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - raptor-webext Info: finished
[task 2020-04-23T02:51:59.818Z] 02:51:51 ERROR - Return code: 1
[task 2020-04-23T02:51:59.818Z] 02:51:51 WARNING - setting return code to 1
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - Killing logcat pid 601.
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - Copying Raptor results to upload dir:
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - /builds/task_1587610071/workspace/build/blobber_upload_dir/perfherder-data.json
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - Copying raptor results from /builds/task_1587610071/workspace/build/raptor.json to /builds/task_1587610071/workspace/build/blobber_upload_dir/perfherder-data.json
[task 2020-04-23T02:51:59.818Z] 02:51:51 CRITICAL - Error copying results /builds/task_1587610071/workspace/build/raptor.json to upload dir /builds/task_1587610071/workspace/build/blobber_upload_dir/perfherder-data.json
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - [Errno 2] No such file or directory: u'/builds/task_1587610071/workspace/build/raptor.json'
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - Running post-action listener: _package_coverage_data
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - Running post-action listener: _resource_record_post_action
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - Running post-action listener: process_java_coverage_data
[task 2020-04-23T02:51:59.818Z] 02:51:51 INFO - Running post-action listener: stop_device
[task 2020-04-23T02:51:59.818Z] 02:51:54 INFO - Killing logcat pid 601.

Summary: Intermittent raptor-perftest Critical: Process has been unexpectedly closed → Intermittent Android raptor-perftest Critical: Process has been unexpectedly closed

So this is actually an expected shutdown:

[task 2020-04-23T02:51:59.818Z] 02:51:50 INFO - raptor-control-server Info: received request to shutdown the browser
[task 2020-04-23T02:51:59.818Z] 02:51:50 INFO - raptor-control-server Info: shutting down android app org.mozilla.geckoview_example
...
[task 2020-04-23T02:51:59.818Z] 02:51:50 CRITICAL - raptor-perftest Critical: Process has been unexpectedly closed

And I assume it's a race given that the process check happens at the exact same time as the control server is shutting down the app. Means the _is_shutting_down flag might not have been recognized by the other thread yet.

This is a regression from bug 1630037, which I thought would have already been fixed, given that my try build was green for that other bug. But as it looks like it's only affecting P2 devices, which are faster.

I will have a look.

Assignee: nobody → hskupin
Status: NEW → ASSIGNED
Keywords: regression
Priority: P5 → P1
Regressed by: 1630037

While investigating I just found that, which is not something how timeout calculation should be done:
https://searchfox.org/mozilla-central/source/testing/raptor/raptor/webextension/base.py#102,113,117-118

I will fix it along the lines.

Sleeping a second and increasing the elapsed time by 1,
doesn't mean that exactly a second has been passed-by.

For a better timeout handling the end time has to be
calculated first, and then checked against.

Note that failures on this bug are all happening for Pixel2 for aarch64 only. I just pushed a new try build with some updates:

https://treeherder.mozilla.org/#/jobs?repo=try&revision=a3c097b405e5542b94d1d3a7310af9a633b997c1

OS: Unspecified → Android
Hardware: Unspecified → ARM64
Depends on: 1633975
Pushed by hskupin@mozilla.com:
https://hg.mozilla.org/integration/autoland/rev/64937ae65eeb
[raptor] Fix timeout handling when shutting down the application. r=perftest-reviewers,sparky
https://hg.mozilla.org/integration/autoland/rev/1fd9c358e893
[raptor] Give enough time to sync the shutdown flags. r=perftest-reviewers,sparky
Status: ASSIGNED → RESOLVED
Closed: 4 years ago
Resolution: --- → FIXED
Target Milestone: --- → mozilla77
Blocks: 1631344
Regressions: 1635053
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: