1493225 - GeckoSession.goBack() can be unresponsive during a heavy page load.

Reporter

Description

•

6 years ago

This issue seems to be exacerbated on devices running the Snapdragon 821 chip set such as the Oculus Go and Google Pixel 1. This issue was specifically called out by Oculus during the Oculus App Store review of Firefox Reality. The issue gives the appearance that GeckoView is unable to go back until a page load has completed. Many pages at https://sketchfab.com/ download a great deal of data and often display this behavior. New devices that use the Snapdragon 835 chip set such as the Pixel 2 do not seem to display this issue to such a degree and are much more responsive. However, the 835 is a much higher performance chip set.

Randall Barker [:rbarker]

Reporter

Updated

•

6 years ago

Whiteboard: [geckoview:fxr]

David Bolter [:davidb] (NeedInfo me for attention)

Comment 1

•

6 years ago

Randall, what is the priority of this bug?

Flags: needinfo?(rbarker)

Randall Barker [:rbarker]

Reporter

Comment 2

•

6 years ago

It really only affects the Oculus Go (which happens to be are largest user base). There are also a lot of phones out there running the 82X (including the insanely popular Galaxy S7 I believe). This seems like it might be a good candidate for a mobile flow analysis?

Flags: needinfo?(rbarker)

David Bolter [:davidb] (NeedInfo me for attention)

Comment 3

•

6 years ago

Reminder to try with e10s on again and report findings. We may need to raise priority.

Flags: needinfo?(rbarker)

Priority: -- → P2

Whiteboard: [geckoview:fxr] → [geckoview:fxr:p1]

Randall Barker [:rbarker]

Reporter

Comment 4

•

6 years ago

I captured profiles from Firefox Reality 1.0.1 (FxR) release running on an Oculus Go which uses the 821 Snapdragon chipset. All were captured while loading sketchfab.com and pressing the back button while the page was loading. These first two were captured with out e10s: https://perfht.ml/2zSR4Wp https://perfht.ml/2zS2vxE And these two were captured with e10s: https://perfht.ml/2zSkLH6 https://perfht.ml/2zTi6gj The back button press on the Oculus Browser loading the same page on the same hardware is almost instant while in FxR, the lag between pressing the back button and the browser actually going back can be noticeably long.

Flags: needinfo?(rbarker)

Chris Peterson [:cpeterson]

Comment 5

•

6 years ago

@ QF team: do you see any obvious problems in the profiles Randall captured in comment 4? On the Oculus Go, the main thread is so busy that it can't respond to the back button to exit the browser.

status-firefox63: --- → wontfix

status-geckoview62: --- → wontfix

Whiteboard: [geckoview:fxr:p1] → [geckoview:fxr:p1][qf]

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 6

•

6 years ago

hmm, so, JS takes lots of time. But we could also improve the situation by forcing back/forward from UI to use higher priority tasks, at least when e10s is used (I don't know what kind of setup we have without e10s). And, we could check pending reloads/back/forward/etc in JS's interrupt handler, and if there is something pending, cancel the current JS. Or wait for Fission, which could just use another process.

Olli Pettay [:smaug][bugs@pettay.fi]

Updated

•

6 years ago

Depends on: 1497626

Jean Gong :jgong

Updated

•

6 years ago

Whiteboard: [geckoview:fxr:p1][qf] → [geckoview:fxr:p1][qf:js:investigate]

Jean Gong :jgong

Comment 7

•

6 years ago

Steve, can you have the JS team do a quick triage on this JS investigate bug. Thanks!

Flags: needinfo?(sdetar)

Chris Peterson [:cpeterson]

Comment 8

•

6 years ago

(In reply to Olli Pettay [:smaug] from comment #6) > hmm, so, JS takes lots of time. But we could also improve the situation by > forcing back/forward from UI to use higher priority tasks, at least when > e10s is used (I don't know what kind of setup we have without e10s). > And, we could check pending reloads/back/forward/etc in JS's interrupt > handler, and if there is something pending, cancel the current JS. Randall says using a higher priority task for the back/forward events would require core Gecko changes. Would those changes be helpful (or at least harmless) on desktop?

Summary: GeckoSesion.goBack() can be unresponsive during a heavy page load. → GeckoSession.goBack() can be unresponsive during a heavy page load.

Jean Gong :jgong

Comment 9

•

6 years ago

Hello Kannan, Can you take a look at this bug based on the js:investigate WB tag?

Flags: needinfo?(kvijayan)

Steven DeTar [:sdetar]

Updated

•

6 years ago

Flags: needinfo?(sdetar)

Kannan Vijayan [:djvj]

Comment 10

•

6 years ago

(In reply to Olli Pettay [:smaug] (high review load) from comment #6) > hmm, so, JS takes lots of time. But we could also improve the situation by > forcing back/forward from UI to use higher priority tasks, at least when > e10s is used (I don't know what kind of setup we have without e10s). > And, we could check pending reloads/back/forward/etc in JS's interrupt > handler, and if there is something pending, cancel the current JS. > > Or wait for Fission, which could just use another process. Is it valid semantics to cancel the current running JS at all times? I'm not familiar enough with expected behaviour on the web. I took a look at the profile and the main issue just seems to be that a lot of JS is being run at startup. There are no easy fixes to this given the runtime of the JS - there are a handful of execution spans >12s here. Even getting that down by a significant percentage would leave a lot of lag in responding to the back button. This seems like a situation where a higher level fix is appropriate: either terminate the JS via interrupt when the back-button is clicked. I was wondering if it would be possible on 'back' to "abandon" the current page to a ghost (hidden) window that terminates gracefully, while the previous page is loaded into a new container instantly. I don't have a great feel for which is more feasible / doable. It seems like the interrupt-JS is the simplest fix. > Randall says using a higher priority task for the back/forward events would require core Gecko changes. Would those changes be helpful (or at least harmless) on desktop? This shouldn't be harmful on desktop, and if anything should improve responsiveness there as well (just to a smaller degree). Randall: if I understand correctly, changing the priority of the back/forward task should improve things, but for those long-running JS slices it will still delay the back until they finish execution, right?

Flags: needinfo?(rjesup)

Flags: needinfo?(kvijayan)

Flags: needinfo?(bugs)

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 11

•

6 years ago

(In reply to Kannan Vijayan [:djvj] from comment #10) > Is it valid semantics to cancel the current running JS at all times? I'm > not familiar enough with expected behaviour on the web. well, we can "kill" a web page at any time, the same way as interrupt-js handler already does. > I was wondering if it would be possible on 'back' to "abandon" the current > page to a ghost (hidden) window that terminates gracefully, while the > previous page is loaded into a new container instantly. how is this different to interrupting JS approach? I would expect interrupted page to not enter bfcache, so it would be destroyed asap a new page is loaded. > Randall: if I understand correctly, changing the priority of the > back/forward task should improve things, but for those long-running JS > slices it will still delay the back until they finish execution, right? No. If we'd do IPC layer high priority task for back/forward, we could have a flag telling to js engine that when its interrupt handler runs, it would just stop running.

Flags: needinfo?(bugs)

Nancy Hang

Comment 12

•

6 years ago

Corresponding GH issue: https://github.com/MozillaReality/FirefoxReality/issues/682

Randell Jesup [:jesup] (needinfo me)

Comment 13

•

6 years ago

Randell != Randall I'm afraid... I think kannan meant to NI rbarker (Randall)

Flags: needinfo?(rjesup) → needinfo?(rbarker)

Randell Jesup [:jesup] (needinfo me)

Comment 14

•

6 years ago

I believe if you do this, you'll have to dump the page from the bfcache, so on Forward it would be a reload (and potentially lost state, though we're not required to hold state there). It would also bypass onunload handlers, etc. We'd also need to cancel running JIT compiles/etc, of course. Since we would have interrupted things in mid-execution, the state of the page/DOM/etc and resources touched by it could be in inconsistent/"Impossible" states.

Chris Peterson [:cpeterson]

Comment 15

•

6 years ago

64=wontfix because FxR 1.1 is using GV 65 and this issue doesn't block Focus 8.0 from using GV 64.

status-firefox64: affected → wontfix

Chris Peterson [:cpeterson]

Comment 16

•

6 years ago

Andreas, this bug will be a big problem for responsiveness. Is this problem reproducible on the Moto G5? rbarker says e10s doesn't help on FxR.

Flags: needinfo?(abovens)

Priority: P2 → P1

Olli Pettay [:smaug][bugs@pettay.fi]

Comment 17

•

6 years ago

Pasting my email about this here. "It is not really high prio IPC task (in vsync sense), but checking whether we have pending "goBack". Similar to what billm did in https://bugzilla.mozilla.org/show_bug.cgi?id=1279086, but simpler and safer, I'd say, since we'd just cancel the current JS execution so that we could handle pending tasks. https://bug1279086.bmoattachments.org/attachment.cgi?id=8773987 modified PProcessHangMonitor.ipdl. Not sure who has time, but this shouldn't be too hard. Some new message similar to PaintWhileInterruptingJS. Perhaps "CancelContentJSExecutionIfRunning()" and then using high prio messages for goBack/Forward in PBrowser. Something like that. Would need to test a bit different setups. Like, do we need to ignore beforeunload and unload and pagehide event listeners. And bfcache should be disabled. "

WIP patch 6 years ago Jim Porter (:squib) 7.44 KB, patch	smaug : feedback+	Details \| Diff \| Splinter Review
WIP patch v2 6 years ago Jim Porter (:squib) 7.77 KB, patch	smaug : feedback+ tcampbell : feedback+	Details \| Diff \| Splinter Review
Mostly-working patch v3 6 years ago Jim Porter (:squib) 11.76 KB, patch		Details \| Diff \| Splinter Review
Cleaned up patch v4 6 years ago Jim Porter (:squib) 11.74 KB, patch		Details \| Diff \| Splinter Review
Patch v5 6 years ago Jim Porter (:squib) 11.91 KB, patch	smaug : feedback+	Details \| Diff \| Splinter Review
Patch v6: disable BF cache and add a pref to disable the patch 6 years ago Jim Porter (:squib) 12.68 KB, patch		Details \| Diff \| Splinter Review
Part 1: Unconditionally cancel content JS 6 years ago Jim Porter (:squib) 12.93 KB, patch		Details \| Diff \| Splinter Review
Part 2: Pass navigation operation along HangMonitor channel to selectively cancel content JS 6 years ago Jim Porter (:squib) 12.93 KB, patch		Details \| Diff \| Splinter Review
Part 2: Pass navigation operation along HangMonitor channel to selectively cancel content JS (fixed) 6 years ago Jim Porter (:squib) 12.75 KB, patch		Details \| Diff \| Splinter Review
Part 2.5: Non-working patch to use enums 6 years ago Jim Porter (:squib) 9.22 KB, patch	smaug : feedback+	Details \| Diff \| Splinter Review
Part 1: Unconditionally cancel content JS (v2) 6 years ago Jim Porter (:squib) 12.92 KB, patch		Details \| Diff \| Splinter Review
Part 1: Unconditionally cancel content JS (v3) 6 years ago Jim Porter (:squib) 12.33 KB, patch		Details \| Diff \| Splinter Review
Part 2: Pass navigation operation along HangMonitor channel to selectively cancel content JS (v2) 6 years ago Jim Porter (:squib) 13.67 KB, patch	smaug : feedback+	Details \| Diff \| Splinter Review
Part 2: Pass navigation operation along HangMonitor channel to selectively cancel content JS (v3) 6 years ago Jim Porter (:squib) 20.76 KB, patch		Details \| Diff \| Splinter Review
Part 1: Unconditionally cancel content JS (v4) 6 years ago Jim Porter (:squib) 12.99 KB, patch		Details \| Diff \| Splinter Review
Part 2: Pass navigation operation along HangMonitor channel to selectively cancel content JS (v4) 6 years ago Jim Porter (:squib) 22.43 KB, patch		Details \| Diff \| Splinter Review
Bug 1493225, part 1 - Cancel content JS when navigating through history to prevent hangs 6 years ago Jim Porter (:squib) 47 bytes, text/x-phabricator-request		Details \| Review
Bug 1493225, part 2 - Cancel content JS when navigating through history to prevent hangs 6 years ago Jim Porter (:squib) 47 bytes, text/x-phabricator-request		Details \| Review
Bug 1493225, part 3 - Cancel content JS when navigating through history to prevent hangs 6 years ago Jim Porter (:squib) 47 bytes, text/x-phabricator-request		Details \| Review
Bug 1493225, part 4 - Cancel content JS when navigating through history to prevent hangs 6 years ago Jim Porter (:squib) 47 bytes, text/x-phabricator-request		Details \| Review