Closed Bug 1179401 Opened 10 years ago Closed 10 years ago

Rewrite fetch-event-respond-with-stops-propagation.https.html to make it pass by not postMessaging after closing the window

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla44

Tracking Flags:

Tracking

Status

firefox42

---

affected

firefox43

---

fixed

firefox44

---

fixed

People

(Reporter: noemi, Assigned: ehsan.akhgari)

References

Details

Attachments

(3 files)

Nightly_Crash_wpt_sw_fetch-event-respond-with-stops-propagation.https.rtf 10 years ago Noemí Freire (:noemi) 112.94 KB, text/rtf		Details
Restart a service worker from ServiceWorker::PostMessage() if it has died 10 years ago (no longer active) 6.62 KB, patch		Details \| Diff \| Splinter Review
Call stopImmediatePropagation() on the Event object in respondWith() 10 years ago (no longer active) 2.85 KB, patch	jdm : review+ lizzard : approval-mozilla-aurora+	Details \| Diff \| Splinter Review

Noemí Freire (:noemi)

Reporter

Description

•

10 years ago

Attached file Nightly_Crash_wpt_sw_fetch-event-respond-with-stops-propagation.https.rtf — Details

A Nightly crash occurs when executing "fetch-event-respond-with-stops-propagation.https.html" wpt test such as |./mach web-platform-tests _mozilla/service-workers/service-worker/etch-event-respond-with-stops-propagation.https.html| with today's (7/1) master build The assertion failure shown is as follows: "Assertion failure: workerPrivate, at /Users/noef/Documents/mozilla-central/dom/workers/ServiceWorker.cpp:93" Please find attached the crash report corresponding to this

(no longer active)

Assignee

Updated

•

10 years ago

Assignee: nobody → ehsan

(no longer active)

Assignee

Comment 1

•

10 years ago

The interesting thing about this test is that it tries to call ServiceWorker::PostMessage after closing the window of the ServiceWorker object: <http://mxr.mozilla.org/mozilla-central/source/testing/web-platform/mozilla/tests/service-workers/service-worker/fetch-event-respond-with-stops-propagation.https.html?force=1#26>. While debugging this, I found two independent issues, one of which I will file separately. The first issue is that RuntimeService::CancelWorkersForWindow() calls Close() on the shared/serviceworker SharedWorker objects associated with the window. That in turn calls SharedWorker::NoteDeadWorker() which sets its mWorkerPrivate to null. Given that CancelWorkersForWindow is called from FreeInnerObjects during docshell destruction (which can be trigger when calling nsINode::Remove from script, for example!) it seems like clearing mWorkerPrivate at that point is premature. Is this behavior intentional, Kyle? It seems to me that the right fix is to remove the call to CloseSharedWorkersForWindow() from RuntimeService::CancelWorkersForWindow() <https://dxr.mozilla.org/mozilla-central/source/dom/workers/RuntimeService.cpp#2312>.

(no longer active)

Assignee

Comment 2

•

10 years ago

Forgot to needinfo Kyle! Please see comment 1.

Flags: needinfo?(khuey)

(no longer active)

Assignee

Comment 3

•

10 years ago

(Note that the fix I suggested in comment 1 seems to pass our tests...)

Kyle Huey (Exited; not receiving bugmail, old account, do not use)

Comment 4

•

10 years ago

SharedWorkers do not have postMessage, they only have a MessagePort. Why would you expect methods on a DOM object to continue to work after the window it lives in is closed?

Flags: needinfo?(khuey)

(no longer active)

Assignee

Comment 5

•

10 years ago

(In reply to Kyle Huey [:khuey] (khuey@mozilla.com) from comment #4) > SharedWorkers do not have postMessage, they only have a MessagePort. Yes, but ServiceWorkers have postMessage(), and internally, ServiceWorker objects have an underlying SharedWorker. See <https://dxr.allizom.org/mozilla-central/source/dom/workers/ServiceWorker.cpp?offset=0#88>. The GetWorkerPrivate() call there is failing here. > Why would you expect methods on a DOM object to continue to work after the > window it lives in is closed? Because in this case the DOM object represents a service worker that can easily survive the lifetime of the window. I think that is quite reasonable.

(no longer active)

Assignee

Updated

•

10 years ago

Depends on: 1179567

Kyle Huey (Exited; not receiving bugmail, old account, do not use)

Comment 7

•

10 years ago

(In reply to Ehsan Akhgari (not reviewing patches, not reading bugmail, needinfo? me!) from comment #5) > (In reply to Kyle Huey [:khuey] (khuey@mozilla.com) from comment #4) > > SharedWorkers do not have postMessage, they only have a MessagePort. > > Why would you expect methods on a DOM object to continue to work after the > > window it lives in is closed? > > Because in this case the DOM object represents a service worker that can > easily survive the lifetime of the window. I think that is quite reasonable. We're getting into edge cases of the web platform here, but I don't think that is a reasonable assumption.

Boris Zbarsky [:bzbarsky]

Comment 8

•

10 years ago

Fwiw, behavior in different browsers differs radically for objects that outlive their window. It can even differ radically for different sorts of objects in the same browser.... That said, for something new like SharedWorker I would hope the spec would define the behavior.

Kyle Huey (Exited; not receiving bugmail, old account, do not use)

•

10 years ago

https://html.spec.whatwg.org/multipage/workers.html#the-worker%27s-lifetime "A worker is said to be a permissible worker if its list of the worker's Documents is not empty, or if its list has been empty for no more than a short user-agent-defined timeout value, its WorkerGlobalScope is actually a SharedWorkerGlobalScope object (i.e. the worker is a shared worker), and the user agent has a browsing context whose Document is not complete loaded." and "Closing orphan workers: Start monitoring the worker such that no sooner than it stops being a protected worker, and no later than it stops being a permissible worker, worker global scope's closing flag is set to true."

Flags: needinfo?(khuey)

(no longer active)

Assignee

Comment 12

•

10 years ago

I think we are talking about different things. I am not talking about the lifetime of the underlying worker, but the SharedWorker object that the script accesses. I don't see anything in either specs that says the SharedWorker or the ServiceWorker _objects_ should stop working. That being said, the service worker spec actually leaves the decision to when it should kill the worker to the UA. But more interestingly, it says that when a postMessage() is being executed, we need to run the service worker <http://slightlyoff.github.io/ServiceWorker/spec/service_worker/index.html#service-worker-postmessage-method>. Should we rerun the service worker in ServiceWorker::PostMessage()? That seems to be a better fix here. Needinfoing some people who might have opinions here.

Flags: needinfo?(nsm.nikhil)

Flags: needinfo?(bkelly)

Flags: needinfo?(amarchesini)

Nikhil Marathe [:nsm] (No longer reading bugmail, please needinfo?)

Comment 13

•

10 years ago

(In reply to Ehsan Akhgari (not reviewing patches, not reading bugmail, needinfo? me!) from comment #12) > That being said, the service worker spec actually leaves the decision to > when it should kill the worker to the UA. But more interestingly, it says > that when a postMessage() is being executed, we need to run the service > worker > <http://slightlyoff.github.io/ServiceWorker/spec/service_worker/index. > html#service-worker-postmessage-method>. > > Should we rerun the service worker in ServiceWorker::PostMessage()? That > seems to be a better fix here. Needinfoing some people who might have > opinions here. Yes, we should try to re-run. This means ServiceWorker::PostMessage will have to request a new SharedWorker from the SWM.

Flags: needinfo?(nsm.nikhil)

Ben Kelly [:bkelly, not reviewing]

Comment 14

•

10 years ago

I also agree that restarting the service worker is closer to the intent of the spec design.

Flags: needinfo?(bkelly)

Noemí Freire (:noemi)

Reporter

Updated

•

10 years ago

Status: NEW → ASSIGNED

(no longer active)

Assignee

Comment 15

•

10 years ago

Attached patch Restart a service worker from ServiceWorker::PostMessage() if it has died — Details — Splinter Review

I tried this, but there is an issue that I'm not sure how to solve. By the time that this code kicks in in the WPT test case, the nsGlobalWindow::mContext is nulled out. You cannot create more than one context for the object because the second time that EnsureScriptEnvironment is called, this check prevents the creation of the global context: <https://dxr.mozilla.org/mozilla-central/source/dom/base/nsGlobalWindow.cpp#1938>. Because of this, the following check fails <https://dxr.mozilla.org/mozilla-central/source/dom/workers/WorkerPrivate.cpp#5061> and because of that, CreateServiceWorkerForWindow fails: <https://dxr.mozilla.org/mozilla-central/source/dom/workers/ServiceWorkerManager.cpp#2708>.

Kyle Huey (Exited; not receiving bugmail, old account, do not use)

•

10 years ago

(In reply to Ehsan Akhgari (not reviewing patches, not reading bugmail, needinfo? me!) from comment #19) > Catalin, Nikhil mentioned that you have been looking at this, and it was > also discussed in the F2F on Monday. Do you have any updates for what we > should do here? Yes, the conclusion we arrived at was that dom objects that outlive their global shouldn't be usable. Jake opened a sw issue until the appropriate spec change is decided: https://github.com/slightlyoff/ServiceWorker/issues/722. > My personal preference is to take my patch here (since it is mandated by the > spec) but WONTFIX this bug about this specific test, and perhaps say > something in the spec about ServiceWorker objects not being functional after > the Window they have been created from is navigated away or some such... Yes, postMessage should spin up the service worker if it's not running. However, except for the orphan dom object case, this can never happen in gecko (I think). This change would be justified once we have limited lifetime for service workers. It's up to you. I think postMessage should throw when |GetParentObject()| is null. Also, we should remove the |mWindow| and |mdDocument| references from ServiceWorker.

Flags: needinfo?(catalin.badea392)

(no longer active)

Assignee

Comment 21

•

10 years ago

Hmm, how can we ensure that this situation cannot happen without the orphan object?

Flags: needinfo?(catalin.badea392)

Cătălin Badea (:catalinb)

Comment 22

•

10 years ago

(In reply to Ehsan Akhgari (not reviewing patches, not reading bugmail, needinfo? me!) from comment #21) > Hmm, how can we ensure that this situation cannot happen without the orphan > object? I'm working on a patch that will stop service workers if they're idle for a given amount of time in bug 1188545. The first patch that's under review refactors event dispatching code and ensures the worker is restarted if needed.

Flags: needinfo?(catalin.badea392)

(no longer active)

Assignee

Updated

•

10 years ago

Depends on: 1188545

Cătălin Badea (:catalinb)

•

10 years ago

Comment on attachment 8667680 [details] [diff] [review] Call stopImmediatePropagation() on the Event object in respondWith() Review of attachment 8667680 [details] [diff] [review]: ----------------------------------------------------------------- ::: testing/web-platform/mozilla/tests/service-workers/service-worker/fetch-event-respond-with-stops-propagation.https.html @@ +26,2 @@ > worker.postMessage({port: channel.port2}, [channel.port2]); > + frame.remove(); What's the reason for this change?

Attachment #8667680 - Flags: review?(josh) → review+

(no longer active)

Assignee

Comment 25

•

10 years ago

(In reply to Josh Matthews [:jdm] from comment #24) > Comment on attachment 8667680 [details] [diff] [review] > Call stopImmediatePropagation() on the Event object in respondWith() > > Review of attachment 8667680 [details] [diff] [review]: > ----------------------------------------------------------------- > > ::: > testing/web-platform/mozilla/tests/service-workers/service-worker/fetch- > event-respond-with-stops-propagation.https.html > @@ +26,2 @@ > > worker.postMessage({port: channel.port2}, [channel.port2]); > > + frame.remove(); > > What's the reason for this change? To bypass the crash in comment 0.

(no longer active)

Assignee

Comment 26

•

10 years ago

Comment on attachment 8667680 [details] [diff] [review] Call stopImmediatePropagation() on the Event object in respondWith() Approval Request Comment [Feature/regressing bug #]: Service workers [User impact if declined]: This is a SW blocker. [Describe test coverage new/current, TreeHerder]: Has a test. [Risks and why]: This change is limited to SW related code and has no risk towards other code. [String/UUID change made/needed]: None.

Attachment #8667680 - Flags: approval-mozilla-aurora?

Pulsebot

Comment 27

•

10 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/ece14c314ca0

Carsten Book [:Tomcat]

Comment 28

•

10 years ago

https://hg.mozilla.org/mozilla-central/rev/ece14c314ca0

Status: ASSIGNED → RESOLVED

Closed: 10 years ago

status-firefox44: --- → fixed

Resolution: --- → FIXED

Target Milestone: --- → mozilla44

Noemí Freire (:noemi)

Reporter

Comment 29

•

10 years ago

Hi, just checked on m-c (89732fcdb0ba revision) and the test successfully runs. Thanks for fixing it! Summary Harness status: OK Found 1 tests 1 Pass Details Result Test Name Pass respondWith() invokes stopImmediatePropagation()

Liz Henry (:lizzard) (relman/hg->git project)

Updated

•

10 years ago

status-firefox43: --- → affected

Liz Henry (:lizzard) (relman/hg->git project)

Comment 30

•

10 years ago

Comment on attachment 8667680 [details] [diff] [review] Call stopImmediatePropagation() on the Event object in respondWith() Fixes a crash and perf issue, includes tests. Let's uplift this to aurora.

Attachment #8667680 - Flags: approval-mozilla-aurora? → approval-mozilla-aurora+

Carsten Book [:Tomcat]

Comment 31

•

10 years ago

https://hg.mozilla.org/releases/mozilla-aurora/rev/1501dc3ad3c3

status-firefox43: affected → fixed

You need to log in before you can comment on or make changes to this bug.