Closed Bug 1350359 Opened 8 years ago Closed 7 years ago

Expose alternate data (ex: JS Bytecode) in http cache via fetch() InternalResponse object

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla59

Tracking Flags:

Tracking

Status

firefox59

---

fixed

People

(Reporter: bkelly, Assigned: edenchuang)

References

(Blocks 1 open bug)

Details

Attachments

(6 files, 23 obsolete files)

P1: Introduce InternalResponse::mAlternativeBody 7 years ago Ben Hsu [:HoPang] 1.18 KB, patch		Details \| Diff \| Splinter Review
P2: Retrieve the alternative data before the original data if applicale 7 years ago Ben Hsu [:HoPang] 7.54 KB, patch		Details \| Diff \| Splinter Review
P1: Set alternative data type from InterceptedChannel to InternalRequest 7 years ago Ben Hsu [:HoPang] 4.56 KB, patch		Details \| Diff \| Splinter Review
P2: Fetch and save alterntative data to InternalResponse 7 years ago Ben Hsu [:HoPang] 8.56 KB, patch		Details \| Diff \| Splinter Review
P3.1: Place the alternative data to the InterceptedChannel 7 years ago Ben Hsu [:HoPang] 17.43 KB, patch		Details \| Diff \| Splinter Review
P3.2. Fix a crash caused by off-main-thread destruction of a HttpChannelChild 7 years ago Ben Hsu [:HoPang] 2.56 KB, patch		Details \| Diff \| Splinter Review
P4.1: Steal test_script_loader_js_cache.html as the base of the 7 years ago Ben Hsu [:HoPang] 14.05 KB, patch		Details \| Diff \| Splinter Review
P4.2: Interpolate the service worker which performs pass-through fetch 7 years ago Ben Hsu [:HoPang] 4.34 KB, patch		Details \| Diff \| Splinter Review
P4.3: Place all the tests into a single promise_test() clause 7 years ago Ben Hsu [:HoPang] 4.19 KB, patch		Details \| Diff \| Splinter Review
P1: Set alternative data type from InterceptedChannel to InternalRequest 7 years ago Eden Chuang[:edenchuang] 7.32 KB, patch	edenchuang : review+	Details \| Diff \| Splinter Review
P1: Set alternative data type from InterceptedChannel to InternalRequest. r?bkelly 7 years ago Eden Chuang[:edenchuang] 7.32 KB, patch	bkelly : review+	Details \| Diff \| Splinter Review
P2: Fetch and save alternative data to InternalResponse. r?bkelly 7 years ago Eden Chuang[:edenchuang] 13.18 KB, patch	bkelly : feedback+	Details \| Diff \| Splinter Review
P3.1: Place the alternative data to the InterceptedChannel. r?bkelly 7 years ago Eden Chuang[:edenchuang] 24.11 KB, patch	bkelly : feedback+	Details \| Diff \| Splinter Review
P3.2: Fix a crash caused by off-main-thread destruction of a HttpChannelChild. r?bkelly 7 years ago Eden Chuang[:edenchuang] 2.91 KB, patch		Details \| Diff \| Splinter Review
P4: mochitest for exposing alternate data in http cache via fetch() InternalResponse object. r?bkelly 7 years ago Eden Chuang[:edenchuang] 15.03 KB, patch	bkelly : review+	Details \| Diff \| Splinter Review
P1: Set alternative data type from InterceptedChannel to InternalRequest. r=bkelly 7 years ago Eden Chuang[:edenchuang] 7.32 KB, patch	edenchuang : review+	Details \| Diff \| Splinter Review
P2: Fetch and save alternative data to InternalResponse. r?bkelly 7 years ago Eden Chuang[:edenchuang] 23.69 KB, patch	bkelly : review-	Details \| Diff \| Splinter Review
P3.1: Place the alternative data to the InterceptedChannel. r?bkelly 7 years ago Eden Chuang[:edenchuang] 30.23 KB, patch	bkelly : review+	Details \| Diff \| Splinter Review
P4: mochitest for exposing alternate data in http cache via fetch() InternalResponse object. r=bkelly 7 years ago Eden Chuang[:edenchuang] 19.72 KB, patch	edenchuang : review+	Details \| Diff \| Splinter Review
P2: Fetch and save alternative data to InternalResponse. r?bkelly 7 years ago Eden Chuang[:edenchuang] 23.94 KB, patch	bkelly : review+	Details \| Diff \| Splinter Review
P1: Set alternative data type from InterceptedChannel to InternalRequest. r=bkelly 7 years ago Eden Chuang[:edenchuang] 7.32 KB, patch	edenchuang : review+	Details \| Diff \| Splinter Review
P2: Fetch and save alternative data to InternalResponse. r=bkelly 7 years ago Eden Chuang[:edenchuang] 25.50 KB, patch	edenchuang : review+	Details \| Diff \| Splinter Review
P3: Place the alternative data to the InterceptedChannel. r=bkelly 7 years ago Eden Chuang[:edenchuang] 30.81 KB, patch	edenchuang : review+	Details \| Diff \| Splinter Review
P4: Fix a crash caused by off-main-thread destruction of a HttpChannelChild. r?bkelly 7 years ago Eden Chuang[:edenchuang] 9.13 KB, patch	bkelly : review+	Details \| Diff \| Splinter Review
P5: mochitest for exposing alternate data in http cache via fetch() InternalResponse object. r=bkelly 7 years ago Eden Chuang[:edenchuang] 21.02 KB, patch	edenchuang : review+	Details \| Diff \| Splinter Review
P4: Fix a crash caused by off-main-thread destruction of a HttpChannelChild. r=bkelly 7 years ago Eden Chuang[:edenchuang] 9.13 KB, patch	edenchuang : review+	Details \| Diff \| Splinter Review
P6: Make sure releasing nsICacheInfoChannel of InternalResponse on the main thread. r?bkelly 7 years ago Eden Chuang[:edenchuang] 2.34 KB, patch		Details \| Diff \| Splinter Review
P6: Make sure releasing nsICacheInfoChannel of InternalResponse on the main thread. r?bkelly 7 years ago Eden Chuang[:edenchuang] 4.05 KB, patch	bkelly : review+	Details \| Diff \| Splinter Review
P6: Make sure releasing nsICacheInfoChannel of InternalResponse on the main thread. r=bkelly 7 years ago Eden Chuang[:edenchuang] 6.43 KB, patch	edenchuang : review+	Details \| Diff \| Splinter Review

Ben Kelly [:bkelly, not reviewing]

Reporter

Description

•

8 years ago

Bug 1231565 added a side-store for alternate cached data like compiled js bytecode. We should allow c++ consumers of Response to access this via the InternalResponse. As a first step this would simply connect back to the nsIChannel used in the FetchDriver. Bug 1336199 would be a later bug to allow storing this same data in Cache API. That would build on the APIs introduced in this bug.

Luke Wagner [:luke]

Comment 1

•

8 years ago

Additionally, this bug would allow wasm compilation APIs (which compile a Response object) to implicitly cache wasm in alternate data.

Ben Kelly [:bkelly, not reviewing]

Reporter

Updated

•

8 years ago

Blocks: 1350364

Andrew Overholt [:overholt]

Updated

•

8 years ago

Priority: -- → P2

Ben Hsu [:HoPang]

Updated

•

7 years ago

Assignee: nobody → bhsu

Ben Hsu [:HoPang]

Comment 2

•

7 years ago

Attached patch P1: Introduce InternalResponse::mAlternativeBody (obsolete) — Details — Splinter Review

Attachment #8902644 - Flags: feedback?(bkelly)

Ben Hsu [:HoPang]

Comment 3

•

7 years ago

Attached patch P2: Retrieve the alternative data before the original data if applicale (obsolete) — Details — Splinter Review

Attachment #8902646 - Flags: feedback?(bkelly)

Ben Hsu [:HoPang]

Comment 4

•

7 years ago

Hi Ben, After working on the caching alt-data stuff for a while, I think maybe I can start making some actual progress bit by bit. In this bug, I tried hard to keep the original data and the alternative data matching each other. However, since we must send two requests to Necko to retrieve both the original and alternative data, IMHO, we can merely mitigate the chances of them being mismatched instead of getting rid of it. With those patched applied, a FetchDriver firstly make a request for the alternative data, since when there is no alternative data, it can still get the original data and then resolve the fetch process. On the other hand, if the alternative data does exist, then the FetchDriver would send another request with (LOAD_FORM_CACHE | LOAD_ONLY_FORM_CACHE) for the original data. However, we could still suffer from the original data and alternative data mismatching with each other when someone modifying/updating the HTTP cache within the time window between the two requests. Similarly, in order to acquire matched pairs of original data and alternative data, I save an nsIInputStream in InternalResponse instead of an optional nsICacheInfoChannel mentioned 900784#c66, since I think the longer time window between the two requests sends to Necko, the more chances we would have them mismatched. If you think this plan is acceptable, I would check FetchDriver::HttpFetch() more carefully (I am deeply afraid of any undesired side effects, since this method would be entered twice), and write a testcase for it. There is another important question is if there is a way to verify whether an original data is matched to an alternative data. If so, the implementation will be more stable. And the most important of all, thanks as always.

Ben Kelly [:bkelly, not reviewing]

Reporter

Comment 5

•

7 years ago

Hmm, this is pretty different than I was thinking. I was originally think we would have something like: nsICacheInfoChannel* InternalResponse::GetCacheInfoChannel() const { return mCacheInfoChannel; } The fetch code itself would not try to read from the nsICacheInfoChannel at all. It would simply expose the interface if a gecko consumer of InternalResponse wants to use it. I guess I see now that nsICacheInfoChannel does not work the way I expected it to. I thought it provided its own stream of data, but it does not. It just changes how the main nsIChannel works. What if we did something like this: 1. Add a string attribute to InternalRequest like mPreferAlternateDataType. 2. Script created Request objects would get mPreferAlternateDataType set to the empty string. 3. Gecko internal code could set the InternalRequest mPreferAlternateDataType to some value with a c++ API. 4. FetchDriver simply calls PreferAlternateDataType() if mPreferAlternateDataType is not empty. Then in a later bug we can add: 5. Add a way to ask an nsIChannel if it is preferring alternate data type and what its string is. 6. When the FetchEvent.request is created from an intercepted channel we would set mPreferAlternateDataType based on if the channel is preferring an alternate data type. I guess the problem with this is the ServiceWorker script could pass the FetchEvent.request through to fetch(), get the alternate data stream, and then inspect it instead of passing it through to respondWith(). I'm sorry, but I think I need to think about this some more.

Flags: needinfo?(bkelly)

Ben Kelly [:bkelly, not reviewing]

Reporter

Comment 6

•

7 years ago

I guess maybe we could make FetchDriver: a. By default only request the main data. b. If mPreferredDataType is set then we would request both the main data and the alternate data. This would allow alternate data use for pass-through SW requests. If a SW created a completely separate Request, though, then it would not get the alternate data source optimization. If mPreferredDataType is set we would provide access to the output stream to write alternate data. To deal with the issue where the http cache entry is changed, maybe there is some way to determine if the resulting main data source is from the same cache entry as the alternate source. If they don't match, then we just throw away the alternate data and keep the main data. Maybe this could be done by comparing the cacheKey and expiration time in nsICacheInfoChannel? What do you think?

Flags: needinfo?(bkelly) → needinfo?(bhsu)

Ben Kelly [:bkelly, not reviewing]

Reporter

Updated

•

7 years ago

Depends on: 1395202

Ben Kelly [:bkelly, not reviewing]

Reporter

Comment 7

•

7 years ago

Honza offered to add an integer identifier to nsICacheInfoChannel so we can tell if two nsIChannel objects are pulling from the same cache entry. This would let us discard the alternate data if we detect the main data is from a different entry.

Ben Kelly [:bkelly, not reviewing]

Reporter

Comment 8

•

7 years ago

Actually, Honza is leaving for PTO shortly. HoPang do you think you could do buy 1395202 as well? Valentin or Michal could probably help there if you have questions.

Ben Kelly [:bkelly, not reviewing]

Reporter

Comment 9

•

7 years ago

Comment on attachment 8902646 [details] [diff] [review] P2: Retrieve the alternative data before the original data if applicale Review of attachment 8902646 [details] [diff] [review]: ----------------------------------------------------------------- ::: dom/fetch/FetchDriver.cpp @@ +402,5 @@ > > + if (mFetchingDataType == eAlternativeData) { > + nsCOMPtr<nsICacheInfoChannel> cic = do_QueryInterface(chan); > + if (cic && nsContentUtils::IsBytecodeCacheEnabled()) { > + cic->PreferAlternativeDataType(nsContentUtils::JSBytecodeMimeType()); Ideally I think we should let the consumer of InternalResponse pass in the type string instead of hard coding this. I believe we are already considering storing wasm alternate data streams, etc. We will want to support that.

Attachment #8902646 - Flags: feedback?(bkelly)

Ben Kelly [:bkelly, not reviewing]

Reporter

Comment 10

•

7 years ago

Comment on attachment 8902644 [details] [diff] [review] P1: Introduce InternalResponse::mAlternativeBody Dropping the flags here since I commented separately in the bug. Overall I agree we need to read both streams, but I think we need to pass the type through the InternalRequest.

Attachment #8902644 - Flags: feedback?(bkelly)

Ben Hsu [:HoPang]

Comment 11

•

7 years ago

(In reply to Ben Kelly [:bkelly] from comment #6) > I guess maybe we could make FetchDriver: > > a. By default only request the main data. > b. If mPreferredDataType is set then we would request both the main data and > the alternate data. Totally agreed, but we should be careful when deciding whether to request the alternative data. At this moment, I can only think of using the source URL as the decider, which should be treated with extra care under certain cases such as ".py" is used for script resources in wpt tests. > This would allow alternate data use for pass-through SW requests. If a SW > created a completely separate Request, though, then it would not get the > alternate data source optimization. Agreed. > If mPreferredDataType is set we would provide access to the output stream to > write alternate data. Sorry, I don't quite get where and why we have to write anything. I thought the thing needed to do in this bug is creating a pipe for the data pumped out from the nsIChannel for the alternative data. > To deal with the issue where the http cache entry is changed, maybe there is > some way to determine if the resulting main data source is from the same > cache entry as the alternate source. If they don't match, then we just > throw away the alternate data and keep the main data. Maybe this could be > done by comparing the cacheKey and expiration time in nsICacheInfoChannel? Yes, I think bug 1395202 is absolutely worth doing. Let me start from it ;P

Flags: needinfo?(bhsu)

Ben Kelly [:bkelly, not reviewing]

Reporter

Comment 12

•

7 years ago

(In reply to Ben Hsu [:HoPang] from comment #11) > (In reply to Ben Kelly [:bkelly] from comment #6) > > a. By default only request the main data. > > b. If mPreferredDataType is set then we would request both the main data and > > the alternate data. > > Totally agreed, but we should be careful when deciding whether to request > the alternative data. At this moment, I can only think of using the source > URL as the decider, which should be treated with extra care under certain > cases such as ".py" is used for script resources in wpt tests. I don't think the URL is a good discriminator here. The place in the code that creates the nsIChannel or InternalRequest is the best place. So for example: 1. Main thread ScriptLoader would set the JS bytecode mimetype as the preferred alternate data source because it knows it can consume it. 2. We propagate that to the FetchEvent.request's InternalRequest. 3. If the SW script does a pass-through fetch(evt.request) or a cache.match(evt.request) then we try to load both the altnerate and main streams in the InternalResponse. 4. If script accesses the body via Response API then it gets the main data stream. If its synthesized on nsIChannel then it gets the alternate data stream. If it goes to Cache API then we write both streams to disk. > > If mPreferredDataType is set we would provide access to the output stream to > > write alternate data. > > Sorry, I don't quite get where and why we have to write anything. I thought > the thing needed to do in this bug is creating a pipe for the data pumped > out from the nsIChannel for the alternative data. So, reading the alternate data stream is only half the problem. The other side of it is populating the alternate data in http cache, Cache API, etc. This works today like: 1. Main thread ScriptLoader creates an nsIChannel and marks that it prefers the JS bytecode mimetype. 2. Let's say there is no bytecode and the nsIChannel provides the main data instead 3. ScriptLoader compiles the main data stream as js. 4. ScriptLoader then writes the byte code back to the nsIOutputStream provided by nsICacheInfoChannel.openAlternativeOutputStream. So we need to be able to hold on to the nsICacheInfoChannel object in our InternalResponse and proxy GetOpenAlternativeOutputStream() to get the output stream to write to the http cache. Or provide a different implementation that writes to the Cache API. Does that help clarify anything?

Ben Hsu [:HoPang]

Comment 13

•

7 years ago

Sorry for the late reply, since I am not feeling well recently, and I need some more time to understand how things work and how they should work in the future :( The comment does help a lot. It seems that I was somehow overly focused on scenarios like a service worker tries to fetch and store all the resources which including scripts on events like `active`. To make sure that I am on the right track, I'd like to try to summarize the plan here which should include all the content in the following table. | TODO | ScriptLoader Requests | Other Requests | | ------------------------------------------------------ | ----------------------| -------------- | | Retrieve the alternative data from HTTP Cache | 1350359 | Follow-up | | Update the alternative data to HTTP Cache | Follow-up | Follow-up | | Save the alternative data from HTTP Cache to DOM Cache | 1336199 | Follow-up | | Retrieve the alternative data from DOM Cache | 1336199 | Follow-up | | Update the alternative data to DOM Cache | Follow-up | Follow-up | Note: When updating DOM cache, we shouldn't update the HTTP cache, since the main data in HTTP Cache might no longer remains the same as the one cached in the DOM cache. At this moment, I think I should work on 1350359 and 1336199 first, since they can stop performance regression from ScriptLoaders not using alternative data when being intercepted by service worker, and I currently have WIP patches for both of them. Besides, we can do the updating stuff together. To tackle this bug, I'd like to address your comment, which includes (1) creating InternalRequest::mPreferredAlternativeDataType and (2) making FetchDrivers fetch both the data if needed (Depends on whether the request is coming from ScriptLoader). For (2), `HTTPFetch` will still could be entered twice as in the WIP patch. However, I suggest focusing "retrieving the alternative data from HTTP Cache" part in this bug, and thus I think making InternalRequests holding the nsICacheInfoChannel could be done in a following bug. There is one more thing to be noted, FetchDrivers would only fetch one type of alternative data, which means that the code should be updated if we do want to fetch both JIT code and WASM.

Ben Kelly [:bkelly, not reviewing]

Reporter

Comment 14

•

7 years ago

(In reply to Ben Hsu [:HoPang] from comment #13) > To tackle this bug, I'd like to address your comment, which includes (1) > creating InternalRequest::mPreferredAlternativeDataType and (2) making > FetchDrivers fetch both the data if needed (Depends on whether the request > is coming from ScriptLoader). For (2), `HTTPFetch` will still could be > entered twice as in the WIP patch. However, I suggest focusing "retrieving > the alternative data from HTTP Cache" part in this bug, and thus I think > making InternalRequests holding the nsICacheInfoChannel could be done in a > following bug. There is one more thing to be noted, FetchDrivers would only > fetch one type of alternative data, which means that the code should be > updated if we do want to fetch both JIT code and WASM. Hmm, I wasn't suggesting that InternalRequest should hold a ref to the nsICacheInfoChannel. Only that it should have a string containing the prefered alternate data mime type. Keeping this on the request lets us: 1. Know what type of alternate data to try to load. 2. Avoid the work of trying to load any alternate data if the type is not set. (Which will be often.) I don't think we should put this off to a follow-up bug. I think its an essential part of reading the alternate data via the fetch API primitives.

Ben Hsu [:HoPang]

Comment 15

•

7 years ago

> Hmm, I wasn't suggesting that InternalRequest should hold a ref to the > nsICacheInfoChannel. Only that it should have a string containing the > prefered alternate data mime type. Keeping this on the request lets us: My bad, I meant making InternalResponse hold a ref of nsICacheInfoChannel here.

Ben Hsu [:HoPang]

Comment 16

•

7 years ago

Just a status update, Previously, I was stuck in various issues such as crashing other testcases and intermittent failures. At this moment, I think I've work around/fix them, and thus I am polishing the patches now.

Ben Hsu [:HoPang]

Comment 17

•

7 years ago

Attached patch P1: Set alternative data type from InterceptedChannel to InternalRequest (obsolete) — Details — Splinter Review

Ben Hsu [:HoPang]

Comment 18

•

7 years ago

Attached patch P2: Fetch and save alterntative data to InternalResponse (obsolete) — Details — Splinter Review

Ben Hsu [:HoPang]

Comment 19

•

7 years ago

Attached patch P3.1: Place the alternative data to the InterceptedChannel (obsolete) — Details — Splinter Review

Ben Hsu [:HoPang]

Comment 20

•

7 years ago

Attached patch P3.2. Fix a crash caused by off-main-thread destruction of a HttpChannelChild (obsolete) — Details — Splinter Review

Ben Hsu [:HoPang]

Comment 21

•

7 years ago

Attached patch P4.1: Steal test_script_loader_js_cache.html as the base of the (obsolete) — Details — Splinter Review

testcase. r=bkelly

Ben Hsu [:HoPang]

Comment 22

•

7 years ago

Attached patch P4.2: Interpolate the service worker which performs pass-through fetch (obsolete) — Details — Splinter Review

r=bkelly

Ben Hsu [:HoPang]

Comment 23

•

7 years ago

Attached patch P4.3: Place all the tests into a single promise_test() clause (obsolete) — Details — Splinter Review

Nicolas B. Pierron [:nbp]

Comment 24

•

7 years ago

Comment on attachment 8918150 [details] [diff] [review] P4.1: Steal test_script_loader_js_cache.html as the base of the Review of attachment 8918150 [details] [diff] [review]: ----------------------------------------------------------------- ::: dom/workers/test/serviceworkers/test_script_loader_intercepted_js_cache.html @@ +43,5 @@ > + "scriptloader_execute": "bytecode_exec" > + } > + }; > + > + function flushNeckoCache() { nit: Note, this function got removed recently, as well as all the calls to it from the dom/base test case, because it was somehow leaking the CacheIOThread.

Ben Hsu [:HoPang]

Updated

•

7 years ago

Attachment #8902644 - Attachment is obsolete: true

Ben Hsu [:HoPang]

Updated

•

7 years ago

Attachment #8902646 - Attachment is obsolete: true

Ben Hsu [:HoPang]

Comment 25

•

7 years ago

(In reply to Nicolas B. Pierron [:nbp] from comment #24) > Comment on attachment 8918150 [details] [diff] [review] > P4.1: Steal test_script_loader_js_cache.html as the base of the > > Review of attachment 8918150 [details] [diff] [review]: > ----------------------------------------------------------------- > > ::: > dom/workers/test/serviceworkers/test_script_loader_intercepted_js_cache.html > @@ +43,5 @@ > > + "scriptloader_execute": "bytecode_exec" > > + } > > + }; > > + > > + function flushNeckoCache() { > > nit: Note, this function got removed recently, as well as all the calls to > it from the dom/base test case, because it was somehow leaking the > CacheIOThread. Thanks!

Ben Hsu [:HoPang]

Comment 26

•

7 years ago

Per offline discussion, I'd like hand these patches over to Eden :)

Assignee: bhsu → echuang

Flags: needinfo?(echuang)