881804 - add interface for predictive actions

I'm not particularly worried about clock skew for this feature; it's used to "decay" our certainty in a predictive action based on how old our data is. If we're a little too aggressive because the clock was skewed backwards, it's not the end of the world. Same if we're too timid because the clock was skewed forward.

Patrick McManus [:mcmanus]

•

11 years ago

(In reply to Honza Bambas (:mayhemer) from comment #18) > There is a relatively complex (at least appearing so) logic and ABSOLUTELY > NO comments. Please add them. Explain all what you can. Indeed. Commenting is the last thing I tend to do, as it makes it so I have to go back, identify the most confusing parts on my own, and then I'm more likely to have commented the parts that will confuse other people, as well :) > ::: netwerk/base/public/nsINetworkSeer.idl > @@ +10,5 @@ > Shouldn't these be rather two interfaces? It seems like predict() will have > very different consumers then learn(). They may have different consumers, but they're rather intimately connected. You can't predict() without having already learn()ed something, so I think it makes more sense to have them together. > @@ +256,5 @@ > Oh god!! Another "Init with I/O" on the main thread. NO! I was under the impression that DB connections had to be opened on the main thread, is that not the case? I could have used the fully-async connection, but that would have resulted in an extra round trip to the main thread per prediction or learn, which I thought sounded worse. It's quite possible that I'm wrong, though :) > Also, when the service is disabled, you shouldn't open the database at all. Good point. > @@ +462,5 @@ > Just a note that this may fail code that does > do_GetService(); > NS_ENSURE_SUCCESS(); > > But since this is just an optimization thing, we might turn it to just a > no-op service when something goes wrong and only log a warning. Makes sense to me. > ::: netwerk/base/src/nsNetworkSeer.h > @@ +77,5 @@ > Please use cache for statements. See e.g DOMStorageDBThread.cpp as an > example. Good tip, will look into it. PB mode is handled just fine so far, but I think I'm going to change how it's handled based on Patrick's feedback above (it makes more sense).

Honza Bambas (:mayhemer)

Comment 21

•

11 years ago

(In reply to Nick Hurley [:hurley] from comment #20) > (In reply to Honza Bambas (:mayhemer) from comment #18) > > There is a relatively complex (at least appearing so) logic and ABSOLUTELY > > NO comments. Please add them. Explain all what you can. > > Indeed. Commenting is the last thing I tend to do, as it makes it so I have > to go back, identify the most confusing parts on my own, and then I'm more > likely to have commented the parts that will confuse other people, as well :) Lessen for next time ;) Please add the comments for the next patch version. > > > ::: netwerk/base/public/nsINetworkSeer.idl > > @@ +10,5 @@ > > Shouldn't these be rather two interfaces? It seems like predict() will have > > very different consumers then learn(). > > They may have different consumers, but they're rather intimately connected. > You can't predict() without having already learn()ed something, so I think > it makes more sense to have them together. Agree. > > > @@ +256,5 @@ > > Oh god!! Another "Init with I/O" on the main thread. NO! > > I was under the impression that DB connections had to be opened on the main > thread, is that not the case? No. You can open on any thread. Just have one connection per a thread (i.e. use a single connection object strictly on a single thread), best only one r/w and arbitrary number of r/o used on other threads. > > I could have used the fully-async connection, but that would have resulted > in an extra round trip to the main thread per prediction or learn, which I > thought sounded worse. It's quite possible that I'm wrong, though :) You don't need to use the async mozstorage api. Just look at how the dom storage sqlite code works, it's quite tuned.

u408661

Assignee

Comment 22

•

11 years ago

Attached patch Latest implementation patch (obsolete) — Details — Splinter Review

Jason - as promised (though a couple days later than promised), here's an updated implementation patch for your feedback. This is pretty much what I'm going to r? when the time comes (modulo any feedback from you), but a couple of the other patches aren't ready for that yet, so no r? quite yet.

Attachment #761693 - Attachment is obsolete: true

Attachment #761693 - Flags: feedback?(jduell.mcbugs)

Attachment #781114 - Flags: feedback?(jduell.mcbugs)

u408661

Assignee

Comment 23

•

11 years ago

Attached patch Latest integration patch (obsolete) — Details — Splinter Review

This is just an update of the previous integration patch to work with the new interface in the latest implementation patch. More integrations still to come.

Attachment #761696 - Attachment is obsolete: true

u408661

Assignee

Comment 24

•

11 years ago

Attached patch Firefox UI hook-up patch (obsolete) — Details — Splinter Review

This is the patch that hooks the seer up to the firefox UI (specifically for "Clear Private Data" functionality).

u408661

Assignee

Comment 25

•

11 years ago

(In reply to Nick Hurley [:hurley] from comment #22) > Created attachment 781114 [details] [diff] [review] > Latest implementation patch I should note that I've addressed all of Patrick's and Honza's feedback on the previous patch in this version (modulo the addition of functionality Patrick mentioned that isn't part of the current first-run functionality on the goals wiki)

u408661

Assignee

Comment 26

•

11 years ago

Attached patch Part 1 - seer implementation (obsolete) — Details — Splinter Review

Here we go, the big r?. This is the first in the patch series, containing the IDL, implementation, and associated support bits for the seer. There are still a couple failures on try that I'm working out (latest run at https://tbpl.mozilla.org/?tree=Try&rev=c4bccbbe3de9), but I don't expect those to wildly change anything, so I'm going to do review and try fixes in parallel. This includes predictive capabilities for startup, pageload, and link hover. There are also tests facilitated by nsINetworkSeerVerifier (the only purpose of that IDL is to support unit tests for this feature).

Attachment #781114 - Attachment is obsolete: true

Attachment #781114 - Flags: feedback?(jduell.mcbugs)

Attachment #790283 - Flags: superreview?(cbiesinger)

Attachment #790283 - Flags: review?(mcmanus)

u408661

Assignee

Comment 27

•

11 years ago

Attached patch Part 2 - Docshell integration (obsolete) — Details — Splinter Review

Herein lies the docshell integration, which is where we currently drive all our predictive actions from (with the exception of startup). This is the same patch that bz gave f+ to a while back, along with a whitespace fix I came across in the code I touched (one part was indented differently from the rest of the function).

Attachment #781117 - Attachment is obsolete: true

Attachment #790287 - Flags: review?(benjamin)

u408661

Assignee

Comment 28

•

11 years ago

Attached patch Part 3 - ScriptLoader integration (obsolete) — Details — Splinter Review

And here we have a way for the seer to learn about scripts that are loaded as part of a page. It's really quite simple, the interface being used is in part 1.

Attachment #790288 - Flags: review?(jst)

u408661

Assignee

Comment 29

•

11 years ago

Attached patch 04_layout.patch (obsolete) — Details — Splinter Review

Similar to part 3, some code so the seer can know about stylesheets and fonts being loaded.

Attachment #790291 - Flags: review?(dbaron)

u408661

Assignee

Comment 30

•

11 years ago

Attached patch Part 5 - image loading integration (obsolete) — Details — Splinter Review

Just like parts 3 and 4, except for images that we load (noticing a pattern here?)

Attachment #790293 - Flags: review?(joe)

u408661

Assignee

Comment 31

•

11 years ago

Attached patch Part 6 - browser hookups for clear private data (obsolete) — Details — Splinter Review

Here we go, the final bits (for now). This hooks up the seer to the clear private data functionality in the Firefox UI. It's quite possible that we don't need another "bucket" for this, and can just group it in with the existing "History" bucket, but this was simple enough to do, and changing it around to be under "History" will be simple enough, too.

Attachment #781119 - Attachment is obsolete: true

Attachment #790296 - Flags: review?(gavin.sharp)

Honza Bambas (:mayhemer)

Comment 32

•

11 years ago

Why is this still called "seer"? I thought "connect predictor" or so would be better.

Joe Drew (not getting mail)

Updated

•

11 years ago

Attachment #790293 - Flags: review?(joe) → review?(seth)

:Gavin Sharp [email: gavin@gavinsharp.com]

Comment 33

•

•

11 years ago

Comment on attachment 790283 [details] [diff] [review] Part 1 - seer implementation Note, I've focused on the interface changes and skimmed the rest. sr=biesi with some nits +/** + * nsINetworkSeerVerifier - used for testing the network seer to ensure it + * does what we expect it to do. + */ fyi, those comments typically go directly before the [scriptable] line +++ b/netwerk/base/src/Seer.cpp +// the result (since we're just trying to warn the cache) warn -> warm, I think, though I like your version :) + nsDependentCString("SELECT * FROM moz_startups;\n"), NS_LITERAL_CSTRING, here and in other places + ioThread->Dispatch(new SeerDBShutdownRunner(), NS_DISPATCH_NORMAL); This isn't a safe way to call an xpcom function, you need to keep a reference: nsCOMPtr<nsIRunnable> runner(new SeerDBShutdownRunner()); ioThread->Dispatch(runner, NS_DISPATCH_NORMAL); Otherwise, if dispatch addrefs and releases the pointer, it suddenly becomes invalid, which is bad. That said, I see this is a pretty common pattern, so maybe we decided not to care... +++ b/netwerk/base/src/Seer.h +struct uriInfo { Uppercase U

Attachment #790283 - Flags: superreview?(cbiesinger) → superreview+

u408661

Assignee

Comment 40

•

11 years ago

Attached patch Part 1 - seer implementation (v2) (obsolete) — Details — Splinter Review

Here's an update to the seer patch that addresses Seth's comments from comment #35, as well as the last failures I've seen so far on try (green as of https://tbpl.mozilla.org/?tree=Try&rev=a8e350619808). This does not (yet) address Biesi's sr+ comments, but I'm carrying forward his sr+ with the caveat that I need to address them in a later version of the patch (they're pretty much all mechanical, anyway).

Attachment #790283 - Attachment is obsolete: true

Attachment #790283 - Flags: review?(mcmanus)

Attachment #793054 - Flags: superreview+

Attachment #793054 - Flags: review?(mcmanus)

u408661

Assignee

Comment 41

•

11 years ago

Attached patch Part 4 - Layout integration (v2) (obsolete) — Details — Splinter Review

This is a minor update to the layout integration patch that addresses Seth's comments from comment #35 in conjunction with v2 of part 1.

Attachment #790291 - Attachment is obsolete: true

Attachment #790291 - Flags: review?(dbaron)

Attachment #793059 - Flags: review?(dbaron)

u408661

Assignee

Comment 42

•

11 years ago

Attached patch Part 5 - image loading integration (v2) (obsolete) — Details — Splinter Review

And here's an update to the image loading integration that addresses Seth's comments in conjunction with v2 of part 1.

Attachment #790293 - Attachment is obsolete: true

Attachment #793060 - Flags: review?(seth)

u408661

Assignee

Comment 43

•

11 years ago

Just to make this explicit, still TODO as of the time I write this comment: - Address Biesi's sr+ from comment #39 - Address Gavin's comments from comment #33 - Any more comments incoming from future reviews :)

Seth Fowler [:seth] [:s2h]

Comment 44

•

11 years ago

Comment on attachment 793060 [details] [diff] [review] Part 5 - image loading integration (v2) Review of attachment 793060 [details] [diff] [review]: ----------------------------------------------------------------- I love it! Thanks Nicholas!

Attachment #793060 - Flags: review?(seth) → review+

Benjamin Smedberg

•

11 years ago

(In reply to Patrick McManus [:mcmanus] [pto until aug 30] from comment #46) > Comment on attachment 793054 [details] [diff] [review] > Part 1 - seer implementation (v2) > > What telemetry are we going to gather here, and more broadly - how do we > assess if this is useful? I talked to Nick about this a couple of days ago. It seems like a good place to start would be to measure how many of the connections we opened speculatively have actually been used, and maybe check whether we get an improvement in DNS cache hit rate as well.

Patrick McManus [:mcmanus]

Comment 48

•

11 years ago

I think the cache hit rate we track right now is overall hit rate, including for speculatives - so we ought to also track non-speculative hit rate (which is what we would expect this to improve) Ideally I'd like to see some time metric be improved.. that's the bottom line, right? Time to first byte of mumble?

u408661

Assignee

Comment 49

•

11 years ago

(In reply to Patrick McManus [:mcmanus] [pto until aug 30] from comment #46) > let's deal with # of connections somewhere.. speculativeconnect() is pretty > conservative right now - but if we know a domain needs 6 then let's open 6! Patrick - I'm working through your review comments, and have come down to this last one (everything else is taken care of, and try still appears to be green!) Do you have a recommendation on where to do this? Should I modify nsISpeculativeConnect (or rather, its HTTP implementation), or should I add a new code path for doing speculative connects from the seer? I'm leaning towards the latter (since it seems like the existing speculative semantics are useful), but would like your expert opinion on the matter.

•

11 years ago

Attached patch Part 1 - seer implementation (v3) (obsolete) — Details — Splinter Review

Here's a patch with all of Patrick's comments to now addressed, with the exception of opening more connections when needed (see comment #49 for more about that). Carrying forward sr+, but not requesting review until the connections issue is addressed.

Attachment #793054 - Attachment is obsolete: true

Attachment #796699 - Flags: superreview+

u408661

Assignee

Comment 54

•

11 years ago

Attached patch Part 2 - Docshell integration (v2) — Details — Splinter Review

Smaug - here's a version of the patch updated to use helper functions, as requested, keeping the docshell logic as clean as it is now :)

Attachment #790287 - Attachment is obsolete: true

Attachment #796700 - Flags: review?(bugs)

u408661

Assignee

Comment 55

•

11 years ago

Attached patch Part 3 - ScriptLoader integration (v2) — Details — Splinter Review

jst - This is exactly the same patch, but with all the logic pushed down into helper functions that live inside netwerk/, just like I did for docshell, layout, and imagelib. I'm re-requesting review since the code has changed, but the semantics are 100% the same.

Attachment #790288 - Attachment is obsolete: true

Attachment #796701 - Flags: review?(jst)

u408661

Assignee

Comment 56

•

11 years ago

Attached patch Part 6 - browser hookups (v2) — Details — Splinter Review

Gavin - here's a new patch with the changes discussed above. Seer data has been lumped in with history for "clear private data". For "forget about site", I went ahead and just cleared all the seer data (as we do with the cache). Given that this is a speculative (with high confidence) optimization, I'm not too concerned about losing everything.

Attachment #790296 - Attachment is obsolete: true

Attachment #796702 - Flags: review?(gavin.sharp)

Boris Zbarsky [:bzbarsky]

Comment 57

•

11 years ago

Comment on attachment 793059 [details] [diff] [review] Part 4 - Layout integration (v2) OK. I think that the Seer interface should be clearly documented to say that what it wants is the document URI, not the HTTP referrer. And maybe it should be named something different so people don't get the two confused... In the CSS loader, you then just want to use the document URI, not the referrer URI, since the referrer in my example would be Y, not X. Similar in the font-face loader: the "referrer" there is always the stylesheet URI.

Attachment #793059 - Flags: review?(bzbarsky) → review-

Boris Zbarsky [:bzbarsky]

Comment 58

•

11 years ago

And similar for the image loads: those would use stylesheet URIs for image loads coming from style rules, with the patch in attachment 793060 [details] [diff] [review].

u408661

Assignee

•

11 years ago

Attached patch Part 4 - Layout integration (v3) — Details — Splinter Review

New patch for layout/, using the document URI instead of referrer (along with renamed variables to match accordingly). I've also changed the interface and added comments to make it clear that users should use document URI instead of a referrer, but I'm not going to upload that patch until I have a better, larger reason to update part 1 (since it's all just mechanical renames).

Attachment #793059 - Attachment is obsolete: true

Attachment #797003 - Flags: review?(bzbarsky)

u408661

Assignee

Updated

•

11 years ago

Attachment #797003 - Attachment description: 0004-Bug-881804-part-4-Plumb-layout-into-predictive-network-actions.-r-bz.patch → Part 4 - Layout integration (v3)

u408661

Assignee

Comment 63

•

11 years ago

Attached patch Part 5 - image loading integration (v3) — Details — Splinter Review

Seth - this patch is 100% the same as previous, but changed to use document URI instead of straight up referrer, as bz suggested in comment #58. Re-requesting review just to make sure.

Attachment #793060 - Attachment is obsolete: true

Attachment #797004 - Flags: review?(seth)

u408661

Assignee

•

11 years ago

Comment on attachment 796702 [details] [diff] [review] Part 6 - browser hookups (v2) Looks good to me, but over to Tim to take a closer look.

Attachment #796702 - Flags: review?(ttaubert)

Attachment #796702 - Flags: review?(gavin.sharp)

Attachment #796702 - Flags: feedback+

•

11 years ago

Attached patch Part 1 - seer implementation (v4) (obsolete) — Details — Splinter Review

Patrick - new patch with your review comments to this point addressed. The one thing that got a bit tricky was overriding restrictConnections, et. al. to allow us to open more speculative connections. I ended up having to get the data somewhere I was certain was on the main thread because of js objects that may have been passed as callbacks to SpeculativeConnect, and then pass that data to OnMsgSpeculativeConnect manually (which runs on the socket thread), so please take an extra-special look at that to make sure I'm not entirely crazy with my approach there :)

Attachment #796699 - Attachment is obsolete: true

•

11 years ago

Comment on attachment 806043 [details] [diff] [review] Part 1 - seer implementation (v4.1) Review of attachment 806043 [details] [diff] [review]: ----------------------------------------------------------------- I can almost smell success here! I only skimmed the sql.. honza I would appreciate it if you could read that - but don't feel the need to review more than that unless you want to. nick, the only must-fix I have left is dealing with RestrictConnections() - see below. ::: netwerk/base/public/nsISpeculativeConnect.idl @@ +29,5 @@ > in nsIInterfaceRequestor aCallbacks); > > }; > > +[builtinclass, uuid(2b6d6fb6-ab28-4f4c-af84-bfdbb7866d72)] definitely need some documentation here @@ +32,5 @@ > > +[builtinclass, uuid(2b6d6fb6-ab28-4f4c-af84-bfdbb7866d72)] > +interface nsISpeculativeConnectionOverrider : nsISupports > +{ > + readonly attribute unsigned long parallelSpeculativeConnectLimit; as long as its a builtinclass you can define these as infallible ::: netwerk/base/src/Seer.cpp @@ +664,5 @@ > + rv = svc->Init(); > + if (NS_FAILED(rv)) { > + SEER_LOG(("Failed to initialize seer, seer will be a noop")); > + } > + rv = svc->QueryInterface(aIID, aResult); should this be in an else branch? @@ +672,5 @@ > + > +// Get the full origin (scheme, host, port) out of a URI (maybe should be part > +// of nsIURI instead?) > +static void > +ExtractOrigin(nsIURI *uri, nsAutoCString &s) I'm thinking s should be Truncated() when you do an early return from an error @@ +1233,5 @@ > + LearnForStartup(uri); > + } > +} > + > +const int MAX_PAGELOAD_DEPTH = 10; good idea :) ::: netwerk/base/src/Seer.h @@ +64,5 @@ > + friend class SeerDBShutdownRunner; > + > + nsresult EnsureInitStorage(); > + > + // This is a proxy for the information we need from an nsIURI // because it is needed off main thread ::: netwerk/protocol/http/nsHttpConnectionMgr.cpp @@ +321,5 @@ > connInfo.forget(); > return rv; > } > > +class SpecConnectArgs : public nsISupports SpeculativeConnectArgs please can we nix nsISupports and not pretend this is com'ified? (yech) you can just implement AddRef() and Release() and still use nsRefPtr @@ +360,5 @@ > > caps |= ci->GetAnonymous() ? NS_HTTP_LOAD_ANONYMOUS : 0; > + args->mTrans = new NullHttpTransaction(ci, wrappedCallbacks, caps); > + > + nsCOMPtr<nsISpeculativeConnectionOverrider> o = do_GetInterface(callbacks); one letter variable names sounds like something I would get dinged for in a review :) @@ +2575,5 @@ > + bool ignoreIdle = false; > + > + if (args->mOverridesOK) { > + parallelSpeculativeConnectLimit = args->mParallelSpeculativeConnectLimit; > + restrictConnections = args->mRestrictConnections; among other things, RestrictConnections() will prevent a new speculative connection to a SPDY host with a full active (muxxable) connection to it. this code basically overrides that and would create multiple TCP sockets to a known (and connected) spdy host - we don't want that. Maybe we could soften the cases where the spdy-sate of the host is unknown and the handshake is in progress - but do we want to be more drastic than that?

Attachment #806043 - Flags: review?(mcmanus)

u408661

Assignee

Comment 79

•

11 years ago

Attached patch Part 1 - seer implementation (v5) (obsolete) — Details — Splinter Review

Yet another patch, with updates based on your latest set of comments. The (intermittent then almost perma) failure on try I mentioned in person today is also fixed in this patch.

Attachment #806043 - Attachment is obsolete: true

Attachment #806043 - Flags: review?(honzab.moz)

Attachment #807027 - Flags: superreview+

Attachment #807027 - Flags: review?(mcmanus)

Honza Bambas (:mayhemer)

Comment 80

•

11 years ago

Nick, don't want a review from me any more?

u408661

Assignee

•

11 years ago

Finally, once I got home to a tree, backed out in https://hg.mozilla.org/integration/mozilla-inbound/rev/f2b5dbc01325. No idea what it means that it was only in reftest and jsreftest that "application ran for longer than allowed maximum time".

Matt Brubeck (:mbrubeck)

Comment 87

•

11 years ago

It looks like these patches also perf caused regressions in the Tp5 and Tp4 page load benchmarks. (The performance improved again when the patches were backed out.)

u408661

Assignee

Comment 88

•

11 years ago

Further inspection indicates that this *only* happens on android 2.2 (android 4.0 is reliably green). Not 100% sure what this means (other than that older android versions are crap), but it may at least provide the opportunity for a workaround to get this landed & enabled on most platforms. Running a test patch through try right now, and we'll see how it goes.

Daniel Veditz [:dveditz]

Updated

•

11 years ago

Depends on: 917682

u408661

Assignee

Comment 89

•

11 years ago

Attached patch Android 2.2 workaround hack (obsolete) — Details — Splinter Review

So here's a thing. As mentioned in my previous comment, the tests *only* time out on Android 2.2 (Armv6 and Armv7). This patch is a hack to disable the seer on android versions less than 4 (determined based on the API level available) until the specific problems with android 2.2 is determined. I don't even know if this is a route we want to go, but if we do, it will at least get us 100% of most platforms, and around 2/3 of android (based on http://developer.android.com/about/dashboards/). I will note here that I think a large part of the problem with android 2.2 is related to the infrastructure or tests themselves. Since I've been digging in on this, I decided to look at how long green test runs (specifically Android 2.2 Opt Armv7 J3) took to complete successfully. My totally unscientific sample of 5 recent pushes on m-c and m-i show a range between 12 and 45 minutes (the timeout occurs after 60 minutes). Looking at my backed-out push to m-i, the green runs of the same test took between 18 and 35 minutes (so well within "normal" range), while (of course) the timed out tests were killed after 60 min. I will also note that I can even effectively make the seer a no-op service (by never dispatching any actual work to the seer i/o thread, but doing all the setup for it) and the reftests will *still* time out. Obviously, the "no-op" seer is still doing a little work (validation of inputs mostly), but it's really not much, and the fact that even that little bit of overhead can break tests is disconcerting, to say the least.

Attachment #810195 - Flags: review?(mcmanus)

u408661

Assignee

Comment 90

•

11 years ago

One final thing: the effect of the hack workaround is to ensure mInitialized == false, which makes the seer do even less work than just the setup for dispatching things to the i/o thread. Eleven runs of the same test (Android 2.2 Opt Armv7 J3) with the hack patch applied all completed in between 12 and 35 minutes, which is well within "normal" range for that test.

Jason Duell

Comment 91

•

11 years ago

Comment on attachment 810195 [details] [diff] [review] Android 2.2 workaround hack Review of attachment 810195 [details] [diff] [review]: ----------------------------------------------------------------- I'm totally fine with skipping earlier android version coverage. As pointed out it looks like we're just tipping already-bloated testruns just past the timeout, so no indications of serious problem anyway.

Attachment #810195 - Flags: review?(mcmanus) → review+

Patrick McManus [:mcmanus]

Updated

•

11 years ago

Attachment #810195 - Flags: review?(mcmanus)

Patrick McManus [:mcmanus]

Comment 92

•

11 years ago

it doesn't make a lot of sense to me that we're adding enough work to extend the runtime of the test to fail. This is probably related the tp4 and tp5 regressions as well. nick? jason - I don't understand why you canceled my review.

Patrick McManus [:mcmanus]

Comment 93

•

11 years ago

for instance - dnsprefetch was able to minimize tp4/5 pain by building a queue of stuff to submit to the prefetcher, batching that all together, and doing it after the parse was complete. it got bounced a bunch of times before that was complete. maybe that applies here? If the tp5 regressions are unavoidable I think we need to be able to characterize them (i.e. how bad they are) and at least have some measured anecdotal wins to compare them against using networks with actual latency.

u408661

Assignee

•

11 years ago

(In reply to Patrick McManus [:mcmanus] from comment #95) > maybe start with the tp4/tp5 issues matt points out in comment 87? > > I would feel more comfortable just calling this a scary test if it were the > only issue - but they feel like they are making the same complaint. Fair enough.

Matt Brubeck (:mbrubeck)

Comment 98

•

11 years ago

For reference, here's a graph that shows the Tp5 regression when this landed on 2013-09-20 and the improvement when it was backed out a day later: http://graphs.mozilla.org/graph.html#tests=[[255,63,21]]&sel=1379614356683,1380219156683 That shows a regression of about 6% on Mac OS X 10.6. Other platforms also regressed, by varying amounts. Tp4 on Android 4.0 regressed by about 8%.

u408661

Assignee

Comment 99

•

11 years ago

OK, for my reference (as much as anything else) https://tbpl.mozilla.org/?tree=Try&rev=cbd4698230ad is a try run, same code as the one mentioned in comment 96, but running talos as well, to see if we get the regression on tp4/tp5, or if that's a problem independent of the reftest failures.

u408661

Assignee

Comment 100

•

11 years ago

So I had a random theory that some of the issues may be related to less-than-great servers running on the tests, and perhaps the timeouts are caused by the same stuff Steve has been working on to eliminate speculative connections to rfc1918 IP blocks. As such, I pushed an experimental run to try with my patches applied on top of his latest patch. There's a 99% chance this is worthless, as I'm tired and not feeling well, but I figured it's worth a shot. https://tbpl.mozilla.org/?tree=Try&rev=4d2dcd19a2bb

Patrick McManus [:mcmanus]

Updated

•

11 years ago

Attachment #810195 - Flags: review?(mcmanus)

:Gavin Sharp [email: gavin@gavinsharp.com]

Comment 101

•

11 years ago

Comment on attachment 810195 [details] [diff] [review] Android 2.2 workaround hack It sounds like you've already decided against landing this, but I thought it important to note that I think it's a bad idea.

Attachment #810195 - Flags: feedback-

u408661

Assignee

Comment 102

•

11 years ago

My idea in comment 100 didn't pan out (jsreftests on android still time out, and at least Tp5 still regresses), however, my push from comment 99 (which does not involve dispatching any events to the seer thread, making the seer an effective no-op) seems to have a normal Tp5 time (this code is already known to not fix the jsreftest issue), so they seem to be two separate issues. Not that this knowledge will make anyone happier (it certainly doesn't make me happier), but it's progress of some sort.

u408661

Assignee

Comment 103

•

11 years ago

Attached patch Part 1 - seer implementation (v6) — Details — Splinter Review

OK, so here's what should be the final version of part 1. The only changes from the previous version are (1) The addition of code to disable the seer on versions of android < 2.3, and (2) turning off the seer xpcshell test on all versions of android. The reason we turn off the test on all versions (instead of just 2.2) is that the code from (1) doesn't work properly on our test infrastructure (it gets the wrong value) while it works perfectly on a real device. Fun! As discussed in email, here's some perf numbers from WPT with the seer: - On a modern internet connection like most of us in the US have, up to 10% better SpeedIndex for repeat views of a page - On a 3G internet connection, up to 23% better SpeedIndex for repeat views So (as I will also email to dev-tree-management), the tp5 regressions caused by these patches do not reflect the gains that will be seen on real networks. Win!

Attachment #807027 - Attachment is obsolete: true

Attachment #821311 - Flags: superreview+

Attachment #821311 - Flags: review?(mcmanus)

u408661

Assignee

Updated

•

11 years ago

Attachment #810195 - Attachment is obsolete: true

u408661

Assignee

Comment 104

•

11 years ago

Try run for the newest patch at https://tbpl.mozilla.org/?tree=Try&rev=5786b873e4c9 (I will keep an eye out, though I'm 99% certain I've done a full try run with this patch, and I just lost the link, hence the new run)

u408661

Assignee

•

11 years ago

Let's try this again: remote: https://hg.mozilla.org/integration/mozilla-inbound/rev/4bcbb58917c9 remote: https://hg.mozilla.org/integration/mozilla-inbound/rev/85ebad9a27c9 remote: https://hg.mozilla.org/integration/mozilla-inbound/rev/21813034cb0e remote: https://hg.mozilla.org/integration/mozilla-inbound/rev/4967a9b78382 remote: https://hg.mozilla.org/integration/mozilla-inbound/rev/dbad5acdc1c7 remote: https://hg.mozilla.org/integration/mozilla-inbound/rev/17dbbb898b80

Phil Ringnalda (:philor)

Comment 110

•

11 years ago

https://hg.mozilla.org/mozilla-central/rev/4bcbb58917c9 https://hg.mozilla.org/mozilla-central/rev/85ebad9a27c9 https://hg.mozilla.org/mozilla-central/rev/21813034cb0e https://hg.mozilla.org/mozilla-central/rev/4967a9b78382 https://hg.mozilla.org/mozilla-central/rev/dbad5acdc1c7 https://hg.mozilla.org/mozilla-central/rev/17dbbb898b80

Status: NEW → RESOLVED

Comment 115

•

11 years ago

Erin, there isn't a whole lot about this that can be specifically tested, unless you want to start looking at packet traces. Certainly, basic things like "do pages load?" should work just like before, and in general, repeat views of a page should "feel" faster (this is what SpeedIndex measures), but that can be pretty subjective. For the power and data usage concerns... if QA wants to test that, then all they need to do is load some pages (with repeat views, to bring the seer into play) and see how things fare compared to older builds.

Flags: needinfo?(hurley)

Patrick McManus [:mcmanus]

Comment 116

•

11 years ago

nick has an awesome summary here. I just want to underscore that this feature is particularly advantageous for mobile because of mobile's poor bandwidth/latency ratio. And it is really exciting to see speedindex be able to validate that! in general - the best thing for mobile performance is to attack incidences of scaling-performance-by-rtt, because mobile rtt is so abysmal. Sometimes that means trading something you have an excess of for a more constrained restraint. In this case that means trading a bit of bandwidth in order to pickup an optimization of something that is latency based - and the result is a big win. We often think of mobile networks as slow and therefore bandwidth constrained - that's certainly true some of the time, but at the early stages of a page load they are actually very poor at using the available bandwidth - and that's the exact moment of time this technique is targetted at. In general costs of that bandwidth are something that need to be considered as part of the tradeoff, but in this particular case I think its a slam dunk because: 1] we expect a very high hit rate (much of the complexity of the feature is about trying to acheive that) - so we aren't as much using more bandwidth as shifting the bandwidth forward in time. However, we need to acknowledge that there will be some waste 2] wasted bandwidth here is comprised of just handhsakes and dns queries. They're both absolutely tiny - comprising well less than 0.1% of the bandwidth of even a mobile page that is being loaded.. so even a total miss would be lost in the noise. So its a no brainer imo The more interesting question will be when it is viable to prefetch resources (not just the setup stages). I think we will want to go there - but its clearly a harder problem.

Marco Bonardo [:mak]

Updated

•

11 years ago

Depends on: 935413

Tracy Walker [:tracy]

Comment 117

•

•

11 years ago

No longer blocks: 947745

Depends on: 947745

Mark Finkle (:mfinkle) (use needinfo?)

•

11 years ago

(In reply to :Ehsan Akhgari (needinfo? me!) from comment #124) > Great, that seems to address my concern. Seer::Learn and Seer::Predict are > the only API entry points from outside of netwerk/, right? The only ones that would cause data to be written, rather than erased (Seer::Reset), yes :)

(no longer active)

Comment 126

•

11 years ago

(In reply to comment #125) > (In reply to :Ehsan Akhgari (needinfo? me!) from comment #124) > > Great, that seems to address my concern. Seer::Learn and Seer::Predict are > > the only API entry points from outside of netwerk/, right? > > The only ones that would cause data to be written, rather than erased > (Seer::Reset), yes :) What's different about erasure? That could give away what you were doing in PB mode just as well as a regular write can...

(no longer active)

Comment 127

•

11 years ago

Oh, Reset clears *everything*, so yeah it's safe. Thanks!

Ioana (away)

Updated

•

11 years ago

Whiteboard: [qa-]

Olli Pettay [:smaug][bugs@pettay.fi]

Updated

•

11 years ago

Blocks: 997166

Virtual_ManPL [:Virtual] 🇵🇱 - (please needinfo? me - so I will see your comment/reply/question/etc.)

Updated

•

11 years ago

Comment 128

•

11 years ago

What kind of prediction failure rate can we expect ? A concern would be that, if Firefox sends too many DNS requests by mistake, an attacker listening to DNS servers could figure out which page the user is visiting within a website. That's why I disabled DNS prefetching. But if Seer is clever enough that it only sends DNS requests that would really occur anyway, I can keep it enabled. So how many wrong guesses can we expect with the current and future algorithms ? Thanks

u408661

Assignee

Comment 129

•

11 years ago

Hopefully the failure rate is pretty low, though we don't currently have a way to track that (especially not for DNS requests). The seer is designed to be clever, though, only doing a DNS prefetch or a TCP preconnect when it's confident that it would be a useful thing to do. There is a slight risk of unnecessary prefetches or preconnects, but it's specifically designed to be a very low risk.

Steph

Comment 130

•

11 years ago

Ok, I'll keep an eye on how things evolve then :) Uh... Is there a master bug or some place where discussion on Seer development occurs, similar to this bug ? I haven't completely grasped how you guys are organised and using Bugzilla. Thanks again. Since you're the "Product Champion" (love that name :D) I won't be double posting the concern on wrong DNS requests in this thread: https://groups.google.com/forum/#!topic/mozilla.dev.planning/aiV8k4XqvJs But tell me if you prefer that I do.

Virtual_ManPL [:Virtual] 🇵🇱 - (please needinfo? me - so I will see your comment/reply/question/etc.)

Updated

•

11 years ago

Blocks: 1009122

Virtual_ManPL [:Virtual] 🇵🇱 - (please needinfo? me - so I will see your comment/reply/question/etc.)

Updated

•

10 years ago

Depends on: 1016622

seer implementation patch 12 years ago u408661 42.64 KB, patch	mcmanus : feedback+ mayhemer : feedback-	Details \| Diff \| Splinter Review
example integrations 12 years ago u408661 4.02 KB, patch	bzbarsky : feedback+	Details \| Diff \| Splinter Review
Latest implementation patch 11 years ago u408661 55.25 KB, patch		Details \| Diff \| Splinter Review
Latest integration patch 11 years ago u408661 5.43 KB, patch		Details \| Diff \| Splinter Review
Firefox UI hook-up patch 11 years ago u408661 8.19 KB, patch		Details \| Diff \| Splinter Review
Part 1 - seer implementation 11 years ago u408661 72.00 KB, patch	Biesinger : superreview+	Details \| Diff \| Splinter Review
Part 2 - Docshell integration 11 years ago u408661 3.86 KB, patch	smaug : review-	Details \| Diff \| Splinter Review
Part 3 - ScriptLoader integration 11 years ago u408661 1.62 KB, patch	jst : review+	Details \| Diff \| Splinter Review
04_layout.patch 11 years ago u408661 4.25 KB, patch		Details \| Diff \| Splinter Review
Part 5 - image loading integration 11 years ago u408661 2.68 KB, patch	seth : review-	Details \| Diff \| Splinter Review
Part 6 - browser hookups for clear private data 11 years ago u408661 8.19 KB, patch	Gavin : feedback-	Details \| Diff \| Splinter Review
Part 1 - seer implementation (v2) 11 years ago u408661 73.34 KB, patch	u408661 : superreview+	Details \| Diff \| Splinter Review
Part 4 - Layout integration (v2) 11 years ago u408661 3.47 KB, patch	bzbarsky : review-	Details \| Diff \| Splinter Review
Part 5 - image loading integration (v2) 11 years ago u408661 1.90 KB, patch	seth : review+	Details \| Diff \| Splinter Review
Part 1 - seer implementation (v3) 11 years ago u408661 94.96 KB, patch	mayhemer : review- u408661 : superreview+	Details \| Diff \| Splinter Review
Part 2 - Docshell integration (v2) 11 years ago u408661 3.17 KB, patch	smaug : review+	Details \| Diff \| Splinter Review
Part 3 - ScriptLoader integration (v2) 11 years ago u408661 1.64 KB, patch	jst : review+	Details \| Diff \| Splinter Review
Part 6 - browser hookups (v2) 11 years ago u408661 1.89 KB, patch	ttaubert : review+ mossop : review+ Gavin : feedback+	Details \| Diff \| Splinter Review
Part 4 - Layout integration (v3) 11 years ago u408661 3.59 KB, patch	bzbarsky : review+	Details \| Diff \| Splinter Review
Part 5 - image loading integration (v3) 11 years ago u408661 2.01 KB, patch	seth : review+	Details \| Diff \| Splinter Review
Part 1 - seer implementation (v4) 11 years ago u408661 107.88 KB, patch	u408661 : superreview+	Details \| Diff \| Splinter Review
Part 1 - seer implementation (v4.1) 11 years ago u408661 107.18 KB, patch	u408661 : superreview+	Details \| Diff \| Splinter Review
Part 1 - seer implementation (v5) 11 years ago u408661 111.62 KB, patch	mcmanus : review+ u408661 : superreview+	Details \| Diff \| Splinter Review
Android 2.2 workaround hack 11 years ago u408661 1.70 KB, patch	jduell.mcbugs : review+ Gavin : feedback-	Details \| Diff \| Splinter Review
Part 1 - seer implementation (v6) 11 years ago u408661 113.45 KB, patch	mcmanus : review+ u408661 : superreview+	Details \| Diff \| Splinter Review