1472110 - 3.77 - 5.8% remote-blank / remote-nytimes (android-4-2-armv7-api16, android-6-0-armv8-api16) regression on push d6120c2bb51e2057df51f4d52510bb5f4e8b4ca5 (Fri Jun 29 2018)

Reporter

Description

•

6 years ago

We have detected an autophone (Android) regression from push:

https://hg.mozilla.org/integration/mozilla-inbound/pushloghtml?changeset=d6120c2bb51e2057df51f4d52510bb5f4e8b4ca5

As author of one of the patches included in that push, we need your help to address this regression.

Regressions:

  6%  remote-blank android-6-0-armv8-api16 opt      385.48 -> 407.86
  5%  remote-nytimes android-4-2-armv7-api16 opt    2,964.92 -> 3,119.07
  4%  remote-nytimes android-6-0-armv8-api16 opt    1,002.64 -> 1,040.42


You can find links to graphs and comparison views for each of the above tests at: https://treeherder.mozilla.org/perf.html#/alerts?id=14082

On the page above you can see an alert for each affected platform as well as a link to a graph showing the history of scores for this test. There is also a link to a treeherder page showing the jobs in a pushlog format.

To learn more about the regressing test(s), please see: https://wiki.mozilla.org/EngineeringProductivity/Autophone

Ionuț Goldan [:igoldan]

Reporter

Updated

•

6 years ago

Product: Testing → Firefox Build System

Version: Version 3 → unspecified

Ionuț Goldan [:igoldan]

Reporter

Updated

•

6 years ago

Flags: needinfo?(kmaglione+bmo)

Kris Maglione [:kmag]

Comment 1

•

6 years ago

I believe this is actually from bug 1453691.

Blocks: 1453691
No longer blocks: 1459004

Flags: needinfo?(kmaglione+bmo) → needinfo?(wisniewskit)

Bob Clary [:bc] (inactive)

Comment 2

•

6 years ago

The patch for bug 1453691 landed on autoland 2018-05-31 and mozilla-central 2018-06-01 so it can't be the cause.

<http://phonedash.mozilla.org/#/2018-06-28/2018-06-29/binning=repo-phonetype-phoneid-test_name-cached_label&rejected=norejected&errorbars=noerrorbars&errorbartype=standarderror&valuetype=median&remote-blank=on&remote-nytimes=on&throbberstart=on&first=on&autoland=on&mozilla-inbound=on&nexus-4=on&nexus-4-09=on&nexus-5=on&nexus-5-06=on&nexus-6p=on&nexus-6p-08=on&nexus-6p-10=on&nexus-6p-12=on>

Looks like Bug 1459004 to me.

Mike Taylor [:miketaylr]

Comment 3

•

6 years ago

Without 1459004, we can't ship system addons in Fennec. We need it for 2 webcompat-related addons: the fb/google experiment one (which is nightly only, and we'll back out in ~4 weeks), and a mobile equivalent of the gofaster webcompat addon that ships in Dekstop.

Nathan Froyd [:froydnj]

Comment 4

•

6 years ago

Bug 1459004 seems unlikely; how does the change to how the file get generated impact performance at runtime?  Different ordering of the file, maaaybe?  Is the generated file somehow larger than the previous file?

Thomas Wisniewski [:twisniewski]

Comment 5

•

6 years ago

Note that bug 1459004 caused this add-on to start working, so it could very well be the reason. Indeed, the add-on *is* intermittently kicking in when I visit nytimes.com on today's Fennec nightly, as its informational console message appears during some page-loads: "The user agent string has been overridden to get the Chrome experience on this site."

I've investigated during one of those times, and it seems that there are indeed resources being loaded from www.google.com and www.facebook.com, like a bunch of single-pixel tracking gifs and 0-byte html files.

>https://www.google.com/ads/user-lists/1008590664/?random=1530283404354&cv=9&fst=1530280800000&num=1&guid=ON&u_h=640&u_w=360&u_ah=640&u_aw=360&u_cd=24&u_his=2&u_tz=-240&u_java=false&u_nplug=0&u_nmime=0&gtm=G6c&sendb=1&frm=0&url=https://mobile.nytimes.com/&tiba=The New York Times - Breaking News, World News & Multimedia&async=1&fmt=3&cdct=2&is_vtc=1&random=3782160405&resp=GooglemKTybQhCsO&rmt_tld=0&ipr=y

>https://www.facebook.com/tr/?id=100468016962764&ev=PageView&dl=https%3A%2F%2Fmobile.nytimes.com%2F&rl=&if=false&ts=1530283386272&sw=360&sh=640&v=2.8.18&r=stable&ec=0&o=28&it=1530283386198

However, those same resources are sometimes loaded without my add-on enabled, where the normal Firefox UA is being sent. So I can't be sure if the add-on is actually the culprit. It's possible that the add-on is triggering perf regression, perhaps due to a cause like one of these:

- the ad-loading scripts behave differently when given a Chrome UA, which somehow makes them a bit slower on Firefox.
- the ads being loaded are simply not the same ones in all Talos runs, and some of them happen to load more slowly than expected.

It's possible that tweaking the add-on to not kick in on www.google.com/ads and www.facebook.com/tr might be enough to make this regression go away. But then again this all could just be a red herring.

Flags: needinfo?(wisniewskit)

Kris Maglione [:kmag]

Comment 6

•

6 years ago

Unfortunately, remote-blank and remote-nytimes regressions usually just mean startup time regressions. That would be my best guess here.

Bob Clary [:bc] (inactive)

Comment 7

•

6 years ago

These were all on first visit measurements not the second visit. Autophone S1S2 does start the browser to load a blank page and then shuts down before beginning the real tests fwiw.

Ionuț Goldan [:igoldan]

Reporter

Comment 8

•

6 years ago

(In reply to twisniewski from comment #5)
> It's possible that the add-on is
> triggering perf regression, perhaps due to a cause like one of these:
> 
> - the ad-loading scripts behave differently when given a Chrome UA, which
> somehow makes them a bit slower on Firefox.
> - the ads being loaded are simply not the same ones in all Talos runs, and
> some of them happen to load more slowly than expected.

Can we check these scenarios?

Flags: needinfo?(wisniewskit)

Thomas Wisniewski [:twisniewski]

Comment 9

•

6 years ago

I'm unaware of how the Talos test operates, so I can't be sure. If it could be loading different content, then perhaps we could log the network requests being made by a series of runs of that test, to confirm which ad/etc is being loaded, and which UA string is being sent to each request.

In addition, the fix in bug 1473181 could impact Talos runs as well, so it might be worth waiting for that to land.

Flags: needinfo?(wisniewskit)

Mike Taylor [:miketaylr]

Comment 10

•

6 years ago

(In reply to Ionuț Goldan [:igoldan], Performance Sheriffing from comment #8)
> (In reply to twisniewski from comment #5)
> > It's possible that the add-on is
> > triggering perf regression, perhaps due to a cause like one of these:
> > 
> > - the ad-loading scripts behave differently when given a Chrome UA, which
> > somehow makes them a bit slower on Firefox.
> > - the ads being loaded are simply not the same ones in all Talos runs, and
> > some of them happen to load more slowly than expected.
> 
> Can we check these scenarios?

Joel, we don't run ads in Talos runs, right? My understanding is we have pages stripped of such things so as to be predictable.

Flags: needinfo?(jmaher)

Joel Maher ( :jmaher ) (UTC -8)

Comment 11

•

6 years ago

remote-blank is just a blank page load, there is no ads or javascript there.

remote-nytimes could have ads in it- there is debates over should we have ads to be more realistic or not to be more stable.  We have found that ads do surface noise- and sometimes a test is stable and a small change will tickle the ordering or timing and result in a regression or bi-modal distribution because our mozAfterPaint could be before or after the ad is displayed.

Given that we see a regression on remote-blank, I would say that this regression isn't so dependent on the content being loaded.

Flags: needinfo?(jmaher)

Bob Clary [:bc] (inactive)

Comment 12

•

6 years ago

No remote content is (or at least should) be loaded from the test pages. wireshark is a little problematic for me considering the other traffic on my network but using the developer tools to load the urls doesn't show any outside network requests that I can see.

If you have access to the vpn you can load the urls from:

http://10.252.73.230:8100/files/ep1/nytimes/nytimes.com/index.html
http://10.252.73.230:8100/files/s1s2/blank.html

Ionuț Goldan [:igoldan]

Reporter

Comment 13

•

6 years ago

How should we proceed on this matter?

Ionuț Goldan [:igoldan]

Reporter

Updated

•

6 years ago

Status: NEW → RESOLVED

Closed: 6 years ago

Resolution: --- → WONTFIX

Bugzilla

Quick Search

3.77 - 5.8% remote-blank / remote-nytimes (android-4-2-armv7-api16, android-6-0-armv8-api16) regression on push d6120c2bb51e2057df51f4d52510bb5f4e8b4ca5 (Fri Jun 29 2018)

Categories

(Firefox Build System :: General, defect)

Tracking

(Not tracked)

People

(Reporter: igoldan, Unassigned)

References

Details

(Keywords: perf, regression)

Crash Data

Security

(public)

User Story

Description

Updated

Updated

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Comment 10

Comment 11

Comment 12

Comment 13

Updated