Restore daily Chromium updates for Windows
Categories
(Testing :: Raptor, task, P1)
Tracking
(firefox75 fixed)
Tracking | Status | |
---|---|---|
firefox75 | --- | fixed |
People
(Reporter: igoldan, Assigned: onegru)
References
Details
Attachments
(2 files)
Description copy/pasted from Jira:
Investigate why the Chromium updates affected Raptor & fix the issue.
These are the first 2 patches which showed intermittens. Use the associated (unstable) Chromium revisions to work on the fix.
What happened:
Since May 23 we pinned hardcoded Chromium revisions for Windows & OSX platforms.
It’s because on May 19 the new daily Chromium revisions caused lots & lots of Raptor intermittents on these 2 platforms. Very likely a breaking change landed on this browser, which affected the Raptor test runner.
We didn’t touch the Linux platforms, as the Chromium revisions there are stable.
Reporter | ||
Comment 1•5 years ago
|
||
Reporter | ||
Comment 2•5 years ago
•
|
||
I managed to reproduce the timeout twice on Windows 10, using Chromium 661171 (one of the 1st unstable revisions). You can see from the devtools logs that from a point on we're not getting the fcp metrics. This is what's causing the timeouts.
I know we disabled metrics like these in the past. Should we consider disabling fcp also on Windows & removed the pinned Chromium versions?
Reporter | ||
Comment 3•5 years ago
•
|
||
(In reply to Ionuț Goldan [:igoldan], Performance Sheriffing from comment #2)
I know we disabled metrics like these in the past. Should we consider disabling fcp also on Windows & removed the pinned Chromium versions?
I say Windows only, because I noticed that Chromium for OSX restabilized. I made a try push in which I removed the pinned versions.
Reporter | ||
Comment 4•5 years ago
•
|
||
Hmmm... Now that I think about it, we don't have a way of specifying metrics per platform.
Reporter | ||
Comment 5•5 years ago
•
|
||
Anyway, I pushed to Try the removal of the fcp from Rap-Cr tasks, to consolidate my theory.
Reporter | ||
Comment 6•5 years ago
•
|
||
(In reply to Ionuț Goldan [:igoldan], Performance Sheriffing from comment #5)
Anyway, I pushed to Try the removal of the fcp from Rap-Cr tasks, to consolidate my theory.
I see that all pageload tests failed because of this, while all benchmarks ran successfully. Have I missed something?
Still, the successfulness of the benchmarks (which previously failed) confirms there's something wrong when recording the fcp.
Reporter | ||
Comment 7•5 years ago
•
|
||
Reproducing & debugging this locally didn't bring something new, expect that at some point window.performance.getEntriesByType("paint")
returns an empty array.
Once this happens, the next 10 reattempts become useless. They produce the same output and then the test officially times out & notifies the control server.
Reporter | ||
Comment 8•5 years ago
|
||
Given this, I'm not sure how to proceed further. Feels to me like one of the official APIs got broken while under active development; probably something that will be addressed in the following weeks at most.
Dave, thoughts?
Reporter | ||
Comment 9•5 years ago
|
||
Digging through the Chromium's commit history, I found that the unstable Windows revision consisted of this patch.
Reporter | ||
Comment 10•5 years ago
|
||
Digging more, I think these are all the patches which landed after the last stable revision.
Reporter | ||
Comment 11•5 years ago
|
||
Made another Try push, where I removed the pinned Chromium revisions. If OSX platform stabilized, I'll remove the pin from there.
Comment 12•5 years ago
|
||
(In reply to Ionuț Goldan [:igoldan], Performance Sheriffing from comment #6)
I see that all pageload tests failed because of this, while all benchmarks ran successfully. Have I missed something?
Still, the successfulness of the benchmarks (which previously failed) confirms there's something wrong when recording the fcp.
Why would the benchmarks fail when they're not measuring fcp?
(In reply to Ionuț Goldan [:igoldan], Performance Sheriffing from comment #7)
Once this happens, the next 10 reattempts become useless. They produce the same output and then the test officially times out & notifies the control server.
Is this just warm page load or does it also affect cold page load?
(In reply to Ionuț Goldan [:igoldan], Performance Sheriffing from comment #8)
Given this, I'm not sure how to proceed further. Feels to me like one of the official APIs got broken while under active development; probably something that will be addressed in the following weeks at most.
I would suggest reaching out to the Chromium developers to see if this is something they're aware of. I wonder if it's also affecting their page load tests, or if this is even something that may ultimately affect their users. Let's test new builds on a weekly basis to see if this gets fixed in the meantime.
Reporter | ||
Comment 13•5 years ago
•
|
||
(In reply to Dave Hunt [:davehunt] [he/him] ⌚️UTC from comment #12)
(In reply to Ionuț Goldan [:igoldan], Performance Sheriffing from comment #6)
I see that all pageload tests failed because of this, while all benchmarks ran successfully. Have I missed something?
Still, the successfulness of the benchmarks (which previously failed) confirms there's something wrong when recording the fcp.Why would the benchmarks fail when they're not measuring fcp?
That's what I was wondering also. I filed bug 1555654 to look into this. Seems like my patch was wrong. I should have removed fcp from alert_on =
also.
Reporter | ||
Comment 14•5 years ago
|
||
(In reply to Dave Hunt [:davehunt] [he/him] ⌚️UTC from comment #12)
Is this just warm page load or does it also affect cold page load?
I only tested warm pageload. I could attempt to reproduce on cold pageloads.
Updated•5 years ago
|
Reporter | ||
Comment 15•5 years ago
|
||
I filed issue 969614, so the Chromium team will take a look over this problem
Reporter | ||
Comment 16•5 years ago
|
||
Did a Try push yesterday and noticed that Raptor Chromium remained stable on OSX. Thus, I filed bug 1556695 to remove the pinned version.
Comment 17•5 years ago
|
||
:igoldan any updates here?
Updated•5 years ago
|
Updated•5 years ago
|
Comment 20•5 years ago
|
||
According to this try the failures seems to be intermittent: https://treeherder.mozilla.org/#/jobs?repo=try&revision=2a15f0f8bbfaa5583ee16d8052824d0a21ffee19
Pushed a try with screenshots enabled for a deeper look: https://treeherder.mozilla.org/#/jobs?repo=try&revision=3273e0378d2846b5ab554af0570d13cf5a8931b6
Comment 21•5 years ago
|
||
chromium responded to the issue:
I've just skimmed through performance.cc and haven't found any suspicious change happening around the date. It may also relate to extension.
Comment 22•5 years ago
|
||
(In reply to Alexandru Ionescu :alexandrui from comment #21)
chromium responded to the issue:
I've just skimmed through performance.cc and haven't found any suspicious change happening around the date. It may also relate to extension.
I had seen that response. I don't have anything to add here, did you mean to needinfo me? I understand we've been able to replicate the issue locally on Windows? Has this helped with identifying the cause?
Comment 23•5 years ago
|
||
You were asking above the state of this and I was just letting you know. Today I'm going to replicate it locally on the machine we received, hope there will be some progress in one way or another.
Updated•5 years ago
|
Reporter | ||
Comment 24•5 years ago
•
|
||
Did a new Try push to check the Windows platforms.
Looks like Raptor Chromium visibly got more stable on Windows. We have less intermittents than previously.
But I don't think we're got enough to disable the pinned versions.
Reporter | ||
Updated•5 years ago
|
Comment 25•5 years ago
|
||
(In reply to Ionuț Goldan [:igoldan], Performance Sheriff from comment #24)
Did a new Try push to check the Windows platforms.
Looks like Raptor Chromium visibly got more stable on Windows. We have less intermittents than previously.
But I don't think we're got enough to disable the pinned versions.
Do the failures appear similar to what we were seeing before? Can you retrigger the failing jobs to see how frequent the failures are? Perhaps also see if the cold page load jobs are intermittent, as these don't appear to have failed in your push.
Reporter | ||
Comment 26•5 years ago
|
||
(In reply to Dave Hunt [:davehunt] [he/him] ⌚️UTC from comment #25)
(In reply to Ionuț Goldan [:igoldan], Performance Sheriff from comment #24)
Did a new Try push to check the Windows platforms.
Looks like Raptor Chromium visibly got more stable on Windows. We have less intermittents than previously.
But I don't think we're got enough to disable the pinned versions.Do the failures appear similar to what we were seeing before?
Yes, at least for the warm page loads.
Can you retrigger the failing jobs to see how frequent the failures are?
Just did that.
Perhaps also see if the cold page load jobs are intermittent, as these don't appear to have failed in your push.
Just did that.
Reporter | ||
Comment 27•5 years ago
|
||
Following up: for each of the failed jobs I made 10 retriggers. Good thing is none of the retriggers failed. So the Chromium is even more stable :)
Also did lots of retriggers for the cold page loads & none failed.
Comment 28•5 years ago
|
||
It sounds like maybe we're stable enough to re-enable the Chromium updates for Windows. Could you take care of this?
Reporter | ||
Updated•5 years ago
|
Updated•5 years ago
|
Reporter | ||
Comment 29•5 years ago
|
||
A month has passed. Before unpinning, I should recheck if things are still stable on Windows.
Did another Try push.
Reporter | ||
Comment 30•5 years ago
|
||
I'm afraid Raptor's Chromium jobs on Windows are once more highly unstable.
Updated•5 years ago
|
Comment 31•5 years ago
|
||
We now have latest Chrome release running on Windows, and Chromium will be running less frequently than before.
Reporter | ||
Comment 32•5 years ago
|
||
Should I reattempt to enable latest Chromium revisions?
Comment 33•5 years ago
|
||
(In reply to Ionuț Goldan [:igoldan] from comment #32)
Should I reattempt to enable latest Chromium revisions?
Yes, let's give it a go. Also, I suspect we don't report the Chromium version in the Perfherder data. If not, could you file a bug for that?
Reporter | ||
Comment 35•5 years ago
|
||
Not currently, but this is indeed for 2020/Q1.
Reporter | ||
Updated•5 years ago
|
Updated•5 years ago
|
Assignee | ||
Updated•5 years ago
|
Assignee | ||
Updated•5 years ago
|
Assignee | ||
Updated•5 years ago
|
Assignee | ||
Comment 36•5 years ago
|
||
Comment 37•5 years ago
|
||
Comment 38•5 years ago
|
||
bugherder |
Description
•