Closed Bug 1740225 Opened 3 years ago Closed 2 years ago

Windows debug Wdspec headless 1 tasks have too large logs above the threshold

Categories

(Testing :: geckodriver, defect)

Firefox 96
defect

Tracking

(firefox-esr91 unaffected, firefox94 unaffected, firefox95 unaffected, firefox96 fixed)

RESOLVED FIXED
96 Branch
Tracking Status
firefox-esr91 --- unaffected
firefox94 --- unaffected
firefox95 --- unaffected
firefox96 --- fixed

People

(Reporter: aryx, Assigned: whimboo)

Details

Attachments

(1 file, 1 obsolete file)

Many of the following tasks on autoland and mozilla-central

  • test-windows10-64-2004-qr/debug-web-platform-tests-wdspec-headless-fis-e10s-1
  • test-windows10-64-2004-qr/debug-web-platform-tests-wdspec-headless-e10s-1
    have logs too big to be parsed (threshold is 5MB in gzipped state if I remember correct).

The first affected is this one. There is a big JSON blob at task 2021-11-08T15:24:50.896Z in the log which is not very compressible.

Is this a change from bug 1739536?

Flags: needinfo?(hskupin)

(In reply to Sebastian Hengst [:aryx] (needinfo on intermittent or backout) from comment #0)

The first affected is this one. There is a big JSON blob at task 2021-11-08T15:24:50.896Z in the log which is not very compressible.

Is this a change from bug 1739536?

No it's not because that landed yesterday and should have no affect here because it's Android only.

But as you referred to the given revision on Treeherder the problematic regression here is the landing of bug 1713920 which changes some color handling. Since then I can see the following output in the log files:

[task 2021-11-08T15:28:16.787Z] 15:28:16 INFO - PID 6800 | [Parent 7784, Main Thread] WARNING: Unknown color for nsLookAndFeel: file /builds/worker/checkouts/gecko/widget/windows/nsLookAndFeel.cpp:327

And that line is listed a lot!! Emilio can you please have a look?

Flags: needinfo?(hskupin) → needinfo?(emilio)
Keywords: regression
Regressed by: 1713920
Summary: Windows debug Wdspec headless 1 tasks often have logs above the threshold to be parsed by Treeherder → Windows debug Wdspec headless 1 tasks have too large logs above the threshold due to WARNING: Unknown color for nsLookAndFeel: file /builds/worker/checkouts/gecko/widget/windows/nsLookAndFeel.cpp:327

Interesting that this affects windows jobs because the patch / bug is titled for GTK. As such I'm going to move this bug to Core : Widget for now.

Component: geckodriver → Widget
Product: Testing → Core

This is a expected situation specially after RemoteLookAndFeel was
enabled (which queries the whole int range).

No point in spamming the console.

Assignee: nobody → emilio
Status: NEW → ASSIGNED

If you think that warning will make a dent happy to remove it. It's an expected condition nowadays so we shouldn't warn to begin with.

Flags: needinfo?(emilio)

But pretty sure this is not a regression from bug 1713920.

No longer regressed by: 1713920

Yes, sorry, I was actually wrong here. The warning also shows up in builds before that patch landed. So it has to be something else. I might have to deeply compare the logs and why they are larger now.

Summary: Windows debug Wdspec headless 1 tasks have too large logs above the threshold due to WARNING: Unknown color for nsLookAndFeel: file /builds/worker/checkouts/gecko/widget/windows/nsLookAndFeel.cpp:327 → Windows debug Wdspec headless 1 tasks have too large logs above the threshold
Assignee: emilio → nobody
Status: ASSIGNED → NEW
No longer depends on: 1740245
Keywords: regression

Comment on attachment 9249959 [details]
Bug 1740225 - Don't warn when nsLookAndFeel doesn't know about a color. r=mstange

Revision D130739 was moved to bug 1740245. Setting attachment 9249959 [details] to obsolete.

Attachment #9249959 - Attachment is obsolete: true

There are even problems for builds from late October like:

https://treeherder.mozilla.org/jobs?repo=mozilla-central&searchStr=wdh%2Cdebug&revision=28da4138ddff87ff0684b274db911cd4b602ac92&selectedTaskRun=Q_5V7gSdTXair3LP4H-o2Q.0

I'm not getting anything shown on the failure summary tab when having any WdH job selected. Is that the same issue or was there a different failure with Treeherder at that time?

Flags: needinfo?(aryx.bugmail)
Component: Widget → geckodriver
Product: Core → Testing

Successful task && nothing in "Failure summary" tab == parsing successful && no failure lines
The affected jobs should read "Log parsing was skipped since the log exceeds the size limit." in the "Failure Summary" tab.

Flags: needinfo?(aryx.bugmail)

Ah ok. So I checked some logs and as it looks like we are always running at the limit for quite a while. As such one thing we might want to do now is to remove the logging of the async shutdown messages means that we should stop setting toolkit.asyncshutdown.log=true.

With Fission and WebRender done the shutdown behavior is much better now and removing that can save us quite a number of lines.

Reduction is about 400kB in gzip format and around 4-5 MB unpacked:

curl -I https://firefoxci.taskcluster-artifacts.net/JUHPDxVOTaWE7cvT94BBDg/0/public/logs/live_backing.log
HTTP/2 200
content-type: text/plain; charset=utf-8
content-length: 4643372
date: Thu, 11 Nov 2021 12:52:23 GMT
last-modified: Thu, 11 Nov 2021 12:46:41 GMT
etag: "b239025d64f3d3dfa0fb5b94f8b58f4e"
content-encoding: gzip

It's still not a huge reduction but maybe that's enough for now?

I've also seen that the patch for bug 1740245 has been landed and that actually helped a lot:

➜ curl -I https://firefoxci.taskcluster-artifacts.net/FRFMs959QuicW0N6O2zscA/0/public/logs/live_backing.log
HTTP/2 200
content-type: text/plain; charset=utf-8
content-length: 4239984
date: Thu, 11 Nov 2021 13:00:50 GMT
last-modified: Thu, 11 Nov 2021 06:40:15 GMT
etag: "453d511070aa56d82c39396f54e49577"
content-encoding: gzip

It's around 700kB and maybe we can leave it with that for now and do not remove the async shutdown logging as in my try build.

Aryx, what do you think?

Flags: needinfo?(aryx.bugmail)

Sorry my failure. I accidentally picked a shippable and not debug build. For a debug build it's not a big difference still:

content-length: 5094227

So we indeed should take the approach and disable async shutdown logs.

Flags: needinfo?(aryx.bugmail)
Assignee: nobody → hskupin
Status: NEW → ASSIGNED

Further tweaks might be needed. So we will keep this bug open for now.

Keywords: leave-open
Pushed by hskupin@mozilla.com:
https://hg.mozilla.org/integration/autoland/rev/89df8f25b8df
[wdspec] Don't print async shutdown logs for CI jobs. r=webdriver-reviewers,jdescottes

Aryx, please let me know if that works now. We seem to be down to a gzip size of about 4.5 MB. I hope that's enough. We can keep the bug open if you want and re-check next week. If it's not getting better we might want to also split up into 3 chunks.

Flags: needinfo?(aryx.bugmail)

Last log for windows wdspec which was above the threshold is from Monday.

Flags: needinfo?(aryx.bugmail)

And still no more failures since you have replied. So I'm going ahead and mark this bug as fixed.

Status: ASSIGNED → RESOLVED
Closed: 2 years ago
Resolution: --- → FIXED
Target Milestone: --- → 96 Branch
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: