Closed Bug 594145 Opened 14 years ago Closed 14 years ago

Intermittent failures in browser_HUDServiceTestsAll.js | Four children in output - Got 5, expected 4 and html page is logged - Didn't expect -1, but got it and javascript is logged - Didn't expect -1, but got it and log() is logged

Tracking

(blocking2.0 betaN+)

Status:

RESOLVED FIXED

Tracking Flags:

Tracking

Status

blocking2.0

---

betaN+

People

(Reporter: pcwalton, Assigned: msucan)

References

Details

(Keywords: intermittent-failure)

Attachments

(1 file, 3 obsolete files)

Proposed patch. 14 years ago Patrick Walton (:pcwalton) 2.40 KB, patch	dietrich : review+ ddahl : feedback+ Gavin : feedback-	Details \| Diff \| Splinter Review
Proposed patch, version 2. 14 years ago Patrick Walton (:pcwalton) 2.05 KB, patch	Gavin : review+	Details \| Diff \| Splinter Review
Proposed patch, version 3. 14 years ago Patrick Walton (:pcwalton) 1.99 KB, patch		Details \| Diff \| Splinter Review
Proposed patch, version 4. 14 years ago Patrick Walton (:pcwalton) 2.10 KB, patch		Details \| Diff \| Splinter Review

Patrick Walton (:pcwalton)

Reporter

Description

•

14 years ago

Occasionally, in the basic network logging test in the Web Console, the loading of the HTML page itself fails to trigger a log message, while the resources do trigger log messages. This is causing random test failures, so I will comment it out for the time being.

Patrick Walton (:pcwalton)

Reporter

Comment 1

•

14 years ago

This looks like the following:

INFO | runtests.py | Running tests: end.
mochitest-browser-chrome failed:
TEST-UNEXPECTED-FAIL |
chrome://mochikit/content/browser/toolkit/components/console/hudservice/tests/browser/browser_webconsole_basic_net_lo
gging.js | Four children in output - Got 2, expected 5
TEST-UNEXPECTED-FAIL |
chrome://mochikit/content/browser/toolkit/components/console/hudservice/tests/browser/browser_webconsole_basic_net_lo
gging.js | html page is logged - Didn't expect -1, but got it
TEST-UNEXPECTED-FAIL |
chrome://mochikit/content/browser/toolkit/components/console/hudservice/tests/browser/browser_webconsole_basic_net_lo
gging.js | Test timed out
make: *** [mochitest-browser-chrome] Error 1

Patrick Walton (:pcwalton)

Reporter

Comment 2

•

14 years ago

Baking a temporary fix for this on try as changeset 028c7eccd0d1.

Patrick Walton (:pcwalton)

Reporter

Updated

•

14 years ago

Depends on: 581069

Kevin Dangoor

Updated

•

14 years ago

Blocks: devtools4b7

Patrick Walton (:pcwalton)

Reporter

Updated

•

14 years ago

Summary: Random test failure in Web Console: basic network logging fails to pick up the HTML page → In the basic network logging test of the Web Console, the HTML page often isn't picked up

Whiteboard: randomorange

Patrick Walton (:pcwalton)

Reporter

Comment 3

•

14 years ago

Removing "randomorange" from the whiteboard as this hasn't been seen in mozilla-central, only in the split-up test in bug 581069.

Patrick Walton (:pcwalton)

Reporter

Comment 4

•

14 years ago

Adding randomorange back; this has been seen in the wild.

Assignee: nobody → pwalton

Status: NEW → ASSIGNED

Whiteboard: randomorange

Patrick Walton (:pcwalton)

Reporter

Comment 5

•

14 years ago

The problem here is that our observer picks up the network event, but it finds no console to deliver it to. (To deliver the message, it uses getHudIdByWindow().)

David Dahl :ddahl

Comment 6

•

14 years ago

(In reply to comment #5)
> The problem here is that our observer picks up the network event, but it finds
> no console to deliver it to. (To deliver the message, it uses
> getHudIdByWindow().)

So it is possible that we still have network events that cannot be traced back to the window via the Channel given to us. If this is the case - and the window we try to use here is null, or the console UI is null and the network traffic is not an image, we may be able to discern the console UI properly via the loadGroup. My css error trapping patch in bug 567165 is reintroducing loadGroup storage to the HUDService.

If it is an image that does this we may be in trouble, as all of the image logging code is not perfect. Again, we need platform support going forward to make this kind of discovery rock solid.

Patrick Walton (:pcwalton)

Reporter

Updated

•

14 years ago

Blocks: 438871

Whiteboard: randomorange → [orange]

Phil Ringnalda (:philor)

Comment 8

•

14 years ago

For [orange] to work, the filename of the failing test has to be in the bug summary (plus enough to tell that the bug is the same as the fails in the log), so if you want bug 596255 to be a duplicate of this, you pretty much need to change the summary to the summary of that.

Patrick Walton (:pcwalton)

Reporter

Updated

•

14 years ago

Summary: In the basic network logging test of the Web Console, the HTML page often isn't picked up → Intermittent failures in browser_HUDServiceTestsAll.js | Four children in output - Got 5, expected 4 and html page is logged - Didn't expect -1, but got it and javascript is logged - Didn't expect -1, but got it and log() is logged

Patrick Walton (:pcwalton)

Reporter

Comment 9

•

14 years ago

(In reply to comment #8)
> For [orange] to work, the filename of the failing test has to be in the bug
> summary (plus enough to tell that the bug is the same as the fails in the log),
> so if you want bug 596255 to be a duplicate of this, you pretty much need to
> change the summary to the summary of that.

Done, thanks.

Patrick Walton (:pcwalton)

Reporter

Comment 10

•

14 years ago

Attached patch Proposed patch. (obsolete) — Details — Splinter Review

So the problem here is that the HUD Service's window registry is only updated when its observer receives the "content-document-global-created" event. That event may or may not be sent after the network observer detects the HTML page. The fix is to make the network observer's helper method, getHudIdByWindow(), query the DOM instead of the window registry to find the Web Console.

The proposed patch implements this fix. Feedback requested.

Attachment #475373 - Flags: feedback?(ddahl)

Patrick Walton (:pcwalton)

Reporter

Comment 11

•

14 years ago

Followup note: The wiki [1] states that, for, "content-document-global-created":

> Sent immediately after a web content document window has been set up, but before any script code has been executed. This lets extensions inject API into content windows as needed. The data is a string indicating the URL of the page that will be loaded in the document window.

So it seems reasonable that this event could be fired after the HTML page is loaded. The only guarantee seems to be that it's fired before any scripts are executed, but that may be too late to catch the loading of content.

Patrick Walton (:pcwalton)

Reporter

Comment 12

•

14 years ago

Forgot to mention: Once the global console object lands, this patch may obviate the need for the window registry entirely. The only reason I kept it around is that it still needs to exist to clean up after console objects attached to iframes. If the global console component can handle this case without having to defer to the HUD Service, then the window registry can be removed.

David Dahl :ddahl

Comment 13

•

14 years ago

(In reply to comment #12)
> Forgot to mention: Once the global console object lands, this patch may obviate
> the need for the window registry entirely.

That is the plan I think, except we will just use the outer Window's ID instead of the window's URI or weakref - and the consoles will know exactly what window they are part of via the window's ID.

David Dahl :ddahl

Comment 14

•

14 years ago

Comment on attachment 475373 [details] [diff] [review]
Proposed patch.

I do not mind if functions return string or object, but some reviewers may not dig it. Looks good.

Attachment #475373 - Flags: feedback?(ddahl) → feedback+

Patrick Walton (:pcwalton)

Reporter

Updated

•

14 years ago

Attachment #475373 - Flags: review?(dietrich)

David Dahl :ddahl

Comment 15

•

14 years ago

\(In reply to comment #11)
\
> So it seems reasonable that this event could be fired after the HTML page is
> loaded. The only guarantee seems to be that it's fired before any scripts are
> executed, but that may be too late to catch the loading of content.

Except that the observer that is started is aware of all http connections, as it is started by the HUDService before any consoles are active -  and is a global singleton.

When we cannot find the correct web console to log the message to, we may have to hodl those "orphaned messages" in a cache to be procesed later. (may be onload or ondocumentready)

Comment hidden (Legacy TBPL/Treeherder Robot)

philringnalda%gmail.com
http://tinderbox.mozilla.org/showlog.cgi?log=Firefox/1284529794.1284530722.12779.gz
Rev3 Fedora 12x64 mozilla-central opt test mochitest-other on 2010/09/14 22:49:54

s: talos-r3-fed64-011
TEST-UNEXPECTED-FAIL | chrome://mochikit/content/browser/toolkit/components/console/hudservice/tests/browser/browser_HUDServiceTestsAll.js | Four children in output - Got 5, expected 4
TEST-UNEXPECTED-FAIL | chrome://mochikit/content/browser/toolkit/components/console/hudservice/tests/browser/browser_HUDServiceTestsAll.js | html page is logged - Didn't expect -1, but got it
TEST-UNEXPECTED-FAIL | chrome://mochikit/content/browser/toolkit/components/console/hudservice/tests/browser/browser_HUDServiceTestsAll.js | javascript is logged - Didn't expect -1, but got it
TEST-UNEXPECTED-FAIL | chrome://mochikit/content/browser/toolkit/components/console/hudservice/tests/browser/browser_HUDServiceTestsAll.js | log() is logged

Dietrich Ayala (:dietrich)

Comment 17

•

14 years ago

Comment on attachment 475373 [details] [diff] [review]
Proposed patch.

r=me. though, is it valid for getHudIdByWindow to be called w/ a non-content window? should we throw in those cases?

Attachment #475373 - Flags: review?(dietrich) → review+

Serge Gautherie (:sgautherie)

Updated

•

14 years ago

Version: unspecified → Trunk

Patrick Walton (:pcwalton)

Reporter

Comment 18

•

14 years ago

(In reply to comment #17)
> Comment on attachment 475373 [details] [diff] [review]
> Proposed patch.
> 
> r=me. though, is it valid for getHudIdByWindow to be called w/ a non-content
> window? should we throw in those cases?

Sometimes, chrome performs network operations and triggers the network observer (examples are the favicon and the view source window). In those cases, getHudIdByWindow will return null to tell our observer to bail out.

Patrick Walton (:pcwalton)

Reporter

Updated

•

14 years ago

Attachment #475373 - Flags: approval2.0?

Patrick Walton (:pcwalton)

Reporter

Comment 19

•

14 years ago

Nominating for blocking status, on the theory that we need our trees green to ship Firefox 4. Additionally, this affects the reliability of the Web Console by making it common for network requests to be dropped when the page is reloaded.

blocking2.0: --- → ?

:Gavin Sharp [email: gavin@gavinsharp.com]

Comment 20

•

14 years ago

Comment on attachment 475373 [details] [diff] [review]
Proposed patch.

>diff --git a/toolkit/components/console/hudservice/HUDService.jsm b/toolkit/components/console/hudservice/HUDService.jsm

>   getHudIdByWindow: function HS_getHudIdByWindow(aContentWindow)
>   {

>+    let webNavigation = aContentWindow.QueryInterface(Ci.nsIInterfaceRequestor).
>+                        getInterface(Ci.nsIWebNavigation);
>+    let docShellTreeItem = webNavigation.QueryInterface(Ci.nsIDocShellTreeItem);
>+    let rootTreeItem = docShellTreeItem.rootTreeItem;
>+    if (!rootTreeItem) {

Doesn't look to me like this can actually happen... Seems like you actually want to check for (docShellTreeItem == rootTreeItem)...

... but I've actually been encouraging the use of chromeEventHandler for this kind of stuff. You could do:

let chromeEventHandler = aContentWindow.QueryInterface(Ci.nsIInterfaceRequestor)
                                       .getInterface(Ci.nsIWebNavigation)
                                       .QueryInterface(Ci.nsIDocShell)
                                       .chromeEventHandler;
if (!chromeEventHandler) {
  // chrome window
}

let xulWindow = chromeEventHandler.ownerDocument.defaultView;
// check window type

>+    let browser = gBrowser.getBrowserForDocument(contentDocument);

This can then just be:

let browser = chromeEventHandler;

Attachment #475373 - Flags: approval2.0? → feedback-

:Gavin Sharp [email: gavin@gavinsharp.com]

Comment 21

•

14 years ago

I think that the suggested code in comment 20 may also make the unwrapping unecessary, but I'm not sure.

:Gavin Sharp [email: gavin@gavinsharp.com]

Updated

•

14 years ago

blocking2.0: ? → betaN+

Patrick Walton (:pcwalton)

Reporter

Comment 22

•

14 years ago

Attached patch Proposed patch, version 2. (obsolete) — Details — Splinter Review

New patch addresses Gavin's comments. The incantation to acquire the notification box is much shorter now.

Attachment #475731 - Flags: review?(gavin.sharp)

:Gavin Sharp [email: gavin@gavinsharp.com]

Comment 23

•

14 years ago

Comment on attachment 475731 [details] [diff] [review]
Proposed patch, version 2.

Actually, I just realized we can make use of the chromeWin.getNotificationBox() pseudo-API that's used for other notifications:

let xulWindow = chromeEventHandler.ownerDocument.defaultView;
if (!xulWindow.getNotificationBox)
  return null; // window doesn't implement the getNotificationBox pseudo-API

let notificationBox = chromeWindow.getNotificationBox(aContentWindow.top);

:Gavin Sharp [email: gavin@gavinsharp.com]

Comment 24

•

14 years ago

Comment on attachment 475731 [details] [diff] [review]
Proposed patch, version 2.

r=me if you use the suggestion from comment 23, and use node.id rather than node.getAttribute("id").

Attachment #475731 - Flags: review?(gavin.sharp) → review+

Kevin Dangoor

Updated

•

14 years ago

Blocks: devtools4b8
No longer blocks: devtools4b7

Patrick Walton (:pcwalton)

Reporter

Comment 25

•

14 years ago

Attached patch Proposed patch, version 3. (obsolete) — Details — Splinter Review

New version addresses reviewer comments.

Attachment #475373 - Attachment is obsolete: true

Attachment #475731 - Attachment is obsolete: true

:Gavin Sharp [email: gavin@gavinsharp.com]

Comment 26

•

14 years ago

Comment on attachment 480778 [details] [diff] [review]
Proposed patch, version 3.

>diff --git a/toolkit/components/console/hudservice/HUDService.jsm b/toolkit/components/console/hudservice/HUDService.jsm

>+    if (!xulWindow.getNotificationBox) {
>+      // The window isn't a content window.
>+      return null;

My comment in comment 23 is more accurate - this check doesn't tell you that aContentWindow isn't a content window, it just tells you that it's chrome parent doesn't support this particular API (e.g., it's a view-source window).

Patrick Walton (:pcwalton)

Reporter

Comment 27

•

14 years ago

This patch doesn't seem to fix the issue. Sigh.

Patrick Walton (:pcwalton)

Reporter

Comment 28

•

14 years ago

Attached patch Proposed patch, version 4. — Details — Splinter Review

New patch uses wrappedJSObject and fixes the comment as suggested by gavin.

Attachment #480778 - Attachment is obsolete: true

David Dahl :ddahl

Comment 29

•

14 years ago

This test was tweaked in the split tests bug landing (bug 581069) - I disabled 2 checks. I did try to simplify the DOM query to just get ".hud-msg-node" nodes, but that did not help too much.

Kevin Dangoor

Comment 30

•

14 years ago

Reassigning to Mihai to see if this is still relevant.

Assignee: pwalton → mihai.sucan

Mihai Sucan [:msucan]

Assignee

Comment 31

•

14 years ago

I looked through the old mochitest code, and the new one, and also read the proposed patches, and the entire discussion in this bug. Thoughts:

What Patrick found is interesting, and seems to be reasonable and correct. The httpObserver does observe network activity *before* any Web Consoles are open.

I believe the problem with the intermittent failures was caused by a more "simpler" issue: the code expected that *exactly* four network requests will occur. However, that was not always true, because the browser also performed the request to load the favicon at times.

The real issue was with the test itself: it should only check for the network requests it knows about. That's what the test does today, and it runs fine without intermittent failures.

The latest patch here is fine, except it's no longer relevant for that specific mochitest code.

The only relevancy for this patch is ... if we want to remove the windowRegistry. Do we want that? It helps with mapping windows to hudIds faster than without.

Patrick Walton (:pcwalton)

Reporter

Comment 32

•

14 years ago

(In reply to comment #31)
> The only relevancy for this patch is ... if we want to remove the
> windowRegistry. Do we want that? It helps with mapping windows to hudIds faster
> than without.

Having the window registry is fine. I suppose this bug is fixed then?

Mihai Sucan [:msucan]

Assignee

Comment 33

•

14 years ago

Yes.

Kevin Dangoor

Updated

•

14 years ago

Status: ASSIGNED → RESOLVED

Closed: 14 years ago

Resolution: --- → FIXED

Nobody; OK to take it and work on it

Updated

•

12 years ago

Keywords: intermittent-failure

Nobody; OK to take it and work on it

Updated

•

12 years ago

Whiteboard: [orange]

BMO Automation

Updated

•

6 years ago

Product: Firefox → DevTools

You need to log in before you can comment on or make changes to this bug.