634564 - intermittent /tests/content/media/test/test_closing_connections.html | Test timed out.

Reporter

Description

•

13 years ago

http://tinderbox.mozilla.org/showlog.cgi?log=Firefox/1297854616.1297855700.26199.gz

Rev3 MacOSX Snow Leopard 10.6.2 mozilla-central opt test mochitests-1/5 on 2011/02/16 03:10:16

56987 INFO TEST-START | /tests/content/media/test/test_closing_connections.html
56988 ERROR TEST-UNEXPECTED-FAIL | /tests/content/media/test/test_closing_connections.html | Test timed out.
56989 INFO TEST-END | /tests/content/media/test/test_closing_connections.html | finished in 325848ms

Comment hidden (Legacy TBPL/Treeherder Robot)

mak77%bonardo.net
http://tinderbox.mozilla.org/showlog.cgi?log=Firefox/1297864788.1297865940.10259.gz
Rev3 WINNT 6.1 mozilla-central opt test mochitests-1/5 on 2011/02/16 05:59:48

s: talos-r3-w7-011
56993 ERROR TEST-UNEXPECTED-FAIL | /tests/content/media/test/test_closing_connections.html | Test timed out.

Comment hidden (Legacy TBPL/Treeherder Robot)

mounir.lamouri%gmail.com
http://tinderbox.mozilla.org/showlog.cgi?log=Firefox/1297865829.1297866901.14106.gz
Rev3 MacOSX Snow Leopard 10.6.2 mozilla-central opt test mochitests-1/5 on 2011/02/16 06:17:09

s: talos-r3-snow-028
56976 ERROR TEST-UNEXPECTED-FAIL | /tests/content/media/test/test_closing_connections.html | Test timed out.

Chris Pearce [:cpearce (Not reading bugmail)]

Comment 3

•

13 years ago

I wonder if this was caused by bug 633051 landing? We didn't have this failure before that landed (at 2011-02-15 12:00:48 PST), and that bug's patch messed with our load group code, which is what this test is testing.

Blocks: 633051

Comment hidden (Legacy TBPL/Treeherder Robot)

dao%mozilla.com
http://tinderbox.mozilla.org/showlog.cgi?log=Firefox/1297889144.1297890410.10121.gz
Rev3 MacOSX Leopard 10.5.8 mozilla-central opt test mochitests-1/5 on 2011/02/16 12:45:44

s: talos-r3-leopard-048
56892 ERROR TEST-UNEXPECTED-FAIL | /tests/content/media/test/test_closing_connections.html | Test timed out.

Comment hidden (Legacy TBPL/Treeherder Robot)

philringnalda%gmail.com
http://tinderbox.mozilla.org/showlog.cgi?log=Firefox/1297911193.1297912254.29877.gz
Rev3 MacOSX Snow Leopard 10.6.2 mozilla-central opt test mochitests-1/5 on 2011/02/16 18:53:13

s: talos-r3-snow-054
56979 ERROR TEST-UNEXPECTED-FAIL | /tests/content/media/test/test_closing_connections.html | Test timed out.

Matthew Gregan [:kinetik]

Assignee

Comment 6

•

13 years ago

I can't reproduce this locally, and there haven't been any reports today, so I'm going to assume this was fixed by the backout of bug 631058.  Please reopen this bug if it recurs.

Status: NEW → RESOLVED

Closed: 13 years ago

Resolution: --- → WORKSFORME

u279076

Updated

•

13 years ago

Status: RESOLVED → VERIFIED

Comment hidden (Legacy TBPL/Treeherder Robot)

Olli.Pettay%gmail.com
https://tbpl.mozilla.org/php/getParsedLog.php?id=7686148&tree=Try
Rev3 Fedora 12 try opt test mochitests-1/5 on 2011-12-01 10:07:10

74897 ERROR TEST-UNEXPECTED-FAIL | /tests/content/media/test/test_closing_connections.html | Test timed out.

Comment hidden (Legacy TBPL/Treeherder Robot)

Olli.Pettay%gmail.com
https://tbpl.mozilla.org/php/getParsedLog.php?id=8376029&tree=Try
Rev3 MacOSX Leopard 10.5.8 try debug test mochitests-1/5 on 2012-01-06 07:01:58

74449 ERROR TEST-UNEXPECTED-FAIL | /tests/content/media/test/test_closing_connections.html | Test timed out.

Matthew Gregan [:kinetik]

Assignee

Comment 9

•

12 years ago

This still exists, it just seems to be very rare on m-c.  It started happening on the incremental GC branch very frequently, which I'm assuming is caused by the timing changing just enough to reveal the bug regularly.

Status: VERIFIED → REOPENED

Resolution: WORKSFORME → ---

Matthew Gregan [:kinetik]

Assignee

Updated

•

12 years ago

Assignee: nobody → kinetik

Status: REOPENED → ASSIGNED

Bill McCloskey [inactive unless it's an emergency] (:billm)

Updated

•

12 years ago

Status: ASSIGNED → UNCONFIRMED

Ever confirmed: false

Bill McCloskey [inactive unless it's an emergency] (:billm)

Updated

•

12 years ago

Status: UNCONFIRMED → ASSIGNED

Ever confirmed: true

Matthew Gregan [:kinetik]

Assignee

Comment 10

•

12 years ago

The change in bug 633051 is implicated here.  I'm still tracking down all of the details, but if you look at https://tbpl.mozilla.org/php/getParsedLog.php?id=9372854&tree=Try&full=1#error0

...you see seek.webm used during test_bug686942 (which completes successfully), and then is added to the loadgroup during test_closing_connections because the media cache seeks the stream back near the start (for reasons that need to be investigated).  With the change from bug 633051, this causes recreated streams to be created in the foreground if they hit that code, which is why the seek ends up blocking the document load for test_closing_connections.

Matthew Gregan [:kinetik]

Assignee

Comment 11

•

12 years ago

Attached patch patch v0 — Details — Splinter Review

Don't clear mLoadInBackground when a load completes--it doesn't make sense that future loads should happen in the foreground. To simplify the code, I've removed the mLoadInBackground test completely and rely directly on the presence of LOAD_BACKGROUND when moving the completed load back into the foreground. Patch also includes two comment fixes, and adds a warning to nsHTMLMediaElement::GetDocumentLoadGroup. It's possible that it makes sense to return nsnull from GetDocumentLoadGroup if the document is inactive, but I'm not sure--at least the warning will make it easier to spot bugs like this in the future.

The sequence of events that was causing this test failure (as observed on larch with billm's incremental GC patches) was:

0. tests use a very small media cache by default (automation.py sets it to 100kB).

1. test_bug686942 runs, including a set of tests against seek.webm. The initial load of seek.webm passes through the code altered in this patch, setting mLoadInBackground to true. seek.webm's element happens to stay alive long enough for the following events to occur.

2. test_bug686942 completes and NotifyOwnerDocumentActivityChanged suspends the decoder/stream.

3. Other media cache activity causes some of seek.webm's blocks to be evicted. The stream moves from "ended" to "throttling" state, waiting for cache space to allow the blocks to be replaced from the network.

4. test_closing_connections starts loading 20 copies of seek.ogv, then waits for page load event to fire. This test uses use_large_cache.js.

5. seek.webm's element from test_bug686942 is still alive (in an inactive document). The cache notices that there's now enough space to fetch more data, so it creates a new channel, and since mLoadInBackground is false, the LOAD_BACKGROUND flag is not set. The new request is added to the element's load group, which is the same load group used by test_closing_connections because both tests were loaded in the same docshell.

6. seek.webm's foreground load is now blocking the document load for test_closing_connections.

7. seek.webm's OnStartRequest fires, and since the suspend count is 1 (from the suspend in step 2), the channel is immediately suspended.

Since seek.webm will never be unsuspended (e.g. by another call to NotifyOwnerDocumentActivityChanged as a result of being added to a docshell), the load never completes, and the document load for test_closing_connections is never unblocked, restuling in a test timeout.

Attachment #600251 - Flags: review?(cpearce)

Chris Pearce [:cpearce (Not reading bugmail)]

Updated

•

12 years ago

Attachment #600251 - Flags: review?(cpearce) → review+

Matthew Gregan [:kinetik]

Assignee

Comment 12

•

12 years ago

http://hg.mozilla.org/integration/mozilla-inbound/rev/f75b898fcef7

Status: ASSIGNED → RESOLVED

Closed: 13 years ago → 12 years ago

Resolution: --- → FIXED

Matthew Gregan [:kinetik]

Assignee

Comment 13

•

12 years ago

And re-enable the test:
http://hg.mozilla.org/integration/mozilla-inbound/rev/7af360ee95fe

Matthew Gregan [:kinetik]

Assignee

Comment 14

•

12 years ago

Oops, bug shouldn't be closed until the fix merges to m-c.

Status: RESOLVED → REOPENED

Resolution: FIXED → ---

Target Milestone: --- → mozilla13

Marco Bonardo [:mak]

Reporter

Comment 15

•

12 years ago

https://hg.mozilla.org/mozilla-central/rev/f75b898fcef7
https://hg.mozilla.org/mozilla-central/rev/7af360ee95fe

Status: REOPENED → RESOLVED

Closed: 12 years ago → 12 years ago

Resolution: --- → FIXED

Comment hidden (Legacy TBPL/Treeherder Robot)

philor
https://tbpl.mozilla.org/php/getParsedLog.php?id=15284675&tree=Mozilla-Inbound
Android Tegra 250 mozilla-inbound opt test mochitest-2 on 2012-09-17 14:53:30
slave: tegra-267

18072 ERROR TEST-UNEXPECTED-FAIL | /tests/content/media/test/test_closing_connections.html | Test timed out.

Comment hidden (Legacy TBPL/Treeherder Robot)

philor
https://tbpl.mozilla.org/php/getParsedLog.php?id=15375459&tree=Fx-Team
Android Tegra 250 fx-team opt test mochitest-2 on 2012-09-20 04:25:20
slave: tegra-192

18072 ERROR TEST-UNEXPECTED-FAIL | /tests/content/media/test/test_closing_connections.html | Test timed out.

Nobody; OK to take it and work on it

Updated

•

12 years ago

Keywords: intermittent-failure

Nobody; OK to take it and work on it

Updated

•

12 years ago

Whiteboard: [orange]