Closed Bug 522416 Opened 15 years ago Closed 15 years ago

Tab Previews must not do sync http requests

Tracking

()

Status:

RESOLVED FIXED

Milestone:

Firefox 3.7a1

Tracking Flags:

Tracking

Status

status1.9.2

---

beta1-fixed

People

(Reporter: vlad, Assigned: sdwilsh)

References

(Blocks 1 open bug)

Details

(Keywords: dev-doc-complete)

Attachments

(6 files, 4 obsolete files)

xpcom and netwerk changes v1.0 15 years ago Shawn Wilsher :sdwilsh 7.42 KB, patch		Details \| Diff \| Splinter Review
Taskbar changes v1.0 15 years ago Shawn Wilsher :sdwilsh 4.06 KB, patch		Details \| Diff \| Splinter Review
Taskbar changes v1.1 15 years ago Shawn Wilsher :sdwilsh 4.26 KB, patch		Details \| Diff \| Splinter Review
Taskbar changes v1.2 15 years ago Shawn Wilsher :sdwilsh 4.46 KB, patch	vlad : review+	Details \| Diff \| Splinter Review
Widget and test fixes v1.0 15 years ago Rob Arnold [:robarnold] 4.10 KB, patch	vlad : review+	Details \| Diff \| Splinter Review
netwerk changes v2.0 15 years ago Shawn Wilsher :sdwilsh 5.86 KB, patch	bzbarsky : review+ vlad : superreview+	Details \| Diff \| Splinter Review
Taskbar changes v1.3 15 years ago Shawn Wilsher :sdwilsh 3.55 KB, patch		Details \| Diff \| Splinter Review
netwerk changes v2.0 for 1.9.2 15 years ago Shawn Wilsher :sdwilsh 5.12 KB, patch		Details \| Diff \| Splinter Review
hang fix v1.0 15 years ago Shawn Wilsher :sdwilsh 1.15 KB, patch	bzbarsky : review+	Details \| Diff \| Splinter Review
tests for hang vix v1.0 15 years ago Shawn Wilsher :sdwilsh 1.27 KB, patch	bzbarsky : review+	Details \| Diff \| Splinter Review

Vladimir Vukicevic [:vlad] [:vladv] (needinfo me, slow to respond)

Reporter

Description

•

15 years ago

This made the problem in bug 521668 much much easier to hit, but that's a separate thing.  Regardless, tab preview code must never do sync network requests;  WindowsPreviewPerTab.jsm's _imageFromURI needs to become async.

Flags: blocking-firefox3.6+

Vladimir Vukicevic [:vlad] [:vladv] (needinfo me, slow to respond)

Reporter

Updated

•

15 years ago

Priority: -- → P1

Justin Dolske [:Dolske]

Comment 1

•

15 years ago

Use nsIFaviconService?

Rob Arnold [:robarnold]

Comment 2

•

15 years ago

That would probably be better. For some reason I thought that I was only getting favicon:// urls which were resolved to some local location.

Dão Gottwald [:dao]

Updated

•

15 years ago

Component: Tabbed Browser → Shell Integration

QA Contact: tabbed.browser → shell.integration

Marco Bonardo [:mak]

Comment 3

•

15 years ago

i think using moz-anno:favicon:uri should be the way to go

Vladimir Vukicevic [:vlad] [:vladv] (needinfo me, slow to respond)

Reporter

Comment 4

•

15 years ago

Hm, it's not a matter of just changing the URI, right?  The code has to be async, regardless of what URI is being used.

Mike Beltzner [:beltzner, not reading bugmail]

Comment 5

•

15 years ago

Who can take this bug - it's blocking the beta.

Component: Shell Integration → Tabbed Browser

Marco Bonardo [:mak]

Comment 6

•

15 years ago

moz-anno:favicon uses an async channel

Mike Beltzner [:beltzner, not reading bugmail]

Comment 7

•

15 years ago

I think Vlad meant that it needs to be read asynchronously, so the required fix is a little more than just pulling the information from a different channel - though that will certainly help!

Mike Beltzner [:beltzner, not reading bugmail]

Comment 8

•

15 years ago

Shawn said he'd take this

Assignee: nobody → sdwilsh

Shawn Wilsher :sdwilsh

Assignee

Comment 9

•

15 years ago

Attached patch xpcom and netwerk changes v1.0 (obsolete) — Details — Splinter Review

This was a lot harder than expected.  Step one - lets make a convenience method for JavaScript to easily open channels asynchronously.  Required a change in nsStorageStream.

Attachment #406568 - Flags: superreview?(benjamin)

Attachment #406568 - Flags: review?(bzbarsky)

Shawn Wilsher :sdwilsh

Assignee

Comment 10

•

15 years ago

Attached patch Taskbar changes v1.0 (obsolete) — Details — Splinter Review

And the jsm changes needed with the new API.

Attachment #406570 - Flags: review?(vladimir)

Shawn Wilsher :sdwilsh

Assignee

Comment 11

•

15 years ago

Attached patch Taskbar changes v1.1 (obsolete) — Details — Splinter Review

per comments from robarnold on irc

Attachment #406570 - Attachment is obsolete: true

Attachment #406583 - Flags: review?(vladimir)

Attachment #406570 - Flags: review?(vladimir)

Justin Dolske [:Dolske]

Comment 12

•

15 years ago

So this is still reading from the network (albeit async now)? Seems like that's worth a followup bug to move this to use the favicon service (or Places' moz-anno stuff), to avoid the network requests entirely? [Along with the followup you mentioned for favicons often not loading at all.]

Shawn Wilsher :sdwilsh

Assignee

Comment 13

•

15 years ago

(In reply to comment #12)
> So this is still reading from the network (albeit async now)? Seems like that's
> worth a followup bug to move this to use the favicon service (or Places'
> moz-anno stuff), to avoid the network requests entirely? [Along with the
> followup you mentioned for favicons often not loading at all.]
The favicon service has no asynchronous way to read data from the database with the exception of using the moz-anno: protocol for favicons, which is what this does.  The moz-anno: protocol has to go through necko as a result, but it's not actually going over the network.  With these patches, this bit of code no longer does any synchronous IO on the main thread.

Whiteboard: [needs review vlad][needs review bz][needs sr bsmedberg]

Justin Dolske [:Dolske]

Comment 14

•

15 years ago

(In reply to comment #13)
> With these patches, this bit of code no
> longer does any synchronous IO on the main thread.

Sure, but the point is a DB lookup should complete faster than an HTTP request, sync/async has nothing to do with it. We can change the favicon service to offer an async method if that's a win.

Shawn Wilsher :sdwilsh

Assignee

Comment 15

•

15 years ago

Attached patch Taskbar changes v1.2 (obsolete) — Details — Splinter Review

Fix default icon race and a comment nit from robarnold.

Attachment #406583 - Attachment is obsolete: true

Attachment #406588 - Flags: review?(vladimir)

Attachment #406583 - Flags: review?(vladimir)

Shawn Wilsher :sdwilsh

Assignee

Updated

•

15 years ago

Attachment #406568 - Flags: superreview?(benjamin) → superreview?(vladimir)

Shawn Wilsher :sdwilsh

Assignee

Updated

•

15 years ago

Whiteboard: [needs review vlad][needs review bz][needs sr bsmedberg] → [needs review vlad][needs review bz][needs sr vlad]

Marco Bonardo [:mak]

Comment 16

•

15 years ago

(In reply to comment #14)
> We can change the favicon service to
> offer an async method if that's a win.

no reason, if you want to get an async channel from favicon service, as i said, and as i implemented apart, you can open an async channel to a moz-anno:favicon: address (Well now you could even use the netUtil method Shwan added). i tested that working, but Shawn's solution looks more polite than this http://mozilla.pastebin.com/m365fe0b4.

Shawn Wilsher :sdwilsh

Assignee

Comment 17

•

15 years ago

(In reply to comment #16)
> no reason, if you want to get an async channel from favicon service, as i said,
> and as i implemented apart, you can open an async channel to a
> moz-anno:favicon: address (Well now you could even use the netUtil method Shwan
> added). i tested that working, but Shawn's solution looks more polite than this
> http://mozilla.pastebin.com/m365fe0b4.
we also have to worry about a delayed load with the favicon service, right?
http://mxr.mozilla.org/mozilla-central/source/toolkit/components/places/src/nsFaviconService.cpp#503

Boris Zbarsky [:bzbarsky]

Comment 18

•

15 years ago

A few quick questions:

1)  Is there a reason to use a storage stream here instead of a pipe?  The latter would allow data to be discarded as it's read, which might be desirable.

2)  If you use nsISimpleStreamListener you don't have to reimplement it in js (or rather just need to implement an nsIRequestObserver with an OnStopRequest that calles the callback), right?  What are the drawbacks?

Rob Arnold [:robarnold]

Comment 19

•

15 years ago

Attached patch Widget and test fixes v1.0 — Details — Splinter Review

Ran the regression tests on Shawn's patch. Found a crash-near-startup bug (see widget fixes; test included to catch a regression on it) and all the browser tests for AeroPeek were broken in bug 520724 so those are fixed too.

Attachment #406615 - Flags: review?(vladimir)

Rob Arnold [:robarnold]

Updated

•

15 years ago

Status: NEW → ASSIGNED

Shawn Wilsher :sdwilsh

Assignee

Comment 20

•

15 years ago

(In reply to comment #18)
> 1)  Is there a reason to use a storage stream here instead of a pipe?  The
> latter would allow data to be discarded as it's read, which might be desirable.
biesi and I were talking on irc and thought that this would be simpler than using a pipe.  I don't fully recall the reasoning (we looked at three or four options all together).  For a more general case, a pipe might be better.

> 2)  If you use nsISimpleStreamListener you don't have to reimplement it in js
> (or rather just need to implement an nsIRequestObserver with an OnStopRequest
> that calles the callback), right?  What are the drawbacks?
Do I just pass the nsISimpleStreamListener to asyncOpen on the channel, and then OnStopRequest, flush init it with the either the nsIStorageStream's output stream or a pipe's output stream?

Marco Bonardo [:mak]

Comment 21

•

15 years ago

(In reply to comment #17)
> (In reply to comment #16)
> we also have to worry about a delayed load with the favicon service, right?
> http://mxr.mozilla.org/mozilla-central/source/toolkit/components/places/src/nsFaviconService.cpp#503
that's why i added a second icon refresh after 5s. but yeah that's the main problem of the favicon service approach.

Phil Ringnalda (:philor)

Updated

•

15 years ago

Component: Tabbed Browser → Shell Integration

Boris Zbarsky [:bzbarsky]

Comment 22

•

15 years ago

OK.  No strong opinions on storage stream vs pipe, I guess, other than pipe already supporting writefrom and such.

> Do I just pass the nsISimpleStreamListener to asyncOpen on the channel, and
> then OnStopRequest, flush init it with the either the nsIStorageStream's output
> stream or a pipe's output stream?

In your asyncOpen, you create the nsISimpleStreamListener, init it with the output end of a pipe or storage stream and an observer of your choice that watches for onStopRequest, and pass the nsISimpleStreamListener to the channel's asyncOpen.

Boris Zbarsky [:bzbarsky]

Comment 23

•

15 years ago

Comment on attachment 406568 [details] [diff] [review]
xpcom and netwerk changes v1.0

>+    return Write(data.get(), aCount, _bytesWritten);

s/aCount/data.Length()/, right?

Vladimir Vukicevic [:vlad] [:vladv] (needinfo me, slow to respond)

Reporter

Updated

•

15 years ago

Attachment #406615 - Flags: review?(vladimir) → review+

Shawn Wilsher :sdwilsh

Assignee

Comment 24

•

15 years ago

(In reply to comment #23)
> s/aCount/data.Length()/, right?
Those should be the same, right?  I could add an assertion there I suppose...

Whiteboard: [needs review vlad][needs review bz][needs sr vlad] → [needs review bz][needs sr vlad]

Vladimir Vukicevic [:vlad] [:vladv] (needinfo me, slow to respond)

Reporter

Updated

•

15 years ago

Attachment #406588 - Flags: review?(vladimir) → review+

Vladimir Vukicevic [:vlad] [:vladv] (needinfo me, slow to respond)

Reporter

Comment 25

•

15 years ago

Maybe call asyncOpen something like asyncFetch?  "asyncOpen" implies just opening to me, whereas the important part of the API (and this particular usage of it) is that it will open and fully read from the stream, ensuring that reading from the input stream will never block.

Boris Zbarsky [:bzbarsky]

Comment 26

•

15 years ago

> Those should be the same, right? 

Not if the stream hit EOF before aCount bytes got read from it.  Which can happen, no problem.

Shawn Wilsher :sdwilsh

Assignee

Comment 27

•

15 years ago

Attached patch netwerk changes v2.0 — Details — Splinter Review

s/asyncOpen/asyncFetch/ per comments by vlad on irc
Use nsIPipe and nsISimpleStreamListener

Attachment #406568 - Attachment is obsolete: true

Attachment #406744 - Flags: superreview?(vladimir)

Attachment #406744 - Flags: review?(bzbarsky)

Attachment #406568 - Flags: superreview?(vladimir)

Attachment #406568 - Flags: review?(bzbarsky)

Shawn Wilsher :sdwilsh

Assignee

Comment 28

•

15 years ago

Attached patch Taskbar changes v1.3 — Details — Splinter Review

Updated to use the new API name.  Ready to land.

Attachment #406588 - Attachment is obsolete: true

Shawn Wilsher :sdwilsh

Assignee

Comment 29

•

15 years ago

(In reply to comment #26)
> Not if the stream hit EOF before aCount bytes got read from it.  Which can
> happen, no problem.
Ah.  That code has all been removed from the latest patch anyway since I no longer needed to make that change.

Vladimir Vukicevic [:vlad] [:vladv] (needinfo me, slow to respond)

Reporter

Updated

•

15 years ago

Attachment #406744 - Flags: superreview?(vladimir) → superreview+

Shawn Wilsher :sdwilsh

Assignee

Updated

•

15 years ago

Whiteboard: [needs review bz][needs sr vlad] → [needs review bz]

Boris Zbarsky [:bzbarsky]

Comment 30

•

15 years ago

Comment on attachment 406744 [details] [diff] [review]
netwerk changes v2.0

Looks good.

Attachment #406744 - Flags: review?(bzbarsky) → review+

Shawn Wilsher :sdwilsh

Assignee

Updated

•

15 years ago

Whiteboard: [needs review bz]

Shawn Wilsher :sdwilsh

Assignee

Comment 31

•

15 years ago

http://hg.mozilla.org/mozilla-central/rev/d3ce844996ee
http://hg.mozilla.org/mozilla-central/rev/a9e961601e20
http://hg.mozilla.org/mozilla-central/rev/3124e42df5a6

Status: ASSIGNED → RESOLVED

Closed: 15 years ago

Keywords: dev-doc-needed

Resolution: --- → FIXED

Target Milestone: --- → Firefox 3.7a1

Shawn Wilsher :sdwilsh

Assignee

Comment 32

•

15 years ago

Attached patch netwerk changes v2.0 for 1.9.2 — Details — Splinter Review

This didn't apply perfectly on branch - one minor conflict in the test file.  This does.

Marco Bonardo [:mak]

Comment 33

•

15 years ago

Shawn, do you think makes sense to file a bug to use the favicon service when we fail fetching through http? (offline mode or network errors)

Shawn Wilsher :sdwilsh

Assignee

Comment 34

•

15 years ago

I think it makes sense to always use the favicon service, and we should file a bug for that.

Shawn Wilsher :sdwilsh

Assignee

Comment 35

•

15 years ago

http://hg.mozilla.org/releases/mozilla-1.9.2/rev/d6cce120cff7
http://hg.mozilla.org/releases/mozilla-1.9.2/rev/56c0fa3e56ba
http://hg.mozilla.org/releases/mozilla-1.9.2/rev/429e7f7ba34d

relbranch soon...

status1.9.2: --- → final-fixed

Shawn Wilsher :sdwilsh

Assignee

Comment 36

•

15 years ago

release branch!  I had to take the changesets for bug 521216 since it would require non-trivial changes for this work to apply to just the release branch.  That bug had already landed on 1.9.2, so I figure it's probably just fine.

http://hg.mozilla.org/releases/mozilla-1.9.2/rev/c307c9951835
http://hg.mozilla.org/releases/mozilla-1.9.2/rev/5d6d66d2e92d
http://hg.mozilla.org/releases/mozilla-1.9.2/rev/28f2f49b61fe

status1.9.2: final-fixed → beta1-fixed

Marco Bonardo [:mak]

Updated

•

15 years ago

Blocks: 522855

Marco Bonardo [:mak]

Comment 37

•

15 years ago

i have filed Bug 522855 to implement this through the favicon service.

Mike Beltzner [:beltzner, not reading bugmail]

Comment 38

•

15 years ago

Awesome, Shawn. Thanks so much for getting this taken care of on 1.9.2 and the releasebranch.

status1.9.2: beta1-fixed → ---

Mike Beltzner [:beltzner, not reading bugmail]

Updated

•

15 years ago

status1.9.2: --- → beta1-fixed

Vladimir Vukicevic [:vlad] [:vladv] (needinfo me, slow to respond)

Reporter

Comment 39

•

15 years ago

Ugh... with today's nightly (Minefield, haven't tried Namaroka), I see deadlocks that look like this:

ntdll_77330000!NtWaitForSingleObject
KERNELBASE!WaitForSingleObjectEx
KERNEL32!WaitForSingleObjectExImplementation
KERNEL32!WaitForSingleObject
nspr4!_PR_MD_WAIT_CV
nspr4!_PR_WaitCondVar
nspr4!PR_Wait
xul!nsAutoMonitor::Wait
xul!nsPipeInputStream::Wait
xul!nsPipeInputStream::ReadSegments
xul!NS_InputStreamIsBuffered
xul!imgTools::DecodeImageData
xul!NS_InvokeByIndex_P
xul!XPCWrappedNative::CallMethod
xul!XPCWrappedNative::GetLock
xul!XPC_WN_OnlyIWrite_PropertyStub
MOZCRT19!arena_malloc_small
MOZCRT19!malloc
mozjs!JS_HashTableRawAdd
mozjs!JS_HashTableAdd

There are only two users of DecodeImageData, and the other one is in C++ code; so it has to be the one here.   This stack is bogus after the XPCWrappedNative bits due to xpconnect, but the top part is telling me that the PipeInputStream is blocking on DecodeImageData -- I'm guessing that DecodeImageData consumed all the data, but the pipe somehow didn't get notified that the base stream got closed.  So, it's getting a wouldblock and is calling Wait(), leading to badness.

Is a simple fix here to set the pipe to non-blocking via SetNonBlocking?  Or is there some deeper problem?

Status: RESOLVED → REOPENED

Resolution: FIXED → ---

Brian Carpenter [:geeknik]

Comment 40

•

15 years ago

Is it possible that these deadlocks are behind the random 'not responding' issues I am having with the 17 October build of Minefield?

Vladimir Vukicevic [:vlad] [:vladv] (needinfo me, slow to respond)

Reporter

Comment 41

•

15 years ago

I just hit this again despite having the taskbar previews pref set to false -- does that pref actually work?  It does seem to turn off the previews, but this code still seems to be executed.

(In reply to comment #40)
> Is it possible that these deadlocks are behind the random 'not responding'
> issues I am having with the 17 October build of Minefield?

If you're running Windows 7, then quite possibly, yes.

Jim Jeffery not reading bug-mail 1/2/11

Comment 42

•

15 years ago

I can confirm that the build with cset:
http://hg.mozilla.org/mozilla-central/rev/8976e1704153 is OK, no hangs on Acid3 test

The first build with cset: 
http://hg.mozilla.org/mozilla-central/rev/3124e42df5a6 which contains this patch locks hard, and have to kill the process. 

Win7RC build 7100 
AMD Athlon CPU Phenom II Quad X4, 8 gig RAM HD3200 onboard video 512meg

Brian Carpenter [:geeknik]

Updated

•

15 years ago

Blocks: 522892

Rob Arnold [:robarnold]

Comment 43

•

15 years ago

(In reply to comment #41)
> I just hit this again despite having the taskbar previews pref set to false --
> does that pref actually work?  It does seem to turn off the previews, but this
> code still seems to be executed.

Yes, the previews' visibility is just set to false. That's the only difference.

Sylvain Pasche

Comment 44

•

15 years ago

Apparently, pages with empty favicons trigger the hang:

data:text/html,<link rel="icon" href="data:image/x-icon,">

Shawn Wilsher :sdwilsh

Assignee

Comment 45

•

15 years ago

(In reply to comment #39)
> Is a simple fix here to set the pipe to non-blocking via SetNonBlocking?  Or is
> there some deeper problem?
hmm.  bz - did we want this line:
pipe.init(false, false, 0, PR_UINT32_MAX, null);
to be this instead:
pipe.init(true, true, 0, PR_UINT32_MAX, null);

Boris Zbarsky [:bzbarsky]

Comment 46

•

15 years ago

Er... we want non-blocking streams, so yes.  I double-checked that when I read the patch originally, and clearly got confused by all the negatives...

That said, would just calling close() on the output end of the pipe in the onStopRequest there before calling the callback address the issue?

Boris Zbarsky [:bzbarsky]

Comment 47

•

15 years ago

Ah, so the stack in comment 39 makes sense: we're trying to read 1 byte from a blocking stream.

I think we might want to both close() and make the streams non-blocking to be safe, though the former should be enough (changes the return from GetReadSegment from NS_BASE_STREAM_WOULD_BLOCK to NS_BASE_STREAM_CLOSED when the input end of the pipe is empty).

Mike Beltzner [:beltzner, not reading bugmail]

Comment 48

•

15 years ago

Vlad, I'm resetting this as status1.9.2:--- on the assumption that you meant to have this continue to block the beta. (If that's not right, we should close this bug again and file a follow up.)

status1.9.2: beta1-fixed → ---

Shawn Wilsher :sdwilsh

Assignee

Comment 49

•

15 years ago

(In reply to comment #48)
> Vlad, I'm resetting this as status1.9.2:--- on the assumption that you meant to
> have this continue to block the beta. (If that's not right, we should close
> this bug again and file a follow up.)
It needs to continue to block the beta.  I need to head out to dinner, but I'll make a patch tonight for bz to review.  I'll try to land it tomorrow.

bz - ideas for a test that would fail and won't in the "correct" situation?  Preferably xpcshell.

Shawn Wilsher :sdwilsh

Assignee

Comment 50

•

15 years ago

Attached patch hang fix v1.0 — Details — Splinter Review

I'll make a test for this on Monday when I get back to the office.  I'm having issues with building on this machine.

Attachment #406902 - Flags: review?(bzbarsky)

Shawn Wilsher :sdwilsh

Assignee

Updated

•

15 years ago

Whiteboard: [needs review bz]

Boris Zbarsky [:bzbarsky]

Comment 51

•

15 years ago

Comment on attachment 406902 [details] [diff] [review]
hang fix v1.0

r=bzbarsky

If we're talking unit test for the fetch method here, you could fetch "data:text/plain," then in the caller try to read one byte from the input stream returned.  If it hangs, the test failed.  ;)

Attachment #406902 - Flags: review?(bzbarsky) → review+

Boris Zbarsky [:bzbarsky]

Comment 52

•

15 years ago

Oh, and presumably do the read in a try/catch, since it'll throw BASE_STREAM_CLOSED when passing.

Rob Arnold [:robarnold]

Comment 53

•

15 years ago

This morning I foolishly updated to today's nightly and immediately encountered this hang on startup. I can now verify that this patch fixes the hang and the favicons still do show up as expected in the tab previews.

Whiteboard: [needs review bz]

Brian Carpenter [:geeknik]

Comment 54

•

15 years ago

It would seem that a temporary workaround would be to toggle browser.chrome.favicons to false.  After I did this, I was able to get the Acid3 test to run w/o hanging.

Shawn Wilsher :sdwilsh

Assignee

Comment 55

•

15 years ago

If somebody has the ability to land this, please do.  I'm not going to be able to do so for several hours.

As stated in comment 50, I'll write the test on Monday.  Comment 53 indicates that the patch fixes the hang.

Keywords: checkin-needed

Vladimir Vukicevic [:vlad] [:vladv] (needinfo me, slow to respond)

Reporter

Comment 56

•

15 years ago

Checked in:

http://hg.mozilla.org/mozilla-central/rev/001a1805c476
http://hg.mozilla.org/releases/mozilla-1.9.2/rev/5336e6da58ab

Vladimir Vukicevic [:vlad] [:vladv] (needinfo me, slow to respond)

Reporter

Comment 57

•

15 years ago

And on the relbranch:

http://hg.mozilla.org/releases/mozilla-1.9.2/rev/984d02abebf0

status1.9.2: --- → beta1-fixed

Nick Thomas [:nthomas] (UTC+12)

Comment 58

•

15 years ago

(In reply to comment #35)
> http://hg.mozilla.org/releases/mozilla-1.9.2/rev/d6cce120cff7
> http://hg.mozilla.org/releases/mozilla-1.9.2/rev/56c0fa3e56ba
> http://hg.mozilla.org/releases/mozilla-1.9.2/rev/429e7f7ba34d

xpcshell tests have been hanging mozilla-1.9.2 since this set of changes landed there, and the hang fix has not changed that. Eg http://tinderbox.mozilla.org/showlog.cgi?log=Firefox3.6-Unittest/1255907641.1255910818.28959.gz
It passes xpcshell/tests/test_intl_strres/unit/test_bug397093.js and then hangs. xpcshell/tests/test_necko/unit/test_NetUtil.js is the next test in logs when it there isn't a hang.

The regression range is only the three changes above, which points the finger at d6cce120cff7. Could someone please check asap if this is a app regression or a test regression.

Shawn Wilsher :sdwilsh

Assignee

Comment 59

•

15 years ago

I'm not sure how that's passing on mozilla-central either...

There should be a run_next_test() call here I think.  
http://hg.mozilla.org/releases/mozilla-1.9.2/rev/d6cce120cff7#l2.76

I'm in the process of rebuilding to test this fix, but I suspect it's it.

Keywords: checkin-needed

Brian Carpenter [:geeknik]

Comment 60

•

15 years ago

I just downloaded the latest hourly with this patch in it and when I toggle browser.chrome.favicons back to true, I don't get a hang on the Acid3 site or other sites that were hanging.

Vladimir Vukicevic [:vlad] [:vladv] (needinfo me, slow to respond)

Reporter

Comment 62

•

15 years ago

Pushed the run_next_test() call to all 3 branches.

Shawn Wilsher :sdwilsh

Assignee

Comment 63

•

15 years ago

(In reply to comment #62)
> Pushed the run_next_test() call to all 3 branches.
I presume that worked then?  My build just finally finished...

Nick Thomas [:nthomas] (UTC+12)

Comment 64

•

15 years ago

Apparently it didn't: http://tinderbox.mozilla.org/showlog.cgi?tree=Firefox3.6-Unittest&errorparser=unittest&logfile=1255917030.1255919856.28064.gz&buildtime=1255917030&buildname=OS%20X%2010.5.2%20mozilla-1.9.2%20test%20everythingelse&fulltext=1
is http://hg.mozilla.org/releases/mozilla-1.9.2/rev/35804d6d3262

TEST-PASS | /builds/slave/mozilla-1.9.2-macosx-unittest-everythingelse/build/xpcshell/tests/test_intl_strres/unit/test_bug397093.js | test passed

command timed out: 1200 seconds without output, killing pid 372

Shawn Wilsher :sdwilsh

Assignee

Comment 65

•

15 years ago

Silly.  NetUtil.ioService doesn't exist on 1.9.2, so the test fails there.  We never report the error because it's outside of the normal run_test() function (asynchronous testing FTL I guess).

Fixed in:
http://hg.mozilla.org/releases/mozilla-1.9.2/rev/8217722e6af4
http://hg.mozilla.org/releases/mozilla-1.9.2/rev/5879c8980e69

Mike Beltzner [:beltzner, not reading bugmail]

Comment 66

•

15 years ago

So is this now resolved? Looks like the patch fixes the issue (comment 60) and the test failures were resolved (comment 65).

status1.9.2: beta1-fixed → ---

Mike Beltzner [:beltzner, not reading bugmail]

Comment 67

•

15 years ago

Marking FIXED and status1.9.2beta1-fixed.

Status: REOPENED → RESOLVED

Closed: 15 years ago → 15 years ago

status1.9.2: --- → beta1-fixed

Resolution: --- → FIXED

Shawn Wilsher :sdwilsh

Assignee

Comment 68

•

15 years ago

(In reply to comment #66)
> So is this now resolved? Looks like the patch fixes the issue (comment 60) and
> the test failures were resolved (comment 65).
Left it open because I still need to write a test as stated in comment 50.  If we close it, I'm more likely to lose track of it.

Status: RESOLVED → REOPENED

Resolution: FIXED → ---

Shawn Wilsher :sdwilsh

Assignee

Comment 69

•

15 years ago

(In reply to comment #59)
> I'm not sure how that's passing on mozilla-central either...
> 
> There should be a run_next_test() call here I think.  
> http://hg.mozilla.org/releases/mozilla-1.9.2/rev/d6cce120cff7#l2.76
Ugh - this was actually wrong.  The call that stops the server takes a callback, where we pass that method in anyway.  My new patch with the test will fix this.

Mike Beltzner [:beltzner, not reading bugmail]

Comment 70

•

15 years ago

This is a P1 blocker, so if you think we should resolve the test issue before we ship the beta, let's leave it open. If not, though, I'd suggest filing a follow-up for the tests, mark it blocking? and I'll + it as a P2 blocker.

Status: REOPENED → RESOLVED

Closed: 15 years ago → 15 years ago

Resolution: --- → FIXED

Shawn Wilsher :sdwilsh

Assignee

Comment 71

•

15 years ago

Attached patch tests for hang vix v1.0 — Details — Splinter Review

The tests, as promised in comment 50

Attachment #407080 - Flags: review?(bzbarsky)

Shawn Wilsher :sdwilsh

Assignee

Updated

•

15 years ago

Attachment #407080 - Flags: review?(bzbarsky)

Shawn Wilsher :sdwilsh

Assignee

Comment 72

•

15 years ago

I'll land on branch in a bit.  Patch doesn't apply completely cleanly, so being extra careful this time.

http://hg.mozilla.org/mozilla-central/rev/3314dc670ff4

Boris Zbarsky [:bzbarsky]

Comment 73

•

15 years ago

Comment on attachment 407080 [details] [diff] [review]
tests for hang vix v1.0

r+, fwiw.

Attachment #407080 - Flags: review+

Shawn Wilsher :sdwilsh

Assignee

Comment 74

•

15 years ago

Tests on branch:
http://hg.mozilla.org/releases/mozilla-1.9.2/rev/826d70847afc (release branch)
http://hg.mozilla.org/releases/mozilla-1.9.2/rev/1fb7fbbb6c36

Shawn Wilsher :sdwilsh

Assignee

Comment 75

•

15 years ago

And documentation:
https://developer.mozilla.org/en/JavaScript_code_modules/NetUtil.jsm#asyncFetch

Leaving dev-doc-needed so sheppy can look it over.

Eric Shepherd [:sheppy]

Comment 76

•

15 years ago

Looks good - nice job!

Keywords: dev-doc-needed → dev-doc-complete

Marcia Knous [:marcia]

Comment 77

•

15 years ago

Shawn: What is the easiest way for QA verify this bug? Am seeing some weirdness while testing the candidate builds.  Thanks.

Shawn Wilsher :sdwilsh

Assignee

Comment 78

•

15 years ago

(In reply to comment #77)
> Shawn: What is the easiest way for QA verify this bug? Am seeing some weirdness
> while testing the candidate builds.  Thanks.
That's probably best identified by vlad since he noticed the issue in the first place.  I can't think of an easy way to verify it.

Vladimir Vukicevic [:vlad] [:vladv] (needinfo me, slow to respond)

Reporter

Comment 79

•

15 years ago

"Doesn't crash during browsing" is really the only way I'm verifying it.  I was crashing trivially every 15 min or so with more than a handful of tabs open, especially upon session restore.  Bugzilla tabs in particular seemed highly likely to trigger the problem.

Vladimir Vukicevic [:vlad] [:vladv] (needinfo me, slow to respond)

Reporter

Comment 80

•

15 years ago

FWIW, I've been browsing and reviewing bugs with yesterday's nightly all day yesterday and today with no crashes; I'd be comfortable calling this VERIFIED based on that.

You need to log in before you can comment on or make changes to this bug.