1152046 - [Win] close() and connect() on a socket may block for a very long time.

Reporter

Description

•

10 years ago

This is a hard to reproduce and nasty bug. There is one known and described case with UDP sockets (worked around in bug 1124880): http://blogs.msdn.com/b/winsdk/archive/2013/10/09/udp-closesocket-takes-upto-5-seconds-to-return-in-disconnect-remote-host-down-scenario-due-to-pending-data-to-send.aspx However, apparently this can happen for any socket, see some of the reports under https://crash-stats.mozilla.com/search/?product=Firefox&version=39.0a2&version=40.0a1&process_type=browser&process_type=content&signature=~nsHttpConnectionMgr%3A%3AShutdown&_facets=signature&_columns=date&_columns=signature&_columns=product&_columns=version&_columns=build_id&_columns=platform#facet-signature Google is quiet on this. Hence this could be specific to how we handle/setup our sockets.

Patrick McManus [:mcmanus]

Comment 1

•

10 years ago

Honza - what do you think of the theory that this is being caused by an LSP locally installed? Perhaps the LSP is blocking on a non-blocking socket. here are three reports that look like that is what is happening: https://crash-stats.mozilla.com/report/index/cd6dc70f-1afc-4596-94b6-8d26c2150404#allthreads https://crash-stats.mozilla.com/report/index/77bf1520-1759-437b-a209-885922150401#allthreads https://crash-stats.mozilla.com/report/index/9adc4168-b377-4a7c-801d-546d32150402#allthreads In each case the Winsock LSP list is considerably more complicated than the winsock lsp list that you find in a crash due to something in gecko (e.g the dispatch deadlock). I took all the samples from XP SP3 just because I thought that might make the LSP list more comparable - but it happens on multiple versions of windows.

Honza Bambas (:mayhemer)

Reporter

Comment 2

•

10 years ago

If you mean the stack: KiFastSystemCallRet NtDeviceIoControlFile WSPBind WSPAddressToString WSPConnect connect _PR_MD_CONNECT SocketConnect then it looks to me like a pure native call (no LSP involved). Being in bind() sounds like it tries to find a local interface to bind to. It may be the same cause as in case of the UDP close(), waiting for an ARP or (more likely) a DHCP response - those are more guesses, not supported by any research now. It could also be an OEM specific code that misbehaves here... something very hard to find out :( there is tho 'rsvpsp.dll' in the LSP list. I don't have that file on my win7 machine.

Honza Bambas (:mayhemer)

Reporter

Updated

•

10 years ago

Blocks: 1158189

Honza Bambas (:mayhemer)

Reporter

Updated

•

10 years ago

No longer blocks: 1124880

Jason Duell

Updated

•

10 years ago

Assignee: nobody → dd.mozilla

.bug_1152046_v1.patch.swp 10 years ago Dragana Damjanovic [:dragana] 12.00 KB, application/octet-stream		Details
bug_1152046_v1.patch 10 years ago Dragana Damjanovic [:dragana] 1.67 KB, patch	mayhemer : review+	Details \| Diff \| Splinter Review
bug_1152046_v1.patch 10 years ago Dragana Damjanovic [:dragana] 1.66 KB, patch	dragana : review+	Details \| Diff \| Splinter Review
bug_1152046_telemetry_v1.patch 10 years ago Dragana Damjanovic [:dragana] 20.67 KB, patch	mayhemer : feedback-	Details \| Diff \| Splinter Review
bug_1152046_telemetry_v2.patch 10 years ago Dragana Damjanovic [:dragana] 19.89 KB, patch	bagder : feedback+	Details \| Diff \| Splinter Review
bug_1152046_telemetry_v2.patch 10 years ago Dragana Damjanovic [:dragana] 21.86 KB, patch	mayhemer : review+	Details \| Diff \| Splinter Review
bug_1152046_telemetry_v2.patch 10 years ago Dragana Damjanovic [:dragana] 21.21 KB, patch	dragana : review+	Details \| Diff \| Splinter Review
bug_1152046_prclose_v1.patch 10 years ago Dragana Damjanovic [:dragana] 42.39 KB, patch	mayhemer : feedback+	Details \| Diff \| Splinter Review
bug_1152046_prclose_v1.patch 10 years ago Dragana Damjanovic [:dragana] 38.86 KB, patch		Details \| Diff \| Splinter Review
bug_1152046_prclose_v1.patch 10 years ago Dragana Damjanovic [:dragana] 39.76 KB, patch	mayhemer : review-	Details \| Diff \| Splinter Review
bug_1152046_prclose_v2.patch 9 years ago Dragana Damjanovic [:dragana] 39.71 KB, patch	mayhemer : review+	Details \| Diff \| Splinter Review
bug_1152046_prclose_v2.patch 9 years ago Dragana Damjanovic [:dragana] 39.65 KB, patch	dragana : review+	Details \| Diff \| Splinter Review
bug_1152046_move_CLosingService_start_toioservice.patch 9 years ago Dragana Damjanovic [:dragana] 5.14 KB, patch	mayhemer : review+	Details \| Diff \| Splinter Review
bug_1152046_disable_on_ffos.patch 9 years ago Dragana Damjanovic [:dragana] 2.58 KB, patch	mcmanus : review+ dragana : checkin+	Details \| Diff \| Splinter Review
bug_1152046_disable_on_ffos.patch 9 years ago Dragana Damjanovic [:dragana] 3.27 KB, patch	dragana : review+	Details \| Diff \| Splinter Review
bug_1152046_move_CLosingService_start_toioservice.patch 9 years ago Dragana Damjanovic [:dragana] 5.14 KB, patch	dragana : review+	Details \| Diff \| Splinter Review