588698 - SSL deadlock (seen in Thunderbird)

Reporter

Description

•

15 years ago

Attached file Shark sample — Details

I noticed wanted to go back to Thunderbird (which was a background task), I couldn't because it was beach-balling like hell. David from the shark sample this might be imap related, what do you think ? Then I asked nicely to gdb to give me a stack and I got : (gdb) bt #0 0x941742ae in semaphore_wait_signal_trap () #1 0x9417bd85 in pthread_mutex_lock () #2 0x02793032 in PR_Lock () #3 0x027931a4 in PR_EnterMonitor () #4 0x028028c1 in ssl3_GatherCompleteHandshake () #5 0x0280295b in ssl3_GatherAppDataRecord () #6 0x0280aa51 in ssl_SecureRecv () #7 0x02810506 in ssl_Recv () #8 0x00ab0eb7 in nsSSLThread::requestRecvMsgPeek () #9 0x00ac1145 in PSMRecv () #10 0x0005ccf3 in nsSocketTransport::IsAlive () #11 0x00c9c6b1 in nsImapProtocol::CanHandleUrl () #12 0x00c71958 in nsImapIncomingServer::GetImapConnection () #13 0x00c71ff8 in nsImapIncomingServer::GetImapConnectionAndLoadUrl () #14 0x00cb7e13 in nsImapService::GetImapConnectionAndLoadUrl () #15 0x00cbb66e in nsImapService::SelectFolder () #16 0x00c7aea1 in nsImapMailFolder::UpdateFolderWithListener () #17 0x00c76a83 in nsImapMailFolder::UpdateFolder () #18 0x00c7bd54 in nsImapMailFolder::OnNewIdleMessages () #19 0x0268bb78 in NS_InvokeByIndex_P () #20 0x0267fe6d in nsProxyObjectCallInfo::Run () #21 0x0267a63c in nsThread::ProcessNextEvent () #22 0x02637307 in NS_ProcessPendingEvents_P () #23 0x00203a61 in nsBaseAppShell::NativeEventCallback () #24 0x001cb757 in nsAppShell::ProcessGeckoEvents () #25 0x906c53c5 in CFRunLoopRunSpecific () #26 0x906c5aa8 in CFRunLoopRunInMode () #27 0x968bf2ac in RunCurrentEventLoopInMode () #28 0x968bf0c5 in ReceiveNextEventCommon () #29 0x968bef39 in BlockUntilNextEventMatchingListInMode () #30 0x95d8b6d5 in _DPSNextEvent () #31 0x95d8af88 in -[NSApplication nextEventMatchingMask:untilDate:inMode:dequeue:] () #32 0x95d83f9f in -[NSApplication run] () #33 0x001cae18 in nsAppShell::Run () #34 0x00a86377 in nsAppStartup::Run () #35 0x00006fd4 in XRE_main () #36 0x00002f50 in main ()

David :Bienvenu

Comment 1

•

15 years ago

Are there any other stacks holding onto an ssl monitor?

Ludovic Hirlimann [:Usul]

Reporter

Comment 2

•

15 years ago

(In reply to comment #1) > Are there any other stacks holding onto an ssl monitor? How would I get that (ain't sure I get what you are asking for ) ?

David :Bienvenu

Comment 3

•

15 years ago

gdb lets you look at other threads (I assume), and I'd like to see any stacks of those threads that look like they might be holding onto the monitor.

Ludovic Hirlimann [:Usul]

Reporter

Comment 4

•

15 years ago

So got the hang again and here are the Threads that had ssl or monitor in them : Thread 3 (process 2012 thread 0x2007): #0 0x95eb12ae in semaphore_wait_signal_trap () #1 0x95eb8d85 in pthread_mutex_lock () #2 0x027ae032 in PR_Lock () #3 0x027ae1a4 in PR_EnterMonitor () #4 0x02824a19 in SSL_DataPending () #5 0x0282a839 in ssl_Poll () #6 0x00ab715f in nsSSLThread::requestPoll () #7 0x00ac73d7 in nsSSLIOLayerPoll () #8 0x027afb80 in PR_Poll () #9 0x0005f4d4 in nsSocketTransportService::Poll () #10 0x000601b2 in nsSocketTransportService::DoPollIteration () #11 0x00060740 in nsSocketTransportService::OnProcessNextEvent () #12 0x026965e1 in nsThread::ProcessNextEvent () #13 0x02653307 in NS_ProcessPendingEvents_P () #14 0x0005fdc8 in nsSocketTransportService::Run () #15 0x0269663c in nsThread::ProcessNextEvent () #16 0x026533aa in NS_ProcessNextEvent_P () #17 0x02696810 in nsThread::ThreadFunc () #18 0x027b3892 in _pt_root () #19 0x95ee2155 in _pthread_start () #20 0x95ee2012 in thread_start () and Thread 1 hread 1 (process 2012 thread 0x20b): #0 0x95eb12ae in semaphore_wait_signal_trap () #1 0x95eb8d85 in pthread_mutex_lock () #2 0x027ae032 in PR_Lock () #3 0x027ae1a4 in PR_EnterMonitor () #4 0x0281d8c1 in ssl3_GatherCompleteHandshake () #5 0x0281d95b in ssl3_GatherAppDataRecord () #6 0x02825a51 in ssl_SecureRecv () #7 0x0282b506 in ssl_Recv () #8 0x00ab7377 in nsSSLThread::requestRecvMsgPeek () #9 0x00ac7655 in PSMRecv () #10 0x0005d773 in nsSocketTransport::IsAlive () #11 0x00ca33a1 in nsImapProtocol::CanHandleUrl () #12 0x00c78628 in nsImapIncomingServer::GetImapConnection () #13 0x00c78cc8 in nsImapIncomingServer::GetImapConnectionAndLoadUrl () #14 0x00cbeb43 in nsImapService::GetImapConnectionAndLoadUrl () #15 0x00cc239e in nsImapService::SelectFolder () #16 0x00c89fc1 in nsImapMailFolder::UpdateFolderWithListener () #17 0x00c7d753 in nsImapMailFolder::UpdateFolder () #18 0x00c82834 in nsImapMailFolder::OnNewIdleMessages () #19 0x026a7b78 in NS_InvokeByIndex_P () #20 0x0269be6d in nsProxyObjectCallInfo::Run () #21 0x0269663c in nsThread::ProcessNextEvent () #22 0x02653307 in NS_ProcessPendingEvents_P () #23 0x00204bf1 in nsBaseAppShell::NativeEventCallback () #24 0x001cc787 in nsAppShell::ProcessGeckoEvents () #25 0x96eab3c5 in CFRunLoopRunSpecific () #26 0x96eabaa8 in CFRunLoopRunInMode () #27 0x905d52ac in RunCurrentEventLoopInMode () #28 0x905d50c5 in ReceiveNextEventCommon () #29 0x905d4f39 in BlockUntilNextEventMatchingListInMode () #30 0x9389c6d5 in _DPSNextEvent () #31 0x9389bf88 in -[NSApplication nextEventMatchingMask:untilDate:inMode:dequeue:] () #32 0x93894f9f in -[NSApplication run] () #33 0x001cbe48 in nsAppShell::Run () #34 0x00a8caf7 in nsAppStartup::Run () #35 0x00007584 in XRE_main () #36 0x000034b0 in main () So that makes it two threads Anything else you need david ?

David :Bienvenu

Comment 5

•

15 years ago

I think I need someone who knows about PSM to say if this is a PSM bug or not...If I can't call nsSocketTransport::IsAlive, that's going to break a lot of things.

Kai Engert [:KaiE:]

Comment 6

•

15 years ago

Can you please say which Thunderbird version you're using? This will tell us which NSPR and NSS versions are used.

Assignee: nobody → nobody

Component: General → Libraries

Product: Thunderbird → NSS

QA Contact: general → libraries

Version: Trunk → trunk

Kai Engert [:KaiE:]

Updated

•

15 years ago

Summary: Hanged while in the background → Thunderbird SSL deadlock while in the background

Ludovic Hirlimann [:Usul]

Reporter

Comment 7

•

15 years ago

Mozilla/5.0 (Macintosh; Intel Mac OS X 10.5; rv:2.0b5pre) Gecko/20100824 Shredder/3.2a1pre

Kai Engert [:KaiE:]

Comment 8

•

15 years ago

If you can reproduce again, it would be interesting whether both stacks try to lock the same or different monitors, and same or different locks. (They'll probably be both the same, but would be good to confirm). I propose, in gdb, use "up" to go to PR_EnterMonitor (up 3), then use the following commands to print some values: print mon print mon->lock print mon->entryCount Do this for both threads, please Also, it would be interesting whether both refer to the same socket, or different sockets. In "thread 3" (or your future equivalent) please go to the frame with SSL_DataPending (use "up") and print ss In "thread 1" (or your future equivalent) please go to the frame with ssl3_GatherCompleteHandshake (use "up") and print ss

Kai Engert [:KaiE:]

Comment 9

•

15 years ago

(In reply to comment #7) > Mozilla/5.0 (Macintosh; Intel Mac OS X 10.5; rv:2.0b5pre) Gecko/20100824 > Shredder/3.2a1pre Thanks. I believe Shredder 3.2 uses mozilla-central. You're using the latest nightly build, so you're probably using NSS 3.12.8 beta 2.

Version: trunk → 3.12.8

Nelson Bolyard (seldom reads bugmail)

Updated

•

15 years ago

Summary: Thunderbird SSL deadlock while in the background → Speculation: Thunderbird SSL deadlock while in the background

Shark sample 15 years ago Ludovic Hirlimann [:Usul] 417.76 KB, application/octet-stream		Details
SSL_DataPending only needs to get recvBufLock (checked in) 15 years ago Wan-Teh Chang 887 bytes, patch	nelson : superreview+	Details \| Diff \| Splinter Review
Patch to discover and document current locking order, by Adam Langley 15 years ago Wan-Teh Chang 10.36 KB, patch	agl : review+ nelson : superreview+	Details \| Diff \| Splinter Review
Patch to discover and document current locking order, v2, by Adam Langley (checked in) 15 years ago Wan-Teh Chang 10.70 KB, patch		Details \| Diff \| Splinter Review
Comment out locking order assertion in ssl_Get1stHandshakeLock (checked in) 15 years ago Wan-Teh Chang 1.38 KB, patch		Details \| Diff \| Splinter Review