237934 - nss_InitLock not atomic

Assignee

Description

•

21 years ago

We ran into a problem on a multi-processor machine in the SSL code in ssl3con.c where a thread tries to PZ_Unlock(symWrapKeyLock) and asserts because another thread owns the lock . There is only one place in NSS where that lock is locked - earlier in the same function. The lock is created by the following sequence : /* atomically initialize the lock */ if (!symWrapKeysLock) nss_InitLock(&symWrapKeysLock, nssILockOther); I think what happened is that multiple threads created the lock, and that lock creation is not truly atomic. Here is the code for nss_InitLock in util, which has not changed in current versions of NSS : /* Given the address of a (global) pointer to a PZLock, * atomicly create the lock and initialize the (global) pointer, * if it is not already created/initialized. */ SECStatus __nss_InitLock( PZLock **ppLock, nssILockType ltype ) { static PRInt32 initializers; PORT_Assert( ppLock != NULL); /* atomically initialize the lock */ while (!*ppLock) { PRInt32 myAttempt = PR_AtomicIncrement(&initializers); if (myAttempt == 1) { *ppLock = PZ_NewLock(ltype); (void) PR_AtomicDecrement(&initializers); break; } PR_Sleep(PR_INTERVAL_NO_WAIT); /* PR_Yield() */ (void) PR_AtomicDecrement(&initializers); } return (*ppLock != NULL) ? SECSuccess : SECFailure; } I believe this code is missing a test, which I added below : if (myAttempt == 1) { if (!*ppLock) { *ppLock = PZ_NewLock(ltype); } (void) PR_AtomicDecrement(&initializers); break; } The reason for adding this test is the following case, which could occur even on a single processor machine : 1) thread 1 gets suspended inside the loop, right after *ppLock is tested for NULL, but before PR_AtomicIncrement executes 2) thread 2 executes, increments the counter to 1, creates the lock, decrements it to 0, and exits the function 3) thread 1 resumes execution, increments the counter to 1, overwrites the lock, and decrements the counter Besides the atomicity bug, I think this function has performance issues. It uses a single initializers static variable to serialize creation of all locks in the process, regardless of the lock address . This also brings the following question : why are we creating this global lock dynamically in SSL, rather than at initialization time ? There aren't many places in NSS fortunately where nss_InitLock is called so the atomicity and performance problems hopefully should be limited, but we have actually run into the atomicity issue several times in tests.

Proposed patch (checked in) 21 years ago Wan-Teh Chang 1.60 KB, patch	nelson : review+ julien.pierre : superreview+ chofmann : approval1.7+	Details \| Diff \| Splinter Review
work around issue for SSL server applications 21 years ago Julien Pierre 4.52 KB, patch	nelson : review+	Details \| Diff \| Splinter Review
updated patch 21 years ago Julien Pierre 3.71 KB, patch	nelson : review+	Details \| Diff \| Splinter Review