164512 - CERT_FindCertIssuer in >3 threads stalls under OS/2 SMP - VACPP bug

Reporter

Description

•

23 years ago

While running a multi-threaded NSS CRL cache stress test on my dual Athlon OS/2 SMP machine, I encountered what seemed to be a deadlock in this function. The process was stalled, and nothing was running on either processor anymore. I attached to the process to find out what was going on. My test was running with 11 threads - the one main thread waiting for 10 test threads in a loop during certificate verification. 3 threads were stuck in SemRequest486, while 7 others were in _SemRequest of the C library. Somehow, all 10 threads were stuck - nothing got to run anymore, so I attached to the process to find out what was going on. Here is the stack of a thread in SemRequest486 : Function | Part ------------------------------------------+----------------- SemRequest486 | OS2VACPP.OBJ nssSession_EnterMonitor | DEVSLOT.OBJ find_objects | DEVTOKEN.OBJ find_objects_by_template | DEVTOKEN.OBJ nssToken_FindCertificatesBySubject | DEVTOKEN.OBJ nssTrustDomain_FindCertificatesBySubject | TRUSTDOMAIN.OBJ find_cert_issuer | CERTIFICATE.OBJ nssCertificate_BuildChain | CERTIFICATE.OBJ NSSCertificate_BuildChain | CERTIFICATE.OBJ CERT_FindCertIssuer | CERTVFY.OBJ cert_VerifyCertChain | CERTVFY.OBJ CERT_VerifyCertificate | CERTVFY.OBJ VerifyCert | CERTUTIL.OBJ _PR_NativeRunThread | PRUTHR.OBJ open__7filebufFPCciT2 | cpprmi36.dll:2 0x1FFECE33 | DOSCALL1.DLL:4 Another one in the heap lock : Function | Part -------------------------------------+----------------- 0x1FFDE361 | DOSCALL1.DLL:3 _SemRequest | cpprmi36.dll:2 free | cpprmi36.dll:2 PR_Free | PRMEM.OBJ PR_DestroyLock | PRULOCK.OBJ PORT_FreeArena | SECPORT.OBJ nss3certificate_getIssuerIdentifier | PKI3HACK.OBJ find_cert_issuer | CERTIFICATE.OBJ nssCertificate_BuildChain | CERTIFICATE.OBJ NSSCertificate_BuildChain | CERTIFICATE.OBJ CERT_FindCertIssuer | CERTVFY.OBJ cert_VerifyCertChain | CERTVFY.OBJ CERT_VerifyCertificate | CERTVFY.OBJ VerifyCert | CERTUTIL.OBJ _PR_NativeRunThread | PRUTHR.OBJ open__7filebufFPCciT2 | cpprmi36.dll:2 0x1FFECE33 | DOSCALL1.DLL:4 Eventually - after about 20s of inactivity, the process resumed execution, and it eventually completed.

source for an OS/2 semaphore stress test. Please run on an SMP box 23 years ago Julien Pierre 2.71 KB, text/plain		Details
batch file to run test the same way I did 23 years ago Julien Pierre 217 bytes, text/plain		Details
source code to a program reproducing stalling problem 23 years ago Julien Pierre 3.57 KB, text/plain		Details
patches I had to make to build NSPR & NSS with EMX (I still have a problem with SMIME3.DLL) 23 years ago Julien Pierre 1.39 KB, patch		Details \| Diff \| Splinter Review