Closed Bug 555760 Opened 15 years ago Closed 14 years ago

Extend VMPI to cover thread creation, mutexes, condition vars and atomics

Tracking

(Not tracked)

Status:

RESOLVED FIXED

Milestone:

Q3 11 - Serrano

People

(Reporter: siwilkin, Assigned: siwilkin)

References

Details

Attachments

(2 files, 16 obsolete files)

Initial patch 15 years ago Simon Wilkinson 64.19 KB, patch		Details \| Diff \| Splinter Review
Rebased, added unix atomics, added missing volatiles 15 years ago Simon Wilkinson 23.43 KB, patch		Details \| Diff \| Splinter Review
latest 15 years ago Simon Wilkinson 24.66 KB, patch		Details \| Diff \| Splinter Review
Latest (with full WinCE support) 15 years ago Simon Wilkinson 25.48 KB, patch		Details \| Diff \| Splinter Review
Added thread startup attributes 15 years ago Simon Wilkinson 32.84 KB, patch		Details \| Diff \| Splinter Review
Added further atomic primitives 14 years ago Simon Wilkinson 38.85 KB, patch		Details \| Diff \| Splinter Review
Added 14 years ago Simon Wilkinson 41.88 KB, patch		Details \| Diff \| Splinter Review
Latest. All platforms implemented. 14 years ago Simon Wilkinson 41.88 KB, patch	edwsmith : review+ kpalacz : review+ lhansen : review+ skekki : review+ dansmith : feedback+	Details \| Diff \| Splinter Review
Selftest for VMPI threads 14 years ago Simon Wilkinson 28.05 KB, patch	kpalacz : review+	Details \| Diff \| Splinter Review
Manifests for WinCE with win32cv included 14 years ago Simon Wilkinson 2.49 KB, patch		Details \| Diff \| Splinter Review
Post-review changes 14 years ago Simon Wilkinson 43.45 KB, patch		Details \| Diff \| Splinter Review
Latest selftests. API fully covered apart from MemoryBarriers 14 years ago Simon Wilkinson 57.60 KB, patch		Details \| Diff \| Splinter Review
Main patch. Post-review changes 14 years ago Simon Wilkinson 43.45 KB, patch		Details \| Diff \| Splinter Review
Latest selftests 14 years ago Simon Wilkinson 57.60 KB, patch		Details \| Diff \| Splinter Review
Main patch. Post-review changes and win32 fixes 14 years ago Simon Wilkinson 43.58 KB, patch		Details \| Diff \| Splinter Review
Latest selftests 14 years ago Simon Wilkinson 57.03 KB, patch		Details \| Diff \| Splinter Review
Latest selftests. Added conservative memory barrier. 14 years ago Simon Wilkinson 56.75 KB, patch	kpalacz : review+ edwsmith : superreview+	Details \| Diff \| Splinter Review
Latest selftests. Added docs. Fixed spacing. 14 years ago Simon Wilkinson 58.12 KB, patch		Details \| Diff \| Splinter Review

Simon Wilkinson

Assignee

Description

•

15 years ago

User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.6; en-US; rv:1.9.2) Gecko/20100115 Firefox/3.6 Build Identifier: The small amount of code that is currently multi-threaded is written in a platform-dependent manner. This will become unmanageable as the VM becomes more multi-threaded. A more attractive solution is to have a low-level thread porting layer (which looks similar to pthreads/OSAtomic), from which higher-level C++ abstractions can be built (monitors, barriers etc.) The attached patch is an extension to VMPI that defines a common interface and platform implementations for thread creation, mutexes, condition variables and atomics: void VMPI_memoryBarrier(); int32_t VMPI_atomicIncAndGet32(int32_t* address); int32_t VMPI_atomicIncAndGet32WithBarrier(int32_t* address); bool VMPI_threadCreate(vmpi_thread_t* thread, vmpi_thread_start_t start_fn, vmpi_thread_arg_t arg); bool VMPI_threadDetach(vmpi_thread_t thread); void VMPI_threadSleep(int32_t timeout_millis); void VMPI_threadJoin(vmpi_thread_t thread); bool VMPI_recursiveMutexInit(vmpi_mutex_t* mutex); bool VMPI_recursiveMutexDestroy(vmpi_mutex_t* mutex); void VMPI_recursiveMutexLock(vmpi_mutex_t* mutex); void VMPI_recursiveMutexUnlock(vmpi_mutex_t* mutex); bool VMPI_condVarInit(vmpi_condvar_t* condvar); bool VMPI_condVarDestroy(vmpi_condvar_t* condvar); void VMPI_condVarWait(vmpi_condvar_t* condvar, vmpi_mutex_t* mutex); bool VMPI_condVarTimedWait(vmpi_condvar_t* condvar, vmpi_mutex_t* mutex, int32_t timeout_millis); void VMPI_condVarBroadcast(vmpi_condvar_t* condvar); void VMPI_condVarSignal(vmpi_condvar_t* condvar); There are a few options for implementing Condition variables on Windows. We can discuss this. Symbian atomics are left unimplemented. Can someone advise? Reproducible: Always

Simon Wilkinson

Assignee

Comment 1

•

15 years ago

Attached patch Initial patch (obsolete) — Details — Splinter Review

Simon Wilkinson

Assignee

Updated

•

15 years ago

Blocks: 538185

Simon Wilkinson

Assignee

Updated

•

15 years ago

Blocks: 555765

Dan Smith

Updated

•

15 years ago

Flags: flashplayer-qrb+

Priority: -- → P3

Target Milestone: --- → flash10.2

Dan Smith

Updated

•

15 years ago

Assignee: nobody → siwilkin

Stan Switzer

Comment 2

•

15 years ago

See FP win32utils.cpp for a lightweight, unobvious but well-worn-in condition variable implementation based on QueueUserAPC(). You might also want to look at the FP Monitor.h for a Java-like monitor abstraction that hides locked variables except when locked. Contract enforcement FTW! And also it's a very good idea to build deadlock detection logic into the lock primitives in debug builds. Look at the FP impls as well.

Simon Wilkinson

Assignee

Comment 3

•

15 years ago

(In reply to comment #2) > See FP win32utils.cpp for a lightweight, unobvious but well-worn-in condition > variable implementation based on QueueUserAPC(). I actually based my implementation on this. Initially I was going to go for a boost-like algorithm (Alexander Terekhov's). But the win32utils implementation pointed the way to not being overly-concerned about absolutely fair scheduling to notified threads. > You might also want to look at the FP Monitor.h for a Java-like monitor > abstraction that hides locked variables except when locked. Contract > enforcement FTW! I have WaitNotifyMonitor in bug 555765, which builds C++ abstraction on top of the VMPI extensions. I'll take a look at the Monitor.h version. Thanks! > And also it's a very good idea to build deadlock detection logic into the lock > primitives in debug builds. Look at the FP impls as well. This is something I wanted to add to bug 555765 at some point, but as some others have mentioned, it would be a good thing to have only a single implementation of all this stuff.

Stan Switzer

Comment 4

•

15 years ago

(In reply to comment #3) > > You might also want to look at the FP Monitor.h for a Java-like monitor > > abstraction that hides locked variables except when locked. Contract > > enforcement FTW! > > I have WaitNotifyMonitor in bug 555765, which builds C++ abstraction on top of > the VMPI extensions. I'll take a look at the Monitor.h version. Thanks! > > > And also it's a very good idea to build deadlock detection logic into the lock > > primitives in debug builds. Look at the FP impls as well. > > This is something I wanted to add to bug 555765 at some point, but as some > others have mentioned, it would be a good thing to have only a single > implementation of all this stuff. Making more of that common would be very good; I remember there was a complication but probably not a serious one. I want to point out that _sharing_ the deadlock detection between the AvmCore and its clients will catch a broader class of problems than having parallel but separate ones. So let's be sure we find a way to share it if/when you do this.

Simon Wilkinson

Assignee

Comment 5

•

15 years ago

Attached patch Rebased, added unix atomics, added missing volatiles (obsolete) — Details — Splinter Review

- Rebased to latest version. - Added inline asm implementations of atomic primitives for Unix. - Added various missing volatile qualifiers

Attachment #435656 - Attachment is obsolete: true

Stan Switzer

Comment 6

•

15 years ago

I was looking at (possibly not the latest) version of windows spinlocks and a couple of comments: VMPI_lockAcquire should probably should use SwitchToThread rather than Sleep (but maybe not on CE). Also, typically a spin-lock would check if it's on an MP machine and if so briefly spin in a loop before yielding to the OS. When spin locks are being used correctly (for very brief spans of code) this will be much much faster. You needn't worry overmuch about pipeline stalls from the tests; after all wasting time is pretty much the whole point.

Simon Wilkinson

Assignee

Comment 7

•

15 years ago

(In reply to comment #6) > I was looking at (possibly not the latest) version of windows spinlocks and a > couple of comments: > > VMPI_lockAcquire should probably should use SwitchToThread rather than Sleep > (but maybe not on CE). > > Also, typically a spin-lock would check if it's on an MP machine and if so > briefly spin in a loop before yielding to the OS. When spin locks are being > used correctly (for very brief spans of code) this will be much much faster. > You needn't worry overmuch about pipeline stalls from the tests; after all > wasting time is pretty much the whole point. Opened up bug 562687 for this one.

Simon Wilkinson

Assignee

Updated

•

15 years ago

Comment 8

•

15 years ago

Attached patch latest (obsolete) — Details — Splinter Review

Added VMPI_recursiveMutexTryLock. Fixed atomics for Solaris, Android and WinCE.

Attachment #442335 - Attachment is obsolete: true

Stan Switzer

Comment 9

•

15 years ago

I'm looking at some of our thread and sync primitives to see what it would take to consolidate on these primitives. In addition to what you show, we use the following: - thread create has low/normal/high priority - there is a way to get and compare thread IDs (so that you can assert you are on the intended thread) (People tend to assume that you can compare pthread_t with ==, but then what is pthread_equal() for? Standards can be so aggravating that way!) Presumably VMPI_threadCreate reserves "big enough" stack and guard areas. But really... how does it know? You might want a "yield" abstraction to address the problem that win32 SwitchToThread addresses. I'm wondering what VMPI_threadSleep is intended for, but something like a "yield" would be better than sleep() in a spin-lock implementation. Finally, to the extent these primitives are clearly modelled on POSIX (yay!), shouldn't time be represented in nanoseconds? A millisecond is, like, _forever_.

Simon Wilkinson

Assignee

Comment 10

•

15 years ago

(In reply to comment #9) > I'm looking at some of our thread and sync primitives to see what it would take > to consolidate on these primitives. In addition to what you show, we use the > following: > > - thread create has low/normal/high priority This would actually be quite useful for the VM (e.g. lower-priority compiler threads). A pthread_attr_ type of interface is the natural choice. This can cover your requirement for stack/guard sizing too. > - there is a way to get and compare thread IDs (so that you can assert you are > on the intended thread) > > (People tend to assume that you can compare pthread_t with ==, but then what is > pthread_equal() for? Standards can be so aggravating that way!) Is the current vmpi_thread_t ID not sufficient? > Presumably VMPI_threadCreate reserves "big enough" stack and guard areas. But > really... how does it know? See the pthread_attr_ comment above. > You might want a "yield" abstraction to address the problem that win32 > SwitchToThread addresses. I'm wondering what VMPI_threadSleep is intended for, > but something like a "yield" would be better than sleep() in a spin-lock > implementation. A efficient yield is on my to-do list. > Finally, to the extent these primitives are clearly modelled on POSIX (yay!), > shouldn't time be represented in nanoseconds? A millisecond is, like, > _forever_. I'd say it's about 90% POSIX flavored. That was one of my win32 compromises. May be a bad one!

Simon Wilkinson

Assignee

Updated

•

15 years ago

Depends on: 574695

Simon Wilkinson

Assignee

Comment 11

•

15 years ago

Attached patch Latest (with full WinCE support) (obsolete) — Details — Splinter Review

WinCE builds now use the condition variable implementation from bug 574695

Attachment #453186 - Attachment is obsolete: true

Stan Switzer

Comment 12

•

15 years ago

> > - thread create has low/normal/high priority > ... > A pthread_attr_ type of interface is the natural choice. This can cover your > requirement for stack/guard sizing too. Yeah, I guess. It's sorta chatty but nicely extensible. The pthread init/set/destroy drill is tedious and also somewhat dangerous since some of those calls can be no-ops on some platforms. That is: it could work perfectly fine on some platforms but have subtle problems on others. > > - there is a way to get and compare thread IDs (so that you can assert you are > > on the intended thread) > > > > (People tend to assume that you can compare pthread_t with ==, but then what is > > pthread_equal() for? Standards can be so aggravating that way!) > > > Is the current vmpi_thread_t ID not sufficient? Not sure; you may need a thread-id-compare primitive. As I was saying, == seems to work just fine in practice, but POSIX apparently doesn't guarantee it. I suppose that in a pinch a platform could define vmpi_thread_t as a class with an overload ==, if it ever came to that. > > Finally, to the extent these primitives are clearly modelled on POSIX (yay!), > > shouldn't time be represented in nanoseconds? A millisecond is, like, > > _forever_. > > I'd say it's about 90% POSIX flavored. That was one of my win32 compromises. > May be a bad one! Maybe define it in nanoseconds with the caveat that precision and accuracy may be wildly different. But then you might need to define a constant for the granularity or the smallest value that isn't rounded down to zero. BTW, how would you round it? POSIX is apparently mute on this but a fair argument can be made that you'd have to round up (at least for values that would truncate to zero): otherwise there is no safe small wait constant that couldn't always result in ETIMEDOUT, and that seems wrong.

Simon Wilkinson

Assignee

Updated

•

15 years ago

Blocks: asymmGC

Simon Wilkinson

Assignee

Updated

•

15 years ago

Blocks: 582772

Simon Wilkinson

Assignee

Updated

•

15 years ago

Blocks: 582776

Simon Wilkinson

Assignee

Updated

•

15 years ago

Blocks: 582782

Simon Wilkinson

Assignee

Updated

•

15 years ago

Blocks: 582817

Simon Wilkinson

Assignee

Comment 13

•

15 years ago

Attached patch Added thread startup attributes (obsolete) — Details — Splinter Review

This new revision addresses some of Stan's requirements in aligning this work with the current FP implementation. Added: - Specify thread priority (low/normal/high) at thread startup. - Specify stack size at thread startup. - Specify guard area size at thread startup. The API looks like pthread_attr_*, but is much smaller and doesn't include the unnecessary get* functions. The idea is that higher-level C++ thread abstractions will hide the opaque pointer handling (e.g. see VMThread::start(ThreadPriority priority) in bug 555765) Clearly all of our platforms don't support all of the options. Summarized below: pthread win32 priority no yes stack size yes yes guard size yes no (possible starting with Vista) One unresolved question: can you find out the default stack size of a win32 process programmatically? (i.e. the value set by the linker) At the moment I just guess it's going to be the 1MB default.

Attachment #454057 - Attachment is obsolete: true

Lars T Hansen

Updated

•

14 years ago

Status: UNCONFIRMED → NEW

Ever confirmed: true

Lars T Hansen

Comment 14

•

14 years ago

Re Symbian, adding Samuli for possible comment on the issue. Re Win32 stack size, I have a vague memory from somewhere that the default stack size is 512KB but that seems laughably small. We do have VMPI functionality to query what it actually is (or at least that code can be adapted to suit your needs), will that not work properly for you?

Samuli Kekki

Comment 15

•

14 years ago

On Symbian you set the stack size at compile time. We can return the same hard coded value from a VMPI method as needed. 80k is the max size on Symbian OS as of today. Whether all the other proposed APIs are possible to implement on Symbian would require some investigation.

Simon Wilkinson

Assignee

Comment 16

•

14 years ago

(In reply to comment #14) > Re Win32 stack size, I have a vague memory from somewhere that the default > stack size is 512KB but that seems laughably small. We do have VMPI > functionality to query what it actually is (or at least that code can be > adapted to suit your needs), will that not work properly for you? The default stack is 1MB: http://msdn.microsoft.com/en-us/library/8cxs58a6%28vs.71%29.aspx. But it can be changed at link time per application. However, I can't find out what the linker set the default to programmatically from within the application. I.e. I can't find an equivalent of: size_t getThisApplicationsDefaultStackSize() { pthread_attr_t attr; size_t size; pthread_attr_init(&attr); pthread_attr_getstacksize(&attr, &size); return size; } Do you mean that there is somewhere in VMPI that has solved this for win32?

Simon Wilkinson

Assignee

Comment 17

•

14 years ago

(In reply to comment #15) > On Symbian you set the stack size at compile time. We can return the same hard > coded value from a VMPI method as needed. 80k is the max size on Symbian OS as > of today. > > Whether all the other proposed APIs are possible to implement on Symbian would > require some investigation. I've done some investigations. I think the only difficult part it support for atomic primitives. Although a compare-and-swap is all we need to emulate everything else.

Simon Wilkinson

Assignee

Comment 18

•

14 years ago

Attached patch Added further atomic primitives (obsolete) — Details — Splinter Review

Added addition atomic operations. Implemented for all platforms apart from Symbian and Android. Added: int32_t VMPI_atomicDecAndGet32(volatile int32_t* value); int32_t VMPI_atomicDecAndGet32WithBarrier(volatile int32_t* value); bool VMPI_compareAndSwap32(int32_t oldValue, int32_t newValue, volatile int32_t* address); bool VMPI_compareAndSwap32WithBarrier(int32_t oldValue, int32_t newValue, volatile int32_t* address); void VMPI_memoryBarrier();

Attachment #461634 - Attachment is obsolete: true

Stan Switzer

Comment 19

•

14 years ago

(In reply to comment #13) > > Clearly all of our platforms don't support all of the options. Summarized > below: > > pthread win32 > priority no yes > stack size yes yes > guard size yes no (possible starting with Vista) pthread_attr_setschedparam appears to allow you to control priorities but I gather that is for realtime scheduling modes we wouldn't typically use. A fair case can be made that we don't actually need those priorities and can just get rid of them. OS X platform code (and perhaps others) ignores the parameter anyway. Of the uses of anything other than normal priority, only the microphone code seems to be of any significance. No current use of low priority seems particularly well-motivated. Probably should ask turo. My suggestion would be something like this: the VMPI thread creation primitive has NO priority param priority and then we provide a separate primitive (perhaps NOT in VMPI) to allow a spawned thread to raise its own priority (if possible). That way VMPI can have a sufficient, well-defined mechanism for its purposes, and client code can goose the priority of threads when necessary and possible (microphone case).

Stan Switzer

Comment 20

•

14 years ago

(In reply to comment #19) > My suggestion would be something like... Or not. I suppose having a priority param that's honored if possible is a fine solution as well. I was looking for an alternative that allowed VMPI to push the vague stuff to the client layer, but you don't have to care about that if you don't want to. :)

Lars T Hansen

Comment 21

•

14 years ago

(In reply to comment #16) > Do you mean that there is somewhere in VMPI that has solved this for win32? Follow the calls from ShellCoreImpl::setStackLimit - I believe this is correct for both Win32 and pthreads, though it may be more limited than you need. I misremembered its location - it's in the shell code, and not in VMPI, because the Flash Player has its own code for this functionality.

Simon Wilkinson

Assignee

Comment 22

•

14 years ago

(In reply to comment #20) > My suggestion would be something like this: the VMPI thread creation primitive > has NO priority param priority and then we provide a separate primitive > (perhaps NOT in VMPI) to allow a spawned thread to raise its own priority (if > possible). That way VMPI can have a sufficient, well-defined mechanism for its > purposes, and client code can goose the priority of threads when necessary and > possible (microphone case). My original plan was to not have any platform-specific threading implementation above VMPI. So VMPI must provide the priority API if one is going to exist at all, even if it is eventually wrapped by higher-level thread abstractions. I think it's totally acceptable to have the priorities ignored if the platform cannot honor them.

Simon Wilkinson

Assignee

Comment 23

•

14 years ago

Attached patch Added (obsolete) — Details — Splinter Review

Attachment #470037 - Attachment is obsolete: true

Simon Wilkinson

Assignee

Comment 24

•

14 years ago

Attached patch Latest. All platforms implemented. (obsolete) — Details — Splinter Review

Added in this version: - Atomic operations for Solaris - Atomic operations for Unix (a GCC intrinsics version if available, an x86 asm version, and a fallback mutex-based emulation for ARM) - Atomic operations for Symbian (currently uses the same mutex-based emulation as Unix/ARM) - WinCE cmdline build links win32cv from bug 574695 - Lots of trivial platform-specific fixes to satisfy the buildbot

Attachment #473962 - Attachment is obsolete: true

Simon Wilkinson

Assignee

Comment 25

•

14 years ago

Attached patch Selftest for VMPI threads (obsolete) — Details — Splinter Review

Added a set of selftests for VMPI threads, mutexes, conditions and atomics. These are available via -Dselftest=vmpi,threads. Updated the list of smokes to run the above selftests.

Simon Wilkinson

Assignee

Comment 26

•

14 years ago

Attached patch Manifests for WinCE with win32cv included (obsolete) — Details — Splinter Review

Simon Wilkinson

Assignee

Updated

•

14 years ago

Attachment #473967 - Flags: review?(skekki)

Attachment #473967 - Flags: review?(lhansen)

Attachment #473967 - Flags: review?(kpalacz)

Attachment #473967 - Flags: review?(edwsmith)

Attachment #473967 - Flags: feedback?(stan)

Simon Wilkinson

Assignee

Comment 27

•

14 years ago

Here's some outstanding issues that merit discussion: - Do we want wait/sleep times to be nanos, not millis. - Possible addition of a VMPI_threadYield(), who's goal is to minimize priority inversion. - An API for comparing the identity of threads. Currently, vmpi_thread_t's are just compared with ==, which looks to be ok. - Efficient atomics implementation for Symbian and Android (some API's are available). Also fallback asm implementation for ARM. Currently this is emulated with pthread mutexes.

Simon Wilkinson

Assignee

Comment 28

•

14 years ago

(In reply to comment #27) > Here's some outstanding issues that merit discussion: One more thing: - Should the atomics section of the API be maximal (i.e. have lots primitives like OSX's OSAtomics), or minimal (i.e. just have CAS and barrier, and build everything else in higher-level libraries, e.g. bug 555765's AtomicOps::or32)

Samuli Kekki

Updated

•

14 years ago

Attachment #473967 - Flags: review?(skekki) → review+

Krzysztof Palacz

Comment 29

•

14 years ago

Comment on attachment 473967 [details] [diff] [review] Latest. All platforms implemented. Nothing seems obviously wrong to me, as the saying goes. We have GC memory barriers, so we could refer to the ordering instructions as fences, but the probability of confusion is probably very low.

Attachment #473967 - Flags: review?(kpalacz) → review+

Lars T Hansen

Comment 30

•

14 years ago

Comment on attachment 473967 [details] [diff] [review] Latest. All platforms implemented. Overall this seems nice. (Not a subject where I have deep expertise.) Nits: I don't understand what 'rnt' stands for in 'vmpi_thread_rnt_t'; a comment or (almost always better anyway) a non-abbreviated name might constitute an improvement. TAB characters in ThreadsWin.cpp. Outstanding FIXME in ThreadsWin.cpp. Do you want dummyAPC do be static to hide it from view? In all instances where a comment says "Note that on some platforms ..." I would strongly encourage you to write down (in VMPI.h) /which/ platforms that you know about where this is an issue; it aids debugging when somebody trips over the bug, and is valuable information to keep around in any case. Yes, the information can go stale and yes it may need to be updated, but something is much better than nothing. I assume this warrants a FIXME (in unix-platform.h), given the imporance of ARM: +#ifdef AVMSYSTEM_ARM + +#define EMULATE_ATOMICS_WITH_PTHREAD_MUTEX

Attachment #473967 - Flags: review?(lhansen) → review+

Stan Switzer

Comment 31

•

14 years ago

(In reply to comment #28) > (In reply to comment #27) > > Here's some outstanding issues that merit discussion: > > One more thing: > > - Should the atomics section of the API be maximal (i.e. have lots primitives > like OSX's OSAtomics), or minimal (i.e. just have CAS and barrier, and build > everything else in higher-level libraries, e.g. bug 555765's AtomicOps::or32) Well IMO it's probably better to restrict ourselves to the things that can be implemented efficiently "everywhere." Because if, for instance, people assume that AtomicOps::or32 is primitive-efficient, people might use that in preference to a CAS-based solution which might be nearly as fast as a primitive or32-based solution but a lot faster than a software atomic or32 implementation. Or alternatively, they exist only when they're genuinely primitive and there's an ifdef telling the implementation whether to try to use them. Anyway, it seems to me that software emulation of atomic primitives has the potential to do more harm than good.

Stan Switzer

Comment 32

•

14 years ago

(In reply to comment #31) > (In reply to comment #28) > > (In reply to comment #27) > > > Here's some outstanding issues that merit discussion: > > > > One more thing: > > > > - Should the atomics section of the API be maximal (i.e. have lots primitives > > like OSX's OSAtomics), or minimal (i.e. just have CAS and barrier, and build > > everything else in higher-level libraries, e.g. bug 555765's AtomicOps::or32) > > Well IMO it's probably better to restrict ourselves to the things that can be > implemented efficiently "everywhere." Because if, for instance, people assume > that AtomicOps::or32 is primitive-efficient, people might use that in > preference to a CAS-based solution which might be nearly as fast as a primitive > or32-based solution but a lot faster than a software atomic or32 > implementation. > > Or alternatively, they exist only when they're genuinely primitive and there's > an ifdef telling the implementation whether to try to use them. > > Anyway, it seems to me that software emulation of atomic primitives has the > potential to do more harm than good. Meh, never mind, shouldn't reply before coffee kicks in. :) A CAS-based emulation would be trivial so the point is moot. Still, all things being equal, minimal primitives reduce the porting and testing burdens.

Dan Smith

Updated

•

14 years ago

Attachment #473967 - Flags: feedback?(stan) → feedback+

Dan Smith

Updated

•

14 years ago

Attachment #473967 - Flags: feedback?(stan)

Stan Switzer

Comment 33

•

14 years ago

Comment on attachment 473967 [details] [diff] [review] Latest. All platforms implemented. stan: feedback+

Dan Smith

Updated

•

14 years ago

Attachment #473967 - Flags: feedback?(stan)

Dan Smith

Comment 34

•

14 years ago

Removed Stan from the feedback list as he can't r+ directly there without higher privs. See comment 33 for his r+.

Edwin Smith

Updated

•

14 years ago

Attachment #473972 - Flags: review?(kpalacz)

Edwin Smith

Comment 35

•

14 years ago

Comment on attachment 473967 [details] [diff] [review] Latest. All platforms implemented. Nothing major to report, just a few questions for Simon/Krzystof to resolve. patch applied cleanly and rebased cleanly, even though it is old. pinging K. for selftest review. Without a rationale for less, it should get to 100% line coverage for the new code. if this cant be done before landing then followup bugs should be created. VMPI.h > Note that it is platform dependent if non-default attributes are honored or completely ignored. presumably this is okay if all attributes are hints, but if any are counted on by a client for correctness, and known to be non-portable, then should we assert if they are set and run on an unsupported client? > After a thread has been detached, any attempt to join the > thread has undefined behavior. > Multiple detachments of the same thread results in undefined behavior. can we detect these and assert? (similar for other undefined behaviors as I keep reading on... i wont list them all. essentially, I'm talking about asserts for all api invariants that are reasonably detectable). > // FIXME: Where do we define _WIN32_WINNT as _WIN32_WINNT_WINXP so that VC++ picks up the correct THREAD_ALL_ACCESS? Resolve, or create a bug and put bug# in comment, for example: // FIXME: bug XXX Where do we ... In ThreadsWin.cpp: size_t VMPI_threadAttrDefaultStackSize() { // Stack size is set in the linker. The default is 1MB: // http://msdn.microsoft.com/en-us/library/8cxs58a6%28vs.71%29.aspx // But can we find this out programmatically? There doesn't appear to be an API: // http://msdn.microsoft.com/en-us/library/ms686774%28v=VS.85%29.aspx return 1024 * 1024; } This probably needs a FIXME and bug#, depending on the likelihood and penalty for being wrong (crash!). I didn't look at other platforms but one bug to track the work is enough. (nit) ThreadsWin.cpp - stray #endifs should have a // WHATEVER saying what they match with. inconsistent blank lines between functions. maybe elsewhere, but this is where I noticed it. win32-platform.h, in VMPI_memoryBarrier(): > // This is a temp solution until we have a x86 asm version. worth a fixme bug?

Attachment #473967 - Flags: review?(edwsmith) → review+

Krzysztof Palacz

Comment 36

•

14 years ago

Comment on attachment 473972 [details] [diff] [review] Selftest for VMPI threads Looks good to me, minor nits: Don't we need to update the vcproj files as well? m_sharedCounter is public, I think it shouldn't have the m_ prefix. Ifdefing that 100000 would help understanding the code. There's no need to typedef structs in C++ (ThreadRecord).

Attachment #473972 - Flags: review?(kpalacz) → review+

Simon Wilkinson

Assignee

Comment 37

•

14 years ago

Attached patch Post-review changes (obsolete) — Details — Splinter Review

Attachment #473967 - Attachment is obsolete: true

Simon Wilkinson

Assignee

Comment 38

•

14 years ago

Attached patch Latest selftests. API fully covered apart from MemoryBarriers (obsolete) — Details — Splinter Review

Attachment #473972 - Attachment is obsolete: true

Simon Wilkinson

Assignee

Comment 39

•

14 years ago

Attached patch Main patch. Post-review changes (obsolete) — Details — Splinter Review

Attachment #473973 - Attachment is obsolete: true

Attachment #488527 - Attachment is obsolete: true

Simon Wilkinson

Assignee

Comment 40

•

14 years ago

(In reply to comment #30) > Comment on attachment 473967 [details] [diff] [review] > Latest. All platforms implemented. > > Overall this seems nice. (Not a subject where I have deep expertise.) > > Nits: > > I don't understand what 'rnt' stands for in 'vmpi_thread_rnt_t'; a comment or > (almost always better anyway) a non-abbreviated name might constitute an > improvement. Changed to be rtn, i.e. return. I've made a comment as to what its intended meaning is. > TAB characters in ThreadsWin.cpp. > > Outstanding FIXME in ThreadsWin.cpp. Fixed. > Do you want dummyAPC do be static to hide it from view? Yes. Commented. > In all instances where a comment says "Note that on some platforms ..." I would > strongly encourage you to write down (in VMPI.h) /which/ platforms that you > know about where this is an issue; it aids debugging when somebody trips over > the bug, and is valuable information to keep around in any case. Yes, the > information can go stale and yes it may need to be updated, but something is > much better than nothing. I've added comments where appropriate. > I assume this warrants a FIXME (in unix-platform.h), given the imporance of > ARM: > > +#ifdef AVMSYSTEM_ARM > + > +#define EMULATE_ATOMICS_WITH_PTHREAD_MUTEX See bug 609809 (Android) and bug 609810 (other ARM)

Simon Wilkinson

Assignee

Comment 41

•

14 years ago

(In reply to comment #35) > Comment on attachment 473967 [details] [diff] [review] > Latest. All platforms implemented. > > Nothing major to report, just a few questions for Simon/Krzystof to resolve. > patch applied cleanly and rebased cleanly, even though it is old. The new patches are rebased to the tr tip. > pinging K. for selftest review. Without a rationale for less, it should get to > 100% line coverage for the new code. if this cant be done before landing then > followup bugs should be created. I've added more coverage to the selftests. They should now cover the whole API apart from memory barrier functions. I've actually included a memory barrier selftest but turned it off until I'm more confident in its results. See bug 609943. > VMPI.h > > Note that it is platform dependent if non-default attributes are honored or completely ignored. > > presumably this is okay if all attributes are hints, but if any are counted on > by a client for correctness, and known to be non-portable, then should we > assert if they are set and run on an unsupported client? I've documented which attributes are supported per platform: pthread win32 (XP version APIs) priority no yes stack size yes yes guard size yes no Priority will only ever be a hint, there are no correctness issues. Should we just remove guard size from the attributes? > > After a thread has been detached, any attempt to join the > > thread has undefined behavior. > > Multiple detachments of the same thread results in undefined behavior. > > can we detect these and assert? (similar for other undefined behaviors as I > keep reading on... i wont list them all. essentially, I'm talking about asserts > for all api invariants that are reasonably detectable). I was going to put all the debug code within the C++ abstractions, one layer up. Clients should not be using these APIs directly. > > // FIXME: Where do we define _WIN32_WINNT as _WIN32_WINNT_WINXP so that VC++ picks up the correct THREAD_ALL_ACCESS? > > Resolve, or create a bug and put bug# in comment, for example: > > // FIXME: bug XXX Where do we ... Fixed. Should not matter for shell builds. Player builds will set the value correctly. > In ThreadsWin.cpp: > > size_t VMPI_threadAttrDefaultStackSize() > { > // Stack size is set in the linker. The default is 1MB: > // http://msdn.microsoft.com/en-us/library/8cxs58a6%28vs.71%29.aspx > // But can we find this out programmatically? There doesn't appear to be an > API: > // http://msdn.microsoft.com/en-us/library/ms686774%28v=VS.85%29.aspx > return 1024 * 1024; > } > > This probably needs a FIXME and bug#, depending on the likelihood and > penalty for being wrong (crash!). I didn't look at other platforms but one bug > to track the work is enough. Opened bug 609822 > (nit) ThreadsWin.cpp - stray #endifs should have a // WHATEVER saying what they > match with. inconsistent blank lines between functions. maybe elsewhere, but > this is where I noticed it. Fixed. > win32-platform.h, in VMPI_memoryBarrier(): > > // This is a temp solution until we have a x86 asm version. > > worth a fixme bug? Yes. See bug 609820

Simon Wilkinson

Assignee

Comment 42

•

14 years ago

(In reply to comment #36) > Comment on attachment 473972 [details] [diff] [review] > Selftest for VMPI threads > > Looks good to me, minor nits: > Don't we need to update the vcproj files as well? Done in the latest patch. > m_sharedCounter is public, I think it shouldn't have the m_ prefix. Fixed. > Ifdefing that 100000 would help understanding the code. #defined as ITERATIONS > There's no need to typedef structs in C++ (ThreadRecord). Oops. Found this in few other places too.

Simon Wilkinson

Assignee

Updated

•

14 years ago

Attachment #488528 - Attachment description: Latest. API fully covered apart from MemoryBarriers → Latest selftests. API fully covered apart from MemoryBarriers

Simon Wilkinson

Assignee

Updated

•

14 years ago

Attachment #488530 - Attachment description: Post-review changes → Main patch. Post-review changes

Simon Wilkinson

Assignee

Comment 43

•

14 years ago

Attached patch Latest selftests (obsolete) — Details — Splinter Review

Attachment #488528 - Attachment is obsolete: true

Simon Wilkinson

Assignee

Comment 44

•

14 years ago

Attached patch Main patch. Post-review changes and win32 fixes — Details — Splinter Review

Attachment #488530 - Attachment is obsolete: true

Simon Wilkinson

Assignee

Comment 45

•

14 years ago

Attached patch Latest selftests (obsolete) — Details — Splinter Review

Attachment #488681 - Attachment is obsolete: true

Simon Wilkinson

Assignee

Updated

•

14 years ago

Attachment #488682 - Flags: review?(kpalacz)

Simon Wilkinson

Assignee

Updated

•

14 years ago

Attachment #488882 - Flags: review?(kpalacz)

Felix S. Klock II [:pnkfelix, :fklock]

Comment 46

•

14 years ago

Comment on attachment 488682 [details] [diff] [review] Main patch. Post-review changes and win32 fixes typo in VMPI.h: "no memory barrier we be applied." (this typo appears at least twice in the patch.). (I got distracted before I could finish going over the patch.)

Krzysztof Palacz

Comment 47

•

14 years ago

Fixed typos and pushed: http://hg.mozilla.org/tamarin-redux/rev/8983e7687b85

Krzysztof Palacz

Updated

•

14 years ago

Attachment #488682 - Flags: review?(kpalacz)

Felix S. Klock II [:pnkfelix, :fklock]

Comment 48

•

14 years ago

Augmented previous push by throwing in the missing ThreadsPosix-inlines.h file. TR 5526:3748b03ad186 http://hg.mozilla.org/tamarin-redux/rev/3748b03ad186

Simon Wilkinson

Assignee

Comment 49

•

14 years ago

Attached patch Latest selftests. Added conservative memory barrier. (obsolete) — Details — Splinter Review

Attachment #488882 - Attachment is obsolete: true

Attachment #489422 - Flags: review?(kpalacz)

Attachment #488882 - Flags: review?(kpalacz)

Krzysztof Palacz

Comment 50

•

14 years ago

Comment on attachment 489422 [details] [diff] [review] Latest selftests. Added conservative memory barrier. minor nit: I'd move the explantation of what MemoryBarrierTest does next to its implementation.

Attachment #489422 - Flags: review?(kpalacz) → review+

Simon Wilkinson

Assignee

Updated

•

14 years ago

Updated

•

14 years ago

Updated

•

14 years ago

Attachment #488682 - Flags: superreview?(edwsmith)

Simon Wilkinson

Assignee

Updated

•

14 years ago

Attachment #488682 - Flags: superreview?(edwsmith)

Simon Wilkinson

Assignee

Updated

•

14 years ago

Attachment #489422 - Flags: superreview?(edwsmith)

Simon Wilkinson

Assignee

Comment 51

•

14 years ago

(In reply to comment #50) > Comment on attachment 489422 [details] [diff] [review] > Latest selftests. Added conservative memory barrier. > > minor nit: > I'd move the explantation of what MemoryBarrierTest does next to its > implementation. I'll note this here, as well as in the code: The memory barrier test is turned off pending bug 609943

Edwin Smith

Comment 52

•

14 years ago

Comment on attachment 489422 [details] [diff] [review] Latest selftests. Added conservative memory barrier. It looks like something weird happened with line endings in class MemoryBarrierTest

Attachment #489422 - Flags: superreview?(edwsmith) → superreview+

Simon Wilkinson

Assignee

Comment 53

•

14 years ago

Attached patch Latest selftests. Added docs. Fixed spacing. — Details — Splinter Review

Attachment #489422 - Attachment is obsolete: true

Krzysztof Palacz

Comment 54

•

14 years ago

Pushed selftests http://hg.mozilla.org/tamarin-redux/rev/b2dbe070c5bc

Felix S. Klock II [:pnkfelix, :fklock]

Comment 55

•

14 years ago

Simon: can this be closed?

Simon Wilkinson

Assignee

Comment 56

•

14 years ago

(In reply to comment #55) > Simon: can this be closed? Yes it can be closed. Question on protocol: is it me or my landing buddy who should set this status? Previously it has been my landing buddy...

Simon Wilkinson

Assignee

Updated

•

14 years ago

Status: NEW → RESOLVED

Closed: 14 years ago

Resolution: --- → FIXED

Trevor Baker

Updated

•

14 years ago

Flags: flashplayer-bug-

You need to log in before you can comment on or make changes to this bug.