720949 - Need API for "transferring" ArrayBuffer data between runtimes (via shared memory)

Ben Turner (not reading bugmail, use the needinfo flag!)

Reporter

Description

•

14 years ago

Web Workers are about to standardize a transfer extension to postMessage that will allow quick passing of ArrayBuffer data to/from workers. As far as I can tell there is no way to create an array buffer that is backed by non-runtime-specific heap memory, and we will need something like that in order to make this fast.

Jonas Sicking (:sicking) No longer reading bugmail consistently

Comment 1

•

14 years ago

And while we're at it, it'd be great to be able to construct an ArrayBuffer which is backed by a malloc'ed buffer. This is needed by for example XMLHttpRequest which streams data into a buffer, and once all data has been downloaded wants to construct an ArrayBuffer containing that data. Since the XHR doesn't know the size of the buffer until the download is done, it can't create an ArrayBuffer instance and download the data directly into that. So if we had a way to back an ArrayBuffer by a malloc'ed buffer, it seems like the use cases in comment 0 as well as the XHR one could be solved?

Luke Wagner [:luke]

Comment 2

•

14 years ago

What does the spec say happens to the original object? Does it become invalidated in some way?

Jeff Walden [:Waldo]

Comment 3

•

14 years ago

It says it's neutered. (Term of art, honest!) That said, based on my dim memory of skimming the spec and happening to see this a week or so ago, it may be ECMAScript-semantically ill-conceived. Neutering an ArrayBuffer causes the byteLength of that buffer to become 0. But byteLength is a non-writable property, and for that to have any teeth, it must also be non-configurable. If that's the case, ECMAScript forbids the observable behavior of an immutable property changing its value or disappearing. Making byteLength a getter without a setter might be one fix for that. The individual indexed properties of the ArrayBuffer are another concern, at least if someone attempted to make one or more such properties non-configurable. But that has interactions with how WebIDL says the descriptions in the spec map to concrete things, and I don't know that well enough to say with much confidence that there is or is not a concern.

David Mandelin [:dmandelin]

Comment 4

•

14 years ago

Oddly enough, just a few days ago I added an internal API that does roughly that in order to implement ArrayBuffer.slice: struct JS_FRIEND_API(ArrayBuffer) { static JSObject *create(JSContext *cx, int32_t nbytes, uint8_t *contents = NULL); Does it get you what you want if I just add a new variant of JS_NewArrayBuffer that takes a |contents| parameter? My only question is about the original buffer: (a) it has to be a standard malloc'd buffer (cx->malloc_ is what the basic version uses, but I think the only thing special about that is some GC memory accounting) so that the finalizer does the right thing, and (b) it had better not get modified later via the original pointer. Are both of those going to work for you?

Jonas Sicking (:sicking) No longer reading bugmail consistently

Comment 5

•

14 years ago

What does "it had better not get modified later via the original pointer" mean? Surely we're allowed to modify the contents of the malloc'ed buffer, right? Not being able to modify the pointer sounds ok. We do still need two more things though: 1. The ability to "truncate" an ArrayBuffer. I.e. set it's size to 0. This is what happens to an ArrayBuffer when it's "transferred" to a Worker. We need to be able to do this on all types of ArrayBuffers, including ones that use an internal buffer. 2. For ArrayBuffers which are backed by a malloc'ed buffer, we need to be able to take ownership of the malloced buffer as we truncate it. This way when a malloced arraybuffer is transferred, we can transfer the data to the worker without doing any memory copying. Ownership of the malloc'ed buffer is simply transferred to a newly constructed ArrayBuffer which lives inside the worker. If someone transfers a non-malloc'ed ArrayBuffer we'll have to copy the data, but we can still maintain the same behavior outwards.

David Mandelin [:dmandelin]

Comment 6

•

14 years ago

(In reply to Jonas Sicking (:sicking) On leave waiting for visa from comment #5) > What does "it had better not get modified later via the original pointer" > mean? Surely we're allowed to modify the contents of the malloc'ed buffer, > right? Well, thinking about it more, it seems that in a single-threaded setup it would fine to modify the contents, because anything you could do that way you could do by creating a Uint8Array view and setting bytes. But if it's being passed from thread T1 to T2, then T1 shouldn't touch the buffer until T2 stops. > We do still need two more things though: > > 1. The ability to "truncate" an ArrayBuffer. I.e. set it's size to 0. This is > what happens to an ArrayBuffer when it's "transferred" to a Worker. We > need to > be able to do this on all types of ArrayBuffers, including ones that use > an > internal buffer. > 2. For ArrayBuffers which are backed by a malloc'ed buffer, we need to be > able to > take ownership of the malloced buffer as we truncate it. > > This way when a malloced arraybuffer is transferred, we can transfer the > data to the worker without doing any memory copying. Ownership of the > malloc'ed buffer is simply transferred to a newly constructed ArrayBuffer > which lives inside the worker. If someone transfers a non-malloc'ed > ArrayBuffer we'll have to copy the data, but we can still maintain the same > behavior outwards. How about: /* * Create a new array buffer with the given contents, which must be valid * memory of at least |nbytes|. The new array buffer takes ownership of * the contents array. After calling this function, do not free |contents| or * use |contents| from another thread. */ JSObject * JS_NewArrayBufferWithContents(JSContext *cx, uint32_t nbytes, uint8_t *contents); /* * Steal the contents of the given array buffer. The array buffer has its * length set to 0 and its contents array cleared. The contents array and * its length are returned via |nbytes| and |contents|. The caller takes * ownership of |contents| and must free it or transfer ownership when done * using it. */ void JS_StealArrayBufferContents(JSContext *cx, JSObject *obj, uint8_t *nbytes, uint8_t **contents);

Jonas Sicking (:sicking) No longer reading bugmail consistently

Comment 7

•

14 years ago

(In reply to David Mandelin from comment #6) > (In reply to Jonas Sicking (:sicking) On leave waiting for visa from comment > #5) > > What does "it had better not get modified later via the original pointer" > > mean? Surely we're allowed to modify the contents of the malloc'ed buffer, > > right? > > Well, thinking about it more, it seems that in a single-threaded setup it > would fine to modify the contents, because anything you could do that way > you could do by creating a Uint8Array view and setting bytes. But if it's > being passed from thread T1 to T2, then T1 shouldn't touch the buffer until > T2 stops. Yes. In practice I don't think we'll be touching the contents of the buffer ever. But if we do, we'll definitely only do it on the thread where we have an ArrayBuffer owning the malloc'ed buffer. > JSObject * > JS_NewArrayBufferWithContents(JSContext *cx, uint32_t nbytes, uint8_t > *contents); ... > void > JS_StealArrayBufferContents(JSContext *cx, JSObject *obj, uint8_t *nbytes, > uint8_t **contents); I assume that JS_StealArrayBufferContents only works on ArrayBuffers which hold a malloc'ed buffer? If so, this doesn't yet completely solve requirement 1 from comment 5. I.e. we need the ability to truncate a non-malloc'ed ArrayBuffer as well as check if an ArrayBuffer is backed by a malloc'ed buffer. Here's the code that I'm imagining we'll write: On thread 1: JSObject* arraybuffer = ...; uint32 size; uint8_t* contents = nsnull; if (JS_ArrayBufferHasStealableContents(cx, arraybuffer)) { // Still missing JS_StealArrayBufferContents(cx, arraybuffer, &size, &contents); } else { size = JS_GetArrayBufferByteLength(arraybuffer); contents = malloc(size); <OOM check contents>; memcpy(contents, JS_GetArrayBufferData(arraybuffer), size); JS_TruncateArrayBuffer(cx, arraybuffer); // Still missing } sendToOtherThread(contents, size); On thread 2: JSObject* newArraybuffer = JS_NewArrayBufferWithContents(cx, aSize, aContents);

David Mandelin [:dmandelin]

Comment 8

•

14 years ago

(In reply to Jonas Sicking (:sicking) from comment #7) > (In reply to David Mandelin from comment #6) > > (In reply to Jonas Sicking (:sicking) On leave waiting for visa from comment > > #5) > > JSObject * > > JS_NewArrayBufferWithContents(JSContext *cx, uint32_t nbytes, uint8_t > > *contents); > ... > > void > > JS_StealArrayBufferContents(JSContext *cx, JSObject *obj, uint8_t *nbytes, > > uint8_t **contents); > > I assume that JS_StealArrayBufferContents only works on ArrayBuffers which > hold a malloc'ed buffer? IIUC, all ArrayBuffers have their internal elements array allocated via malloc (whether created internally or supplied to the creation function), so once the ArrayBuffer is created there is no distinction and they are all stealable. If that for some reason turned out not to be true, then I would probably want to keep the same API as above, but let it alloc-copy-and-truncate internally (i.e., put the code you gave into the API implementation). Does that sound right?

Jonas Sicking (:sicking) No longer reading bugmail consistently

Comment 9

•

14 years ago

That sounds great!

David Mandelin [:dmandelin]

Updated

•

14 years ago

Assignee: general → sphink

Add JSAPI for transferring ArrayBuffer contents 14 years ago Steve Fink [:sfink] [:s:] 13.27 KB, patch		Details \| Diff \| Splinter Review
Add JSAPI for tranferring ArrayBuffer contents 13 years ago Steve Fink [:sfink] [:s:] 29.91 KB, patch		Details \| Diff \| Splinter Review
Add JSAPI for transferring ArrayBuffer contents 13 years ago Steve Fink [:sfink] [:s:] 38.77 KB, patch		Details \| Diff \| Splinter Review
Add JSAPI for transferring ArrayBuffer contents 13 years ago Steve Fink [:sfink] [:s:] 36.39 KB, patch	luke : review+	Details \| Diff \| Splinter Review