1058340 - [meta] Minimize typed object memory usage

Reporter

Description

•

10 years ago

Typed objects should use less memory than plain objects: the layout of typed values in memory is more efficient than an array of plain values, and there is no need to support extensibility. Currently, however, due to overhead smaller typed objects will use more memory than a comparable plain object. There are two sources of overhead: 1) typed objects always have an associated ArrayBuffer object with their backing data, and 2) typed objects have four fixed slots (byteOffset, length, owner, next view) plus a private pointer for the data itself (which unfortunately gets padded so that we use eight slots in total). In many common use cases all this overhead should be able to be eliminated. To see what the target layout should be and how it will work, here is a basic example: var T = TypedObject; var Pair = new T.StructType({y:T.int32, z:T.int32}); var Triple = new T.StructType({x:T.int32, f:Pair}); #1 var v1 = new Triple(); #2 var v2 = v1.f; #3 var buf = T.storage(v1).buffer; #4 neuter(buf); I'm working under the assumption that operations like #1 and #2 are common, but that operations like #3 and #4 are rare. So, after #1 we've created a typed object with no explicit associated array buffer. As we do for typed arrays, the array buffer should be implicit, and to use as little storage as feasible we'd end up with the following layout for the object in memory: p0 p1 Object OA: [shape, type, x, f.y, f.z] Some notes: - The object's byte offset and length are common to all Triples and can be stored in its type information or descriptor (not sure yet how that stuff fits together). - The object's owner is implicitly null, since there's no array buffer. I think the view list should be removed entirely, which will have some nice complexity and memory benefits. - The object's data follows the shape/type inline so the data pointer is stored implicitly. - There are no slots/elements pointers, which we currently create for all objects. Typed objects are not extensible so these pointers will never be used. The same holds true for other non-native objects (i.e. proxies; after bug 966518 typed objects should be able to become proxies fwiw), so I think it would be a good idea to remove all notion of slots and elements in non-native objects entirely. - The shape and type (i.e. the TI type) should stay unchanged; these are accessed all the time in hot jitcode. After #2 we've created another typed object for the inner Pair structure. This one can't use the same inline layout since it needs to alias the outer object. The best layout for this object in memory is: Object OB: [shape, type, owner:p0, data:p1] Note that now there are two different layouts which can be used for a typed object, depending on how it is created or whether it aliases the contents of another typed object (or array buffer). We'd need to distinguish which representation is in use, by varying either the shape or the type of the object. After #3 we need to lazily create the array buffer for object OA. There are two typed objects now with pointers to the data, and no way to obtain a pointer to OB from OA (since there's no view list). So the array buffer needs to point to the data in OA, and OA needs to point to that array buffer. The latter is somewhat tricky since there's no space in OA for a new pointer, but this could be done by reshaping OA to hold a pointer to the array buffer. This would be expensive but not horribly so. After #4 the array buffer has been neutered so no accesses can be performed on it or OA or OB. Not having a view list means that instances are not explicitly neutered so accesses on them need to always check for neutered buffers, which is some additional VM cost but will normally be eliminated or minimized in JIT code.

Nicholas Nethercote [inactive]

Comment 1

•

10 years ago

Excellent! This could help pdf.js quite a bit. I've encountered exactly this issue a few times, where I would like to convert a vanilla array to a Uint8Array but it's not a memory win unless the array gets to a certain length.

Whiteboard: [MemShrink]