1160971 - SIMD: remove signMask and implement Bool vectors in the interpreter and in the JITs

The linked commit only implements it for the int{8x16,16x8,32x4} types (and still removes it for the float{32x4,64x2} types), but the simd-mandelbrot.js test still needs it because it uses signmask of float32x4 at the moment. Does a patch for this bug have to include allTrue/anyTrue for the float types?

Flags: needinfo?(benj)

Dan Gohman [:sunfish]

Comment 2

•

9 years ago

In the spec, allTrue/anyTrue are removed from the integer types as well, and is only present on the Bool vector types. The most straightforward translation of float32x4 signmask will be to do a SIMD.Float32x4.lessThan and then do allTrue/anyTrue on the result.

Benjamin Bouvier [:bbouvier] (inactive)

Reporter

Comment 3

•

9 years ago

What Dan said. Let's morph this bug into implementing Bool vectors, as these are non trivial to do and deserve their own bug.

Blocks: 1173722

Flags: needinfo?(benj)

Summary: SIMD: remove signMask and implement AllTrue/AnyTrue → SIMD: remove signMask and implement Bool vectors in the interpreter and in the JITs

Benjamin Bouvier [:bbouvier] (inactive)

Reporter

Updated

•

9 years ago

Blocks: 1176375

sajjadt

Comment 4

•

9 years ago

Attached patch Implementing bool vector for the interpreter (obsolete) — Details — Splinter Review

Attachment #8640012 - Flags: feedback?(bbouvier)

Benjamin Bouvier [:bbouvier] (inactive)

Reporter

Updated

•

9 years ago

Attachment #8640012 - Flags: feedback?(bbouvier) → feedback?(benj)

Benjamin Bouvier [:bbouvier] (inactive)

Reporter

Comment 5

•

9 years ago

Comment on attachment 8640012 [details] [diff] [review] Implementing bool vector for the interpreter Review of attachment 8640012 [details] [diff] [review]: ----------------------------------------------------------------- Looks good overall! A few style nits, easy wins and discussions, so I would like to see another version of this patch. About the implementation strategy, this is how I would recommend doing things (one patch per bullet) 1. let signMask live and implement bool vectors in the interpreter 2. add support for the JITs 3. update tests to remove signMask and use bool vectors instead 4. remove support for signMask By moving patches 3. and 4. in your patch queue, you can make sure that all instances of signMask have all been replaced. This makes also landing and testing things easier! How does that sound? ::: js/src/builtin/SIMD.cpp @@ +164,5 @@ > +// SIGN_MASK(Float64x2); > +// SIGN_MASK(Int8x16); > +// SIGN_MASK(Int16x8); > +// SIGN_MASK(Int32x4); > +//#undef SIGN_MASK This code can be deleted. @@ +259,5 @@ > JS_FS_END > }; > > const JSPropertySpec Float32x4Defn::TypedObjectProperties[] = { > +// JS_PSG("signMask", Float32x4SignMask, JSPROP_PERMANENT), ditto, here and below @@ +905,5 @@ > + Elem* vec = TypedObjectMemory<Elem*>(args[0]); > + bool allTrue = true; > + // enumerate the lanes > + for (unsigned i = 0; i < V::lanes; i++) { > + allTrue &= vec[i]; We can make this slightly faster by changing the middle condition in the for loop to: allTrue && i < V::lanes @@ +917,5 @@ > +static bool > +AnyTrue(JSContext* cx, unsigned argc, Value* vp) > +{ > + typedef typename V::Elem Elem; > + nit: a few trailing spaces, here and below @@ +925,5 @@ > + > + Elem* vec = TypedObjectMemory<Elem*>(args[0]); > + bool anyTrue = false; > + // enumerate the lanes > + for (unsigned i = 0; i < V::lanes; i++) { ditto, !anyTrue && i < V::lanes @@ +1266,2 @@ > for (unsigned i = 0; i < V::lanes; i++) > + result[i] = mask[i] > 0 ? tv[i] : fv[i]; Probably just testing mask[i] would do the trick? @@ +1266,3 @@ > for (unsigned i = 0; i < V::lanes; i++) > + result[i] = mask[i] > 0 ? tv[i] : fv[i]; > + nit: trailing ws ::: js/src/builtin/SIMD.h @@ +50,5 @@ > + V(notEqual, (CompareFunc<Bool32x4, NotEqual, Bool32x4>), 2) > + > +#define BOOL8X16_TERNARY_FUNCTION_LIST(V) \ > + V(replaceLane, (ReplaceLane<Bool8x16>), 3) \ > + V(select, (Select<Bool32x4, Bool8x16>), 3) should be Select<Bool8x16, Bool8x16> @@ +54,5 @@ > + V(select, (Select<Bool32x4, Bool8x16>), 3) > + > +#define BOOL16X8_TERNARY_FUNCTION_LIST(V) \ > + V(replaceLane, (ReplaceLane<Bool16x8>), 3) \ > + V(select, (Select<Bool32x4, Bool16x8>), 3) ditto with Bool16x8 @@ +310,5 @@ > V(neg, (UnaryFunc<Int32x4, Neg, Int32x4>), 1) \ > V(not, (UnaryFunc<Int32x4, Not, Int32x4>), 1) \ > V(splat, (FuncSplat<Int32x4>), 0) > > + nit: blank line @@ +551,5 @@ > + return a; > + } > + static bool toType(JSContext* cx, JS::HandleValue v, Elem* out) { > + *out = ToBoolean(v); > + return true; Isn't there a fallible ToBoolean function, as for the other types? ::: js/src/builtin/TypedObject.h @@ +365,5 @@ > macro_(SimdTypeDescr::Int16x8, int16_t, int16, 8) \ > macro_(SimdTypeDescr::Int32x4, int32_t, int32, 4) \ > + macro_(SimdTypeDescr::Bool8x16, int8_t, int8, 16) \ > + macro_(SimdTypeDescr::Bool16x8, int16_t, int16, 8) \ > + macro_(SimdTypeDescr::Bool32x4, int32_t, int32, 4) \ The 3 types should be able to use int8_t for their storage, right? Having 32 bits to represent a bool seems overkill... ::: js/src/builtin/TypedObject.js @@ +157,5 @@ > var y = Load_float64(typedObj, offset + 8); > return GetFloat64x2TypeDescr()(x, y); > > case JS_SIMDTYPEREPR_INT8: > + case JS_SIMDTYPEREPR_BOOL8: According to the specification, bool vectors store booleans, not integers, so reusing the integer path won't work. @@ +196,5 @@ > var y = Load_int32(typedObj, offset + 4); > var z = Load_int32(typedObj, offset + 8); > var w = Load_int32(typedObj, offset + 12); > return GetInt32x4TypeDescr()(x, y, z, w); > + nit: extra spaces @@ +625,5 @@ > + var s5 = callFunction(std_SIMD_Bool8x16_extractLane, null, this, 4); > + var s6 = callFunction(std_SIMD_Bool8x16_extractLane, null, this, 5); > + var s7 = callFunction(std_SIMD_Bool8x16_extractLane, null, this, 6); > + var s8 = callFunction(std_SIMD_Bool8x16_extractLane, null, this, 7); > + var s9 = callFunction(std_SIMD_Boolx16_extractLane, null, this, 0); nit: missing std_SIMD_Bool8x16_extractLane, not std_SIMD_Boolx16_extractLane Also, copy-pasto error: the last indexes restart from 0, while it should start from 7 (maybe a pre-existing issue with int8x16?) ::: js/src/vm/SelfHosting.cpp @@ +1300,5 @@ > > JS_FN("std_SIMD_Int32x4_extractLane", simd_int32x4_extractLane, 2,0), > JS_FN("std_SIMD_Float32x4_extractLane", simd_float32x4_extractLane, 2,0), > JS_FN("std_SIMD_Float64x2_extractLane", simd_float64x2_extractLane, 2,0), > + JS_FN("std_SIMD_Bool32x4_extractLane", simd_bool32x4_extractLane, 2,0), Other Bool types would need their intrinsics too

Attachment #8640012 - Flags: feedback?(benj) → feedback+

sajjadt

Comment 6

•

9 years ago

Thanks for the review; that flow makes perfect sense to me. Now, I have bool32x4 working in JIT using SSE intrinsic. The question is, is it okay to use int32 type to represent boolean vectors(Bool32x4, Bool16x8, Bool8x16) and then apply non-vectorized bitwise operations on them?

Benjamin Bouvier [:bbouvier] (inactive)

Reporter

Comment 7

•

9 years ago

(In reply to staheri from comment #6) > Thanks for the review; that flow makes perfect sense to me. > > Now, I have bool32x4 working in JIT using SSE intrinsic. The question is, is > it okay to use int32 type to represent boolean vectors(Bool32x4, Bool16x8, > Bool8x16) and then apply non-vectorized bitwise operations on them? Yes, it sounds fine. The specification is specifically loose about the implementation details of the boolean vectors. The only place where it matters are the places where we need to rematerialize the SIMD values (bailouts, recover instructions), but this just should be about specializing MSimdBox, hopefully.

sajjadt

Comment 8

•

9 years ago

Attached patch Tests for SIMD boolean vectors (obsolete) — Details — Splinter Review

Attachment #8662166 - Flags: review?(benj)

sajjadt

Comment 9

•

9 years ago

Attached patch WIP patch for supporting SIMD boolean vectors in interpreter and JIT (obsolete) — Details — Splinter Review

Attachment #8640012 - Attachment is obsolete: true

Attachment #8662331 - Flags: feedback?(benj)

Benjamin Bouvier [:bbouvier] (inactive)

Reporter

Comment 10

•

9 years ago

Comment on attachment 8662166 [details] [diff] [review] Tests for SIMD boolean vectors Review of attachment 8662166 [details] [diff] [review]: ----------------------------------------------------------------- Looks good for bool32x4! I wonder why you had to raise the iteration threshold in most loops? I guess we should be fine just having 50 iterations with the special line at the top. Can you change the value back to what it was before, in all tests, please? ::: js/src/jit-test/tests/SIMD/bool32x4-arith.js @@ +6,5 @@ > + var b1 = SIMD.Bool32x4(true, false, true, false); > + var b2 = SIMD.Bool32x4(true, true, true, true); > + var ret = false; > + for (var i = 0; i < 3500; i++) { > + assertEqX4(SIMD.Bool32x4.and(b1, b2), booleanBinaryX4((x, y) => x && y, b1, b2)); Where is booleanBinaryX4 defined? (it doesn't seem present to be in this patch) ::: js/src/jit-test/tests/SIMD/check.js @@ +9,3 @@ > var i = 0; > try { > + for (; i < 3500; i++) { Any reason why you needed to raise the thresholds? The warmup trigger is supposed to make this compile after around 50 uses. ::: js/src/jit-test/tests/SIMD/compare.js @@ +13,5 @@ > var i1 = SIMD.Int32x4(1, 2, -3, 4); > var i2 = SIMD.Int32x4(1, -2, 3, 0); > > + for (var i = 0; i < 3500; i++) { > + assertEqX4(SIMD.Int32x4.lessThan(i1, i2), [false, false, true, false]); i think you can simply change the function bool, instead of changing all expected results, to something like function bool(x) { return !!x; } ::: js/src/jit-test/tests/SIMD/getters.js @@ +29,5 @@ > + assertEq(SIMD.Bool32x4.extractLane(b4, 2), false); > + assertEq(SIMD.Bool32x4.extractLane(b4, 3), true); > + > + assertEq(SIMD.Bool32x4.anyTrue(b4), true); > + assertEq(SIMD.Bool32x4.allTrue(b4), false); Can you add some tests for anyTrue/allTrue? anyTrue for a vector containing all falses allTrue for a vector containing all trues ::: js/src/jit-test/tests/SIMD/replacelane.js @@ +169,5 @@ > + try { > + let x = SIMD.Bool32x4.replaceLane(b4, i < 3499 ? 0 : 1.1, true); > + } catch(e) { > + assertEq(e instanceof TypeError, true); > + //assertEq(i, 3499); nit: please un-comment this line and make sure the test pass ::: js/src/tests/ecma_7/SIMD/binary-operations.js @@ +389,5 @@ > +} > + > +function testBool8x16xor() { > + function xorb(a, b) { > + return !!(a ^ b); Out of curiosity, why do we need the explicit boolean coercion here? ::: js/src/tests/ecma_7/SIMD/select-bitselect.js @@ +16,5 @@ > > function getMask(i, maskLength) { > var args = []; > for (var j = 0; j < maskLength; j++) args.push(!!((i >> j) & 1)); > + console.log(args); nit: please remove this call @@ +30,5 @@ > + > +function getSelectBitsMask(i, maskLength) { > + var args = []; > + for (var j = 0; j < maskLength; j++) args.push(!!((i >> j) & 1)); > + console.log(args); nit: ditto @@ +88,3 @@ > var mask = getMask(i, maskLength); > for ([x, y] of inputs) > + assertEqVec(type.selectBits(bitsMask, x, y), selectBits(type, bitsMask, x, y)); pre-existing, but as you're around: please add {} around these two lines, it's actually not doing what it ought to :/ ::: js/src/tests/ecma_7/SIMD/shell.js @@ +151,5 @@ > } > > function simdLength(v) { > var pt = Object.getPrototypeOf(v); > + if (pt == SIMD.Int8x16.prototype || pt === SIMD.Bool8x16.prototype) pre-existing, but here and below, can you replace all of these by ===? @@ +163,5 @@ > throw new TypeError("Unknown SIMD kind."); > } > > function simdLengthType(t) { > + if (t == SIMD.Int8x16 || t == SIMD.Bool8x16) ditto ::: js/src/tests/ecma_7/SIMD/typedobjects.js @@ +661,2 @@ > function test() { > + nit: blank line

Attachment #8662166 - Flags: review?(benj) → feedback+

Benjamin Bouvier [:bbouvier] (inactive)

Reporter

Comment 11

•

9 years ago

Comment on attachment 8662331 [details] [diff] [review] WIP patch for supporting SIMD boolean vectors in interpreter and JIT Review of attachment 8662331 [details] [diff] [review]: ----------------------------------------------------------------- Any chance you'd be willing to split this patch into two parts (with hg histedit/hg record if you're using mercurial, git rebase -i/git add -p if you're using git): the interpreter parts on one hand, the JIT parts on the other hand?

sajjadt

Comment 12

•

9 years ago

Thanks for the review. I'll provide you with new patches. To answer the questions that you just raised, I wonder why you had to raise the iteration threshold in most loops? I noticed that 50 iteration is not enough for Ion to generate machine code (Just the baseline). Where is booleanBinaryX4 defined? You're right, my bad. It supposed to be jit-test/lib/simd.js. function booleanBinaryX4(op, v, w) { var arr = []; var [varr, warr] = [simdToArray(v), simdToArray(w)]; for (var i = 0; i < 4; i++) arr[i] = op(varr[i], warr[i]); return arr; } i think you can simply change the function bool... That function is from previous implementation and not being used anymore. I guess we can remove it. Out of curiosity, why do we need the explicit boolean coercion here? One way to make an xor is to use "^" operator. But, result of "^" operator when applied to two boolean values is a numeric value i.e. "0,1". Since we're using "===" in assertions, we need to make sure that they have the same type as well.

sajjadt

Comment 13

•

9 years ago

Attached patch WIP patch for supporting SIMD boolean vectors in interpreter (obsolete) — Details — Splinter Review

Attachment #8662331 - Attachment is obsolete: true

Attachment #8662331 - Flags: feedback?(benj)

Attachment #8670460 - Flags: feedback?(benj)

sajjadt

Comment 14

•

9 years ago

Attached patch WIP patch for supporting SIMD boolean vectors in JIT (obsolete) — Details — Splinter Review

Attachment #8670462 - Flags: feedback?(benj)

Benjamin Bouvier [:bbouvier] (inactive)

Reporter

Comment 15

•

9 years ago

Comment on attachment 8670460 [details] [diff] [review] WIP patch for supporting SIMD boolean vectors in interpreter Review of attachment 8670460 [details] [diff] [review]: ----------------------------------------------------------------- Thank you for splitting the patch! I'd like to see another version before r+-ing it, but it looks good. I suppose from this patch that your plan is to remove signMask later? That sounds great. ::: js/src/builtin/SIMD.cpp @@ +225,5 @@ > static const JSFunctionSpec TypedObjectMethods[]; > static const JSFunctionSpec Methods[]; > }; > +class Bool8x16Defn { > +public: nit: before "public", half indent please (2 spaces) @@ +228,5 @@ > +class Bool8x16Defn { > +public: > + static const SimdTypeDescr::Type type = SimdTypeDescr::Bool8x16; > + static const JSFunctionSpec TypeDescriptorMethods[]; > + static const JSPropertySpec TypedObjectProperties[]; You probably don't need this here (see below), and in other types as well. @@ +233,5 @@ > + static const JSFunctionSpec TypedObjectMethods[]; > + static const JSFunctionSpec Methods[]; > +}; > +class Bool16x8Defn { > +public: ditto @@ +241,5 @@ > + static const JSFunctionSpec TypedObjectMethods[]; > + static const JSFunctionSpec Methods[]; > +}; > +class Bool32x4Defn { > +public: ditto @@ +384,5 @@ > + JS_FS_END, > +}; > + > +const JSPropertySpec Bool8x16Defn::TypedObjectProperties[] = { > + JS_PS_END You probably don't need this TypedObjectProperties here @@ +899,5 @@ > + return ErrorBadArgs(cx); > + > + Elem* vec = TypedObjectMemory<Elem*>(args[0]); > + bool allTrue = true; > + // enumerate the lanes nit: i would remove this comment, it's not adding much information @@ +902,5 @@ > + bool allTrue = true; > + // enumerate the lanes > + for (unsigned i = 0; allTrue && i < V::lanes; i++) { > + allTrue &= vec[i]; > + } nit: you don't need {} for single line for-bodies @@ +920,5 @@ > + return ErrorBadArgs(cx); > + > + Elem* vec = TypedObjectMemory<Elem*>(args[0]); > + bool anyTrue = false; > + // enumerate the lanes ditto for this comment @@ +923,5 @@ > + bool anyTrue = false; > + // enumerate the lanes > + for (unsigned i = 0; !anyTrue && i < V::lanes; i++) { > + anyTrue |= vec[i]; > + } ditto for {} @@ +1255,5 @@ > Elem* tv = TypedObjectMemory<Elem*>(args[1]); > Elem* fv = TypedObjectMemory<Elem*>(args[2]); > > Elem result[V::lanes]; > + //Boolean mask items are represented with -1/0 for true/false Well well well, couldn't just have them be 1 for true, 0 for false, instead? That seems more natural. ::: js/src/builtin/SIMD.h @@ +50,5 @@ > + V(notEqual, (CompareFunc<Bool32x4, NotEqual, Bool32x4>), 2) > + > +#define BOOL8X16_TERNARY_FUNCTION_LIST(V) \ > + V(replaceLane, (ReplaceLane<Bool8x16>), 3) \ > + V(select, (Select<Bool32x4, Bool8x16>), 3) there's no select, per spec @@ +54,5 @@ > + V(select, (Select<Bool32x4, Bool8x16>), 3) > + > +#define BOOL16X8_TERNARY_FUNCTION_LIST(V) \ > + V(replaceLane, (ReplaceLane<Bool16x8>), 3) \ > + V(select, (Select<Bool32x4, Bool16x8>), 3) there's no select, per spec @@ +58,5 @@ > + V(select, (Select<Bool32x4, Bool16x8>), 3) > + > +#define BOOL32X4_TERNARY_FUNCTION_LIST(V) \ > + V(replaceLane, (ReplaceLane<Bool32x4>), 3) \ > + V(select, (Select<Bool32x4, Bool32x4>), 3) there's no select, per spec @@ +60,5 @@ > +#define BOOL32X4_TERNARY_FUNCTION_LIST(V) \ > + V(replaceLane, (ReplaceLane<Bool32x4>), 3) \ > + V(select, (Select<Bool32x4, Bool32x4>), 3) > + > +#define BOOL16X8_BINARY_FUNCTION_LIST(V) \ Please add some consistency to the grouping of the new functions: the BINARY macros stay together, the TERNARY macros stay together, etc. @@ +208,5 @@ > V(fromInt32x4Bits, (FuncConvertBits<Int32x4, Int8x16>), 1) \ > V(neg, (UnaryFunc<Int8x16, Neg, Int8x16>), 1) \ > V(not, (UnaryFunc<Int8x16, Not, Int8x16>), 1) \ > + V(splat, (FuncSplat<Int8x16>), 1) \ > + nit: extra line here @@ +392,5 @@ > #define BITWISE_COMMONX4_SIMD_OP(_) \ > _(and) \ > _(or) \ > _(xor) > + nit: extra line @@ +431,5 @@ > _(store2) \ > _(store3) \ > _(check) > + > + nit: blank lines @@ +567,5 @@ > + return JS::ToInt8(a); > + } > + static bool toType(JSContext* cx, JS::HandleValue v, Elem* out) { > + bool ret = ToInt8(cx, v, out); > + *out = *out * -1; Why do you need to do this? If *out is false, it's 0, and multiplying by 0 won't change it; if it's true, it will change the sign of the result (I guess passing from -1 to 1, in that case). Can you either add a comment explaining what's going on here, or just do: *out = int8_t(bool(*out)); @@ +568,5 @@ > + } > + static bool toType(JSContext* cx, JS::HandleValue v, Elem* out) { > + bool ret = ToInt8(cx, v, out); > + *out = *out * -1; > + return ret; nit: the convention, in that case, is to do it this way: if (!ToInt8(cx, v, out)) return false; *out = *out * -1; return true; These two remarks apply to the other types, as well :) ::: js/src/builtin/TypedObject.js @@ +212,5 @@ > + var s13 = Load_int8(typedObj, offset + 13); > + var s14 = Load_int8(typedObj, offset + 14); > + var s15 = Load_int8(typedObj, offset + 15); > + return GetBool8x16TypeDescr()(s0, s1, s2, s3, s4, s5, s6, s7, > + s8, s9, s10, s11, s12, s13, s14, s15); nit: align s8 with the opening parenthesis above, please @@ +226,5 @@ > + var s7 = Load_int8(typedObj, offset + 7); > + return GetBool16x8TypeDescr()(s0, s1, s2, s3, s4, s5, s6, s7); > + > + case JS_SIMDTYPEREPR_BOOL32: > + var x = Load_int8(typedObj, offset + 0); uber-nit: this should be aligned 2 spaces after the "case" keyword above, for consistency: case X: var x = Load_int8(etc); @@ +593,5 @@ > // SIMD > > function SimdProtoString(type) { > switch (type) { > + case JS_SIMDTYPEREPR_BOOL8: nit: alignment is inconsistent with below @@ +692,5 @@ > + var s5 = callFunction(std_SIMD_Bool8x16_extractLane, null, this, 4); > + var s6 = callFunction(std_SIMD_Bool8x16_extractLane, null, this, 5); > + var s7 = callFunction(std_SIMD_Bool8x16_extractLane, null, this, 6); > + var s8 = callFunction(std_SIMD_Bool8x16_extractLane, null, this, 7); > + var s9 = callFunction(std_SIMD_Boolx16_extractLane, null, this, 0); nit: that should be std_SIMD_Bool8x16_extractLane here (the 8 is missing) Please make sure this is tested (calling toSource() or toString() on a SIMD instance)

Attachment #8670460 - Flags: feedback?(benj) → feedback+

Benjamin Bouvier [:bbouvier] (inactive)

Reporter

Comment 16

•

9 years ago

Comment on attachment 8670462 [details] [diff] [review] WIP patch for supporting SIMD boolean vectors in JIT Review of attachment 8670462 [details] [diff] [review]: ----------------------------------------------------------------- One major non-nit: it seems that at some places, an int32x4 register is used as the backend for a bool32x4 vector, but not at others. This will blow up all runtime assertions, especially with respect to memory alignment constraints. Ideally, MIRType_Bool32x4 would exist, but would lower to an LInt32x4, so you don't need LBool32x4 at all. Does that sound feasible? ::: js/src/jit/BaselineIC.cpp @@ +8475,5 @@ > + FOREACH_BOOL32X4_SIMD_OP(ADD_BOOL32X4_SIMD_OP_NAME_)) > + { > + Rooted<SimdTypeDescr*> descr(cx, &cx->global()->bool32x4TypeDescr().as<SimdTypeDescr>()); > + res.set(cx->compartment()->jitCompartment()->getSimdTemplateObjectFor(cx, descr)); > + return !!res; Can you merge the two if conditions, so as to have only one block? ::: js/src/jit/IonTypes.h @@ +312,5 @@ > SimdConstant cst; > cst.fillInt32x4(v, v, v, v); > return cst; > } > + static SimdConstant CreateX4(int8_t x, int8_t y, int8_t z, int8_t w) { There's an implicit type risk here: CreateX4 has a variant that takes all int32, so we could create an instance of the wrong type. Maybe call it CreateBoolX4 instead? Or make sure all arguments are bools? @@ +317,5 @@ > + SimdConstant cst; > + cst.fillBool32x4(x, y, z, w); > + return cst; > + } > + static SimdConstant CreateX4(int8_t* array) { ditto @@ +322,5 @@ > + SimdConstant cst; > + cst.fillBool32x4(array[0], array[1], array[2], array[3]); > + return cst; > + } > + static SimdConstant SplatX4(int8_t v) { ditto @@ +423,5 @@ > MIRType_ObjectGroup, // An ObjectGroup pointer. > MIRType_Last = MIRType_ObjectGroup, > MIRType_Float32x4 = MIRType_Float32 | (2 << VECTOR_SCALE_SHIFT), > MIRType_Int32x4 = MIRType_Int32 | (2 << VECTOR_SCALE_SHIFT), > + MIRType_Bool32x4 = MIRType_Boolean | (2 << VECTOR_SCALE_SHIFT), nit: remove spaces so as to keep alignment ::: js/src/jit/LIR.cpp @@ +547,5 @@ > MOZ_ASSERT(from != to); > for (size_t i = 0; i < moves_.length(); i++) > MOZ_ASSERT(to != moves_[i].to()); > > + int memoryAlignment = (type == LDefinition::BOOL32X4) ? SimdBoolMemoryAlignment:SimdMemoryAlignment; I'm not sure to follow: I thought the implementation was using xmm registers (namely, int32x4 registers) for storing the content of bool32x4. Correct me if that's wrong but it seems to be the case. If that's the case, the same memory alignment constraints apply on bool32x4 as for int32x4. If you haven't hit any runtime failure during your testing, it might mean there's something really bad happening, like none of the functions being inlined. This is something you can check by running the shell with the env variable IONFLAGS=logs, and use iongraph to check that the bool32x4 appear there. iongraph: https://github.com/sstangl/iongraph ::: js/src/jit/Lowering.cpp @@ +4016,5 @@ > else if (ins->type() == MIRType_Float32x4) > define(new(alloc()) LFloat32x4(), ins); > + else if (ins->type() == MIRType_Bool32x4){ > + MOZ_CRASH("Visit SIMD Bool Const is NYI"); > + } nit: no {} for single line if body. Also, please do implement this :D @@ +4107,5 @@ > } > } > > void > +LIRGenerator::visitSimdAllTrue(MSimdAllTrue* ins) Is there any chance this could be part of SimdUnaryX4 instead? @@ +4114,5 @@ > + MOZ_ASSERT(IsSimdType(input->type())); > + MOZ_ASSERT(input->type() == MIRType_Bool32x4); > + > + LUse use = useRegisterAtStart(input); > + define(new(alloc()) LSimdAnyTrue(use), ins); nit: LSimdAllTrue @@ +4118,5 @@ > + define(new(alloc()) LSimdAnyTrue(use), ins); > +} > + > +void > +LIRGenerator::visitSimdAnyTrue(MSimdAnyTrue* ins) ditto @@ +4210,5 @@ > if (ins->type() == MIRType_Int32x4) { > LSimdUnaryArithIx4* lir = new(alloc()) LSimdUnaryArithIx4(in); > define(lir, ins); > + } else if (ins->type() == MIRType_Bool32x4) { > + LSimdUnaryArithIx4* lir = new(alloc()) LSimdUnaryArithIx4(in); Looks like there's something wrong here: it should probably be LSimdUnaryArithBx4, as the methods @@ +4257,5 @@ > LSimdBinaryBitwiseX4* lir = new(alloc()) LSimdBinaryBitwiseX4; > lowerForFPU(lir, ins, lhs, rhs); > + } else if (ins->type() == MIRType_Bool32x4) { > + LSimdBinaryBitwiseX4* lir = new(alloc()) LSimdBinaryBitwiseX4; > + lowerForFPU(lir, ins, lhs, rhs); seems you can just add "|| ins->type() == MIRType_Bool32x4" in the above branch ::: js/src/jit/MCallOptimize.cpp @@ +3507,5 @@ > return boxSimd(callInfo, ins, templateObj); > } > > + > + IonBuilder::InliningStatus nit: all of this code should be less indented by one tab @@ +3546,5 @@ > { > switch (type) { > case SimdTypeDescr::Float32x4: return Scalar::Float32x4; > case SimdTypeDescr::Int32x4: return Scalar::Int32x4; > + case SimdTypeDescr::Bool32x4: return Scalar::Int32x4; I don't think you need to return a value here: this function is just used for load/store, which aren't defined for Bool vectors. That being said, there's a pre-existing naming issue here, and I wonder how the current code can compile. There's already a function called SimdTypeToScalarType (returning MIRType) which is called way above this function, and it doesn't seem to be an issue... ::: js/src/jit/MIR.cpp @@ +919,5 @@ > switch (type()) { > + case MIRType_Bool32x4:{ > + int8_t a[4]; > + for (size_t i = 0; i < 4; ++i) > + a[i] = ((int8_t) getOperand(i)->constantValue().toBoolean() ? -1:0); nits: no need for the wrapping parenthesis, space before and after the : ::: js/src/jit/MIR.h @@ +1754,5 @@ > > +// Returns true if all lanes are true. > +class MSimdAllTrue > +: public MUnaryInstruction, > +public SimdPolicy<0>::Data nit: spacing on this 2 lines is wrong (look at the rest of this file) @@ +1756,5 @@ > +class MSimdAllTrue > +: public MUnaryInstruction, > +public SimdPolicy<0>::Data > +{ > +protected: nit: half indent (2 spaces) before public/protected/private keywords @@ +1758,5 @@ > +public SimdPolicy<0>::Data > +{ > +protected: > + explicit MSimdAllTrue(MDefinition* obj, MIRType type) > + : MUnaryInstruction(obj) nit: 2 spaces before the : @@ +1765,5 @@ > + specialization_ = type; > + setMovable(); > + } > + > +public: ditto @@ +1768,5 @@ > + > +public: > + INSTRUCTION_HEADER(SimdAllTrue) > + > + static MSimdAllTrue* NewAsmJS(TempAllocator& alloc, MDefinition* obj) If it's not used yet, don't add it yet :) @@ +1787,5 @@ > + if (!ins->isSimdAllTrue()) > + return false; > + return congruentIfOperandsEqual(ins); > + } > + ALLOW_CLONE(MSimdAllTrue) I guess you could also implement foldsTo, but that can be done as a follow up bug @@ +1793,5 @@ > + > +// Returns true if any lane is true. > +class MSimdAnyTrue > +: public MUnaryInstruction, > +public SimdPolicy<0>::Data All comments from the previous class apply here as well, and in the next class as well @@ -3208,5 @@ > : MUnaryInstruction(op), > templateObject_(templateObject), > initialHeap_(initialHeap) > { > - MOZ_ASSERT(IsSimdType(op->type())); Please don't remove this assertion. ::: js/src/jit/StackSlotAllocator.h @@ +98,5 @@ > #endif > case LDefinition::DOUBLE: return 8; > case LDefinition::SINCOS: > case LDefinition::FLOAT32X4: > + nit: no newline here ::: js/src/jit/shared/LIR-shared.h @@ +248,5 @@ > } > return "unknown lane"; > } > }; > +// Extracts an element from a given SIMD bool32x4 lane. nit: Please add a line before and after this class. @@ +251,5 @@ > }; > +// Extracts an element from a given SIMD bool32x4 lane. > +class LSimdExtractElementB : public LSimdExtractElementBase > +{ > +public: see nits in other files, like MIR.h @@ +254,5 @@ > +{ > +public: > + LIR_HEADER(SimdExtractElementB); > + explicit LSimdExtractElementB(const LAllocation& base) > + : LSimdExtractElementBase(base) here too @@ +741,5 @@ > return f_; > } > }; > > + nit: blank line @@ +763,5 @@ > const SimdConstant& getValue() const { return mir_->toSimdConstant()->value(); } > }; > > +// Constant SIMD Bool32x4 > +class LBool32x4 : public LInstructionHelper<1, 0, 0>// nit: remove // at the end ::: js/src/jit/x64/Assembler-x64.h @@ +201,5 @@ > // here such that it is accessible from the entire codebase. Once full support > // for SIMD is reached on all tier-1 platforms, this constant can be deleted. > static MOZ_CONSTEXPR_VAR bool SupportsSimd = true; > static MOZ_CONSTEXPR_VAR uint32_t SimdMemoryAlignment = 16; > +static MOZ_CONSTEXPR_VAR uint32_t SimdBoolMemoryAlignment = 4; Why do we need this? (see also LIR.cpp) ::: js/src/jit/x86-shared/CodeGenerator-x86-shared.cpp @@ +2396,5 @@ > } > } > > void > +CodeGeneratorX86Shared::visitSimdExtractElementB(LSimdExtractElementB* ins) It's the same code as visitSimdExtractElementI, which makes sense if bool32x4 are represented by int32x4. At an higher level, can you use LSimdExtractElementI instead? @@ +2460,5 @@ > masm.canonicalizeFloat(output); > } > > void > +CodeGeneratorX86Shared::visitSimdInsertElementB(LSimdInsertElementB* ins) ditto ::: js/src/jit/x86-shared/Encoding-x86-shared.h @@ +175,5 @@ > OP2_RCPPS_VpsWps = 0x53, > OP2_ANDPD_VpdWpd = 0x54, > OP2_ORPD_VpdWpd = 0x56, > OP2_XORPD_VpdWpd = 0x57, > + OP2_PUNPCKLBW_VdqWdq= 0x60, nit: space before the = ::: js/src/jit/x86-shared/MacroAssembler-x86-shared.h @@ +1015,5 @@ > + vmovdqu(Operand(src), dest); > + // Do a low byte unpacking > + vpunpcklbw(dest, dest, dest); > + // Do a low half-word unpack > + vpunpcklwd(dest, dest, dest); Why do you need the unpacking? It seems like it's just supposed to read the value from an address (stack or memory), which should already be in the right format. @@ +1062,5 @@ > } > void storeUnalignedInt32x4(FloatRegister src, const Operand& dest) { > vmovdqu(src, dest); > } > + void storeBool32x4(FloatRegister src, const Address& dest) { ditto @@ -1357,5 @@ > return true; > } > return false; > } > - nit: keep this line please

Attachment #8670462 - Flags: feedback?(benj)

sajjadt

Comment 17

•

9 years ago

Attached patch Implementing bool vector for the interpreter (obsolete) — Details — Splinter Review

Attachment #8662166 - Attachment is obsolete: true

Attachment #8670460 - Attachment is obsolete: true

Attachment #8670462 - Attachment is obsolete: true

Attachment #8686450 - Flags: review?(benj)

sajjadt

Comment 18

•

9 years ago

Attached patch Tests for SIMD boolean vectors (obsolete) — Details — Splinter Review

Attachment #8686451 - Flags: review?(benj)

Benjamin Bouvier [:bbouvier] (inactive)

Reporter

Comment 19

•

9 years ago

Comment on attachment 8686450 [details] [diff] [review] Implementing bool vector for the interpreter Review of attachment 8686450 [details] [diff] [review]: ----------------------------------------------------------------- A few comments below. I'll add them atop of your patch. Thanks for the patch! ::: js/src/builtin/SIMD.cpp @@ +382,5 @@ > + JS_SELF_HOSTED_FN("equivalent", "TypeDescrEquivalent", 1, 0), > + JS_FS_END, > +}; > + > +const JSPropertySpec Bool8x16Defn::TypedObjectProperties[] = { Do we really need ::TypedObjectProperties arrays? @@ +901,5 @@ > + bool allTrue = true; > + for (unsigned i = 0; allTrue && i < V::lanes; i++) > + allTrue &= vec[i]; > + > + V::setReturn(args, allTrue); I'd just set args.rval() here, to avoid the implicit coercion bool -> int8/int16/int32 -> bool when calling setReturn @@ +920,5 @@ > + bool anyTrue = false; > + for (unsigned i = 0; !anyTrue && i < V::lanes; i++) > + anyTrue |= vec[i]; > + > + V::setReturn(args, anyTrue); ditto @@ +1250,5 @@ > Elem* tv = TypedObjectMemory<Elem*>(args[1]); > Elem* fv = TypedObjectMemory<Elem*>(args[2]); > > Elem result[V::lanes]; > + //Boolean mask items are represented with -1/0 for true/false Why not just true/false? I really don't see a reason not to do this. ::: js/src/builtin/SIMD.h @@ +32,5 @@ > + V(and, (BinaryFunc<Bool8x16, And, Bool8x16>), 2) \ > + V(or, (BinaryFunc<Bool8x16, Or, Bool8x16>), 2) \ > + V(xor, (BinaryFunc<Bool8x16, Xor, Bool8x16>), 2) \ > + V(equal, (CompareFunc<Bool8x16, Equal, Bool8x16>), 2) \ > + V(notEqual, (CompareFunc<Bool8x16, NotEqual, Bool8x16>), 2) bool8x16 doesn't have equal/notEqual in the spec @@ +55,5 @@ > + V(and, (BinaryFunc<Bool16x8, And, Bool16x8>), 2) \ > + V(or, (BinaryFunc<Bool16x8, Or, Bool16x8>), 2) \ > + V(xor, (BinaryFunc<Bool16x8, Xor, Bool16x8>), 2) \ > + V(equal, (CompareFunc<Bool16x8, Equal, Bool16x8>), 2) \ > + V(notEqual, (CompareFunc<Bool16x8, NotEqual, Bool16x8>), 2) ditto @@ +78,5 @@ > + V(and, (BinaryFunc<Bool32x4, And, Bool32x4>), 2) \ > + V(or, (BinaryFunc<Bool32x4, Or, Bool32x4>), 2) \ > + V(xor, (BinaryFunc<Bool32x4, Xor, Bool32x4>), 2) \ > + V(equal, (CompareFunc<Bool32x4, Equal, Bool32x4>), 2) \ > + V(notEqual, (CompareFunc<Bool32x4, NotEqual, Bool32x4>), 2) ditto @@ +351,5 @@ > INT32X4_TERNARY_FUNCTION_LIST(V) \ > INT32X4_QUARTERNARY_FUNCTION_LIST(V) \ > INT32X4_SHUFFLE_FUNCTION_LIST(V) > > +#define FOREACH_BOOL32X4_SIMD_OP(_) \ This will be needed for when we compile these operators in the JITs, let's not add them before. @@ +559,5 @@ > + static bool toType(JSContext* cx, JS::HandleValue v, Elem* out) { > + if (!ToInt8(cx, v, out)) > + return false; > + /* Convert true value (i.e. 1) to 0xff which makes SIMD select > + implemetation easier*/ nit: implementation + use a simple comment // @@ +583,5 @@ > + static bool toType(JSContext* cx, JS::HandleValue v, Elem* out) { > + if (!ToInt8(cx, v, out)) > + return false; > + /* Convert true value (i.e. 1) to 0xff which makes SIMD select > + implemetation easier*/ ditto @@ +607,5 @@ > + static bool toType(JSContext* cx, JS::HandleValue v, Elem* out) { > + if (!ToInt8(cx, v, out)) > + return false; > + /* Convert true value (i.e. 1) to 0xff which makes SIMD select > + implemetation easier*/ ditto ::: js/src/builtin/TypedObject.js @@ +692,5 @@ > + var s5 = callFunction(std_SIMD_Bool8x16_extractLane, null, this, 4); > + var s6 = callFunction(std_SIMD_Bool8x16_extractLane, null, this, 5); > + var s7 = callFunction(std_SIMD_Bool8x16_extractLane, null, this, 6); > + var s8 = callFunction(std_SIMD_Bool8x16_extractLane, null, this, 7); > + var s9 = callFunction(std_SIMD_Boolx16_extractLane, null, this, 8); The 8 of Bool8x16 is still missing on this line, this will break. Is this tested somewhere?

Attachment #8686450 - Flags: review?(benj)

Benjamin Bouvier [:bbouvier] (inactive)

Reporter

Comment 20

•

9 years ago

Comment on attachment 8686451 [details] [diff] [review] Tests for SIMD boolean vectors Review of attachment 8686451 [details] [diff] [review]: ----------------------------------------------------------------- Thank you for this patch! I think Bool16x8 and Bool8x16 will need full proper testing as well, but that's going to happen in a different patch set. ::: js/src/jit-test/tests/SIMD/check.js @@ +9,4 @@ > var i = 0; > try { > + for (; i < 3500; i++) { > + if (i > 3048) No need to change these thresholds (when the test is run with --no-threads, it gets ion-compiled. Indeed it wasn't the case a few months ago, so we might need something more to set in the jit compiler options). ::: js/src/jit-test/tests/SIMD/getters.js @@ +5,5 @@ > function f() { > var i4 = SIMD.Int32x4(1, -2, 3, -4); > + var b4 = SIMD.Bool32x4(true, true, false, true); > + > + nit: blank line ::: js/src/jit-test/tests/asm.js/testBug1099216.js @@ +25,4 @@ > var g = m.SIMD.Int32x4 > var h = g.select > function f() { > + var x = k(0, 0, 0, 0) This won't validate as asm.js, it should rather stay as is. ::: js/src/tests/ecma_7/SIMD/typedobjects.js @@ +253,5 @@ > var f = array[1]; > + > + var sj1 = Int8x16.extractLane(f, 3); > + > + assertEq(sj1, 20); Could stay as it was before. @@ +661,2 @@ > function test() { > + nit: blank line

Attachment #8686451 - Flags: review?(benj) → review+

Benjamin Bouvier [:bbouvier] (inactive)

Reporter

Comment 21

•

9 years ago

Attached patch 1. Interpreter changes (obsolete) — Details — Splinter Review

Updated sajjad's patch.

Attachment #8686450 - Attachment is obsolete: true

Attachment #8688571 - Flags: review?(jolesen)

Benjamin Bouvier [:bbouvier] (inactive)

Reporter

Comment 22

•

9 years ago

Attached patch 2. Tests (obsolete) — Details — Splinter Review

Attachment #8686451 - Attachment is obsolete: true

Attachment #8688572 - Flags: review?(jolesen)

Implementing bool vector for the interpreter 9 years ago sajjadt 67.31 KB, patch	bbouvier : feedback+	Details \| Diff \| Splinter Review
Tests for SIMD boolean vectors 9 years ago sajjadt 73.70 KB, patch	bbouvier : feedback+	Details \| Diff \| Splinter Review
WIP patch for supporting SIMD boolean vectors in interpreter and JIT 9 years ago sajjadt 141.84 KB, patch		Details \| Diff \| Splinter Review
WIP patch for supporting SIMD boolean vectors in interpreter 9 years ago sajjadt 65.32 KB, patch	bbouvier : feedback+	Details \| Diff \| Splinter Review
WIP patch for supporting SIMD boolean vectors in JIT 9 years ago sajjadt 76.69 KB, patch		Details \| Diff \| Splinter Review
Implementing bool vector for the interpreter 9 years ago sajjadt 66.10 KB, patch		Details \| Diff \| Splinter Review
Tests for SIMD boolean vectors 9 years ago sajjadt 77.75 KB, patch	bbouvier : review+	Details \| Diff \| Splinter Review
1. Interpreter changes 9 years ago Benjamin Bouvier [:bbouvier] (inactive) 71.61 KB, patch	jolesen : review+	Details \| Diff \| Splinter Review
2. Tests 9 years ago Benjamin Bouvier [:bbouvier] (inactive) 77.81 KB, patch	jolesen : review+	Details \| Diff \| Splinter Review
Part 1: SIMD bool vector implementation for the interpreter 9 years ago Jakob Stoklund Olesen [:jolesen] 66.45 KB, patch		Details \| Diff \| Splinter Review
Part 2: JSAPI/JIT tests for SIMD bool vector implementation. 9 years ago Jakob Stoklund Olesen [:jolesen] 78.43 KB, patch		Details \| Diff \| Splinter Review
Part 3: SIMD boolean vector support for JIT. 9 years ago Jakob Stoklund Olesen [:jolesen] 68.55 KB, patch		Details \| Diff \| Splinter Review
Part 4: Convert boolean scalars to int32. 9 years ago Jakob Stoklund Olesen [:jolesen] 12.37 KB, patch		Details \| Diff \| Splinter Review
Part 1: SIMD bool vector implementation for the interpreter 9 years ago Jakob Stoklund Olesen [:jolesen] 71.82 KB, patch		Details \| Diff \| Splinter Review
Part 1: SIMD bool vector implementation for the interpreter 9 years ago Jakob Stoklund Olesen [:jolesen] 71.82 KB, patch		Details \| Diff \| Splinter Review
Part 2: JSAPI/JIT tests for SIMD bool vector implementation. 9 years ago Jakob Stoklund Olesen [:jolesen] 78.43 KB, patch		Details \| Diff \| Splinter Review
Part 3: SIMD boolean vector support for JIT. 9 years ago Jakob Stoklund Olesen [:jolesen] 76.02 KB, patch		Details \| Diff \| Splinter Review
Part 4: Delete signMask and selectBits. 9 years ago Jakob Stoklund Olesen [:jolesen] 78.00 KB, patch		Details \| Diff \| Splinter Review
Part 5: Reconcile asm.js SIMD opcodes 9 years ago Jakob Stoklund Olesen [:jolesen] 37.87 KB, patch		Details \| Diff \| Splinter Review
Part 6: ASM.js boolean vectors. 9 years ago Jakob Stoklund Olesen [:jolesen] 103.11 KB, patch		Details \| Diff \| Splinter Review
Part 5: ASM.js boolean vectors. 9 years ago Jakob Stoklund Olesen [:jolesen] 115.29 KB, patch	luke : feedback+	Details \| Diff \| Splinter Review
Part 1: SIMD bool vector implementation for the interpreter. 9 years ago Jakob Stoklund Olesen [:jolesen] 77.08 KB, patch	bbouvier : review+	Details \| Diff \| Splinter Review
Part 2: JSAPI/JIT tests for SIMD bool vector implementation. 9 years ago Jakob Stoklund Olesen [:jolesen] 83.79 KB, patch	bbouvier : review+	Details \| Diff \| Splinter Review
Part 3: SIMD boolean vector support for JIT. 9 years ago Jakob Stoklund Olesen [:jolesen] 90.92 KB, patch		Details \| Diff \| Splinter Review
Part 4: Delete signMask and selectBits. 9 years ago Jakob Stoklund Olesen [:jolesen] 81.46 KB, patch		Details \| Diff \| Splinter Review
Part 5: ASM.js boolean vectors. 9 years ago Jakob Stoklund Olesen [:jolesen] 136.21 KB, patch		Details \| Diff \| Splinter Review
Part 1: SIMD bool vector implementation for the interpreter. 9 years ago Jakob Stoklund Olesen [:jolesen] 77.02 KB, patch		Details \| Diff \| Splinter Review
Part 2: JSAPI/JIT tests for SIMD bool vector implementation. 9 years ago Jakob Stoklund Olesen [:jolesen] 81.53 KB, patch		Details \| Diff \| Splinter Review
Part 3: SIMD boolean vector support for JIT. 9 years ago Jakob Stoklund Olesen [:jolesen] 85.38 KB, patch		Details \| Diff \| Splinter Review
Part 4: Delete signMask and selectBits. 9 years ago Jakob Stoklund Olesen [:jolesen] 93.30 KB, patch		Details \| Diff \| Splinter Review
Part 5: ASM.js boolean vectors. 9 years ago Jakob Stoklund Olesen [:jolesen] 138.44 KB, patch		Details \| Diff \| Splinter Review
Part 1: SIMD bool vector implementation for the interpreter. 9 years ago Jakob Stoklund Olesen [:jolesen] 77.02 KB, patch		Details \| Diff \| Splinter Review
Part 2: JSAPI/JIT tests for SIMD bool vector implementation. 9 years ago Jakob Stoklund Olesen [:jolesen] 88.52 KB, patch		Details \| Diff \| Splinter Review
Part 3: SIMD boolean vector support for JIT. 9 years ago Jakob Stoklund Olesen [:jolesen] 85.38 KB, patch	bbouvier : review+	Details \| Diff \| Splinter Review
Part 4: Delete signMask and selectBits. 9 years ago Jakob Stoklund Olesen [:jolesen] 105.24 KB, patch	bbouvier : review+	Details \| Diff \| Splinter Review
Part 5: ASM.js boolean vectors. 9 years ago Jakob Stoklund Olesen [:jolesen] 140.38 KB, patch	bbouvier : review+	Details \| Diff \| Splinter Review
Part 3: SIMD boolean vector support for JIT. 9 years ago Jakob Stoklund Olesen [:jolesen] 83.09 KB, patch		Details \| Diff \| Splinter Review