Closed Bug 1073064 Opened 10 years ago Closed 10 years ago

SIMD: Implement logical operators (lsh, rhs and ursh) and add support in Odin

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla35

People

(Reporter: bbouvier, Assigned: bbouvier)

References

Details

Attachments

(2 files, 1 obsolete file)

SIMD x86-x64 backend: Implement MSimdBinaryLogical 10 years ago Benjamin Bouvier [:bbouvier] (inactive) 15.63 KB, patch		Details \| Diff \| Splinter Review
SIMD: Add int32x4.shift{left,right,rightLogical} to asm.js 10 years ago Benjamin Bouvier [:bbouvier] (inactive) 8.18 KB, patch	luke : review+	Details \| Diff \| Splinter Review
1) SimdShift 10 years ago Benjamin Bouvier [:bbouvier] (inactive) 18.89 KB, patch	sunfish : review+	Details \| Diff \| Splinter Review

Benjamin Bouvier [:bbouvier] (inactive)

Assignee

Description

•

10 years ago

SIMD.int32x4.shiftLeft(vec, val) => [vec.x << val, vec.y << val, vec.z << val, vec.w << val]

SIMD.int32x4.shiftRight ~rsh
SIMD.int32x4.shiftRightLogical ~ursh

Benjamin Bouvier [:bbouvier] (inactive)

Assignee

Comment 1

•

10 years ago

Attached patch SIMD x86-x64 backend: Implement MSimdBinaryLogical (obsolete) — Details — Splinter Review

I also wanted to implement float32x4.scale in the same MIR node, as they relate
by having a first vector arg and second scalar arg. Actually, scale has the
same issue as withFlags and zero(): it doesn't map to an x86 instruction and it
can be emulated with other operators (I've added it in the github discussion).

Attachment #8495392 - Flags: review?(sunfish)

Benjamin Bouvier [:bbouvier] (inactive)

Assignee

Comment 2

•

10 years ago

Attached patch SIMD: Add int32x4.shift{left,right,rightLogical} to asm.js — Details — Splinter Review

With the last refactorings, adding an operation to odin feels even easier.

Attachment #8495393 - Flags: review?(luke)

Luke Wagner [:luke]

Comment 3

•

10 years ago

Comment on attachment 8495393 [details] [diff] [review]
SIMD: Add int32x4.shift{left,right,rightLogical} to asm.js

Review of attachment 8495393 [details] [diff] [review]:
-----------------------------------------------------------------

::: js/src/asmjs/AsmJSValidate.cpp
@@ +2463,5 @@
>          curBlock_->add(ins);
>          return ins;
>      }
>  
> +    MDefinition *binarySimd(MDefinition *lhs, MDefinition *rhs, MSimdBinaryLogical::Operation op)

I think you could have a:
  template <class T>
  MDefinition *binarySimd(MDefininition *lhs, MDefinition *rhs, T::Operation op, MIRType type);
that collapsed this with the Arith and Bitwise overloads.

Attachment #8495393 - Flags: review?(luke) → review+

Dan Gohman [:sunfish]

Comment 4

•

10 years ago

Comment on attachment 8495392 [details] [diff] [review]
SIMD x86-x64 backend: Implement MSimdBinaryLogical

Review of attachment 8495392 [details] [diff] [review]:
-----------------------------------------------------------------

::: js/src/jit/LIR-Common.h
@@ +325,5 @@
>          return mir_->toSimdBinaryBitwise()->operation();
>      }
>  };
>  
> +class LSimdBinaryLogical : public LInstructionHelper<1, 2, 0>

Since this class expects its right operand to be an int32, rather than a SIMD value, it's pretty specific to shifting (and maybe also rotating). How about LSimdShift? Or maybe LSimdShiftUniform?

::: js/src/jit/MIR.h
@@ +1717,5 @@
>  
>      ALLOW_CLONE(MSimdBinaryBitwise)
>  };
>  
> +class MSimdBinaryLogical : public MBinaryInstruction

If you rename LSimdBinaryLogical, keep this analogous.

::: js/src/jit/shared/CodeGenerator-x86-shared.cpp
@@ +2538,5 @@
> +    FloatRegister out = ToFloatRegister(ins->output());
> +    MOZ_ASSERT(vec == out); // defineReuseInput(0);
> +
> +    FloatRegister tmp = ScratchFloat32Reg;
> +    masm.movd(val, tmp);

Shifting by constant is likely to be our main use case, since ARM appears to lack a non-immediate shift. On Intel, shift-by-immediate is 1 clock, while shifting by a dynamic value is 2 clocks plus 1 for the movd plus 1 to put the count in a gpr for the movd. Consequently, I think it's important to optimize the case where the shift count is a constant now rather than later. It should just be a useRegisterOrConstant in the lowering code, assembler support for the immediate variants, and code here to handle the immediate case separate from the dynamic case.

There's also the question here of what to do when the shift is out of range. I filed this:
https://github.com/johnmccutchan/ecmascript_simd/issues/72
and I think just keeping the x86 behavior here for now is fine.

::: js/src/jit/shared/MacroAssembler-x86-shared.h
@@ +524,5 @@
>      void packedSubInt32(const Operand &src, FloatRegister dest) {
>          psubd(src, dest);
>      }
>  
> +    void packedLeftShift(FloatRegister src, FloatRegister dest) {

These names are a little ambiguous given that they shift all elements of dest by the count in the *first* element of src, in contrast to e.g. vpsllvd which shifts elements of dest by the counts in the *corresponding* elements of src (and is the more intuitive "vectorized" version of a shift, at least to me). How about packedLeftShiftUniform?

Attachment #8495392 - Flags: review?(sunfish)

Benjamin Bouvier [:bbouvier] (inactive)

Assignee

Comment 5

•

10 years ago

(In reply to Luke Wagner [:luke] from comment #3)
> Comment on attachment 8495393 [details] [diff] [review]
> SIMD: Add int32x4.shift{left,right,rightLogical} to asm.js
> 
> Review of attachment 8495393 [details] [diff] [review]:
> -----------------------------------------------------------------
> 
> ::: js/src/asmjs/AsmJSValidate.cpp
> @@ +2463,5 @@
> >          curBlock_->add(ins);
> >          return ins;
> >      }
> >  
> > +    MDefinition *binarySimd(MDefinition *lhs, MDefinition *rhs, MSimdBinaryLogical::Operation op)
> 
> I think you could have a:
>   template <class T>
>   MDefinition *binarySimd(MDefininition *lhs, MDefinition *rhs, T::Operation
> op, MIRType type);
> that collapsed this with the Arith and Bitwise overloads.

Thanks for the review! Unfortunately, T::Operation is a dependent type so the compiler cannot infer which type is that. If you're okay with explicit instanciation of the template, I can do it. Also, this could be collapsed with the SimdBinaryComp overload only, as the signatures with Arith and Bitwise are different (they include a MIRType).

Flags: needinfo?(luke)

Luke Wagner [:luke]

Comment 6

•

10 years ago

Yeah, explicit is fine and we do that in many other FunctionCompiler method calls.

Flags: needinfo?(luke)

Benjamin Bouvier [:bbouvier] (inactive)

Assignee

Updated

•

10 years ago

Blocks: 1068028

Benjamin Bouvier [:bbouvier] (inactive)

Assignee

Comment 7

•

10 years ago

Attached patch 1) SimdShift — Details — Splinter Review

With comments addressed. As discussed on IRC, i've used packed*ShiftByScalar rather than Uniform.

Attachment #8495392 - Attachment is obsolete: true

Attachment #8495992 - Flags: review?(sunfish)

Dan Gohman [:sunfish]

Comment 8

•

10 years ago

Comment on attachment 8495992 [details] [diff] [review]
1) SimdShift

Review of attachment 8495992 [details] [diff] [review]:
-----------------------------------------------------------------

Looks good, thanks!

Attachment #8495992 - Flags: review?(sunfish) → review+

Benjamin Bouvier [:bbouvier] (inactive)

Assignee

Comment 9

•

10 years ago

remote:   https://hg.mozilla.org/integration/mozilla-inbound/rev/e6e7586b3e02
remote:   https://hg.mozilla.org/integration/mozilla-inbound/rev/4baa041973f8

Wes Kocher (:KWierso) (Not reading bugmail; email directly if needed)

Comment 10

•

10 years ago

https://hg.mozilla.org/mozilla-central/rev/e6e7586b3e02
https://hg.mozilla.org/mozilla-central/rev/4baa041973f8

Status: ASSIGNED → RESOLVED

Closed: 10 years ago

Resolution: --- → FIXED

Target Milestone: --- → mozilla35

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

SIMD: Implement logical operators (lsh, rhs and ursh) and add support in Odin

Categories

(Core :: JavaScript Engine: JIT, defect)

Tracking

()

People

(Reporter: bbouvier, Assigned: bbouvier)

References

Details

Crash Data

Security

(public)

User Story

Attachments

(2 files, 1 obsolete file)

Description

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Updated

Comment 7

Comment 8

Comment 9

Comment 10

Attachment

General

Description

File Name

Content Type