Bug 1694191 Comment 0 Edit History

Note: The actual edited comment in the bug view page will always show the original commenter’s name and original timestamp.

The codegen for extractLaneInt64x2 is specialized to x86 and x64.  It lowers as a single vpextrq.  But the other extract-lane operations have a couple of optimizations missing here, *if the lane is zero*: a move can be used instead (if we believe that will be faster), but importantly, if the src and dest registers are the same then this operation should generate no code at all.
The codegen for extractLaneInt64x2 is specialized to x86 and x64.  It lowers as a single vpextrq.  But the other extract-lane operations have a couple of optimizations missing here, *if the lane is zero*: a move can be used instead to move the value from src to dest (extract-lane is pretty slow, according to the manual), and if the src and dest registers are the same then this operation should generate no code at all.

Back to Bug 1694191 Comment 0