Closed Bug 1639464 Opened 5 years ago Closed 5 years ago

Optimize SIMD v8x16.shuffle in Ion x86_64

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla78

Tracking Flags:

Tracking

Status

firefox78

---

fixed

People

(Reporter: lth, Assigned: lth)

References

Details

Attachments

(1 file)

Bug 1639464 - wasm ion simd: optimize v8x16.shuffle. r=jseward 5 years ago Lars T Hansen [:lth] 47 bytes, text/x-phabricator-request		Details \| Review

Lars T Hansen [:lth]

Assignee

Description

•

5 years ago

The v8x16.shuffle opcode is a very general workhorse for wasm SIMD, performing byte shuffle and blend. The straightforward implementation is expensive, equivalent to at least a dozen simple instructions (CONST + PSHUFB + CONST + PSHUFB + POR on x86). In many cases, the patterns are simple and can be lowered to a small number of instructions. We should recognize a number of these patterns and lower to better code.

Lars T Hansen [:lth]

Assignee

Comment 1

•

5 years ago

Attached file Bug 1639464 - wasm ion simd: optimize v8x16.shuffle. r=jseward — Details

Implement some shuffle specializations in the MacroAssembler interface
(permutations, interleaves, concat-and-shift) and then add code to the
Ion x64 back-end to pattern match the shuffle masks and map as many
cases as we can to these specializations.

The pattern matcher is simple: it sorts instructions into buckets of
single-operand, single-operand-with-zero, and dual-operand, and then
matches patterns on the shuffle mask in a fixed order from what is
perceived as least expensive to most expensive. The matcher is
optimized for clarity, not for speed, since it will run very rarely.

The patterns I've chosen are inspired by the SSE instruction set, the
v8 code, and the SIMD.js code. More can be added; some TODO remarks
are left in the code to indicate this.

A simple test infrastructure is added and used to ensure that
optimizations are triggered (and not triggered) as expected.

Currently the pattern matcher is in x64-specific code, since we only
support x64. But it will move without any substantive changes into
x86-shared code when we add x86 support (bug 1637332), and it is
mostly platform-independent and can eventually move into shared code,
possibly with some platform hooks and some extensions, when we add
arm64 support.

The matcher can also be used to optimize baseline code, should we wish
to do that.

Lars T Hansen [:lth]

Assignee

Comment 2

•

5 years ago

Memo to self: we don't have to use PALIGNR for byte shifting the vector; we have PSLLDQ and PSRLDQ for that case and can avoid generating a zero or futzing with operand order.

Lars T Hansen [:lth]

Assignee

Updated

•

5 years ago

Blocks: 1639517

Phabricator Automation

Updated

•

5 years ago

Attachment #9150404 - Attachment description: Bug 1639464 - wasm ion simd: optimize v8x16.shuffle. r?jseward → Bug 1639464 - wasm ion simd: optimize v8x16.shuffle. r=jseward

Pulsebot

Comment 3

•

5 years ago

Pushed by lhansen@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/2d1bf65618ad wasm ion simd: optimize v8x16.shuffle. r=jseward

Cristina Coroiu [:ccoroiu]

Comment 4

•

5 years ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/2d1bf65618ad

Status: ASSIGNED → RESOLVED

Closed: 5 years ago

status-firefox78: --- → fixed

Resolution: --- → FIXED

Target Milestone: --- → mozilla78

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Optimize SIMD v8x16.shuffle in Ion x86_64

Categories

(Core :: JavaScript: WebAssembly, enhancement, P2)

Tracking

()

People

(Reporter: lth, Assigned: lth)

References

Details

Crash Data

Security

(public)

User Story

Attachments

(1 file)

Description

Comment 1

Comment 2

Updated

Updated

Comment 3

Comment 4

Attachment

General

Description

File Name

Content Type