Closed Bug 1690483 Opened 5 years ago Closed 4 years ago

SIMD optimization x64/x86: Better code for variable swizzle

Tracking

()

Status:

RESOLVED FIXED

Milestone:

95 Branch

Tracking Flags:

Tracking

Status

firefox95

---

fixed

People

(Reporter: lth, Assigned: yury)

References

(Blocks 1 open bug)

Details

Attachments

(1 file)

Bug 1690483 - Use saturating add for mask of SIMD swizzle. r?lth 4 years ago Yury Delendik (:yury) 48 bytes, text/x-phabricator-request		Details \| Review

Lars T Hansen [:lth]

Reporter

Description

•

5 years ago

•

Edited

The variable swizzle on intel can use PSHUFB to shuffle the bytes but the mask vector must first be sanitized so that out-of-range lanes in the mask have the high bit set. Currently we use a compare-with-constant-and-POR to do this (and we don't even inline the constant load in the compare, sigh) but it's possible to do better by saturating-add'ing a constant into the mask: https://github.com/WebAssembly/simd/issues/68#issuecomment-470825324

For specific code generation, I'm not sure if it's better to (a) splat a byte value into scratch / load the constant into scratch, and add the mask to the scratch, or (b) to move the mask to scratch and add a constant from memory into scratch. Either way the mask register is not volatile.

Also see https://github.com/WebAssembly/simd/issues/93 for more discussion, probably worth reading although it ranges across a bunch of topics.

Yury Delendik (:yury)

Assignee

Comment 1

•

4 years ago

Attached file Bug 1690483 - Use saturating add for mask of SIMD swizzle. r?lth — Details

Phabricator Automation

Updated

•

4 years ago

Assignee: nobody → ydelendik

Status: NEW → ASSIGNED

Yury Delendik (:yury)

Assignee

Comment 2

•

4 years ago

There is some 3-4% gain in local microbenchmark test.

Pulsebot

Comment 3

•

4 years ago

Pushed by ydelendik@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/ee2bc38e681e Use saturating add for mask of SIMD swizzle. r=lth

Noemi Erli[:noemi_erli]

Comment 4

•

4 years ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/ee2bc38e681e

Status: ASSIGNED → RESOLVED

Closed: 4 years ago

status-firefox95: --- → fixed

Resolution: --- → FIXED

Target Milestone: --- → 95 Branch

You need to log in before you can comment on or make changes to this bug.

Bugzilla

SIMD optimization x64/x86: Better code for variable swizzle

Categories

(Core :: JavaScript: WebAssembly, enhancement, P3)

Tracking

()

People

(Reporter: lth, Assigned: yury)

References

(Blocks 1 open bug)

Details

Crash Data

Security

(public)

User Story

Attachments

(1 file)

Description

Comment 1

Updated

Comment 2

Comment 3

Comment 4

Attachment

General

Description

File Name

Content Type