Closed Bug 1699192 Opened 4 years ago Closed 3 years ago

[exploration] Experiment with AVX encoding and (maybe) assumed-aligned loads in simd wormhole

Tracking

()

Status:

RESOLVED WONTFIX

People

(Reporter: lth, Unassigned)

References

(Blocks 2 open bugs)

Details

Lars T Hansen [:lth]

Reporter

Description

•

4 years ago

Since it's the workhorse of inner loops in the machine learning codes, the WHPMADDUBSW operation could usefully use an AVX encoding (to avoid clobbering a register whose value is needed, thus necessitating an additional move to preserve that value). This will be a little tricky, because we do not want to enable AVX for any other instructions at all, yet the encoding is chosen fairly deep down in the pipeline. Probably this means changing the AVX test in the encoder from if (AVXPresent(...)) { ... } else { ... } to if (AVXPresent(...) || op == WHPMADDUBSW && AVXReallyPresent(...) { ... } else { ... } since the AVXPresent predicate is subject to various switches that are off (and shall remain off).

Another issue here is that we're not able to fuse a v128.load into a WHPMADDUBSW. I'm not sure how valuable this is - if the code preloads a bunch of registers and then operates on them then there's no sense in trying to fuse anything, but if it consists of load-and-operate pairs then the matter is different. But the problem is that fusing only works if the load is aligned, and we have no guarantee of that. We could do an exception handler fixup of unaligned loads but this is basically going to be a mess. But for starters we could look at the code to see if it would match the pattern, and if it does then we could experimentally try for a fusing, and then we could measure the result to see if there's an improvement.

Related discussion here: https://github.com/mozilla-extensions/bergamot-browser-extension/issues/75

Lars T Hansen [:lth]

Reporter

Updated

•

4 years ago

Comment 1

•

4 years ago

We may solve this differently and it's not a priority right now to investigate this.

Assignee: lhansen → nobody

Status: ASSIGNED → NEW

Lars T Hansen [:lth]

Reporter

Updated

•

4 years ago

Blocks: 1713056

Lars T Hansen [:lth]

Reporter

Updated

•

3 years ago

Type: enhancement → task

Summary: Experiment with AVX encoding and (maybe) assumed-aligned loads in simd wormhole → [exploration] Experiment with AVX encoding and (maybe) assumed-aligned loads in simd wormhole

Yury Delendik (:yury)

Comment 2

•

3 years ago

This optimization is too narrow. Also, looking at intgemm multiply code, it is rarely direct memory operands for pmaddubsw.

Ryan Hunt [:rhunt]

Comment 3

•

3 years ago

We're intending to phase out the wormhole.

Status: NEW → RESOLVED

Closed: 3 years ago

Resolution: --- → WONTFIX

You need to log in before you can comment on or make changes to this bug.

Bugzilla

[exploration] Experiment with AVX encoding and (maybe) assumed-aligned loads in simd wormhole

Categories

(Core :: JavaScript: WebAssembly, task, P3)

Tracking

()

People

(Reporter: lth, Unassigned)

References

(Blocks 2 open bugs)

Details

Crash Data

Security

(public)

User Story

Description

Updated

Comment 1

Updated

Updated

Comment 2

Comment 3