Closed Bug 1712321 Opened 3 years ago Closed 3 years ago

64-bit imul by small constant yields unnecessary constant setup

Tracking

()

Status:

RESOLVED FIXED

Milestone:

91 Branch

Tracking Flags:

Tracking

Status

firefox91

---

fixed

People

(Reporter: lth, Assigned: lukas.bernhard, Mentored)

References

(Blocks 1 open bug)

Details

(Keywords: good-first-bug)

Attachments

(1 file)

Bug 1712321 - Remove unnecessary constant setup for 64-bit imul. r=lth 3 years ago lukas.bernhard 48 bytes, text/x-phabricator-request		Details \| Review

Lars T Hansen [:lth]

Reporter

Description

•

3 years ago

A wasm i64 multiply by 5 yields this code:

0000002A  41 bb 05 00 00 00         mov $0x05, %r11d
00000030  49 0f af c3               imul %r11, %rax

which is because of this code in MacroAssembler-x64-inl.h:

void MacroAssembler::mul64(Imm64 imm, const Register64& dest) {
  movq(ImmWord(uintptr_t(imm.value)), ScratchReg);
  imulq(ScratchReg, dest.reg);
}

which seems to be an artifact of there not being an imulq variant in the assembler that takes an immediate operand, though there is a variant that takes an imm32 in the instruction set.

Lars T Hansen [:lth]

Reporter

Updated

•

3 years ago

Mentor: lhansen

Keywords: good-first-bug

Nicolas B. Pierron [:nbp]

Comment 1

•

3 years ago

If we do not need overflow checks, we should implement it as:

  lea [%rax + %rax * 4], %r11

Mayank Bansal

Updated

•

3 years ago

Comment 2

•

3 years ago

Indeed, better strength reduction would be good too! Strength reduction is pretty ad-hoc at this point: -1, 0, 1, 2, and other powers of 2, but no work on (2^n)+1 (becomes LEA for small n), reduction to a sequence of adds, etc.

Comment 3

•

3 years ago

I would like to work on this.

Lars T Hansen [:lth]

Reporter

Comment 4

•

3 years ago

(In reply to Mang Yau from comment #3)

I would like to work on this.

Sure thing. I think you should disregard the discussion in comment 1 and later for now, and only try to fix the problem outlined in comment 0.

This bug may be a little challenging, not because it's hard but because there are a number of concepts and a fair amount of code that you may need to understand to fix it.

lukas.bernhard

Assignee

Comment 5

•

3 years ago

Mang Yau are you still on this issue or can I give it a try?

Mang Yau

Comment 6

•

3 years ago

(In reply to lukas.bernhard from comment #5)

Mang Yau are you still on this issue or can I give it a try?

Hey Lukas, I have been a bit busy and haven't had much time to dedicate to this issue. You can give it a try :)

lukas.bernhard

Assignee

Comment 7

•

3 years ago

Attached file Bug 1712321 - Remove unnecessary constant setup for 64-bit imul. r=lth — Details

Phabricator Automation

Updated

•

3 years ago

Assignee: nobody → lukas.bernhard

Status: NEW → ASSIGNED

lukas.bernhard

Assignee

Comment 8

•

3 years ago

While working on the bug and reading surrounding multiplication code I noticed pop2xI64ForMulI64 (on x64) makes (seemingly) unnecessary assumptions about rdx being clobbered; this shouldn't be the case for a reg*reg multiplication so a pop2xI64(r0, r1); should be sufficient.
Furthermore: while the constant setup has been removed as described in the ticket the generated code remains suboptimal. In particular, multiplication now emits code such as:

movq rcx, rax
imulq imm8/32, rax, rax

Instead, a imulq imm8/32, rcx, rax would be sufficient, saving one reg->reg move (imul with immediate can select source register and dest register independently).
inline void mul64(Imm64 src1, const Register64& src2, const Register64& dest) could be made available on x64, with a special case if src2==dest + src1 cannot be encoded as immediate.
Generating this code seems to require changes to CodeGenerator::visitMulI64 in CodeGenerator-x86-shared.cpp (due to some code being shared with x86-32); if this change is desirable I can give it a try (or work on something else if optimizing a mov reg->reg is deemed too effortful).

Lars T Hansen [:lth]

Reporter

Comment 9

•

3 years ago

Patches are in general welcome (do file new bugs for new issues). pop2xI64ForMulI64 is in the baseline compiler so we may not care all that much about micro-optimizations like these, but anything that affects code generation in ion (the optimizing compiler) is interesting in principle.

Pulsebot

Comment 10

•

3 years ago

Pushed by archaeopteryx@coole-files.de:
https://hg.mozilla.org/integration/autoland/rev/574004fd100a
Remove unnecessary constant setup for 64-bit imul. r=lth

Atila Butkovits

Comment 11

•

3 years ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/574004fd100a

Status: ASSIGNED → RESOLVED

Closed: 3 years ago

status-firefox91: --- → fixed

Resolution: --- → FIXED

Target Milestone: --- → 91 Branch

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

64-bit imul by small constant yields unnecessary constant setup

Categories

(Core :: JavaScript Engine: JIT, enhancement, P3)

Tracking

()

People

(Reporter: lth, Assigned: lukas.bernhard, Mentored)

References

(Blocks 1 open bug)

Details

(Keywords: good-first-bug)

Crash Data

Security

(public)

User Story

Attachments

(1 file)

Description

Updated

Comment 1

Updated

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Comment 7

Updated

Comment 8

Comment 9

Comment 10

Comment 11

Attachment

General

Description

File Name

Content Type