Closed Bug 900756 Opened 11 years ago Closed 11 years ago

Ionmonkey (ARM): add float32 support

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla27

People

(Reporter: dougc, Assigned: jonco)

References

(Blocks 1 open bug)

Details

(Whiteboard: [games])

Attachments

(3 files, 3 obsolete files)

float32-ARM-1 11 years ago Jon Coppeard (:jonco) 36.67 KB, patch	mjrosenb : review+	Details \| Diff \| Splinter Review
float32-ARM-2 11 years ago Jon Coppeard (:jonco) 12.15 KB, patch	mjrosenb : review+	Details \| Diff \| Splinter Review
float32-ARM-1 11 years ago Jon Coppeard (:jonco) 36.31 KB, patch	jonco : review+	Details \| Diff \| Splinter Review
float32-ARM-2 11 years ago Jon Coppeard (:jonco) 12.27 KB, patch	jonco : review+	Details \| Diff \| Splinter Review
Combined rebased patch. 11 years ago Douglas Crosher [:dougc] 44.97 KB, patch		Details \| Diff \| Splinter Review
Rebased combined patch 11 years ago Douglas Crosher [:dougc] 45.11 KB, patch		Details \| Diff \| Splinter Review

Douglas Crosher [:dougc]

Reporter

Description

•

11 years ago

Bug 888109 adds general float32 support. ARM support just requires a little backend support. It might be simplest for now to just use half of a double register pair on the ARM rather than trying to pack float32s into all available registers.

Jon Coppeard (:jonco)

Assignee

Updated

•

11 years ago

Assignee: general → jcoppeard

Jon Coppeard (:jonco)

Assignee

Comment 1

•

11 years ago

Attached patch float32-ARM-1 (obsolete) — Details — Splinter Review

This patch provides assembler / macro assembler support for float32 operations. I added Float32Encoder for float32 immediate constants and added a bit in PoolHintData to signify whether the destination register is double or float32 for floating point constant loads. Currently the patch adds float32 constants to the double pool, wasting 4 bytes each time. Another option would be for them to have their own pool, but I don't know enough about how pools work to know whether this is worth it.

Attachment #796708 - Flags: review?(mrosenberg)

Jon Coppeard (:jonco)

Assignee

Comment 2

•

11 years ago

Attached patch float32-ARM-2 (obsolete) — Details — Splinter Review

Code generator and lowering support.

Attachment #796710 - Flags: review?(mrosenberg)

Marty Rosenberg [:mjrosenb]

Comment 3

•

11 years ago

Comment on attachment 796708 [details] [diff] [review] float32-ARM-1 Review of attachment 796708 [details] [diff] [review]: ----------------------------------------------------------------- ::: js/src/jit/arm/Assembler-arm.cpp @@ +1781,5 @@ > + /* > + * Insert floats into the double pool as they have the same limitations on > + * immediate offset. This wastes 4 bytes padding per float. An alternative > + * would be to have a separate pool for floats. > + */ quite sad. The Assembler Buffer re-write should fix this. ::: js/src/jit/arm/MacroAssembler-arm.cpp @@ +131,5 @@ > +} > + > +void > +MacroAssemblerARM::convertInt32ToFloat32(const Register &src, const FloatRegister &dest_) { > + // direct conversions aren't possible. I assume that "direct conversions" means doing a int32 -> float32 conversion and doing the gpr -> vfp transfer in one instruction? @@ +140,5 @@ > +} > + > +void > +MacroAssemblerARM::convertInt32ToFloat32(const Address &src, FloatRegister dest) { > + ma_ldr(Operand(src), ScratchRegister); You should be able to load the int32 directly into tho dest register, then do the int32 -> float32 conversion in that register. You should probably use the scratch, since I've head that immediately overwriting a register is bad for perf. Since the vfp's offsets are more limited, this method may not save any instructions, but it should at the very least save a synchronization point. Now that I think about it, this can almost certainly be applied to convertInt32ToFloat64 (or whatever it is actually called) @@ +1323,5 @@ > +{ > + as_vadd(VFPRegister(dst).singleOverlay(), VFPRegister(src1).singleOverlay(), > + VFPRegister(src2).singleOverlay()); > +} > + *idly wonders if we can have these functions take VFPRegisters* @@ +1436,5 @@ > + VFPRegister vd = VFPRegister(dest).singleOverlay(); > + uint32_t spun = *reinterpret_cast<uint32_t*>(&value); > + if (hasVFPv3()) { > + if (spun == 0) { > + // To zero a register, load 1.0, then execute dN <- dN - dN minor nit: those should be sN @@ +1443,5 @@ > + as_vsub(vd, vd, vd, cc); > + return; > + } > + > + VFPImm floatEnc = VFPImm::forFloat32(spun); Fwiw, the set of encodings for float32 and float64 is identical, so you should be able to just cast the float value to a double, and call the preexisting routine.

Attachment #796708 - Flags: review?(mrosenberg) → review+

Marty Rosenberg [:mjrosenb]

Updated

•

11 years ago

Attachment #796710 - Flags: review?(mrosenberg) → review+

Jon Coppeard (:jonco)

Assignee

Comment 4

•

11 years ago

(In reply to Marty Rosenberg [:mjrosenb] from comment #3) Thanks for the comments. > > +void > > +MacroAssemblerARM::convertInt32ToFloat32(const Register &src, const FloatRegister &dest_) { > > + // direct conversions aren't possible. > > I assume that "direct conversions" means doing a int32 -> float32 conversion > and doing the gpr -> vfp transfer in one instruction? Yes, it seems you need to transfer the integer value to the vfp register first, and then convert it. > > +MacroAssemblerARM::convertInt32ToFloat32(const Address &src, FloatRegister dest) { > > + ma_ldr(Operand(src), ScratchRegister); > > You should be able to load the int32 directly into tho dest register, then > do the int32 -> float32 conversion in that register. You should probably > use the scratch, since I've head that immediately overwriting a register is > bad for perf. Since the vfp's offsets are more limited, this method may not > save any instructions, but it should at the very least save a > synchronization point. Now that I think about it, this can almost certainly > be applied to convertInt32ToFloat64 (or whatever it is actually called) Good idea, done. > @@ +1323,5 @@ > > +{ > > + as_vadd(VFPRegister(dst).singleOverlay(), VFPRegister(src1).singleOverlay(), > > + VFPRegister(src2).singleOverlay()); > > +} > > + > > *idly wonders if we can have these functions take VFPRegisters* Sounds good, I might look at that as a followup bug. > > + // To zero a register, load 1.0, then execute dN <- dN - dN > > minor nit: those should be sN Fixed. > @@ +1443,5 @@ > > + as_vsub(vd, vd, vd, cc); > > + return; > > + } > > + > > + VFPImm floatEnc = VFPImm::forFloat32(spun); > > Fwiw, the set of encodings for float32 and float64 is identical, so you > should be able to just cast the float value to a double, and call the > preexisting routine. I didn't realise that. I'll remove the separate float32 encoder.

Jon Coppeard (:jonco)

Assignee

Comment 5

•

11 years ago

Attached patch float32-ARM-1 — Details — Splinter Review

Updated patch following review comments.

Attachment #796708 - Attachment is obsolete: true

Attachment #800658 - Flags: review+

Jon Coppeard (:jonco)

Assignee

Comment 6

•

11 years ago

Attached patch float32-ARM-2 — Details — Splinter Review

Update patch following review comments

Attachment #796710 - Attachment is obsolete: true

Attachment #800659 - Flags: review+

Douglas Crosher [:dougc]

Reporter

Comment 7

•

11 years ago

Attached patch Combined rebased patch. (obsolete) — Details — Splinter Review

An attempt to rebase these patches. Tested in combination with the patches for bug 915495 the ARM backend passes the standard jit tests (shall try --tbpl now).

Douglas Crosher [:dougc]

Reporter

Comment 8

•

11 years ago

Attached patch Rebased combined patch — Details — Splinter Review

Attachment #804894 - Attachment is obsolete: true

Jon Coppeard (:jonco)

Assignee

Comment 9

•

11 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/e38bff7fe9c0

Ryan VanderMeulen [:RyanVM]

Comment 10

•

11 years ago

https://hg.mozilla.org/mozilla-central/rev/e38bff7fe9c0

Status: NEW → RESOLVED

Closed: 11 years ago

Resolution: --- → FIXED

Target Milestone: --- → mozilla27

Douglas Crosher [:dougc]

Reporter

Updated

•

11 years ago

Depends on: 918206

Douglas Crosher [:dougc]

Reporter

Updated

•

11 years ago

Whiteboard: [games]

Marco Mucci [:MarcoM]

Updated

•

11 years ago

Blocks: gecko-games

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

Ionmonkey (ARM): add float32 support

Categories

(Core :: JavaScript Engine, defect)

Tracking

()

People

(Reporter: dougc, Assigned: jonco)

References

(Blocks 1 open bug)

Details

(Whiteboard: [games])

Crash Data

Security

(public)

User Story

Attachments

(3 files, 3 obsolete files)

Description

Updated

Comment 1

Comment 2

Comment 3

Updated

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Comment 10

Updated

Updated

Updated

Attachment

General

Description

File Name

Content Type