Closed Bug 513514 Opened 15 years ago Closed 15 years ago

nanojit: make hint() faster

Tracking

(Not tracked)

Status:

RESOLVED FIXED

Milestone:

Future

People

(Reporter: gal, Assigned: n.nethercote)

References

Details

(Whiteboard: PACMAN, fixed-in-nanojit, fixed-in-tracemonkey, fixed-in-tamarin)

Attachments

(4 files, 3 obsolete files)

instrumentation patch 15 years ago Nicholas Nethercote [inactive] 2.25 KB, patch		Details \| Diff \| Splinter Review
better patch 15 years ago Nicholas Nethercote [inactive] 1.98 KB, patch		Details \| Diff \| Splinter Review
only hint when use and def are nearby 15 years ago Andreas Gal :gal 6.84 KB, patch		Details \| Diff \| Splinter Review
mildy hackish patch 15 years ago Andreas Gal :gal 6.84 KB, patch		Details \| Diff \| Splinter Review
and the right patch this time 15 years ago Andreas Gal :gal 8.31 KB, patch		Details \| Diff \| Splinter Review
Results on TR x86 with hinting disabled showed no changes outside "noise" 15 years ago Edwin Smith 34.36 KB, text/plain		Details
patch implementing table-based hinting 15 years ago Nicholas Nethercote [inactive] 17.04 KB, patch	edwsmith : review+	Details \| Diff \| Splinter Review

Andreas Gal :gal

Reporter

Description

•

15 years ago

It shows up in some profiles clocking in at about 1% total time for aes. The long series of if statements is painful, especially the isCmp part. We probably want to put this in some kind of table since most branches only depend on op.

Nicholas Nethercote [inactive]

Assignee

Comment 1

•

15 years ago

The if-then-else chains are painful, as you say, but the actions for each alternative are different -- usually an assignment or masking, but sometimes (eg. the LIR_param case) depending on other values. They also vary across back-ends so doing it via the usual LIRopcode.tbl mechanism wouldn't work. There's also the issue that some of the back-ends mask the result with regs.free in hint() but not all of them do. That's easier to make consistent, though.

Nicholas Nethercote [inactive]

Assignee

Comment 2

•

15 years ago

Here's another suggestion: get rid of hint() altogether. I turned it off and it barely made a difference; it may have even caused a 1--2ms speedup. Why? On x86, at least, it's because hint() almost never makes any useful contribution. There are three cases: - 'prefer' ends up the same as 'allow'. This can happen because the 'allow' set passed to findRegFor() often already has been chosen carefully and hint() is effectively repeating that choosing. Eg. for calls we do prepResultReg(ins, rmask(EAX)) and then hint() sees that it's a LIR_call and so suggests rmask(EAX). Alternatively, the opcode is one that hint() doesn't treat specially. Either way, I call this the "I second your fine decision!" case. - In the remaining cases, 'prefer' usually ends up having no overlap with 'free', so we just return 'allow' anyway. I call this the "I give up!" case. - In the remaining cases, 'prefer' overlaps with 'free' and is more specific than 'allow'. I call this the "I have something useful to add!" case. For 3d-raytrace (for which bug 516042 tells us hint() accounts for 0.9% of run-time) here are the proportions of the above cases: - "I second your fine decision!" 65% - "I give up!" 33% - "I have something useful to add!" 2%

Andreas Gal :gal

Reporter

Comment 3

•

15 years ago

Great statistics. Thanks. If you focus on the "I have something useful to add" part, is there any specific op that we contribute something useful for? Maybe there is one or two cases where we do something useful and we can drop the rest (or we just drop them all if there is no easy cheap hints we might want to keep).

Nicholas Nethercote [inactive]

Assignee

Comment 4

•

15 years ago

Looking some more, the main cause of the "I second your fine decision!" case is findSpecificRegFor() -- ie. when we've already narrowed 'allow' down to a single register. There's no pointing trying to apply hints in that case. Nb: it's interesting to see that in the NativeX64 backend hint() is a no-op.

Nicholas Nethercote [inactive]

Assignee

Comment 5

•

15 years ago

Here's the stats for 3d-raytrace. On each line is: - the number of occurrences for this pattern - the opcode - the value of 'allow' at the end of hint() - the value of 'prefer' at the end of hint() 1102 eq allow(eax ecx edx ebx esi edi) prefer(eax ecx edx ebx) 346 int allow(eax edx ebx esi edi) prefer(eax edx) 236 icall allow(eax edx ebx esi edi) prefer(eax) 216 icall allow(eax ecx edx ebx esi edi) prefer(eax) 114 iparam allow(eax ecx edx ebx esi edi) prefer(eax) 87 iparam allow(eax ecx edx esi edi) prefer(eax) 68 icall allow(eax ecx edx ebx edi) prefer(eax) 53 fcall allow(xmm0 xmm1 xmm2 xmm3 xmm4 xmm5 xmm6 xmm7 f0) prefer(f0) 26 flt allow(eax ecx edx ebx esi edi) prefer(eax ecx edx ebx) 13 int allow(eax ecx edx esi edi) prefer(eax ecx edx) 11 fgt allow(eax ecx edx ebx esi edi) prefer(eax ecx edx ebx) 3 flt allow(eax ecx edx ebx esi) prefer(eax ecx edx ebx) 3 flt allow(ecx edx ebx esi edi) prefer(ecx edx ebx) 1 int allow(eax ecx edx ebx esi edi) prefer(eax ecx edx) In other words, it's spread across all the hint() cases. I guess these cases mostly correspond to times when findRegFor() is called not on the instruction we're looking at, but one of its operands, and so we haven't already narrowed 'allow' down sensibly?

Andreas Gal :gal

Reporter

Comment 6

•

15 years ago

Can we narrow down when hint is called? findRegForOperand()?

Nicholas Nethercote [inactive]

Assignee

Comment 7

•

15 years ago

We already have findSpecificRegFor(), but it just calls findRegFor(). If we clone findRegFor() and specialise for the single-register-allowed case we can skip the hint calls. We can also then specialise registerAlloc() for the single-register-allowed case. But I just tried all that and didn't get a noticeable speedup. Perhaps not surprising -- we're looking at minor speedups to a functions that account for less than 1% of SS time.