Closed Bug 589528 Opened 14 years ago Closed 9 months ago

many misaligned 32-bit loads from jitted regexps

Tracking

()

Status:

RESOLVED INCOMPLETE

People

(Reporter: luke, Unassigned)

Details

Attachments

(1 file)

Annotated asm found during debug. 14 years ago Chris Leary [:cdleary] (not checking bugmail) 3.07 KB, text/plain		Details

Luke Wagner [:luke]

Reporter

Description

•

14 years ago

With njn's misalignment patch applied (bug 476122), valgrind is reporting ~2 million misaligned 32-bit loads from regexp jitted code running regexp-dna.js.

For the following microbenchmark:

s = "abcdabcdabcd";
for (var i = 0; i < 10000; ++i) {
    /cdab/.test(s);
}

valgrind reports 10000 misaligned 32-bit loads.

Chris Leary [:cdleary] (not checking bugmail)

Comment 1

•

14 years ago

Will investigate. Have we seen what kind of wins we get from any other misaligned load bugs so I know where to prioritize?

Assignee: general → cdleary

Status: NEW → ASSIGNED

David Mandelin [:dmandelin]

Comment 2

•

14 years ago

(In reply to comment #1)
> Will investigate. Have we seen what kind of wins we get from any other
> misaligned load bugs so I know where to prioritize?

So far, it's been mostly with misaligned doubles. It seems like it should be possible to find out if this matters by doing a quick-and-dirty tweak to reduce the unaligned loads in the microbenchmark and see if that has any effect.

Luke Wagner [:luke]

Reporter

Comment 3

•

14 years ago

(In reply to comment #1)
In bug 589526 comment 0, I seemed to get a 1-2ms speedup (on a way-old 1.8GHz laptop) from a hack to remove around 200K misaligned double-loads/stores.  YMMV.

Julian Seward [:jseward]

Comment 4

•

14 years ago

(In reply to comment #1)
I think it's worth investigating, although I suspect it might end
up feeling like a trip into the microarchitectural Twilight Zone.

One thing to bear in mind is, there may be a (big?) cost difference
between misaligned accesses that straddle a D1 or L2 line, as opposed
to those that don't.  In the former case the processor has to fish
out both cache lines and glue the result together, which sounds
slow.  See (eg) 2nd para of "Introduction" of 
http://software.intel.com/en-us/articles/reducing-the-impact-of-misaligned-memory-accesses

Chris Leary [:cdleary] (not checking bugmail)

Comment 5

•

14 years ago

(In reply to comment #4)
> misaligned accesses that straddle a D1 or L2 line

Or worse, a page boundary!

Chris Leary [:cdleary] (not checking bugmail)

Comment 6

•

14 years ago

Attached file Annotated asm found during debug. — Details

My debugging session showed only aligned accesses to the string with our malloc.

The assembled regexp program always fails to match on the first two (coalesced and aligned) characters of the string in my debugging session, but Valgrind is showing the error at an address of 0x7d254f2.

It says, "Address 0x7d254f2 is 2 bytes inside a block of size 26 alloc'd" -- I'm guessing that means the accesses is at _an offset of two bytes_ within a block sized 26 bytes? If so, I can't repro that behavior under debug ATM. Will ponder a bit.

Julian Seward [:jseward]

Comment 7

•

14 years ago

> It says, "Address 0x7d254f2 is 2 bytes inside a block of size 26 alloc'd" --
> I'm guessing that means the accesses is at _an offset of two bytes_ within a
> block sized 26 bytes?

Yes.

> If so, I can't repro that behavior under debug ATM. Will
> ponder a bit.

Rerun with --db-attach=yes.  This allows you to optionally attach GDB
to the process at any error V reports, so you can look at the
registers exactly at the point where the alleged misalignment
occurred.  (--db-attach only works on Linux, be warned.)

Chris Leary [:cdleary] (not checking bugmail)

Comment 8

•

14 years ago

Nevermind, this makes sense to me now. The increment after the first char test is only one, so our misaligned dword load comes from the "b" char, two bytes in as Valgrind is reporting. Thinking now about how to get a dword-sized increment in the most general case we can.

Thanks for the tip Julian! Will definitely try that out.

Till Schneidereit [:till]

Comment 9

•

11 years ago

Mass-reassigning cdleary's bugs to default. He won't work on any of them, anymore. I guess, at least.

@cdleary: shout if you take issue with this.

Assignee: cdleary → general

Status: ASSIGNED → NEW

Nobody; OK to take it and work on it

Assignee

Updated

•

10 years ago

Assignee: general → nobody

BMO Automation

Updated

•

2 years ago

Severity: normal → S3

Matthew Gaudet (he/him) [:mgaudet]

Updated

•

9 months ago

Status: NEW → RESOLVED

Closed: 9 months ago

Resolution: --- → INCOMPLETE

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

many misaligned 32-bit loads from jitted regexps

Categories

(Core :: JavaScript Engine, defect)

Tracking

()

People

(Reporter: luke, Unassigned)

References

Details

Crash Data

Security

(public)

User Story

Attachments

(1 file)

Description

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Updated

Updated

Updated

Attachment

General

Description

File Name

Content Type