Closed Bug 720695 Opened 14 years ago Closed 14 years ago

Lexicographic sort on integers broken on array of integers that includes -0x80000000

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla12

People

(Reporter: Ms2ger, Assigned: luke)

References

Details

(Keywords: regression, testcase)

Attachments

(2 files)

Fixes sorting when integer digits equals 10 14 years ago Santiago Gimeno 13.84 KB, patch		Details \| Diff \| Splinter Review
fix and test 14 years ago Luke Wagner [:luke] 4.10 KB, patch	santiago.gimeno : review+ Waldo : review+	Details \| Diff \| Splinter Review

:Ms2ger (he/him; ⌚ UTC+1/+2)

Reporter

Description

•

14 years ago

Test case: [-0x80000000, -3].sort() Expected: [-2147483648,-3] Got: [-3,-2147483648]

:Ms2ger (he/him; ⌚ UTC+1/+2)

Reporter

Updated

•

14 years ago

Keywords: regression, testcase

Santiago Gimeno

Comment 1

•

14 years ago

(In reply to Ms2ger from comment #0) > Test case: > > [-0x80000000, -3].sort() > > Expected: [-2147483648,-3] > Got: [-3,-2147483648] The problem happens when one of the integers being has 10 digits, and the other when multiplied for a power of 10 to perform the comparison overflows. For example to compare these 2 numbers, we were multiplying 3*10e9 > INT32_MAX. For example it also fails with: [2147483647, 3] I'll try to look into it

Santiago Gimeno

Comment 2

•

14 years ago

Attached patch Fixes sorting when integer digits equals 10 — Details — Splinter Review

Attachment #591141 - Flags: review?(luke)

Luke Wagner [:luke]

Assignee

Comment 3

•

14 years ago

Santiago: could you create a patch that applies to mozilla-inbound tip?

Luke Wagner [:luke]

Assignee

Comment 4

•

14 years ago

Attached patch fix and test — Details — Splinter Review

I ended up thinking about this a bit. The cause of comment 0 is actually that special-casing INT32_MIN is wrong for lexicographic ordering since, even if a = INT32_MIN, if b = -1, a < b. What is great is that if we just lift the comparison to uint32_t, no special-casing is necessary at all (b/c of the commented algebraic identity)! But also comment 1 is right in that there is a *separate* bug that the multiplication can overflow. I think the simplest fix is to just carry out the multiplication and comparison in the uint64_t domain. I was also remiss as a reviewer in not asking for a good test of CompareLexicographicInt32, so I attached one here (that would have found both the above bugs). I was doubly remiss by not comparing our implementation with v8s: http://code.google.com/p/v8/source/browse/trunk/src/runtime.cc#7191. They found the same INT32_MIN trick but it would seem they chose to go with a more-complicated/slower strategy involving integer division to deal with overflow. Perhaps there is a case I'm missing that makes their strategy necessary? Adding Waldo as a second reviewer just to be safe.

Assignee: general → luke

Status: NEW → ASSIGNED

Attachment #591215 - Flags: review?(santiago.gimeno)

Attachment #591215 - Flags: review?(jwalden+bmo)

Luke Wagner [:luke]

Assignee

Updated

•

14 years ago

Attachment #591141 - Flags: review?(luke)

Santiago Gimeno

Comment 5

•

14 years ago

Comment on attachment 591215 [details] [diff] [review] fix and test Review of attachment 591215 [details] [diff] [review]: ----------------------------------------------------------------- Hi, I like your implementation better than mine :D. I have one question though (sorry if it does not make any sense): Is there any penalty on having to cast to uint64_t for every comparison compared to having two different paths one for "could-overflow" comparisons(uint64_t) and other for "normal" comparisons(uint32_t)?

Santiago Gimeno

Comment 6

•

14 years ago

(In reply to Luke Wagner [:luke] from comment #3) > Santiago: could you create a patch that applies to mozilla-inbound tip? Sorry about it. I'm newbie on this. What exactly is mozilla-inbound? Is there any documentation I can refer to?

:Ms2ger (he/him; ⌚ UTC+1/+2)

Reporter

Comment 7

•

14 years ago

https://wiki.mozilla.org/Tree_Rules/Inbound

Luke Wagner [:luke]

Assignee

Comment 8

•

14 years ago

(In reply to Santiago Gimeno from comment #5) > I have one question though (sorry if it does not make any sense): Is there > any penalty on having to cast to uint64_t for every comparison compared to > having two different paths one for "could-overflow" comparisons(uint64_t) > and other for "normal" comparisons(uint32_t)? Good question. I compared before/after this patch for a int-sorting microbench and found practically no difference. Thus, I think we are free to do the simpler thing.

Santiago Gimeno

Updated

•

14 years ago

Attachment #591215 - Flags: review?(santiago.gimeno) → review+

Jeff Walden [:Waldo]

Comment 9

•

14 years ago

Comment on attachment 591215 [details] [diff] [review] fix and test Review of attachment 591215 [details] [diff] [review]: ----------------------------------------------------------------- r=me with the requested changes. ::: js/src/jit-test/tests/basic/testBug720695.js @@ +1,2 @@ > +var v = [ -0x80000003, -0x80000002, -0x80000001, -0x80000000, -0x7fffffff, -0x7ffffffe, -0x7ffffffd, > + -9, -8, -7, -6, -5, -4, -3, -2, -1, 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, Add a few other numbers that aren't single-digit magnitudes for sanity, something like this to catch last-digit differences, extra-digit differences, and so on: 10, 11, 20, 21, 100, 101, 110, 111, 500 Add both negatives and positives for those while you're at it. ::: js/src/jsarray.cpp @@ +2046,5 @@ > + buint = bint; > + } else { > + /* uint32_t(-INT32_MIN) = uint32_t(INT32_MIN) = uint32_t(-int64_t(INT32_MIN)) */ > + auint = (uint32_t) -aint; > + buint = (uint32_t) -bint; -INT32_MIN has undefined behavior, 2**31 being greater than INT32_MAX. Whee, fun. Do this instead (with a comment if you can think of one that gives a good, pithy explanation of why - isn't used): auint = ~uint32_t(aint) + 1; buint = ~uint32_t(bint) + 1; @@ +2056,5 @@ > * If digits_a > digits_b: a < b*10e(digits_a - digits_b). > * If digits_b > digits_a: a*10e(digits_b - digits_a) <= b. > */ > + int digitsa = NumDigitsBase10(auint); > + int digitsb = NumDigitsBase10(buint); Shurely unsigned if you're changing the type the method returns...

Attachment #591215 - Flags: review?(jwalden+bmo) → review+

Luke Wagner [:luke]

Assignee

Comment 10

•

14 years ago

(In reply to Jeff Walden (remove +bmo to email) from comment #9) > Shurely unsigned if you're changing the type the method returns... You're right, and don't call me Shurley.

Luke Wagner [:luke]

Assignee

Comment 11

•

14 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/40b1a7a5e630

Target Milestone: --- → mozilla12

Joe Drew (not getting mail)

Comment 12

•

14 years ago

https://hg.mozilla.org/mozilla-central/rev/40b1a7a5e630

Status: ASSIGNED → RESOLVED

Closed: 14 years ago

Resolution: --- → FIXED

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Lexicographic sort on integers broken on array of integers that includes -0x80000000

Categories

(Core :: JavaScript Engine, defect)

Tracking

()

People

(Reporter: Ms2ger, Assigned: luke)

References

Details

(Keywords: regression, testcase)

Crash Data

Security

(public)

User Story

Attachments

(2 files)

Description

Updated

Comment 1

Comment 2

Comment 3

Comment 4

Updated

Comment 5

Comment 6

Comment 7

Comment 8

Updated

Comment 9

Comment 10

Comment 11

Comment 12

Attachment

General

Description

File Name

Content Type