Closed Bug 344778 Opened 18 years ago Closed 18 years ago

Optimize ASCII line breaking

Tracking

()

Status:

RESOLVED FIXED

People

(Reporter: masayuki, Assigned: masayuki)

References

Details

(Keywords: perf)

Attachments

(1 file, 1 obsolete file)

Patch rv1.0 18 years ago Masayuki Nakano [:masayuki] (he/him)(JST, +0900) 2.24 KB, patch	masayuki : review-	Details \| Diff \| Splinter Review
Patch rv2.0 18 years ago Masayuki Nakano [:masayuki] (he/him)(JST, +0900) 4.83 KB, patch		Details \| Diff \| Splinter Review

Masayuki Nakano [:masayuki] (he/him)(JST, +0900)

Assignee

Description

•

18 years ago

By bug 255990. The tp value of tinderbox was damaged. We should optimize the line breaking algorithm.

Simple testcase is here:
http://www.d-toybox.com/mozilla/testpages/testcasesforbenchmark.zip

Masayuki Nakano [:masayuki] (he/him)(JST, +0900)

Assignee

Updated

•

18 years ago

Status: NEW → ASSIGNED

Summary: Optimze ASCII line breaking → Optimize ASCII line breaking

Masayuki Nakano [:masayuki] (he/him)(JST, +0900)

Assignee

Comment 1

•

18 years ago

Attached patch Patch rv1.0 (obsolete) — Details — Splinter Review

Attachment #229354 - Flags: superreview?(roc)

Attachment #229354 - Flags: review?(roc)

Masayuki Nakano [:masayuki] (he/him)(JST, +0900)

Assignee

Updated

•

18 years ago

Keywords: perf

Masayuki Nakano [:masayuki] (he/him)(JST, +0900)

Assignee

Comment 2

•

18 years ago

I'll post new result of the page loading score with the test tool of mozilla.org.

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 3

•

18 years ago

this looks good...

would be interesting to see how many times we end up calling CanBreakBetweenLatin1 on English pages vs the number of times we avoid calling CanBreakBetweenlatin1 (i.e., how often this optimization succeeds).

I'd also like to see the microbenchmark results for English Web page text with this optimization.

Another thing that'd be interesting is to run the microbenchmark on English text of spaces and letters only (so CanBreakBetweenLatin1 is never called), and compare two versions of the code: this patch, and this patch but with chIsSymbol set to just PR_TRUE. That would tell us whether it's worth optimizing those conditionals (which may not be predicted very well by the CPU).

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 4

•

18 years ago

BTW it would be nice if we could automatically test that our optimization isn't changing anything. Perhaps you could add an NS_ASSERTION to check that if the optimization is used, then CanBreakBetweenLatin1 would have returned PR_FALSE.

Masayuki Nakano [:masayuki] (he/him)(JST, +0900)

Assignee

Comment 5

•

18 years ago

> Another thing that'd be interesting is to run the microbenchmark on English
> text of spaces and letters only (so CanBreakBetweenLatin1 is never called), and
> compare two versions of the code: this patch, and this patch but with
> chIsSymbol set to just PR_TRUE. That would tell us whether it's worth
> optimizing those conditionals (which may not be predicted very well by the
> CPU).

     patched build  patched but set to TRUE
AVG.       19028.0    19243.9
MED.       18953.0    19188.0

the test case is the English test case of comment 0. the score is 'ms'.

Masayuki Nakano [:masayuki] (he/him)(JST, +0900)

Assignee

Comment 6

•

18 years ago

The current patch has many lossy cases. I'll add more checking.

Masayuki Nakano [:masayuki] (he/him)(JST, +0900)

Assignee

Updated

•

18 years ago

Attachment #229354 - Flags: superreview?(roc)

Attachment #229354 - Flags: review?(roc)

Attachment #229354 - Flags: review-

Masayuki Nakano [:masayuki] (he/him)(JST, +0900)

Assignee

Comment 7

•

18 years ago

Attached patch Patch rv2.0 — Details — Splinter Review

This is using static table for breaking check. This is faster than previous patch.

Bechmark score (simple English)(ms):
       normal  patched
AVG.  19356.3  18890.5
MED.  19461.0  18828.0

Attachment #229354 - Attachment is obsolete: true

Attachment #229645 - Flags: superreview?(roc)

Attachment #229645 - Flags: review?(roc)

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 8

•

18 years ago

The result before bug 255990 was fixed was about 10,000, right?

What exactly does this benchmark do? Can you put the code somewhere I can see it?

Jungshik Shin

Comment 9

•

18 years ago

(In reply to comment #8)
> The result before bug 255990 was fixed was about 10,000, right?

No, it's around 18700. See bug 255990 comment #76. Masayuki reported there that he got 18804 without this optimization. Here, it's now 18953 with this optimization(all in median). Apparently, his numbers given in bug 255990 can't be compared directly with numbers given here. All three cases (with no patch at all, with bug 255990 patch alone and with optimization) need to be compared under identical conditions. Moreover, just giving median/average doesn't tell much. Masayuki, why don't you do t-test (every decent spreadsheet offers an easy way to do that) and give p-number?  

BTW, using our (mozilla.org) test tools (bug 255990 comment #59) is better than relying on his synthetic test cases, I believe. Note that our test web pages are virtually all in English.

Masayuki Nakano [:masayuki] (he/him)(JST, +0900)

Assignee

Comment 10

•

18 years ago

(In reply to comment #9)
> Masayuki, why don't you do t-test (every decent spreadsheet offers an
> easy way to do that) and give p-number?  

I cannot understand here. What means 't-test' and 'p-number'?

> BTW, using our (mozilla.org) test tools (bug 255990 comment #59) is better than
> relying on his synthetic test cases, I believe. Note that our test web pages
> are virtually all in English. 

Yeah, of course. I'll test by it ASAP.

Masayuki Nakano [:masayuki] (he/him)(JST, +0900)

Assignee

Updated

•

18 years ago

Attachment #229645 - Flags: superreview?(roc)

Attachment #229645 - Flags: review?(roc)

Masayuki Nakano [:masayuki] (he/him)(JST, +0900)

Assignee

Comment 11

•

18 years ago

backed-out the original patch. I'll retry to fix it with UAX#14. If you have some ideas and suggestions, please tell me.

-> FIXED(by backed-out)

Status: ASSIGNED → RESOLVED

Closed: 18 years ago

Resolution: --- → FIXED

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

Optimize ASCII line breaking

Categories

(Core :: Layout: Text and Fonts, defect)

Tracking

()

People

(Reporter: masayuki, Assigned: masayuki)

References

Details

(Keywords: perf)

Crash Data

Security

(public)

User Story

Attachments

(1 file, 1 obsolete file)

Description

Updated

Comment 1

Updated

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Updated

Comment 7

Comment 8

Comment 9

Comment 10

Updated

Comment 11

Attachment

General

Description

File Name

Content Type