Closed Bug 98118 Opened 23 years ago Closed 23 years ago

nsJISx4501LineBreaker::Next is worse than O(N^2) in number of elements in line

Tracking

()

Status:

VERIFIED FIXED

People

(Reporter: dbaron, Assigned: smontagu)

References

Details

(Keywords: perf)

Attachments

(2 files, 1 obsolete file)

Something to try 23 years ago Boris Zbarsky [:bzbarsky] 6.84 KB, patch	shanjian : review+	Details \| Diff \| Splinter Review
testcase 23 years ago Shanjian Li 107.31 KB, text/html		Details
Same as first patch but -u10 so the callers are obvious and with fprintfs removed 23 years ago Boris Zbarsky [:bzbarsky] 11.65 KB, patch	roc : superreview+ asa : approval+	Details \| Diff \| Splinter Review

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Reporter

Description

•

23 years ago

nsJISx4501LineBreaker::Next takes time proportional to worse than the square of the number of elements in a line of text. This is demonstrated by the testcases in bug 97229. It causes reflow to take *extremely* long times when there are large numbers of elements in a line. See bug 97229 for more details (see the information on the Cn testcases). With N elements on a line, it takes the following numbers of seconds for a one-line testcase to display on my machine: N time 8192 13 16384 62 32768 400

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Reporter

Updated

•

23 years ago

Blocks: 97229

Keywords: perf

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Reporter

Comment 1

•

23 years ago

jprof shows that over 80% of the time loading the 32768-element testcase was spentin nsJISx4501LineBreaker::Next itself. About 6% was spent in the memmove called from nsVoidArray::RemoveElementsAt called from nsLineLayout::ForgetWordFrame. The testcase can be constructed by doing the following: 1st line: "<pre>" 2d line: "01" ... "f" repeated so as to give the correct number of elements. 3d line: "</pre>"

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Reporter

Comment 2

•

23 years ago

To clarify, I haven't yet investigated how the overall rate of increase in time spent in this function is split between the number of times the function is called and the amount of time a single call takes.

Roy Yokoyama

Comment 3

•

23 years ago

assigning to bstell: and cc'ing self

Assignee: yokoyama → bstell

Frank Tang

Comment 4

•

23 years ago

dave: please work with shanjian. Shanjian- can you take a look at the performance there? I don't believe the Next routines will spend a lot of time....

Assignee: bstell → shanjian

Shanjian Li

Comment 5

•

23 years ago

I made 3 testcase, each with 330, 3300 and 33000 elements. The approximate number of call to nsJISx4501LineBreaker::Next is: 330 -> 329, -> 1 call per element 3300 -> 9464 -> 3 calls per element 33000 -> 895043 -> 27 calls per element The performance of a single call of nsJISx4501LineBreaker::Next is O(n), but as the number of call increase exponentially, the mentioned behavior is within explaination.

Status: NEW → ASSIGNED

Shanjian Li

Comment 6

•

23 years ago

Now the problem is clear. In the testcase proposed by dbaron, there are a lot of elements, each has a frame. When it's time to decide line break, frame is added one by one and sent to nsJISx4501LineBreaker::Next. Since all those element are in a single word, no break can be found and this procedure repeat again and again, with the text buffer sent to nsJISx4501LineBreaker::Next increasing as well. I believe this is a very special scenario. Normally it is breakable between frame and frame, and only in very few situation several frames need to be treated as one unbreakable word. IMO we probably don't need to do anything here.

Status: ASSIGNED → RESOLVED

Closed: 23 years ago

Resolution: --- → WONTFIX

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Reporter

Comment 7

•

23 years ago

Wontfix isn't appropriate here -- we should fix this. It may or may not be your bug, but it should be fixed -- we shouldn't have quadratic growth on input size.

Status: RESOLVED → REOPENED

Resolution: WONTFIX → ---

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Reporter

Comment 8

•

23 years ago

(The fix may be as simple as not calling the linebreaker at all for preformatted output.)

Shanjian Li

Comment 9

•

23 years ago

<pre> is not necessary to reproduce this problem. I still don't think this is something we should sweat on. But if people want to improve this block of code, that is always welcome. So let me reassign this bug to dbaron.

Assignee: shanjian → dbaron

Status: REOPENED → NEW

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Reporter

Updated

•

23 years ago

Status: NEW → ASSIGNED

Priority: -- → P2

Target Milestone: --- → mozilla0.9.7

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Reporter

Updated

•

23 years ago

Target Milestone: mozilla0.9.7 → mozilla0.9.8

Boris Zbarsky [:bzbarsky]

Comment 10

•

23 years ago

*** Bug 114420 has been marked as a duplicate of this bug. ***

Boris Zbarsky [:bzbarsky]

Comment 11

•

23 years ago

http://www.toronto.com is a real-world page; when we call view-source on it it takes forever and 92% of that time is in (not under) nsJISx4501LineBreaker::Next Of course there are no newline characters in the source, so this is to be expected...

Markus Hübner

Updated

•

23 years ago

Blocks: 114584

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Reporter

Updated

•

23 years ago

Target Milestone: mozilla0.9.8 → mozilla0.9.9

Boris Zbarsky [:bzbarsky]

Comment 12

•

23 years ago

*** Bug 122810 has been marked as a duplicate of this bug. ***

Darin Fisher

Comment 13

•

23 years ago

*** Bug 111784 has been marked as a duplicate of this bug. ***

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Reporter

Updated

•

23 years ago

Target Milestone: mozilla0.9.9 → Future

Simon Montagu :smontagu

Assignee

Comment 14

•

23 years ago

dbaron, are you planning to work on this? If not, please assign it to me.

Keywords: nsbeta1

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Reporter

Comment 15

•

23 years ago

->smontagu, per comment 14. (But see comment 8, although I'm not sure what's easier.)

Assignee: dbaron → smontagu

Status: ASSIGNED → NEW

Target Milestone: Future → ---

Andreas Becker

Updated

•

23 years ago

QA Contact: andreasb → ylong

Boris Zbarsky [:bzbarsky]

Updated

•

23 years ago

Blocks: 87787

Boris Zbarsky [:bzbarsky]

Comment 16

•

23 years ago

Attached patch Something to try (obsolete) — Details — Splinter Review

So... this patch tries to not call the linebreaker when aTextData.mIsBreakable is false in MeasureText(). Two things that could be wrong with the patch: 1) I may misunderstand how the whole thing works.... I assumed that nsJISx4501LineBreaker::Next can't return anything useful if the line is not breakable 2) I'm not sure whether the if((*aStop) && (wordLen == 0)) return dimensions; // 0; lines in ComputeWordFragmentDimensions() should always be executed or only executed when the text is breakable.... Are there any wrapping regression tests this could be tested on? I'm not quite sure under what circumstances the block of code I'm skipping will actually produce usable output. I've been browsing with this patch for a bit with no ill effects.

Boris Zbarsky [:bzbarsky]

Comment 17

•

23 years ago

I just tested that patch on the testcase in bug 104517 (attachment 71611 [details]) and the patched and unpatched builds act exactly alike (and yes, there _are_ places in that testcase where the changes I made change whether code runs or not). Shanjian? Frank? What do you think?

Keywords: review

Shanjian Li

Comment 18

•

23 years ago

Attached file testcase — Details

Simon Montagu :smontagu

Assignee

Updated

•

23 years ago

Attachment #71698 - Attachment mime type: text/plain → text/html

Shanjian Li

Comment 19

•

23 years ago

Your patch make sense to me. I tried several testcases, and they all looks fine. >>1) I may misunderstand how the whole thing works.... I assumed that >> nsJISx4501LineBreaker::Next can't return anything useful if the line is not >> breakable This seems truth, I could not find a testcase to defeat it. >>2) I'm not sure whether the >> if((*aStop) && (wordLen == 0)) >> return dimensions; // 0; This statement should always be called in either case. There is another thing that I am not absolutely sure. In case "aIsBreakable" is not true, will we ever suffer in performance in certain situation? If there does exists such a case, "aLineBreaker->Next" will be called again and again with expanding buffer. The following patch should fix the problem if it does happen. +#define MAX_LOOK_BACK_FOR_WORDBREAK 80^M if(wordLen > 0)^M {^M memcpy((void*)&(aWordBuf[aRunningWordLen]), bp, sizeof(PRUnichar)*wordLen); ^M ^M PRUint32 breakP=0;^M PRBool needMore=PR_TRUE;^M + PRUint32 startPos = 0;^M + if (aRunningWordLen > MAX_LOOK_BACK_FOR_WORDBREAK)^M + startPos = aRunningWordLen - MAX_LOOK_BACK_FOR_WORDBREAK;^M nsresult lres = aLineBreaker->Next(aWordBuf, aRunningWordLen+wordLen,^M - 0, &breakP, &needMore);^M + startPos, &breakP, &needMore);^M if(NS_SUCCEEDED(lres)) ^M

Boris Zbarsky [:bzbarsky]

Comment 20

•

23 years ago

> There is another thing that I am not absolutely sure. In case "aIsBreakable" is > not true You mean in case it _is_ true but there are no actual places where one can break? How does that situation arise? Would it be triggered by a long run of letter with no spaces? Note that I _really_ don't know this code...

Asa Dotzler [:asa]

Updated

•

23 years ago

Keywords: mozilla1.0+

Shanjian Li

Comment 21

•

23 years ago

>>You mean in case it _is_ true but there are no actual places where one can >>break? Yes. Sorry for my typo. >>How does that situation arise? Would it be triggered by a long run of >>letter with no spaces? I don't know if such situation exist or not. But clearly, a long run of letters with no spaces within the same frame will not trigger such a problem. For the time being, probably your patch is enough. When we do meet such a testcase in future, we can put my additional patch in.

Shanjian Li

Comment 22

•

23 years ago

Comment on attachment 71533 [details] [diff] [review] Something to try This patch improve the performance greatly in my testing. But we are still sufferring in such a situation, so further optimization is needed. Please open another bug when this bug is closed. Future optimization probably will have nothing to do with linebreaker.

Attachment #71533 - Flags: review+

Bernd

Comment 23

•

23 years ago

I runned the layout regression tests with the patch, they passed, only the usual suspects (aka false postives) did appear.

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 24

•

23 years ago

I have some questions about the patch.... part of it seems to be missing. Where are the callers to these functions that you have added aIsBreakable to? Plus you have some fprintfs in there that probably shouldn't be there.

Boris Zbarsky [:bzbarsky]

Comment 25

•

23 years ago

Attached patch Same as first patch but -u10 so the callers are obvious and with fprintfs removed — Details — Splinter Review

Attachment #71533 - Attachment is obsolete: true

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 26

•

23 years ago

Comment on attachment 72653 [details] [diff] [review] Same as first patch but -u10 so the callers are obvious and with fprintfs removed sr=roc+moz

Attachment #72653 - Flags: superreview+

Frank Tang

Comment 27

•

23 years ago

nsbeta1+, please work with IQA to test the line breaking test suite

Keywords: nsbeta1 → nsbeta1+

Boris Zbarsky [:bzbarsky]

Comment 28

•

23 years ago

Is the suite available outside the firewall? ylong?

Yuying Long

Comment 29

•

23 years ago

The test cases in comment #18, #12, #13 not helping?

Asa Dotzler [:asa]

Comment 30

•

23 years ago

Comment on attachment 72653 [details] [diff] [review] Same as first patch but -u10 so the callers are obvious and with fprintfs removed a=asa (on behalf of drivers) for checkin to the 1.0 trunk

Attachment #72653 - Flags: approval+

Boris Zbarsky [:bzbarsky]

Comment 31

•

23 years ago

Patch checked in. Leaving bug open per shanjian's comments, since memmove is now the biggest offender in the various testcases.... Simon, Shanjian, should we open a new bug on that?

Shanjian Li

Comment 32

•

23 years ago

Yes, I suggest to open a new bug and close this one. One bug should only focusing on one issue for ease of regression and management control.

Boris Zbarsky [:bzbarsky]

Comment 33

•

23 years ago

Bug 129167 and bug 129166 filed. Marking this one fixed.

Status: NEW → RESOLVED

Closed: 23 years ago → 23 years ago

Resolution: --- → FIXED

Yuying Long

Comment 34

•

23 years ago

Mark as verified for this one cause we have seperate bugs filed. Should the new bugs be nsbeta1? cause "nsbeta1+" for this one.

Status: RESOLVED → VERIFIED

You need to log in before you can comment on or make changes to this bug.