Closed Bug 394751 Opened 17 years ago Closed 17 years ago

Hang with long line of text and page break character

Tracking

()

Status:

VERIFIED FIXED

Milestone:

mozilla1.9beta2

People

(Reporter: martijn.martijn, Assigned: cpearce)

References

Details

(Keywords: hang, regression, testcase)

Attachments

(2 files, 7 obsolete files)

testcase 17 years ago Martijn Wargers (dead) 42.87 KB, application/xhtml+xml		Details
Patch v1 - split oversize items in Uniscribe.Itemize() 17 years ago Chris Pearce [:cpearce (Not reading bugmail)] 8.55 KB, patch		Details \| Diff \| Splinter Review
Patch v2 - wrap InitTextRunUniscribe() 17 years ago Chris Pearce [:cpearce (Not reading bugmail)] 8.17 KB, patch		Details \| Diff \| Splinter Review
Patch v3 - Patch v1 cleaned up a bit 17 years ago Chris Pearce [:cpearce (Not reading bugmail)] 9.76 KB, patch		Details \| Diff \| Splinter Review
Patch v4 - patch_v3++ 17 years ago Chris Pearce [:cpearce (Not reading bugmail)] 9.87 KB, patch		Details \| Diff \| Splinter Review
Patch v5 - patch_v4++ 17 years ago Chris Pearce [:cpearce (Not reading bugmail)] 8.97 KB, patch		Details \| Diff \| Splinter Review
Patch v6 17 years ago Chris Pearce [:cpearce (Not reading bugmail)] 9.20 KB, patch	pavlov : review-	Details \| Diff \| Splinter Review
Patch v7 - With Stuart's suggested changes. 17 years ago Chris Pearce [:cpearce (Not reading bugmail)] 9.29 KB, patch		Details \| Diff \| Splinter Review
Patch v8 - v7 with return value fixed. 17 years ago Chris Pearce [:cpearce (Not reading bugmail)] 9.30 KB, patch	pavlov : review+	Details \| Diff \| Splinter Review

Martijn Wargers (dead)

Reporter

Description

•

17 years ago

Attached file testcase — Details

See testcase, which seems to hang in current trunk builds. This seems to have regressed between 2007-06-20 and 2007-06-21, this is when the new textframe was turned on, bug 367177. So I guess it's a regression from that. Note that the testcase has right at the end, just before the </sourcetext> tag, a page-break character. That seems to be necessary for the hang, apart from the long line of text.

Robert O'Callahan (:roc) (email my personal email if necessary)

Updated

•

17 years ago

Depends on: 388367

-fullmetaljacket-

Updated

•

17 years ago

Flags: blocking1.9?

Martijn Wargers (dead)

Reporter

Comment 1

•

17 years ago

The testcase is still hanging, using: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9a8pre) Gecko/2007090405 Minefield/3.0a8pre This is after bug 388367 has been fixed.

Robert O'Callahan (:roc) (email my personal email if necessary)

Updated

•

17 years ago

Assignee: nobody → roc

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 2

•

17 years ago

Doesn't hang on Mac, Windows only?

Martijn Wargers (dead)

Reporter

Comment 3

•

17 years ago

I guess so, it's still hanging with current trunk build on windowsXP.

Robert O'Callahan (:roc) (email my personal email if necessary)

Updated

•

17 years ago

Flags: blocking1.9? → blocking1.9+

Chris Pearce [:cpearce (Not reading bugmail)]

Assignee

Comment 4

•

17 years ago

I can reproduce this on Vista. We're stuck in an infinite loop in gfxWindowsFontGroup::InitTextRunUniscribe(), gfxWindowsFonts.cpp:1594: while (FAILED(item->Shape())) { PR_LOG(gFontLog, PR_LOG_DEBUG, ("shaping failed")); // we know we have the glyphs to display this font already // so Uniscribe just doesn't know how to shape the script. // Render the glyphs without shaping. item->DisableShaping(); } item->Shape() is always failing, and we're stuck in a loop. [TAKING]

Status: NEW → ASSIGNED

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 5

•

17 years ago

We should figure out why that's failing. But I think we should also get rid of this loop here. Instead of looping, we should just disable shaping, then retry item->Shape() once. If the retry fails, we should warn and fill in the textrun with missing glyph boxes or something like that.

Component: Layout → GFX: Thebes

:Gavin Sharp [email: gavin@gavinsharp.com]

Updated

•

17 years ago

QA Contact: layout → thebes

Robert O'Callahan (:roc) (email my personal email if necessary)

Updated

•

17 years ago

Assignee: roc → chris

Status: ASSIGNED → NEW

Chris Pearce [:cpearce (Not reading bugmail)]

Assignee

Comment 6

•

17 years ago

Ok, the reason that item->Shape() is failing, is that it's calling the Win32 Function ScriptShape(), which is returning E_INVALIDARG. It does this when the value of mMaxGlyphs which is passed into it is greater than 65535. So the solution is to change Itemize() such that it breaks up items which are longer than 65535 characters.

Status: NEW → ASSIGNED

Chris Pearce [:cpearce (Not reading bugmail)]

Assignee

Comment 7

•

17 years ago

Attached patch Patch v1 - split oversize items in Uniscribe.Itemize() (obsolete) — Details — Splinter Review

Patch changes Itemize() to detect uniscribe items which are too long and which would cause ScriptShape() to fail. It breaks items on cluster boundaries.

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 8

•

17 years ago

> NUM_MAX_GLYPHS ESTIMATE_MAX_GLYPHS? Anyway, I think there's a simpler way to do all this. Look at what gfxAtsuiFonts does. The basic idea would be to make InitTextRunUniscribe able to fill in just part of a textrun, i.e., you pass in an extra aStart parameter that tells you where to start filling in the textrun. Then you write a wrapper InitTextRun method that breaks the string up into chunks that aren't too long, and pass each chunk to InitTextRunUniscribe. gfxAtsuiFonts even has a FindTextRunSegmentLength function that tries to break the string up at whitespace boundaries, because that won't damage kerning or ligatures; you probably want to do the same. I think this approach would be simpler than messing with the itemization. See http://mxr.mozilla.org/seamonkey/source/gfx/thebes/src/gfxAtsuiFonts.cpp#512

Chris Pearce [:cpearce (Not reading bugmail)]

Assignee

Comment 9

•

17 years ago

Attached patch Patch v2 - wrap InitTextRunUniscribe() (obsolete) — Details — Splinter Review

This patch follows ROC's suggestion...

Attachment #285024 - Attachment is obsolete: true

Stuart Parmenter

Comment 10

•

17 years ago

Comment on attachment 285038 [details] [diff] [review] Patch v2 - wrap InitTextRunUniscribe() I'm not sure I follow your ESTIMATE_MAX_GLYPHS code -- I measured a lot of content to come up with 40 as the right number for the average item length -- your code will result in those being 76 You also have mismatched SaveDC/RestoreDC calls here which could cause all sorts of problems. I don't think you want to move the RestoreDC

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 11

•

17 years ago

Patch v2 has a problem --- the code that finds where to break up the textrun depends on cluster information, which we don't have yet (we do have it by then on Mac where this code was copied from). So we have do things differently. I think in the end patch v1 was closer to the mark, although it can be simplified a bit. Sorry for leading you down a blind alley Chris...

Chris Pearce [:cpearce (Not reading bugmail)]

Assignee

Comment 12

•

17 years ago

(In reply to comment #10) > (From update of attachment 285038 [details] [diff] [review]) > I'm not sure I follow your ESTIMATE_MAX_GLYPHS code -- I measured a lot of > content to come up with 40 as the right number for the average item length -- > your code will result in those being 76 Thanks Stuart, the ESTIMATE_MAX_GLYPHS is just me wrapping up a lot of (1.5*x)+16 into a macro, so I'm not changing any behaviour there. Remember 3*x/2 = 1.5*x. It's recommended here: http://www.microsoft.com/typography/developers/uniscribe/uniscribe.htm > You also have mismatched SaveDC/RestoreDC calls here which could cause all > sorts of problems. I don't think you want to move the RestoreDC Yikes, you're right, thanks for pointing that out!

Chris Pearce [:cpearce (Not reading bugmail)]

Assignee

Comment 13

•

17 years ago

Attached patch Patch v3 - Patch v1 cleaned up a bit (obsolete) — Details — Splinter Review

Patch v1 cleaned up a bit.

Attachment #285038 - Attachment is obsolete: true

Robert O'Callahan (:roc) (email my personal email if necessary)

Comment 14

•

17 years ago

+ if (mLength > 43680) { ESTIMATE_MAX_GLYPHS(mLength) > 65535? + // Count number of items that are too long. + int count = TotalOversizeItems(); + if (count > 0) { I wouldn't bother with this check and TotalOversizeItems. Just go straight into the splitting-up loop. Don't need newNumItems either, or the SetCapacity call, it's OK to just add items to 'items' incrementally. CopyItemSplitOversize can be simplified: + if (ESTIMATE_MAX_GLYPHS(itemLength) > 65535) { I wouldn't bother with this. Instead of computing the number of new items up front, do what gfxAtsuiFontGroup does; just repeatedly find the longest string that makes an acceptable item, add an item to aDest for that string, and then advance to the end of that string. Make sense? You don't need to bother with midpoints that way. You should do what FindTextRunSegmentLength does to prefer whitespace boundaries. In fact you can copy FindTextRunSegmentLength here, call it FindItemSegmentLength or something.

Chris Pearce [:cpearce (Not reading bugmail)]

Assignee

Comment 15

•

17 years ago

Attached patch Patch v4 - patch_v3++ (obsolete) — Details — Splinter Review

Attachment #285168 - Attachment is obsolete: true

Chris Pearce [:cpearce (Not reading bugmail)]

Assignee

Comment 16

•

17 years ago

Attached patch Patch v5 - patch_v4++ (obsolete) — Details — Splinter Review

Attachment #285194 - Attachment is obsolete: true

Stuart Parmenter

Updated

•

17 years ago

Priority: -- → P3

Chris Pearce [:cpearce (Not reading bugmail)]

Assignee

Updated

•

17 years ago

Attachment #285196 - Flags: review?(pavlov)

Chris Pearce [:cpearce (Not reading bugmail)]

Assignee

Comment 17

•

17 years ago

Comment on attachment 285196 [details] [diff] [review] Patch v5 - patch_v4++ Review cancelled... Roc pointed out an issue...

Attachment #285196 - Flags: review?(pavlov)

Chris Pearce [:cpearce (Not reading bugmail)]

Assignee

Comment 18

•

17 years ago

Attached patch Patch v6 (obsolete) — Details — Splinter Review

Patch made more fail safe.

Attachment #285196 - Attachment is obsolete: true

Chris Pearce [:cpearce (Not reading bugmail)]

Assignee

Updated

•

17 years ago

Attachment #285788 - Flags: review?(pavlov)

Stuart Parmenter

Updated

•

17 years ago

Target Milestone: --- → mozilla1.9 M10

Stuart Parmenter

Comment 19

•

17 years ago

Comment on attachment 285788 [details] [diff] [review] Patch v6 + +static PRUint32 FindNextItemStart(int aOffset, int aLimit, + nsTArray<SCRIPT_LOGATTR> &aLogAttr, const PRUnichar *aString) nit: please align the params on the second line with the ones above. + // Try to start the next item before or after a space, since spaces + // don't kern or ligate. + PRUint32 off; + for (off = MAX_ITEM_LENGTH; off > 1; --off) { + if (aLogAttr[off].fCharStop && + (aString[aOffset+off] == ' ' + || aString[aOffset+off - 1] == ' ')) + return aOffset+off; + } + + // Try to start the next item at the last cluster boundary in the range. + for (off = MAX_ITEM_LENGTH; off > 1; --off) { + if (aLogAttr[off].fCharStop) + return aOffset+off; + } + you probably want to just cache the offset of the first fCharStop you encounter and get rid of the second loop.

Attachment #285788 - Flags: review?(pavlov) → review-

Chris Pearce [:cpearce (Not reading bugmail)]

Assignee

Comment 20

•

17 years ago

Attached patch Patch v7 - With Stuart's suggested changes. (obsolete) — Details — Splinter Review

Attachment #285788 - Attachment is obsolete: true

Attachment #286739 - Flags: review?(pavlov)

Chris Pearce [:cpearce (Not reading bugmail)]

Assignee

Comment 21

•

17 years ago

Attached patch Patch v8 - v7 with return value fixed. — Details — Splinter Review

This is patch v7 with the return value of the (boundary > 0) condition corrected.

Attachment #286739 - Attachment is obsolete: true

Attachment #286756 - Flags: review?(pavlov)

Attachment #286739 - Flags: review?(pavlov)

Stuart Parmenter

Updated

•

17 years ago

Attachment #286756 - Flags: review?(pavlov) → review+

Stuart Parmenter

Updated

•

17 years ago

Keywords: checkin-needed

Stuart Parmenter

Comment 22

•

17 years ago

checked in

Status: ASSIGNED → RESOLVED

Closed: 17 years ago

Resolution: --- → FIXED

Reed Loden [:reed]

Updated

•

17 years ago

Keywords: checkin-needed

Jeff Walden [:Waldo]

Updated

•

17 years ago

Flags: in-testsuite?

Martijn Wargers (dead)

Reporter

Comment 23

•

17 years ago

Wonderful, no hang anymore with current trunk build, thanks Chris. Verified fixed, using: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9a9pre) Gecko/2007110705 Minefield/3.0a9pre

Status: RESOLVED → VERIFIED

Karl Tomlinson (:karlt back Feb 10)

Comment 24

•

14 years ago

(In reply to comment #12) > ESTIMATE_MAX_GLYPHS is just me wrapping up a lot of > (1.5*x)+16 into a macro, [...]. It's recommended here: > http://www.microsoft.com/typography/developers/uniscribe/uniscribe.htm Looks like that documentation has moved. It seems the formula is recommended for an initial guess for the cMaxGlyphs parameter of Uniscribe's ScriptShape: http://msdn.microsoft.com/en-us/library/dd368564%28v=VS.85%29.aspx "Maximum number of glyphs to generate, and the length of pwOutGlyphs. A reasonable value is (1.5 * cChars + 16), but this value might be insufficient in some circumstances. For more information, see the Remarks section." "If this function returns E_OUTOFMEMORY, the application might call ScriptShape repeatedly, with successively larger output buffers, until a large enough buffer is provided. The number of glyphs generated by a code point varies according to the script and the font. For a simple script, a Unicode code point might generate a single glyph. However, a complex script font might construct characters from components, and thus generate several times as many glyphs as characters. Also, there are special cases, such as invalid character representations, in which extra glyphs are added to represent the invalid sequence. Therefore, a reasonable guess for the size of the buffer indicated by pwOutGlyphs is 1.5 times the length of the character buffer, plus an additional 16 glyphs for rare cases, for example, invalid sequence representation."

Mats Palmgren (inactive)

Comment 25

•

12 years ago

crashtest: https://hg.mozilla.org/integration/mozilla-inbound/rev/b3276034cc07

Flags: in-testsuite? → in-testsuite+

Ryan VanderMeulen [:RyanVM]

Comment 26

•

12 years ago

https://hg.mozilla.org/mozilla-central/rev/b3276034cc07

You need to log in before you can comment on or make changes to this bug.