Closed
Bug 165881
Opened 22 years ago
Closed 16 years ago
Combining diacritics don't work
Categories
(Core :: Graphics, defect)
Tracking
()
RESOLVED
WORKSFORME
People
(Reporter: hsivonen, Unassigned)
References
Details
(Keywords: intl)
Attachments
(5 files, 1 obsolete file)
Build ID: 2002083017 (on OS X 10.2) Reproducible: Always Steps to reproduce: 1) Load the test case attachment (upcoming) Actual results: Mozilla displays an 'R' followed by an ogonek. Expected results: Expected Mozilla to display the 'R' and ogonek combined the way OmniWeb and TextEdit do. Additional info: Since there is no precomposed character LATIN CAPITAL LETTER R WITH OGONEK, it is appropriate to send this particular combination as non-precomposed. See the CharMod draft: http://www.w3.org/TR/charmod/
Reporter | ||
Comment 1•22 years ago
|
||
Reporter | ||
Comment 2•22 years ago
|
||
Supporting combining diacritics requires the base character and the combining marks to be passed to ATSUI together, so the one-char-at-a-time ATSUI fallback won't do.
Depends on: atsui
It seems to work if you explicitly specify a font such as TITUS Cyberbit Basic or Arial Unicode MS, but it's strange that it doesn't work if I don't, since those fonts are defined as my default Unicode fonts.
Comment 4•22 years ago
|
||
>Supporting combining diacritics requires the base character and the combining
>marks to be passed to ATSUI together
I better assign to nhotta. cc ftang
Assignee: yokoyama → nhotta
Comment 7•20 years ago
|
||
This bug is quite old, but it seems the problem still persists. Frank, did you have any luck in investigating ? I will attach a file that shows a greek nu with a diacritic line above (should be the sign for an anti-neutrino). On Mozilla 1.7 on Fedora Core 2 (Xorg) the line is right of the nu. Same mozilla, but on windows it shows the line above the nu.
Comment 8•20 years ago
|
||
Comment 9•19 years ago
|
||
-> to default owner (rather than ftang's WONTFIX)
Assignee: ftang → smontagu
Status: ASSIGNED → NEW
Comment 10•18 years ago
|
||
I ran into this bug myself just now. Here is a small html file that exemplifies the problem. Firefox 2.0 (rv:1.8.1), Mac OS 10.4.8.
Comment 11•17 years ago
|
||
Comment 12•17 years ago
|
||
Comment on attachment 265593 [details]
Displays all of the combining diacritical mark codes for testing
0300: à<br />
0301: á<br />
0302: â<br />
0303: ã<br />
0304: ā<br />
0305: a̅<br />
0306: ă<br />
0307: ȧ<br />
0308: ä<br />
0309: ả<br />
030A: å<br />
030B: a̋<br />
030C: ǎ<br />
030D: a̍<br />
030E: a̎<br />
030F: ȁ<br />
0310: a̐<br />
0311: ȃ<br />
0312: a̒<br />
0313: a̓<br />
0314: a̔<br />
0315: a̕<br />
0316: a̖<br />
0317: a̗<br />
0318: a̘<br />
0319: a̙<br />
031A: a̚<br />
031B: a̛<br />
031C: a̜<br />
031D: a̝<br />
031E: a̞<br />
031F: a̟<br />
0320: a̠<br />
0321: a̡<br />
0322: a̢<br />
0323: ạ<br />
0324: a̤<br />
0325: ḁ<br />
0326: a̦<br />
0327: a̧<br />
0328: ą<br />
0329: a̩<br />
032A: a̪<br />
032B: a̫<br />
032C: a̬<br />
032D: a̭<br />
032E: a̮<br />
032F: a̯<br />
0330: a̰<br />
0331: a̱<br />
0332: a̲<br />
0333: a̳<br />
0334: a̴<br />
0335: a̵<br />
0336: a̶<br />
0337: a̷<br />
0338: a̸<br />
0339: a̹<br />
033A: a̺<br />
033B: a̻<br />
033C: a̼<br />
033D: a̽<br />
033E: a̾<br />
033F: a̿<br />
0340: à<br />
0341: á<br />
0342: a͂<br />
0343: a̓<br />
0344: ä́<br />
0345: aͅ<br />
0346: a͆<br />
0347: a͇<br />
0348: a͈<br />
0349: a͉<br />
034A: a͊<br />
034B: a͋<br />
034C: a͌<br />
034D: a͍<br />
034E: a͎<br />
034F (non-visible): a͏<br />
0350: a͐<br />
0351: a͑<br />
0352: a͒<br />
0353: a͓<br />
0354: a͔<br />
0355: a͕<br />
Comment 13•17 years ago
|
||
Comment on attachment 265593 [details]
Displays all of the combining diacritical mark codes for testing
0300: à<br />
0301: á<br />
0302: â<br />
0303: ã<br />
0304: ā<br />
0305: a̅<br />
0306: ă<br />
0307: ȧ<br />
0308: ä<br />
0309: ả<br />
030A: å<br />
030B: a̋<br />
030C: ǎ<br />
030D: a̍<br />
030E: a̎<br />
030F: ȁ<br />
0310: a̐<br />
0311: ȃ<br />
0312: a̒<br />
0313: a̓<br />
0314: a̔<br />
0315: a̕<br />
0316: a̖<br />
0317: a̗<br />
0318: a̘<br />
0319: a̙<br />
031A: a̚<br />
031B: a̛<br />
031C: a̜<br />
031D: a̝<br />
031E: a̞<br />
031F: a̟<br />
0320: a̠<br />
0321: a̡<br />
0322: a̢<br />
0323: ạ<br />
0324: a̤<br />
0325: ḁ<br />
0326: a̦<br />
0327: a̧<br />
0328: ą<br />
0329: a̩<br />
032A: a̪<br />
032B: a̫<br />
032C: a̬<br />
032D: a̭<br />
032E: a̮<br />
032F: a̯<br />
0330: a̰<br />
0331: a̱<br />
0332: a̲<br />
0333: a̳<br />
0334: a̴<br />
0335: a̵<br />
0336: a̶<br />
0337: a̷<br />
0338: a̸<br />
0339: a̹<br />
033A: a̺<br />
033B: a̻<br />
033C: a̼<br />
033D: a̽<br />
033E: a̾<br />
033F: a̿<br />
0340: à<br />
0341: á<br />
0342: a͂<br />
0343: a̓<br />
0344: ä́<br />
0345: aͅ<br />
0346: a͆<br />
0347: a͇<br />
0348: a͈<br />
0349: a͉<br />
034A: a͊<br />
034B: a͋<br />
034C: a͌<br />
034D: a͍<br />
034E: a͎<br />
034F (non-visible): a͏<br />
0350: a͐<br />
0351: a͑<br />
0352: a͒<br />
0353: a͓<br />
0354: a͔<br />
0355: a͕<br />
Attachment #265593 -
Attachment is obsolete: true
Comment 14•17 years ago
|
||
Comment 15•17 years ago
|
||
This is the official Unicode listing of the combining diacritical marks (posting is permitted by http://www.unicode.org/copyright.html ). As comparison with the updated test file should show, the diacritical marks are: 1) frequently a bit off kilter (e.g., placed to the side when they should be centered) 2) some of the marks are missing entirely: 332, 333 3) 350-355 show up as question marks (though I'm not sure whether that is due to requiring combination with the Uralic alphabet or not) 4) others appear to have a wrong mark (31D, 31F, 325, 33B) 5) others seem they may be too small (326 and 329) 6) and this one seems just slightly off (32C).
This seems much better on the trunk, but there are still a number of problems and a few that are rendering as boxes-after where they at least rendered as characters-after on the 1.8 branch.
Comment 18•17 years ago
|
||
The only issue that I see in attachment 265595 [details] is that many marks are shifted to the right of the base character. Repeating my comment from bug 386573: I think the broken cases are whenever there is no precomposed character equivalent to the base+diacritic pair. For example, in data:text/html,<p>a̋ e̋ i̋ ő ű</p> the double acute is correctly placed only on the o and the u, which match U+0151 LATIN SMALL LETTER O WITH DOUBLE ACUTE and U+0171 LATIN SMALL LETTER U WITH DOUBLE ACUTE.
Assignee: smontagu → nobody
Component: Internationalization → GFX: Thebes
Flags: blocking1.9?
QA Contact: amyy → thebes
Comment 19•17 years ago
|
||
Minusing. Not a regression. It's a feature that's never worked on the mac. Per discussion with Vlad and Stuart. Marking wanted-1.9.
Flags: blocking1.9? → blocking1.9-
Whiteboard: [wanted-1.9]
Updated•17 years ago
|
Blocks: supercombiner
Comment 20•17 years ago
|
||
Sorry for the very delayed reply, but in response to Simon... x332 and x333 (the first of which I happen to need) don't show their marks at all, and everything from x350 to x36F are showing up as question marks. Compare with the chart at http://www.unicode.org/charts/PDF/U0300.pdf
Comment 21•17 years ago
|
||
Oh, and the problem, at least in my case, at least, is in Windows... (sorry I hadn't clarified)
Comment 22•17 years ago
|
||
Argh... sorry again, I do see it is working in Firefox 3, albeit with the characters being shifted (finally got around to figuring out how to run two profiles at once)... Nice work...
Updated•17 years ago
|
Flags: wanted1.9+
Whiteboard: [wanted-1.9]
The remaining problem here, which smontagu identified in comment 18, now has bug 425650 to cover it specifically. Closing this bug WFM on trunk thanks to all the rewrites.
Status: NEW → RESOLVED
Closed: 16 years ago
Resolution: --- → WORKSFORME
You need to log in
before you can comment on or make changes to this bug.
Description
•