Closed
Bug 106803
Opened 23 years ago
Closed 19 years ago
problem in rendering 0x80~0x9f in win1252 encoding
Categories
(Core :: Internationalization, defect)
Tracking
()
RESOLVED
DUPLICATE
of bug 134733
People
(Reporter: shanjian, Assigned: jshin1987)
Details
(Keywords: intl)
This problem was spinned from 97343. It is a different problem from 0xb7 and it
is more complicated to fix. Related comment below:
>>shanjian
With my test case, I tried all raster font on winNT and win98, here is a summary
of my findings:
. Winnt:
"MS Sans Serif" and "MS Serif" has problem with 0x82 to 0x8c, 0x93 to 0x9c
"Modern", "Courier", "Small Fonts", "Roman" and all truetype font are OK.
. On Win98 , non-A-version
"MS Sans Serif" and "MS Serif" has problem with 0x82 to 0x8c, 0x93 to 0x9c,
0xb7
"Modern", "Courier", "Small Fonts", "Roman" and all truetype font are OK.
. On Win98, A-version
All looks fine.
Since 0x80 to 0x9f are left for C1 in 8859-x, the problem is not serious. 0xb7
is something we have to deal with. In Erik's existing code, charmap for certain
charset is generated using system's conversion API. Only a character in certain
charset can be converted to wchar, do we declare the support for such char in
charmap.
>> ftang
>You might also have noticed that 0x82 to 0x8c, 0x93 to 0x9c are untouched.
>IMO that is not worth fixing.
Why ? these seems important characters for some language too.
Probably we need a different fix for it; I think we should not generate the
multibyte by for loop from 0 to 255 (0xff). The origional definitation of ANSI
charcter set from my Win3.1 and Win95 programming book show it does not contains
the 0x80-0x9F part. see the following
Page 130 (Figure 3-8. The ANSI character set arranged by character code.) of
Programming Windows 3.1 3rd Edition, Charles Petzold, Microsoft Press, ISBN
1-55615-395-3
also see
Page 281 (Figure 5-8. The ANSI character set arranged by character code.) of
Programming Windows 95, Charles Petzold, Microsoft Press, ISBN 1-55615-676-6
Page 162 (Figure 3-8. The ANSI character set) of Programming Windows 95 with
MFC, Jeff Prosise, Microsoft Press, ISBN 1-55615-902-1
I think your approach for B7 is needed. But I think we should also fix other
characters. They might be used somewhere else. The unicode character of these
char are:
0x80 0x20AC #EURO SIGN
0x82 0x201A #SINGLE LOW-9 QUOTATION MARK
0x83 0x0192 #LATIN SMALL LETTER F WITH HOOK
0x84 0x201E #DOUBLE LOW-9 QUOTATION MARK
0x85 0x2026 #HORIZONTAL ELLIPSIS
0x86 0x2020 #DAGGER
0x87 0x2021 #DOUBLE DAGGER
0x88 0x02C6 #MODIFIER LETTER CIRCUMFLEX ACCENT
0x89 0x2030 #PER MILLE SIGN
0x8A 0x0160 #LATIN CAPITAL LETTER S WITH CARON
0x8B 0x2039 #SINGLE LEFT-POINTING ANGLE QUOTATION MARK
0x8C 0x0152 #LATIN CAPITAL LIGATURE OE
0x8E 0x017D #LATIN CAPITAL LETTER Z WITH CARON
0x91 0x2018 #LEFT SINGLE QUOTATION MARK
0x92 0x2019 #RIGHT SINGLE QUOTATION MARK
0x93 0x201C #LEFT DOUBLE QUOTATION MARK
0x94 0x201D #RIGHT DOUBLE QUOTATION MARK
0x95 0x2022 #BULLET
0x96 0x2013 #EN DASH
0x97 0x2014 #EM DASH
0x98 0x02DC #SMALL TILDE
0x99 0x2122 #TRADE MARK SIGN
0x9A 0x0161 #LATIN SMALL LETTER S WITH CARON
0x9B 0x203A #SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
0x9C 0x0153 #LATIN SMALL LIGATURE OE
0x9E 0x017E #LATIN SMALL LETTER Z WITH CARON
0x9F 0x0178 #LATIN CAPITAL LETTER Y WITH DIAERESIS
you can see the following characters are wrong on my NT4J
0x82 0x201A #SINGLE LOW-9 QUOTATION MARK
0x83 0x0192 #LATIN SMALL LETTER F WITH HOOK
0x84 0x201E #DOUBLE LOW-9 QUOTATION MARK
0x85 0x2026 #HORIZONTAL ELLIPSIS
0x86 0x2020 #DAGGER
0x87 0x2021 #DOUBLE DAGGER
0x88 0x02C6 #MODIFIER LETTER CIRCUMFLEX ACCENT
0x89 0x2030 #PER MILLE SIGN
0x8A 0x0160 #LATIN CAPITAL LETTER S WITH CARON
0x8B 0x2039 #SINGLE LEFT-POINTING ANGLE QUOTATION MARK
0x8C 0x0152 #LATIN CAPITAL LIGATURE OE
0x8E 0x017D #LATIN CAPITAL LETTER Z WITH CARON
0x93 0x201C #LEFT DOUBLE QUOTATION MARK
0x94 0x201D #RIGHT DOUBLE QUOTATION MARK
0x95 0x2022 #BULLET
0x96 0x2013 #EN DASH
0x97 0x2014 #EM DASH
0x98 0x02DC #SMALL TILDE
0x99 0x2122 #TRADE MARK SIGN
0x9A 0x0161 #LATIN SMALL LETTER S WITH CARON
0x9B 0x203A #SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
0x9C 0x0153 #LATIN SMALL LIGATURE OE
0x9E 0x017E #LATIN SMALL LETTER Z WITH CARON
0x9F 0x0178 #LATIN CAPITAL LETTER Y WITH DIAERESIS
MS Sans Serif can display
0x80 0x20AC #EURO SIGN
but not "MS Serif"
B7, 2018 and 2019 have no problem on both "MS Sans Serif" and "MS Serif"
Reporter | ||
Comment 1•23 years ago
|
||
I need to testing this in various windows versions and localization builds. We
might need to escalated this bug if user begin to complain. Set it to 1.01 now.
Status: NEW → ASSIGNED
Target Milestone: --- → mozilla1.0.1
Comment 2•20 years ago
|
||
shanjian is no longer working on mozilla for 2 years and these bugs are still
here. Mark them won't fix. If you want to reopen it, find a good owner first.
Status: ASSIGNED → RESOLVED
Closed: 20 years ago
Resolution: --- → WONTFIX
Comment 4•20 years ago
|
||
Mass Re-opening Bugs Frank Tang Closed on Wensday March 02 for no reason, all
the spam is his fault feel free to tar and feather him
Status: RESOLVED → REOPENED
Resolution: WONTFIX → ---
Comment 5•20 years ago
|
||
Reassigning Franks old bugs to Jungshik Shin for triage - Sorry for spam
Assignee: nobody → jshin1987
Status: REOPENED → NEW
Comment 6•19 years ago
|
||
Can someone reproduce this bug on current Trunk?
Mr. Kimura said, this bug may be dup of bug 130617.
Updated•19 years ago
|
Target Milestone: mozilla1.0.1 → ---
Comment 7•19 years ago
|
||
Sorry. bug 130617 -> bug 136017.
Comment 8•19 years ago
|
||
Very sorry. I told a wrong bug number.
This bug may be dup of bug 134733.
Comment 9•19 years ago
|
||
*** This bug has been marked as a duplicate of 134733 ***
Status: NEW → RESOLVED
Closed: 20 years ago → 19 years ago
Resolution: --- → DUPLICATE
You need to log in
before you can comment on or make changes to this bug.
Description
•