Closed Bug 61108 Opened 25 years ago Closed 24 years ago

prob converting locale to language

Tracking

()

Status:

VERIFIED FIXED

Milestone:

mozilla0.9

People

(Reporter: masaki.katakai, Assigned: bstell)

References

Details

(Keywords: intl)

Attachments

(9 files)

snapshot 25 years ago Masaki Katakai 50.63 KB, image/jpeg		Details
patch for nsLanguageAtomService.cpp 25 years ago Masaki Katakai 1.74 KB, patch		Details \| Diff \| Splinter Review
patch to nsPosixLocale.cpp to fix encoding parsing 25 years ago kill this account 4.08 KB, patch		Details \| Diff \| Splinter Review
patch to nsLocaleService.cpp and nsPosixLocale.cpp 25 years ago kill this account 6.41 KB, patch		Details \| Diff \| Splinter Review
patches to nsLocaleService.cpp, nsCollationUnix.cpp, nsDateTimeFormatUnix.cpp, and nsPosixLocale.cpp 25 years ago kill this account 7.99 KB, patch		Details \| Diff \| Splinter Review
Adding zn-CN entry for zh 24 years ago Masaki Katakai 463 bytes, patch		Details \| Diff \| Splinter Review
patches to langGroups.properties, nsLocaleService.cpp, nsCollationUnix.cpp, nsDateTimeFormatUnix.cpp, nsPosixLocale.cpp 24 years ago kill this account 10.81 KB, patch		Details \| Diff \| Splinter Review
revised patches to nsIPosixLocale.h, nsLocaleService.cpp, nsCollationUnix.cpp, nsDateTimeFormatUnix.cpp, nsPosixLocale.cpp, and langGroups.properties 24 years ago kill this account 13.09 KB, patch		Details \| Diff \| Splinter Review
revised patch with the "MAX_LOCALE_LEN+1" change 24 years ago kill this account 13.40 KB, patch		Details \| Diff \| Splinter Review

Masaki Katakai

Reporter

Description

•

25 years ago

In Solaris chinese locales (e.g. zh_CN.EUC), the same UI font isn't used for UI characters. Some glyphs are small than the others. I'll attach the snapshot. We need a reasonable workaround or the exact fix for Sun'S OEM release of Netscape 6. It seems that unicode encoding is used for UI glyphs, which doesn't use the same font for whole glyphs of UI. I defined the following fonts for zh-CN, but which couldn't solve the problem, user_pref("font.name.monospace.zh-CN", "sun-song-gb2312.1980-0"); user_pref("font.name.sans-serif.zh-CN", "sun-song-gb2312.1980-0"); user_pref("font.name.serif.zh-CN", "sun-song-gb2312.1980-0"); However, when I defined gb2312 for Unicode encoding, the UI glyphs can be displayed with the same font I specified, like, user_pref("font.name.monospace.x-unicode", "sun-song-gb2312.1980-0"); user_pref("font.name.sans-serif.x-unicode", "sun-song-gb2312.1980-0"); user_pref("font.name.serif.x-unicode", "sun-song-gb2312.1980-0"); I don't think this can be a reasonable workaround. I'll investigate more detail, but if you have any suggestion, please le me know.

Masaki Katakai

Reporter

Comment 1

•

25 years ago

Attached image snapshot — Details

Masaki Katakai

Reporter

Updated

•

25 years ago

Blocks: 60916

Summary: font size problem for Chinese UI characters → font size problem for Chinese UI characters

nhottanscp

Comment 2

•

25 years ago

Reassign to erik.

Assignee: nhotta → erik

Erik van der Poel

Comment 3

•

25 years ago

I think Mozilla should be using the fonts set for the language group of the locale when the document is in a Unicode-based encoding. Maybe the locale name (zh_CN.EUC) is not being recognized as a Simplified Chinese locale? (I thought I saw a related bug report today...)

Masaki Katakai

Reporter

Comment 4

•

25 years ago

Thanks for evaluation, Erik. Yes, you're right. x-western seems to be passed to GFX. I found a problem that nsLanguageAtomService.cpp can not get 'zh-CN' langGroup by 'zh-cn' key. In the nsLanguageAtomService class, nsIPersistentProperties is used for mLangGroups but it seems that the nsIPersistentProperties can not handle '-' character. So even when zh-cn=zh-CN exists in langGroups.properties, zh-CN can not be returned. res = mLangGroups->GetStringProperty(lowered, langGroupStr); if (NS_FAILED(res)) { PRInt32 hyphen = lowered.FindChar('-'); When the first query fails, the key 'zh-cn' will be changed to 'zh', then trying to get langGroup again. zh= entry doesn't exist also this should contain cn or tw to distinguish the locales. So, I believe we should change the codes to handle '-'. What do you think? How about using nsURLProperties class of uconv/ for this purpose? For testing, I added 'zh=zh-CN' entry and changed font definitions of zh-CN for Solaris, it worked fine.

tao

Updated

•

25 years ago

Assignee: erik → katakai

tao

Comment 5

•

25 years ago

Hi, Katakai-san: Erik will be out for a while. Would you please provide a patch so we can proceed to the review phase? thx

Erik van der Poel

Comment 6

•

25 years ago

If PersistentProperties cannot handle the '-' character, then that should be fixed (instead of changing LanguageAtomService). Are you sure that zh-cn is being looked up? Maybe it is actually zh_cn because of a bug in the locale name parsing code.

Masaki Katakai

Reporter

Comment 7

•

25 years ago

Sorry for confusion. I found my comment is wrong. The langgroup is retrieved by the exact locale name, e.g. "zh", "zh-gb.gbk", "zh-tw.euc". Those are not defined in langGroups.properties file. When the first retrieval fails, LanguageAtomService will cut the word after "-" and will try again, e.g. zh-tw and zh-cn will be "zh". "zh" is not defined in langGroups.properties. For example, "ja_JP.PCK" will work because "ja-jp.pck" fails, but "ja" is defined. We can not define "zh=zh-CN" or "zh=zh-TW" in langGroups.properties because it's for zh-cn and zh-tw. How about the following scenario? Current implementation uses only '-' when it fails, but I believe we should check '.' first, then use '-' for retrieval. 0) add the entry below to langGroups.properties zh=zh-CN 1) Start Mozilla in zh_TW.EUC locale in Solaris 2) zh_TW.EUC -> zh-tw.euc 3) "zh-tw.euc" isn't defined, so it fails 4) cut after '.', try "zh-tw" => zh-tw=zh-TW is defined in langGroups.properties 1) Start Mozilla in zh locale 2) zh -> zh zh=zh-CN is defined in langGroups.properties 1) in zh.GBK locale 2) zh.gbk fails (Assumed 60954 is fixed) 3) try zh zh=zh-CN is defined This seems to work for me. I'll attach the patch if ready.

Masaki Katakai

Reporter

Comment 8

•

25 years ago

Attached patch patch for nsLanguageAtomService.cpp — Details — Splinter Review

Masaki Katakai

Reporter

Comment 9

•

25 years ago

I've attached the proposal patch. Please review. Should I ask Shanjian also?

Erik van der Poel

Comment 10

•

25 years ago

I do not approve of this patch. There should not be any dot (.) in the locale name returned by the locale module. It should return names like en, en-US, zh-CN, etc. The locale name parser is bad, and should be fixed. We should not change nsLanguageAtomService to work around the bug in the locale name parser.

Masaki Katakai

Reporter

Comment 11

•

25 years ago

Thanks for comments. Do you mean UNIX's nsPosixLocale::GetXPLocale() is wrong? Actually this method returns ".EUC" as extra, not only "zh-TW". However, I think this ".EUC" should be there in UNIX platform because we can't find zh-TW.BIG5 and zh-TW.EUC is different. For example, nsDateTimeFormatUnix.cpp uses "NSILOCALE_TIME" and it stores "zh-TW.EUC", and tries to get charset by using the key. (at GetDefaultCharsetForLocale()). However, when we change it to only "zh-TW", we will not be able to get the correct charset from unixcharset.properties file. For japanese, ja-JP.PCK (SJIS in Solaris) will be considered as "ja-JP", and unixcharset.properties will return EUC-JP by mistake. Thus, in UNIX platform, I'm thinking .??? field is needed in nsLocaleSerice.

nhottanscp

Comment 12

•

25 years ago

Adding bstell to cc.

Linda Baliman

Comment 13

•

25 years ago

Added intl and nsbeta1 keywords. This bug is a showstopper Sun's Chinese releases (see snapshot)

Keywords: intl, nsbeta1

Keyser Sose

Comment 14

•

25 years ago

*** Bug 60179 has been marked as a duplicate of this bug. ***