Open Bug 460422 Opened 17 years ago Updated 3 years ago

IDN NAMEPREP and Punycode conversion problem for Japanese combining characters

Categories

(Core :: Internationalization, defect)

x86
Windows XP
defect

Tracking

()

People

(Reporter: yone, Unassigned)

Details

User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; ja; rv:1.9.0.3) Gecko/2008092417 Firefox/3.0.3 Build Identifier: Mozilla/5.0 (Windows; U; Windows NT 5.1; ja; rv:1.9.0.3) Gecko/2008092417 Firefox/3.0.3 Japanese has two combining characters named voiced-sound mark (U+309B) and semi-voiced-sound mark (U+309C). Those characters are rarely used independently, but may happen. When following sequences are typed on address bar, Firefox produces invalid Punycode. Input Output http://ラフ゛リー.jp/ http://xn-- -wdusizci2i.jp/ ^^^^^^^^^^ U+30E9 U+30D5 U+309B U+30EA U+30FC http://ハッヒ゜ー.jp/ http://xn-- -yduvfxbq8w.jp/ ^^^^^^^^^^ U+30CF U+30C3 U+30D2 U+309C U+30FC After NFKC, U+309B is normalized as U+0020 U+3099, and U+309C is normalized as U+0020 U+309A as well. Produced Punycode strings seem including U+0020 that are invalid. Reproducible: Always Steps to Reproduce: See details. Actual Results: See details. Expected Results: Producing invalid Punycode should be avoided. Such sequences should be warned to users as 'invalid input sequence'.
Assignee: nobody → smontagu
Component: General → Internationalization
Product: Firefox → Core
QA Contact: general → i18n
Status: UNCONFIRMED → ASSIGNED
Ever confirmed: true

The bug assignee didn't login in Bugzilla in the last 7 months, so the assignee is being reset.

Assignee: smontagu → nobody
Status: ASSIGNED → NEW
Severity: normal → S3
You need to log in before you can comment on or make changes to this bug.