Closed Bug 412415 (Persian-AddrBar) Opened 17 years ago Closed 11 years ago

Replace Space with ZWNJ in domain-lable for some TLDs

Categories

(SeaMonkey :: Location Bar, defect)

x86
Linux
defect
Not set
normal

Tracking

(Not tracked)

RESOLVED WONTFIX

People

(Reporter: zwnj, Unassigned)

References

(Blocks 1 open bug, )

Details

Dot-test localized TLDs (ie. the Persian one: ".آزمایشی"
/dot-aazmaayeshi/) has been on the root servers for a while.  Currently
there's only one domain under this TLD, which is "مثال" /mesaal/. Now
consider that we have a domain name, which is two of that word in a
row: "مثال‌مثال" /mesaal-mesaal/.  As you can see, we separate words in
domain names with ZWNJ.  This is something like Camel-Case in Latin
script.  We use ASCII dash too, but it doesn't look good for more than
a couple of words. (same as English domain names, right?)

OTOH, Shift+Space, the old standard way to input ZWNJ in
Iranian/Persian applications, doesn't work in Windows.  So, almost 99%
of Iranians use Space instead of ZWNJ. (This is originally a MS
disease. More info at http://zwnj.org/labels/microsoft.html ).

BTW, when Iranian users want to type that address, "مثال‌مثال.آزمایشی"
/mesaal-mesaal-dot-aazmaayeshi/, they use SPACE between two words that
makes Firefox to start a search, instead of handling space.  As
Firefox wouldn't turn that "I'm Feeling Lucky" address-bar search off
by default, I wanna propose to do some regex on entered URL, and make
a replacement for some TLDs, like replacing SPACE with ZWNJ, *iff*
there's a ".آزمایشی" or ".ir" in the URL...

And here are what IRI RFC says (p12):
"""
Systems accepting IRIs MAY also deal with the printable characters in
US-ASCII that are not allowed in URIs, namely "<", ">", '"', space,
"{", "}", "|", "\", "^", and "`", in step 2 above.  If these
characters are found but are not converted, then the conversion
SHOULD fail.
"""

So, I'm asking to handle space in another special way for
Iranian/Persian users (maybe useful for other languages using Arabic
script), based on the TLD.

The regex for checking this case and/or applying the change seems so
easy though.

IE7's behavior when address-bar search is:
ON: Starts searching if it cannot resolve the first label (before first SPACE)
OFF: Makes an alert that says """Address Bar: Windows cannot find
'مثال مثال.آزمایشی'. Check the spelling and try again. [ OK ]""",
which means it doesn't handle it at all, and fails.
Blocks: fx35-l10n-fa
No longer blocks: Persian-Fx3.5
No longer blocks: fx35-l10n-fa
Product: Core → SeaMonkey
I don't think that we want to detect the language of URL components and behave differently based on the language context in terms of how we display spaces etc.

WONTFIX for now, please reopen if you disagree.
Status: NEW → RESOLVED
Closed: 11 years ago
Resolution: --- → WONTFIX
You need to log in before you can comment on or make changes to this bug.