Closed Bug 131498 Opened 22 years ago Closed 22 years ago

Can't join a channel with Japanese name

Tracking

(Not tracked)

Status:

RESOLVED FIXED

People

(Reporter: kazhik, Assigned: rginda)

References

Details

Attachments

(3 files, 6 obsolete files)

patch 22 years ago Koike Kazuhiko 4.87 KB, patch		Details \| Diff \| Splinter Review
patch v2 22 years ago Koike Kazuhiko 4.82 KB, patch		Details \| Diff \| Splinter Review
patch v3 22 years ago Koike Kazuhiko 13.68 KB, patch		Details \| Diff \| Splinter Review
patch v4 22 years ago Koike Kazuhiko 8.44 KB, patch		Details \| Diff \| Splinter Review
patch v5 22 years ago Koike Kazuhiko 8.55 KB, patch		Details \| Diff \| Splinter Review
patch v6 22 years ago NAITOH Jun 15.62 KB, patch		Details \| Diff \| Splinter Review
patch v7 for ChatZilla 0.8.8 22 years ago NAITOH Jun 15.76 KB, patch		Details \| Diff \| Splinter Review
"Open at Startup" patch for ChatZilla 0.8.9b or 0.8.9c 22 years ago NAITOH Jun 401 bytes, patch		Details \| Diff \| Splinter Review
patch v2 for ChatZilla 0.8.9b 22 years ago NAITOH Jun 2.27 KB, patch		Details \| Diff \| Splinter Review

Koike Kazuhiko

Reporter

Description

•

22 years ago

Can't join a channel with Japanese name.
But RFC1459 allows Japanese channel name.

http://www.rfc.net/rfc1459.html
1.3 Channels
****************************************
Channels names are strings (beginning with a '&' or '#' character) of
length up to 200 characters.  Apart from the the requirement that the
first character being either '&' or '#'; the only restriction on a
channel name is that it may not contain any spaces (' '), a control G
(^G or ASCII 7), or a comma (',' which is used as a list item
separator by the protocol).
****************************************

Koike Kazuhiko

Reporter

Comment 1

•

22 years ago

Attached patch patch (obsolete) — Details — Splinter Review

This patch converts channel name string from Unicode before sending
"JOIN" and "MODE" and to Unicode after receiving the responses for them.

Koike Kazuhiko

Reporter

Comment 2

•

22 years ago

The patch has two problems.

(1) 
  name = fromUnicode(name.toLowerCase() + " ");
  e.channel = e.server.addChannel (name);

This blank is necessary to avoid bug 114923.


(2)
function ucConvertIncomingMessage (e)
{
  if (e.code != "JOIN")
  {
    e.meat = toUnicode(e.meat);
  }
  return true;
}

This conditional statement should be placed somewhere else.

OS: Linux → All

Hardware: PC → All

Koike Kazuhiko

Reporter

Comment 3

•

22 years ago

Attached patch patch v2 (obsolete) — Details — Splinter Review

Attachment #75897 - Attachment is obsolete: true

Koike Kazuhiko

Reporter

Comment 4

•

22 years ago

*** Bug 131888 has been marked as a duplicate of this bug. ***

Koike Kazuhiko

Reporter

Comment 5

•

22 years ago

Attached patch patch v3 (obsolete) — Details — Splinter Review

patch v2(attachment 76553 [details] [diff] [review]) isn't case insensitive.

Attachment #76553 - Attachment is obsolete: true

Koike Kazuhiko

Reporter

Comment 6

•

22 years ago

rginda, could you review the patch?

This bug is very important for Japanese IRC users.

mental

Updated

•

22 years ago

Keywords: patch, review

Koike Kazuhiko

Reporter

Comment 7

•

22 years ago

Attached patch patch v4 (obsolete) — Details — Splinter Review

Converts reply strings of /list command.

Attachment #79535 - Attachment is obsolete: true

Koike Kazuhiko

Reporter

Comment 8

•

22 years ago

Attached patch patch v5 — Details — Splinter Review

Attachment #83711 - Attachment is obsolete: true

NAITOH Jun

Comment 9

•

22 years ago

Attached patch patch v6 (obsolete) — Details — Splinter Review

patch v5 has a bug in handling irc:// multibyte language encoding.
This patch fixed it, so that makes patch v5 useless.
(and this patch is include attachment 84546 [details] [diff] [review].
by http://bugzilla.mozilla.org/show_bug.cgi?id=146058)

This bug is very, very important for Japanese IRC users.
Please review.
If this bug don't fixed, Japanese can't use chatzilla.

Robert Ginda

Assignee

Comment 10

•

22 years ago

+    var encodedMatchText = fromUnicode(matchText + " ");
+    /* bug 114923 */
+    encodedMatchText = encodedMatchText.substr(0,encodedMatchText.length -1);
+

fromUnicode() resets the .charset of the decoder, which is supposed to work
around bug 114923.  Do we still need to append the empty space for some reason?

-            unescape(ary[1]).toLowerCase().replace("\n", "\\n"): "";
+            unescape(ary[1]).replace("\n", "\\n"): "";

Why aren't we lowercasing anymore, won't this cause problems elsewhere?  It
seems to me this will cause irc://irc.mozilla.org/Mozilla and
irc://irc.mozilla.org/MOZILLA to create two distinct channel tabs.

+        case "IRCChannel":
+            name = source.unicodeName;

Is it possible to have an encoded nickname too?

-               sourceObj ? sourceObj.name : this.name]);
+               sourceObj ? sourceObj.unicodeName : this.unicodeName]);

sourceObj could be a network or user object, which don't have a unicodeName
property.    I'm worried here that you're making new assumptions that the
current object will have a "unicodeName" property.  In many places, sourceObj or
currentObject can be an irc network, server, user, channel, or even the client
object itself.

+    var encodedName = fromUnicode(name + " ");
+    /* bug 114923 */
+    encodedName = encodedName.substr(0,encodedName.length -1);

again, doesn't toUnicode take care of bug 114923?

NAITOH Jun

Comment 11

•

22 years ago

>+    var encodedMatchText = fromUnicode(matchText + " ");
>+    /* bug 114923 */
>+    encodedMatchText = encodedMatchText.substr(0,encodedMatchText.length -1);
>
>fromUnicode() resets the .charset of the decoder, which is supposed to work
>around bug 114923.  Do we still need to append the empty space for some reason?

Yes! If I don't append the empty space, fromUnicode() is useless on ISO-2022-JP
decode.
If I don't append the empty space, fromUnicode() isn't  append the ISO-2022-JP
end code "ESC ) B", now.
This is very important ploblem.
All ISO-2022-JP's channel name is a channel name including an end code.
If it doesn't include end code, it will become another channel name,
so I can not join in the target channel.
The empty space is added in order to avoid this bug, and one character
equivalent to an empty space is deleted from the changed character sequence.

But this is add-hoc. (Originally, we should correct bug 114923.)


>-            unescape(ary[1]).toLowerCase().replace("\n", "\\n"): "";
>+            unescape(ary[1]).replace("\n", "\\n"): "";
>
>Why aren't we lowercasing anymore, won't this cause problems elsewhere?  It
>seems to me this will cause irc://irc.mozilla.org/Mozilla and
>irc://irc.mozilla.org/MOZILLA to create two distinct channel tabs.

This is no ploblem.
Because It is because same processing is performed by CIRCChannel() function.
It doesn't seems to me two distinct channel tabs.

This patch is three information on "channel name", is hold.

"unicodeName" is for a display.
"encodedName" for communication of an IRC client and a server.
"name" for the judgment of being the same channel.

Please don't send lowercasing channel name to IRC server.
If you send lowercasing channel name and other IRC user type /list case, the
channel name can not decoded.
Because , the channel name is lowercasing.

>+        case "IRCChannel":
>+            name = source.unicodeName;
>
>Is it possible to have an encoded nickname too?

This isn't nickname, but this is channel name.
CIRCChannel object has unicodeName property.

>-               sourceObj ? sourceObj.name : this.name]);
>+               sourceObj ? sourceObj.unicodeName : this.unicodeName]);
>
>sourceObj could be a network or user object, which don't have a unicodeName
>property.    I'm worried here that you're making new assumptions that the
>current object will have a "unicodeName" property.  In many places, sourceObj or
>currentObject can be an irc network, server, user, channel, or even the client
>object itself.

Sorry, this is a mistake.
I should have processed as follows.
--------------------------------
    if (fromUser)
    {
        statusString =
            getMsg("cli_statusString", [d.getMonth() + 1, d.getDate(),
                                        d.getHours(), mins,
                                        sourceObj.nick + "!" + 
                                        sourceObj.name + "@" + sourceObj.host]);    
    }
    else if (this.TYPE.search(/IRCChannel/) != -1)
    {
        statusString =
            getMsg("cli_statusString", [d.getMonth() + 1, d.getDate(),
                      d.getHours(), mins,
                      sourceObj ? sourceObj.unicodeName : this.unicodeName]);
    }
    else
    {
        statusString =
            getMsg("cli_statusString", [d.getMonth() + 1, d.getDate(),
                      d.getHours(), mins,
                      sourceObj ? sourceObj.name : this.name]);
    }
--------------------------------

>+    var encodedName = fromUnicode(name + " ");
>+    /* bug 114923 */
>+    encodedName = encodedName.substr(0,encodedName.length -1);
>
>again, doesn't toUnicode take care of bug 114923?

The influence of bug 114923 is a problem very serious in Japanese (ISO-2022-JP).
If I don't append the empty space, fromUnicode() isn't  append the ISO-2022-JP
end code "ESC ) B", now.

Robert Ginda

Assignee

Comment 12

•

22 years ago

> If I don't append the empty space, fromUnicode() isn't  append the ISO-2022-JP
> end code "ESC ) B", now.

Oh, I think I understand.  The convention is that the end-escape code is part of
an encoded channel name?

>>Is it possible to have an encoded nickname too?
>
> This isn't nickname, but this is channel name.
> CIRCChannel object has unicodeName property.

Yes, I understand that.  My question was whether or not it is possible to have
an endcoded nickname too.  If so, perhaps we should file a new bug to the
ability to handle encoded nicknames.

>  else if (this.TYPE.search(/IRCChannel/) != -1)

I'd prefer |this.TYPE == "IRCChannel"|, because it is more specific, and easier
to read.

Please attach an updated patch with your latest change.  I will create an XPI
and post it to <http://www.hacksrus.com/~ginda/chatzilla/> for testing.

NAITOH Jun

Comment 13

•

22 years ago

Attached patch patch v7 for ChatZilla 0.8.8 — Details — Splinter Review

>> If I don't append the empty space, fromUnicode() isn't  append the
ISO-2022-JP
>> end code "ESC ) B", now.
>
>Oh, I think I understand.  The convention is that the end-escape code is part
of
>an encoded channel name?

Yes , this is not IRC rule, but this is ISO-2022-JP rule.
"At the end of the sentence, it returns to the original state."

>>>Is it possible to have an encoded nickname too?
>>
>> This isn't nickname, but this is channel name.
>> CIRCChannel object has unicodeName property.
>
>Yes, I understand that.  My question was whether or not it is possible to have

>an endcoded nickname too.  If so, perhaps we should file a new bug to the
>ability to handle encoded nicknames.

Oh, I was mistaken in the interpretation. Sorry.

Unfortunately I can not use Japanses (encoded) nickname.
The character which can be used for nickname is only as follows.

Please check RFC2812.
ftp://ftp.isi.edu/in-notes/rfc2812.txt

  nickname   =	( letter / special ) *8( letter / digit / special / "-" )
  letter     =	%x41-5A / %x61-7A	; A-Z / a-z
  digit      =	%x30-39 		; 0-9
  special    =	%x5B-60 / %x7B-7D
		   ; "[", "]", "\", "`", "_", "^", "{", "|", "}"

but, it will be glad if a capital letter small letter can be used.

>>  else if (this.TYPE.search(/IRCChannel/) != -1)
>
>I'd prefer |this.TYPE == "IRCChannel"|, because it is more specific, and
easier
>to read.

OK.

>Please attach an updated patch with your latest change.  I will create an XPI
>and post it to <http://www.hacksrus.com/~ginda/chatzilla/> for testing.

OK.
It was reflected with this patch.
I expect. Thank you.

(P.S. this patch is include attachment 84546 [details] [diff] [review].
by http://bugzilla.mozilla.org/show_bug.cgi?id=146058)

Attachment #89860 - Attachment is obsolete: true

Robert Ginda

Assignee

Comment 14

•

22 years ago

I've posted an XPI with your changes to <http://www.hacksrus.com/~ginda/chatzilla/>.

NAITOH Jun

Comment 15

•

22 years ago

Attached patch "Open at Startup" patch for ChatZilla 0.8.9b or 0.8.9c (obsolete) — Details — Splinter Review

Sorry, patch v7 is mistake "Open at Startup" handling.
patch v7 has "Open at Startup" bug in multibyte language encoding name channel.


If you join #"multibyte language encoding name channel",
and set "Open at Startup" case,
and reboot chatzilla, so you can't join IRC server.
This is critical bug, Sorry.

NAITOH Jun

Comment 16

•

22 years ago

Attached patch patch v2 for ChatZilla 0.8.9b — Details — Splinter Review

some fix for multibyte language encoding name channel probrem.
under case bug fixed.

/who
/invite <nick> <multibyte channnel>
/kick <nick> <reason>
I think that this kick explanation is right.
I think that chatzilla's kick explanation is wrong.

and , this patch include "Open at Startup" in  multibyte language encoding name
channel patch.
(attachment 90409 [details] [diff] [review] )

thanks

Attachment #90409 - Attachment is obsolete: true

Robert Ginda

Assignee

Updated

•

22 years ago

Depends on: 160462

Robert Ginda

Assignee

Comment 17

•

22 years ago

checked in to trunk

Status: NEW → RESOLVED

Closed: 22 years ago

Resolution: --- → FIXED

NAITOH Jun

Comment 18

•

22 years ago

I try Trunk nightly chatzilla.
(Mozilla/5.0 (Windows; U; Win98; en-US; rv:1.1b) Gecko/20020802)

I try to use (join) Japanese(ISO-2022-jp) channel name.
It is work fine ;)
Thanks!

Myk Melez [:myk] [@mykmelez]

Updated

•

20 years ago

Product: Core → Other Applications

You need to log in before you can comment on or make changes to this bug.