Closed Bug 162182 Opened 22 years ago Closed 22 years ago

Null characters embedded in HTML show up as weird kanji

Categories

(Core :: Internationalization, defect)

x86
Windows XP
defect
Not set
normal

Tracking

()

VERIFIED INVALID

People

(Reporter: nathans, Assigned: tetsuroy)

References

()

Details

(Keywords: intl)

Attachments

(4 files)

From Bugzilla Helper:
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.1b) Gecko/20020809
BuildID:    2002080908

This page appears to have 0x00 characters (null) embedded within it. Those
characters show up unexpectedly as kanji. My View » Character Coding is set to
"Western (ISO-8859-1)" and View » Character Coding » Autodetect is set to "(Off)".

Reproducible: Always
Steps to Reproduce:
1. Visit the above URL using Mozilla on Windows XP.

Actual Results:  Weird Japanese/Chinese characters are seen among the text.

Expected Results:  Those characters should not be rendered.

This rendering only happens for me on Windows (Mac OS X build 2002081103 just
ignores the 0x00 characters). I have also seen this, on Windows, when viewing
MacSlash, so it perhaps it is some Mac-based HTML editor inserting nulls into
the page.
Attached image Windows screenshot
Attached image Mac screenshot
Attached file Page source code
also see something rendered there on Linux, 2002080804.
-> Int
Assignee: Matti → yokoyama
Status: UNCONFIRMED → NEW
Component: Browser-General → Internationalization
Ever confirmed: true
QA Contact: asa → ruixu
Get squares instead of kanji in Build 2002081018 Win98
squares on linux trunk cvs 2002-08-11 (that's "wfm" I suppose?)
Keywords: intl
QA Contact: ruixu → ylong
Inside a html file, what's this "Null" unicode point 0000 supposedly represent?
and what's that web wants to show there?

For my point of view, it not a good idea to use it present space or blank
line...etc., because there are certain ways to do that.
Attached file Minimal test case
I think that null characters should simply be ignored. My reasoning is that:
(1) this is currently what Mozilla itself does on Mac OS X; and (2) this is
what MSIE does on both Windows XP and Mac OS X.

For those who are seeing squares instead of Chinese characters, maybe the font
you are using doesn't include that character, or the character mapping is
different on your system. You are still seeing the wrong thing (i.e.,
something) where there should be nothing.
this is a bug on those content. There are no html specification said the null
should be ignore.
On my window, I saw question mark on those spot. 
Status: NEW → RESOLVED
Closed: 22 years ago
Resolution: --- → INVALID
Mark as verified.
Status: RESOLVED → VERIFIED
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Creator:
Created:
Updated:
Size: