Latin1 pages mis-displayed as Unicode despite preferences

RESOLVED WORKSFORME

Status

()

Core
Internationalization
RESOLVED WORKSFORME
15 years ago
12 years ago

People

(Reporter: Jamie Zawinski, Assigned: Jungshik Shin)

Tracking

({intl})

Trunk
x86
Linux
Points:
---

Firefox Tracking Flags

(Not tracked)

Details

(URL)

(Reporter)

Description

15 years ago
(This may well be a duplicate, but I don't understand
the other bug reports well enough to tell tell.)

Sites that have no charset specified in their HTTP
headers or HTML meta tags are being interpreted
in UTF-8 by default, instead of in the encoding
that I selected in Preferences.

When I load, e.g., http://www.jwz.org/ the 
"View / Character Coding" menu shows that the document
is being interpreted as "Unicode (UTF-8)", despite
the fact that "Preferences / Languages / Character Coding"
is set to "Western (ISO-8859-1)" (and has been forever,
so this isn't just a "you have to restart the browser"
problem.)

This means that pretty much every news site I ever look
at (e.g., http://www.nytimes.com/) show up with ? instead
of ', and that?s very distracting...  

The workaround is to select "Western" off of the
"Character Coding" menu, but I have to do that for
*each page*.

One set of totally vanilla headers causing this behavior:

    HTTP/1.1 200 OK
    Date: Mon, 07 Oct 2002 01:57:25 GMT
    Server: Apache/1.3.26 (Unix)
    Last-Modified: Mon, 07 Oct 2002 01:00:02 GMT
    ETag: "242920d-85bc-3da0dc92"
    Accept-Ranges: bytes
    Content-Length: 34236
    Keep-Alive: timeout=40, max=4096
    Connection: Keep-Alive
    Content-Type: text/html

    <HEAD><TITLE>jwz.org</TITLE></HEAD>
    <BODY ...


Linux 2.4.9-13smp #1 SMP Tue Oct 30 19:06:50 EST 2001 i686 unknown
Mozilla 1.1b
Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.1b) Gecko/20020820

No locale-related environment variables set.

Comment 1

15 years ago
Definitely a bug, but you do have an old beta.

Updated

15 years ago
Keywords: intl
QA Contact: ruixu → ylong

Comment 2

15 years ago
Works for me on latest build / linux RH7.2, the page is marked as western
iso-8859-1 which is the my browser default charset in Preferences.  Even I
select Universal, I still didn't see UTF-8 show up.

Jamie Zawinski:
How about you try change charset to western, then clear the Cache (Edit |
Advanced | Cache, both memory and Disk)?
If it won't work for you, could you please try to install a latest trunk build,
and create a new profile to see if there is any difference? thanks!
(Reporter)

Comment 3

15 years ago
Seems to be (rightly) defaulting to Latin1 since I upgraded to 1.2b...

Comment 4

13 years ago
I think both roy and me are off mozilla for more than 2 years. If these bugs are
still here now, I think the real stauts is 'won't fix'. If you want to reopen
it, please find a new owner for it first. 
Status: NEW → RESOLVED
Last Resolved: 13 years ago
Resolution: --- → WONTFIX

Comment 5

13 years ago
Mass Reassign Please excuse the spam
Assignee: tetsuroy → nobody

Comment 6

13 years ago
Mass Re-opening Bugs Frank Tang Closed on Wensday March 02 for no reason, all
the spam is his fault feel free to tar and feather him
Status: RESOLVED → REOPENED
Resolution: WONTFIX → ---

Comment 7

13 years ago
Reassigning Franks old bugs to Jungshik Shin for triage - Sorry for spam
Assignee: nobody → jshin1987
Status: REOPENED → NEW

Comment 8

12 years ago
WORKSFORME
Status: NEW → RESOLVED
Last Resolved: 13 years ago12 years ago
Resolution: --- → WORKSFORME
You need to log in before you can comment on or make changes to this bug.