Closed Bug 267582 Opened 20 years ago Closed 17 years ago

xinhuanet.com - UTF-8 content reported as iso-8859-1

Categories

(Tech Evangelism Graveyard :: French, defect)

x86
Windows XP
defect
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: micrond, Unassigned)

References

()

Details

User-Agent:       Mozilla/5.0 (Windows; U; Windows NT 5.1; fr-FR; rv:1.7.3) Gecko/20041027 Firefox/1.0RC1
Build Identifier: Mozilla/5.0 (Windows; U; Windows NT 5.1; fr-FR; rv:1.7.3) Gecko/20041027 Firefox/1.0RC1

French accented characters are improperly displayed on the page AND in source
code view. For example : where you would expect to see "i" with "^" accent
giving "î", the page display these two characters: "î".

Comparing both source code view and normal rendering of IE and Firefox show the
same error where IE is correct and Firefox wrong.

Reproducible: Always
Steps to Reproduce:
1. Display site the title should start with "L'île" where the character after
the apostrophe is an "i" with the accent "^" on top of it.

Actual Results:  
Some characters in title or on page improperly rendered both in title, core page
and source view.

Expected Results:  
Show proper characters.
The server, in the header "Content-Type: text/html; charset=ISO-8859-1", and the
HTML, in "<meta http-equiv="Content-Type" content="text/html;
charset=iso-8859-1" />", both tell us the page is in the ISO-8859-1 character
encoding. That's obviously false: when you are viewing source, you see garbage
characters before the <html> which are actually a UTF-8 byte order mark, and
when you manually change View - Character Encoding to UTF-8, the display is
correct. IE displays the correct characters by being utterly wrong: it is sure
it knows better than both the server and the HTML, so it silently corrects them.
This time, it's right; other times, it's utterly wrong.

So, this bug should be moved to the Tech Evangelism product, but that requires
choosing a language. Hrm. French, I suppose.
Assignee: firefox → french
Component: General → French
Product: Firefox → Tech Evangelism
QA Contact: firefox.general → french
Summary: French accented characters are improperly displayed → xinhuanet.com - UTF-8 content reported as iso-8859-1
The page http://french.xinhuanet.com/french seems to be regularly updated, the
content/type bug is still present. There is no resolution to
www.french.xinhuanet.com for which images and javascripts refers so many images
are broken and the script does not work, but over-riding the content-type to
utf8 works fine. 
This is not a tech evangelism imao but a lazy webmaster's bug
Tech evangelism is PRECISELY about convincing lazy programmers to fix their code
curl -I http://www.french.xinhuanet.com/french/2004-11/04/content_43051.htm
HTTP/1.1 200 OK
Server: Apache
Last-Modified: Thu, 04 Nov 2004 01:16:21 GMT
ETag: "31c022-56e3-818ac340"
Content-Type: text/html; charset=UTF-8
Date: Fri, 09 May 2008 03:18:09 GMT
Connection: keep-alive

No garbage characters in source, though page still has ISO-8859-1 as its charset in the META tag. Looks FIXED to me in both Camino trunk and Firefox 2.
Status: UNCONFIRMED → RESOLVED
Closed: 17 years ago
Resolution: --- → FIXED
Product: Tech Evangelism → Tech Evangelism Graveyard
You need to log in before you can comment on or make changes to this bug.