Closed Bug 277868 Opened 20 years ago Closed 16 years ago

cfilt.iitb.ac.in - for UTF-8 pages, http Content-Type header is set to ISO-8859-1

Categories

(Tech Evangelism Graveyard :: Other, defect)

defect
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: arvindvenkataramani, Assigned: arvindvenkataramani)

References

()

Details

Attachments

(1 file)

User-Agent:       Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.8a6) Gecko/20041220
Build Identifier: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.8a6) Gecko/20041220

mozilla fails to detect the UTF-8 character encoding for this page:
http://www.cfilt.iitb.ac.in/wordnet/webhwn/lookup1.html, but does so correctly
for this page: http://geocities.com/hanu_man_ji/hindi_page.html

Reproducible: Always

Steps to Reproduce:
1. view http://www.cfilt.iitb.ac.in/wordnet/webhwn/lookup1.html

Actual Results:  
the page did not render correctly. after setting the character encoding
manually, it renders correctly

Expected Results:  
correctly set the character encoding to UTF-8

having looked at the source for these and other pages, i find that there is some
junk text right at the beginning of the pages, just before the <html> tag. this
may be the source of the problem. otherwise the pages seem to be identical. a
sample page that renders correctly is attached
this is a sample page that renders correctly
The server sends out (via HTTP) 'Content-Type: text/html; charset=ISO-8859-1'
which has a higher priority than the meta tag in the document (which declares
the document as in UTF-8). If the problem is site-wide, this is a tech
evangelism issue. 'Autodetect' (which is invoked ONLY when both HTTP and meta
tag info are missing) cannot do anything about this. Contact the site admin and
ask her/him to fix the server configuration. 
Status: UNCONFIRMED → NEW
Component: Internationalization → Other
Ever confirmed: true
Product: Core → Tech Evangelism
Version: Trunk → unspecified
Assignee: smontagu → arvindvenkataramani
OS: Windows XP → All
Hardware: PC → All
Conforming summary to TFM item 10 at 
http://www.mozilla.org/projects/tech-evangelism/site/procedures.html#file-new
Summary: utf-8 character encoding not auto detected → cfilt.iitb.ac.in - utf-8 character encoding not auto detected
Summary: cfilt.iitb.ac.in - utf-8 character encoding not auto detected → cfilt.iitb.ac.in - for UTF-8 pages, http Content-Type header is set to ISO-8859-1
QA Contact: amyy → other
url gone
Status: NEW → RESOLVED
Closed: 16 years ago
Resolution: --- → FIXED
Product: Tech Evangelism → Tech Evangelism Graveyard
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: