Closed Bug 152814 Opened 24 years ago Closed 23 years ago

detecting BOM when loading script

Tracking

()

Status:

VERIFIED FIXED

Milestone:

mozilla1.2beta

People

(Reporter: swann, Assigned: shanjian)

References

Details

(Keywords: intl)

Attachments

(4 files)

html file 24 years ago Teruko Kobayashi 189 bytes, text/html		Details
UTF8 JS file 24 years ago Teruko Kobayashi 173 bytes, text/plain		Details
Javascript console 24 years ago Teruko Kobayashi 25.51 KB, image/jpeg		Details
patch 23 years ago Shanjian Li 1.73 KB, patch	ftang : review+ jst : superreview+	Details \| Diff \| Splinter Review

David

Reporter

Description

•

24 years ago

When a unicode javascript file is included, i have the following error in the javascript console : Error: illegal character Source File: /login.js Line: 1 Source Code: ��/

Christopher Hoess (gone)

Comment 1

•

24 years ago

Is the Javascript file being served with the right character set from the server?

Phil Schwartau

Comment 2

•

24 years ago

A unicode decoder problem? Not sure if this is Parser or International, but I don't think this is JS Engine. Reassigning to International. swann@cqs.dyndns.org: is there a URL we can go to that shows the problem? Without a testcase or a URL, we won't be able to work on this; thanks -

Assignee: rogerl → yokoyama

Component: JavaScript Engine → Internationalization

QA Contact: pschwartau → ruixu

Summary: include unicode javascript files don't work → Included Unicode JavaScript files don't work

Rui Xu

Updated

•

24 years ago

Keywords: intl

QA Contact: ruixu → teruko

David

Reporter

Comment 3

•

24 years ago

unfortunatly the file is inside the intranet . actually all the js files in my application are included and stored in unicode.. the main page which include them is encoded in ISO-8859-1... the error occurs for each file inclusion.. to reproduce the bug, one just have to make a simple html page encoded with ISO-8859-1 that include a javascript file (even empty) stored in unicode.

Teruko Kobayashi

Comment 4

•

24 years ago

I tested this in 6-21 branch Win32 build. I could reproduce this. The beggining of the UTF8 js file caused the problem.

Teruko Kobayashi

Comment 5

•

24 years ago

Attached file html file — Details

Teruko Kobayashi

Comment 6

•

24 years ago

Attached file UTF8 JS file — Details

Teruko Kobayashi

Comment 7

•

24 years ago

Attached image Javascript console — Details

Teruko Kobayashi

Comment 8

•

24 years ago

When I run the attached html file (id=88692) which included UTF8 file (id = 88693), I got the error message in Javascript console (id=88694). David, is this what you see?

Status: UNCONFIRMED → NEW

Ever confirmed: true

David

Reporter

Comment 9

•

24 years ago

yes teruko. it's exactly the same error i got, due to the fact that js file is stored in UTF-8

Roy Yokoyama

Updated

•

24 years ago

Status: NEW → ASSIGNED

Target Milestone: --- → mozilla1.2beta

Roy Yokoyama

Comment 10

•

24 years ago

If I manually change the attached html file from <ISO-8859-1> to <UTF-8> from browser menu; then JS works fine. I guess the decoder is respecting the doc charset.

Target Milestone: mozilla1.2beta → ---

Frank Tang

Updated

•

24 years ago

Blocks: 157673

Frank Tang

Comment 11

•

23 years ago

what happen is the html is in iso-8859-1 and the js file is UTF-8 file without any labeling. (http charset) So the browser assume the js file is in the same encoding of the html and try to load it with iso-8859-1 converter. Since this utf-8 js file is created by window notpad, it generate 3 bytes of BOM in utf8 in the beginning. This make the JS engine think this is not a valid file. What could we do? I think one thing we could do is in the code which convert JS into unicode. look at the first several bytes, like what we do in html parser. And detect UTF16 BOM or UTF-8 BOM Reassign this to shanjian nsbeta1+ for m1.2final

Assignee: yokoyama → shanjian

Status: ASSIGNED → NEW

Keywords: nsbeta1+

Summary: Included Unicode JavaScript files don't work → Included UTF8 JavaScript files from a non UTF8 html don't work

Target Milestone: --- → mozilla1.2beta

Daniel Wang

Comment 12

•

23 years ago

setting charset solves the problem: <script type="text/javascript" charset="UTF-8" src="test.js"></script> unfortunately the code will still produce error in msie5. can anyone check if this is also a prob with ns4.x ?

Daniel Wang

Comment 13

•

23 years ago

msie5 loads the file fine if the first 3 characters in the js file are removed. if the first 3 bytes are removed, but charset is not set, msie5 and mozilla will load the script but produce (document.write) jibberish chars in iso-8859-1

Shanjian Li

Assignee

Comment 14

•

23 years ago

Attached patch patch — Details — Splinter Review

Shanjian Li

Assignee

Comment 15

•

23 years ago

ftang, could you review?

Status: NEW → ASSIGNED

Summary: Included UTF8 JavaScript files from a non UTF8 html don't work → detecting BOM when loading script

Frank Tang

Comment 16

•

23 years ago

Comment on attachment 97079 [details] [diff] [review] patch r=ftang make sure there are space between if and (

Attachment #97079 - Flags: review+

Shanjian Li

Assignee

Comment 17

•

23 years ago

I will take care of the space format issue before checkin. dbaron, could you sr?

David Baron :dbaron: (⌚️UTC-5, no longer working on Mozilla)

Comment 18

•

23 years ago

Comment on attachment 97079 [details] [diff] [review] patch I could, although I'd much prefer if jst did since he's the module owner for this area of code. (Other than the bizarre indentation, which should be made consistent with the rest of the file, it seems fine to me, although the "should change to" comments seem to have been done already.)

Shanjian Li

Assignee

Comment 19

•

23 years ago

johny,I try to avoid you this time in order not to keep you too busy. But it seems like I have to do ask you about this one since you are the best candidate.

Johnny Stenback (:jst)

Comment 20

•

23 years ago

Comment on attachment 97079 [details] [diff] [review] patch + +// This function is copied from nsParser.cpp. It was simplied though, unnecessary part is removed. "simlified" is mis-spelled above. +static PRBool DetectByteOrderMark(const unsigned char* aBytes, PRInt32 aLen, nsString& oCharset) { Please put the static keyword and the return type on its own line, and opening brace on its own line as well. + if (aLen < 2) + return false; + + switch(aBytes[0]) + { ... Please clean up this indentation mess. No tabs, 2-space indentation. Other than that (and what dbaron pointed out about the comments), sr=jst

Attachment #97079 - Flags: superreview+

Shanjian Li

Assignee

Comment 21

•

23 years ago

fix checked in.

Status: ASSIGNED → RESOLVED

Closed: 23 years ago

Resolution: --- → FIXED

Christian :Biesinger (don't email me, ping me on IRC)

Comment 22

•

23 years ago

urg. this patch is bad. it uses AssignWithConversion, which should no longer be used, and uses nsString as function argument. It should use a more general type as the argument (nsAString seems appropriate). as for + oCharset.AssignWithConversion("UTF-8"); it would be better written as: oCharset.Assign(NS_LITERAL_STRING("UTF-8")); which can also be faster, if the compiler supports doing the conversion at compile time.

Christian :Biesinger (don't email me, ping me on IRC)

Comment 23

•

23 years ago

oh yeah, one more thing: + return oCharset.Length() > 0; can be written as: return !oCharset.IsEmpty(); which can be faster.

Roland Mainz

Comment 24

•

23 years ago

This checkin broke the OS/2 tinderbox on http://tinderbox.mozilla.org/showbuilds.cgi?tree=SeaMonkey-Ports ... is anyone looking at this ?

Katsuhiko Momoi

Comment 25

•

23 years ago

If you have a serious concern with the current fix, please re-open this bug or file another one.

Christian :Biesinger (don't email me, ping me on IRC)

Comment 26

•

23 years ago

reopening bug so that the issues from comment 22 and comment 23 can get addressed.

Status: RESOLVED → REOPENED

Resolution: FIXED → ---

Shanjian Li

Assignee

Comment 27

•

23 years ago

bug 170339 filed to code cleanup. Close this bug.

Status: REOPENED → RESOLVED

Closed: 23 years ago → 23 years ago

Resolution: --- → FIXED

Teruko Kobayashi

Comment 28

•

23 years ago

Changed QA contact to ylong@netscape.com.

QA Contact: teruko → ylong

Yuying Long

Comment 29

•

23 years ago

Verified it's fixed in 10-31 trunk build. However it's still in 1.0.2 branch build. I'm marking this as verified, if any one think it's important for branch build, feel free to nominate it.

Status: RESOLVED → VERIFIED

Frank Tang

Updated

•

23 years ago

Depends on: 180372

Frank Tang

Updated

•

23 years ago

No longer blocks: 157673

You need to log in before you can comment on or make changes to this bug.