Closed Bug 106843 Opened 24 years ago Closed 23 years ago

add special character back for hwaara's name to folderPane.css

Tracking

()

Status:

VERIFIED FIXED

Milestone:

mozilla1.1beta

People

(Reporter: sspitzer, Assigned: bzbarsky)

References

Details

(Keywords: helpwanted, intl)

Attachments

(2 files, 2 obsolete files)

Add an assertion 24 years ago Boris Zbarsky [:bzbarsky] 1.57 KB, patch		Details \| Diff \| Splinter Review
Assertion patch using mURL 24 years ago Boris Zbarsky [:bzbarsky] 1.07 KB, patch	pierre : review+ attinasi : superreview+	Details \| Diff \| Splinter Review
Håkan's name in UTF-8 24 years ago Simon Montagu :smontagu 1.28 KB, patch		Details \| Diff \| Splinter Review
Revert the patch from bug 106674 23 years ago Boris Zbarsky [:bzbarsky] 1.15 KB, patch	dbaron : review+ sspitzer : superreview+	Details \| Diff \| Splinter Review

(not reading, please use seth@sspitzer.org instead)

Reporter

Description

•

24 years ago

the special character causes us to fail to convert folderPane.css to UTF-8 boris writes: "we now default to UTF8 for all the chrome sheets, since that's the charset of XML documents by default In ISO-8859-1 that char is apparently not a problem.... 1) take that char out 2) put an @charset rule in that sheet 3) fix the code that gets the charset" before you add @charset, I'd want to know what it costs us. if that means a performance hit, let's come up with something else. http://bugzilla.mozilla.org/show_bug.cgi?id=106674 sorry about that, hwaara

Boris Zbarsky [:bzbarsky]

Assignee

Comment 1

•

24 years ago

OK. "fix the code that gets the charset" means one of: a) "don't ask the document for its charset". I believe this would make the intl folks very unhappy. b) "don't ask the document for its charset for chrome:// urls". This seems silly and is a bigger perf hit than adding @charset, imo. adding @charset _would_ mean a bit of a perf hit, but very very little. Right now we check for @charset and if it's found we get the charset. So we would need to do the getting. On the other hand, we would not have to ask the document for a charset, which avoids a string copy. Once I fix bug 80106 the getting will be no slower than reading past an extra line or two of comments in the file and may well be faster than asking the document and copying that string. So I think my vote is for just adding an |@charset "ISO-8859-1";| rule to the sheet... Pierre? Thoughts?

Status: NEW → ASSIGNED

OS: Windows 2000 → All

Priority: -- → P1

Hardware: PC → All

Target Milestone: --- → mozilla0.9.6

(not reading, please use seth@sspitzer.org instead)

Reporter

Comment 2

•

24 years ago

in the mean time, can you add an assertion so that we'll know where to look if this happens again?

Boris Zbarsky [:bzbarsky]

Assignee

Comment 3

•

24 years ago

Yeah, no matter what solution we pick the patch I attach will include that assertion. Unless you want the assertion ASAP, in which case please let me know.

Pierre Saslawsky

Comment 4

•

24 years ago

It is a bit harsh to reject a stylesheet just because a character conversion failed. Is there any way we can ignore the error and go on with the conversion?

Håkan Waara

Comment 5

•

24 years ago

This is very similar to bug 91933 that I filed a while ago, that makes .properties and DTD files fail if I add my name there. Boris, do you think you could look into that next (if it's related) ?

Daniel Glazman (:glazou) (not active in Mozilla any more)

Comment 6

•

24 years ago

Can't we just throw away unrecognized chars if we are in a comment ? I agree with Pierre that it sounds weird to refuse a sheet because of a comment we don't deal with anyway. Håkan: be glad you live in an iso-8859-1 country ; another charset and you just could not add your name to a file w/o transliteration or utf-8 encoding.

Boris Zbarsky [:bzbarsky]

Assignee

Comment 7

•

24 years ago

OK. I don't know _that_ much about charsets. ccing some intl folks for insight. The problem with UTF8 in particular is that chars are encoded using a variable number of bytes. So picking up after an error may well have alignment issues. Looking at the nsIUnicodeConverter interface, it looks like on error the converter _does_ tell us how much data was actually consumed so we can "restart" it at some point further along the string. The wisdom of doing so I again leave to the intl folks. Daniel, with the current system we have no way of knowing we are in a comment. We convert the string to UCS2 before we ever give it to the CSS parser. Furthermore, because of the variable-width thing a single char can affect the way the rest of the document is decoded. I don't know how robust the various encodings are at permitting a unique interpretation of the remainder of the document given an error like this. The HTML parser does some handling of this sort of error at: http://lxr.mozilla.org/seamonkey/source/htmlparser/src/nsScanner.cpp#343 so apparently it's not unreasonable. If the intl folks have no issues with me doing something like that, I could do it. Adding dependency since depending on what we decide here the patch in bug 80106 may need to be reworked. Let's get a decision here ASAP, please. I'd really like to fix bug 80106. :) On a side note, why was the assertion on conversion failure in nsUTF8ToUnicode::Convert commented out? That would have made diagnosing this problem much simpler... :)

Blocks: 80106

Keywords: intl

Pierre Saslawsky

Comment 8

•

24 years ago

The i18n team should define how errors are processed. A solution could be to pass a boolean ('aIgnoreErrors') to Convert() and let the i18nn code replace the bad characters with U+FFFD or something else. This code shouldn't be in the HTML or CSS parsers, I think. If 'aIgnoreErrors' is true, Convert() would still return an error code but it would complete the character conversion. Returning the error code would leave the caller the choice to use a buffer that is known to contain conversion errors, or attempt a conversion with a different character set.

Boris Zbarsky [:bzbarsky]

Assignee

Updated

•

24 years ago

No longer blocks: 80106

Boris Zbarsky [:bzbarsky]

Assignee

Comment 9

•

24 years ago

Pushing out, but could we please come to a decision on this one? Especially if this'll involve changing the converter interface, we should do it as soon as we can.

Target Milestone: mozilla0.9.6 → mozilla0.9.7

nhottanscp

Comment 10

•

24 years ago

I filed a bug 107790 for nsIUnicodeDecoder enhancement request.

Pierre Saslawsky

•

24 years ago

Pushing out to match bug this depends on. I suggest that we do something here (add an assertion, add a workaround like the HTML parser, something) in the near future, however. I'll be unable to work on Mozilla starting a week from now for about a month, so if someone wants to do this in the meantime.... Pierre, would you be willing to add an assertion before the 0.9.7 freeze? If so, I'll attach a patch.

Keywords: helpwanted

Target Milestone: mozilla0.9.7 → mozilla0.9.9

Pierre Saslawsky

Comment 14

•

24 years ago

Agreed: an assertion is a good idea when a stylesheet is rejected because of a bad character. We had a couple of dups and they are not easy to find for those who are not already aware of the problem.

Boris Zbarsky [:bzbarsky]

•

24 years ago

Boris can you remove my name from those files, too please? (If it can't be my real first name, Håkan (as opposed to Hakan), then please just remove it.)

Boris Zbarsky [:bzbarsky]

Assignee

Comment 22

•

24 years ago

Simon is making progress on this over in bug 128896

Depends on: 128896

Boris Zbarsky [:bzbarsky]

Assignee

Comment 23

•

24 years ago

setting target milestone... :(

Target Milestone: mozilla0.9.9 → mozilla1.0

Hixie (not reading bugmail)

Comment 24

•

24 years ago

Why can't we put a native UTF8 a-ring in the file?

Simon Montagu :smontagu

Comment 25

•

Assignee

Comment 30

•

24 years ago

I'm going out of town in two days and not getting back till after 1.0 freeze. Setting realistic milestones for all 1.0 bugs that depend on other bugs or on possible interface work. Any help on getting these fixed would be much appreciated...

Target Milestone: mozilla1.0 → mozilla1.1alpha

Simon Montagu :smontagu

Comment 31

•

24 years ago

Now that bug 128896 is fixed, there are no less than 3 ways we could fix this: 1) Change Hakan to HÃ¥kan (attachment 73579 [details] [diff] [review]) 2) Change Hakan to Håkan, and add @charset ISO-8859-1 3) Change Hakan to Håkan without adding @charset If it was up to me I would choose option #2.

Boris Zbarsky [:bzbarsky]

Assignee

•

23 years ago

Comment on attachment 89992 [details] [diff] [review] Revert the patch from bug 106674 r=dbaron

Attachment #89992 - Flags: review+

(not reading, please use seth@sspitzer.org instead)

Reporter

Comment 38

•

23 years ago

Comment on attachment 89992 [details] [diff] [review] Revert the patch from bug 106674 assuming the original bug with these special characters in css files is fixed, sr=sspitzer

Attachment #89992 - Flags: superreview+

Boris Zbarsky [:bzbarsky]

Assignee

Comment 39

•

23 years ago

Yeah, I tested that. ;) Fixed.

Status: ASSIGNED → RESOLVED

Closed: 23 years ago

Resolution: --- → FIXED