Closed Bug 520406 Opened 16 years ago Closed 16 years ago

german umlauts are not decoded correctly

Tracking

(Not tracked)

Status:

RESOLVED FIXED

People

(Reporter: bugzilla, Unassigned)

References

(
URL
)

Details

Heiko Adams

Reporter

Description

•

16 years ago

User-Agent: Mozilla/5.0 (X11; U; Linux i686; de; rv:1.9.1.3) Gecko/20090909 Fedora/3.5.3-1.fc11 Firefox/3.5.3 Build Identifier: Snow doesn't decode the german umlauts in the feed provided by the attached url correctly. Other newsreader like brief decode them correctly. Reproducible: Always Steps to Reproduce: 1. Open http://handelsblatt6.blogg.de/rss.xml 2. Subscribe to the feed using snowl 3. Open snowl news list and select the feed 4. Browse through the articles Actual Results: German umlauts and some other special characters are not decoded correctly Expected Results: German umlauts and some other special characters should be displayed correctly

Heiko Adams

Reporter

Updated

•

16 years ago

Version: unspecified → 0.3

alta88

Comment 1

•

16 years ago

this particular feed is encoded in ISO-8859-1 but the server sends as US-ASCII. feeds with UTF8 encoding and <description> tags with CDATA show correctly, as does spiegel.de for example, which uses <content:encoding> tags for its content summary. do other german feeds also fail or just this one? more info can be found here: http://www.rssboard.org/rss-specification http://validator.w3.org/feed/

Heiko Adams

Reporter

Comment 2

•

16 years ago

This feed has the same problem: http://www.pro-linux.de/backend/pro-linux.rdf

Heiko Adams

Reporter

Updated

•

16 years ago

OS: Linux → All

alta88

Comment 3

•

16 years ago

fixed by http://hg.mozdev.org/snowl/rev/86a914981a5b. but i'm not sure what was intended by overriding mimetype, myk is this a proper fix?

Status: UNCONFIRMED → RESOLVED

Closed: 16 years ago

Resolution: --- → FIXED

Myk Melez [:myk] [@mykmelez]

Comment 4

•

16 years ago

The purpose was to prevent XMLHttpRequest from trying to parse the response, since the feed processor parses it itself, which makes the XMLHttpRequest parsing redundant and unnecessary. I'm ok with this change if it fixes the bug, although note that it might have a performance impact.

alta88

Comment 5

•

16 years ago

so it seems the request needs to be for non text/plain to return a non-encoded result, which triggers a XMLHttpRequest parse. if the request were of some rss mimetype, would this bypass XMLHttpRequest? wouldn't it be a core inefficiency, since Fx would be doing two parses as well, so perhaps there should be certain mimetypes exempted?

Myk Melez [:myk] [@mykmelez]

Comment 6

•

16 years ago

Hmm, I'm not sure what you're asking; can you explain further?

alta88

Comment 7

•

16 years ago

if a mimetype request of text/plain bypasses XMLHttpRequest parsing, then a feed specific or custom mimetype (since feedprocessor is going to parse anyway) should also bypass XMLHttpRequest. this would allow the data to retain non ascii values rather than have them returned hex-encoded as with text/plain. in other words, if asking for text/plain results in encoding, and asking for xhtml results in non encoding but parsing, then asking for rss/xhtml (eg) should result in a non encoded and non parsed result doc.

Myk Melez [:myk] [@mykmelez]

Comment 8

•

16 years ago

Ah, I see. Hmm, it's worth a shot, although perhaps the MIME type should be something like text/dont-encode-me.

alta88

Comment 9

•

16 years ago

sure, that would be fine. seems like there should be such a mimetype already; i don't know fx mime handling/code that deeply. anything for performance!

You need to log in before you can comment on or make changes to this bug.

Bugzilla

german umlauts are not decoded correctly

Categories

(Mozilla Labs :: Snowl, defect)

Tracking

(Not tracked)

People

(Reporter: bugzilla, Unassigned)

References

(
URL
)

Details

Crash Data

Security

(public)

User Story

Description

Updated

Comment 1

Comment 2

Updated

Comment 3

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9