Closed Bug 177270 Opened 23 years ago Closed 23 years ago

Incorrect character encoding used for bookmarked pages

Categories

(Core :: Internationalization, defect)

x86
All
defect
Not set
major

Tracking

()

RESOLVED DUPLICATE of bug 174197

People

(Reporter: muhdv98, Assigned: smontagu)

Details

(Keywords: intl)

Attachments

(1 file)

A bug that has been keeping to reappear in a few versions of Mozilla now is that swedish characters ('å','ä','ö') are sometimes substituted by '?' signs in web-pages. If a swedish char appears in a word of a mail subject, the whole word is substituted by a '?' sign. This is the same for Linux and Windows versions of Mozilla. I hope that this bug hasn't been reported yet, eventhough I think it is strange if it hasn't. /Mattias
Please include an URL where the bug can be seen Also: Always include the mozilla build ID in bug-reports.
Since I'm currently not able to find a web-page that shows this bug, it is perhaps better to just concentrate on the mail subject issue. I am currently using Mozilla Build ID 2002101612.
едц (and other non-ASCII characters) might be displayed erroneously on web pages if 1) the document or server produces no character encoding but your default is set to something like UTF-8, or 2) if the document or server produces no character encoding and you have universal auto-detection on, or 3) if your profile/cache is corrupt, which will make Mozilla use UTF-8 when it shouldn't. I have yet to see something to that effect myself, though. Mail headers are usually corrupted by other software, such as servers and weird mail clients. What's your default encoding? Do you have universal auto-detect on?
A curiosity: when opening this very page with universal auto-detect on, Mozilla thinks this is Windows-1251, displaying едц as Cyrillic glyphs.
Keywords: intl
Ok, this is perhaps not a bug after all, since I can't reproduce the web issue anymore, and the mail subject issue was fixed by checking the "Apply default character coding to all messages in the folder" checkbox for my Inbox. Previously I had it unchecked and Western (ISO-8859-1) as Default Character Coding. I assumed it was a bug since I have never changed any language or character coding settings, and the mentioned issues did not exist in previous versions of mozilla (somewhere before v 1.0). (Or perhaps something has changed on the mailserver I use since then, if you think that might be the case.) Hope I haven't caused any unnecessary work and/or confusion.
per reporter's comments, WORKSFORME.
Status: UNCONFIRMED → RESOLVED
Closed: 23 years ago
Resolution: --- → WORKSFORME
Mark as verified.
Status: RESOLVED → VERIFIED
This still happends to me, with latest build on Windows XP. The weird thing is that it only happends sometimes. Will try to figure out why and hopefully attach a testcase.
Attached file Testcase
When I see this file locally it shows swedish וצה as ???, but if I make a copy of this file and rename it it shows the swedish וצה. Hopefully the ??? will show now when uploaded...
Now when viewed in bugzilla I see the цед, this is really driving me crazy.
When you see the problem on the local disk, what do you see if you enter javascript:void(alert(document.characterSet)) in the URL bar?
It alerts UTF-8. The (working) copy alerts ISO-8859-I, hm, same html, different files names.
What filename works and what filename doesn't work?
The non-working is named menu.html, if I make a copy and rename to whatever name the וצה will work. Should I mail you the file?
Does "whatever name" have a .html extension? There's no need to mail the file if it's attached to the bug. I'm just trying to get a full decription of the problem recorded in the bug so that it can be analyzed.
Yes, all the new copies also have .html extension.
Now I know how to reproduce this bug every time. 1. Load testcase. 2. Add the URL to your Personal Toolbar by dragging it. 3. Now activate the bookmark (click on it), you will notice that the page will reload but that the characters are replaced by "?". If you do this locally, after the characters are transformed to "?", it doenst matter how you open the file. The file will never show correct characters again.
Status: VERIFIED → UNCONFIRMED
Resolution: WORKSFORME → ---
Confirming (on my Linux opt build from yesterday), changing component to Internationalization, and raising severity to major. Steps to reproduce: 1. Load http://bugzilla.mozilla.org/attachment.cgi?id=105326&action=view 2. Drag the icon in the URL bar to the personal toolbar. 3. Hit the back button. 4. Click on the icon in the personal toolbar just created. Expected results: B O-umlaut R S E A-ring O-umlaut A-umlaut Actual results: B ?-diamond R S E ?-diamond ?-diamond ?-diamond
Assignee: font → smontagu
Severity: normal → major
Status: UNCONFIRMED → NEW
Component: Layout: Fonts and Text → Internationalization
Ever confirmed: true
Summary: Swedish characters substituted by '?' → Incorrect character encoding used for bookmarked pages
*** This bug has been marked as a duplicate of 174197 ***
Status: NEW → RESOLVED
Closed: 23 years ago23 years ago
Resolution: --- → DUPLICATE
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Creator:
Created:
Updated:
Size: