Closed Bug 323383 Opened 19 years ago Closed 19 years ago

UTF8 utf-8 characters rendered incorectly (as: <?> <?> <?>) in most windows from in example ISO-8859-2 emails

Categories

(Thunderbird :: General, defect)

x86
Linux
defect
Not set
major

Tracking

(Not tracked)

RESOLVED INVALID

People

(Reporter: mozilla-bug-ticket, Assigned: mscott)

Details

Attachments

(1 file)

User-Agent:       Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.7.12) Gecko/20050922 Firefox/1.0.7 (Debian package 1.0.7-1)
Build Identifier: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.7.12) Gecko/20050922 Firefox/1.0.7 (Debian package 1.0.7-1)

My settings are all to UTF-8.
UTF8 is default encoding of the mail box, all fonts in configruation (to all regions: Central Europe, and so on) are set to UTF8 encoding (and font is typical, like Serif)

When i hav email encoded in in example iso-8859-2 then the special characters (and all text after first special character) in all headers are incorectly decoded. 

I will attach a screenshot ilustrating the problem.

Example email:


From - Wed Jan 11 10:25:28 2006
X-Account-Key: account9
X-UIDL: 0134563102bbda1d
X-Mozilla-Status: 0001
X-Mozilla-Status2: 00000000
Received: from yan.interia.pl ([217.74.66.8]:44261 "EHLO taj.interia.pl")
	by kps8.test.onet.pl with ESMTP id <S1180324AbWAKJBs>;
	Wed, 11 Jan 2006 10:01:48 +0100
Received: from pup.interia.pl (pup.interia.pl [217.74.66.37])
	by mx-out.strefa.interia.pl (Postfix) with SMTP id 82EB833F17F;
	Wed, 11 Jan 2006 10:01:47 +0100 (CET)
To:	foo@bar.invalid
Subject: Zamówienie zegarków na kwotê xxx
From:	potwierdzenie <potwierdzenie@foo.bar.invalid>
User-Agent: script
X-Accept-Language: en-us, en, de, pl, it
MIME-Version: 1.0
Content-Type: text/plain; charset=ISO-8859-2
Content-Transfer-Encoding: 7bit
[...]

List wys³any z sklepu Zegarki Chrono Online
List wys³any na ¿yczenie:
[...]

Reproducible: Always

Steps to Reproduce:
1. get email with UTF8 characters
2. follow my setup/configuration
3. look

Actual Results:  
Text in mailbox view (columns subjects, From) as well as in message view are renderd/decoded incorrectly: all characters from the first non ASCII character (including it) are replaced by a character that looks like a small question mark on a turned around rectangle, sort of like: <?>

Also attachments file names are broken.
Also attachments while Save As are broken, so it is impossible to save with a correct file extension, one have like: 
  Rafa??????????
instead of:
  Rafa³ Test.doc
and so on.


Expected Results:  
Correctly render the characters
Characters from iso-8859-2 email are encoded wrongly almost  everywhere
I was a bit incorrect, the bug is when one recives NON UTF8 email (and the settings are to default to try to use UTF8 encoding)
The screenshot doesn't show whether you have "Apply the default character encoding to all incoming messages" checked. If you do, it would explain this bug. Leaving it unchecked will apply UTF-8 to messages with no encoding specified, but allow messages that specify another encoding, like the one shown in the screenshot, to display correctly.
> Content-Type: text/plain; charset=ISO-8859-2
> Content-Transfer-Encoding: 7bit

In addition to what Simon mentioned, there's a small possibility that 'C-T-E: 7bit' tricked TB to misinterpret the message. However, it's not likely. 

Anyway, it's a BAD idea to set the default encoding for *incoming* emails to UTF-8 because it's very rare that UTF-8 encoded messages come without the correct label while it's still common for non-UTF-8 messages to be unlabelled (or mislabelled.)

 
 
For now, I think that perhaps just my settings where a bit messed up, and after all it do work correctly.

Sorry for trouble, Im closing this bug report then
Status: UNCONFIRMED → RESOLVED
Closed: 19 years ago
Resolution: --- → INVALID
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Creator:
Created:
Updated:
Size: