Open Bug 1866453 Opened 1 year ago Updated 8 months ago

Thunderbird could not decode raw ISO-2022-JP encoded headers

Categories

(MailNews Core :: MIME, defect)

defect

Tracking

(Not tracked)

REOPENED

People

(Reporter: emk, Unassigned)

References

(Regression)

Details

(Keywords: regression)

Attachments

(2 files)

Some older Japanese emails have subject/from fields encoded in ISO-2022-JP without using MIME "B"-encoding for historical reasons. The latest Thunderbird could not decode such subject/from fields. I think this is a regression by JSMime. (Yes, this is not a recent regression.)

Subjects need to be RFC-2047 encoded or raw UTF-8. Any other encoding is invalid, see RFC 6532: https://datatracker.ietf.org/doc/html/rfc6532. That subject fields are no longer repaired is bug 1739609.

So WONTFIX? My Mailbox has pre-RFC 6532 mails that Thunderbird can no longer show the subject correctly.

I'm just commenting on new bugs, not taking any decisions. Did any TB version ever display raw ISO-2022-JP in the subject? JS Mime landed in TB 31 (from memory, in 2014 or so). I believe the real bug is that you can no longer repair such subjects.

MS products could also handle raw ISO-8859-1/windows-1252 in the subject which TB never did.

Status: NEW → RESOLVED
Closed: 1 year ago
Duplicate of bug: 1829655
Resolution: --- → DUPLICATE

I'm reopening this bug because I found a STR and bisected. I had to import an mbox that was imported from Outlook Express and view the message list pane.
Last good: https://hg.mozilla.org/comm-central/rev/546a24b91d05
First bad: https://hg.mozilla.org/comm-central/rev/2608fae8b13b
Changeset: https://hg.mozilla.org/comm-central/pushloghtml?fromchange=546a24b91d05&tochange=2608fae8b13b
I guess bug 858337 or bug 790855 is the culprit.

Keywords: regression
Regressed by: 858337, 790855
Status: RESOLVED → REOPENED
No longer duplicate of bug: 1829655
Resolution: DUPLICATE → ---
Attached image Bad example
Attached image Good example

As per my comment #1, it's likely a WONTFIX. Subjects need to be RFC-2047 encoded or raw UTF-8. There are a few bugs like bug 1829655 where people complain about raw ISO-8859-1/windows-1252 not being handled "correctly".

As I said in comment #2, some old users (like me) may have pre-RFC 6532 mailbox.

convert8BitHeader already supports fallback to non-UTF-8 charset, but it does not consider the case for the 7-bit non-ASCII charset (i.e. ISO-2022-JP). So I don't think this is the same pattern as the ISO-8859-1/windows-1252 case.

You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: