Closed Bug 65702 Opened 25 years ago Closed 24 years ago

Can't decode subject if it contains ASCII and Japanese

Tracking

(Not tracked)

Status:

VERIFIED WORKSFORME

People

(Reporter: kazhik, Assigned: jgmyers)

References

Details

(Keywords: intl)

Attachments

(1 file)

Testcase(mail file) 25 years ago Koike Kazuhiko 732 bytes, text/plain		Details

Koike Kazuhiko

Reporter

Description

•

25 years ago

If the subject of a message contains encoded ASCII code and Japanese code, Mozilla cannot decode it. For example, $B$"$$$&$($*(B(abcdefghijklmnopqrstuvwxyz) Mew(Mailer for emacs) encodes this subject as below: Subject: =?iso-2022-jp?B?GyRCJCIkJCQmJCgkKhsoQihhYmNkZQ==?= =?us-ascii?Q?fghijklmnopqrstuvwxyz)?= Mozilla can't decode it. But NC4.7 can decode it.

Koike Kazuhiko

Reporter

Comment 1

•

25 years ago

Attached file Testcase(mail file) — Details

Frank Tang

Comment 2

•

25 years ago

change platform to ALL and mark this as P3 moz0.9.1

Keywords: intl, nsbeta1

OS: Windows 2000 → All

Priority: -- → P3

Hardware: PC → All

Target Milestone: --- → mozilla0.9.1

Katsuhiko Momoi

Updated

•

25 years ago

QA Contact: momoi → ji

Katsuhiko Momoi

Comment 3

•

25 years ago

Change QA contact to ji.

nhottanscp

Comment 4

•

25 years ago

MIME decoder is being rewritten by jgmyers@netscape.com, reassign to him.

Assignee: nhotta → jgmyers

Target Milestone: mozilla0.9.1 → ---

John G. Myers

Assignee

Comment 5

•

25 years ago

My rewrite will indeed handle this.

Status: NEW → ASSIGNED

John G. Myers

Assignee

Updated

•

25 years ago

Depends on: 58114

John G. Myers

Assignee

Comment 6

•

25 years ago

Fix checked in.

Status: ASSIGNED → RESOLVED

Closed: 25 years ago

Resolution: --- → FIXED

nhottanscp

Comment 7

•

25 years ago

The fix was done by changed the MIME decoder to do a charset conversion and always returns UTF-8 string. There are a couple of issues appeared by that implementation. * By always returning UTF-8, there is no way for the caller to correct mislabeled charset headers (e.g. ISO-8859-1 labeled Big5, US-ASCII labeled Shift_JIS). * There are places which optimizes charset conversions in libmime. By doing charset conversion inside MIME decoder, we cannot take advantage of them. The first issue caused a regression of bug 65277. I reopen this bug and propose a better implementatiuon. * Do the UTF-8 conversion only if the header contains multiple charsets. That can be done by pre-parsing the header to check charsets in the header. This is a litter overhead but avoiding the charset conversion inside the decoder helps performance gain.

Status: RESOLVED → REOPENED

Resolution: FIXED → ---

nhottanscp

Updated

•

25 years ago

Blocks: 65277

John G. Myers

Assignee

Comment 8

•

25 years ago

A better approach would be to pass an override charset down to the encoded-word decoder. Converting to UTF-8 only in the multi-charset case requires a 2-pass decoder and prevents charset override in the multi-charset case. What charset conversion optimizations are you talking about? The only one I know of is the one which no-ops conversions between UTF-8 and US-ASCII. I believe this bug should remain closed fixed and override work be done on bug 65277.

nhottanscp

Updated

•

25 years ago

Blocks: 68344

nhottanscp

Comment 9

•

25 years ago

I think my proposal has minimum impact for the caller (and less chances of another regression) because it is basically requesting to back to the old behavior. I am not sure if anybody care about overriding multiple charsets case. In fact, multiple charset in a header itself is rarely seen. So the other option could be no support for multiple charset at all then no need for the pre-parsing. Anyway, please try whatever you think it's right to fix the problem but please test to prevent another regression. About the optimization, we cache the charset convertors which saves extra createintance and getservice.

No longer blocks: 68344

nhottanscp

Updated

•

25 years ago

Blocks: 68344

John G. Myers

Assignee

Comment 10

•

25 years ago

This code is still working. Override work is being done as bug 65277

Status: REOPENED → RESOLVED

Closed: 25 years ago → 25 years ago

Resolution: --- → FIXED

Comment 11

•

24 years ago

The testing mail that the original reporter attached does't show in the folder. I'd like to reopen this bug.

John G. Myers

Assignee

Comment 12

•

24 years ago

I suggest you wait another day. You may be seeing bug 75390.

Comment 13

•

24 years ago

It doesn't show with previous builds, like 04/02 build either. I'll wait until tomorrow anyway.

Comment 14

•

24 years ago

The attached testing mail doesn't show up either with today's trunk build (04/11). Reopened the bug.

Status: RESOLVED → REOPENED

Resolution: FIXED → ---

John G. Myers

Assignee

Comment 15

•

24 years ago

Once I added a "From " line to the test case, it worked for me.

Status: REOPENED → RESOLVED

Closed: 25 years ago → 24 years ago

Resolution: --- → WORKSFORME

Comment 16

•

24 years ago

The original testcase does include the "From:" line. How did you edit it to get it work?

John G. Myers

Assignee

Comment 17

•

24 years ago

"From ", not "From:".

Comment 18

•

24 years ago

Yes. it appears correctly. Marked it as verified.

Status: RESOLVED → VERIFIED

Myk Melez [:myk] [@mykmelez]

Updated

•

21 years ago

Product: MailNews → Core

Nobody; OK to take it and work on it

Updated

•

17 years ago

Product: Core → MailNews Core

You need to log in before you can comment on or make changes to this bug.