Closed Bug 8343 Opened 25 years ago Closed 25 years ago

UTF-7 optionally encoded characters need to be accommodated

Tracking

(Not tracked)

Status:

VERIFIED FIXED

Milestone:

M10

People

(Reporter: momoi, Assigned: nhottanscp)

References

(
URL
)

Details

Attachments

(1 file)

zipped mailbox file contains UTF-7 encoded Latin1 character 25 years ago nhottanscp 13.70 KB, application/octet-stream		Details

Katsuhiko Momoi

Reporter

Description

•

25 years ago

** Observed with 6/16/99 Win32 build **

Here's a part of a UTF-7 mail sent from OutlookExpress:

+ADwAIQ-DOCTYPE HTML PUBLIC +ACI--//W3C//DTD W3 HTML//EN+ACIAPg-

This is how it is displayed in Messenger 5.0 above.

Communicator 4.6 does not display this at all since it is part of
the HTML structure, abd that is correct.

Here, UTF-7 Set O (optional direct characters) are encoded by Outlook Express,
but we apprently expect all Set 0 characters to be directly represented.
We should be able to deal with optionally encoded characters also.

Katsuhiko Momoi

Reporter

Updated

•

25 years ago

URL: http://rocknroll/users/momoi/publish/...

Katsuhiko Momoi

Reporter

Comment 1

•

25 years ago

Here's additional info. I took the above UTF-7 string and put that into
an unlabeld .txt file and placed it at the above URL:

1. http://rocknroll/users/momoi/publish/seamonkey/tests/optcharutf7.txt

Also I replaced the optionally encoded characters with direct
representations and placed the string in a .txt file at:

2. http://rocknroll/users/momoi/publish/seamonkey/tests/optcharutf7b.txt

Under Communicator 4.6, I can see both of them correctly under UTF-7 as:

<!DOCTYPE HTML PUBLIC "-//W3C//DTD W3 HTML//EN">

With 5.0 Browser, under Latin 1 encoding, I see

+ADwAIQ-DOCTYPE HTML PUBLIC +ACI--//W3C//DTD W3 HTML//EN+ACIAPg-

for URL 1 but NOTHING for URL 2.

Under UTF-7. I see NOTHING for either URL 1 or URL 2.

These are text files and we should be able to show the string
under either Latin 1 (incorrectly) or UTF-7 (correctly).

cata

Updated

•

25 years ago

Status: NEW → ASSIGNED

Target Milestone: M8

cata

Comment 2

•

25 years ago

I just checked. Actually the UTF-7 decoder is accepting anything base64-encoded,
including Set O. So, the converter seems to be ok.

About the two test URLs: they are txt files with HTML inside. But that HTML is
only a header, so of course we don't show anything in the page. If there is a
problem, that is that the txt is parsed as HTML. But the very fact that for the
URL 1 with encoding set to UTF7, we don't show anything, proves that the
converter is working right: the text got converted into that HTML header!!!!!!
Adding a single valid HTML tag in there puts something visible in the page,
proving once again that the converter is ok.

Now, about the original issue. Tested in browser (that piece of encoded text)
works. So I guess it is some other problem. Maybe the encoding is not setted
right in the mail? I do not know. But we should reassign this bug to the right
owner.

Katsuhiko Momoi

Reporter

Updated

•

25 years ago

Assignee: cata → nhotta

Status: ASSIGNED → NEW

Katsuhiko Momoi

Reporter

Comment 3

•

25 years ago

OK. Let's send this over to nhotta then.
What should we do about the .txt file interpreted like .html file in 5.0?
A separate bug?

nhottanscp

Assignee

Comment 4

•

25 years ago

Attached file zipped mailbox file contains UTF-7 encoded Latin1 character — Details

nhottanscp

Assignee

Updated

•

25 years ago

Status: NEW → ASSIGNED

Target Milestone: M8 → M10

nhottanscp

Assignee

Comment 5

•

25 years ago

I was able to see UTF-7 encoded Latin1 character mailed by OE5 (attachment
created).
We need more data for this (original bug was filed with OE4, we also need 4.x
data).
Viewing UTF-7 is not a not a major requirement for M8. Moving to M10.

nhottanscp

Assignee

Updated

•

25 years ago

Status: ASSIGNED → RESOLVED

Closed: 25 years ago

Resolution: --- → FIXED

nhottanscp

Assignee

Comment 6

•

25 years ago

Marking as FIXED. I saw 6983 was fixed and verified. With the converter's fix,
this should be resolved now.

Katsuhiko Momoi

Reporter

Updated

•

25 years ago

Status: RESOLVED → VERIFIED

Katsuhiko Momoi

Reporter

Comment 7

•

25 years ago

** Checked with 7/12/99 Win32 build **

The original problem, i.e. some HTML structures were
displayed rather than suppressed, is now gone.
So in this sense, the bug has been fixed though I don't
know what check-in has solved the problem.

The 2nd problem is that of mistaking .txt files as if it were
.html files. This is not directly relevant to the UTF-7 coverter
and will be filed as a separate bug.

Marking it verified/fixed.

Myk Melez [:myk] [@mykmelez]

Updated

•

20 years ago

Product: MailNews → Core

Nobody; OK to take it and work on it

Updated

•

16 years ago

Product: Core → MailNews Core

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

UTF-7 optionally encoded characters need to be accommodated

Categories

(MailNews Core :: Internationalization, defect, P3)

Tracking

(Not tracked)

People

(Reporter: momoi, Assigned: nhottanscp)

References

(
URL
)

Details

Crash Data

Security

(public)

User Story

Attachments

(1 file)

Description

Updated

Comment 1

Updated

Comment 2

Updated

Comment 3

Comment 4

Updated

Comment 5

Updated

Comment 6

Updated

Comment 7

Updated

Updated

Attachment

General

Description

File Name

Content Type