Closed Bug 32336 Opened 25 years ago Closed 25 years ago

Double formatting

Tracking

()

Status:

VERIFIED FIXED

Milestone:

M17

People

(Reporter: BenB, Assigned: BenB)

Details

Attachments

(2 files)

\|cd htmlparser/src;cvs diff -u\| 25 years ago Ben Bucksch (:BenB) 7.43 KB, patch		Details \| Diff \| Splinter Review
Fix, version 5 25 years ago Ben Bucksch (:BenB) 5.67 KB, patch		Details \| Diff \| Splinter Review

Ben Bucksch (:BenB)

Assignee

Description

•

25 years ago

<quote src="my mail to akk"> As you know, ScanHTML looks for URLs and structured phrases and *adds* HTML tags, if it thinks to have found one, leaving the original in. I worried about it, because it is basically a double formatting, but in all cases, I could think of, it was no problem in practice. In fact, it revealed several scary data flows, like quoting of plain text in plain text based on the displayed HTML, i.e. TXT->HTML->TXT. But there's one case, for which I don't have a solution: 1. The user types (or quotes) "*bla*" 2. It is sent out as HTML (only), so we run ScanHTML over the msg before sending. The result is "*bla*". 3. The recipient also uses Mozilla. 4a. The recipient decides to respond via plain text. At send, the msg, including the (HTML) quote, runs through nsHTMLToTXTSinkStream, which values "" as important information. The result is "**bla**". or 4b. The recipient decides to respond via HTML. At send, the msg, including the (HTML) quote, runs through ScanHTML, which values "*" as important information. The result is "*bla*". Similar for URLs. Now, imagine 2 Mozilla users discuss via email/news... I don't know, how to solve this without special-casing (i.e. making nsHTMLToTXTSinkStream and ScanHTML recognize our inserted HTML tags, which might not even possible for a-tags). Do you? </quote> This should be fixed in HTML->TXT converte, since the problem also appears, if mozTXTToHTMLConv is not called. E.g. we have the same problem, if a 4.x user writes an URL and sends it as HTML only, and I quote it in plain text. Proposed fix: 1. Add a class=txt_url to a elements generated by ScanTXT. 2. Do nothing in nsHTMLToTXTSinkStream for , , <code> and <a>, if class=txt_*. 3. Test in nsHTMLToTXTSinkStream::AddLeaf, if mURL.IsEmpty. If not, check, if mURL is equal the content. If yes, mURL.Truncate(). 1. and 2. fixed the double quoting we cause, and 3. fixes the double quoting cuased by other mailers like 4.x. But 3. has the problem, that it will do a if "(mURL.IsEmpty())" for *every* leaf node. Fortunately, nsString caches the length in a PRUint32, but still... Akk, do you think, that is a problem?

Ben Bucksch (:BenB)

Assignee

Updated

•

25 years ago

Status: NEW → ASSIGNED

Target Milestone: --- → M16

Ben Bucksch (:BenB)

Assignee

Updated

•

25 years ago

Target Milestone: M16 → M18

Ben Bucksch (:BenB)

Assignee

Comment 1

•

25 years ago

> it will do a if "(mURL.IsEmpty())" for *every* leaf node. Intelligent Send does the same, so I guess, that's OK. > 1. Add a class=txt_url to a elements generated by ScanTXT. Done in bug 32420 with "class=txt-link" > if class=txt_* class=txt-* > Proposed fix: 4. In mozTXTToHTMLConv::ScanHTML, skip content of tags with class=txt-*.

Ben Bucksch (:BenB)

Assignee

Updated

•

25 years ago

Whiteboard: Fixed except 4.

Target Milestone: M18 → M17

Ben Bucksch (:BenB)

Assignee

Comment 2

•

25 years ago

Well, it turns out, that I'm lucky and I don't need to special case in ScanHTML. - With my latest changes, I insert e.g. "*bla*". There's no leaf with *bla* anymore, so I don't "enhance" it. - I skip <a> elements incl. content anyway. - Glyph conversion removes the orignal content. So, there's "by accident" no case, where we enhance twice. I also added support for sub: "H2O" -> "H_2 O" (That's the convention according to Richard Zach). Akk, can you review and checkin, please? Patch follows.

Whiteboard: Fixed except 4. → Fixed. Waiting for review, approval and checkin.

Ben Bucksch (:BenB)

Assignee

Comment 3

•

25 years ago

Attached patch |cd htmlparser/src;cvs diff -u| — Details — Splinter Review

Ben Bucksch (:BenB)

Assignee

Updated

•

25 years ago

Keywords: patch

Ben Bucksch (:BenB)

Assignee

Comment 4

•

25 years ago

Attached patch Fix, version 5 — Details — Splinter Review

Ben Bucksch (:BenB)

Assignee

Comment 5

•

25 years ago

Fix fianlly checked in. My first checkin to Mozilla. WHOOOHOOO!

Status: ASSIGNED → RESOLVED

Closed: 25 years ago

Resolution: --- → FIXED

Whiteboard: Fixed. Waiting for review, approval and checkin.

sujay

Comment 6

•

25 years ago

verified in 7/25 build.

Status: RESOLVED → VERIFIED

You need to log in before you can comment on or make changes to this bug.

\|cd htmlparser/src;cvs diff -u\| 25 years ago Ben Bucksch (:BenB) 7.43 KB, patch		Details \| Diff \| Splinter Review
Fix, version 5 25 years ago Ben Bucksch (:BenB) 5.67 KB, patch		Details \| Diff \| Splinter Review

Bugzilla

Double formatting

Categories

(Core :: DOM: Serializers, defect, P3)

Tracking

()

People

(Reporter: BenB, Assigned: BenB)

References

Details

Crash Data

Security

(public)

User Story

Attachments

(2 files)

Description

Updated

Updated

Comment 1

Updated

Comment 2

Comment 3

Updated

Comment 4

Comment 5

Comment 6

Attachment

General

Description

File Name

Content Type