Closed Bug 270145 Opened 21 years ago Closed 21 years ago

Cannot copy/paste a portion of a XHTML page that is in a CDATA section

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla1.8alpha6

People

(Reporter: david.sporn, Assigned: bzbarsky)

References

(
URL
)

Details

(Keywords: dataloss)

Attachments

(2 files)

testcase 21 years ago Phil Ringnalda (:philor) 354 bytes, application/xhtml+xml		Details
Refix the "non-HTML documents are all plaintext and text controls" hacks 21 years ago Boris Zbarsky [:bzbarsky] 2.32 KB, patch	jst : review+ jst : superreview+	Details \| Diff \| Splinter Review

David Sporn

Reporter

Description

•

21 years ago

User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.0; fr-FR; rv:1.7.5) Gecko/20041108 Firefox/1.0 Build Identifier: Mozilla/5.0 (Windows; U; Windows NT 5.0; fr-FR; rv:1.7.5) Gecko/20041108 Firefox/1.0 Any part of a page that is inside a CDATA section cannot be copy/pasted. Reproducible: Always Steps to Reproduce: 1.Go to a page containing CDATA (e.g. http://www.sporniket.com/page/blog/david/200411161202) 2.Select all the text 3.Copy the selection and paste it into a text editor. Actual Results: The text that is inside CDATA does not appear in the text editor. If you select text that is inside CDATA only, the paste operation add nothing in the text editor. Expected Results: The selected text should have been copied into the clipboard so that it could be pasted. Workaround : view the source, and copy/paste the CDATA content

Phil Ringnalda (:philor)

Comment 1

•

21 years ago

Attached file testcase — Details

:Gavin Sharp [email: gavin@gavinsharp.com]

Updated

•

21 years ago

Component: OS Integration → DOM to Text Conversion

Product: Firefox → Browser

Version: unspecified → Trunk

Boris Zbarsky [:bzbarsky]

Assignee

Comment 2

•

21 years ago

Hmm.. I thought this had been fixed in bug 234427 (and the testcase for that bug no longer seems to be available, so I can't retest). I traced through this code, though, and it looks like nsAutoCopyService::NotifySelectionChanged calls nsCopySupport::HTMLCopy no matter what the content is. This serializes out the DOM as HTML (with the <![CDATA[]]> in the string). Then when we want to get text out, nsHTMLFormatConverter::ConvertFromHTMLToUnicode creates a parser with nsPlainTextSerializer as the content sink and reparses the string... but the HTML parser doesn't actually pass CDATA sections to the content sink as CDATA but as comments (we have existing bugs on this, I'm pretty sure). So the text inside the CDATA gets lost. I'm not sure what the right thing to do is here, since I'm not sure whether we event support an XML clipboard flavor... Mike, do you happen to know anothing about that? I know you dealt with clipboard once upon a time.

Assignee: bugs → dom-to-text

Severity: minor → major

Status: UNCONFIRMED → NEW

Ever confirmed: true

Keywords: dataloss

OS: Windows 2000 → All

QA Contact: firefox.os-integration

Hardware: PC → All

Sebastian Redl

Comment 3

•

21 years ago

The testcase's file ending was changed to xhtml somewhere in the past. http://stud3.tuwien.ac.at/~e0226430/tutorial/functions.xhtml It amazes me how multi-layered this bug is - I already fixed two places, and still it doesn't work.

Boris Zbarsky [:bzbarsky]

Assignee

Comment 4

•

21 years ago

So here's another interesting thing. nsCopySupport::IsPlainTextContext returns true if the document is not an nsIHTMLDocument. Which is true for XML, but definitely not true for XHTML. I'm not sure how that helps, since nsHTMLCopyEncoder::Init ignores aMIMEType, and the HTMLCopy() code creates an HTMLCopyEncoder unconditionally... but if I view the same data as text/xml, I can copy the CDATA content.

Boris Zbarsky [:bzbarsky]

Assignee

Comment 5

•

21 years ago

And finally, the problem is that the code involved just uses the nsHTMLFormatConverter if bIsHTMLCopy is set. Sounds like the real problem is that sanely copying XML is simply unsupported. The right fix ought to be to disable HTML copy for XHTML documents....

Boris Zbarsky [:bzbarsky]

Assignee

Comment 6

•

21 years ago

Attached patch Refix the "non-HTML documents are all plaintext and text controls" hacks — Details — Splinter Review

Boris Zbarsky [:bzbarsky]

Assignee

Comment 7

•

21 years ago

Comment on attachment 167578 [details] [diff] [review] Refix the "non-HTML documents are all plaintext and text controls" hacks jst, would you review?

Attachment #167578 - Flags: superreview?(jst)

Attachment #167578 - Flags: review?(jst)

Boris Zbarsky [:bzbarsky]

Assignee

Comment 8

•

21 years ago

I've filed bug 272649 on the general issue of implementing copying of XML in a sane way.

Johnny Stenback (:jst)

Comment 9

•

21 years ago

Comment on attachment 167578 [details] [diff] [review] Refix the "non-HTML documents are all plaintext and text controls" hacks r+sr=jst, but I'd split those if-then's up on individual lines :)

Attachment #167578 - Flags: superreview?(jst)

Attachment #167578 - Flags: superreview+

Attachment #167578 - Flags: review?(jst)

Attachment #167578 - Flags: review+

Sebastian Redl

Comment 10

•

21 years ago

So that means my patch worked for text/xml and application/xml, but not application/xhtml+xml?

Boris Zbarsky [:bzbarsky]

Assignee

Comment 11

•

21 years ago

Sebastian, that's correct.

Assignee: dom-to-text → bzbarsky

Boris Zbarsky [:bzbarsky]

Assignee

Comment 12

•

21 years ago

Checked in on the trunk, with jst's requested change.

Status: NEW → RESOLVED

Closed: 21 years ago

Resolution: --- → FIXED

Target Milestone: --- → mozilla1.8alpha6

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Cannot copy/paste a portion of a XHTML page that is in a CDATA section

Categories

(Core :: DOM: Serializers, defect)

Tracking

()

People

(Reporter: david.sporn, Assigned: bzbarsky)

References

(
URL
)

Details

(Keywords: dataloss)

Crash Data

Security

(public)

User Story

Attachments

(2 files)

Description

Comment 1

Updated

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Comment 10

Comment 11

Comment 12

Attachment

General

Description

File Name

Content Type