Closed Bug 121040 Opened 23 years ago Closed 14 years ago

Attribute values not normalized in XML

Tracking

()

Status:

RESOLVED INVALID

Milestone:

Future

People

(Reporter: brant, Unassigned)

References

Details

(Keywords: testcase, xhtml)

Attachments

(2 files)

acronym title test case 23 years ago Brant Gurganus 381 bytes, application/xhtml+xml		Details
Example that shows CR/LFs as well 20 years ago Martijn Polder 2.35 KB, text/html		Details

Brant Gurganus

Reporter

Description

•

23 years ago

From Bugzilla Helper:
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:0.9.7) Gecko/20011221
BuildID:    2001122106

Mozilla is rendering multiple spaces when it should be rendering only one.  I
quote from the XHTML 1.0 specification:

4.7 Whitespace handling in attribute values

In attribute values, user agents will strip leading and trailing whitespace from
attribute values and map sequences of one or more whitespace characters
(including line breaks) to a single inter-word space (an ASCII space character
for western scripts). See Section 3.3.3 of [XML].

I could not find the section in the HTML 4.01 specs that addresses this.  It
seems to only address extra white space between characters.

Reproducible: Always
Steps to Reproduce:
1. Open example URL.
2. The first line of the third paragraph contains "WYSIWYG" as an acronym with
title attribute.

Actual Results:  I believe the title attribute is rendered incorrectly.  It is
rendered with multiple spaces when it should, to my knowledge, be rendered with
a single space in place of the multiple spaces.

Expected Results:  I believe, to my knowledge that it should be rendered with a
single space in place of the multiple spaces.

The suspect HTML code is:
<acronym
title="What You See Is What You (might)                                
Get">"WYSIWYG</acronym>

Notice, there are multiple spaces after (might).

Boris Zbarsky [:bzbarsky]

Comment 1

•

23 years ago

Over to parser.  This happens on linux too.

Assignee: asa → harishd

Status: UNCONFIRMED → NEW

Component: Browser-General → Parser

Ever confirmed: true

OS: Windows XP → All

QA Contact: doronr → moied

Hardware: PC → All

Christopher Hoess (gone)

Comment 2

•

23 years ago

Similar to bug 47078.  There's some bugs floating out there about strippingspaces in non-CDATA attributes too (for instance), but I don't remember ifthere's anything on this specifically.

Brant Gurganus

Reporter

Comment 3

•

23 years ago

The people at the original URL removed the extra space.  It can no longer be
used as an example of the problem.

Brant Gurganus

Reporter

Comment 4

•

23 years ago

URL removed because it is no longer an example

URL: http://www.w3.org/Amaya/

Brant Gurganus

Reporter

Updated

•

23 years ago

Status: NEW → ASSIGNED

Brant Gurganus

Reporter

Comment 5

•

23 years ago

Attached file acronym title test case — Details

This is a test case for the white space in acronym title bug.  The rendered
tool tip will contain five spaces between "test" and "case" although it should
be rendered with only one space between the two.

Brant Gurganus

Reporter

Comment 6

•

23 years ago

Simplified test case has been added.

Keywords: testcase

Brant Gurganus

Reporter

Comment 7

•

23 years ago

All tested browsers get this wrong: IE6, Mozilla, W3C's Amaya, Opera.

harishd

Comment 8

•

23 years ago

Not a high priority. 

This bug has been marked "future" because the original netscape engineer working 
on this is over-burdened. If you feel this is an error, that you or another 
known resource will be working on this bug,or if it blocks your work in some way 
-- please attach your concern to the bug for reconsideration.

Target Milestone: --- → Future

Brant Gurganus

Reporter

Comment 9

•

22 years ago

It isn't uncommon for editors to wordwrap and pretty print their HTML.  If this
happens to any title attribute, then this problem will occur.  I think it has a
fair likelihood of repetition.

Christopher Hoess (gone)

Comment 10

•

22 years ago

Repurposing this bug for the XML case, HTML case is a duplicate of bug 47078. 
Maybe heikki's bug?

Summary: title attribute of acronym element rendered incorrectly → Attribute values not normalized in XML

Christopher Hoess (gone)

Updated

•

22 years ago

Attachment #69987 - Attachment mime type: text/html → application/xhtml+xml

Brant Gurganus

Reporter

Updated

•

22 years ago

Keywords: xhtml

Martijn Polder

Comment 11

•

20 years ago

Attached file Example that shows CR/LFs as well — Details

The relative seriousness of the bug shows up better if CR/LFs are inserted into
the title-attributes. Instead of the CR/LFs, strange looking characters are
displayed. An extra incentive should be that IE6 handles the CR/LFs correctly.

Jo Hermans

Comment 12

•

19 years ago

*** Bug 299365 has been marked as a duplicate of this bug. ***

Blake Kaplan (:mrbkap) (inactive)

Comment 13

•

19 years ago

Moving to XML based on comment 10.

Assignee: harishd → xml

Status: ASSIGNED → NEW

Component: HTML: Parser → XML

QA Contact: moied → ashshbhatt

Phil Ringnalda (:philor)

Updated

•

15 years ago

Assignee: xml → nobody

QA Contact: ashshbhatt → xml

:Ms2ger (he/him; ⌚ UTC+1/+2)

Comment 14

•

14 years ago

Marking invalid. XHTML1 misrepresents the XML specification here. XML states: [1]

> If the attribute type ***is not CDATA,*** then the XML processor MUST further
> process the normalized attribute value by discarding any leading and trailing
> space (#x20) characters, and by replacing sequences of space (#x20) characters
> by a single space (#x20) character.

(Emphasis mine.) This makes attachment 69987 [details] incorrect, as @title *is* CDATA. [2]
However, we don't seem to normalize for non-CDATA attributes either, but as we aren't a validator, and we don't actually read the full DTD, the following exception applies: [1]

> All attributes for which no declaration has been read SHOULD be treated by a
> non-validating processor as if declared CDATA.

... which makes our behavior for those attributes correct as well.

[1] http://www.w3.org/TR/REC-xml/#AVNormalize
[2] http://www.w3.org/TR/xhtml1/dtds.html#dtdentry_xhtml1-strict.dtd_coreattrs

Status: NEW → RESOLVED

Closed: 14 years ago

Resolution: --- → INVALID

You need to log in before you can comment on or make changes to this bug.