Closed Bug 64999 Opened 25 years ago Closed 24 years ago

Uppercase elements are being treated as XHTML elements in XML documents

Tracking

()

Status:

VERIFIED DUPLICATE of bug 29171

Milestone:

Future

People

(Reporter: chrispetersen, Assigned: pierre)

Details

(Keywords: css1, testcase, xhtml, Whiteboard: (py8ieh: evil tests needed))

Attachments

(4 files)

A xhtml document that contains upper case element names. 25 years ago Chris Petersen 319 bytes, text/xml		Details
A revised version of original xml file-please use 25 years ago Chris Petersen 319 bytes, text/xml		Details
A XML doc that uses HTML name space. 25 years ago Chris Petersen 462 bytes, text/xml		Details
A upper case P element with a lowercase attribute 25 years ago Chris Petersen 339 bytes, text/xml		Details

Chris Petersen

Reporter

Description

•

25 years ago

Build:2001010908 Platforms: All Expected Results: A XML parser error should be displayed because document is not well-formed. What I got: Document's content is displayed. Steps to reproduce: 1) Open xml test case 2) The test case contains elements that are in uppercase instead of lowercase. 3) The document's content is rendered. 4) According to the XHTML 1.O spec, XHTML documents must use lower case for all HTML element and attribute names. Our XML parser should trap for this.

Chris Petersen

Reporter

Comment 1

•

25 years ago

Attached file A xhtml document that contains upper case element names. — Details

Chris Petersen

Reporter

Comment 2

•

25 years ago

Attached file A revised version of original xml file-please use — Details

Heikki Toivonen (remove -bugzilla when emailing directly)

Comment 3

•

25 years ago

Please mark xhtml bugs with the xhtml keyword. Also, could you make a regular XML file that has just a couple of elements from the XHTML namespace to see if this problem appears in that case as well? I suspect that in this case we might actually be going through the HTML parser in which case we would not care about upper/lowe case... It depends at least on the file suffix (local file) and mime type (http). I am not completely sure about .xml (or text/xml mime type) document containing only XHTML tags...

Keywords: xhtml

Hixie (not reading bugmail)

Comment 4

•

25 years ago

We are not a validating parser, so we should not spit an error on this -- the file is well formed, just invalid. This is why we are not doing anything with the attribute or the <TITLE> element -- they are not valid HTML, so we treat them like generic, non-magical tags. INVALID?

Whiteboard: INVALID?

Heikki Toivonen (remove -bugzilla when emailing directly)

Comment 5

•

25 years ago

Wrong, the file is not well formed. For example "<p>Blah</P>" is missing an end tag "p" and is also missing a start tag "P" (notice the case).

Whiteboard: INVALID?

Chris Petersen

Reporter

Comment 6

•

25 years ago

Attached file A XML doc that uses HTML name space. — Details

Chris Petersen

Reporter

Comment 7

•

25 years ago

Ok, the new xml test case uses HTML namespace in it's root element. The document conatins six html elements ( 3 uppercase names, 3 lowercase names). The three lowercase names elements (P, H1, H3) are rendered correctly. The uppercase elements are rendered as inline text.

Hixie (not reading bugmail)

Comment 8

•

25 years ago

Heikki: None of the test cases have mismatched case start and end tags as far as I can tell. On which line of which attachment did you see that? ChrisP: The first testcase doesn't work because no namespace is given (since, as you obviously noticed and then corrected, the xmlns attribute is misspelt). The second testcase is showing an error that I had missed before -- somehow, one or both of the HEAD and TITLE elements are being wrongly recognised as XHTML elements and thus hidden. This is wrong, since XHTML is case-sensitive, and so upper case tags should not be recognised as XHTML elements. This third testcase is showing correct behaviour, since the "html" and "HTML" namespace prefixes are not the same, and so should not (and do not) both resolve to the same thing, resulting in the HTML:P, HTML:H1 and HTML:H3 elements being treated as generic XML elements. (BTW, the namespace you used in the third testcase is deprecated, you should use the one you used in the other two files.) Therefore, I am retitling this bug to address the issue given in the second testcase -- why is the Style System matching the elements in the HTML namespace in an XML document case insensitively?

Assignee: heikki → pierre

Component: XML → Style System

Keywords: css1

Summary: Uppercase elements and attributes are not being detected by XML parser → Uppercase elements are being treated as XHTML elements in XML documents

Heikki Toivonen (remove -bugzilla when emailing directly)

Comment 9

•

25 years ago

Am I missing something here? In the second testcase the head, title, body and p elements have start and end tags in different case. Also, I don't think this is a style system bug but rather the case that XHTML documents are treated differently based on the file suffix/mime type - in some cases they go through the HTML parser in which case it is treated as normal HTML (so case does not matter for example) and in other cases the document goes through the XML parser which has incomplete support for XHTML namespace (for example title and style tags don't work). suffix or mime type parser .htm* html .xml xml .xhtml ? text/html html text/xml xml and so on...

Hixie (not reading bugmail)

Comment 10

•

25 years ago

Heikki: Where do you see that they are different case??? They look like the same case to me: <?xml version="1.0"?> <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "DTD/xhtml1- strict .dtd"> <html xmlns="http://www.w3.org/1999/xhtml"> <HEAD> <TITLE>Elements and attributes are uppercase</TITLE> </HEAD> <BODY> <P ALIGN="CENTER">This text uses the align attribute with a value of center</P> </BODY> </html> -- http://bugzilla.mozilla.org/showattachment.cgi?attach_id=22301 Also, yes, the MIME type is important; but in this case the MIME type is text/xml so it is definitely the XML parser that is being used. Or at least it should be.

Heikki Toivonen (remove -bugzilla when emailing directly)

Comment 11

•

25 years ago

AARGH! Ignore my comments and shoot me please. I was looking at the attachment in NS6, and View > Page Source really showed the starting tags of the aforementioned elements in lower case. When I looked at it in latest Mozilla, or saved the file in NS 6 I actually saw that the file was as you described. Ok, so now that I know the testcase is correct... I believe this is an XML bug still. Hmm... or maybe there are bugs both in XML and the style system... Specifically in XML, in nsXMLContentSink::OpenContainer() (maybe elsewhere as well, like attributes) we check for HTML namespace and if so, create an HTML element with NS_CreateHTMLElement. So we do not check for the element case, and NS_CreateHTMLElement is case insensitive because it has to be for normal HTML. So it looks like the fix in the XML side seems to be simple: just check that the element name (attributes?) does not contain uppercase characters used in the XHTML DTD (A-Z, for simplicity?). The fix in the style system side is probably needed as well, although it is not as important. Even though the fix in the content sink would fix normal paths, it would still be possible to create XHTML elements in wrong case using the DOM and XPCOM. I don't know how the style system works, but if it looks at content objects it can get a pointer to the content's document (sometimes null so this is not foolproof?), and if it is not HTML the element case should matter.

Hixie (not reading bugmail)

Updated

•

25 years ago

Whiteboard: (py8ieh: evil tests needed)

Heikki Toivonen (remove -bugzilla when emailing directly)

Comment 12

•

25 years ago

Nominating for nsbeta1 because of standards compliance, this could introduce bad habits for web developers. By the way, bug 29171 is related if not dupe (see my description there of the 2-3 bugs these bug reports describe).

Keywords: nsbeta1

Chris Petersen

Reporter

Comment 13

•

25 years ago

Attached file A upper case P element with a lowercase attribute — Details

Chris Petersen

Reporter

Comment 14

•

25 years ago

Heikki, How should empty elements (BR, HR, IMG) or elements like object ,applet or form be treated in this case ? Just not be rendered ?

Heikki Toivonen (remove -bugzilla when emailing directly)

Comment 15

•

25 years ago

Upper case empty HTML elements should be treated the same as any other empty XML element that has no formatting associated with it - I think it does not affect the formatting in any way. Byt the way, I have the fix for this on the XML side in the fix to the STYLE element.

Chris Petersen

Reporter

Comment 16

•

25 years ago

Two additional test cases with uppercase elements examples: http://mozilla.org/quality/browser/standards/xhtml/transitional_negative/ lowerc_block_element_attrib.xml http://mozilla.org/quality/browser/standards/xhtml/transitional_negative/ lowerc_empty_element_attrib.xml

Pierre Saslawsky

Assignee

Comment 17

•

25 years ago

Reassigned to heikki who's got a fix.

Assignee: pierre → heikki

Heikki Toivonen (remove -bugzilla when emailing directly)

Comment 18

•

25 years ago

No, this is about the fix to the style system. There is a bug on Nisheeth's list to fix things on the XML side (bug 29171). I suspect that once that bug is fixed this bug is hidden (so with luck we might never see this again). I am giving this back to pierre, but putting moving to future.

Assignee: heikki → pierre

Keywords: nsbeta1 → nsbeta1-

Target Milestone: --- → Future

Chris Petersen

Reporter

Comment 19

•

24 years ago

Works great in the 6/07 branch build. Uppercase elements are nolonger being rendered as HTML elements.

Christopher Hoess (gone)

Comment 20

•

24 years ago

WFM, yesterday's Linux CVS build.

Keywords: testcase

Heikki Toivonen (remove -bugzilla when emailing directly)

Comment 21

•

24 years ago

Please note that this bug will be hidden because bug 29171 was fixed. This is not fixed, AFAIK. But as long as this does not cause problems this can be futured, or if you really want to remove this from people's radar, mark it worksforme and if it some day raises its head try to remember to reopen this and not open a new one.

Pierre Saslawsky

Assignee

Comment 22

•

24 years ago

For the record, this bug is under Style System because of the question raised by Ian on [ 2001-01-13 04:09] which is... Why is the Style System matching the elements in the HTML namespace in an XML document case insensitively?

Status: NEW → ASSIGNED

Priority: -- → P4

David Baron :dbaron: (⌚️UTC-5, no longer working on Mozilla)

Comment 23

•

24 years ago

> Why is the Style System matching the > elements in the HTML namespace in an XML document case insensitively? Presumably because the old content model code was changing the case of those elements. I doubt there was ever a style system bug here.

David Baron :dbaron: (⌚️UTC-5, no longer working on Mozilla)

Comment 24

•

24 years ago

See previous comment. *** This bug has been marked as a duplicate of 29171 ***

Status: ASSIGNED → RESOLVED

Closed: 24 years ago

Resolution: --- → DUPLICATE

Hixie (not reading bugmail)

Comment 25

•

24 years ago

Status: RESOLVED → VERIFIED

You need to log in before you can comment on or make changes to this bug.