477455 - Parser does not wait for "?>" to close blocks that begin with "<?"

Reporter

Description

•

16 years ago

The markup parser knows that blocks that begin with "<?" are special (the source code viewer highlights them as pink), so it should know that those blocks can only end with "?>". It currently doesn't, meaning it ends the block whenever it encounters any old ">". This doesn't seem right to me. The XML declaration requires the question mark end tag, as does PHP. I can't think of any markup language (or any other language, for that matter) that opens with a question mark but doesn't close with one.

Blake Kaplan (:mrbkap) (inactive)

Comment 1

•

16 years ago

Well, HTML actually requires this. http://www.w3.org/TR/html4/appendix/notes.html#h-B.3.6 is the relevant specification.

Gordon P. Hemsley [:GPHemsley]

Reporter

Comment 2

•

16 years ago

Hmm... well the beginning of that Appendix B says that it is informative, not normative, but it says that everything listed there is already defined elsewhere. Could you point me to where that issue is defined in the normative part of the spec? Also, does anyone anywhere use these SGML "processing instructions" as they are defined in the spec? Seriously, just point me to one valid use that Firefox is expected to (and does) interpret correctly. (However, XSLT processing instructions might have to be looked into. I can't figure out if they end with ">" or "?>". And those would be necessary to support, right? [Even if I don't know what they're for....])

Blake Kaplan (:mrbkap) (inactive)

Comment 3

•

16 years ago

(In reply to comment #2) > Hmm... well the beginning of that Appendix B says that it is informative, not > normative, but it says that everything listed there is already defined > elsewhere. Could you point me to where that issue is defined in the normative > part of the spec? It isn't, except if you count that HTML is an SGML application. Currently HTML5 specifies <? ... >. I don't have strong feelings about this one way or the other, you might want to bring it up on the WhatWG list.

Hixie (not reading bugmail)

Comment 4

•

16 years ago

<? ... ?> is an XMLism. <? ... > is an SGMLism. HTML comes from an SGML heritage (it predates XML). However this is all rather academic, because HTML doesn't actually have any valid <? ... > constructs, and so there's not really anything to support here.

Status: NEW → RESOLVED

Closed: 16 years ago

Resolution: --- → INVALID

Gordon P. Hemsley [:GPHemsley]

Reporter

Comment 5

•

16 years ago

(In reply to comment #4) > <? ... ?> is an XMLism. > <? ... > is an SGMLism. > HTML comes from an SGML heritage (it predates XML). > > However this is all rather academic, because HTML doesn't actually have any > valid <? ... > constructs, and so there's not really anything to support here. Well, if HTML doesn't have valid <? ... > constructs to support, what's the harm in changing it to <? ... ?> parsing?

Hixie (not reading bugmail)

Comment 6

•

16 years ago

What's the benefit? Changing things usually has a cost associated with it, in terms of needed coding, testing, documenting, etc, and often causes compatibility problems. There are almost certainly pages that depend on the current error handling. Since all the browsers pretty much agree on this, why change it?

Gordon P. Hemsley [:GPHemsley]

Reporter

Comment 7

•

16 years ago

Well, I was really prompted to report this when I loaded a page that contained PHP code in it. Rather than match the <?php with ?>, the parser (and the view-source parser, as I recall) stopped a > within the code, and thus left the remaining PHP out in the open for the markup parser to mishandle. Plus, I like symmetry. :)

Hixie (not reading bugmail)

Comment 8

•

16 years ago

Don't send PHP code to the browser. It'll fail just like if you send C++ code to the browser. :-)

Bugzilla

Parser does not wait for "?>" to close blocks that begin with "<?"

Categories

(Core :: DOM: HTML Parser, defect)

Tracking

()

People

(Reporter: GPHemsley, Unassigned)

References

Details

Crash Data

Security

(public)

User Story

Description

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8