Closed Bug 107904 Opened 23 years ago Closed 21 years ago

Problems with XHTML as text/html in HTML tokenizer

Tracking

()

Status:

RESOLVED WONTFIX

Milestone:

Future

People

(Reporter: choess, Assigned: harishd)

References

Details

Christopher Hoess (gone)

Reporter

Description

•

23 years ago

From conversation with harishd on IRC tonight, CParserContext::SetMimeType() isn't setting the XML parsing flag for XHTML served as text/html, which it will eventually need to parse XML-style empty tags properly. Solution is probably to modify doctype-sniffing code upstream of tokenizer so that XHTML DTDs set both the strict and XML flags. (Actually, layout is handling text/html XHTML as quirks right now, but that's another bug).

Christopher Hoess (gone)

Reporter

Updated

•

23 years ago

Blocks: 94284

Christopher Hoess (gone)

Reporter

Comment 1

•

23 years ago

Looks like the most likely solution would be to hack nsParser.cpp around line 1105 so documents with an XHTML doctype are set to, say, aDocType = eXHTML. In "if (aDocType == eXML)", s/XML/XHTML/, and the tokenizer should work correctly. Parse Mode will still be set to strict; however, I'm not sure exactly how the parser decides to use HTML or XML tokenizer, so I don't know how this would affect that mechanism.

harishd

Assignee

Updated

•

23 years ago

Status: NEW → ASSIGNED

Priority: -- → P3

Target Milestone: --- → mozilla0.9.8

harishd

Assignee

Updated

•

23 years ago

Target Milestone: mozilla0.9.8 → mozilla1.0

harishd

Assignee

Comment 2

•

23 years ago

mass move to 1.1

Target Milestone: mozilla1.0 → mozilla1.1

Jonas Jørgensen

Comment 3

•

23 years ago

Pardon my ignorance, but is this bug requesting that XHTML documents served as text/html be parsed as XML (as they would if they were served with an XML mime type)?

Christopher Hoess (gone)

Reporter

Comment 4

•

23 years ago

I should hope not! This is in reference to XML-style empty elements being incompatible with strict SGML parsing: see <URL:http://www.cs.tut.fi/~jkorpela/html/empty.html>.

Heikki Toivonen (remove -bugzilla when emailing directly)

Comment 5

•

23 years ago

[was 1.1alpha]

Target Milestone: mozilla1.1alpha → Future

Hixie (not reading bugmail)

Comment 6

•

22 years ago

According to the HTMLWG, XHTML sent as text/html must not be parsed any differently than HTML sent as text/html. So this would be WONTFIX, no? I wish we implemented the NET feature...

Boris Zbarsky [:bzbarsky]

Comment 7

•

21 years ago

I'm going to go out on a limb here and wontfix this.

Status: ASSIGNED → RESOLVED

Closed: 21 years ago

Resolution: --- → WONTFIX

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Problems with XHTML as text/html in HTML tokenizer

Categories

(Core :: DOM: HTML Parser, defect, P3)

Tracking

()

People

(Reporter: choess, Assigned: harishd)

References

Details

Crash Data

Security

(public)

User Story

Description

Updated

Comment 1

Updated

Updated

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Comment 7