Closed
Bug 385776
Opened 17 years ago
Closed 14 years ago
make entity handling match IE7 / HTML5
Categories
(Core :: DOM: HTML Parser, defect)
Core
DOM: HTML Parser
Tracking
()
RESOLVED
FIXED
People
(Reporter: zcorpan, Unassigned)
References
()
Details
(Whiteboard: [fixed by the HTML5 parser])
Attachments
(6 files)
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9a6pre) Gecko/20070620 Minefield/3.0a6pre Build Identifier: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9a6pre) Gecko/20070620 Minefield/3.0a6pre Semi-colonless entities in content aren't supported (001). ⟨ and ⟩ don't return the right characters (per HTML5) (001..003). ™ is not supported (001..003). U+002D (-), U+002E (.), and U+005F (_) don't terminate entities in attributes (004). Reproducible: Always Steps to Reproduce: 1. Run the test cases 001..004 2. 3. Relevant spec: http://www.whatwg.org/specs/web-apps/current-work/multipage/section-tokenisation.html#tokenising
Reporter | ||
Comment 1•17 years ago
|
||
Reporter | ||
Comment 2•17 years ago
|
||
Reporter | ||
Comment 3•17 years ago
|
||
Reporter | ||
Comment 4•17 years ago
|
||
Comment 5•17 years ago
|
||
Note to whoever decides to fix this: don't screw up view-source of XML.
Comment 6•16 years ago
|
||
(In reply to comment #1) > Created an attachment (id=269684) [details] > Entities in content Hi, I'm fixing bug 427240 which tries to coveer this tetstcase (entities in content). Please take a look to comment #15 https://bugzilla.mozilla.org/show_bug.cgi?id=427240#c15 Are lang and rang codes wrong here? And, should I add TRADE in uppercase too? Please let me know. Thanks! Daniel.
Comment 7•16 years ago
|
||
(In reply to comment #5) > Note to whoever decides to fix this: don't screw up view-source of XML. Hello Boris, could you please attach a test case, so I ensure I don't screw up anything? Thanks! Daniel.
Comment 8•16 years ago
|
||
When viewing source, the entire entity name should be an error: data:text/xml,<root>&ersand</root> When viewing source, the "amp" part should be an error: data:text/xml,<root>& ersand</root> There might be other situations that need testing too. I don't recall all of them offhand, but the point is that the HTML tokenizer is used for XML view-source, so any changes to it need to be carefully thought about in that context.
Comment 9•16 years ago
|
||
Sorry Boris, I didn't understand your example :( (please note I'm new here). I understand that the same parser is used both for HTML and XML. I beg you to create a minimalistic XML with your examples above, so I can open it with FF before and after my fix, and check that it is parsed in the same way. Thanks! Daniel.
Comment 10•16 years ago
|
||
With the newly added patch of bug 427240, all testcases of this bug report pass (except for lang and rang). I'm waiting for a response about this. If we agree on that, then this bug can be closed as well. Daniel.
Comment 11•16 years ago
|
||
Hi, this is how my patched FF renders the output of testcase #2. As you can see, the left angle and right angle brackets are properly rendered with the existing values. So, PLEASE CONFIRM that I should change them, since once I do it, they will be no longer properly rendered. (unless I should change them in the rendering engine as well). Thanks! Daniel.
Reporter | ||
Comment 12•16 years ago
|
||
(I think that U+3008 and U+3009 don't properly render for you because you don't have any glyphs for them in your fonts.) Please note that HTML 5 has changed wrt ⟨ and ⟩ since this bug was reported: http://www.whatwg.org/specs/web-apps/current-work/multipage/named.html#named http://html5.org/tools/web-apps-tracker?from=1285&to=1286 lang; U+027E8 rang; U+027E9
Updated•16 years ago
|
Blocks: html5-parsing
Comment 13•15 years ago
|
||
(In reply to comment #12) > (I think that U+3008 and U+3009 don't properly render for you because you don't > have any glyphs for them in your fonts.) > > Please note that HTML 5 has changed wrt ⟨ and ⟩ since this bug was > reported: > > http://www.whatwg.org/specs/web-apps/current-work/multipage/named.html#named > http://html5.org/tools/web-apps-tracker?from=1285&to=1286 > > lang; U+027E8 > rang; U+027E9 Fixed in the HTML5 parsing repo with this spec change. However, the default font for U+027E8 and U+027E9 looks terrible on Mac.
Status: UNCONFIRMED → NEW
Ever confirmed: true
Comment 14•15 years ago
|
||
Updated•15 years ago
|
No longer blocks: html5-parsing
Depends on: html5-parsing
Comment 15•14 years ago
|
||
The ugliness is bug 431559.
Status: NEW → RESOLVED
Closed: 14 years ago
Resolution: --- → FIXED
Whiteboard: [fixed by the HTML5 parser]
You need to log in
before you can comment on or make changes to this bug.
Description
•