Closed
Bug 385776
Opened 18 years ago
Closed 15 years ago
make entity handling match IE7 / HTML5
Categories
(Core :: DOM: HTML Parser, defect)
Core
DOM: HTML Parser
Tracking
()
RESOLVED
FIXED
People
(Reporter: zcorpan, Unassigned)
References
()
Details
(Whiteboard: [fixed by the HTML5 parser])
Attachments
(6 files)
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9a6pre) Gecko/20070620 Minefield/3.0a6pre
Build Identifier: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9a6pre) Gecko/20070620 Minefield/3.0a6pre
Semi-colonless entities in content aren't supported (001). ⟨ and ⟩ don't return the right characters (per HTML5) (001..003). ™ is not supported (001..003). U+002D (-), U+002E (.), and U+005F (_) don't terminate entities in attributes (004).
Reproducible: Always
Steps to Reproduce:
1. Run the test cases 001..004
2.
3.
Relevant spec: http://www.whatwg.org/specs/web-apps/current-work/multipage/section-tokenisation.html#tokenising
Reporter | ||
Comment 1•18 years ago
|
||
Reporter | ||
Comment 2•18 years ago
|
||
Reporter | ||
Comment 3•18 years ago
|
||
Reporter | ||
Comment 4•18 years ago
|
||
![]() |
||
Comment 5•17 years ago
|
||
Note to whoever decides to fix this: don't screw up view-source of XML.
Comment 6•17 years ago
|
||
(In reply to comment #1)
> Created an attachment (id=269684) [details]
> Entities in content
Hi, I'm fixing bug 427240 which tries to coveer this tetstcase (entities in content). Please take a look to comment #15 https://bugzilla.mozilla.org/show_bug.cgi?id=427240#c15
Are lang and rang codes wrong here? And, should I add TRADE in uppercase too?
Please let me know.
Thanks!
Daniel.
Comment 7•17 years ago
|
||
(In reply to comment #5)
> Note to whoever decides to fix this: don't screw up view-source of XML.
Hello Boris, could you please attach a test case, so I ensure I don't screw up anything?
Thanks!
Daniel.
![]() |
||
Comment 8•17 years ago
|
||
When viewing source, the entire entity name should be an error:
data:text/xml,<root>&ersand</root>
When viewing source, the "amp" part should be an error:
data:text/xml,<root>& ersand</root>
There might be other situations that need testing too. I don't recall all of them offhand, but the point is that the HTML tokenizer is used for XML view-source, so any changes to it need to be carefully thought about in that context.
Comment 9•17 years ago
|
||
Sorry Boris, I didn't understand your example :( (please note I'm new here).
I understand that the same parser is used both for HTML and XML.
I beg you to create a minimalistic XML with your examples above, so I can open it with FF before and after my fix, and check that it is parsed in the same way.
Thanks!
Daniel.
Comment 10•17 years ago
|
||
With the newly added patch of bug 427240, all testcases of this bug report pass (except for lang and rang). I'm waiting for a response about this. If we agree on that, then this bug can be closed as well.
Daniel.
Comment 11•17 years ago
|
||
Hi, this is how my patched FF renders the output of testcase #2.
As you can see, the left angle and right angle brackets are properly rendered with the existing values. So, PLEASE CONFIRM that I should change them, since once I do it, they will be no longer properly rendered.
(unless I should change them in the rendering engine as well).
Thanks!
Daniel.
Reporter | ||
Comment 12•17 years ago
|
||
(I think that U+3008 and U+3009 don't properly render for you because you don't have any glyphs for them in your fonts.)
Please note that HTML 5 has changed wrt ⟨ and ⟩ since this bug was reported:
http://www.whatwg.org/specs/web-apps/current-work/multipage/named.html#named
http://html5.org/tools/web-apps-tracker?from=1285&to=1286
lang; U+027E8
rang; U+027E9
Updated•17 years ago
|
Blocks: html5-parsing
Comment 13•16 years ago
|
||
(In reply to comment #12)
> (I think that U+3008 and U+3009 don't properly render for you because you don't
> have any glyphs for them in your fonts.)
>
> Please note that HTML 5 has changed wrt ⟨ and ⟩ since this bug was
> reported:
>
> http://www.whatwg.org/specs/web-apps/current-work/multipage/named.html#named
> http://html5.org/tools/web-apps-tracker?from=1285&to=1286
>
> lang; U+027E8
> rang; U+027E9
Fixed in the HTML5 parsing repo with this spec change. However, the default font for U+027E8 and U+027E9 looks terrible on Mac.
Status: UNCONFIRMED → NEW
Ever confirmed: true
Comment 14•16 years ago
|
||
Updated•16 years ago
|
No longer blocks: html5-parsing
Depends on: html5-parsing
Comment 15•15 years ago
|
||
The ugliness is bug 431559.
Status: NEW → RESOLVED
Closed: 15 years ago
Resolution: --- → FIXED
Whiteboard: [fixed by the HTML5 parser]
You need to log in
before you can comment on or make changes to this bug.
Description
•