Closed Bug 385776 Opened 17 years ago Closed 14 years ago

make entity handling match IE7 / HTML5

Categories

(Core :: DOM: HTML Parser, defect)

defect
Not set
normal

Tracking

()

RESOLVED FIXED

People

(Reporter: zcorpan, Unassigned)

References

()

Details

(Whiteboard: [fixed by the HTML5 parser])

Attachments

(6 files)

User-Agent:       Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9a6pre) Gecko/20070620 Minefield/3.0a6pre
Build Identifier: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9a6pre) Gecko/20070620 Minefield/3.0a6pre

Semi-colonless entities in content aren't supported (001). ⟨ and ⟩ don't return the right characters (per HTML5) (001..003). ™ is not supported (001..003). U+002D (-), U+002E (.), and U+005F (_) don't terminate entities in attributes (004).

Reproducible: Always

Steps to Reproduce:
1. Run the test cases 001..004
2.
3.



Relevant spec: http://www.whatwg.org/specs/web-apps/current-work/multipage/section-tokenisation.html#tokenising
Attached file Entities in content
Note to whoever decides to fix this: don't screw up view-source of XML.
(In reply to comment #1)
> Created an attachment (id=269684) [details]
> Entities in content

Hi, I'm fixing bug 427240 which tries to coveer this tetstcase (entities in content). Please take a look to comment #15 https://bugzilla.mozilla.org/show_bug.cgi?id=427240#c15

Are lang and rang codes wrong here? And, should I add TRADE in uppercase too?

Please let me know.

Thanks!

  Daniel.
(In reply to comment #5)
> Note to whoever decides to fix this: don't screw up view-source of XML.

Hello Boris, could you please attach a test case, so I ensure I don't screw up anything?

Thanks!
  Daniel.
When viewing source, the entire entity name should be an error:

  data:text/xml,<root>&ampersand</root>

When viewing source, the "amp" part should be an error:

  data:text/xml,<root>&amp ersand</root>

There might be other situations that need testing too.  I don't recall all of them offhand, but the point is that the HTML tokenizer is used for XML view-source, so any changes to it need to be carefully thought about in that context.
Sorry Boris, I didn't understand your example :(   (please note I'm new here).
I understand that the same parser is used both for HTML and XML.

I beg you to create a minimalistic XML with your examples above, so I can open it with FF before and after my fix, and check that it is parsed in the same way.

Thanks!

  Daniel.
With the newly added patch of bug 427240, all testcases of this bug report pass (except for lang and rang). I'm waiting for a response about this. If we agree on that, then this bug can be closed as well.
  Daniel.
Hi, this is how my patched FF renders the output of testcase #2.
As you can see, the left angle and right angle brackets are properly rendered with the existing values. So, PLEASE CONFIRM that I should change them, since once I do it, they will be no longer properly rendered.
(unless I should change them in the rendering engine as well).

Thanks!
  Daniel.
(I think that U+3008 and U+3009 don't properly render for you because you don't have any glyphs for them in your fonts.)

Please note that HTML 5 has changed wrt &lang; and &rang; since this bug was reported:

http://www.whatwg.org/specs/web-apps/current-work/multipage/named.html#named
http://html5.org/tools/web-apps-tracker?from=1285&to=1286

lang; 	U+027E8
rang; 	U+027E9
(In reply to comment #12)
> (I think that U+3008 and U+3009 don't properly render for you because you don't
> have any glyphs for them in your fonts.)
> 
> Please note that HTML 5 has changed wrt &lang; and &rang; since this bug was
> reported:
> 
> http://www.whatwg.org/specs/web-apps/current-work/multipage/named.html#named
> http://html5.org/tools/web-apps-tracker?from=1285&to=1286
> 
> lang;     U+027E8
> rang;     U+027E9

Fixed in the HTML5 parsing repo with this spec change. However, the default font for U+027E8 and U+027E9 looks terrible on Mac.
Status: UNCONFIRMED → NEW
Ever confirmed: true
No longer blocks: html5-parsing
Depends on: html5-parsing
The ugliness is bug 431559.
Status: NEW → RESOLVED
Closed: 14 years ago
Resolution: --- → FIXED
Whiteboard: [fixed by the HTML5 parser]
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: