Open Bug 1029309 Opened 10 years ago Updated 4 months ago

make named_characters on parser/html/java removes some named characters.

Categories

(Core :: DOM: HTML Parser, defect, P3)

defect

Tracking

()

ASSIGNED

People

(Reporter: yuki.sekiguchi, Assigned: patgrizle)

Details

Attachments

(1 file)

User Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_9_3) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/35.0.1916.153 Safari/537.36

Steps to reproduce:

cd $MOZILLA_SRC/parser/html/java
make sync
make translate
make named_characters
cd $MOZILLA_SRC
mach build
mach mochitest-plain parser/htmlparser/tests/mochitest/test_html5_tree_construction.html
mach mochitest-plain parser/htmlparser/tests/mochitest/test_html5_tree_construction_part2.html


Actual results:

Some tests fail due to named characters do not change to actual characters.


Expected results:

Pass the tests.
OS: Mac OS X → All
Hardware: x86 → All
Component: Untriaged → HTML: Parser
Product: Firefox → Core

Bulk-downgrade of unassigned, >=5 years untouched DOM/Storage bugs' priority.

If you have reason to believe this is wrong (especially for the severity), please write a comment and ni :jstutte.

Severity: normal → S4
Priority: -- → P5

We should actually look at this sooner than later.

Priority: P5 → P3
Assignee: nobody → patgrizle
Status: UNCONFIRMED → ASSIGNED
Ever confirmed: true

Hopefully just in time. The outdated named character references caused this problem.

The table from https://html.spec.whatwg.org/multipage/named-characters.html#named-character-references needed some modifications to be parsed successfully by the htmlparser library. The spec website provides this data in JSON format, so it would be best to use that instead of html, but the list is static and won't be expanded or changed in the future, so I'm not sure it's worth the effort.

You need to log in before you can comment on or make changes to this bug.

Attachment

General

Creator:
Created:
Updated:
Size: