Bring htmlparser ampersand-error-reporting into conformance with HTML spec
Categories
(Core :: DOM: HTML Parser, defect)
Tracking
()
| Tracking | Status | |
|---|---|---|
| firefox93 | --- | fixed |
People
(Reporter: sideshowbarker, Assigned: hsivonen)
References
Details
Attachments
(1 file, 7 obsolete files)
| Reporter | ||
Comment 1•10 years ago
|
||
| Reporter | ||
Comment 2•10 years ago
|
||
| Reporter | ||
Comment 3•10 years ago
|
||
| Assignee | ||
Comment 4•10 years ago
|
||
| Reporter | ||
Comment 5•10 years ago
|
||
| Reporter | ||
Comment 6•10 years ago
|
||
| Reporter | ||
Comment 7•10 years ago
|
||
| Assignee | ||
Updated•10 years ago
|
| Assignee | ||
Comment 8•5 years ago
|
||
| Assignee | ||
Comment 9•5 years ago
|
||
| Assignee | ||
Updated•5 years ago
|
| Assignee | ||
Updated•5 years ago
|
| Assignee | ||
Comment 10•5 years ago
|
||
| Assignee | ||
Comment 11•5 years ago
|
||
This refines 9ce4bd4 by only buffering if we’re actually inside an
attribute value.
Updated•5 years ago
|
| Assignee | ||
Comment 12•5 years ago
|
||
This patch appears to change the DOM in an unwanted way:
https://treeherder.mozilla.org/#/jobs?repo=try&revision=59a15640760bc1c1a4629c90b849cfa43a20c057&selectedTaskRun=GdCS5FrVRbCXm2jscZdZkw.0
| Reporter | ||
Comment 13•5 years ago
|
||
(In reply to Henri Sivonen (:hsivonen) (not reading bugmail until 2020-08-03) from comment #12)
This patch appears to change the DOM in an unwanted way:
https://treeherder.mozilla.org/#/jobs?repo=try&revision=59a15640760bc1c1a4629c90b849cfa43a20c057&selectedTaskRun=GdCS5FrVRbCXm2jscZdZkw.0
Using the current Java sources for the parser — including the https://phabricator.services.mozilla.com/D81992 patch to the Java sources — I can’t reproduce the test failures shown at https://treeherder.mozilla.org/#/jobs?repo=try&revision=59a15640760bc1c1a4629c90b849cfa43a20c057&selectedTaskRun=GdCS5FrVRbCXm2jscZdZkw.0
For example, given this input:
<html><head><body>&ammmp;
…the Java parser produces the expected output — specifically, it outputs the &ammmp; as expected.
So I’m wondering if it’s possible that some difference could be getting introduced during the Java-to-C++ translation.
| Reporter | ||
Comment 14•5 years ago
|
||
The Java parser can also be tested by doing this:
curl -L -O https://github.com/validator/validator/releases/download/jar/vnu.jar \
&& echo "<html><head><body>&ammmp;" > TEST.html
&& java -cp vnu.jar nu.validator.htmlparser.test.TreePrinter TEST.html
That outputs this tree:
| <html>
| <head>
| <body>
| "&ammmp;
"
| Assignee | ||
Comment 15•5 years ago
|
||
For Bugzilla record: This is being discussed on GitHub. The difference is not from C++ but from where the test harness puts buffer boundaries.
| Assignee | ||
Comment 16•4 years ago
|
||
| Assignee | ||
Comment 17•4 years ago
|
||
Updated•4 years ago
|
Updated•4 years ago
|
| Assignee | ||
Comment 18•4 years ago
|
||
Updated•4 years ago
|
Updated•4 years ago
|
| Assignee | ||
Comment 19•4 years ago
|
||
| Assignee | ||
Comment 20•4 years ago
|
||
Comment 21•4 years ago
|
||
Comment 23•4 years ago
|
||
| bugherder | ||
Description
•