Closed Bug 47583 Opened 25 years ago Closed 24 years ago

Why is the &#x97 entity translated to U+2014 in an iso8859-1 document?

Categories

(Core :: DOM: HTML Parser, defect, P3)

x86
Linux
defect

Tracking

()

RESOLVED WORKSFORME

People

(Reporter: tenthumbs, Assigned: rickg)

Details

Attachments

(1 file)

See the attached HTML document. U+0097 is a well-defined Unicode character. The Unicode ISO8859-1 mapping uses an identity mapping for all the C1 control characters so I would think that an iso8859-1 document would treat "—" as a control character. It doesn't.
Attached file U+0097 HTML test
Re-assigning bugs on Clayton's list opened between 7/29 and 8/4 to Kevin for further triage.
Assignee: clayton → kmcclusk
Do we support the charset=iso-8859-1" on the meta tag? Changing component to parser and reassigning
Assignee: kmcclusk → rickg
Component: Layout → Parser
QA Contact: petersen → janc
This looks fine to me, and is equivalent to what gets display in nav and ie.
Status: NEW → RESOLVED
Closed: 24 years ago
Resolution: --- → WORKSFORME
updated qa contact.
QA Contact: janc → bsharma
QA Contact: bsharma → moied
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Creator:
Created:
Updated:
Size: