Closed Bug 483158 Opened 12 years ago Closed 12 years ago

[HTML5] Make CRLF normalization work with stream rewinding

Categories

(Core :: DOM: HTML Parser, defect, P2)

Other Branch
defect

Tracking

()

RESOLVED FIXED

People

(Reporter: hsivonen, Assigned: hsivonen)

References

Details

Currently, CRLF normalization in the HTML5 tokenizer overwrites original data in an irreversible and later undetectable way.

Need to make it so that the stream can be tokenized twice correctly. (For speculative parsing.)

The best solution I have come up with so far involves reversing CRLF pairs to LFCR on the first run and skipping CRs on the second run.
Priority: -- → P2
Summary: Make CRLF normalization work with stream rewinding → [HTML5] Make CRLF normalization work with stream rewinding
Probably the right way to go is to make the tokenizer not write into its input buffer at all.
Fixed in rev 965664cbf73b in HTML5 repo.
Status: NEW → RESOLVED
Closed: 12 years ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.