Open Bug 489822 Opened 15 years ago Updated 2 years ago

[HTML5] Read input once and write to tree builder directly

Tracking

()

Status:

NEW

People

(Reporter: hsivonen, Unassigned)

References

(Depends on 1 open bug)

Details

Henri Sivonen (:hsivonen)

Reporter

Description

•

15 years ago

As an artifact of SAX, the HTML5 tokenizer tries to batch its character data transfers from the input buffer into the tree builder accumulation buffer by flushing runs of characters via memcpy.

However, the tokenizer will have read all those characters once by then. If the write operation to the accumulation buffer on a per character basis were more efficient than the per-character amortized memcpy (re-)read&write, it would be worthwhile to write characters one by one into the accumulation buffer.

This means the tokenizer/treebuilder boundary can't become fully virtual for sanitization layers or such, since the per character write should be inlineable.

This would probably require bug 489820 and bug 489821 as prerequisites.

Henri Sivonen (:hsivonen)

Reporter

Updated

•

15 years ago

Priority: -- → P5

BMO Automation

Updated

•

2 years ago

Severity: normal → S3

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

[HTML5] Read input once and write to tree builder directly

Categories

(Core :: DOM: HTML Parser, enhancement, P5)

Tracking

()

People

(Reporter: hsivonen, Unassigned)

References

(Depends on 1 open bug)

Details

Crash Data

Security

(public)

User Story

Description

Updated

Updated