This sort of thing makes me sad.... yes, it's a website leak, perhaps we could to (some) deduping without a big perf hit to limit the pain of this sort of thing. Or a separate idle-time-scheduled de-duping pass. I'd suggest using some heuristics to decide if there's any chance of a big win before throwing too many cycles at it; perhaps during compaction we can record a histogram of string sizes and see if there are hot spots. 1,248.01 MB (70.42%) -- string(length=648061, copies=624, "url("data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAABi" (truncated))

Emanuel Hoogeveen [:ehoogeveen]

Comment 3

•

6 years ago

We talked about this during the GC meeting at Orlando. 1) We should probably prioritize solving this for short strings like in comment #0 and comment #1; given how short they are, the overhead of hashing them should be relatively small. 2) For huge strings like in comment #2 we'd need to do something a little more clever - maybe we could hash the first page and compare prefixes before we commit to comparing the full strings. 3) I remember a bug where we had a large amount of strings that shared a long prefix but each had a unique suffix - it would be great if we could turn these into ropes somehow. 4) Another bug I remember has us keeping long strings alive even though the JS only used short substrings; it would be great to copy/inline/deduplicate these during compaction as well. I wondered whether it would be a good idea to store a hash per page of a string (or rope) so we could deduplicate page-sized chunks. Of course, it won't work if you're comparing flattened strings with variable length prefixes, but it would work on mostly identical strings with differing fixed length prefixes.

Steve Fink [:sfink] [:s:]

Updated

•

5 years ago

Depends on: 1568923

BMO Automation

Updated

•

2 years ago

Severity: normal → S3

Steve Fink [:sfink] [:s:]

Updated

•

4 days ago

Updated

•

4 days ago

Blocks: sfink.backlog

You need to log in before you can comment on or make changes to this bug.

Bugzilla

De-duplicate strings or other constant & common data during compaction

Categories

(Core :: JavaScript: GC, enhancement, P3)

Tracking

()

People

(Reporter: pbone, Unassigned)

References

(Blocks 1 open bug)

Details

(Keywords: triage-deferred, Whiteboard: [MemShrink:P2])

Crash Data

Security

(public)

User Story

Description

Updated

Comment 1

Updated

Updated

Updated

Updated

Comment 2

Comment 3

Updated

Updated

Updated

Updated