Closed Bug 1014343 Opened 12 years ago Closed 11 years ago

DMD: add ability to do diffs of memory dumps

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla36

People

(Reporter: n.nethercote, Assigned: n.nethercote)

References

Details

Attachments

(1 file)

Add diff support to dmd.py 11 years ago Nicholas Nethercote [inactive] 38.07 KB, patch	mccr8 : review+	Details \| Diff \| Splinter Review

Nicholas Nethercote [inactive]

Assignee

Description

•

12 years ago

dbaron described on dev-platform one way he still uses trace-malloc: > One is to check for leaks that involve caches (i.e., don't involve > unreachable pointers). One can take two memory dumps at different > times and build a allocation-stack-tree-diff of the two dumps. Thus > one can get into a certain state, do things that should lead to the > memory usage being the same as the first state, and see if anything > interesting increased (with allocation stacks). This has been > useful for finding caches that weren't being cleared properly.

Andrew McCreight [:mccr8]

Comment 1

•

12 years ago

I've dabbled in a tool to do this for CC logs. One thing to watch out for is "replay attacks": an object at address 0xFOO can get freed, then another object allocated at the same address. I'm not sure if you can deal with this problem without some kind of explicit support from the logging, like "Oh, here's a new log, and FYI, these objects that were in the old log have been freed." That sounds doable, but it will make it more annoying. Maybe trace-malloc suffers from this, and a feature like that isn't really critical.

Nicholas Nethercote [inactive]

Assignee

Comment 2

•

12 years ago

> I've dabbled in a tool to do this for CC logs. One thing to watch out for > is "replay attacks": an object at address 0xFOO can get freed, then another > object allocated at the same address. Valgrind has a hack to work around this behaviour in a similar case: it holds onto freed blocks for a while so they don't get recycled too quickly. There's a flag that lets you control how long the blocks are held onto.

David Baron :dbaron: (⌚️UTC-5, no longer working on Mozilla)

Comment 3

•

12 years ago

The trace-malloc tool in question doesn't care about addresses, it only cares about total amount of memory allocated at a given stack prefix.

David Baron :dbaron: (⌚️UTC-5, no longer working on Mozilla)

Comment 4

•

12 years ago

FYI, the trace-malloc capability is documented in https://wiki.mozilla.org/Performance:Leak_Tools#trace-malloc_with_diffbloatdump

Nicholas Nethercote [inactive]

Assignee

Comment 5

•

11 years ago

Attached patch Add diff support to dmd.py — Details — Splinter Review

This patch implements diffs. Changes of note in dmd.py: - Previously the file reading and output printing was done in a single function (main). Now it's split into two, because we have to read two files, diff them, and then print the output for the diff. I've used the term "digest" for the intermediate data in this process. See main() to understand this better. - There's now code for diffing digests. - All the sorting code now uses absolute values. - The way blocks are aggregated into records has changed. It used to be based on the base-32 frame keys (e.g. "AXd") of the |allocatedAt| and |reportedAt| stack traces -- if two blocks had matching frame keys for all their traces, they'd be put in the same record. It's now based on the frame values (e.g. "#00: foo (X.cpp:99)") of those same stack traces. This is because you can't sensible compare frame keys from two different DMD runs, because they're quasi-random, but you can compare frame values. There's also some new tests, and test_dmd.js needed some refactoring to handle tests that take two filenames.

Attachment #8506571 - Flags: review?(continuation)

Nicholas Nethercote [inactive]

Assignee

Updated

•

11 years ago

Assignee: nobody → n.nethercote

Status: NEW → ASSIGNED

Andrew McCreight [:mccr8]

Comment 6

•

11 years ago

Comment on attachment 8506571 [details] [diff] [review] Add diff support to dmd.py Review of attachment 8506571 [details] [diff] [review]: ----------------------------------------------------------------- ::: memory/replace/dmd/dmd.py @@ +69,3 @@ > self.usableSizes = collections.defaultdict(int) > > + def isZero(self, args): Please add a comment to the effect that Record can be used to store the difference of two DMD records as well as an individual record. Maybe also add a comment on these three methods that they are for diffs? Part of me thinks it would be good to have RecordDiff as a subclass of Record, but I guess that's overengineering. @@ +128,5 @@ > > @staticmethod > def cmpByUsableSize(r1, r2): > # Sort by usable size, then req size, then by isSampled. > + return cmp(abs(r1.usableSize), abs(r2.usableSize)) or \ Maybe this is a matter of taste, but I'd think you'd want all of the positive diffs (from largest to smallest), then all of the negative diffs (from largest to smallest), not mixed all together like this. If I get the diff of two DMD logs, I'm usually either interested in either what appeared or (sometimes) what went away, not both. @@ +468,5 @@ > + return records3 > + > + > +def diffDigests(args, d1, d2): > + d3 = {} It isn't a big deal, but the way you create the digest here is inconsistent with how the digest is created above. ::: memory/replace/dmd/test/script-diff1.json @@ +1,5 @@ > +{ > + "version": 1, > + "invocation": { > + "dmdEnvVar": "--sample-below=127", > + "sampleBelowSize": 127 micro nit: trailing space

Attachment #8506571 - Flags: review?(continuation) → review+

Nicholas Nethercote [inactive]

Assignee

Comment 7

•

11 years ago

> Maybe this is a matter of taste, but I'd think you'd want all of the > positive diffs (from largest to smallest), then all of the negative diffs > (from largest to smallest), not mixed all together like this. I strongly disagree. For one, about:memory used to behave like you suggest and I ended up changing it to the sort-on-absolute approach because it ends up being more what people want. Secondly, because dmd.py only shows the first 1000 records, doing it your way is likely to cut off all of the large negative diff records :(

Andrew McCreight [:mccr8]

Comment 8

•

11 years ago

Yeah, I guess you'd have to sort by absolute value, trim off the excess, then sort by non-absolute value. I can always add some additional sort options later if it bothers me.

Nicholas Nethercote [inactive]

Assignee

Comment 9

•

11 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/94c5d968e7e8

Carsten Book [:Tomcat]

Comment 10

•

11 years ago

sorry had to back this out in https://treeherder.mozilla.org/ui/#/jobs?repo=mozilla-inbound&revision=de805196bbc4 since one of this changes caused perma failure on 10.8 Debug Tests like https://treeherder.mozilla.org/ui/logviewer.html#?job_id=3275566&repo=mozilla-inbound

Nicholas Nethercote [inactive]

Assignee

Comment 11

•

11 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/9fe610b8aa4f

Carsten Book [:Tomcat]

Comment 12

•

11 years ago

https://hg.mozilla.org/mozilla-central/rev/9fe610b8aa4f

Status: ASSIGNED → RESOLVED

Closed: 11 years ago

Resolution: --- → FIXED

Target Milestone: --- → mozilla36

You need to log in before you can comment on or make changes to this bug.

Bugzilla

DMD: add ability to do diffs of memory dumps

Categories

(Core :: DMD, defect)

Tracking

()

People

(Reporter: n.nethercote, Assigned: n.nethercote)

References

Details

Crash Data

Security

(public)

User Story

Attachments

(1 file)

Description

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Updated

Comment 6

Comment 7

Comment 8

Comment 9

Comment 10

Comment 11

Comment 12

Attachment

General

Description

File Name

Content Type