1045316 - Add end-to-end testing of our leak checking infrastructure

Reporter

Description

•

10 years ago

The leak checking infrastructure requires many parts to work, from Gecko itself, to the test harness, to buildbot, to TBPL itself.  It would be nice if we had some more testing that it is actually working.

This is not merely an academic concern, as currently both our regular leak checking (bug 1045255) and LSan (bug 1044206) appear to have silently broken.

The simplest possible approach would be to introduce an intentional intermittent leak, that fails about once a day.  Then you get some people to follow the bug, and if it goes silent for too long, you know something is wrong.  Bug 1033464 sort of served this purpose, but it was a little too intermittent so nobody noticed when it went silent.  So this is not a great approach, but it might be better than nothing for the moment.

A more comprehensive approach would maybe just test the various bits of the leak checking infrastructure, with some kind of intentional leak that the harness ignores.  I'm not sure how that would work.

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Comment 1

•

10 years ago

This is a general problem with all of our testing infrastructure; we've had similar problems with regressions in the reftest harness.

I think recent events have probably shown that we should be thinking about testing our tests.  We probably don't (yet) need the next level down.  Somehow we need to avoid having elephants all the way down, though.

Gregory Szorc [:gps]

Comment 2

•

10 years ago

I've long wanted to introduce something like Mercurial's "t tests" for testing shell output (like mach commands). http://mercurial.selenic.com/wiki/WritingTests It even allows you to test exit codes. You are running the actual commands that users run. https://hg.mozilla.org/hgcustom/version-control-tools/file/7efa5dc02222/hgext/reviewboard/tests/test-commits-deleted-obsolescence.t

(not currently active) Ted Mielczarek

Comment 3

•

10 years ago

The closest thing we have to this is the xpcshell selftests:
http://mxr.mozilla.org/mozilla-central/source/testing/xpcshell/selftest.py

I've found these really useful as they allow us to test that the harness is actually doing what we intend it to do, and we can also test failures. We could probably write an equivalent set of tests for any test harness.

(not currently active) Ted Mielczarek

Updated

•

10 years ago

Depends on: 1048446

Nicholas Nethercote [inactive]

Updated

•

10 years ago

Whiteboard: [MemShrink] → [MemShrink:P1]

Andrew McCreight [:mccr8]

Reporter

Comment 4

•

9 years ago

This is a little less fragile than it was before, as we now produce an error if a process creates a leak log, but does not actually output leak information into that log.

Bug 1121670 is another example of why having end-to-end testing is important: I accidentally broke the Treeherder starring suggestions for four months for leakcheck, so we have no idea how much various intermittent leaks happened in that period.  This isn't the end of the world, because something that leaked frequently would still get backed out by the sheriffs, and I think that if it was happening a lot, the sheriffs would have complained earlier.  This problem was only noticed because when I turned on leak checking for content processes, there was a very frequently leak, and RyanVM complained to me about it.  But it still isn't great.

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Updated

•

9 years ago

Blocks: 1229578

Eric Rahm [:erahm]

Comment 6

•

6 years ago

Geoff, do you know if anyone can look into this? It's definitely bit us in the past. mccr8 suggested that just having a task that checks if any leaks have been starred in the past week might be good enough.

Flags: needinfo?(gbrown)

Whiteboard: [MemShrink:P1] → [MemShrink:P2]

Geoff Brown [:gbrown]

Comment 7

•

6 years ago

:ahal added mochitest/reftest harness "self-tests", including some tests for leak logging. Have those tests "missed" recent issues? :ahal -- do you have thoughts (plans!?) on next steps?

https://dxr.mozilla.org/mozilla-central/rev/c291143e24019097d087f9307e59b49facaf90cb/testing/mochitest/tests/python/test_basic_mochitest_plain.py#104
https://dxr.mozilla.org/mozilla-central/rev/c291143e24019097d087f9307e59b49facaf90cb/layout/tools/reftest/selftest/test_reftest_output.py#123

Flags: needinfo?(gbrown) → needinfo?(ahal)

Andrew Halberstadt [:ahal]

Comment 8

•

6 years ago

I couldn't figure out how to reliably produce a leak for testing purposes, so those tests mock out a leaklog instead of actually getting Gecko to produce one. They basically just test that the log processing stuff in mochitest/reftest is working and nothing else.

If someone can write a mochitest/reftest (or devise some other mechanism) for me that causes a leaklog to get generated, we could start using that instead of mocking it out. But beyond that I don't have any thoughts or plans.

Flags: needinfo?(ahal)

BMO Automation

Updated

•

2 years ago

Severity: normal → S3

Bugzilla

Quick Search

Add end-to-end testing of our leak checking infrastructure

Categories

(Testing :: General, defect)

Tracking

(Not tracked)

People

(Reporter: mccr8, Unassigned)

References

(Blocks 1 open bug)

Details

(Whiteboard: [MemShrink:P2])

Crash Data

Security

(public)

User Story

Description

Comment 1

Comment 2

Comment 3

Updated

Updated

Comment 4

Updated

Comment 6

Comment 7

Comment 8

Updated