Closed Bug 1754536 Opened 2 years ago Closed 2 years ago

Land various intermittent test fixes originally part of bug 1749775 and reenable verify

Categories

(Firefox :: Messaging System, task, P1)

task

Tracking

()

RESOLVED FIXED
101 Branch
Iteration:
101.2 - April 18 - April 29
Tracking Status
firefox101 --- fixed

People

(Reporter: dmosedale, Assigned: dmosedale)

References

(Blocks 1 open bug)

Details

Attachments

(1 file, 6 obsolete files)

To land bug 1749775 in a way that is upliftable, we had disabled a test and multiple asserts, that were intermittently failing on all platforms, as well as add a skip-if for verify. The tests need to be fixed and verify should be re-enabled.

Here's how I can reproduce the issues:

  • ./mach try fuzzy --no-artifact
  • type mochitest-browser-chrome to find all such tests
  • press Ctrl-A to select them all
  • hit enter to accept

There were lots of problems with the weird interactions and cleanup in the browser_private_browsing_about_nimbus.js file (all the other known ones have been fixed or worked around). It's conceivable that once we get everything migrated to messaging system and away from the bespoke Nimbus feature code (bug 1748124), and all the tests in that file should use setupMSExperimentWithMessage with its associated cleanup code, the problem will disappear.

However, the wiser thing to do may be to simply split that file apart into two files to cut down on the number of possible cross-test interactions.

Flags: needinfo?(andrei.br92)
Summary: Fix and re-enable test_experiment_messaging_system_dismiss test → Land various intermittent test fixes originally part of bug 1749775 and reenable verify
Assignee: nobody → dmosedale
Iteration: --- → 99.1 - Feb 7 - Feb 20
Priority: -- → P1

Depends on D138834

Depends on D138835

Depends on D138837

Depends on D138838

Depends on D138839

To land bug 1749775 in a way that is upliftable, we had disabled a test and multiple asserts, that were intermittently failing on all platforms, as well as add a skip-if for verify. The tests need to be fixed and verify should be re-enabled.

Here's how I can reproduce the issues:

./mach try fuzzy --no-artifact
type mochitest-browser-chrome to find all such tests
press Ctrl-A to select them all
hit enter to accept

There were lots of problems with the weird interactions and cleanup in the browser_private_browsing_about_nimbus.js file (all the other known ones have been fixed or worked around). It's conceivable that once we get everything migrated to messaging system and away from the bespoke Nimbus feature code (bug 1748124), and all the tests in that file should use setupMSExperimentWithMessage with its associated cleanup code, the problem will disappear.

However, the wiser thing to do may be to simply split that file apart into two files to cut down on the number of possible cross-test interactions.

Flags: needinfo?(andrei.br92)

I've attached the work-in-progress patch stack, which I expect to squish down to a smaller number of commits before landing.

This is currently using requestLongerTimeout; we probably want to split it into at least two files instead. That could be a separate bug, though.

We only have a small number of intermittents left (currently 2; there used to be 10-20 or more on this try run):

https://treeherder.mozilla.org/jobs?repo=try&revision=7219b7bd345f313a57b26eec182265c0d51e3cdc

Depends on: 1753917
Iteration: 99.1 - Feb 7 - Feb 20 → 99.2 - Feb 21 - Mar 6

Punam, Andrei and I chatted about this last week, and here are some useful links and a plan that came out of that meeting:

Punam’s debugging work: https://phabricator.services.mozilla.com/D138978

Useful links for telemetry testing and intermittent diagnosis:

Plan to fix intermittents

  • Use existing patch stack in this bug
  • Remove local branch commits making the category on WaitForEventsToClear be optional: always wait for “normandy”
  • Split out each new MS test (and anything with TelemetryTestUtils) into its own file
  • If we’re not there yet:
    switch aboutPrivateBrowsing.js to message that parent using RemotePageAccessManager so that the exposure telemetry is sent from AboutPrivateBrowsingParent rather than AboutPrivateBrowsingChild
  • change the tests to check “parent” rather than “content”.

Probably want to also have a patch (could be part of a different bug) to add these debugging links to the part of firefox-source-docs about messaging system debugging.

Depends on: 1749775
Priority: P1 → P2
Iteration: 99.2 - Feb 21 - Mar 6 → 100.1 - March 1 - March 18
Iteration: 100.1 - March 1 - March 18 → 100.2 - March 21 - April 1
Attachment #9264025 - Attachment description: WIP: Bug 1754536 - Initial test cleanup → Bug 1754536 - Fix intermittent failures in private browsing nimbus tests, r?Mardak
Attachment #9264030 - Attachment is obsolete: true
Attachment #9264029 - Attachment is obsolete: true
Attachment #9264028 - Attachment is obsolete: true
Attachment #9264027 - Attachment is obsolete: true
Attachment #9264026 - Attachment is obsolete: true
Attachment #9264031 - Attachment is obsolete: true
Iteration: 100.2 - March 21 - April 1 → 101.1 - April 4 - April 15
Iteration: 101.1 - April 4 - April 15 → 101.2 - April 18 - April 29
Attachment #9264025 - Attachment description: Bug 1754536 - Fix intermittent failures in private browsing nimbus tests, r?Mardak → Bug 1754536 - Fix intermittent failures in private browsing nimbus tests, r?pdahiya
Priority: P2 → P1
Blocks: 1766531
Pushed by dmosedale@mozilla.com:
https://hg.mozilla.org/integration/autoland/rev/0ab99a4d3c5c
Fix intermittent failures in private browsing nimbus tests, r=pdahiya
Status: NEW → RESOLVED
Closed: 2 years ago
Resolution: --- → FIXED
Target Milestone: --- → 101 Branch
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: