1894672 - Intermittent comm/mail/base/test/browser/browser_getMessages_certError.js | showAlert should not be called while an alert is showing

Reporter

Description

•

7 months ago

treeherder

Filed by: mkmelin [at] iki.fi
Parsed log: https://treeherder.mozilla.org/logviewer?job_id=456667166&repo=comm-central
Full log: https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/G-rIbi6QQf6vI1hWi05V5A/runs/0/artifacts/public/logs/live_backing.log

Magnus Melin [:mkmelin]

Updated

•

7 months ago

Keywords: regression

Regressed by: 1893899

Geoff Lankow (:darktrojan)

Updated

•

7 months ago

Summary: Intermittent comm/mail/base/test/browser/browser_getMessages_certError.js | single tracking bug → Intermittent comm/mail/base/test/browser/browser_getMessages_certError.js | showAlert should not be called while an alert is showing

Comment hidden (Intermittent Failures Robot)

Geoff Lankow (:darktrojan)

Comment 6

•

5 months ago

Attached file Bug 1894672 - Wait for repeated alerts before moving on in connection error tests. r=#thunderbird-reviewers — Details

Phabricator Automation

Updated

•

5 months ago

Assignee: nobody → geoff

Status: NEW → ASSIGNED

Geoff Lankow (:darktrojan)

Updated

•

5 months ago

Duplicate of this bug: 1894772

Comment hidden (Intermittent Failures Robot)

Geoff Lankow (:darktrojan)

Updated

•

5 months ago

Keywords: checkin-needed-tb

Target Milestone: --- → 129 Branch

Pulsebot

Comment 9

•

5 months ago

Pushed by kaie@kuix.de:
https://hg.mozilla.org/comm-central/rev/2543f8548643
Wait for repeated alerts before moving on in connection error tests. r=mkmelin

Status: ASSIGNED → RESOLVED

Closed: 5 months ago

Keywords: checkin-needed-tb

Resolution: --- → FIXED

Magnus Melin [:mkmelin]

Comment 10

•

5 months ago

Still happening :(

Status: RESOLVED → REOPENED

Resolution: FIXED → ---

Geoff Lankow (:darktrojan)

Comment 11

•

5 months ago

I think there must be an actual bug (or more than one) here. It seems all the failures are from the same point in the test now. At that point the code shouldn't be making two alerts in the first place, and second one is within the 1000ms waiting period.

Unfortunately whenever I try to reproduce this on the try server it refuses to happen.

Geoff Lankow (:darktrojan)

Comment 12

•

5 months ago

I wonder if it's the first alert that's not supposed to happen. That could happen as the IMAP code might make more than one alert (the POP3 code shouldn't but that's where the failure occurs) and they'd have the same text.

Comment hidden (Intermittent Failures Robot)

Kai Engert [:KaiE:]

Comment 14

•

5 months ago

I should review the changes that were made in bug 1893899.

Flags: needinfo?(kaie)

Kai Engert [:KaiE:]

Comment 15

•

5 months ago

I'm testing this on Fedora Linux with the Gnome Desktop.

When I get a bad cert error, I get an error popup.
I don't know how that type of popup is called, it's shown outside of Thunderbird, it's shown in the desktop environment's area with all notifications.

Is that the type of alert that's mentioned here?

It seems that alert isn't modal.

If it isn't modal, why can be a problem to trigger another alert, while the first one is still showing?

Geoff Lankow (:darktrojan)

Comment 16

•

5 months ago

Because it's pointless to show several identical notifications just because the code is coded badly. That's what is happening, there's not separate problems for the user to know about.

This test failure is (I think) due to the CI machines being resource constrained and things taking longer than expected. We should wait for an event instead of an arbitrary time, but that's very difficult in this case.

Kai Engert [:KaiE:]

Comment 17

•

5 months ago

It isn't clear to me which of the following happens, can you clarify?

(a) We have logic to detect additional notifications, and that code should prevent the additional ones, but that code isn't working.

(b) We don't expect being notified about a cert error twice, but that's what happens, and that triggers the duplicate notification?

Kai Engert [:KaiE:]

Comment 18

•

5 months ago

The test has this comment:
// There could be multiple alerts for the same problem. These are swallowed
// while the first alert is open, but we should wait a while for them.

I cannot find the code that is expected to swallow (suppress) alerts while the first one is open.

Kai Engert [:KaiE:]

Comment 19

•

5 months ago

(In reply to Kai Engert (:KaiE:) from comment #18)

I cannot find the code that is expected to swallow (suppress) alerts while the first one is open.

I think I found it, alertHook.sys.mjs, activeAlerts.

I have a theory:

you trigger get messages
the cert is triggered and showAlert is called
then you call observe("alertclickcallback")
alertHook.observe opens an exceptionDialog, with prefetchCert: true,
might that trigger the subsequent network activity and the new cert error?
immediately after you request the dialog, you call activeAlerts.delete(),
so from now on, the same cert error is no longer suppressed,
which means, if the dialog triggers the error, onCertError will call showAlert
however, at this time, MockAlertsService._alert is still set

Flags: needinfo?(kaie)

Kai Engert [:KaiE:]

Comment 20

•

5 months ago

Hmm, that exception dialog is modal. If the call to openDialog blocks until after the dialog gets closed, then activeAlerts still has the tracker entry and should cause suppression...

Kai Engert [:KaiE:]

Comment 21

•

5 months ago

I was able to reproduce locally, it happened when I had lots of logging enabled.

we start the notyetvalid test
we get the IMAP cert error
we run through the exception dialog, the exception is present
we reach alertfinished
I see again "Requesting notyetvalid.test.test:993"
again we arrive in onCertError for IMAP :993
the alert is shown

However, before we reach observe("alertclickcallback"),
we reach onCertError for POP :995,
which also calls showAlert

So we are in fact showing different errors in parallel, not the same one, at least in the scenario that I see.

Kai Engert [:KaiE:]

Comment 22

•

5 months ago

Instead of one global MockAlertsService, would it work to create a separate instance of MockAlertsService for each port/protocol you're testing, and have a member variable instead of a static member?

Kai Engert [:KaiE:]

Comment 23

•

5 months ago

(In reply to Kai Engert (:KaiE:) from comment #22)

Instead of one global MockAlertsService, would it work to create a separate instance of MockAlertsService for each port/protocol you're testing, and have a member variable instead of a static member?

Maybe that doesn't work, if only one global mock object may be registered at any time.

Comment hidden (Intermittent Failures Robot)

Geoff Lankow (:darktrojan)

Updated

•

4 months ago

Assignee: geoff → nobody

Comment hidden (Intermittent Failures Robot)