796444 - [email/activesync] ActiveSync does not automatically retry or report account problems.

Reporter

Description

•

12 years ago

[GitHub issue by nhirata on 2012-07-30T21:07:03Z, https://github.com/mozilla-b2g/gaia/issues/2974]
## Environment :
Otoro phone, build 2012-07-30
Taken from default.xml in b2g-distro: 
* "platform_build" revision= 2163d79af8c06ffcf7607a83e01dc5bf1107fd8e
* "gaia" revision= 759caf5aa151ce06ce580e0a9ddb6ffe24ff8334
* "mozilla-central" revision= 07226cfa1211b2f8fc7702345a7184fae7c3962b
* "gonk-misc" revision= c6a9a256cd4a1c53696488739d36778b9dbfc881

## Repro :
1. setup the wifi with a bad username/password for a wpa connection
2. launch email app
3. setup with hotmail account

## Expected :
* time out since there's no network connection; or an early warning, no network connection


## Actual :
* just waits...

## Note :
* logcat:

GH to BZ

Reporter

Comment 1

•

12 years ago

[GitHub comment by mozsquib on 2012-09-27T19:45:18Z]
I believe this will be fixed by PR #5325. Can you retest once that's merged?

GH to BZ

Reporter

Comment 2

•

12 years ago

[GitHub comment by mozsquib on 2012-09-27T22:39:14Z]
Ok, that PR is checked in now, so you should be able to test this out now.

Andrew Sutherland [:asuth] (he/him)

Updated

•

12 years ago

Component: Gaia → Gaia::E-Mail

Naoki Hirata :nhirata (please use needinfo instead of cc)

Updated

•

12 years ago

Whiteboard: [label:task][label:email][label:mentored][label:email-ui][LOE:S] → [label:task][label:email][label:mentored][label:email-ui][LOE:S][qa+]

Jim Porter (:squib)

Comment 3

•

12 years ago

asuth, what do you think about this:

Set an initial timeout for all XHRs to 10 seconds. Every time we receive a Progress event, update the timeout to be 5 seconds in the future. I think this should be pretty straightforward, since we just need to keep track of when the XHR started, and then use the Progress event's timestamp to figure out how much time has elapsed (the spec explicitly allows us to update the timeout in the middle of a request). This way, we can automatically adapt to large and small requests/responses coming through.

We might also want a cap on the timeout for sanity. Say 5 minutes?

Andrew Sutherland [:asuth] (he/him)

Comment 4

•

12 years ago

This sounds very reasonable to me.  I think I would avoid the maximum cap logic for now.  I'm not so concerned about the potential for 'jumpy NTP clock problems' since Date.now is not monotonically increasing, just the extra edge cases.  That's something we can do later.  And of course let's try and avoid this being a foot-gun when we adopt push notifications :)

Jim Porter (:squib)

Comment 5

•

12 years ago

Push notifications will naturally have their own logic for timeouts (probably just a constant, large value). Luckily, the responses for push notifications are basically guaranteed to be very small.

Jim Porter (:squib)

Comment 6

•

12 years ago

Oh, and for working around the fact that time is a world of pain, we could just make sure we start with a reasonably-long timeout (10-20 seconds is probably plenty) and make sure never to decrease the accumulated timeout value. Then, occasional hiccups with the timestamp won't immediately kill our connection.

Jim Porter (:squib)

Comment 7

•

12 years ago

I guess another option would be to use setTimeout and continually cancel and restart the setTimeout callback as we get progress notifications...

Andrew Sutherland [:asuth] (he/him)

Comment 8

•

12 years ago

Yeah, I thought you were proposing to just clearTimeout on every progress and do a new setTimeout.  Since that's all relative time, jumpy time is not our problem in that case (and I think the right NSPR stuff happens under the hood.)

Jim Porter (:squib)

Comment 9

•

12 years ago

I should clarify then: I was originally thinking about using the |timeout| attribute on the XHR, which lets you specify the timeout in milliseconds. We could just keep adding to that timeout every time we got a Progress event from the XHR, but that would (I think) involve using clock time.

Dylan Oliver [:doliver]

Comment 11

•

12 years ago

The title of this one is about Activesync but will the fix serve as a more general solution to handling server communication issues in the email app?

Jim Porter (:squib)

Comment 12

•

12 years ago

It'll probably fix some of the setup issues, but other than that, I have no idea.

Mike Connor [:mconnor]

Updated

•

12 years ago

Priority: P1 → --

Mike Connor [:mconnor]

Updated

•

12 years ago

Priority: -- → P3

Jim Porter (:squib)

Updated

•

12 years ago

Assignee: nobody → squibblyflabbetydoo

Status: NEW → ASSIGNED

Jim Porter (:squib)

Comment 13

•

12 years ago

Partially fixed in: https://github.com/mozilla-b2g/gaia/pull/6447
                    https://github.com/mozilla-b2g/gaia-email-libs-and-more/pull/76

Leaving this bug open so that we can also add some retry/backoff code to ActiveSync (like IMAP already has).

Jim Porter (:squib)

Comment 14

•

12 years ago

asuth: Do you think it makes sense to allow the user to abort the XHRs in the autoconfig process as a part of this bug? It broadly falls under the scope of what comment 0 is talking about, but it's not exactly the same. Mainly, this would have the benefit of saving network resources and preventing possible breakage if the user hit "back" when trying to autoconfig (e.g. if they realized they typed their email address in wrong).

Jim Porter (:squib)

Comment 15

•

12 years ago

Actually, I guess that needs to wait until I hear back from Andreas about what he'd like us to do with making setup non-blocking.

Dietrich Ayala (:dietrich)

Comment 16

•

12 years ago

Been almost 2 weeks - what needs to happen in this bug?

Jim Porter (:squib)

Comment 17

•

12 years ago

(In reply to Dietrich Ayala (:dietrich) from comment #16)
> Been almost 2 weeks - what needs to happen in this bug?

I'm probably going to allow canceling the requests during autoconfigure, since it's easy, and we also want to be able to retry failed attempts to talk to the server for ActiveSync. IMAP already supports this, so it should be straightforward to make ActiveSync do the same.

Alex Keybl [:akeybl]

Comment 18

•

12 years ago

The user impact here seems fairly minimal, and should be uncommon - marking as a soft blocker.

blocking-basecamp: + → -

Priority: P3 → P4

ayman maat :maat

Comment 19

•

12 years ago

no updates on this bug for some time. RFI to Jim to enquire what the current status of the bug is.

Flags: needinfo?(squibblyflabbetydoo)

Dylan Oliver [:doliver]

Comment 20

•

12 years ago

Ayman, this bug was marked blocking- on 11/30 so it's not getting any attention right now.

ayman maat :maat

Comment 21

•

12 years ago

(In reply to Dylan Oliver [:doliver] from comment #20)
> Ayman, this bug was marked blocking- on 11/30 so it's not getting any
> attention right now.

Thanks for the response Dylan. On what grounds could i ask was it defined as not blocking? From a UX perspective if the app enters an error state due to loss of connection and the user is not informed of the error, and therefore cannot identify the error and recover from it this is a blocking bug by definition.

RFI to josh to bring this bug back into triage.

Flags: needinfo?(jcarpenter)

Whiteboard: [label:task][label:email][label:mentored][label:email-ui][LOE:S][qa+] → [label:task][label:email][label:mentored][label:email-ui][LOE:S][qa+] interaction [UX-P2]

Andrew Sutherland [:asuth] (he/him)

Comment 22

•

12 years ago

I somewhat recently landed an additional error handling/feedback patch that built on Jim's error/timeout improvements in comment 13.  We should now display an error during synchronization with a status banner thing across the bottom that offers the ability to retry.

There is, however, no automatic retry capability or connection status tracking.  Because ActiveSync doesn't rely on a long-lived connection, the retries are somewhat less important since there's less chance of a connection 'going stale' since every request is a new XHR.  However, it would be useful in that it could help us more confidently report that we seem to be having trouble connecting to a server.  And then once we know that is happening, like IMAP, we can stop retrying until we have an indication the server is back.

Flags: needinfo?(squibblyflabbetydoo)

Summary: [email/activesync] email does not time out if there's no network connection or if the connection destination is unreachable. → [email/activesync] ActiveSync does not automatically retry or report account problems.

Josh Carpenter [:jcarpenter]

Comment 23

•

12 years ago

Changing needsinfo to Casey, who is the UX owner for Email and the right person to weigh in here.

Flags: needinfo?(jcarpenter) → needinfo?(kyee)

Casey Yee

Comment 24

•

12 years ago

Having the error feedback as per Andrews comment 22 is what is important here.   The user still has the ability to manually re-check email which in my opinion is sufficient for V1.

Flags: needinfo?(kyee)

Naoki Hirata :nhirata (please use needinfo instead of cc)

Updated

•

11 years ago

blocking-b2g: --- → koi?

Dylan Oliver [:doliver]

Comment 25

•

11 years ago

We haven't seen problems in this area for some time -- please renominate if this pops up again.

blocking-b2g: koi? → -

Jim Porter (:squib)

Comment 26

•

11 years ago

Yeah, I'd say this is one of those bugs which would be nice to have if someone had a little bit of free time, but otherwise it's not a big deal. Since ActiveSync doesn't keep connections open*, retrying happens at a different layer, and shouldn't cause problems. Fixing this might make things a little more efficient/friendly though.

* If we enable push mail, this might become more important, since that involves long-lived connections.

Jim Porter (:squib)

Comment 27

•

10 years ago

I'm not working on this, so resetting the assignee.

Assignee: squibblyflabbetydoo → nobody

Status: ASSIGNED → NEW

Mike Hoye [:mhoye]

Comment 28

•

10 years ago

This bug is looking for a mentor. Casey?

Flags: needinfo?(kyee)

Whiteboard: [label:task][label:email][label:mentored][label:email-ui][LOE:S][qa+] interaction [UX-P2] → [label:task][label:email][mentor needed][label:email-ui][LOE:S][qa+] interaction [UX-P2]

Jim Porter (:squib)

Comment 29

•

10 years ago

I'm probably the most obvious mentor for this, but I don't know if we even need this anymore. We could possibly just close it (but it'd be best to ask Andrew when he gets back from vacation).

Mike Hoye [:mhoye]

Updated

•

10 years ago

Whiteboard: [label:task][label:email][mentor needed][label:email-ui][LOE:S][qa+] interaction [UX-P2] → [label:task][label:email][mentor wanted][label:email-ui][LOE:S][qa+] interaction [UX-P2]

Casey Yee

Comment 30

•

10 years ago

redirecting to juwei since she is the ux owner for this.

Flags: needinfo?(kyee) → needinfo?(jhuang)

Juwei Huang[:juwei]

Comment 31

•

10 years ago

Hi Andrew, is this bug resolved or we need an alternate ux solution for it?

Flags: needinfo?(jhuang) → needinfo?(bugmail)

Andrew Sutherland [:asuth] (he/him)

Comment 32

•

10 years ago

I'm going to just resolve this as WFM since I think we ended up implementing all the ActiveSync fixes we thought we needed already on other bugs.  Things like us not telling the user we're offline are not ActiveSync specific and are handled by existing UI bugs.

The only thing we could do that we didn't do was do more exponential backoff stuff, but as noted in many places, it's not clear what win that gets us.  IMAP has (or at least we're more aware of) many transient failure modes that we understand and where our backoff is appropriate.  And we've seen servers generate those failures in our experience.  While ActiveSync has some state indication like that, we generally have not seen it used on Microsoft-authored implementations, suggesting it's not super-needed.  And on the other implementations, well, they tend to be bad and they probably have learned not to generate errors clients don't expect.

We'll file new bugs for any enhancements we can think of for ActiveSync or just address them on bugs as the need arises.

Status: NEW → RESOLVED

Closed: 10 years ago

Flags: needinfo?(bugmail)

Resolution: --- → WORKSFORME