Closed Bug 1120001 Opened 9 years ago Closed 9 years ago

direct calls incorrectly say "caller unavailable" instead of "something went wrong" sometimes

Tracking

(firefox36-)

Status:

RESOLVED FIXED

Milestone:

mozilla38

Iteration:

38.1 - 26 Jan

Tracking Flags:

Tracking

Status

firefox36

---

People

(Reporter: dmosedale, Assigned: jaws)

References

Details

(Keywords: ux-userfeedback)

Attachments

(3 files)

Server change [landed]: Split out USER_UNAVAILABLE from INVALID_PARAMETERS so clients can show better error messages 9 years ago Dan Mosedale (:dmosedale, :dmose) 3.02 KB, patch		Details \| Diff \| Splinter Review
client-patch: direct calls sometime say 'unavailable' rather than 'something went wrong'; patch=jaws,dmose 9 years ago Dan Mosedale (:dmosedale, :dmose) 5.11 KB, patch	dmosedale : review+	Details \| Diff \| Splinter Review
Update documentation 9 years ago Rémy Hubscher (:natim) 48 bytes, text/x-github-pull-request	alexis+bugs : review+	Details \| Review

Dan Mosedale (:dmosedale, :dmose)

Reporter

Description

•

9 years ago

This is confusing to the user, since they may need to ask for debugging help if things are going wrong for them.  It also messes up our metrics.

The problem is that right now, if you try to call an email address or phone number that doesn't have an account, the server REST API returns errno 107, which is INVALID_PARAMETERS, which can include other problems as well.

We need to create a separate error code for at least this case (USER_UNAVAILABLE?), and then implement both client and server support for it.

Dan Mosedale (:dmosedale, :dmose)

Reporter

Comment 1

•

9 years ago

At the same time, I suspect it'll make sense to kill the "user-unknown" websocket termination reason documented at <https://docs.services.mozilla.com/loop/apis.html#termination-reasons>, both in the client validator and documention.  We know the problem in advance of setting up the websocket, and the server doesn't use that reason at all.

Dan Mosedale (:dmosedale, :dmose)

Reporter

Updated

•

9 years ago

Depends on: 1120003

Mark Banner (:standard8)

Updated

•

9 years ago

Blocks: 1120003

No longer depends on: 1120003

Mark Banner (:standard8)

Updated

•

9 years ago

No longer blocks: 1119992

Depends on: 1119992

Dan Mosedale (:dmosedale, :dmose)

Reporter

Comment 2

•

9 years ago

Adam, do these things seem like a reasonable approach to you?

* get rid of the unused websocket code
* split out a separate rest errno for user unavailable
* use the name USER_UNAVAILABLE rather than USER_UNKNOWN based on the speculation that it's since it's more opaque, at least unsophisticated spammers would be less likely to consider using it to test for the existence for accounts

Flags: needinfo?(adam)

Keywords: ux-userfeedback

Dan Mosedale (:dmosedale, :dmose)

Reporter

Updated

•

9 years ago

Assignee: nobody → jaws

Adam Roach [:abr]

Comment 3

•

9 years ago

(In reply to Dan Mosedale (:dmose) - needinfo? me for response from comment #2)
> Adam, do these things seem like a reasonable approach to you?
> 
> * get rid of the unused websocket code

I think that's okay.

> * split out a separate rest errno for user unavailable

Sure. Do you know where the authoritative database for these errnos is, by the way? The numbering on them seems to be skipping around as if to avoid codes used in other services.

> * use the name USER_UNAVAILABLE rather than USER_UNKNOWN based on the
> speculation that it's since it's more opaque, at least unsophisticated
> spammers would be less likely to consider using it to test for the existence
> for accounts

That seems to be of extremely marginal value, but I don't object.

Flags: needinfo?(adam)

Dan Mosedale (:dmosedale, :dmose)

Reporter

Comment 4

•

9 years ago

(In reply to Adam Roach [:abr] from comment #3)

> > * split out a separate rest errno for user unavailable
> 
> Sure. Do you know where the authoritative database for these errnos is, by
> the way? The numbering on them seems to be skipping around as if to avoid
> codes used in other services.

AFAIK, the authoritative source is at

https://github.com/mozilla-services/loop-server/blob/master/loop/errno.json

I don't know why they skip around the way they do.  I was figuring I'd propose this be 122, based on not knowing any better.

> > * use the name USER_UNAVAILABLE rather than USER_UNKNOWN based on the
> > speculation that it's since it's more opaque, at least unsophisticated
> > spammers would be less likely to consider using it to test for the existence
> > for accounts
> 
> That seems to be of extremely marginal value, but I don't object.

Agreed.  It also makes debugging harder, by making the error message less specific.  Unless someone else has a strong opinion on it, jaws and I will discuss tomorrow and pick one of the two for our patch.  :-)

Adam Roach [:abr]

Comment 5

•

9 years ago

(In reply to Dan Mosedale (:dmose) - needinfo? me for response from comment #4)
> I don't know why they skip around the way they do.  I was figuring I'd
> propose this be 122, based on not knowing any better.

There appears to be semantic commonality with these, for the most part:

https://github.com/mozilla/fxa-auth-server/blob/master/docs/api.md#response-format

There's clearly an issue here to be resolved, and I've opened Bug 1120671 to deal with that.

In any case, I don't think we should block *this* bug on figuring out this question. I don't find any claims on 122, so that seems to be safe for this particular case. I'd just like to have a consistent allocation behavior for future codes.

Dan Mosedale (:dmosedale, :dmose)

Reporter

Comment 6

•

9 years ago

[Tracking Requested - why for this release]:

Since we're trying to get our metrics cleaner sooner rather than later, this may be worth uplifting to 36.

tracking-firefox36: --- → ?

Jared Wein [:jaws] (please needinfo? me)

Assignee

Comment 7

•

9 years ago

(In reply to Dan Mosedale (:dmose) - needinfo? me for response from comment #4)
> Agreed.  It also makes debugging harder, by making the error message less
> specific.  Unless someone else has a strong opinion on it, jaws and I will
> discuss tomorrow and pick one of the two for our patch.  :-)

We went with USER_UNKNOWN. We can use server-side rate limiting to prevent people from scraping the email addresses of all Loop users.

Dan Mosedale (:dmosedale, :dmose)

Reporter

Comment 8

•

9 years ago

Attached patch Server change [landed]: Split out USER_UNAVAILABLE from INVALID_PARAMETERS so clients can show better error messages — Details — Splinter Review

Dan Mosedale (:dmosedale, :dmose)

Reporter

Comment 9

•

9 years ago

Here's a server patch that, as it turned out, went with USER_UNAVAILABLE, because that actually turns to be the easiest thing to correctly implement server side (ie represents both "user unknown" and "user not logged in" cases).

We'll put in a PR.

Dan Mosedale (:dmosedale, :dmose)

Reporter

Comment 10

•

9 years ago

Server-side pull-request: https://github.com/mozilla-services/loop-server/pull/287

Dan Mosedale (:dmosedale, :dmose)

Reporter

Comment 11

•

9 years ago

Attached patch client-patch: direct calls sometime say 'unavailable' rather than 'something went wrong'; patch=jaws,dmose — Details — Splinter Review

Dan Mosedale (:dmosedale, :dmose)

Reporter

Comment 12

•

9 years ago

Comment on attachment 8548398 [details] [diff] [review]
client-patch: direct calls sometime say 'unavailable' rather than 'something went wrong'; patch=jaws,dmose

Patch paired & reviewed by jaws & dmose.  Note that unless we land the client patch and deploy the server patch at the same, there will be some short period where Nightly 38 is slightly broken.

Our suggestion is to wait until the server deploy happens, and land this on Nightly that day.

Attachment #8548398 - Flags: review+

Dan Mosedale (:dmosedale, :dmose)

Reporter

Comment 13

•

9 years ago

Furthermore, 37 will be broken for the same reason until the patch is uplifted.

Dan Mosedale (:dmosedale, :dmose)

Reporter

Updated

•

9 years ago

Attachment #8548398 - Attachment description: client-patch: direct calls sometime say 'unavailable' rather than 'something went wrong'; patch=jaws,dmose → [wait for server deploy to land] client-patch: direct calls sometime say 'unavailable' rather than 'something went wrong'; patch=jaws,dmose

Mark Banner (:standard8)

Comment 14

•

9 years ago

(In reply to Dan Mosedale (:dmose) - needinfo? me for response from comment #12)
> Patch paired & reviewed by jaws & dmose.  Note that unless we land the
> client patch and deploy the server patch at the same, there will be some
> short period where Nightly 38 is slightly broken.
> 
> Our suggestion is to wait until the server deploy happens, and land this on
> Nightly that day.

Why do we need to wait to land the client patch? It is a new error code we're detecting, and if we don't see it the old behavior is the same.

I would say that we only need to wait until the server patch is reviewed and landed in the server tree before we land this patch - we don't need to wait to deploy.

I think the same goes for 37 - we could uplift as soon as we're ready.

Flags: needinfo?(dmose)

Rémy Hubscher (:natim)

Comment 15

•

9 years ago

Maria can you verify that this server change is ok for the Firefox OS client?

https://github.com/mozilla-services/loop-server/pull/287

Flags: needinfo?(oteo)

Maria Angeles Oteo (:oteo)

Comment 16

•

9 years ago

In theory, the change should not affect how FxOS Loop client works as we are not checking the error code and just offering the user the option to share a URL in case the REST call fails. However, I'd like to give it a try, is this already deployed in Development?

Flags: needinfo?(oteo) → needinfo?(rhubscher)

Rémy Hubscher (:natim)

Comment 17

•

9 years ago

https://github.com/mozilla-services/loop-server/commit/fa04dcf1c0ddb2df006f71306e68d90644084652

Flags: needinfo?(rhubscher)

Rémy Hubscher (:natim)

Comment 18

•

9 years ago

Maria — I merged and updated the dev environment.

Maria Angeles Oteo (:oteo)

Comment 19

•

9 years ago

Remy, I've tested pointed to Development Server with 1.1 Loop Mobile version and the master branch (future 1.1.1 based on Rooms) and all is working as expected.

You can remove the patch from the Development Server and thanks for asking!

Jared Wein [:jaws] (please needinfo? me)

Assignee

Comment 20

•

9 years ago

(In reply to Mark Banner (:standard8) from comment #14)
> (In reply to Dan Mosedale (:dmose) - needinfo? me for response from comment
> #12)
> > Patch paired & reviewed by jaws & dmose.  Note that unless we land the
> > client patch and deploy the server patch at the same, there will be some
> > short period where Nightly 38 is slightly broken.
> > 
> > Our suggestion is to wait until the server deploy happens, and land this on
> > Nightly that day.
> 
> Why do we need to wait to land the client patch? It is a new error code
> we're detecting, and if we don't see it the old behavior is the same.

Yeah, it's not the biggest of issues. But if this lands before the server code is deployed, the behavior will regress back to the point of generic error messages until the server is deployed. Dan and I talked about the timing of landing this, and there really is no good time due to the separate deploy/release cycles between Firefox desktop and the server. At some point, one of the two will be out of sync with the other in relation to these patches.

The goal with trying to sync the landings is just to reduce that period.

> I would say that we only need to wait until the server patch is reviewed and
> landed in the server tree before we land this patch - we don't need to wait
> to deploy.
> 
> I think the same goes for 37 - we could uplift as soon as we're ready.

Yeah, that would be a fine approach as well.

Flags: needinfo?(dmose)

Dan Mosedale (:dmosedale, :dmose)

Reporter

Updated

•

9 years ago

Attachment #8548390 - Attachment description: Server change: Split out USER_UNAVAILABLE from INVALID_PARAMETERS so clients can show better error messages → Server change [landed]: Split out USER_UNAVAILABLE from INVALID_PARAMETERS so clients can show better error messages

Dan Mosedale (:dmosedale, :dmose)

Reporter

Comment 21

•

9 years ago

Jared and I just discussed, and we're going to go ahead and land this now, and request uplift after it's been on m-c for a day or two...

Jared Wein [:jaws] (please needinfo? me)

Assignee

Comment 22

•

9 years ago

Comment on attachment 8548398 [details] [diff] [review]
client-patch: direct calls sometime say 'unavailable' rather than 'something went wrong'; patch=jaws,dmose

https://hg.mozilla.org/integration/fx-team/rev/a23d5b6b7278

Attachment #8548398 - Attachment description: [wait for server deploy to land] client-patch: direct calls sometime say 'unavailable' rather than 'something went wrong'; patch=jaws,dmose → client-patch: direct calls sometime say 'unavailable' rather than 'something went wrong'; patch=jaws,dmose

Jared Wein [:jaws] (please needinfo? me)

Assignee

Updated

•

9 years ago

Flags: in-testsuite+

Ryan VanderMeulen [:RyanVM]

Comment 23

•

9 years ago

https://hg.mozilla.org/mozilla-central/rev/a23d5b6b7278

Status: NEW → RESOLVED

Closed: 9 years ago

Resolution: --- → FIXED

Target Milestone: --- → mozilla38

Sylvestre Ledru [:Sylvestre]

Comment 24

•

9 years ago

Dan, do you want to uplift this patch to 36?

Flags: needinfo?(dmose)

Dan Mosedale (:dmosedale, :dmose)

Reporter

Comment 25

•

9 years ago

Sylvestre: yes, but it would require uplifting at least the two strings in 

https://bugzilla.mozilla.org/attachment.cgi?id=8546881&action=diff#a/browser/locales/en-US/chrome/browser/loop/loop.properties_sec2

in bug 1118061 (I actually would like to uplift that entire bug, because it helps users avoid perceiving Hello as buggy in ways that it is not).

What are your thoughts here?

Flags: needinfo?(dmose)

Sylvestre Ledru [:Sylvestre]

Comment 26

•

9 years ago

Pike, Francesco: could you help here? thanks

Flags: needinfo?(l10n)

Flags: needinfo?(francesco.lodolo)

Francesco Lodolo [:flod]

Comment 27

•

9 years ago

Bug 1118061 is already on mozilla-aurora, with the 2 strings
http://hg.mozilla.org/releases/mozilla-aurora/log/486b783c09b6/browser/locales/en-US/chrome/browser/loop/loop.properties

What am I missing?

Flags: needinfo?(l10n)

Flags: needinfo?(francesco.lodolo)

Francesco Lodolo [:flod]

Comment 28

•

9 years ago

OK, found out what I missed right after hitting submit: fx36=mozilla-beta. Reopening NI for pike too based on that.

To be honest I wouldn't add new strings in mozilla-beta. A lot of locales won't have the chance to translate these strings, which means displaying the error message in English. And displaying a localized generic error message seems preferable than displaying one specific but in a language users might not understand.

Flags: needinfo?(l10n)

Axel Hecht [:Pike]

Comment 29

•

9 years ago

I don't think we should uplift strings for this. We might want to create a branch specific patch to absorb the new error code in a backwards compatible way, though.

Not sure how users on release are going to experience this?

Flags: needinfo?(l10n)

Rémy Hubscher (:natim)

Comment 30

•

9 years ago

We forgot about updating the documentation.

Status: RESOLVED → REOPENED

Resolution: FIXED → ---

Rémy Hubscher (:natim)

Comment 31

•

9 years ago

Attached file Update documentation — Details

Attachment #8550897 - Flags: review?(alexis+bugs)

Sylvestre Ledru [:Sylvestre]

Comment 32

•

9 years ago

L10N folks are pretty clear on this. This will wait for 37!

tracking-firefox36: ? → -

Alexis Metaireau (:alexis)

Comment 33

•

9 years ago

Comment on attachment 8550897 [details] [review]
Update documentation

https://github.com/mozilla-services/docs/commit/46c69e17d94f1ca888c84129c4344314d40f2f54

Attachment #8550897 - Flags: review?(alexis+bugs) → review+

Alexis Metaireau (:alexis)

Comment 34

•

9 years ago

I just landed server code.

Jared Wein [:jaws] (please needinfo? me)

Assignee

Updated

•

9 years ago

Status: REOPENED → RESOLVED

Closed: 9 years ago → 9 years ago

Resolution: --- → FIXED

Dan Mosedale (:dmosedale, :dmose)

Reporter

Updated

•

9 years ago

Blocks: 1124365

:shell escalante

Updated

•

9 years ago

Iteration: --- → 38.1 - 26 Jan

Jared Wein [:jaws] (please needinfo? me)

Assignee

Updated

•

9 years ago

Points: --- → 3

You need to log in before you can comment on or make changes to this bug.