568156 - Use Sync client version as the User-Agent for Sync requests

Reporter

Description

•

15 years ago

Need this for critical security updates, to be able to offer new functionality to only new version, and who knows what else, but we should know what version each user is on.

Ed Lee :Mardak

Comment 1

•

15 years ago

We do ping with the client version at most once a day when it syncs. It'll show up in the info/collections?v=1.2.3 query

Justin Fitzhugh

Reporter

Comment 2

•

15 years ago

perfect - didnt know we did this. perhaps we can hack up a maintenance script to find this and stick it in the db or ldap?

Mike Connor [:mconnor]

Comment 3

•

15 years ago

Let's be careful here. I know why we want this information, but I'm also starting to wonder if we should go the other way, and override UA to be something very generic. One core pillar about the Weave project has been giving the user as much control over their own data as possible, and reducing what we know about users to the bare minimum. I'm not sure this qualifies. The more information we provide about a user, the more the server operator knows about the client. Looking at tech like panopticlick, we could (if we were evil) combine the fingerprint from the browser with the knowledge about what clients the users has to get a lot of detail about the user. We've done security design from the perspective of having hostile/compromised servers, and this feels like a step against that, so we should discuss this in detail, probably on the public dev list.

Ragavan S [:rags]

Comment 4

•

15 years ago

(In reply to comment #3) > We've done > security design from the perspective of having hostile/compromised servers, and > this feels like a step against that, so we should discuss this in detail, > probably on the public dev list. I agree with Mike here. We specifically discussed this when we were implementing the client heartbeat for metrics tracking - in fact, I was still under the (mistaken) impression that we weren't sending anything after the ? in the query. That said, I also see the value this particular piece of data. Talking about this on weave-dev sounds like a good next step.

Justin Fitzhugh

Reporter

Comment 5

•

15 years ago

how is this any different than when we send the ff ua string? weave is a client and the uses for having that data are infact quite focused on keeping the user *more* secure if we needed to do something specific on the server side to a specific version of the client. I agree having a public discussion is good (isn't this bug public?) but we should frame it as something that will help us offer better support and a more secure service rather than a metric we are gathering to keep track of users (which it isn't, and we aren't).

Mike Connor [:mconnor]

Comment 6

•

15 years ago

(In reply to comment #5) > how is this any different than when we send the ff ua string? That's why I mentioned overriding the UA for Weave requests. > weave is a > client and the uses for having that data are infact quite focused on keeping > the user *more* secure if we needed to do something specific on the server side > to a specific version of the client. I agree having a public discussion is > good (isn't this bug public?) but we should frame it as something that will > help us offer better support and a more secure service rather than a metric we > are gathering to keep track of users (which it isn't, and we aren't). That's one potential use-case. But if it were a hostile/compromised server, it would make users less secure, as it would be easier to target attacks. This type of data cuts both ways. The point of Weave has been to make it possible to use a service without needing to trust them with your data or your security. I'd rather go even further in the other direction, and figure out how to make collection names opaque to the server. This is public, but few people watch these components.

Michael Coates [:mcoates] (acct no longer active)

Comment 8

•

15 years ago

I'd like to revisit this discussion, as you may notice from the duplicate bug in comment 7. Having an understanding of the client version will increase our security monitoring capabilities. We've had several situations where we've identified false positive security events that are due to bugs in the client software. Once these bugs are fixed we have no way of differentiating older, unfixed clients, from actual attacks. There is a trade off here, we do expose a small amount of client version information (no more than a user agent string), but in return we are able to better protect the entire infrastructure against attackers.

Mike Connor [:mconnor]

Comment 9

•

15 years ago

a) as noted in comment 3, I would much rather obscure the UA for all requests than expose even more information to the server. b) why wouldn't attackers identify as old clients to look like false positives? Relying on what the client claims to be feels fragile at best. I do understand the infra/infrasec perspective here, but I still feel like this runs directly counter to the direction we should go from a product perspective. From that perspective, the client should be as anonymous and generic as possible in how it syncs. We probably should go further than we do now, as I've said here.

Michael Coates [:mcoates] (acct no longer active)

Comment 10

•

15 years ago

Yes, in a one off situation an attacker could pose as an older version. But that's not really how we plan to use the client version information. This information can be used to diagnose large quantities of events that we say from numerous clients of a particular version. This way can slowly eliminate the bugs which cause security false positive events without continually bothering dev teams with issues that have already been fixed in recent releases (and we keep seeing because some people haven't updated). I personally don't think the version data is disclosing too much information. We may need to all get together in a room and hash this one out since we have differing opinions. There is a trade-off here and we need to make a decision on what item is more important - protecting the client's version from reporting to the server or enhanced security monitoring and bug detection. I will schedule a meeting for next week (sorry out the rest of this week) and we can finalize this issue with some discussion. Everyone on this bug will receive the invite.

Mike Connor [:mconnor]

Comment 11

•

15 years ago

I'm on an island next week, with no phone. Ultimately, Ragavan has to set the product direction, and we need to live within that space. I think our core goals of privacy, and user-centric data policies, have to win here, but he needs to make that call, not me.

Chris Lyon [:clyon]

Comment 12

•

15 years ago

(In reply to comment #11) > I'm on an island next week, with no phone. > > Ultimately, Ragavan has to set the product direction, and we need to live > within that space. I think our core goals of privacy, and user-centric data > policies, have to win here, but he needs to make that call, not me. Ragavan, do you want to make a call on this? My points which are in line with some of the other comments: 1. We are seeing some very odd issues with the client/server interaction beyond just a single use case. Having the version in the log entry sent to us will be very helpful and help us figure out what problems to address. This is beyond just security. 2. To Justin's point, we are running a service and this will only be used as part of this service. Since this is a service, what if we had a major issue with a client and didn't want it syncing with our environment, how could we enforce this? Again, we are responsible for this data and if there is a fundamental problem we might not want that data on our servers from a given client version. So we can't enforce this now but we should think about it. I think we are all on the same page with regards to privacy and the new log policy will no-doubt be in place with this service. Metrics is just secondary but not our main reason for having the client report version numbers.

Ragavan S [:rags]

Comment 13

•

15 years ago

This is on me - I'll update this bug with thoughts/comments before the end of this week.

Mike Connor [:mconnor]

Comment 14

•

15 years ago

(In reply to comment #12) > (In reply to comment #11) > > I'm on an island next week, with no phone. > > > > Ultimately, Ragavan has to set the product direction, and we need to live > > within that space. I think our core goals of privacy, and user-centric data > > policies, have to win here, but he needs to make that call, not me. > > Ragavan, do you want to make a call on this? > > My points which are in line with some of the other comments: > > 1. We are seeing some very odd issues with the client/server interaction beyond > just a single use case. Having the version in the log entry sent to us will be > very helpful and help us figure out what problems to address. This is beyond > just security. I completely understand how useful this would be for debugging client changes. This would make my life easier, probably even moreso than for infrasec. No one is questioning the potential benefits or use-cases. The question is whether they violate the basic product requirements of user privacy being paramount. > 2. To Justin's point, we are running a service and this will only be used as > part of this service. Since this is a service, what if we had a major issue > with a client and didn't want it syncing with our environment, how could we > enforce this? Again, we are responsible for this data and if there is a > fundamental problem we might not want that data on our servers from a given > client version. So we can't enforce this now but we should think about it. What you're really talking about here is "require unique client ID/version strings as part of the API calls" for all clients. It's an open system, so we'd have to require this from all clients. We're running a service, but the service operational limits cannot be the primary decision factor. The success of Sync, and of how we build out services in general, will be in changing how people run services, and making very different tradeoffs from a privacy perspective. This means we sometimes walk away from wins (like not encrypting data on the client, and not running a proxy server for decrypting data for mobile devices). > I think we are all on the same page with regards to privacy and the new log > policy will no-doubt be in place with this service. Metrics is just secondary > but not our main reason for having the client report version numbers. Remember, we're not the only people running servers.

Chris Lyon [:clyon]

Comment 15

•

15 years ago

(In reply to comment #14) > I completely understand how useful this would be for debugging client changes. > This would make my life easier, probably even moreso than for infrasec. No one > is questioning the potential benefits or use-cases. The question is whether > they violate the basic product requirements of user privacy being paramount. What are the requirements for user privacy at this point? Per the FF Sync Privacy Policy Definition: "“Non-Personal Information” is information that cannot by itself be directly associated with a specific person or entity. Non-Personal Information includes but is not limited to your computer’s configuration and the *version* of Firefox Sync you use." So we are really saying is that the version should be considered personal information?

Ragavan S [:rags]

Comment 16

•

15 years ago

I promised a response by the end of the week, but I haven't had a chance to talk to Chris yet. I have a fairly good understanding of Mike's position on this, but want to talk to Chris as well. I'll sync up with Chris early next week and also talk to legal and resolve this.

Ragavan S [:rags]

Comment 17

•

15 years ago

I talked to Julie, she doesn't think it is an issue from the perspective of our Privacy Policy, but is going to think about it some more. https://bugzilla.mozilla.org/show_bug.cgi?id=582483 tracks that. Also, @clyon and @mcoates, as Mardak mentions in comment #1, we currently include the client version in the daily heartbeat ping. Is that not sufficient? Any reason this should be included in each sync request?

Michael Coates [:mcoates] (acct no longer active)

Comment 18

•

15 years ago

The daily hearbeat version information would not be easily available to correlate with each event that we receive into arcsight. Any sort of automatic correlation between the two data sets would not be feasible considering the large number of events we receive. Having the version number available with each request (via header or url argument) is the optimal solution for us.

Michael Coates [:mcoates] (acct no longer active)

Comment 19

•

15 years ago

I spoke with zandr today, we may already have access to this information via the http request object. If that is the case we could update our CEF log points to include this data with client. Zandr, is it possible to get the client's version everytime they send a request?

Zandr Milewski [:zandr]

Comment 20

•

15 years ago

(In reply to comment #19) > Zandr, is it possible to get the client's version everytime they send a > request? It would be possible to modify the client to do this in the future yes, subject to, well, the rest of the conversation in this bug. There's no technical barrier, no.

Michael Coates [:mcoates] (acct no longer active)

Comment 21

•

15 years ago

Ah, ok. Perhaps I didn't understand earlier. I was thinking we already had the information available on the server and could just correlate the version info with each request that goes to CEF logging (e.g not requiring a client change). If a client code change is required then we are back at the original discussion. I thought I found something that worked without client modification.

Zandr Milewski [:zandr]

Comment 22

•

15 years ago

That's right. Once a day the client appends the ?v= query string, so you need to find that request in the log to get a particular user's version (and if the user has multiple clients, there's lots of opportunity for things to get messy) So we'll get that once/day/client for each user. Metrics counts total numbers of version pings for each version, but they don't tie it up with username to my knowledge.

Mike Connor [:mconnor]

Comment 23

•

15 years ago

Spoke to cslyon about this, going to append this to info/collections, as this is the start of each sync pass, and possibly the only call.

OS: Mac OS X → All

Priority: -- → P2

Hardware: x86 → All

Summary: Need to capture the client version number and update on each sync → include client and client version in each info/collections call

Target Milestone: --- → 1.6

WIP. v1 14 years ago Richard Newman [:rnewman] 2.75 KB, patch	mconnor : feedback-	Details \| Diff \| Splinter Review
User-agent version. v1 14 years ago Richard Newman [:rnewman] 6.60 KB, patch		Details \| Diff \| Splinter Review
User-agent version. v2 14 years ago Richard Newman [:rnewman] 4.95 KB, patch	philikon : review+	Details \| Diff \| Splinter Review
User-agent version. v3 14 years ago Richard Newman [:rnewman] 5.06 KB, patch		Details \| Diff \| Splinter Review
User-agent version. v4 14 years ago Richard Newman [:rnewman] 5.76 KB, patch	philikon : review+	Details \| Diff \| Splinter Review
Minor update. v1 14 years ago Richard Newman [:rnewman] 2.26 KB, patch	philikon : review+	Details \| Diff \| Splinter Review