Closed Bug 846489 Opened 11 years ago Closed 10 years ago

Create an SSL Error Reporting Mechanism

Tracking

()

Status:

RESOLVED FIXED

Milestone:

mozilla36

People

(Reporter: kathleen.a.wilson, Assigned: mgoodwin)

References

(
URL
)

Details

Attachments

(4 files, 21 obsolete files)

patch 10 years ago Dana Keeler (she/her) (use needinfo) [:keeler] (on leave) 24.22 KB, patch	jaws : feedback-	Details \| Diff \| Splinter Review
846489-ssl-error-reporting.patch 10 years ago Garrett Robinson [:grobinson] 5.82 KB, patch		Details \| Diff \| Splinter Review
846489-ssl-error-reporting.patch 10 years ago Garrett Robinson [:grobinson] 6.69 KB, patch		Details \| Diff \| Splinter Review
846489_ssl_error_reporting.patch 10 years ago Garrett Robinson [:grobinson] 5.34 KB, patch		Details \| Diff \| Splinter Review
846489_neterror_ui.patch 10 years ago Garrett Robinson [:grobinson] 6.96 KB, patch		Details \| Diff \| Splinter Review
846489_expose_error_code_on_transport_security_info.patch 10 years ago Garrett Robinson [:grobinson] 1.94 KB, patch	keeler : review+	Details \| Diff \| Splinter Review
846489_expose_error_code_on_transport_security_info.patch 10 years ago Garrett Robinson [:grobinson] 2.04 KB, patch	grobinson : review+	Details \| Diff \| Splinter Review
846489_neterror_ui.patch 10 years ago Mark Goodwin [:mgoodwin] 16.46 KB, patch		Details \| Diff \| Splinter Review
846489_neterror_ui.patch 10 years ago Mark Goodwin [:mgoodwin] 14.53 KB, patch	benjamin : feedback- Felipe : feedback+	Details \| Diff \| Splinter Review
03_846489_tests.patch 10 years ago Mark Goodwin [:mgoodwin] 103.98 KB, patch		Details \| Diff \| Splinter Review
846489_neterror_ui.patch 10 years ago Mark Goodwin [:mgoodwin] 16.18 KB, patch		Details \| Diff \| Splinter Review
03_846489_tests.patch 10 years ago Mark Goodwin [:mgoodwin] 109.62 KB, patch	Felipe : feedback+	Details \| Diff \| Splinter Review
846489_neterror_ui.patch 10 years ago Mark Goodwin [:mgoodwin] 21.51 KB, patch	Felipe : feedback+	Details \| Diff \| Splinter Review
846489_neterror_ui.patch 10 years ago Mark Goodwin [:mgoodwin] 25.38 KB, patch		Details \| Diff \| Splinter Review
846489_neterror_ui.patch 10 years ago Mark Goodwin [:mgoodwin] 23.59 KB, patch		Details \| Diff \| Splinter Review
03_846489_tests.patch 10 years ago Mark Goodwin [:mgoodwin] 113.88 KB, patch	Felipe : review+	Details \| Diff \| Splinter Review
846489_neterror_ui.patch 10 years ago Mark Goodwin [:mgoodwin] 25.33 KB, patch	Felipe : review+	Details \| Diff \| Splinter Review
846489_neterror_ui.patch 10 years ago Mark Goodwin [:mgoodwin] 25.59 KB, patch	mgoodwin : review+ benjamin : feedback+	Details \| Diff \| Splinter Review
03_846489_tests.patch 10 years ago Mark Goodwin [:mgoodwin] 113.88 KB, patch	mgoodwin : review+	Details \| Diff \| Splinter Review
846489_neterror_ui.patch 10 years ago Mark Goodwin [:mgoodwin] 25.64 KB, patch	Felipe : review+	Details \| Diff \| Splinter Review
846489_neterror_ui.patch 10 years ago Mark Goodwin [:mgoodwin] 25.64 KB, patch	mgoodwin : review+	Details \| Diff \| Splinter Review
01_bug846489_expose_error_code_on_transport_security_info.patch 10 years ago Mark Goodwin [:mgoodwin] 2.02 KB, patch	mgoodwin : review+	Details \| Diff \| Splinter Review
02_bug846489_neterror_ui.patch 10 years ago Mark Goodwin [:mgoodwin] 25.65 KB, patch	mgoodwin : review+	Details \| Diff \| Splinter Review
03_bug846489_tests.patch 10 years ago Mark Goodwin [:mgoodwin] 113.89 KB, patch	mgoodwin : review+	Details \| Diff \| Splinter Review
04_bug846489_fix_report_URL.patch 10 years ago Mark Goodwin [:mgoodwin] 917 bytes, patch	keeler : review+	Details \| Diff \| Splinter Review

Kathleen Wilson

Reporter

Description

•

11 years ago

Please create a certificate error reporting mechanism that will transmit and store the following information on a Mozilla server, allowing the data to be analyzed both automatically and manually.
- Domain of bad connection
- Error type (e.g. Pinning, domain mismatch, etc)
- Cert chain (at minimum, same data to distrust each cert in the chain)
- Request data (e.g. User Agent, IP, Timestamp)

Initially this reporting mechanism will be used to report, store, and analyze certificate pinning violations. In the future it could also be used for user-reported certificate errors, and other related concerns.

Certificate pinning is a mechanism by which site owners can specify a set of keys (actually fingerprints of the keys) such that in the next connection to the site, the set of keys in the certificate chain MUST intersect with the set of keys 'pinned' in the browser.
- https://bugzilla.mozilla.org/show_bug.cgi?id=744204
- https://wiki.mozilla.org/Security/Features/CA_pinning_functionality

When the set of keys in the certificate chain do not intersect with the set of keys 'pinned' in the browsers, then an alert will be generated and sent to Mozilla to be stored and analyzed. There may be some false alarms, but if a real issue (such as MITM) is identified, the security-group should be alerted for further action.

Curtis Koenig [:curtisk-use curtis.koenig+bzATgmail.com]]

Updated

•

11 years ago

Depends on: 846501

Dana Keeler (she/her) (use needinfo) [:keeler] (on leave)

Comment 1

•

11 years ago

I'm assuming this bug is where the development for this feature is going, since the other bug (bug 846501) seems to be a project review tracking bug.

Depends on: 875577

Dana Keeler (she/her) (use needinfo) [:keeler] (on leave)

Updated

•

11 years ago

Depends on: 875583

Dana Keeler (she/her) (use needinfo) [:keeler] (on leave)

Updated

•

11 years ago

Depends on: 731485

Dana Keeler (she/her) (use needinfo) [:keeler] (on leave)

Updated

•

11 years ago

Depends on: 940506

Dana Keeler (she/her) (use needinfo) [:keeler] (on leave)

Comment 2

•

10 years ago

Attached patch patch (obsolete) — Details — Splinter Review

Jared - this adds a window that comes up when a certificate error is encountered. I was wondering if you could have a look. Thanks! (It's not entirely finished, but I thought I could get a head-start on an initial pass.)
(Here's the design, from Larissa: http://people.mozilla.org/~lco/ProjectSPF/Certificate_Messages/131209%20Wireframes.pdf )

Assignee: nobody → dkeeler

Status: NEW → ASSIGNED

Attachment #8375900 - Flags: review?(jaws)

Jared Wein [:jaws] (please needinfo? me)

Comment 3

•

10 years ago

Comment on attachment 8375900 [details] [diff] [review]
patch

Review of attachment 8375900 [details] [diff] [review]:
-----------------------------------------------------------------

I really think we should re-consider using a popup window for this. We shouldn't be adding to the number of popup windows that users will see while using the browser.

I would prefer a checkbox within the current UI, unchecked by default, that when checked will send the information to Mozilla upon clicking on one of the buttons within the current UI. Larissa, what do you think about that?

Attachment #8375900 - Flags: review?(jaws) → feedback-

Jared Wein [:jaws] (please needinfo? me)

Updated

•

10 years ago

Flags: needinfo?(lco)

Jared Wein [:jaws] (please needinfo? me)

Comment 4

•

10 years ago

David, what do you think about implementing something that matches the interaction that I described in comment #3?

Jared Wein [:jaws] (please needinfo? me)

Updated

•

10 years ago

Flags: needinfo?(dkeeler)

Dana Keeler (she/her) (use needinfo) [:keeler] (on leave)

Comment 5

•

10 years ago

(In reply to Jared Wein [:jaws] (please needinfo? me) from comment #4)
> David, what do you think about implementing something that matches the
> interaction that I described in comment #3?

I'm fine with that. I would still appreciate Larissa's input, though.

Flags: needinfo?(dkeeler)

Larissa Co [:lco]

Comment 6

•

10 years ago

After talking with keeler, jaws, kathleen, and ibarlow, the solution we came up with was to use a doorhanger instead of the dialog box.

The doorhanger will appear to come from the site identity indicator (e.g. the globe/lock/etc). The user won't be able to access the notification again if he dismisses it; this is because the site identity panel is what usually shows up when you click on the indicator.

YES, this is somewhat of a hack. However, I think it's ok for the following reasons:
1. I would rather not create a new icon just for this special case, especially when there's no really value in allowing the user to revisit the doorhanger at a later point; this is a one-time prompt.
2. The number of users who will actually see this interface is quite low. Let's not go crazy optimizing for a corner case.
3. The doorhanger is a good compromise in drawing the user's attention without getting in their way.

I think that we should try this UI in Nightly and see how it goes in terms of getting feedback. We can adjust its visibility as necessary.

I don't have a great answer for where the doorhanger should be displayed (the Connection Untrusted page vs. in the redirect page). There are pros and cons to either case. However, my gut says that it should be displayed in the redirected page (i.e. either the Mozilla home page that the user sees when he clicks "get me out of here", or the page he's trying to visit when he adds an exception). It's a bit overwhelming for the user to encounter a page with two messages from the system: one warning him that the connection is untrusted, and another asking him to report the error. So I would rather that he see the option to report the error once he's redirected.

We'll have to tweak the message in the dialog a little though. How about this:

**Report the Untrusted Connection to Mozilla?**

Sharing the address and certificate information for the site you were trying to access (http://ulik2.accv.es) will help us identify and block malicious sites. Learn more...

Flags: needinfo?(lco)

Tanvi Vyas[:tanvi]

Comment 7

•

10 years ago

(In reply to Larissa Co [:lco] from comment #6)
> After talking with keeler, jaws, kathleen, and ibarlow, the solution we came
> up with was to use a doorhanger instead of the dialog box.
> 
> The doorhanger will appear to come from the site identity indicator (e.g.
> the globe/lock/etc). The user won't be able to access the notification again
> if he dismisses it; this is because the site identity panel is what usually
> shows up when you click on the indicator.
> 
> YES, this is somewhat of a hack. However, I think it's ok for the following
> reasons:
> 1. I would rather not create a new icon just for this special case,
> especially when there's no really value in allowing the user to revisit the
> doorhanger at a later point; this is a one-time prompt.
> 2. The number of users who will actually see this interface is quite low.
> Let's not go crazy optimizing for a corner case.
> 3. The doorhanger is a good compromise in drawing the user's attention
> without getting in their way.
> 
> I think that we should try this UI in Nightly and see how it goes in terms
> of getting feedback. We can adjust its visibility as necessary.
> 
> I don't have a great answer for where the doorhanger should be displayed
> (the Connection Untrusted page vs. in the redirect page). There are pros and
> cons to either case. However, my gut says that it should be displayed in the
> redirected page (i.e. either the Mozilla home page that the user sees when
> he clicks "get me out of here", or the page he's trying to visit when he
> adds an exception). It's a bit overwhelming for the user to encounter a page
> with two messages from the system: one warning him that the connection is
> untrusted, and another asking him to report the error. So I would rather
> that he see the option to report the error once he's redirected.
> 
> We'll have to tweak the message in the dialog a little though. How about
> this:
> 
> **Report the Untrusted Connection to Mozilla?**
> 
> Sharing the address and certificate information for the site you were trying
> to access (http://ulik2.accv.es) will help us identify and block malicious
> sites. Learn more...

Hi Larissa!  I took a look at your proposal and have some questions.  Perhaps a mock up would help answer some of them.

* Will the doorhanger automatically pop up or is it dismissed by default?  

* The code that pops up and shows doorhangers is separate from the code that shows the lock/globe icon.  From the above proposal, it sounds like we want to use the lock as a doorhanger.  Would this mean we don't use PopupNotifications.jsm?  If this is a onetime notification that is not encountered very often, it might be easier from a development stand point to just add another icon and use the doorhanger functionality.  It sounds like you have already discussed this with jaws.  If he is okay with adding doorhanger-like code to the site identity box, then that's fine.  The consequence is that everytime the look and appearance of doorhangers change, we will also have to make sure we change the code for this to match it (example bug 967349 / bug 864160).
* What would happen to the site identify information in the site identity box?  Would it be above or below the request to report the site?  Or will the site identify information be replaced by the reporting content the first time the user clicks on the lock?  And after dismissal, the user can see the site identity information again?

Thanks!

Jared Wein [:jaws] (please needinfo? me)

Comment 8

•

10 years ago

(In reply to Tanvi Vyas [:tanvi] from comment #7)
>

(Please use needinfo to make sure that someone is flagged and questions don't get lost in bugmail)

> * Will the doorhanger automatically pop up or is it dismissed by default?  

Let's make it automatically pop up.

> * The code that pops up and shows doorhangers is separate from the code that
> shows the lock/globe icon.  From the above proposal, it sounds like we want
> to use the lock as a doorhanger.  Would this mean we don't use
> PopupNotifications.jsm?  If this is a onetime notification that is not
> encountered very often, it might be easier from a development stand point to
> just add another icon and use the doorhanger functionality.  It sounds like
> you have already discussed this with jaws.  If he is okay with adding
> doorhanger-like code to the site identity box, then that's fine.  The
> consequence is that everytime the look and appearance of doorhangers change,
> we will also have to make sure we change the code for this to match it
> (example bug 967349 / bug 864160).

Is that true? Can't we show a doorhanger and just set the anchor of the doorhanger to be the site identity icon?

> * What would happen to the site identify information in the site identity
> box?  Would it be above or below the request to report the site?  Or will
> the site identify information be replaced by the reporting content the first
> time the user clicks on the lock?  And after dismissal, the user can see the
> site identity information again?

The reporting mechanism popup would be opened automatically. If it gets dismissed and the user clicks on the site identity icon, then the site identity doorhanger will open.

Dana Keeler (she/her) (use needinfo) [:keeler] (on leave)

Comment 9

•

10 years ago

I don't have time to work on this at the moment, so we should find a new owner.

Assignee: dkeeler → dougt

Jared Wein [:jaws] (please needinfo? me)

Comment 10

•

10 years ago

Tanvi, can you take this?

Flags: needinfo?(tanvi)

Tanvi Vyas[:tanvi]

Comment 11

•

10 years ago

Not unless I postpone my current project.  Will talk to my team and get back to you this week.  If I can't take it, I will mark the UI parts of this bug for firefox-backlog review (perhaps using a separate bug to track the front-end work).

(In reply to Jared Wein [:jaws] (please needinfo? me) from comment #10)
> Tanvi, can you take this?

Flags: needinfo?(tanvi) → firefox-backlog?

Tanvi Vyas[:tanvi]

Comment 12

•

10 years ago

Oops, looks like I did that already.  Removing the backlog flag for now.

Flags: firefox-backlog?

Tanvi Vyas[:tanvi]

Comment 13

•

10 years ago

A few more questions...

Does anyone have a copy of http://people.mozilla.org/~lco/ProjectSPF/Certificate_Messages/131209%20Wireframes.pdf ?  The link no longer works.

What buttons are available in the doorhanger?
- Is the "Learn More..." a link in the text like it is here - https://people.mozilla.org/~tvyas/FigureB.jpg
- What does the "report" button say?  Is it a drop down or a single 'report this page' button?

Does anyone have feedback on Larissa's proposed text -
**Report the Untrusted Connection to Mozilla?**

Sharing the address and certificate information for the site you were trying to access (http://ulik2.accv.es) will help us identify and block malicious sites. Learn more...

Jared Wein [:jaws] (please needinfo? me)

Updated

•

10 years ago

Flags: needinfo?(lco.mozilla)

Doug Turner (:dougt)

Updated

•

10 years ago

Assignee: dougt → grobinson

Tanvi Vyas[:tanvi]

Comment 14

•

10 years ago

Since lco is no longer at mozilla, maybe we should need-info someone else for the questions in comment 13.  Flagging Madhava to see if he can route this to someone.

Flags: needinfo?(madhava)

Garrett Robinson [:grobinson]

Comment 15

•

10 years ago

Unfortunately, recent changes to the cert error flow for certain cases may invalidate the proposed user flow from Comment 6.

The proposed flow assumes that every cert error page has two options: a "Get me out of here", which redirects to about:home, and some bypass mechanism ("I Understand the Risks"). The "report error" doorhanger is meant to be shown *after* one of these two choices is made.

However, pinning violations and revoked certificates do not allow the user to bypass the warning and access the site anyway. At the moment, they are shown a different UI (see bug 1011638), where the only choice they have is to reload the page. There is no "I Understand the Risks", nor a "Get me out of here". In the case of pinning, this makes sense if the pinning error is due to a captive portal. However, it makes it unclear as to at which point in the flow the "Report an Error to Mozilla" message should be shown.

IMO, the best place might be on the about:certerror page itself. It already directs users to access a non-existent Help menu item to report errors... (bug 1014282)

Sid Stamm [:geekboy or :sstamm]

Comment 16

•

10 years ago

Rerouting to Philipp... Philipp, can you take a look at this bug and help out with Tanvi's questions?

Flags: needinfo?(philipp)

Flags: needinfo?(madhava)

Flags: needinfo?(lco.mozilla)

Tanvi Vyas[:tanvi]

Comment 17

•

10 years ago

(In reply to Garrett Robinson [:grobinson] from comment #15)
> IMO, the best place might be on the about:certerror page itself.

Sounds like the proposal in comment 6 would work for non-pinning errors only.  Since this bug was intended to address pinning first, we need another flow.  What was the reason we tried to avoid adding a report button in the cert error page itself?

Garrett Robinson [:grobinson]

Comment 18

•

10 years ago

> Since this bug was intended to address pinning first, we need another flow.

Yup, exactly.

> What was the reason we tried to avoid adding a report button in the cert error page itself?

No idea, I wasn't involved with the first go-round on this bug. I think it's a reasonable thing to try.

(Currently slow to respond) Philipp Sackl [:phlsa] (Firefox UX) please use needinfo

Comment 19

•

10 years ago

Reading this thread, my first intuition is also to add the option to the error page itself.
Is this only spawned from about:certerror or also from other error pages?

Flags: needinfo?(philipp) → needinfo?(tanvi)

Jared Wein [:jaws] (please needinfo? me)

Comment 20

•

10 years ago

(In reply to Tanvi Vyas [:tanvi] from comment #17)
> What was the reason we tried to avoid adding a report button in the
> cert error page itself?

The goal was to increase responses, at the expense of making this an obtrusive alert. It started out as a proposal for a modal dialog, which I then negotiated to get moved to a doorhanger. I would be in favor of moving it to the page content, at the expense of lower responses but an improved user experience for people who don't want their task continuity broken.

Garrett Robinson [:grobinson]

Comment 21

•

10 years ago

Attached patch 846489-ssl-error-reporting.patch (obsolete) — Details — Splinter Review

I got started with an "on-page" (aboutCertError.xhtml) UI today. I got stuck on extracting the SSLStatus in browser.js::onAboutCertError in order to prepare the report. This is a long-standing problem with no easy solution - see comments in the patch for ideas on how to move forward.

Tanvi Vyas[:tanvi]

Comment 22

•

10 years ago

(In reply to Philipp Sackl [:phlsa] from comment #19)
> Reading this thread, my first intuition is also to add the option to the
> error page itself.
> Is this only spawned from about:certerror or also from other error pages?

Only the cert error page.

Flags: needinfo?(tanvi)

Garrett Robinson [:grobinson]

Comment 23

•

10 years ago

Attached patch 846489-ssl-error-reporting.patch (obsolete) — Details — Splinter Review

Uses nsIRecentBadCerts to get the certificate that caused the error. It's also used to get the cert when adding an exception, so it kind of makes sense in this context. We can revisit a "better" way to do this stuff (e.g. persist the certificate info on the docshell) later, in a different bug.

Attachment #8430427 - Attachment is obsolete: true

Brian Smith (:briansmith, :bsmith, use NEEDINFO?)

Comment 24

•

10 years ago

Comment on attachment 8433729 [details] [diff] [review]
846489-ssl-error-reporting.patch

Review of attachment 8433729 [details] [diff] [review]:
-----------------------------------------------------------------

::: browser/base/content/browser.js
@@ +2323,5 @@
> +         *
> +         * We have 3 options here:
> +         * 1. use recentBadCertService, which "has its own problems" according
> +         *    to keeler
> +         * 2. xhr to the domain in question, and pull the cert off of that.

The certificate that nsIRecentBadCertService returns may not be the same certificate that caused the connection to fail. That is, it is inherently racy and shouldn't be used for anything other than its intended purpose. I appreciate the attempt to expedite the solution for this bug, but it isn't good to add race conditions here and it isn't good to make it harder to remove nsIRecentBadCertService than it already is. Please try to find an alternative solution.

[:mmc] Monica Chew (no longer reading bugmail)

Comment 25

•

10 years ago

How does the certificate viewer get the cert to show?

Garrett Robinson [:grobinson]

Comment 26

•

10 years ago

(In reply to [:mmc] Monica Chew (please use needinfo) from comment #25)
> How does the certificate viewer get the cert to show?

It uses nsIRecentBadCertService, and falls back to sending an XHR and checking the channel if that doesn't work: http://dxr.mozilla.org/mozilla-central/source/security/manager/pki/resources/content/exceptionDialog.js#119

Brian Smith (:briansmith, :bsmith, use NEEDINFO?)

Comment 27

•

10 years ago

(In reply to Garrett Robinson [:grobinson] from comment #26)
> (In reply to [:mmc] Monica Chew (please use needinfo) from comment #25)
> > How does the certificate viewer get the cert to show?
> 
> It uses nsIRecentBadCertService, and falls back to sending an XHR and
> checking the channel if that doesn't work:
> http://dxr.mozilla.org/mozilla-central/source/security/manager/pki/resources/
> content/exceptionDialog.js#119

That's the exception dialog, not the cert viewer, which is also broken (many bugs filed against it). The cert exception dialog box hasn't been a high priority to fix because we (I) have decided that improvements to the cert error override mechanism should have very low priority. But, I think that this mechanism requires a higher level of correctness considering its intended purpose.

Perhaps describing some cases of this problem will lead to a solution. It is pretty obvious what to do if we're making a single connection to a load a page and that connection has a key pinning violation--we show the error page and we prompt in some manner to report the error to our reporting service using the entire certificate chain that was given in the TLS handshake. Right?

Even if nsIRecentBadCertService weren't racy and otherwise problematic, you *cannot* get the certificates other than the end-entity certificate from it. Thus, you cannot send back a complete report for even the simplest case.

Now, consider more complex cases: Let's say we try to load the same site in two tabs and we create TWO connections to do so; further, let's say one fails due to a key pinning violation and another one fails due to some other issue. IIUC, nsIRecentBadCertService will have trouble here because getRecentBadCert can only return *one* bad cert. But, what if the key pinning violation occurred before the other issue? Then the key pinning violation would be hidden and/or we'd send the wrong cert (chain) in the report.

Note also that we don't show the cert error page when the main page loads but a sub-resource fails to load due to a cert error--the cert error just causes the subresource to silently fail to load. And, note that especially with subresources we could have many connections happening simultaneously and/or in quick succession for a single hostname, some of which may have zero cert errors, some of which may have key pinning violations, and some of which may have other cert errors. nsIRecentBadCertService just doesn't handle this well at all, and it doesn't really need to, since its original intended purpose was solely to implement the Tools > Options > Advanged > Manage Certificates > Servers > Add Exception UI. Note that the addition of that "Add Exception" button is what made the certificate error exception dialog box so ugly and confusing. My opinion is that we should remove that button from Gecko along with all the associated badness, including in particular ns[I]RecentBadCert*.

Now, it seems to me that your reporting mechanism should try to report *all* the distinct cert chains that failed due to key pinning violation. I suggest that you change PublicKeyPinningService so that it saves all the violations in its own (in-memory) storage, and then implement a new GetAllPinningViolationsForHost(nsACString hostname) method that returns *all* the stored information, for all ports (since your UI prompt doesn't need to bother the user about ports), including in particular *all* of the relevant certificates instead of the last known bad end-entity certificate for the hostname. Note that you need to be aware of the potential for the accumulating key pinning violations to consume too much memory and deal with that by limiting the amount of info that you store. I believe this would be a better solution to the problem that you are trying to solve.

Brian Smith (:briansmith, :bsmith, use NEEDINFO?)

Comment 28

•

10 years ago

(In reply to Brian Smith (:briansmith, was :bsmith; NEEDINFO? for response) from comment #27)
> I believe this would be
> a better solution to the problem that you are trying to solve.

...while not being much harder to do than using nsIRecentBadCertService.

(Currently slow to respond) Philipp Sackl [:phlsa] (Firefox UX) please use needinfo

Comment 29

•

10 years ago

Hey, sorry for letting this slip. You still need some UI work done here, right?

Garrett Robinson [:grobinson]

Comment 30

•

10 years ago

(In reply to Brian Smith (:briansmith, was :bsmith; NEEDINFO? for response) from comment #27)
> Even if nsIRecentBadCertService weren't racy and otherwise problematic, you
> *cannot* get the certificates other than the end-entity certificate from it.
> Thus, you cannot send back a complete report for even the simplest case.

I assumed I would be able to use nsIX509Cert.getChain [0]. Is there something wrong with that?

[0] http://dxr.mozilla.org/mozilla-central/source/security/manager/ssl/public/nsIX509Cert.idl?from=nsIX509Cert.idl#181

> Now, consider more complex cases: Let's say we try to load the same site in
> two tabs and we create TWO connections to do so; further, let's say one
> fails due to a key pinning violation and another one fails due to some other
> issue. IIUC, nsIRecentBadCertService will have trouble here because
> getRecentBadCert can only return *one* bad cert. But, what if the key
> pinning violation occurred before the other issue? Then the key pinning
> violation would be hidden and/or we'd send the wrong cert (chain) in the
> report.

Can we collect multiple failures for the same hostname:port, and report them all (let the analysts sort it out)?

> Note also that we don't show the cert error page when the main page loads
> but a sub-resource fails to load due to a cert error--the cert error just
> causes the subresource to silently fail to load. And, note that especially
> with subresources we could have many connections happening simultaneously
> and/or in quick succession for a single hostname, some of which may have
> zero cert errors, some of which may have key pinning violations, and some of
> which may have other cert errors. nsIRecentBadCertService just doesn't
> handle this well at all, and it doesn't really need to, since its original
> intended purpose was solely to implement the Tools > Options > Advanged >
> Manage Certificates > Servers > Add Exception UI. Note that the addition of
> that "Add Exception" button is what made the certificate error exception
> dialog box so ugly and confusing. My opinion is that we should remove that
> button from Gecko along with all the associated badness, including in
> particular ns[I]RecentBadCert*.

I interpreted the focus of this bug as reporting validation errors for resources (not resources) and extending the existing UI to do so. Reporting validation errors for sub-resources would require entirely new UI and add significant complexity. I'd prefer to focus on reporting errors for documents and, if people want it, develop reporting UI for sub-resources in a follow-up.

> Now, it seems to me that your reporting mechanism should try to report *all*
> the distinct cert chains that failed due to key pinning violation. I suggest
> that you change PublicKeyPinningService so that it saves all the violations
> in its own (in-memory) storage, and then implement a new
> GetAllPinningViolationsForHost(nsACString hostname) method that returns
> *all* the stored information, for all ports (since your UI prompt doesn't
> need to bother the user about ports), including in particular *all* of the
> relevant certificates instead of the last known bad end-entity certificate
> for the hostname. Note that you need to be aware of the potential for the
> accumulating key pinning violations to consume too much memory and deal with
> that by limiting the amount of info that you store. I believe this would be
> a better solution to the problem that you are trying to solve.

keeler points out that doing this would probably lead to false positives, since our path building algorithm may build multiple unsuccessful chains before giving up (especially in the case of cross-signing). I think we only want to report the chain that was presented in the certificate that failed to validate (let me know if I am misunderstanding you).

Flags: needinfo?(brian)

Brian Smith (:briansmith, :bsmith, use NEEDINFO?)

Comment 31

•

10 years ago

(In reply to Garrett Robinson [:grobinson] from comment #30)
> (In reply to Brian Smith (:briansmith, was :bsmith; NEEDINFO? for response)
> from comment #27)
> > Even if nsIRecentBadCertService weren't racy and otherwise problematic, you
> > *cannot* get the certificates other than the end-entity certificate from it.
> > Thus, you cannot send back a complete report for even the simplest case.
> 
> I assumed I would be able to use nsIX509Cert.getChain [0]. Is there
> something wrong with that?
> 
> [0]
> http://dxr.mozilla.org/mozilla-central/source/security/manager/ssl/public/
> nsIX509Cert.idl?from=nsIX509Cert.idl#181

It is bad to use nsIX509Cert.getChain. First, nsIX509Cert.getChain has the same kind of race conditions that nsIRecentBadCertService has in the face of different concurrent (or even serial) connections that have different certificate chains. Secondly, nsIX509Cert.getCert is only "guaranteed" to see the relevant CA certificates for *valid* (accepted), not *invalid* (rejected) certificates, because it requires the CA certificates to be a CERTCertificate object to be in memory and/or it requires the CA certificate to be written in the certificate database, and we explicitly avoid doing those things for invalid (rejected) certificates. Also, nsXI50Cert.getChain returns a *constructed* certificate chain, but for error reporting you should be reporting the *received* certificates. Finally, nsXI50Cert.getChain can return chains with certificates received on connections to other hosts, so your permission prompt to ask the user's permission to send the certificates received for a connection to host example.org would be misleading in its privacy implications.

tl;dr: Never use nsIX509Cert.getChain for anything. See bug 867473 about the planned removal of this horrific code.

> Can we collect multiple failures for the same hostname:port, and report them
> all (let the analysts sort it out)?

That's what I was recommending. (And, like I said, I suggest not keying things on the port, but just the hostname.)

> I interpreted the focus of this bug as reporting validation errors for
> resources (not resources) and extending the existing UI to do so. Reporting
> validation errors for sub-resources would require entirely new UI and add
> significant complexity. I'd prefer to focus on reporting errors for
> documents and, if people want it, develop reporting UI for sub-resources in
> a follow-up.

I don't have an opinion about that. Note that my points about race conditions and reporting the wrong certificates and/or reporting incomplete information still stand even without considering subresources.

> > Now, it seems to me that your reporting mechanism should try to report *all*
> > the distinct cert chains that failed due to key pinning violation.

> keeler points out that doing this would probably lead to false positives,
> since our path building algorithm may build multiple unsuccessful chains
> before giving up (especially in the case of cross-signing). I think we only
> want to report the chain that was presented in the certificate that failed
> to validate (let me know if I am misunderstanding you).

Sorry, I was unclear. When I said "report *all* the distinct cert chains that failed due to key pinning violation" I meant "report the contents of the TLS Certificate message for all TLS connections that failed due to key pinning violation." The emphasis should be on reporting the *received* certificate chains, not *constructed* certificate chains.

(Actually, to report the most complete information, you should report the union of the received and constructed chains, but that is complicated by the requirement to ask the user permission on a domain-by-domain basis for sending the information. Eventually we should make it so that a connection to hostname X never affects the connection to hostname y as far as cert chain construction is concerned. FWIW, I am working on something that should enable that, eventually.)

Flags: needinfo?(brian)

Dana Keeler (she/her) (use needinfo) [:keeler] (on leave)

Updated

•

10 years ago

Depends on: 1025330

Garrett Robinson [:grobinson]

Updated

•

10 years ago

Depends on: 1029155

Garrett Robinson [:grobinson]

Comment 32

•

10 years ago

Attached patch 846489_ssl_error_reporting.patch (obsolete) — Details — Splinter Review

Now that we have the failed channel available on the docShell (thanks, keeler!), I'm returning to my original plan of getting the necessary info from the channel that caused the error page to appear.

Need to expose the peer certificate chain on nsISSLStatus (bug 1029155), as discussed with keeler and cviecco on IRC, to continue with this approach.

Attachment #8433729 - Attachment is obsolete: true

Garrett Robinson [:grobinson]

Comment 33

•

10 years ago

Now the URL is Philipp's sweet UI design, check it out!

URL: http://cl.ly/image/1f3k0Q1R0l2Q

Garrett Robinson [:grobinson]

Comment 34

•

10 years ago

Attached patch 846489_neterror_ui.patch (obsolete) — Details — Splinter Review

Implemented the UI (see URL) for certificate error reporting in netError.xhtml, so it can be shown for pinning errors. You can see how the "Report an error" link only appears for pinning errors by visiting https://pinningtest.appspot.com.

There are still a few minor tweaks needed.

* I wasn't able to get the "automatically report in the future" checkbox to autofocus, so it didn't make sense to add the blue glow shown in the mockup. Can revisit this later, but I think the glow is a UI hint that should only be present in the button is in some way special. If it were autofocused, you'd be able to toggle it with the spacebar.
* Showing the error reporting panel shifts the entire layout up. It should instead dropdown from the link without causing it to shift.

Garrett Robinson [:grobinson]

Comment 35

•

10 years ago

Attached patch 846489_expose_error_code_on_transport_security_info.patch (obsolete) — Details — Splinter Review

We need to expose the underlying error code from NSS on TransportSecurityInfo so it can be included in the error report.

Attachment #8472721 - Flags: review?(dkeeler)

Dana Keeler (she/her) (use needinfo) [:keeler] (on leave)

Comment 36

•

10 years ago

Comment on attachment 8472721 [details] [diff] [review]
846489_expose_error_code_on_transport_security_info.patch

Review of attachment 8472721 [details] [diff] [review]:
-----------------------------------------------------------------

LGTM.

::: netwerk/socket/nsITransportSecurityInfo.idl
@@ +12,4 @@
>  interface nsITransportSecurityInfo : nsISupports {
>      readonly attribute unsigned long    securityState;
>      readonly attribute wstring          errorMessage;
> +    readonly attribute long             errorCode;

Maybe add a comment that this is a PRErrorCode, just so we don't get confused later.

Attachment #8472721 - Flags: review?(dkeeler) → review+

Garrett Robinson [:grobinson]

Comment 37

•

10 years ago

Attached patch 846489_expose_error_code_on_transport_security_info.patch (obsolete) — Details — Splinter Review

Comment on error code type in .idl. Add commit message for landing.

Attachment #8472721 - Attachment is obsolete: true

Attachment #8473990 - Flags: review+

Garrett Robinson [:grobinson]

Updated

•

10 years ago

Blocks: 1056366

[:mmc] Monica Chew (no longer reading bugmail)

Comment 38

•

10 years ago

Hey Garrett,

Is there more WIP that's not reflected in the patches attached? mgoodwin was asking on irc.

Thanks,
Monica

Flags: needinfo?(garrett.f.robinson+mozilla)