1255570 - (CVE-2016-5251) HTTP(S) URL spoof in location bar

Reporter

Description

•

9 years ago

Attached file testcase.html — Details

User Agent: Mozilla/5.0 (Windows NT 6.3) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/49.0.2623.87 Safari/537.36 Steps to reproduce: Combination of data URI, Unicode characters and frames. Spoof is not perfect but good enough to easily fool your mom and dad. At least it worked with mine :) Follow link in testcase file for a simple demo. Actual results: Browser navigates to arbitrary website but URL bar shows https://secure.paypal.com/ Expected results: At the very least misleading Unicode characters should be detected/escaped and/or a big warning should be displayed. Note: I'm submitting a similar report to Chromium.

firace

Reporter

Updated

•

9 years ago

Component: Untriaged → Location Bar

firace

Reporter

Comment 1

•

9 years ago

Forgot to mention, tested with Firefox 45.0.

:Gijs (he/him)

Assignee

Updated

•

9 years ago

Flags: needinfo?(dveditz)

Flags: needinfo?(abillings)

Al Billings [:abillings - ex-MoCo]

Updated

•

9 years ago

Flags: needinfo?(abillings)

Daniel Veditz [:dveditz]

Updated

•

9 years ago

Status: UNCONFIRMED → NEW

Ever confirmed: true

Keywords: csectype-spoof

Daniel Veditz [:dveditz]

Comment 4

•

9 years ago

This is similar to (inspired by?) bug 1221444 fixed in Firefox 43, too lax handling of data: urls. That bug messed with the mediatype, this one messes with the optional ";base64" bit. If we encounter a ";" in the content type section and it's not followed by "base64" that should be an error. Instead it looks like (from symptoms, have not debugged code) we ignore anything not "base64" and go on our merry way. Since we display the URL that means this field can be used for spoofing arbitrary text. There are a few characters we'd percent-encode, but I bet you could get around that using Unicode look-alikes.

Group: firefox-core-security → network-core-security

Component: Location Bar → Networking

Flags: needinfo?(dveditz)

Keywords: sec-low

Product: Firefox → Core

Comment 6

•

9 years ago

looks like we treat it as an unknown, and thus ignored, content type parameter. What breaks if we stop ignoring those and instead treat them as invalid? Probably something -- see bug 781693 where sites are putting invalid parameters after the ";base64" bit. That means, by the way, that you could definitely do the same kind of spoof by putting this text after the base64 instead of before. URL in this bug (shortened): data:text/html;%E2%80%83%F0%9F%94%92%E2%80%83https://paypal.com/%E2%80%83etc.%E2%80%83charset=utf-8;base64,c3Bvb2Y= see: https://dxr.mozilla.org/mozilla-central/rev/dd1abe874252e507b825a0a4e1063b0e13578288/netwerk/protocol/data/nsDataHandler.cpp#197 It "works" just as well (but at a different point in the code) to put the base64 first: data:text/html;base64;%E2%80%83%F0%9F%94%92%E2%80%83https://paypal.com/%E2%80%83etc.%E2%80%83charset=utf-8,c3Bvb2Y= see: https://dxr.mozilla.org/mozilla-central/rev/dd1abe874252e507b825a0a4e1063b0e13578288/netwerk/protocol/data/nsDataHandler.cpp#182

Patrick McManus [:mcmanus]

Updated

•

9 years ago

Whiteboard: [necko-backlog]

Julian Reschke

Comment 7

•

9 years ago

Escaping non-ASCII characters where media type parameters are expected should be sufficient, no?

Valentin Gosu [:valentin] (he/him)

Comment 8

•

9 years ago

(In reply to Julian Reschke from comment #7) > Escaping non-ASCII characters where media type parameters are expected > should be sufficient, no? nsDataHandler doesn't have a say about what characters are escaped. The location bar is where the entire string is unescaped, and it does so regardless of what media type, etc. We could probably start ignoring certain characters, or rejecting them, but that's likely to break some odd use case for someone.

:Gijs (he/him)

Assignee

Comment 9

•

9 years ago

(In reply to Valentin Gosu [:valentin] from comment #8) > (In reply to Julian Reschke from comment #7) > > Escaping non-ASCII characters where media type parameters are expected > > should be sufficient, no? > > nsDataHandler doesn't have a say about what characters are escaped. > The location bar is where the entire string is unescaped, and it does so > regardless of what media type, etc. > We could probably start ignoring certain characters, or rejecting them, but > that's likely to break some odd use case for someone. The location bar could be taught not to unescape non-ascii for data: and javascript: URIs for display, without much loss of use for 99.99% of users, I think. I haven't looked into this bug in detail, so I'm not sure, but: 1) Marco, would you agree that my suggestion is feasible in principle; 2) Dan, would that address the issue here?

Flags: needinfo?(mak77)

Flags: needinfo?(dveditz)

Daniel Veditz [:dveditz]

Comment 10

•

9 years ago

I moved this to Core::Networking figuring we'd fix it in the data: URL handler like bug 1221444. The only real fix there, though, is either to reject such URLs as invalid or to strip out the unknown bits and re-write the URL--either of which is fairly guaranteed to break some site some where (cf bug 781693). We could only take that approach if we added telemetry to these and measured the use of these variants. That would take a while. Gijs' suggestion puts the ball back in the front-end court. I like it, so I'll move the bug back. I'd prefer a whitelist approach that says we WILL unescape http(s)/ftp/??? rather than a blacklist of data: and javascript:, but I'd accept either as a fix. NB: expect complaints from a small number of developers angry their bookmarklets are suddenly filled with ugly %20. I guess it depends on whether you skip the "prettify" step entirely for those schemes or just change it to treat non-ascii differently.

Group: network-core-security → core-security-release

Component: Networking → Keyboard Navigation

Flags: needinfo?(dveditz)

Product: Core → Firefox

:Gijs (he/him)

Assignee

Comment 11

•

9 years ago

(moving groups because this is unfixed; dveditz, let me know if you moved it to -release over fx-frontend intentionally...)

Group: core-security-release → firefox-core-security

Marco Bonardo [:mak]

Comment 12

•

9 years ago

I think it's feasible, I don't know if the devs complains would be so many. The bookmarks dialogs are already not unescaping the url, we have bugs filed but not so many complains. The awesomebar popup shows unescape bookmarklets and we could and should continue to do that since it's just a temporary visualization. So when the user selects the bookmarklet, it will be pretty-printed. The only thing that will change is the final url shown in the input field. So I assume the broken use-case is when one selects a bookmarklet from the popup, and then wants to quickly edit it? I don't know how common is as a use-case, I could argue it's less common than editing a bookmarklet stored as a bookmark, where we don't unescape. An intermediate solution to that use'case, could be to unescape the url only when it's being edited, but it may be more risky and regression-prone.

Flags: needinfo?(mak77)

:Gijs (he/him)

Assignee

Comment 13

•

9 years ago

Attached patch Patch v1.0 — Details — Splinter Review

Always reassuring if there is already test coverage for this. I looked where that test came from, and it seems to have been there from the start of bug 666964, but there's not much rationale in that bug as to why/how we're unescaping even non-ascii.

Assignee: nobody → gijskruitbosch+bugs

Status: NEW → ASSIGNED

Attachment #8731360 - Flags: review?(mak77)

:Gijs (he/him)

Assignee

Updated

•

9 years ago

Component: Keyboard Navigation → Location Bar

Marco Bonardo [:mak]

Comment 14

•

9 years ago

Comment on attachment 8731360 [details] [diff] [review] Patch v1.0 Review of attachment 8731360 [details] [diff] [review]: ----------------------------------------------------------------- ::: browser/base/content/browser.js @@ +2374,5 @@ > + // This only decodes ascii characters (hex) 20-7e, except 25 (%). > + // This avoids both cases stipulated below (%-related issues, and \r, \n > + // and \t, which would be %0d, %0a and %09, respectively) as well as any > + // non-US-ascii characters. > + value = value.replace(/%(2[0-4]|2[6-9a-f]|[3-6][0-9a-f]|7[0-9a-e])/g, decodeURI); Should I assume decoding %20 is not considered a problem regarding this bug? Cause in the end one could still isolate any wanted address from the rest in the input field through spaces. Or maybe I'm misunderstanding part of this.

:Gijs (he/him)

Assignee

Comment 15

•

9 years ago

(In reply to Marco Bonardo [::mak] from comment #14) > Comment on attachment 8731360 [details] [diff] [review] > Patch v1.0 > > Review of attachment 8731360 [details] [diff] [review]: > ----------------------------------------------------------------- > > ::: browser/base/content/browser.js > @@ +2374,5 @@ > > + // This only decodes ascii characters (hex) 20-7e, except 25 (%). > > + // This avoids both cases stipulated below (%-related issues, and \r, \n > > + // and \t, which would be %0d, %0a and %09, respectively) as well as any > > + // non-US-ascii characters. > > + value = value.replace(/%(2[0-4]|2[6-9a-f]|[3-6][0-9a-f]|7[0-9a-e])/g, decodeURI); > > Should I assume decoding %20 is not considered a problem regarding this bug? > Cause in the end one could still isolate any wanted address from the rest in > the input field through spaces. > Or maybe I'm misunderstanding part of this. I think your understanding is correct. The patch assumes that US-ascii is OK. The more I think about it though, the more this seems generically problematic - replacing the \u2003 from the testcase with ascii spaces seems to work, too? Why is ascii whitespace allowed in the media type here?

Flags: needinfo?(valentin.gosu)

Valentin Gosu [:valentin] (he/him)

Comment 16

•

9 years ago

(In reply to :Gijs Kruitbosch from comment #15) > > Should I assume decoding %20 is not considered a problem regarding this bug? > > Cause in the end one could still isolate any wanted address from the rest in > > the input field through spaces. > > Or maybe I'm misunderstanding part of this. > > I think your understanding is correct. The patch assumes that US-ascii is OK. > > The more I think about it though, the more this seems generically > problematic - replacing the \u2003 from the testcase with ascii spaces seems > to work, too? Why is ascii whitespace allowed in the media type here? From what I remember whitespace is stripped from a data URI, so maybe that's why that works.

Flags: needinfo?(valentin.gosu)

:Gijs (he/him)

Assignee

Comment 17

•

9 years ago

Can you provide a link to the Chromium issue?

Flags: needinfo?(firace)

firace

Reporter

Comment 18

•

9 years ago

Sure, here's a link to the Chromium issue. But note that meanwhile, it was marked as WontFix as the LOCK Unicode character is not decoded by Chromium. https://bugs.chromium.org/p/chromium/issues/detail?id=594057

Flags: needinfo?(firace)

firace

Reporter

Comment 19

•

9 years ago

(In reply to Valentin Gosu [:valentin] from comment #16) > (In reply to :Gijs Kruitbosch from comment #15) > > > Should I assume decoding %20 is not considered a problem regarding this bug? > > > Cause in the end one could still isolate any wanted address from the rest in > > > the input field through spaces. > > > Or maybe I'm misunderstanding part of this. > > > > I think your understanding is correct. The patch assumes that US-ascii is OK. > > > > The more I think about it though, the more this seems generically > > problematic - replacing the \u2003 from the testcase with ascii spaces seems > > to work, too? Why is ascii whitespace allowed in the media type here? > > From what I remember whitespace is stripped from a data URI, so maybe that's > why that works. Correct, ascii spaces are stripped, which is why I had to get a little creative. :)

:Gijs (he/him)

Assignee

Comment 20

•

9 years ago

Hm, so I can reproduce the spaces collapsing if I modify the testcase to navigate to that URI with spaces instead of the \u2003. However, if I myself type: "data:text/html;

:Gijs (he/him)

Assignee

Comment 21

•

9 years ago

Umm, wow, bugzilla, you don't like that comment, do you? Let's try again... ---- Hm, so I can reproduce the spaces collapsing if I modify the testcase to navigate to that URI with spaces instead of the \u2003. However, if I myself type: "data:text/html; [lock] https://secure.paypal.com/,foo" sans quotes in the URL bar, the spaces don't get collapsed. :-\ Anyway, it sounds like a good mitigation here will be to escape non-ascii, even if we leave spaces, as far as exploiting this from the web is concerned. One other thing that we could consider doing is mangling the data URI completely for display, but that would involve reimplementing some of the parsing stuff in frontend JS which I'm not keen on - feels like we'll just be introducing bugs there instead.

Marco Bonardo [:mak]

Comment 22

•

9 years ago

(In reply to :Gijs Kruitbosch from comment #21) > Anyway, it sounds like a good mitigation here will be to escape non-ascii, > even if we leave spaces, as far as exploiting this from the web is concerned. yeah, I think we only care about web exploitability here.

Marco Bonardo [:mak]

Comment 23

•

9 years ago

Attached file test.html — Details

to clarify, this is what I meant.

:Gijs (he/him)

Assignee

Comment 24

•

9 years ago

So I thought we might be able to allow "data:text/html,", but: data:text/html, | PayPal Inc. (US) | https://paypal.com/ <html><body><p>Hello, this is a test</p><script>var p = document.getElementsByTagName("p")[0]; while (p.previousSibling) p.previousSibling.remove();</script></body></html> would work just as well. So really, there is no way we can display this with the spaces intact, and not suffer spoofing. :-\ I think that means we can't take the patch as-is (I mean, we can, but it won't address all the issues here). (In reply to Marco Bonardo [::mak] from comment #12) > An intermediate solution to that use'case, could be to unescape the url only > when it's being edited, but it may be more risky and regression-prone. Where would we do this?

Flags: needinfo?(mak77)

Marco Bonardo [:mak]

Comment 25

•

9 years ago

(In reply to :Gijs Kruitbosch (away 24-29/3, incl.) from comment #24) > So really, there is no way we can display this with the spaces intact, and > not suffer spoofing. :-\ Right, that's why I was asking if we should decode %20 or not... As soon as we decode them, it is spoofable. Sure we won't show the lock, that is the worst part of this bug... We could decide to not decode %20, that is what may annoy devs using bookmarklets (comment 10). > (In reply to Marco Bonardo [::mak] from comment #12) > > An intermediate solution to that use'case, could be to unescape the url only > > when it's being edited, but it may be more risky and regression-prone. > > Where would we do this? it would be somewhere in the focus/blur event handlers in urlbarBindings.xml. Basically it would work similarly to the formaValue thing that currently hilights the domain. http://mxr.mozilla.org/mozilla-central/source/browser/base/content/urlbarBindings.xml#1015 Though, it's by far more regressions prone than the approach in the current patch.

Flags: needinfo?(mak77)

Marco Bonardo [:mak]

Comment 26

•

9 years ago

we could limit the decoding-on-edit to javascript urls though... that may be less scary.

Marco Bonardo [:mak]

Updated

•

9 years ago

Attachment #8731360 - Flags: review?(mak77)

:Gijs (he/him)

Assignee

Comment 27

•

9 years ago

(In reply to Marco Bonardo [::mak] from comment #26) > we could limit the decoding-on-edit to javascript urls though... that may be > less scary. I mean, selfishly, I use data: URIs fairly frequently for testing, and having spaces would annoy me. Just sticking: data:text/html, <body>Hello this is a test</body> in Google Chrome also leaves all the spacing alone and unescaped, even when not editing. Dan, what's the bar for spoofing that we want to be using here?

Flags: needinfo?(dveditz)

Daniel Veditz [:dveditz]

Comment 28

•

9 years ago

(In reply to :Gijs Kruitbosch from comment #11) > dveditz, let me know if you moved it to -release over fx-frontend intentionally...) I did. It's a low-severity bug and somewhat similar to a fixed one we've announced and unhidden. I figured better internal access was a win, and we didn't need the extra security of compartmentalization--especially since this kind of straddles front-end and networking. For example, we could decide the data: protocol handler should error out on any non-ascii encountered in the media type section. The various relevant RFCs would seem to give us permission to do that although it'll probably break something somewhere. But then, as you point out in comment 24, the spaces and spoofy stuff (like the lock icon) could be in the body and we can't reject that. All we can do is decide to unescape or not. We might have to leave this wontfix like Chrome.

Group: firefox-core-security → core-security-release

Flags: needinfo?(dveditz)

Marco Bonardo [:mak]

Comment 29

•

9 years ago

(In reply to Daniel Veditz [:dveditz] from comment #28) > We might have to leave this wontfix like Chrome. not decoding the lock could still be a good thing to do regardless.

:Gijs (he/him)

Assignee

Comment 30

•

9 years ago

Comment on attachment 8731360 [details] [diff] [review] Patch v1.0 Review of attachment 8731360 [details] [diff] [review]: ----------------------------------------------------------------- (In reply to Marco Bonardo [::mak] from comment #29) > (In reply to Daniel Veditz [:dveditz] from comment #28) > > We might have to leave this wontfix like Chrome. > > not decoding the lock could still be a good thing to do regardless. I agree. Re-requesting review based on the last few comments. :-)

Attachment #8731360 - Flags: review?(mak77)

Marco Bonardo [:mak]

Comment 31

•

9 years ago

Comment on attachment 8731360 [details] [diff] [review] Patch v1.0 Review of attachment 8731360 [details] [diff] [review]: ----------------------------------------------------------------- OK, let's try this. It's not perfect, but the important thing is that the spoofed url is not good enough to trick people, and if we can avoid graphics and (mostly) lock icons, we are close enough to that. On the other side if we'd be more strict we'd annoy any technical user by not decoding ascii, so that part will stay a wontfix. That means things like comment 23 can still happen, but we assume the different scheme, missing colors, missing lock and the security info panel, are good enough to avoid tricking the user.

Attachment #8731360 - Flags: review?(mak77) → review+

:Gijs (he/him)

Assignee

Comment 32

•

9 years ago

https://hg.mozilla.org/integration/fx-team/rev/995e7890dd613843c3914a1d9d46676f400152c1

Carsten Book [:Tomcat]

Comment 33

•

9 years ago

https://hg.mozilla.org/mozilla-central/rev/995e7890dd61

Status: ASSIGNED → RESOLVED

Closed: 9 years ago

status-firefox48: --- → fixed

Resolution: --- → FIXED

Target Milestone: --- → Firefox 48

firace

Reporter

Comment 34

•

9 years ago

Sorry, I'm not very familar with Firefox development. Is a nightly build with this fix already available somewhere?

:Gijs (he/him)

Assignee

Comment 35

•

9 years ago

(In reply to firace from comment #34) > Sorry, I'm not very familar with Firefox development. > Is a nightly build with this fix already available somewhere? Yes, https://nightly.mozilla.org .

firace

Reporter

Comment 36

•

9 years ago

Ah, that was easy! Thanks.

Philip Chee

Updated

•

9 years ago

Comment 37

•

9 years ago

Verified as fixed using Firefox 48 beta 10 under Win 10 64-bit, Ubuntu 14.04 64-bit and Mac OS X 10.11.

Status: RESOLVED → VERIFIED

status-firefox48: fixed → verified

Al Billings [:abillings - ex-MoCo]

Updated

•

9 years ago

Alias: CVE-2016-5251

Whiteboard: [necko-backlog] → [necko-backlog][adv-main48+]

Daniel Veditz [:dveditz]

Updated

•

8 years ago

Group: core-security-release

testcase.html 9 years ago firace 689 bytes, text/html		Details
Patch v1.0 9 years ago :Gijs (he/him) 4.79 KB, patch	mak : review+	Details \| Diff \| Splinter Review
test.html 9 years ago Marco Bonardo [:mak] 982 bytes, text/html		Details