279099 - (punycode) Protect against homograph attacks (spoofing using punycode IDNs)

Reporter

Description

•

20 years ago

User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.0; rv:1.7.3) Gecko/20040913 Firefox/0.10 Build Identifier: Mozilla/5.0 (Windows; U; Windows NT 5.0; rv:1.7.3) Gecko/20040913 Firefox/0.10 firefox (and other unamed browsers) incorrectly handles punycode encoded domain names. This allows attackers (namely phishers) to spoof urls of just about any domain name, including ssl certs. Proof of concept url: http://www.shmoo.com/testing_punycode/ The links are directed at "http://www.pаypal.com/", which the punycode handlers render as "www.xn--pypal-4ve.com" The domain was just registered, so the root servers may not have gotten it yet. Point your dns servers at '216.254.21.212' if you have problems. Here's what I think the bug is: 1. firefox (and mozilla) should warn the user if punycode is in use at all 2. You should consider validating the ssl cert with the non-decoded version of the website Just in case it's not clear, an attack case could be an ebayer/phisher who includes links to paypal in their auction. When the auction ends, the buyer clicks on the paypal link (which is a punycode/proxy to the real paypal), and proceeds to steal all of their private green bits. I have not done any platform testing, or tested any other versions of mozilla/firefox/etc. I assume this bug is cross-platform. The proof of concept urls are hosted on a personal server, and as such, I'd like to have a chance to bring them down before this bug becomes public. Please email me at ericj@shmoo.com before marking this bug public. /me goes and reads up on the mozilla bounty program. Reproducible: Always

3ric Johanson

Reporter

Comment 1

•

20 years ago

This bug impacts many other browsers, and I'm working on notifying them right now. Based on the critical nature of this bug, I believe it's best to: 1. not notify the public until all vendors have been notified & have a chance to release updates 2. set a fixed date on which this vulnerability will become public (so no one company releases details before others have a chance to release updates). That date will be 2/5/05, unless folks convince to delay this action. Thanks, Ericj 206.321.3411

Daniel Veditz [:dveditz]

Comment 2

•

20 years ago

Attached file more examples — Details

from a spreadfirefox.com blog I found out this morning about http://www.retrosynth.com/misc/phishing.html which plays with the same idea: www.xn--amazn-mye.com www.xn--micrsoft-qbh.com www.xn--papal-fze.com These three were registered to Jesse C Lee (Witchita, KS) on Jan 8, 2005. The retrosynth page was last updated (created?) Jan 16, 2005, presumably by the site owner Cary Roberts in Mountain View, CA. What's the connection? What's the connection between retrosynth and the spreadfirefox blogger? This may already be widely known.

Daniel Veditz [:dveditz]

Comment 3

•

20 years ago

Darin: any ideas?

Assignee: firefox → darin

Blocks: sbb?, sg-ff101, sg-moz176

Status: UNCONFIRMED → NEW

Component: General → Networking

Ever confirmed: true

Product: Firefox → Core

QA Contact: general → benc

Whiteboard: [sg:fix]

Version: unspecified → Trunk

3ric Johanson

Reporter

Comment 4

•

20 years ago

Opera has responded: Date: Thu, 20 Jan 2005 18:06:30 +0100 From: bug-161715-s10@bugs.opera.com To: ericj@shmoo.com Subject: Your bug report Hello Eric, What you illustrate is an inherent problem with IDNA and the international Unicode characterset. On many systems success may depend on which fonts and languages the user have installed (and what is included in the default installation) There was a discussion about a similar issue in our forums a couple of days ago: <URL: http://groups.google.com/groups?threadm=tmgou051aaovjqh2isd5shkcel8rp4j96q%404ax.com > Unfortunately, I do not believe your suggestion of warning the user about IDNA encoded names in the name of secure servers is particable. It might look that way when you are dealing with spoofsites such as your example, but it would be maddening for Chinese and Japanese websurfers, in fact it would also irritate many European (e.g. French, German and Scandinavian) surfers who are using languages with characters that will generate punycode servernames. The problem about spoofing websites using IDNA is IMO best solved by the domainname registrars, by limiting on their side the character-combinations they want to accept in a domainname. AFAIK such limitations are implemented in (e.g.) the Norwegian zone, but Verisign has not yet implemented something similar, which is understandable given the worldwide use of .com domains. Please note that Wand or cookies will not be tricked by this kind of servernames. -- Sincerely, Yngve N. Pettersen ******************************************************************** Senior Developer Email: yngve@opera.com Opera Software ASA http://www.opera.com/ Phone: +47 24 16 42 60 Fax: +47 24 16 40 01 ********************************************************************

Component: Networking → Bookmarks

Product: Core → Firefox

Version: Trunk → 1.0 Branch

3ric Johanson

Reporter

Comment 5

•

20 years ago

We should consider adding opera to the CC list on this bug: bug-161715-s10@bugs.opera.com Cheers, Eric

3ric Johanson

Reporter

Comment 6

•

20 years ago

It turns out this attack was talked about several years ago; it was called the 'homograph attack' http://www.cs.technion.ac.il/~gabr/papers/homograph.html The problem today is that several browsers support this right out of the box. This introduces a huge security risk for users. Filtering at the registrar level is possible, but VERY hard. They should not allow mixed-byte or multi-language encodings, and should consider blacklisting some of the chars from the punycode encode process. However, as a user of firefox, I see no method for me to disable punycode support. This is not just a browser bug - - it's a standards bug. But early adoption means that firefox & CO needs to deal with it at some level (even if it means disabling puny support, or ssl + puny support). I don't know what the right answer is. I'm just saying: "TODAY THIS IS A HUGE PROBLEM FOR FIREFOX SECURITY".

Daniel Veditz [:dveditz]

Comment 7

•

20 years ago

That opera address is not registered in bugzilla can't be CC'd, but we have contacts at Opera and will work through those.

Summary: CRITICAL SECURITY VULN: punycode allows attackers to spoof urls/ssl certs → punycode allows attackers to spoof urls/ssl certs

3ric Johanson

Reporter

Comment 8

•

20 years ago

After talking about this bug with a few other security folks, I have some ideas I'd like to share. 1. Different validation of ssl certs. Currently, the browser encodes the unicode into punycode, loads the website, and validates that the puny encoded domain matches the ssl cert. I think this is a problem. The browser should validate the cert name with the raw unicode text (you can generate ssl certs with unicode CNs - - I tested this). 2. Filtering should happen at both the browser level and the registrar level. Example filtering should include: A. not mixing double-byte & single byte punycode wrapped domain names. This makes it much harder to spoof domain names, as most other codepages don't have standard latin in them. B. Validation of codepage. Ensure that all chars in a domain are part of ONE codepageset, not mixed. C. Don't allow for bad unicode chars (see MS Press "Writing Secure Code, 2nd Edition", page 379) such as non-shortest encoding of UTF8->punycode. D. Block some 'non-alpha' chars in other code pages. Examples are Unicode 05B4, which looks like the latin period '.' IDN Filtering is a complex subject, and is highly prone to errors. 3. Display a country flag next to the addressbar/domain name. Display icon or something showing the current language the domain is in. 4. Must have feature: Disable/enable IDN in all mozilla products. Anyway, I hope some of these ideas get some traction or result in some better solution... If I can assist in any way (testing, providing evil ssl certs, whatever) please let me know. Cheers, Eric

3ric Johanson

Reporter

•

20 years ago

Attached image konqueror correctly not matching the domain with the CN on the ssl cert. — Details

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Comment 13

•

Target Milestone: --- → mozilla1.8beta

Version: 1.0 Branch → Trunk

Darin Fisher

Comment 26

•

20 years ago

> At this point, I'm not sure which one is correct; but there should be a correct > method for using ssl with IDN. Purhaps this is because the existing RFCs don't > really talk about ssl + IDN. I think it makes more sense to compare the punycode value of the hostname to the cert since that is the value of the hostname used with DNS to resolve the IP address. It seems like a bug to me that KHTML and Opera do otherwise. As with many of the older internet specifications (DNS, HTTP, Cookies, etc.), IDN names are intended to be converted to punycode before being used. So, it is an odd choice to treat certs as somehow different.

David Baron :dbaron: (⌚️UTC-4, no longer working on Mozilla)

Comment 27

•

20 years ago

(In reply to comment #26) This should now be on bug 280839.

Darin Fisher

Comment 28

•

20 years ago

> but does not appear to be working. See bug 261934. The bug was fixed recently on the trunk. The patch applies cleanly on the 1.7 branch.

Status: NEW → ASSIGNED

Daniel Veditz [:dveditz]

Comment 29

•

20 years ago

*** Bug 281439 has been marked as a duplicate of this bug. ***

Ian Hawthorn

Comment 30

•

20 years ago

(In reply to comment #21) > OK, after some more grovelling around in the Unicode mailing list archive, I've > found the following file: http://www.unicode.org/Public/UNIDATA/NamesList.txt > > This has the cross-reference data in it, giving both exact and approximate > visual similarities between the characters, and also code-point equivalents for > ligatures etc. Together with the script-family data, this is probably a good > starting point for an anti-spoof algorithm. An algorithm which looks purely at specific character pairs will remain a point of weakness. If a flaw leaves the user has no other protection, then each flaw, big or small, will be announced with all the gravitas of a full security vulnerability. The spreadfirefox people don't need this. Detection of potential problems needs to operate on several levels, and I think we need a top down approach, with warnings on by default and user configurable, so that the browser is safe `out of the box'. For example the warning could be displayed 1. the first time a new codeset is encountered in a URL 2. the first time a particular pair of codesets are used together in a URL. The user may disable this warning for future encounters with that character set or combination of character sets, or may leave the warning enabled but create an exception for that particular site. This would catch almost all of the problem without getting into the detail of similar appearing characters. Below this would be the more detailed algorithm for flagging potentially ambiguous constructions. However with such broad general protections in place, this could now be implemented on a per codesetpair basis.

Matthew T (active 1999-2002)

Comment 31

•

•

20 years ago

Attached file Program for analyzing homograph spoofed domain names (obsolete) — Details

I have attached a small Python program that analyzes internationalized domain names for cross-script homograph attacks, and tries to detect various possible kinds of spoofing.

bugzilla

Comment 51

•

20 years ago

Comment 107

•

20 years ago

Attached file Character repertoires from expired Internet-Draft draft-alvestrand-lang-char-03.txt (obsolete) — Details

This is a series of character repertoires for various languages. Source: from expired Internet-Draft "Characters and character sets for various languages", by Harald Tveit Alvestrand, draft-alvestrand-lang-char-03.txt The characters given here for a language include the base set, the Required characters, and the Important characters. See the Internet-Draft for the definitions of these terms. One correction has been made: the entry for German contained a control character. This has been removed. This is a draft document generated from another draft; there is absolutely no guarantee of correctness or fitness for any use; this information is provided for research and entertainment purposes only.

Assignee

Comment 135

•

20 years ago

There's a lot of discussion going on here :-) One idea which met with approval on the Mozilla security list was the following: Most domain registrars have been correctly implementing the guidelines for avoiding IDN-related spoofing problems. AIUI, the .jp registry even delayed issuing IDN names for six months until the guidelines were finished. Unfortunately, there are a few rather large exceptions to this - .com being one. So, the suggestion is to have a blacklist of those TLDs, and display the IDN in raw punycode form throughout the UI until such time as the registrars get their act together. Later Firefoxe releases, or automatically-pushed updates, can shrink (or expand) the blacklist. This has many significant advantages. It's fairly simple to code, and doesn't penalise IDN domain owners and registrars who have been doing the right thing. It doesn't place any restrictions on what domains are allowed. It requires no user configuration, and no assumptions about what characters a given user might be familiar with. It involves no pop-ups. It places the blame and the responsibility where it really belongs, and kills any homograph attacks stone dead. Gerv

bugzilla

Comment 136

•

20 years ago

(In reply to comment #135) > There's a lot of discussion going on here :-) > > One idea which met with approval on the Mozilla security list was the following: > > Most domain registrars have been correctly implementing the guidelines for > avoiding IDN-related spoofing problems. AIUI, the .jp registry even delayed > issuing IDN names for six months until the guidelines were finished. > Unfortunately, there are a few rather large exceptions to this - .com being one. > > So, the suggestion is to have a blacklist of those TLDs, and display the IDN in > raw punycode form throughout the UI until such time as the registrars get their > act together. Later Firefoxe releases, or automatically-pushed updates, can > shrink (or expand) the blacklist. > > This has many significant advantages. It's fairly simple to code, and doesn't > penalise IDN domain owners and registrars who have been doing the right thing. > It doesn't place any restrictions on what domains are allowed. It requires no > user configuration, and no assumptions about what characters a given user might > be familiar with. It involves no pop-ups. It places the blame and the > responsibility where it really belongs, and kills any homograph attacks stone dead. > > Gerv > Neat. Alternatively, you can have a _whitelist_ of TLDs which are known to be following the ICANN / IANA rules. This is more "politically" neutral, avoids the issues associated with a blacklist, and yet will act in the same way as a strong incentive for non-conformant TLD registries to follow best practices. This also deals better with new TLD allocations.

Gervase Markham [:gerv]

Assignee

Comment 137

•

20 years ago

Whether we go for a white or a blacklist probably depends on getting a much better view of how widespread the problem is. What I'm hearing from the IDN community is that most people are playing by the rules - it's just a few high-profile registrars and TLDs which aren't. If that's the case, a blacklist is probably good - we do want to send a message. After all, their negligence has put our users at risk. On the other hand, if the picture is more mixed than I understand, then perhaps a whitelist approach might be better. Gerv

Erik van der Poel

Comment 138

•

•

20 years ago

I like the idea in comment 145. It may harm valid IRIs a bit, but they are not widely deployed and I guess the option is pref controlled so people can turn it off. (Where I mean IRIs that are registered without the intention to spoof users by valid IRIs.)

Gervase Markham [:gerv]

Assignee

Comment 147

•

20 years ago

> (Please note that the IDN > policies for a TLD are set by the *registry*, not the *registrars*.) Well that's good, because it's easy for us to determine the registry (just look at the TLD), but hard for us to determine the registrar (requires a WHOIS). If the policies for .com do not protect against phishing, then we should not display IDN domains in their full form in that TLD, because to do so is a security risk. It's as simple as that. I don't quite understand how Mozilla not displaying IDN for .com gives Verisign a monopoly on anything. But if Verisign want to have a monopoly on putting their customers at risk of phishing, let them. I strongly believe that whatever solution we implement should allow full, uncrippled and first class implementation of IDN in those cases, whatever they may be, where we have established that there is no more risk than in the ASCII domain name space. (In reply to comment #145) > Rather than just showing the punycode in the status bar which many people either > don't have turned on, don't notice, or may even be altered by a script in the > website (unless that ability has been disabled by the user); My suggestion is not to only show the punycode in the status bar, but to use it everywhere for TLDs which have poor homograph control policies. The status bar is always-on in Firefox, unless the user specifically disables it. This is a security feature. The security area of the status bar (to the right) cannot be altered by script. > how about > displaying the information bar at the top, just like the popup blocker, that > explains that it is an internationalised domain name, the possible security > implications and show the punycode version. A strong characteristic of a good solution is that it does not discriminate against all IDN domain names. This solution, in its plain form, does. There is definitely value in using a phishing detection heuristic to display such a bar - but that's fixing the more general phishing problem, not just the homograph one. Gerv

bugzilla

Comment 148

•

20 years ago

> I don't quite understand how Mozilla not displaying IDN for .com gives > Verisign a monopoly on anything. Take a look at the documentation provided by the TLDs that support IDN. Prominent in every such text is a clear reference to the need for an IDN-complaint browser, and a list of available alternatives. Such lists are almost always headed by the VeriSign IE plug-in and Mozilla, often listing no further alternatives. Regardless of VeriSign's IDN policies in .com, their IE plug-in is a sound implementation of IDNA. The same goes for Mozilla. What do you think the maintainers of the TLD documentation are going to do if the one of these two decides that it is no longer going to provide rigorous support for IDNA? > if Verisign want to have a monopoly on putting their customers at risk of > phishing, let them. How are you defining the concept of VeriSign customer? Someone who expects to be able to use the Unicode form of an IDN in .com? Someone who is using IE for the task?

Gervase Markham [:gerv]

Assignee

Comment 192

•

20 years ago

(in reply to comment 133) If a domain name is a Russian word written totally in Latin homographically equivalent letters instead of original Cyrillic letters, this does not have to be a spoof. Please note: long before IDN was first presented, some Russian sites already had domain names which contained only Latin letters homographically equivalent to Cyrillic -- this was a pre-IDN hack to include Russian words to ASCII domain name. The Latin homographs are: A/a, B/b, C/c, E/e, H, K, M/m, n, O/o, P/p, T, u, X/x, y Their respective Cyrillic equvalents (named according to http://www.unicode.org/charts/PDF/U0400.pdf chart): A (capital/small), VE (capital)/SOFT SIGN, ES (capital/small), IE (capital/small), EN (capital), KA (capital), EM (capital)/TE (small, cursive variation), PE (small, cursive variation), O (capital/small), ER (capital/small), TE (capital), I (small, cursive variation), HA (capital/small), U (small) Please also note that Cyrillic letter ZE is pretty much homographical to DIGIT THREE, and BE (small) is more or less homographical to DIGIT SIX. Cyrillic letter YERU is homographical to "bI" or "bl" (two symbols together). So we have more than half of the alphabet -- if you carefully avoid Russian letters GHE, DE, IO, ZHE, SHORT I, EL, EF, TSE, CHE, SHA, SHCHA, HARD SIGN, E, YU, and YA, then you may write Russian words with Latin letters (either capital or small). There are 33 letters in Russian alphabet (see http://learningrussian.com/alphabet.htm for details). Only 15 don't have homographs in Latin or digits. Please note also: it is allowed by Russian rules to use IE letter ('e' letter, don't think of MSIE) instead of IO letter in most words. So, effectively, only 14 Russian letters cannot be presented homographically in pure ASCII. The Russian Alphabet (Unicode names --> ASCII homographs, if exist): A --> A/a BE --> 6 VE --> B GHE DE IE --> E/e IO =-=> E/e (not allowed in some words) ZHE ZE --> 3 I --> u SHORT I KA --> K EL EM --> M EN --> H O --> O/o PE --> n ER --> P/p ES --> C/c TE --> T/m U --> y EF HA --> X/x TSE CHE SHA SHCHA HARD SIGN YERU --> bI/bl SOFT SIGN -> b E YU YA Some existing (registered and working) domain names using this technique (some I knew of, some I've found right now combining the above enumerated letters forming valid Russian words -- these homographs are also used to represent Russian words below in round brackets, because Bugzilla does not support Unicode yet, AFAIK): http://www.XAKEP.ru/ (Russian word 'XAKEP' means 'hacker') http://www.PEKA.ru/ (Russian word 'PEKA' means 'river') http://CTEHA.ru/ (Russian word 'CTEHA' means 'wall') http://www.CblP.ru/ (Russian word 'CblP' means 'cheese') http://www.TEMA.ru/ (Russian word 'TEMA' means 'theme'; and, backreplacing IE-->IO, we get a variant of name of this domain's owner) http://ABTO.ru/ (Russian word 'ABTO' means 'auto') http://3ByK.ru/ (Russian word '3ByK' means 'sound') http://KOCMOHABT.ru/ (Russian word 'KOCMOHABT' is equivalent to 'astronaut') http://MATPAC.ru/ (Russian word 'MATPAC' means 'mattress') http://MEXA.ru/ (Russian word 'MEXA' means 'furs' -- yes, plural; singular form http://MEX.ru/ is cybersquatted) http://MPAMOP.ru/ (Russian word 'MPAMOP' means 'marble') http://OXPAHA.ru/ (Russian word 'OXPAHA' means 'guard' or process of guarding) And some not so useful (registered for sale or otherwise cybersquatted) domains: http://www.BAHHA.ru/ (Russian word 'BAHHA' means 'bath') http://CyKA.ru/ (Russian word 'CyKA' means 'bitch') http://MOCKBA.ru/ (Russian word 'MOCKBA' means city of Moscow) http://EBPO.ru/ (Russian word 'EBPO' means 'euro') http://KOCMOC.ru/ (Russian word 'KOCMOC' means 'outer space') http://KPACKA.ru/ (Russian word 'KPACKA' means 'paint, dye, colour') This is a proof for two more or less separate ideas: 1) Full homography of a domain name can be a legacy of pre-IDN times, a basis for someone's ethic and legal business, which must not be ruined. 2) We should end at considering only symbol-to-symbol homography; also, two adjacent symbols of one alphabet may happen to look like a single glyph of another alphabet. The hunt for Russian domain names written in pure ASCII will continue.

Gervase Markham [:gerv]

Assignee

Comment 193

•

20 years ago

Sergey: that's very useful - thanks :-) Everyone else: I'm currently up to my eyeballs in IDN lists and blog posts and emails. I'm trying to get on top of what everyone is saying this weekend, and see what emerges.

Jungshik Shin

Comment 194

•

20 years ago

> Russian words below in round brackets, because Bugzilla does not support > Unicode yet, AFAIK): Well, all you have to do is set 'View | Character Encoding' to UTF-8 before posting any comment with non-ASCII characters and do the same when viewing any comment posted in UTF-8. We'd not have NCRs as in comment #133. Please, everybody, set 'Character Encoding' to UTF-8 before *posting* comments with non-ASCII characters here and in other bugs at bugzilla.mozilla.org. (be aware that changing 'character encoding' resets the content of a textarea - you would lose everything you've written there so that before changing 'character encoding' you have to make sure to copy it to the clipboard or elsewhere) > http://www.XAKEP.ru/ (Russian word 'XAKEP' means 'hacker') 'ХАКЕР' in Cyrillic One idea (as a part of *multiple* lines of defense). We may render characters beloning to the 'minority' scripts for a given domain component in a 'conspicuous' 'color' (and/or font) different from the color used to render characters in the 'majority' script (the script with the largest count in a given domain component.). For 'pаypаl' where 'а' is Cyrillic, Cyrillic would be the minority script while Latin would be the majority script)

Sergey Sokoloff

Comment 195

•

20 years ago

Thank you, Jungshik Shin, this helped. Ok, now 19 more domains I've found last night (use UTF-8 to view Russian words below). Website: http://PEMOHT.ru/ Russian word: ремонт Translation: 'repair' (noun) Status: cybersquatted Website: http://caxap.ru/ Russian word: сахар Translation: 'sugar' (noun) Status: used by sugar traders Website: http://COK.ru/ Russian word: сок Translation: 'juice' (noun) Status: list of winners of some lottery drawing among apple juice customers Website: http://COCKA.ru/ Russian word: соска Translation: 'comforter, dummy teat' Status: pornocybersquat Website: http://coyc.ru/ Russian word: соус Translation: 'sauce, gravy' Status: sauce recipe list, FAQ, etc. Domain: cyxapu.ru Russian word: сухари Translation: 'rusks, pieces of dried bread' (plural) Status: DNS works, but no route to host Website: http://www.MAKCu.ru/ Russian word: МАКСИ Translation: this is a trademark that has no direct meaning and translation; it is most likely derived from the word 'максимум', which means 'maximum' or 'at most' Status: some cellphone-related business and FAQ Website: http://yxo.ru/ Russian word: ухо Translation: 'ear' Status: webmail provider, hosting provider Website: http://yKcyc.ru/ Russian word: уксус Translation: 'vinegar' Status: cybersquatted by international drug dealers Website: http://xop.ru/ Russian word: хороший Translation: 'good' or 'fine' (there's a kind of pun in this domain name: Russian word 'хор' means 'chorus') Status: furniture shop Website: http://XPyCT.ru/ Russian word: хруст Translation: 'crunch' (noun) Status: website temporarily closed (probably it exceeded its bandwidth limit or other rent limit) Website: http://KAPTA.ru/ Russian word: карта Translation: 'map' or 'card' Status: communication service card dealer Website: http://KOBEP.ru/ Russian word: ковёр Translation: 'carpet' or 'rug' Status: cybersquatted Website: http://MAPKA.ru/ Russian word: марка Translation: '(postage-)stamp' or 'trade-mark' or 'brand' Status: philatelic activity Domain: HAyKA.ru Russian word: наука Translation: 'science' (noun) Status: DNS works, but no route to host Website: http://npoKaT.ru/ Russian word: прокат Translation: 'hire' (noun) Status: merchandise for hire Website: http://PECTOPAH.ru/ Russian word: ресторан Translation: 'restaurant' Status: internet shop selling goods and services somehow related to restaurants Website: http://CTAHOK.ru/ Russian word: станок Translation: (noun) 'machine-tool' or 'lathe' or 'printing-press' Status: somehow related to machine-building or machine works; not yet open Website: http://TypucT.ru/ Russian word: турист Translation: 'tourist' (noun) Status: site is under construction

Sergey Sokoloff

Comment 196

•

•

20 years ago

Depends on: 286535

Michael Scovetta (Scovetta Labs)

Comment 227

•

20 years ago

I'm glad that there's so much thought going into this issue, but I think much of it isn't practical. To the end user, this is a browser-level issue, and we need to treat it as such. Next, the problem is much simpler. Forgetting about punycode, can you really tell that www.paypal.com and www.paypa1.com are different? Can you do it in Times New Roman? Now adding in Punycode, and a multi-language, multi-charset world, and we've got a headache. Somehow, the user need to be alerted to the possibility that they're going to a different site, but without being too intrusive. Why not just have two URL boxes, or a URL box and a label next to it: www.paypal.com www.pxn--ypal.com Somehow, get the UI to look nice, maybe a mouse-over or an alert bubble. For users going to IDN sites, they need to deal with this. For everyone else, they can ignore it. It's one better than "just display everything in punycode". Another note-- most people see this as a problem with people thinking they're at ASCII sites, but are actually at puny-coded sites, but if you're living in Spain, going to www.aΙa.com (www.a&0399;a.com, but click on a link that goes to www.eІp.com (www.a&0406;a.com), then they both look the same, and neither are ASCII. With the above method, they'd see: www.aΙa.com www.xn--aa-09b.com They'd probably not know that they're going to the wrong address, but short of maintaining a list of valid "similar" DNS entries, I don't see any general solution to this problem.

Darin Fisher

Comment 228

•

20 years ago

This bug is on my plate for 1.8, but I'm not exactly working on a solution and times ticking. I have many other important things to do for 1.8, and I'm personally fine with the current solution of rending only punycode because I believe that the IDN spec is pretty broken (homographs of '/' considered valid -- come on!). If someone wants to champion a solution for Mozilla that would enable us to safely enable IDN in some form, then by all means run with it. I'll help where I can, but I don't have the time to develop a solution myself. I'm reducing the severity of this bug to minor because it only applies when the default preferences are changed. The original setting of critical was correct for Firefox 1.0 and earlier Mozilla-based browsers, but it no longer applies. I half expect my comments to raise a ruckus in this bug. Please keep any comments brief and constructive. Already, this bug report has grown to a length that would deter most from venturing to read it, let alone actually work on it. Not that there aren't plenty of great comments here... let's just keep it that way ;-)

Comment 243

•

20 years ago

darin doesn't have any time for this in beta2, may not have time to get it in to 1.1.

Flags: blocking1.8b3?

Flags: blocking1.8b2+

Comment 247

•

19 years ago

I'm not sure that this bug has significant remaining value, but I'll assign it to me for the moment. Gerv

Assignee: darin → gerv

Status: ASSIGNED → NEW

Daniel Veditz [:dveditz]

Updated

•

19 years ago

Whiteboard: [sg:fix] → [sg:spoof]

Asa Dotzler [:asa]

Updated

•

19 years ago

Flags: blocking-aviary1.5+

Wil Tan

Comment 248

•

•

17 years ago

Oh. And regarding colouring and concerns for the visually impaired. Even if there was no additional notification text wouldn't someone using a screen reader get an actual character name for a spoof? Like, if it looked like an i but was a unicode char, wouldn't it read the unicode char name? I don't know, not having a screen reader handy to test.

Gervase Markham [:gerv]

Assignee

Comment 256

•

17 years ago

We now implement a whitelist of TLDs which have sensible practices. Gerv

Status: NEW → RESOLVED

Closed: 17 years ago

Resolution: --- → FIXED

uamjet602

Comment 257

•

17 years ago

Any pointers to a bug where that was implemented? Can't seem to find it.

Jo Hermans

Comment 258

•

17 years ago

(In reply to comment #257) > Any pointers to a bug where that was implemented? Can't seem to find it. > bug 286534 http://www.mozilla.org/projects/security/tld-idn-policy-list.html

Neil Harris

Comment 259

•

15 years ago

There's now a useful tool for investigating spoofing at the Unicode Consortium site: http://unicode.org/cldr/utility/confusables.jsp

Hendrik Lönngren

Comment 260

•

•

8 years ago

Attached image Screenshot spoof of epic.com . Demo in wordfence blog. — Details

There is also a report on SUMO (Support Mozilla) as a question asking about the same wordfence blog > https://support.mozilla.org/t5/Firefox/firefox-phishing-warning/m-p/1391610 The poster of that question marked it solved after using about:config to toggle the pref > network.IDN_show_punycode to true. That is the workaround suggested in the wordfence blog. I note that unlike the examples in commennt 2 (In reply to Daniel Veditz [:dveditz] from comment #2) > Created attachment 171916 [details] > more examples > > from a spreadfirefox.com blog I found out this morning about > http://www.retrosynth.com/misc/phishing.html which plays with the same idea: > www.xn--amazn-mye.com > www.xn--micrsoft-qbh.com > www.xn--papal-fze.com > .... Where the fake and real URLs give distinct and different displays on mousover The example from wordfence gives a mousover result from the fake URL that visually matches the genuine URL using > <a href="https://www.xn--e1awd7f.com/" target="_blank"> to spoof > https://www.еріс.com/ As an additional twist they have also obtained a SSL cert from the Mozilla affiliated https://letsencrypt.org/ for the fake site.

ilias.giechaskiel

Comment 271

•

8 years ago

Agreed that this bug is not yet fixed. This URL also popped up on Hackaday: https://www.xn--80ak6aa92e.com/ which spoofs apple.com. The idea behind how Firefox deals with IDN is explained in https://wiki.mozilla.org/IDN_Display_Algorithm: > Instead, we now augment our whitelist with something based on ascertaining whether all the characters in a label all come from the same script, or are from one of a limited and defined number of allowable combinations. The hope is that any intra-script near-homographs will be recognisable to people who understand that script. > We retain the whitelist as well, because a) removing it might break some domains which worked previously, and b) if a registry submits a good policy, we have the ability to give them more freedom than the default restrictions do. So an IDN is shown as Unicode if the TLD was on the whitelist or, if not, if it met the criteria above. The example I linked to uses just the Cyrillic alphabet and is thus displayed with its IDN label per the single-script considerations of the algorithm. Perhaps, even if you allow IDN labels, you need to visually distinguish them, for example by marking the domain in a different color.

John Hesling [:John99] (NeedInfo me)

Comment 272

•

8 years ago

This bug is old and is already resolved and marked as fixed. It is probably not productive to continue commenting further in this bug. A newer and currently reopened one covering the subject is Bug 1332714. Bugzilla is however not the best place for general discussion of problems of complex issues involving languages and ICANN policy. Any action to attempt to mitigate issues is likely to have downsides like hitting legitimate sites with either blocks or display problems. There are always the standard Mozilla forums: https://www.mozilla.org/about/forums/ https://www.mozilla.org/en-US/about/forums/#dev-security https://groups.google.com/forum/#!forum/mozilla.dev.security.policy

more examples 20 years ago Daniel Veditz [:dveditz] 793 bytes, text/html		Details
omniweb correctly not matching the domain name. 20 years ago 3ric Johanson 91.25 KB, image/jpeg		Details
konqueror correctly not matching the domain with the CN on the ssl cert. 20 years ago 3ric Johanson 124.73 KB, image/gif		Details
Proposed blacklist of Unicode code points that should never occur in URLs 20 years ago bugzilla 2.49 KB, text/plain		Details
Program for analyzing homograph spoofed domain names 20 years ago bugzilla 11.64 KB, text/plain		Details
Program for analyzing homograph spoofed domain names, v1.1 20 years ago bugzilla 12.17 KB, text/plain		Details
Character repertoires from expired Internet-Draft draft-alvestrand-lang-char-03.txt 20 years ago bugzilla 24.44 KB, text/html		Details
Character repertoires from expired Internet-Draft draft-alvestrand-lang-char-03.txt, and IANA registry 20 years ago bugzilla 30.14 KB, text/html		Details
Character repertoires from expired Internet-Draft draft-alvestrand-lang-char-03.txt, and IANA registry, lowercase-only 20 years ago bugzilla 19.45 KB, text/html		Details
Experimental table of some homograms, with confusion distances 20 years ago bugzilla 1.98 KB, text/plain		Details
Screenshot showing Mozilla's default rendering on my computer. 20 years ago Mike Young 37.08 KB, image/png		Details
Screenshot spoof of epic.com . Demo in wordfence blog. 8 years ago John Hesling [:John99] (NeedInfo me) 44.08 KB, image/png		Details