41489 - HTTP authentication does not support non-ISO-8859-1 characters

Reporter

Description

•

25 years ago

I was just looking over the code for doing http authentication and I happened to notice that the username and password are being kept in a char*. That won't work with japanese characters for example. You need to either work exclusively with PRUnichar* or you need to have at some point done a converstion from PRUnichar to UTF8 encoding. I picked this up inadvertently in a code reading so I attempted to see what would happen if I used japanese characters for http authentication. I tried it and got an assertion failure at nsBasicAuth.cpp line 60. Comment preceding that line says "we work with ASCII around these here parts." You can continue to work with ascii if you do the utf8 conversion first. Take a look at wallet.cpp and see how I handle double-byte characters and the UTF8 conversion. BTW, you'll need to be able to enter and display a japanese character set in order to demonstrate this problem. I questioned the i18n team a long time ago about that and was given instructions (see bug 23037) for solving these two problems. Basically you first need to do the following simple steps: 1. Get a japanese font from http://jazz/users/teruko/publish/fonts/ie3lpkja.exe and install it. Need to reboot system after installation. 2. Modify browser prefs to use that font (set variable-width japanese encoding to use MS Gothic). Now you can display japanese characters. 3. In order to enter japanese characters without a japanese keyboard, do a copy-and-paste from http://babel/testdata/double_byte/Selected_data_sjis.htm.

Gagan

Updated

•

25 years ago

Target Milestone: --- → Future

Stephen P. Morse

Reporter

Comment 1

•

25 years ago

I know what "future" means -- it means we aren't going to do it. Is i18n comfortable with the fact that a japanese user will never be able to enter his japanese username in an http authentication box? Is he able to do so now in 4.x? It's not that difficult to continue working with ascii and do a utf-8 conversion first. I'll be glad to work with you if you don't know how to do it.

Gagan

Comment 2

•

25 years ago

Steve-- the bugs are not being marked future becuz we don't know how to do them. We are prioritizing. We have much more cruel bugs to look at before getting around to these. If you want to spare your free time on this-- go ahead.

Assignee: gagan → morse

Stephen P. Morse

Reporter

Comment 3

•

25 years ago

Haven't heard any comments from the i18n team about this so I guess they don't consider this important after all. Therefore I'll leave it futured and give it back to gagan.

Assignee: morse → gagan

bobj

Comment 4

•

•

24 years ago

Severity: critical → major

Priority: P1 → P2

nhottanscp

Comment 18

•

24 years ago

To encode, we need to specify a charset name, is that available in the code? Also, need to convert the string from UTF-8 to that charset, so need uconv. Does necko currently depends on uconv? Taka is working on MIME encoder, cc to him.

Takayuki Tei

Comment 19

•

24 years ago

Do we really want to do this? I see lots of implications here which are not easy to get answers. For instance, RFC-2617 "3.2.2 The Authorization Request Header" has following BNF: credentials = "Digest" digest-response digest-response = 1#( username | realm | nonce | digest-uri | response | [ algorithm ] | [cnonce] | [opaque] | [message-qop] | [nonce-count] | [auth-param] ) username = "username" "=" username-value username-value = quoted-string And, RFC-2047 says: + An 'encoded-word' MUST NOT appear within a 'quoted-string'. I don't know how we can get over this conflict.

Darin Fisher

Comment 29

•

19 years ago

*** Bug 340438 has been marked as a duplicate of this bug. ***

Simon Montagu :smontagu

Comment 30

•

19 years ago

Bug 340438 contains headers sent by Firefox and IE when attempting to login with user name "卓" (U+5353). We send Authorization: Basic UzpwYXNzd29yZA== which decodes as S:password IE sends Authorization: Basic 1786cGFzc3dvcmQ= which decodes as <0xd7bf>:password. 0xd7bf is 卓 in cp936

Jungshik Shin

Comment 31

•

19 years ago

(In reply to comment #30) > We send Authorization: Basic UzpwYXNzd29yZA== > which decodes as S:password So, we're just truncating 0x5353 (=U+5353 in UTF-16) to 0x53 as expected. > IE sends Authorization: Basic 1786cGFzc3dvcmQ= > which decodes as <0xd7bf>:password. 0xd7bf is 卓 in cp936 IE uses perhaps the defautl character encoding ... while Opera uses UTF-8. And, RFC 2616 stipulates that RFC 2047 be used.. This is a total mess. How about major web servers? I wonder whether Apache 2.x has anything better than Apache 1.x BTW, bug 295084 is about moving RFC 2047/2231 encoding routines from mail to necko.

Depends on: 295084

Darin Fisher

Updated

•

19 years ago

Assignee: darin → nobody

Status: ASSIGNED → NEW

QA Contact: benc → networking.http

Christian :Biesinger (don't email me, ping me on IRC)

Updated

•

19 years ago

OS: Windows NT → All

Priority: P2 → --

Hardware: PC → All

Summary: http authentication will not support double-byte characters → http authentication will not support non-ASCII characters

Target Milestone: Future → ---

Christian :Biesinger (don't email me, ping me on IRC)

Comment 32

•

19 years ago

*** Bug 337130 has been marked as a duplicate of this bug. ***

Christian :Biesinger (don't email me, ping me on IRC)

Updated

•

19 years ago

Summary: http authentication will not support non-ASCII characters → http authentication does not support non-ASCII characters

Christian :Biesinger (don't email me, ping me on IRC)

Comment 33

•

19 years ago

*** Bug 352953 has been marked as a duplicate of this bug. ***

Robert Sayre

Comment 34

•

•

17 years ago

Attached patch Patch for UTF-8 encode (obsolete) — Details — Splinter Review

This patch change behaviour of http authentication to send username and password in UTF-8. After this patch authentication in Mozilla happen exactly like Opera.

Attachment #315810 - Flags: review?(cbiesinger)

Andrey M.

Comment 51

•

•

16 years ago

Attached patch Convert basic credentials to UTF-8 prior to base64 encoding them (obsolete) — Details — Splinter Review

Attachment #420886 - Flags: superreview?(bzbarsky)

Attachment #420886 - Flags: review?(bzbarsky)

Robert Sayre

Updated

•

16 years ago

Attachment #315810 - Attachment is obsolete: true

Attachment #315810 - Flags: review?(cbiesinger)

Robert Sayre

Comment 79

•

16 years ago

Comment 84

•

16 years ago

Hmm. Retrying with byte-truncation upon 401 might actually be reasonable... and get us out of this impasse. How hard would that be to do?

Honza Bambas (:mayhemer)

Comment 85

•

16 years ago

We could introduce a simple continuation state object for basic auth. It would keep a state of attempt to send creds in UTF-8 or loosy-convert form. In case we failed for UTF-8, nsHttpBasicAuth::ChallengeReceived would then set *identityInvalid to PR_FALSE and move the state to loosy-convert, and on the following attempt it would set *identityInvalid to PR_TRUE as it does now, breaking the basic auth cycle. nsHttpBasicAuth::GenerateCredentials would then just convert according to the state object. We have to invalidate the state object when the challenge (actually the realm) changes in ChallengeReceived. Relatively simple. Makes sense? Could this new code be potentially exploitable with a UTF-8 and loosy-convert user name or password overlap (two user-names are converted to the same byte representation one by UTF8 and other by loosy-convert, for instance) ?

Robert Sayre

Comment 86

•

16 years ago

(In reply to comment #85) > > Could this new code be potentially exploitable with a UTF-8 and loosy-convert > user name or password overlap don't think so, but maybe I'm missing the point. how would the attack work. the credentials are pretty much in the clear, so what does it matter?

Johnny Stenback (:jst)

Comment 87

•

•

14 years ago

Attached patch Allow converting basic creds to UTF-8 before base64 encoding (obsolete) — Details — Splinter Review

•

•

14 years ago

(In reply to comment #102) > As far as I can tell, they do work, because Firefox just drop the top 8 > bits, and thus ISO-8859-1 passes through unharmed. The same IMHO is true for > IE. See summary in comment 58. Thanks Julian. The key point that was overlooked previously is that codepoints 0-255 are the same in ISO-8859-1 and Unicode, so chopping off the high byte works pretty well for sites that are expecting ISO-8859-1, which is probably most of the Americas and Western Europe. Consequently, switching to UTF-8 by default *would* be not be a regression-free change, as I think some were assuming. Our goal here is to get non-ISO-8859-1 characters in usernames and passwords to work as well as the ISO-8859-1 characters already do. We don't have any evidence that switching to UTF-8 actually improves things here. We should avoid switching the default to UTF-8 unless we have some evidence that it actually helps. I filed two bugs blocking this one to improve things here.

Summary: http authentication does not support non-ASCII characters → HTTP authentication does not support non-ISO-8859-1 characters

Brian Smith (:briansmith, :bsmith, use NEEDINFO?)

Updated

•

14 years ago

Depends on: 656213
No longer depends on: 295084

Brian Smith (:briansmith, :bsmith, use NEEDINFO?)

Comment 104

•

Attachment #734492 - Flags: feedback+

Attachment #734492 - Flags: checkin+

Patrick McManus [:mcmanus]

Comment 120

•

12 years ago

Comment on attachment 734492 [details] [diff] [review] Allow utf-8 encoded username in digest authentication response header. Hi - Thanks for the patch. Please see https://developer.mozilla.org/en-US/docs/Developer_Guide/How_to_Submit_a_Patch for information on how the review flags work. You need to submit r? to someone to get a review. You added r+ which indicates you have given the code a positive review. You can't review your own code.. you also set checkin+ which indicates the patch is ready for checkin - and we can't do that without the right reviews. So your next step is to request a review.. you probably want to do that from bsmith because he had issues with the prior approach. You should also add to the comment trail in the bug how this effort differs from the last one (or why we should change our minds about the reasons nick and brian have mentioned in the comment trail). Thanks for the work!

Attachment #734492 - Flags: review+

Attachment #734492 - Flags: feedback+

Attachment #734492 - Flags: checkin+

ggo

•

12 years ago

Attached patch Allow utf-8 encoded username in digest authentication response header. (obsolete) — Details — Splinter Review

This patch allows the http digest authentication module to send back utf8 encoded username in the response header in order to enable support for non-ascii usernames. It can be enabled by setting the "network.auth.digest-response-header-username-utf8" flag to true in the FF config (about:config). the patch could be improved by using the charset parameter as defined in RFC 2831, even if the flag is not set. I can probably do if it is asked.

Attachment #740318 - Flags: review?(bsmith)

Justin Dolske [:Dolske]

Comment 124

•

12 years ago

Comment on attachment 740318 [details] [diff] [review] Allow utf-8 encoded username in digest authentication response header. >diff -r fd264d551130 modules/libpref/src/init/all.js >--- a/modules/libpref/src/init/all.js Fri Apr 19 07:45:15 2013 -0400 >+++ b/modules/libpref/src/init/all.js Mon Apr 22 18:13:28 2013 +0200 >@@ -1280,6 +1280,14 @@ > // Specify if the gss lib comes standard with the OS > pref("network.negotiate-auth.using-native-gsslib", true); > >+// Controls whether to allow sending back UTF8 username >+// or ASCII only characters in the digest authentication response header >+// False: non-ascii usernames will cause the authentication to fail, >+// (which is the default behavior up to now). >+// True: non-ascii usernames will be sent back UTF-8 encoded in the >+// in the digest authentication response header . >+pref("network.auth.digest-response-header-username-utf8", false); >+ > #ifdef XP_WIN > > // Default to using the SSPI intead of GSSAPI on windows >diff -r fd264d551130 netwerk/protocol/http/nsHttpDigestAuth.cpp >--- a/netwerk/protocol/http/nsHttpDigestAuth.cpp Fri Apr 19 07:45:15 2013 -0400 >+++ b/netwerk/protocol/http/nsHttpDigestAuth.cpp Mon Apr 22 18:13:28 2013 +0200 >@@ -156,6 +156,8 @@ > return NS_OK; > } > >+static const char kAllowUTF8UserNameInResponseHeader[] = "network.auth.digest-response-header-username-utf8"; >+ > NS_IMETHODIMP > nsHttpDigestAuth::GenerateCredentials(nsIHttpAuthenticableChannel *authChannel, > const char *challenge, >@@ -178,6 +180,13 @@ > bool isDigestAuth = !PL_strncasecmp(challenge, "digest ", 7); > NS_ENSURE_TRUE(isDigestAuth, NS_ERROR_UNEXPECTED); > >+ // we work with ASCII around here >+ nsCOMPtr<nsIPrefBranch> prefs = do_GetService(NS_PREFSERVICE_CONTRACTID); >+ bool allowUTF8UserNameInResponseHeader = false; // Default to the old behavior >+ if (prefs) >+ if (NS_FAILED(prefs->GetBoolPref(kAllowUTF8UserNameInResponseHeader, &allowUTF8UserNameInResponseHeader))) >+ allowUTF8UserNameInResponseHeader = false; >+ > // IIS implementation requires extra quotes > bool requireExtraQuotes = false; > { >@@ -314,7 +323,10 @@ > nsAutoCString authString; > > authString.AssignLiteral("Digest username="); >- rv = AppendQuotedString(cUser, authString); >+ if (allowUTF8UserNameInResponseHeader) >+ rv = AppendUTF8QuotedString(cUser, authString); >+ else >+ rv = AppendQuotedString(cUser, authString); > NS_ENSURE_SUCCESS(rv, rv); > > authString.AppendLiteral(", realm="); >@@ -688,4 +700,40 @@ > return NS_OK; > } > >+nsresult >+nsHttpDigestAuth::AppendUTF8QuotedString(const nsACString & value, >+ nsACString & aHeaderLine) >+{ >+ nsAutoCString quoted; >+ nsACString::const_iterator s, e; >+ value.BeginReading(s); >+ value.EndReading(e); >+ >+ // >+ // Encode string according to RFC 2616 quoted-string production, >+ // but with NON-ascii characters allowed. >+ // (not a standard, but what other browsers already >+ // do to support username with unicode characters) >+ quoted.Append('"'); >+ for ( ; s != e; ++s) { >+ // >+ // CTL = <any US-ASCII control character (octets 0 - 31) and DEL (127)> >+ // >+ if (((unsigned char)(*s)) <= 31 || *s == 127) { >+ return NS_ERROR_FAILURE; >+ } >+ >+ // Escape two syntactically significant characters >+ if (*s == '"' || *s == '\\') { >+ quoted.Append('\\'); >+ } >+ >+ quoted.Append(*s); >+ } >+ quoted.Append('"'); >+ aHeaderLine.Append(quoted); >+ return NS_OK; >+} >+ >+ > // vim: ts=2 sw=2 >diff -r fd264d551130 netwerk/protocol/http/nsHttpDigestAuth.h >--- a/netwerk/protocol/http/nsHttpDigestAuth.h Fri Apr 19 07:45:15 2013 -0400 >+++ b/netwerk/protocol/http/nsHttpDigestAuth.h Mon Apr 22 18:13:28 2013 +0200 >@@ -79,6 +79,11 @@ > // append the quoted version of value to aHeaderLine > nsresult AppendQuotedString(const nsACString & value, > nsACString & aHeaderLine); >+ // append the quoted version of value to aHeaderLine, >+ // with bytes values from 128 to 255 allowed >+ // (to be able to send back the utf8 encoded username) >+ nsresult AppendUTF8QuotedString(const nsACString & value, >+ nsACString & aHeaderLine); > > protected: > nsCOMPtr<nsICryptoHash> mVerifier;

Attachment #740318 - Attachment is patch: true

Justin Dolske [:Dolske]

•

10 years ago

Attachment #8587817 - Flags: review?(brian) → review?(mcmanus)

Patrick McManus [:mcmanus]

Comment 139

•

10 years ago

Comment on attachment 8587817 [details] [diff] [review] to do here as draft-ietf-httpauth-digest-15 Review of attachment 8587817 [details] [diff] [review]: ----------------------------------------------------------------- honza, I think you have the most domain knowledge here and have shown an interest in 656213.. but let me know if the review should be rerouted

Attachment #8587817 - Flags: review?(mcmanus) → review?(honzab.moz)

Honza Bambas (:mayhemer)

Comment 140

•

10 years ago

Attached patch to do here as draft-ietf-httpauth-digest-15 (refreshed as it should be) (obsolete) — Details — Splinter Review

Honza Bambas (:mayhemer)

Comment 141

•

10 years ago

I'll try to get to this within few days.

Honza Bambas (:mayhemer)

Updated

•

•

10 years ago

Whiteboard: [necko-would-take]

for nothing

Comment 147

•

10 years ago

Attached patch to do here as draft-ietf-httpauth-digest (obsolete) — Details — Splinter Review

About it: > + else{ > + //this true lossy convert UTF16 to ASCII > + //ASCII is an 7-bit encoding > + const char16_t *p = username; > + cUser.Assign(*p % 128); you really can't do this... who is using real 7bit ascii these days? the correct way is escaping and it's probably not what you want here, or yes? I can not do otherwise. For escaping required "charset" to use extended notation username*=charset''username, without it required to use quoted string ascii or do not use at all ABNF.

Attachment #8714628 - Flags: review?(honzab.moz)

Masatoshi Kimura [:emk]

Comment 148

•

10 years ago

First of all, the internet draft is RFC 7616 (for digest auth)/7617 (for basic auth) now. (In reply to for nothing from comment #147) > I can not do otherwise. For escaping required "charset" to use extended > notation username*=charset''username, without it required to use quoted > string ascii or do not use at all ABNF. You don't have to care about charset other than UTF-8. UTF-8 is the only valid value per RFC 7616. For all other charset values (or servers that do not send the charset parameter), just use |username="<username without escape>"| as before (that is, the RFC 2617 compatible header). The quoted-string production can have non-ASCII octets. https://tools.ietf.org/html/rfc7230#section-3.2.6 > quoted-string = DQUOTE *( qdtext / quoted-pair ) DQUOTE > qdtext = HTAB / SP /%x21 / %x23-5B / %x5D-7E / obs-text > obs-text = %x80-FF Although it is obsolete, RFC 2617 compatible header is obsolete by definition. It will be needed for compatibility with old servers. RFC 7616-unaware servers are unlikely to support "username*" not "userhash".

Honza Bambas (:mayhemer)

Comment 149

•

10 years ago

Comment on attachment 8714628 [details] [diff] [review] to do here as draft-ietf-httpauth-digest The patch is malformed on binary level and cannot be applied. Please submit a result of hg diff -U 8 -p (or qdiff).

Attachment #8714628 - Flags: review?(honzab.moz)

for nothing

Comment 150

•

10 years ago

Attached patch to do here as rfc 7616 (obsolete) — Details — Splinter Review

Attachment #8587817 - Attachment is obsolete: true

Attachment #8714628 - Attachment is obsolete: true

Attachment #8717027 - Flags: review?(honzab.moz)

Honza Bambas (:mayhemer)

Comment 151

•

9 years ago

Comment on attachment 8717027 [details] [diff] [review] to do here as rfc 7616 Review of attachment 8717027 [details] [diff] [review]: ----------------------------------------------------------------- Thanks for the update and my apologies for taking that long to respond. I'll ask you to do one major thing: please split this patch to two pieces extra for each type of authentication. One for basic auth, which I believe will be easier to bring to the final state, and second for digest auth. This will make progress on this bug faster. Thanks. ::: netwerk/protocol/http/nsHttpBasicAuth.cpp @@ +78,5 @@ > + nsAutoCString charset; > + Tokenizer p(challenge); > + Tokenizer::Token t; > + while (p.Next(t)) { > + if (t.AsString() == "charset" && p.Next(t) && (t.AsChar() == '=')) { Note: p.Next(t) && (t.AsChar() == '=') can better be written as p.CheckChar('=') @@ +80,5 @@ > + Tokenizer::Token t; > + while (p.Next(t)) { > + if (t.AsString() == "charset" && p.Next(t) && (t.AsChar() == '=')) { > + p.Record(); > + while (p.Next(t) && !t.Equals(Tokenizer::Token::Char(','))); Note: now we have a convenient ReadUntil method @@ +84,5 @@ > + while (p.Next(t) && !t.Equals(Tokenizer::Token::Char(','))); > + p.Claim(charset); > + charset.StripChar('"', 0); > + } > + } You must respect the grammar here. This (if your code were written correctly, by the way) will take any occurrence of "charset" within the string. The grammar is: Schema header="value"[,header="value"]* Also allows white spaces (including cr/lf) between each element. I think we are OK with having just a necessary subset of https://tools.ietf.org/html/rfc5234#section-4 Roughly looking at the spec, seems like header is what Tokenizer recognizes as "word" with added "-" as word char (see the constructor). value seems to be anything between the quotes delimited by ','. Not sure how quotes are escaped. You will need to write a better loop for it or even a simple recursive descent (optionally by deriving from Tokenizer). You have to walk the headers one by one and pick charset value. @@ +91,5 @@ > + userpass.Append(':'); > + if (password) { > + AppendUTF16toUTF8(password, userpass); > + } > + } else { we must fail when charset is found non-null and something else than "UTF-8". The spec seems to allow only that value. ::: netwerk/protocol/http/nsHttpDigestAuth.cpp @@ +212,5 @@ > > + char ha1_digest[EXPANDED_DIGEST_SHA256_LENGTH+1]; > + char ha2_digest[EXPANDED_DIGEST_SHA256_LENGTH+1]; > + char response_digest[EXPANDED_DIGEST_SHA256_LENGTH+1]; > + char upload_data_digest[EXPANDED_DIGEST_SHA256_LENGTH+1]; I'd slightly more prefer to have LONGEST_DIGEST_LENGTH define that will simply defined as the longest of the defined hash lengths, so that when new one is added, we simply redefine it (when needed) and don't crash. @@ +319,5 @@ > + for (;*p != 0;p++) { > + if (*p % 128 >= 32) { > + cUser.Append(*p % 128); > + } > + } can you please explain what you are doing here? we might already have some helper functions for this (if I knew what's going on). @@ +358,5 @@ > // > > nsAutoCString authString; > > + if (userhash == 2) { please have defines for possible values of |userhash| to make it clear what is happening here. @@ +374,5 @@ > + if (charset.EqualsLiteral("ISO-8859-1")) { > + authString.AssignLiteral("Digest username*=ISO-8859-1\'\'"); > + } else { > + authString.AssignLiteral("Digest username*=UTF-8\'\'"); > + } maybe: authString.AssignLiteral("Digest username*=") authString.Append(charset); ? @@ +405,2 @@ > } else { > + authString.AppendLiteral("MD5"); We need to know that we don't understand the algorithm. But here for unknown values you simply fallback to MD5, which is default only when the "algorithm" header is not present at all. That is the whole purpose of the ALGO_SPECIFIED flag, which your patch effectively removes. @@ +722,5 @@ > else if (nameLength == 9 && > nsCRT::strncasecmp(challenge+nameStart, "algorithm", 9) == 0) > { > // we want to clear the default, so we use = not |= here > *algorithm = ALGO_SPECIFIED; with this usage, this should be renamed to ALGO_SPECIFIED_UNKNOWN @@ +739,3 @@ > else if (valueLength == 8 && > + nsCRT::strncasecmp(challenge+valueStart, "MD5-sess", 8) == 0) { > + *algorithm = ALGO_MD5_SESS; should we pick the strongest algo when this header is present more then once? or should we ignore the challenge completely when present more than once? ::: netwerk/protocol/http/nsHttpDigestAuth.h @@ +24,4 @@ > #define QOP_AUTH 0x01 > #define QOP_AUTH_INT 0x02 > > #define DIGEST_LENGTH 16 rename to DIGEST_MD5_LENGTH @@ +91,5 @@ > nsresult AppendQuotedString(const nsACString & value, > nsACString & aHeaderLine); > > + int16_t DigestLength(int16_t algorithm); > + int16_t ExpadedDigestLength(int16_t algorithm); Expanded? (typo)

Attachment #8717027 - Flags: review?(honzab.moz) → review-

for nothing

Comment 152

•

9 years ago

Attached patch patch for basic auth (obsolete) — Details — Splinter Review

I do not know of bugs should correct patch for basic authentication, leave here.

Attachment #8717027 - Attachment is obsolete: true

Attachment #8750616 - Flags: review?(honzab.moz)

for nothing

Comment 153

•

9 years ago

Attached patch patch for digest auth (obsolete) — Details — Splinter Review

This for digest auth

Attachment #8750619 - Flags: review?(honzab.moz)

Honza Bambas (:mayhemer)

Comment 154

•

9 years ago

Comment on attachment 8750616 [details] [diff] [review] patch for basic auth Review of attachment 8750616 [details] [diff] [review]: ----------------------------------------------------------------- ::: netwerk/protocol/http/nsHttpBasicAuth.cpp @@ +78,5 @@ > + nsAutoCString charset; > + Tokenizer p(challenge); > + Tokenizer::Token t; > + while (p.Next(t)) { > + if (t.AsString() == "charset" && p.CheckChar('=')) { This is still not a grammar-like parsing loop, sorry. Also, doing t.AsString() on a returned token w/o checking its type first is wrong. The correct way to check it's a word and has an expected content is to do: t.Equals(Tokenizer::Token::Word("charset")) which does the correct type and value checks for you. Just read the comments in https://mxr.mozilla.org/mozilla-central/source/xpcom/ds/Tokenizer.h to more understand how it works. @@ +91,5 @@ > + } > + if (!charset.IsEmpty()) { > + if (charset.EqualsLiteral("UTF-8")) { > + CopyUTF16toUTF8(user, userpass); > + userpass.Append(':'); please leave the "// always send a ':' (see bug 129565)" comment

Attachment #8750616 - Flags: review?(honzab.moz) → review-

Honza Bambas (:mayhemer)

Updated

•

9 years ago

Attachment #521605 - Attachment is obsolete: true

Honza Bambas (:mayhemer)

Updated

•

9 years ago

Attachment #740318 - Attachment is obsolete: true

Honza Bambas (:mayhemer)

Updated

•

9 years ago

Attachment #8589323 - Attachment is obsolete: true

Honza Bambas (:mayhemer)

•

9 years ago

Attached patch patch for digest (obsolete) — Details — Splinter Review

Sorry for the delay.

Attachment #8750616 - Attachment is obsolete: true

Attachment #8750619 - Attachment is obsolete: true

Attachment #8821682 - Flags: review?(bzbarsky)

for nothing

Comment 158

Comment 171

•

9 years ago

Comment on attachment 8822552 [details] [diff] [review] patch for basic >diff -r 79ef93672445 netwerk/protocol/http/nsHttpBasicAuth.cpp >--- a/netwerk/protocol/http/nsHttpBasicAuth.cpp Thu Dec 29 12:03:47 2016 -0800 >+++ b/netwerk/protocol/http/nsHttpBasicAuth.cpp Fri Dec 30 04:45:00 2016 +0600 >@@ -5,16 +5,17 @@ > > // HttpLog.h should generally be included first > #include "HttpLog.h" > > #include "nsHttpBasicAuth.h" > #include "plbase64.h" > #include "plstr.h" > #include "nsString.h" >+#include "mozilla/Tokenizer.h" > > namespace mozilla { > namespace net { > > //----------------------------------------------------------------------------- > // nsHttpBasicAuth <public> > //----------------------------------------------------------------------------- > >@@ -82,22 +83,36 @@ nsHttpBasicAuth::GenerateCredentials(nsI > NS_ENSURE_ARG_POINTER(creds); > > *aFlags = 0; > > // we only know how to deal with Basic auth for http. > bool isBasicAuth = !PL_strncasecmp(challenge, "basic", 5); > NS_ENSURE_TRUE(isBasicAuth, NS_ERROR_UNEXPECTED); > >- // we work with ASCII around here > nsAutoCString userpass; >- LossyCopyUTF16toASCII(user, userpass); >- userpass.Append(':'); // always send a ':' (see bug 129565) >- if (password) >+ uint16_t charset; >+ nsresult rv = ParseCharset(challenge, &charset); >+ NS_ENSURE_SUCCESS(rv, rv); >+ if (charset) { >+ if (charset & CHARSET_UTF8) { >+ CopyUTF16toUTF8(user, userpass); >+ userpass.Append(':'); >+ if (password) { >+ AppendUTF16toUTF8(password, userpass); >+ } >+ } else { >+ return NS_ERROR_UNEXPECTED; >+ } >+ } else { >+ LossyCopyUTF16toASCII(user, userpass); >+ userpass.Append(':'); // always send a ':' (see bug 129565) >+ if (password) > LossyAppendUTF16toASCII(password, userpass); >+ } > > // plbase64.h provides this worst-case output buffer size calculation. > // use calloc, since PL_Base64Encode does not null terminate. > *creds = (char *) calloc(6 + ((userpass.Length() + 2)/3)*4 + 1, 1); > if (!*creds) > return NS_ERROR_OUT_OF_MEMORY; > > memcpy(*creds, "Basic ", 6); >@@ -107,10 +122,60 @@ nsHttpBasicAuth::GenerateCredentials(nsI > > NS_IMETHODIMP > nsHttpBasicAuth::GetAuthFlags(uint32_t *flags) > { > *flags = REQUEST_BASED | REUSABLE_CREDENTIALS | REUSABLE_CHALLENGE; > return NS_OK; > } > >+NS_IMETHODIMP >+nsHttpBasicAuth::ParseCharset(const char *challenge, uint16_t * charset) >+{ >+ const char *ch = challenge + 5; >+ *charset = CHARSET_NOT_SET; >+ Tokenizer p(ch); >+ Tokenizer::Token t; >+ nsAutoCString name; >+ nsAutoCString value; >+ while (p.Next(t)) { >+ p.Rollback(); >+ while (p.CheckChar(',') || p.CheckWhite()); >+ if (!p.Next(t) || t.Type() == Tokenizer::TOKEN_EOF) { >+ break; >+ } >+ p.Rollback(); >+ p.Record(); >+ while (p.Next(t) && !t.Equals(Tokenizer::Token::Char('=')) && >+ t.Type() != Tokenizer::TOKEN_WS); >+ if (p.HasFailed()) { >+ return NS_ERROR_INVALID_ARG; >+ } >+ p.Claim(name); >+ p.Rollback(); >+ p.SkipWhites(); >+ if (!p.CheckChar('=')) { >+ return NS_ERROR_INVALID_ARG; >+ } >+ p.SkipWhites(); >+ bool quoted = p.CheckChar('"'); >+ if (quoted) { >+ if (!p.ReadUntil(Tokenizer::Token::Char('"'), value)) { >+ return NS_ERROR_INVALID_ARG; >+ } >+ } else { >+ p.Record(); >+ while (p.Next(t) && !t.Equals(Tokenizer::Token::Char(',')) && >+ t.Type() != Tokenizer::TOKEN_WS); >+ p.Claim(value); >+ } >+ if (name.EqualsLiteral("charset")) { >+ *charset = CHARSET_SPECIFIED; >+ if (!PL_strncasecmp(value.get(), "UTF-8", 5)) { >+ *charset |= CHARSET_UTF8; >+ } >+ } >+ } >+ return NS_OK; >+} >+ > } // namespace net > } // namespace mozilla >diff -r 79ef93672445 netwerk/protocol/http/nsHttpBasicAuth.h >--- a/netwerk/protocol/http/nsHttpBasicAuth.h Thu Dec 29 12:03:47 2016 -0800 >+++ b/netwerk/protocol/http/nsHttpBasicAuth.h Fri Dec 30 04:45:00 2016 +0600 >@@ -5,28 +5,33 @@ > > #ifndef nsBasicAuth_h__ > #define nsBasicAuth_h__ > > #include "nsIHttpAuthenticator.h" > > namespace mozilla { namespace net { > >+#define CHARSET_NOT_SET 0x00 >+#define CHARSET_SPECIFIED 0x01 >+#define CHARSET_UTF8 0x02 >+ > //----------------------------------------------------------------------------- > // The nsHttpBasicAuth class produces HTTP Basic-auth responses for a username/ > // (optional)password pair, BASE64("user:pass"). > //----------------------------------------------------------------------------- > > class nsHttpBasicAuth : public nsIHttpAuthenticator > { > public: > NS_DECL_ISUPPORTS > NS_DECL_NSIHTTPAUTHENTICATOR > > nsHttpBasicAuth(); > private: > virtual ~nsHttpBasicAuth(); >+ NS_IMETHODIMP ParseCharset(const char *challenge, uint16_t * charset); > }; > > } // namespace net > } // namespace mozilla > > #endif // !nsHttpBasicAuth_h__

Attachment #8822552 - Flags: review?(honzab.moz)

for nothing

Comment 172

•

9 years ago

Attached patch patch for basic (obsolete) — Details — Splinter Review

Yes, it's a mess, sorry.

Attachment #8822552 - Attachment is obsolete: true

Attachment #8822552 - Flags: review?(honzab.moz)

Attachment #8824314 - Flags: review?(honzab.moz)

for nothing

Updated

•

9 years ago

Attachment #8822553 - Flags: review?(honzab.moz)

for nothing

Comment 173

•

9 years ago

(In reply to guntiso from comment #169) > Case-insensitivity? Fixed

for nothing

Comment 174

•

9 years ago

Can I delete messages?

for nothing

Comment 175

•

9 years ago

Attached patch patch for basic (obsolete) — Details — Splinter Review

Stop me, please.

Attachment #8824314 - Attachment is obsolete: true

Attachment #8824314 - Flags: review?(honzab.moz)

Attachment #8824354 - Flags: review?(honzab.moz)

guntiso

Comment 176

•

9 years ago

Unstoppable ;) Note that parameter names (i.e. "charset") are matched case-insensitively

for nothing

Comment 177

•

9 years ago

(In reply to guntiso from comment #176) > Unstoppable ;) Note that parameter names (i.e. "charset") are matched > case-insensitively Ok.

for nothing

Comment 178

•

9 years ago

Attached patch patch for basic (obsolete) — Details — Splinter Review

Attachment #8824354 - Attachment is obsolete: true

Attachment #8824354 - Flags: review?(honzab.moz)

Attachment #8824634 - Flags: review?(honzab.moz)

Honza Bambas (:mayhemer)

•

8 years ago

Comment on attachment 8821682 [details] [diff] [review] patch for digest Review of attachment 8821682 [details] [diff] [review]: ----------------------------------------------------------------- ::: netwerk/protocol/http/nsHttpDigestAuth.cpp @@ +346,5 @@ > + if (userhash == USERHASH_TRUE) { > + char hashuser[EXPANDED_DIGEST_LENGTH+1]; > + cUser.Append(":"); > + cUser.Append(realm); > + rv = DoHash(cUser.get(), cUser.Length(), algorithm); one question: is there a spec for how the username has to be encoded before we give it the hashing function? I presume utf-8, but rather checking.

Honza Bambas (:mayhemer)

Comment 183

•

8 years ago

Comment on attachment 8822553 [details] [diff] [review] patch for test basic Review of attachment 8822553 [details] [diff] [review]: ----------------------------------------------------------------- thanks! ::: netwerk/test/unit/test_authentication.js @@ +156,5 @@ > > if (this.flags & FLAG_BOGUS_USER) > this.user = "foo\nbar"; > + if (this.flags & FLAG_UTF8_USER) > + this.user = "\u0443\u0442\u04448"; please have a global const for this (you are using it more than once) @@ +518,5 @@ > + else > + { > + // didn't know guest:guest, failure > + response.setStatusLine(metadata.httpVersion, 401, "Unauthorized"); > + response.setHeader("WWW-Authenticate", 'Basic realm="secret", charset=UTF-8', false); can we add some dummy arguments beside charset to exercise the parser a bit?

Attachment #8822553 - Flags: review?(honzab.moz) → review+

Honza Bambas (:mayhemer)

Comment 184

•

8 years ago

Comment on attachment 8821683 [details] [diff] [review] patch for test digest Review of attachment 8821683 [details] [diff] [review]: ----------------------------------------------------------------- ::: netwerk/test/unit/test_authentication.js @@ +156,5 @@ > > if (this.flags & FLAG_BOGUS_USER) > this.user = "foo\nbar"; > + if (this.flags & FLAG_UTF8_USER) > + this.user = "\u0443\u0442\u04448"; again, have a global const for this. @@ +584,3 @@ > var authenticate = 'Digest realm="secret", domain="/", qop=auth,' + > 'algorithm=MD5, nonce="' + nonce+ '" opaque="' + > + opaque + '"' + ', algorithm=SHA-256'; algorithm param is there twice, once for MD5 and then for SHA-256 @@ +2100,5 @@ > + var username = (auth.match(usernameRE))[1]; > + var algorithm = (auth.match(algorithmRE))[1]; > + var userhash = (auth.match(userhashRE))[1]; > + if (userhash == "true") { > + if (username != H("guest:secret", algorithm)) { might be more interesting to do this for a utf8 encoded user name?

Attachment #8821683 - Flags: review?(honzab.moz) → review+

Honza Bambas (:mayhemer)

Comment 185

•

8 years ago

(In reply to for nothing from comment #159) > > This is still not a grammar-like parsing loop, sorry. > > Still not sure that I understood correctly. Need something like this? I was not reading this code, sorry. But more or less, yes. If you are lost in how to use the tokenizer, I can suggest how to write the parser loop. I just need some BNF (lazy to read specs again :)) Thanks and sorry for delays!

Flags: needinfo?(honzab.moz)

Julian Reschke

Comment 186

•

8 years ago

The spec for the authentication framework is RFC 7235, with the ABNF for WWW-Authenticate in <https://greenbytes.de/tech/webdav/rfc7235.html#header.www-authenticate>. The spec for the "Basic" authentication scheme is RFC 7617. I18N is discussed in <https://greenbytes.de/tech/webdav/rfc7617.html#rfc.section.2.1>. By default, Firefox imho should continue to do what it always did (use ISO-8859-1). The charset param in the challenge allows the server to override this.

for nothing

Comment 187

•

8 years ago

Attached patch patch for digest — Details — Splinter Review

>check that result is always ExpandedDigestLength(algorithm) + 1 long. Ok, result always is EXPANDED_DIGEST_LENGTH+1 long,that is 64 + 1. >how the username has to be encoded before we give it the hashing function? https://tools.ietf.org/html/rfc7616#section-3.3 >charset >This is an OPTIONAL parameter that is used by the server to >indicate the encoding scheme it supports. The only allowed value >is "UTF-8". >userhash >This is an OPTIONAL parameter that is used by the server to >indicate that it supports username hashing. Valid values are: >"true" or "false". Default value is "false". https://tools.ietf.org/html/rfc7616#section-3.4 >username >The user's name in the specified realm. The quoted string >contains the name in plaintext or the hash code in hexadecimal >notation. If the username contains characters not allowed inside >the ABNF quoted-string production, the username* parameter can be >used. Sending both username and username* in the same header >option MUST be treated as an error. >username* >If the userhash parameter value is set "false" and the username >contains characters not allowed inside the ABNF quoted-string >production, the user's name can be sent with this parameter, using >the extended notation defined in [RFC5987]. >name in plaintext or the hash code in hexadecimal notation. Charset is optional, but I don't think the hash must be calculated from the bytes of the utf-16 or from a serialized array with non-zero bytes.

Attachment #8821682 - Attachment is obsolete: true

Attachment #8848140 - Flags: review?(honzab.moz)

for nothing

Comment 188

•

8 years ago

Attached patch patch for test digest — Details — Splinter Review

Attachment #8821683 - Attachment is obsolete: true

Attachment #8848141 - Flags: review?(honzab.moz)

for nothing

Comment 189

•

8 years ago

Attached patch patch for basic — Details — Splinter Review

>> + t.Type() != Tokenizer::TOKEN_WS); >why is a whitespace considered a delimiter here? Whitespaces can be.

Attachment #8824634 - Attachment is obsolete: true

Attachment #8848143 - Flags: review?(honzab.moz)

for nothing

Comment 190

•

8 years ago

Attached patch patch for test basic — Details — Splinter Review

Attachment #8822553 - Attachment is obsolete: true

Attachment #8848144 - Flags: review?(honzab.moz)

Honza Bambas (:mayhemer)

Comment 191

•

8 years ago

I can't apply the patches (rejects in netwerk/test/unit/test_authentication.js.rej) regardless of in which order I apply these four patches. Maybe it's time to put them back to a single patch when this is so close to a final version. Note that this has a very low priority (reason I am constantly not getting to review this). Thanks.

Flags: needinfo?(8f7pfg)

Honza Bambas (:mayhemer)

Comment 192

•

8 years ago

Comment on attachment 8848140 [details] [diff] [review] patch for digest dropping r until there is a patch that applies to test locally and on the try server

Attachment #8848140 - Flags: review?(honzab.moz)

Honza Bambas (:mayhemer)

Updated

•

8 years ago

Attachment #8848141 - Flags: review?(honzab.moz)

Honza Bambas (:mayhemer)

Updated

•

8 years ago

Attachment #8848143 - Flags: review?(honzab.moz)

Honza Bambas (:mayhemer)

Updated

•

8 years ago

Attachment #8848144 - Flags: review?(honzab.moz)

Firefox Bug Husbandry Bot

Comment 194

•

8 years ago

Bulk change to priority: https://bugzilla.mozilla.org/show_bug.cgi?id=1399258

Priority: -- → P5

Honza Bambas (:mayhemer)

Updated

•

8 years ago

Assignee: 8f7pfg → nobody

Status: ASSIGNED → NEW

Whiteboard: [necko-would-take] → [necko-triaged]

Junior [inactive]

Comment 195

•

8 years ago

bug 1419658 has made Basic Auth defualt to UTF-8. I scan the patch here, seems we need to support "userhash" for Digest Auth. Moreover, Digest seems support UTF-8. However, if the WWW-Authentication indicates a charset, we need to specify it in the response. Hence we still need some work here.

Honza Bambas (:mayhemer)

Updated

•

7 years ago

Flags: needinfo?(8f7pfg)

Timea Cernea [:tbabos][inactive]

Comment 196

•

4 years ago

Hey Honza, if you are not the right person to ask about this, please remove the needinfo or redirect it to someone that maybe knows better.

Is this issue still relevant today? Based on the Mozilla documentation from (https://developer.mozilla.org/en-US/docs/Web/HTTP/Authentication), Firefox once used ISO-8859-1, but changed to utf-8 for parity with other browsers and to avoid potential problems as described in bug 1419658.

And if read things around right, I quote here, UTF-8 "contains virtually all characters in common use,[14] including most Chinese, Japanese and Korean characters." and this could basically cover the non-ISO-8859-1 characters. However, the comment above mine (Comment 195) says there are still things to get worked on here. So should we keep this bug open for the reason mentioned in the previous comment or close it?

Flags: needinfo?(honzab.moz)

Jens Stutte [:jstutte]

Comment 197

•

4 years ago

Hi Christoph, seems more like a question for you?

Flags: needinfo?(honzab.moz) → needinfo?(ckerschb)

Daniel Veditz [:dveditz]

Comment 198

•

4 years ago

This issue in this bug--inability to use non-ASCII characters--was fixed by bug 1419658. The issues in comment 195 talk about features added to HTTP in 2015, long after this bug was filed, by rfc 7616 and 7617. This bug should not be morphed into an enhancement bug of adding new features from a later version of the spec--unless of course it was germane to solving the stated problem.

I filed bug 1735850 and bug 1735854 to address comment 195. I did not look through the RFCs to see if there are other things we haven't implemented or need to change.

Status: NEW → RESOLVED

Closed: 4 years ago

Depends on: 1419658

Flags: needinfo?(ckerschb)

Resolution: --- → FIXED

Patch for UTF-8 encode 17 years ago Andrey M. 1.45 KB, patch		Details \| Diff \| Splinter Review
Simple http authentication 17 years ago Andrey M. 290 bytes, application/x-php		Details
Convert basic credentials to UTF-8 prior to base64 encoding them 16 years ago Robert Sayre 7.96 KB, patch		Details \| Diff \| Splinter Review
Allow converting basic creds to UTF-8 before base64 encoding 14 years ago u408661 16.07 KB, patch	briansmith : review- bzbarsky : superreview+	Details \| Diff \| Splinter Review
Allow utf-8 encoded username in digest authentication response header. 12 years ago ggo 4.17 KB, patch		Details \| Diff \| Splinter Review
Allow utf-8 encoded username in digest authentication response header. 12 years ago ggo 4.17 KB, patch	briansmith : review-	Details \| Diff \| Splinter Review
to do here as draft-ietf-httpauth-digest-15 10 years ago for nothing 26.67 KB, patch	mayhemer : review-	Details \| Diff \| Splinter Review
to do here as draft-ietf-httpauth-digest-15 (refreshed as it should be) 10 years ago Honza Bambas (:mayhemer) 25.95 KB, patch	mayhemer : review-	Details \| Diff \| Splinter Review
to do here as draft-ietf-httpauth-digest 10 years ago for nothing 26.84 KB, patch		Details \| Diff \| Splinter Review
to do here as rfc 7616 10 years ago for nothing 26.02 KB, patch	mayhemer : review-	Details \| Diff \| Splinter Review
patch for basic auth 9 years ago for nothing 2.38 KB, patch	mayhemer : review-	Details \| Diff \| Splinter Review
patch for digest auth 9 years ago for nothing 22.38 KB, patch	mayhemer : feedback+	Details \| Diff \| Splinter Review
patch for digest 9 years ago for nothing 21.55 KB, patch	mayhemer : review+	Details \| Diff \| Splinter Review
patch for test digest 9 years ago for nothing 8.72 KB, patch	mayhemer : review+	Details \| Diff \| Splinter Review
patch for basic 9 years ago for nothing 4.76 KB, patch		Details \| Diff \| Splinter Review
patch for test basic 9 years ago for nothing 4.57 KB, patch	mayhemer : review+	Details \| Diff \| Splinter Review
patch for basic 9 years ago for nothing 4.77 KB, patch		Details \| Diff \| Splinter Review
patch for basic 9 years ago for nothing 4.77 KB, patch		Details \| Diff \| Splinter Review
patch for basic 9 years ago for nothing 4.78 KB, patch	mayhemer : feedback+	Details \| Diff \| Splinter Review
patch for digest 8 years ago for nothing 22.95 KB, patch		Details \| Diff \| Splinter Review
patch for test digest 8 years ago for nothing 8.21 KB, patch		Details \| Diff \| Splinter Review
patch for basic 8 years ago for nothing 5.54 KB, patch		Details \| Diff \| Splinter Review
patch for test basic 8 years ago for nothing 4.43 KB, patch		Details \| Diff \| Splinter Review