<a class="header-button" href="https://bugzilla.mozilla.org/home" title="Go to home page"> Bugzilla

Comment 6

•

12 years ago

Comment on attachment 718718 [details] [diff] [review] Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (ICU stubs) Review of attachment 718718 [details] [diff] [review]: ----------------------------------------------------------------- Bunches of style nits, but otherwise fine enough. ::: js/src/builtin/Intl.cpp @@ +20,5 @@ > #include "vm/Stack.h" > > #include "jsobjinlines.h" > > +#include "unicode/utypes.h" Assuming this is a reference to intl/icu/source/common/unicode/utypes.h, is this patch predicated on starting to use ICU? Right now it looks like this patch wouldn't build. @@ +22,5 @@ > #include "jsobjinlines.h" > > +#include "unicode/utypes.h" > + > +using namespace icu; Hmm. We *are* the implementers of namespace js, so it's okay to open that. Here...this looks dodgy. I think probably we want to have the icu:: prefix in all the type names for this. Either that, or we should have |using icu::Foo;| for the particular symbols we want. We've run into issues opening up whole non-SpiderMonkey namespaces in the past, so once bitten, twice shy. @@ +30,5 @@ > +/******************** ICU stubs ********************/ > + > +#if !ENABLE_INTL_API > + > +/* When the Internationalization API isn't enabled, we also shouldn't link /* * When the... is SpiderMonkey comment style for multiline comments. @@ +36,5 @@ > + * bit rot. The following stub implementations for ICU functions make this > + * possible. The functions using them should never be called, so they assert > + * and return error codes. Signatures adapted from ICU header files locid.h, > + * numsys.h, ucal.h, ucol.h, udat.h, udatpg.h, uenum.h, unum.h; see the ICU > + * directory for license. Hmm. Historically we haven't done it this way, and predictably bitrot has ensued. In the long run I expect we won't want to allow disabling ICU, and this will die. In the short run, eh. We can do it this way if it (probably) helps you out. :-) @@ +46,5 @@ > + JS_ASSERT(false); > + return 0; > +} > + > +typedef struct UEnumeration UEnumeration; struct UEnumeration; @@ +49,5 @@ > + > +typedef struct UEnumeration UEnumeration; > + > +static int32_t > +uenum_count(UEnumeration* en, UErrorCode* status) SpiderMonkey style has * next to the variable name, sadly. (Applies everywhere else here, too.) @@ +51,5 @@ > + > +static int32_t > +uenum_count(UEnumeration* en, UErrorCode* status) > +{ > + JS_ASSERT(false); Let's make these all MOZ_NOT_REACHED("<function name>: Intl API disabled"); @@ +59,5 @@ > + > +static const char* > +uenum_next(UEnumeration* en, > + int32_t* resultLength, > + UErrorCode* status) This should all be on one line if it fits in 99ch. Same throughout. @@ +93,5 @@ > + UCOL_ON = 17, > + UCOL_SHIFTED = 20, > + UCOL_LOWER_FIRST = 24, > + UCOL_UPPER_FIRST = 25, > +} UColAttributeValue; enum UColAttributeValue { ... }; and the same for all the enums here. @@ +135,5 @@ > +ucol_strcoll( const UCollator *coll, > + const UChar *source, > + int32_t sourceLength, > + const UChar *target, > + int32_t targetLength) Fix the bizarre whitespacing, please. There are a few others, too, below. @@ +159,5 @@ > + return NULL; > +} > + > +typedef struct UParseError UParseError; > +typedef struct UFieldPosition UFieldPosition; Just forward-declare these structs too. @@ +251,5 @@ > +} > + > +class Locale { > +public: > + Locale( const char * language, public: should be indented two spaces. All the parameters should be |const char *foo| or |const char *foo = 0| as well. Looks like a search for "* " in these changes would point out a bunch of places. @@ +267,5 @@ > +} > + > +class NumberingSystem { > +public: > + static NumberingSystem* createInstance(const Locale & inLocale, UErrorCode& status); & by parameter name.

Attachment #718718 - Flags: review?(jwalden+bmo) → review+

Comment 7

•

12 years ago

Comment on attachment 718720 [details] [diff] [review] Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 1) Review of attachment 718720 [details] [diff] [review]: ----------------------------------------------------------------- ::: js/src/builtin/Intl.cpp @@ +15,5 @@ > #include "jsatom.h" > #include "jscntxt.h" > #include "jsinterp.h" > #include "jsobj.h" > +#include "jstypes.h" What types are you accessing through this #include? I'm surprised you wouldn't already have them here. @@ +29,5 @@ > +#include "unicode/numsys.h" > +#include "unicode/ucal.h" > +#include "unicode/ucol.h" > +#include "unicode/udat.h" > +#include "unicode/udatpg.h" I assume all these horrible header names date back to 8.3 days? :-) @@ +480,5 @@ > +(* GetAvailable)(int32_t localeIndex); > + > +static bool > +intl_availableLocales(JSContext *cx, CountAvailable countAvailable, > + GetAvailable getAvailable, MutableHandleValue result) Needs whitespace formatting fixing. I find myself wondering if this wouldn't be much better as a HashSet<const char *>, or something. I guess let's run with this for now, and we can figure out something better later if it comes to it. ...or is it this way because it feeds directly into self-hosted code? Then I guess my assumption that [[availableLocales]] wasn't actually a thing in implementations was wrong! I'd gotten the impression it was sort of a latent property of the system that arbitrates internationalization queries. @@ +483,5 @@ > +intl_availableLocales(JSContext *cx, CountAvailable countAvailable, > + GetAvailable getAvailable, MutableHandleValue result) > +{ > +#if ENABLE_INTL_API > + int32_t count = countAvailable(); This always returns a value >= 0, right? Please use uint32_t, then, and for the other available-locales-indexes. If ICU mandates int32_t (...not int, or some other integer type, seeing as <stdint.h> didn't exist until 1999, and wasn't in MSVC til 2010?), use uint32_t for value types when you can, and leave int32_t to signatures that have to match up with ICU typedefs or method signatures or whatever. @@ +484,5 @@ > + GetAvailable getAvailable, MutableHandleValue result) > +{ > +#if ENABLE_INTL_API > + int32_t count = countAvailable(); > +#endif Not that it really matters, but why not put this below and have one #if ENABLE_INTL_API for the entire function? @@ +491,5 @@ > + return false; > + > +#if ENABLE_INTL_API > + RootedValue t(cx, BooleanValue(true)); > + for (int32_t i = 0; i < count; i++) { uint32_t as well. @@ +492,5 @@ > + > +#if ENABLE_INTL_API > + RootedValue t(cx, BooleanValue(true)); > + for (int32_t i = 0; i < count; i++) { > + const char* locale = getAvailable(i); const char *locale @@ +493,5 @@ > +#if ENABLE_INTL_API > + RootedValue t(cx, BooleanValue(true)); > + for (int32_t i = 0; i < count; i++) { > + const char* locale = getAvailable(i); > + char *lang = JS_strdup(cx, locale); Use ScopedJSFreePtr<char> here -- lets you remove/not worry about the js_free(lang). @@ +499,5 @@ > + return false; > + char *p; > + while ((p = strchr(lang, '_'))) > + *p = '-'; > + RootedAtom a(cx, Atomize(cx, lang, strlen(lang), InternAtom)); Interning's not such a great concept, the way it's currently formulated (it's not amenable to compacting GC). Could you skip interning, and if we find perf bottlenecks from this later we can add it then? @@ +504,5 @@ > + js_free(lang); > + if (!a) > + return false; > + if (!JSObject::defineProperty(cx, locales, a->asPropertyName(), t, > + JS_PropertyStub, JS_StrictPropertyStub, JSPROP_ENUMERATE)) { { aligns with } if the condition of an if spans multiple lines. @@ +517,5 @@ > +/** > + * Returns the object holding the internal properties for obj. > + */ > +static bool > +GetInternals(JSContext *cx, HandleObject obj, MutableHandleObject internals) Hoo boy, this is some fun times. We really need to clean up the self-hosting/native code boundary and transitions and how you do them at some point.

Attachment #718720 - Flags: review?(jwalden+bmo) → review+

•

12 years ago

Attached patch Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 3) (obsolete) — Details — Splinter Review

Updated per comment 8.

Attachment #718723 - Attachment is obsolete: true

Attachment #718723 - Flags: review?(jwalden+bmo)

Attachment #722075 - Flags: review?(jwalden+bmo)

Assignee

Comment 13

•

12 years ago

Attached patch Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 4) (obsolete) — Details — Splinter Review

Attachment #718724 - Attachment is obsolete: true

Attachment #722077 - Flags: review?(jwalden+bmo)

Assignee

Comment 14

•

12 years ago

Attached patch Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 5) (obsolete) — Details — Splinter Review

Attachment #722078 - Flags: review?(jwalden+bmo)

Assignee

Comment 15

•

12 years ago

Attached patch Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (remainder) (obsolete) — Details — Splinter Review

Assignee

Updated

•

12 years ago

Attachment #722071 - Flags: checkin?(jwalden+bmo)

Comment 16

•

12 years ago

Comment on attachment 722071 [details] [diff] [review] Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (ICU stubs) https://hg.mozilla.org/integration/mozilla-inbound/rev/14f64332f3ea

Attachment #722071 - Flags: checkin?(jwalden+bmo)

Comment 17

•

12 years ago

Comment on attachment 722073 [details] [diff] [review] Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 1) Review of attachment 722073 [details] [diff] [review]: ----------------------------------------------------------------- ::: js/src/builtin/Intl.cpp @@ +37,1 @@ > #include "unicode/utypes.h" Hmm. I think we might want to move these above the jsobjinlines.h #include, so that we have all *.h headers, then all {*-inl,*inlines}.h headers. Sorry for all this code motion. :-( @@ +38,5 @@ > > +#if ENABLE_INTL_API > +using icu::Locale; > +using icu::NumberingSystem; > +#endif I think we've usually put using-declarations beneath using-namespaces. @@ +524,5 @@ > +} > + > +// Simple RAII for ICU objects. MOZ_TYPE_SPECIFIC_SCOPED_POINTER_TEMPLATE > +// unfortunately doesn't work because of namespace incompatibilities > +// (TypeSpecificDelete cannot be in icu and mozilla at the same time) Hmm? I'm not sure I quite see how both-in-icu and mozilla is happening here. But the void* issue is obvious enough regardless. (If only C/C++ had generative typedefs...) @@ +532,5 @@ > +class ScopedICUObject > +{ > + typedef T* type; > + type ptr; > + void (* deleter)(type); |type| being purely for convenience in the scoped stuff, just use T* directly everywhere in here, and get rid of type. @@ +537,5 @@ > + > + public: > + ScopedICUObject(type ptr, void (* deleter)(type)) : > + ptr(ptr), > + deleter(deleter) SpiderMonkey style is trending toward trailing _ on field names. Also some whitespace alignment nits: ScopedICUObject(T *ptr, void (*deleter)(T*)) : ptr_(ptr), deleter_(deleter) {} @@ +550,5 @@ > + // but returned to the caller if everything goes well, call keep() > + // just before returning. > + void keep() { > + ptr = NULL; > + } Canonically this would be: T * forget() { T *tmp = ptr; ptr = NULL; return tmp; } then making the return-the-value idiom read |return scope.forget();| or so. @@ +553,5 @@ > + ptr = NULL; > + } > +}; > + > +static const int STACK_STRING_SIZE = 50; Are subsequent uses of this going to require int? size_t would be better. @@ +555,5 @@ > +}; > + > +static const int STACK_STRING_SIZE = 50; > + > +static const int ICU_OBJECT_SLOT = 0; JS slot numbers are uint32_t.

Attachment #722073 - Flags: review?(jwalden+bmo) → review+

Assignee

Comment 18

•

12 years ago

Attached patch Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 1) — Details — Splinter Review

Updated per comment 17. Carrying r+jwalden. (In reply to Jeff Walden [:Waldo] (remove +bmo to email) from comment #17) > > +// (TypeSpecificDelete cannot be in icu and mozilla at the same time) > > Hmm? I'm not sure I quite see how both-in-icu and mozilla is happening > here. See comment 11.

Attachment #722073 - Attachment is obsolete: true

Attachment #724287 - Flags: review+

Attachment #724287 - Flags: checkin?(jwalden+bmo)

https://hg.mozilla.org/mozilla-central/rev/14f64332f3ea

Comment 19

•

12 years ago

Comment 20

•

12 years ago

https://hg.mozilla.org/integration/mozilla-inbound/rev/acad61792b53 to kill off all the unused-function warnings introduced by that single push. :-( Should have noticed those when reviewing, and/or I should have test-built it before pushing.

•

12 years ago

Comment on attachment 722078 [details] [diff] [review] Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 5) Review of attachment 722078 [details] [diff] [review]: ----------------------------------------------------------------- Given the fragility of memory management, especially at API mismatch points like this, I think I want to see a new version for the StringBuffer use in it. ::: js/src/builtin/Intl.cpp @@ +1241,5 @@ > + RootedObject internals(cx); > + if (!GetInternals(cx, numberFormat, &internals)) > + return NULL; > + > + if (!JS_GetProperty(cx, internals, "locale", value.address())) { Same cx->names().locale mumble mumble again. @@ +1270,5 @@ > + if (equal(style, "currency")) { > + if (!JS_GetProperty(cx, internals, "currency", value.address())) > + return NULL; > + // uCurrency remains owned by value.toString() > + uCurrency = JS_GetStringCharsZ(cx, value.toString()); This comment is sort of right. The chars are owned by the string, but the string's not rooted locally after the next getProperty. It's rooted through the rooting of the internals object, which would root the string, but this rooting chain gets a bit strained. Could you save the string into a root that lives for the whole method, instead? Also I think you need a SkipRoot skip(cx, &uCurrency); so that the in-progress rooting analysis builds don't get confused and try to poison this pointer under you. This would need to stay valid until uCurrency is used, so probably it goes next to the declaration of that. No, we don't particularly have documentation of this. :-\ I only knew just enough to know to ask someone if that was necessary, and really no more than that -- we have just about nothing for this now, except some tinderbox builds that will probably turn orange without it. Later on you implicitly assume this string's length is 3. That follows from IsWellFormedCurrencyCode, right? Add a MOZ_ASSERT(value.toString()->length() == 3, "IsWellFormedCurrencyCode permits only length-3 strings"); so the reason for this is clear. @@ +1281,5 @@ > + uStyle = UNUM_CURRENCY_ISO; > + else if (equal(currencyDisplay, "symbol")) > + uStyle = UNUM_CURRENCY; > + else > + uStyle = UNUM_CURRENCY_PLURAL; Assert this is "name"? @@ +1285,5 @@ > + uStyle = UNUM_CURRENCY_PLURAL; > + } else if (equal(style, "percent")) { > + uStyle = UNUM_PERCENT; > + } else { > + uStyle = UNUM_DECIMAL; Assert this is "decimal"? @@ +1294,5 @@ > + return NULL; > + if (hasP) { > + if (!JS_GetProperty(cx, internals, "minimumSignificantDigits", value.address())) > + return NULL; > + uMinimumSignificantDigits = value.toInt32(); Hmm. If this is actually always guaranteed to be an int32_t Value -- that is, isInt32(), versus isDouble() -- it's pretty obscure. (I suspect this value can flow from script in various ways, which can produce isDouble() versions of the integer values. An attacker would have to know that certain methods produce isDouble() values, and combine them just right to get an integer-valued double, but it *is* a hazard.) Could you use int32_t(value.toNumber()) to be safe? Same for all these minimum/maximum values. @@ +1315,5 @@ > + return NULL; > + uUseGrouping = value.toBoolean(); > + > + UErrorCode status = U_ZERO_ERROR; > + UNumberFormat *nf = unum_open(uStyle, NULL, 0, icuLocale(locale.ptr()), NULL, &status); It's again permissible to pass in language tags here, and not _-separated stuff? The UParseError* here is only for parse errors in the pattern we're not providing, correct? @@ +1345,5 @@ > + return NULL; > + } > + > + toClose.keep(); > + return nf; return toClose.forget(); @@ +1353,5 @@ > +intl_FormatNumber(JSContext *cx, UNumberFormat *nf, double x, MutableHandleValue result) > +{ > + if (x == 0.0) > + // could be -0.0, which we don't want to see in output > + x = 0.0; Style nit -- don't want a multiline unbraced if -- and a substantive bit about using something clearer about what's being done: // FormatNumber doesn't consider -0.0 to be negative. if (MOZ_DOUBLE_IS_NEGATIVE_ZERO(x)) x = 0.0; @@ +1356,5 @@ > + // could be -0.0, which we don't want to see in output > + x = 0.0; > + > + jschar stackChars[STACK_STRING_SIZE + 1]; > + jschar *chars = stackChars; Rather than manual memory management of maybe-stacky memory, you should use StringBuffer from vm/StringBuffer.h. It's a little awkward because you'll have to resize(32) (that's the current internal buffer size) and then use begin() to access the raw internal pointer. (You'll have to change STACK_STRING_SIZE to 32 as well.) But it seems better reusing that allocation logic than open-coding it. Note that because you don't actually care about null-termination, there's no reason not to use the full 32 for the input length here. unum_formatDouble's documentation is actually surprisingly vague about whether the output is null-terminated. If I read the tea leaves of the implementation just right (it flows into UnicodeString::extract(UChar* dest, int32_t destCapacity, UErrorCode&) which is clear about this), I *think* if the output is length-32 without null termination, and the input buffer is length 32, status will be U_STRING_NOT_TERMINATED_WARNING and not a U_FAILURE. (size is the length of the formatted string not including '\0'.) (This actually seems like a documentation bug to me, that unum_formatDouble doesn't clearly say what null-termination behavior is.) We don't care about null-termination, because we're using a CopyN function below currently. So we shouldn't add in this +1 everywhere. @@ +1372,5 @@ > + js_free(chars); > + JS_ReportErrorNumber(cx, js_GetErrorMessage, NULL, JSMSG_INTERNAL_INTL_ERROR); > + return false; > + } > + Apropos of previous patches, I just stumbled across * ICU functions that take a reference (C++) or a pointer (C) to a UErrorCode * first test if(U_FAILURE(errorCode)) { return immediately; } * so that in a chain of such functions the first one that sets an error code * causes the following ones to not perform any operations. So ignore my comment about scoped and checking for failure every time in one of the previous patches. \o/ @@ +1373,5 @@ > + JS_ReportErrorNumber(cx, js_GetErrorMessage, NULL, JSMSG_INTERNAL_INTL_ERROR); > + return false; > + } > + > + RootedString str(cx, JS_NewUCStringCopyN(cx, chars, size)); Once StringBuffer is used, this can be bs.finishString() returned into a JSString *. You'll have to null-check the result, as string creation can fail if the stackful buffer was used, and allocating for the chars or whatever failed. @@ +1397,5 @@ > + // Obtain a UNumberFormat object, cached if possible. > + bool isNumberFormatInstance = numberFormat->getClass() == &NumberFormatClass; > + UNumberFormat *nf; > + if (isNumberFormatInstance) { > + nf = (UNumberFormat *) numberFormat->getReservedSlot(ICU_OBJECT_SLOT).toPrivate(); I think I forgot to mention this reviewing the last patch or two, but add new constants for the slot and slot count for these classes. ::: js/src/builtin/Intl.h @@ +93,5 @@ > extern JSBool > intl_numberingSystem(JSContext *cx, unsigned argc, Value *vp); > > +/** > + * Returns a String value representing x (which must be a Number value) "must be a number" is slightly clearer, and eliminates |new Number(5)|, if the reader started to wonder.

Attachment #722078 - Flags: review?(jwalden+bmo) → review-

Assignee

Comment 26

•

12 years ago

Attached patch Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 2) — Details — Splinter Review

Updated per comment 22. Carrying r+jwalden.

Attachment #722074 - Attachment is obsolete: true

Attachment #725280 - Flags: review+

Attachment #725280 - Flags: checkin?(jwalden+bmo)

Assignee

Comment 27

•

12 years ago

Attached patch Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 3) — Details — Splinter Review

Updated per comment 23. Carrying r+jwalden. (In reply to Jeff Walden [:Waldo] (remove +bmo to email) from comment #23) > General note, but could you add SUPPRESS_UNUSED_WARNING to every function > that's not yet used, please? It'll save you the hassle of dealing with my > manual additions of that, in your own patch queue. So we don't have to add and remove these macros like crazy, can you just check in parts 1-3 together? At that point, there shouldn't be any unused non-stub functions left. > > + RootedObject internals(cx); > > + if (!GetInternals(cx, collator, &internals)) > > + return NULL; > > This is the object that you originally had __locale and all those internal > properties underscore-prefixed, until we realized it was never exposed and > so not necessary, right? Correct. For Collator instances, this object has exactly the internal properties specified for these objects. For NumberFormat and DateTimeFormat, there are some variations. > > + // UCollator options with default values. > > I find myself wondering if these would be better placed next to the code > that modifies them. It would be clearer what value's used at each point, > then. (I'd also not be scrolling back and forth between default and > modification, when reviewing this. :-) ) Would make knowing you'd set all > values more difficult, tho. Dunno. Anyway, not asking for a change, just > raising the point in case you have particular feelings here. No particular feelings. At some point I had shortcuts for argument-less localeCompare and toLocaleString, and in that shortcut the default values were filled in without even creating a Collator, NumberFormat, or DateTimeFormat. Now that doesn't matter any more. > > + UColAttributeValue uCaseFirst = UCOL_DEFAULT; > > The header I'm reading says acceptable values for UCOL_CASE_FIRST are > upper-first, lower-first, or off. Should this be UCOL_OFF? A bit further up in ucol.h there's the hint "All the attributes can take UCOL_DEFAULT value", and since I want the locale default here, using UCOL_DEFAULT seems appropriate. > > + const char *oldLocale = locale.ptr(); > > + localeLen = strlen(oldLocale); > > This is guaranteed to be a structurally-valid locale tag, right? Yes. > > + if (!JS_GetProperty(cx, internals, "ignorePunctuation", value.address())) > > + return NULL; > > + if (value.toBoolean()) > > + uAlternate = UCOL_SHIFTED; > > I will candidly admit to not understanding at all what's going on here. > Could you explain how this is supposed to effect [[ignorePunctuation]]'s > behavior? Added comment. The ICU documentation is no help here. > > + ucol_setAttribute(coll, UCOL_CASE_LEVEL, uCaseLevel, &status); > > + ucol_setAttribute(coll, UCOL_ALTERNATE_HANDLING, uAlternate, &status); > > + ucol_setAttribute(coll, UCOL_NUMERIC_COLLATION, uNumeric, &status); > > + ucol_setAttribute(coll, UCOL_NORMALIZATION_MODE, uNormalization, &status); > > + ucol_setAttribute(coll, UCOL_CASE_FIRST, uCaseFirst, &status); > > I don't see in docs where it's valid to keep setting attributes after a > failure. Added comment. > > + jschar const *chars2 = str2->getChars(cx); > > + if (!chars2) > > + return false; > > + > > + UCollationResult uresult = ucol_strcoll(coll, chars1, length1, chars2, length2); > > Hmm. I guess we're assuming jschar == UChar, above and beyond the two types > being compatible. Both are specified to be UTF-16 code units: http://userguide.icu-project.org/unicode#TOC-Programming-using-UTFs http://ecma-international.org/ecma-262/5.1/#sec-8.4 > Hopefully that won't break anywhere, but I have my > doubts. :-\ Do you have try server access to test this, or should I do so > before pushing? Most recent try server build with Intl enabled: https://tbpl.mozilla.org/?tree=Try&rev=4b5645a415c7 Collation demo: http://lindenbergsoftware.com/demo/Collation.html v1-v4 swap in different source code variants, which you can also edit. Editable input is in the lower left text box. The Sort button runs the source and places the output into the lower right text box. Conformance tests: http://test262.ecmascript.org/testcases_intl402.html Relevant here are the 10.3.2_CS_* tests.

Attachment #722075 - Attachment is obsolete: true

Attachment #725282 - Flags: review+

Attachment #725282 - Flags: checkin?(jwalden+bmo)

•

12 years ago

Attached patch Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 4) — Details — Splinter Review

Updated per comment 24. Carrying r+jwalden. (In reply to Jeff Walden [:Waldo] (remove +bmo to email) from comment #24) > > + const char *name = numbers->getName(); > > + JSString *jsname = JS_NewStringCopyZ(cx, name); > > + delete numbers; > > Hoo boy, this is some implementation dumpster-diving. Meaning...? Suggestions? > > + * Usage: defaultNumberingSystem = intl_numberingSystem(locale) > > The docs I saw suggested |locale| could be something like "en_US", for the > NumberingSystem stuff you're passing along. Are language tags also accepted? Hmm. Yes in the sense that I know the ICU team has done a lot of work to enable BCP 47 language tags in their API, and that I do get properly localized behavior when I pass in language tags. No in the sense that there's no documentation I could use to prove to you that this is actually supposed to work. I filed yet another ICU ticket: http://bugs.icu-project.org/trac/ticket/10040

Attachment #722077 - Attachment is obsolete: true

Attachment #725687 - Flags: review+

Attachment #725687 - Flags: checkin?(jwalden+bmo)

Assignee

Comment 34

•

12 years ago

Attached patch Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 5) (obsolete) — Details — Splinter Review

Updated per comment 25. (In reply to Jeff Walden [:Waldo] (remove +bmo to email) from comment #25) > > + UNumberFormat *nf = unum_open(uStyle, NULL, 0, icuLocale(locale.ptr()), NULL, &status); > > It's again permissible to pass in language tags here, and not _-separated > stuff? Same story as in comment 33. > The UParseError* here is only for parse errors in the pattern we're not > providing, correct? Correct. > > + if (x == 0.0) > > + // could be -0.0, which we don't want to see in output > > + x = 0.0; > > Style nit -- don't want a multiline unbraced if -- It seems you guys spend way too much brain power thinking about braces. In Java-land, there's a simple rule: use braces for any compound statement; opening brace at the end of the line, closing at the beginning. Most JavaScript style guides have adopted the same rule. My right pinky knows the rule; from 1997 until I started working on SpiderMonkey my brain never had to worry about it. But in SpiderMonkey there's a page of rules to consider, and undocumented add-ons to the rule still appear, and adding an assertion or comment in one line can trigger multiple brace changes across dozens of lines. That's a lot of effort just to save a bit of space here and there... > (This actually seems like a documentation bug to me, that unum_formatDouble > doesn't clearly say what null-termination behavior is.) I filed http://bugs.icu-project.org/trac/ticket/10042 > > + * Returns a String value representing x (which must be a Number value) > > "must be a number" is slightly clearer, and eliminates |new Number(5)|, if > the reader started to wonder. Number objects are not Number values - see ES5 4.3.19 and 4.3.21.

Attachment #722078 - Attachment is obsolete: true

Attachment #725688 - Flags: review?(jwalden+bmo)

Assignee

Comment 35

•

12 years ago

Attached patch Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 6) (obsolete) — Details — Splinter Review

Attachment #725703 - Flags: review?(jwalden+bmo)

Assignee

Comment 36

•

12 years ago

Attached patch Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 7) (obsolete) — Details — Splinter Review

Attachment #725704 - Flags: review?(jwalden+bmo)

Assignee

Comment 37

•

12 years ago

Attached patch Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 8) (obsolete) — Details — Splinter Review

Attachment #725705 - Flags: review?(jwalden+bmo)

https://hg.mozilla.org/mozilla-central/rev/e1cc50bfee41 https://hg.mozilla.org/mozilla-central/rev/f723856dac07 https://hg.mozilla.org/mozilla-central/rev/c3d72dcbbe94 https://hg.mozilla.org/mozilla-central/rev/0ed3d38e0e4d

Assignee

Comment 38

•

12 years ago

Attached patch Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 9) (obsolete) — Details — Splinter Review

Attachment #722079 - Attachment is obsolete: true

Attachment #725706 - Flags: review?(jwalden+bmo)

Phil Ringnalda (:philor)

Comment 39

•

12 years ago

Comment 40

•

12 years ago

Comment on attachment 725687 [details] [diff] [review] Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 4) https://hg.mozilla.org/integration/mozilla-inbound/rev/605348ff1ee6 (In reply to Norbert Lindenberg from comment #33) > > > + const char *name = numbers->getName(); > > > + JSString *jsname = JS_NewStringCopyZ(cx, name); > > > + delete numbers; > > > > Hoo boy, this is some implementation dumpster-diving. > > Meaning...? Suggestions? I was referring to the ugliness of knowing that it's proper to dispose of the NumberSystem object by deleting it. I had to dig into the initial creation method quite a ways to see the |new| this pairs up with. But hey, it works...at least up until we start building against system ICU, at which point I suspect this delete will be a mismatch with the corresponding new in ICU. (Mozilla -- not SpiderMonkey -- defines its own malloc implementation.) Bug 724531 comment 29 mentions this; hopefully whoever adds --with-system-icu can make things kosher here somehow. I don't have any suggestions for anything better here. If I'd had them I'd have said it more clearly. :-)

Attachment #725687 - Flags: checkin?(jwalden+bmo)

Assignee

Comment 41

•

12 years ago

(In reply to Norbert Lindenberg from comment #38) > Created attachment 725706 [details] [diff] [review] > Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, > Intl.DateTimeFormat (part 9) I should highlight that this change renders BasicFormatMatcher and BestFitFormatMatcher unused, and relies on ICU's DateTimePatternGenerator to construct the pattern for the provided options. As it turns out, DateTimePatternGenerator doesn't conform to the spec for BasicFormatMatcher, and I have an item on my to-do list to find a solution for this issue.

Assignee

•

12 years ago

Comment on attachment 725706 [details] [diff] [review] Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 9) Review of attachment 725706 [details] [diff] [review]: ----------------------------------------------------------------- It is entirely possible I misunderstood a whole bunch of this patch. In any case, it definitely needs enough changes for another review of it. ::: js/src/builtin/Intl.js @@ +1598,5 @@ > var value = GetOption(options, prop, "string", dateTimeComponentValues[prop], undefined); > opt[prop] = value; > } > > + // Steps 20-21 provided by ICU. Hmm. I guess all these step numbers became unsynced at some point between the i18n spec we were looking at when the patch adding this landed, and the 2013-02-28 draft with errata. I guess once everything's stabilized I should go back and adjust numbering appropriately. @@ +1625,5 @@ > // Step 31. > internals.initializedDateTimeFormat = true; > } > > Understanding nothing of patterns, skeletons, or date/time formatting using ICU before reading this patch, I hope it will come as no surprise that this was all very, very confusing. :-) A brief overview comment discussing why things are as they are here, how ICU works to implement this, and how SpiderMonkey uses ICU internally here would be incredibly helpful to future readers. In pursuit of understanding everything, I wrote up such a comment. Please critique it, tell me where I totally don't know what I'm talking about, and include an appropriately-fixed version in this patch -- here or thereabouts seems a reasonable place. """ Different locales have different optimal ways to display dates using the same basic components. For example, en-US might use "Sept. 24, 2012" while fr-FR might use "24 Sept. 2012". The intent of Intl.DateTimeFormat is to permit production of the "optimal" format for the locale, without the user having to pick it (or even know what it is). ICU implements this behavior with a two-level pattern system. The skeleton level consumes the options -- options.weekday, options.era, options.day, options.minute, options.hour, options.hour12, etc. (see Table 3 - Components of date and time formats) -- provided when an Intl.DateTimeFormat is initialized. These inputs are used to form a skeleton. A skeleton is a string like "yyyyMMDD", specifying the components of the date/time to be included in the ultimate formatted string. A skeleton is consumed by a UDateTimePatternGenerator to produce a pattern. The various skeleton patterns are defined in unicode/udat.h The pattern level takes a pattern string like "yyyy.MM.dd" and produces an exact corresponding string like "2012.09.14". This level does only exact pattern substitutions (and unescaping of any escaped portions of the pattern). The overall format of the corresponding string is identical to that of the pattern string. The components of the string are substituted with locale-sensitive values -- "December" for en-US or "Dezember" for de-DE, say -- but the overall format is determined entirely by the input pattern, locale-insensitively. Inside this Intl.DateTimeFormat implementation, skeletons exist only temporarily within |toBestICUPattern|. The created skeleton is immediately passed to js::intl_patternForSkeleton, a C++ method which gets the best pattern for the specified locale for those components. Patterns initially exist as the return value of |toBestICUPattern|. This value then propagates to the |internals.pattern| property for the specific DateTimeFormat-initialized object. From here it's primarily consumed when producing a formatted string. But there's one additional place that consumes the pattern: Intl.DateTimeFormat.prototype.resolvedOptions(), to expose the values of the Table 3 properties. In the spec all the properties listed in Table 3 exist as separate internal properties on |internals|. But in our implementation, these properties are latent in the value of |internals.pattern| -- they're not saved or tracked. So to reconstitute them when computing resolved options, we have to parse |internals.pattern| to extract the components and set the appropriate properties on the returned object. This task is performed by |resolveICUPattern|. """ @@ +1626,5 @@ > internals.initializedDateTimeFormat = true; > } > > > +function toBestICUPattern(locale, options) { An overview comment for this method would have been helpful in helping me understand it. Something like this, maybe: """ Computes an ICU pattern for the given locale, usable in producing formatted dates/times for the locale. (Internally this computes an ICU skeleton containing the requested components in |options|, then it gets an ICU pattern corresponding to that skeleton in the given locale.) """ @@ +1627,5 @@ > } > > > +function toBestICUPattern(locale, options) { > + var skeleton = ""; Is there specific documentation for the symbols in a skeleton anywhere? Somehow I found <http://userguide.icu-project.org/formatparse/datetime>, but that's not a complete guide to all the skeleton symbols used here. udat.h seems to include other symbols, some of which might be applicable at the skeleton level -- UDAT_HOUR as "j" is obviously used below here, for one. Having one document to point to would be ideal here, but if it has to be two, somehow, that's better than not having it spelled out at all. @@ +1636,5 @@ > + case "short": > + skeleton += "E"; > + break; > + case "long": > + skeleton += "EEEE"; FormatDateTime 7.a.ix says the interpretations of narrow/short/long for all of these are implementation-defined, correct? Not disagreeing with these choices here, or saying they're not sensible, just making sure I understand the extent they're required by the spec. Given that for numeric fields, field width corresponds to output width, I'm a little sad that non-numeric fields don't at least roughly try to correspond to length ordering -- "E", "EEEE", "EEEEE" from shortest to longest, I mean. :-( This confused me for a bit, until I noticed the rough correspondence had been specified for *numeric* fields only. Is it at all possible we might want "e" instead, here? I have no idea whether Intl.DateTimeFormat output should use local day-of-week or not. If we want "e" here, I assume we'd also want to use "e" in |icuPatternCharToComponent|, probably. @@ +1638,5 @@ > + break; > + case "long": > + skeleton += "EEEE"; > + } > + switch (options.era) { When I look at http://userguide.icu-project.org/formatparse/datetime I only see the single code "G" for era designator. Surely I'm missing something here, right? Because otherwise it looks like these would produce "ADADADADAD" for "narrow" and "ADADADAD" for "long", which is obviously not acceptable. Hmm. Further down that doc it says that "yyyyy.MMMM.dd GGG hh:mm aaa" would correspond to "01996.July.10 AD 12:08 PM". Maybe there's some sort of API rule that unrecognized-length sequences correspond to one of the recognized-length versions? (And we're just using the different lengths to hopefully trigger better-targeted era behavior should it ever be implemented?) Let me know where this is listed/documented, please! @@ +1654,5 @@ > + case "2-digit": > + skeleton += "yy"; > + break; > + case "numeric": > + skeleton += "y"; Nitpick, but "yyyy" more clearly to me suggests what this actually expands to. ...or, after more reading/grokking, I guess this is what UDAT_YEAR is. Right? So it should be one "y", because "yyyy" would constrain exactly to four-digit years, but we want whatever the locale-preferred format is. Is that right? @@ +1683,5 @@ > + skeleton += "d"; > + break; > + } > + var hourSkeletonChar = "j"; > + if (options.hour12 !== undefined) { Where's "j" documented? It's not listed at the link mentioned previously, although both "h"/"H" are. (Maybe you meant this to be "k"? Or perhaps I'm totally wrong, and my head's just spinning right now from so much pattern-comparison. :-) ) ...oh, later after finding the bits in udat.h I see this is UDAT_HOUR, with "the locale's preferred hour format (12 or 24)". That makes sense, then. @@ +1691,5 @@ > + hourSkeletonChar = "H"; > + } > + switch (options.hour) { > + case "2-digit": > + skeleton += hourSkeletonChar + hourSkeletonChar; It *is* documented somewhere (where exactly?) that "jj" produces a two-digit 24-hour number, right? In combining docs from two different places, I don't see anything that tells me this work. (I guess this is the hour-equivalent to the era/"G" issue mentioned before.) @@ +1715,5 @@ > + break; > + } > + switch (options.timeZoneName) { > + case "short": > + skeleton += "z"; Since this corresponds to PDT and such, I'd prefer "zzz" as well here. Or does "zzz" not correspond to PDT? http://userguide.icu-project.org/formatparse/datetime seems to say "z"/"zz"/"zzz" all correspond to the same thing, but I'm not entirely sure that's the case, or if only "z" does and the others correspond to slightly longer things. ...or no, reading further along, maybe this is so |resolveICUPattern| can reconstitute this value when required by resolvedOptions()? (And maybe this was the "why" for "y" versus "yyyy", and "jj" versus "j".) Given this re-parsing complexity, I seriously wonder whether we wouldn't be better just saving these options on the internals data, so we don't have to go to such work to recompute them later... @@ +1918,5 @@ > > function dateTimeFormatLocaleData(locale) { > + return { > + ca: intl_availableCalendars(locale), > + nu: getNumberingSystems(locale) I'm confused. Doesn't this value correspond to "The value of the [[localeData]] internal property" in 12.2.3? And doesn't that have a number of requirements that aren't followed here, and that InitializeDateTimeFormat depends on? And that were basically followed in the previous shim? I'm very confused about this! Or is this going to be filled in in subsequent patches? ...or no, after further review I see everything else here is latent in the pattern value computed by |js::intl_patternForSkeleton|. Right? There should be a comment here saying that the missing values are latent in the pattern computed by |toBestICUPattern|, or that this implementation doesn't need to track them, or something. @@ +1979,5 @@ > calendar: internals.calendar, > numberingSystem: internals.numberingSystem, > timeZone: internals.timeZone > }; > + resolveICUPattern(internals.pattern, result); This method implements DateTimeFormat.prototype.resolvedOptions(), defined in 12.3.3. Unless I'm seriously confused, @@ +1999,5 @@ > + m: "minute", > + s: "second", > + z: "timeZoneName", > + v: "timeZoneName", > + V: "timeZoneName" Is there specific documentation for the symbols that make up a pattern anywhere? As mentioned before I found <http://userguide.icu-project.org/formatparse/datetime> and bits at the top of udat.h, but nothing comprehensive and specifically saying it was a reference to all pattern symbols. Well, the link may be a claim to complete documentation, if I read it closely. Is it? Whatever the proper reference is, this needs a comment by it that points to it, either as a link or as a header path, or as something that'll let the reader see where all these characters come from. @@ +2003,5 @@ > + V: "timeZoneName" > +}; > + > + > +function resolveICUPattern(pattern, result) { """ Recomputes the internal properties (12.4) of a DateTimeFormat-initialized object that are latently represented in the ICU pattern string. Each property/value pair represented in the pattern ("hour12" is only present if "hour" is) is then defined upon the |result| object. """ You should also |assert(IsObject(result), "resolveICUPattern")|, come to think of it. @@ +2008,5 @@ > + var i = 0; > + while (i < pattern.length) { > + var c = pattern[i++]; > + if (c === "'") { > + while (i < pattern.length && pattern[i] !== c) Since we're skipping over a quoted (escaped) subcomponent til the closing single-quote, I'd rather see a comparison to "'" than to |c| again. @@ +2022,5 @@ > + switch (c) { > + // "text" cases > + case "G": > + case "E": > + case "a": What's "a" doing here? There's no "a" in |icuPatternCharToComponent|, so you're not going to do anything except uselessly set the value here, I think. @@ +2037,5 @@ > + // "number" cases > + case "y": > + case "d": > + case "h": > + case "H": Does "j" need to be handled here?

•

12 years ago

Attached patch Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 7) — Details — Splinter Review

Updated per comment 47. Carrying r+jwalden. (In reply to Jeff Walden [:Waldo] (remove +bmo to email) from comment #47) > > + UCalendar *cal = (UCalendar *) udat_getCalendar(df); > > I don't see a need for this cast, looking at the current headers. udat_getCalendar returns a const UCalendar *; ucal_setGregorianChange wants a UCalendar *. If I'm in trouble, so is Apple: http://www.opensource.apple.com/source/CF/CF-744/CFDateFormatter.c

Attachment #725704 - Attachment is obsolete: true

Attachment #726520 - Flags: review+

Attachment #726520 - Flags: checkin?(jwalden+bmo)

Comment 55

•

12 years ago

(In reply to Norbert Lindenberg from comment #51) > > I think the problem you have is you aren't, and haven't, edited existing > > code much. > > SpiderMonkey code, you mean. Yes, sorry for not being quite precise about that. :-)

Terrence Cole [:terrence]

Assignee

Comment 56

•

12 years ago

(In reply to Terrence Cole [:terrence] from comment #48) > > +var count = Intl.Collator.supportedLocalesOf(locales).length; > > + > > +reportCompare(locales.length, count, "Number of supported locales in Intl.Collator"); > > This is really gross. I'd prefer to assert this property for some reasonably > stable subset so that the test doesn't constantly break and so that it will > run on platforms where people have built their own firefox or ICU. My thinking was that Mozilla would want to notice if a new version of ICU drops support for a language that Mozilla cares about, so I listed the locales for those languages. I didn't list ICU languages that Mozilla currently doesn't localize into [1]. You're right that people who use a slimmed version of ICU would see the test fail and would have to change it, but I can't predict which locales they'd keep or drop. The lists of English/French/Spanish locales are probably longer than needed because I don't have information on which language/country combinations Mozilla cares about. If you still think the list is too long, how would you construct a shorter list? [1] http://www.mozilla.org/en-US/firefox/all/

•

12 years ago

Attached patch Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 8) — Details — Splinter Review

Updated per comment 49. Carrying r+jwalden. (In reply to Jeff Walden [:Waldo] (remove +bmo to email) from comment #49) > > + var localeOfLastResort = "en-GB"; > > I assume this change is because of bug 851763? Add a comment like // XXX > TEMPORARY HACK FOR BUG 851763 if so. No. I added a comment explaining en-GB. Previously, I used "und" because the implementation didn't have any real locale data. > @@ +1116,5 @@ > > > > > > function collatorSortLocaleData(locale) { > > + var collations = intl_availableCollations(locale); > > + callFunction(std_Array_unshift, collations, null); > > For a start this is okay, I guess. > > But unshift is inherently a linear-time operation, in the absence of a pile > of hacks, and particular requirements that only *sometimes* make it fast. > (I am assuming intl_availableCollations(locale) is large; I don't remember > for sure that it is.) Can we follow up and make intl_availableCollations > reserve space for this |null| if we're going to add it like this later? > Perhaps as an optional mode or something, as I assume most users don't need > any extra space at the start. intl_availableCollations(locale) is small; 1 element for most locales; 6 for Chinese. While rigging up the function to provide that number I found that it's called twice per Collator instance; localeData() is called in both ResolveLocale and InitializeCollator. I changed ResolveLocale to return the locale data to InitializeCollator, but it made no measurable difference, so I reverted the change. > @@ +1134,1 @@ > > sensitivity: "variant" > > This is implementation-defined per "The default search sensitivity per > locale (10.2.3)" as mentioned informatively in Appendix A, correct? Correct.

Attachment #725705 - Attachment is obsolete: true

Attachment #727084 - Flags: review+

Attachment #727084 - Flags: checkin?(jwalden+bmo)

Assignee

Comment 63

•

12 years ago

Attached patch Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 9) (obsolete) — Details — Splinter Review

Update per comment 50. (In reply to Jeff Walden [:Waldo] (remove +bmo to email) from comment #50) I apologize for not providing adequate documentation with this patch. The interaction with ICU is impossible to understand without, and the related ICU documentation is incomplete - I filed a bug about that a long time ago. You guessed a lot of the background correctly, but were still missing critical bits. I hope the comment after InitializeDateTimeFormat provides the necessary information. > > + // Steps 20-21 provided by ICU. > > Hmm. I guess all these step numbers became unsynced at some point between > the i18n spec we were looking at when the patch adding this landed, and the > 2013-02-28 draft with errata. I guess once everything's stabilized I should > go back and adjust numbering appropriately. I can see why you'd want to use the February spec draft, since it addresses some spec problems that you found in the code reviews and adds a feature that we still want to add to this initial implementation (time zones). But there are still some open issues that I'll have to address in a later draft. Let's stick with the 1.0 spec as a reference for the time being. > > + case "short": > > + skeleton += "E"; > > + break; > > + case "long": > > + skeleton += "EEEE"; > > FormatDateTime 7.a.ix says the interpretations of narrow/short/long for all > of these are implementation-defined, correct? Not disagreeing with these > choices here, or saying they're not sensible, just making sure I understand > the extent they're required by the spec. Correct. Think of it as 9 knobs connected to the engine via rubber bands. In fact, the rubber bands for the era and timeZoneName knobs aren't required to exist at all... > Given that for numeric fields, field width corresponds to output width, I'm > a little sad that non-numeric fields don't at least roughly try to > correspond to length ordering -- "E", "EEEE", "EEEEE" from shortest to > longest, I mean. :-( This confused me for a bit, until I noticed the rough > correspondence had been specified for *numeric* fields only. The set of pattern symbols has grown organically since about 1996. If someone designed it from scratch today, it would surely look different. > Is it at all possible we might want "e" instead, here? I have no idea > whether Intl.DateTimeFormat output should use local day-of-week or not. If > we want "e" here, I assume we'd also want to use "e" in > |icuPatternCharToComponent|, probably. I have no idea what "local day of week" really means, and this pattern character isn't used at all in CLDR in any of its hundreds of locales. "E" is used everywhere. > > + case "2-digit": > > + skeleton += "yy"; > > + break; > > + case "numeric": > > + skeleton += "y"; > > Nitpick, but "yyyy" more clearly to me suggests what this actually expands > to. > > ...or, after more reading/grokking, I guess this is what UDAT_YEAR is. > Right? So it should be one "y", because "yyyy" would constrain exactly to > four-digit years, but we want whatever the locale-preferred format is. Is > that right? "y" uses as many digits as necessary. In the Japanese imperial calendar, we're only in the year 25 of the current era, and that shouldn't turn into 0025. > ...or no, reading further along, maybe this is so |resolveICUPattern| can > reconstitute this value when required by resolvedOptions()? (And maybe this > was the "why" for "y" versus "yyyy", and "jj" versus "j".) Given this > re-parsing complexity, I seriously wonder whether we wouldn't be better just > saving these options on the internals data, so we don't have to go to such > work to recompute them later... resolveICUPattern gets the pattern, not the skeleton, and the pattern may be somewhat different from what the skeleton requests (it all gets filtered through what UDateTimePatternGenerator can find for the locale). > > function dateTimeFormatLocaleData(locale) { > > + return { > > + ca: intl_availableCalendars(locale), > > + nu: getNumberingSystems(locale) > > I'm confused. Doesn't this value correspond to "The value of the > [[localeData]] internal property" in 12.2.3? And doesn't that have a number > of requirements that aren't followed here, and that InitializeDateTimeFormat > depends on? And that were basically followed in the previous shim? I'm > very confused about this! > > Or is this going to be filled in in subsequent patches? Correct, there is an issue that UDateTimePatternGenerator with the underlying locale data is an adequate implementation of BestFitFormatMatcher, but doesn't conform to the BasicFormatMatcher spec. I've filed https://bugzilla.mozilla.org/show_bug.cgi?id=852837 > @@ +2037,5 @@ > > + // "number" cases > > + case "y": > > + case "d": > > + case "h": > > + case "H": > > Does "j" need to be handled here? No, "j" is only allowed in skeleton strings, not in pattern strings.

Attachment #725706 - Attachment is obsolete: true

Attachment #727085 - Flags: review?(jwalden+bmo)

Assignee

Comment 64

•

12 years ago

Attached patch Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (cleanup) — Details — Splinter Review

Fixes a small bug and removes no-longer-needed warning suppression. (In reply to Jeff Walden [:Waldo] (remove +bmo to email) from comment #61) > One minor substantive change I made to the patches, beyond style: I moved > the second StringBuffer::resize() calls into the buffer-overflowed ifs. It > makes more sense that way, and on the off chance of a bogus error happening, > it's probably not valid to treat the returned size as being well-defined, > anyway. I had moved it above so that it would also trim the buffer to the actual size of the content. But you're right, errors need to be handled first.

Attachment #727088 - Flags: review?(jwalden+bmo)

Assignee

Comment 65

•

12 years ago

Attached patch Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (tests) — Details — Splinter Review

Updated per comment 48 and comment 57. Carrying r+terrence.

Attachment #725926 - Attachment is obsolete: true

Attachment #727089 - Flags: review+

Attachment #727089 - Flags: checkin?(jwalden+bmo)

Updated

•

12 years ago

Depends on: 852912

Comment 66

•

•

12 years ago

Attached patch Implement ICU dependent functions of Intl.Collator, Intl.NumberFormat, Intl.DateTimeFormat (part 9) — Details — Splinter Review

Updated per comment 69. Carrying r+jwalden.

Attachment #727085 - Attachment is obsolete: true

Attachment #728329 - Flags: review+

Attachment #728329 - Flags: checkin?(jwalden+bmo)

Assignee

Updated

•

12 years ago

Attachment #727088 - Flags: checkin?(jwalden+bmo)

https://hg.mozilla.org/mozilla-central/rev/100e46502574 https://hg.mozilla.org/mozilla-central/rev/e513c5d3b416

•

12 years ago

Status: NEW → RESOLVED

Closed: 12 years ago

Resolution: --- → FIXED

Assignee

Updated

•

12 years ago

Blocks: 864843

Assignee

Updated

•

12 years ago

Blocks: 866301