BigInt.prototype.toLocaleString #218

littledan · 2018-02-14T19:50:04Z

In the Stage 3 BigInt proposal, toLocaleString is defined. We should have a definition for this in the ECMA-402 specification.

I wasn't able to find an ICU API which would be useful here; does one exist?

cc @jakobkummerow

anba · 2018-03-06T13:57:53Z

ICU4C has unum_formatDecimal, ICU4J has NumberFormat#format(java.math.BigInteger).

littledan · 2018-03-06T16:20:22Z

That looks perfect for this case. Seems like we could pass the output of ToString(bigInt) to unum_formatDecimal as its number/length arguments, if I'm understanding the Decimal Arithmetic Specification correctly.

@anba Would you be interested in writing specification text for BigInt.prototype.toLocaleString in ECMA 402? The design I was imagining would be to base it on an overload of Intl.NumberFormat.prototype.format for BigInts--Number Format Functions could start with ToNumeric rather than ToNumber, and then PartitionNumberPattern would somehow be either generalized or cloned to handle BigInt in addition to Number.

cc @cxielarko

anba · 2018-03-15T15:48:50Z

AFAICT the descriptions for FormatNumberToString, PartitionNumberPattern, FormatNumber, ToRawPrecision, and ToRawFixed only need to be changed to allow BigInt values for the x parameter. BigInt.prototype.toLocaleString can then be defined as:

Let x be ? thisBigIntValue(this value).
Let numberFormat be ? Construct(%NumberFormat%, « locales, options »).
Return FormatNumber(numberFormat, x).

And maybe s/Number/Numeric/ for FormatNumberToString, PartitionNumberPattern, and FormatNumber?

Do we also want to modify (or add a new method) to allow BigInt values for Intl.NumberFormat.prototype.format and Intl.NumberFormat.prototype.formatToParts? The latter may not be implementable in ICU4C because it lacks a unum_formatDecimalForFields function.

sffc · 2018-03-16T04:57:50Z

FYI, better than unum_formatDecimal and NumberFormat#format(java.math.BigInteger) is to use the new methods offered in icu.number and numberformatter.h. The former two methods are not deprecated (at least not yet), but they are discouraged for new users.

numberformatter.h has an endpoint for the same syntax as unum_formatDecimal, but it also has full support for the field position iterator needed for formatToParts. You use it like this:

FormattedNumber result = NumberFormatter::withLocale(...).formatDecimal(..., status);
result.populateFieldPositionIterator(...);
UnicodeString resultString = result.toString();

anba · 2018-03-16T11:30:06Z

Thanks for the info that the new number formatting API will support this use case! For SpiderMonkey we may need to wait for https://ssl.icu-project.org/trac/ticket/13597, because we generally only use the C-API for its compatibility across different releases.

sffc · 2018-04-20T19:06:02Z

As a general comment, I think that we should interpret string inputs as the highest precision datatype supported. So if I give a string containing something like "987654321987654321987654321", that should be interpreted as a BigInteger. However, "9876.543" should be interpreted as a double until Ecma adds a datatype for BigDecimal, for example.

littledan · 2018-04-21T14:28:57Z

I'd prefer not to do this sort of "magical" semantic overloading. If your intuition is that that should exist, then I'm inclined to add another method, and avoid overloading, e.g., Intl.NumberFormat.prototype.formatBigInt/formatBigIntToParts. What would you think of that?

sffc · 2018-04-22T00:26:14Z

I don't see this as "magical" overloading. I see it as the method taking the input data type and then internally choosing the highest precision data type available to represent it.

My intuition is that it's weird to ever convert strings to Numbers, and we should avoid it when possible with better alternative behavior.

littledan · 2018-04-24T19:39:13Z

My intuition is that it's weird to ever convert strings to Numbers, and we should avoid it when possible with better alternative behavior.

Following this intuition, I'd like to mentally classify the format method accepting String arguments at all as weird random legacy behavior, and based on that analysis, not upgrade it to get higher precision. We can't remove it, but we also don't have to improve it. Leaving format(String)'s behavior unchanged will simplify the specification and implementation.

This patch brings Intl.NumberFormat support to BigInt, and adds a BigInt.prototype.toLocaleString method based on it. The design here is to include overloading between BigInt and Number as arguments for the format and formatToParts methods based on ToNumeric. This means that, for example, string arguments are cast to Number, rather than BigInt. This design preserves compatibility and consistency with operators like unary - This definition permits options in the NumberFormat to force decimal places, e.g., 1n formatting as 1.00000 if the minimum fractional digits is 5. Alternative semantics would be to throw an exception in this case. For the algorithm text itself: the specification algorithms ToRawPrecision and ToRawFixed are now used for both Numbers and BigInts. Given the ECMAScript specification's use of implicit coercisions between Numbers and mathematical values, I believe that this is valid without any special changes; the phrasing may change in the future [1]. ICU4C-based implementations of ECMAScript can use LocalizedNumberFormatter::formatDecimal [2] or unum_formatDecimal [3] to implement the algorithms in this patch. [1] tc39/ecma262#1135 [2] http://icu-project.org/apiref/icu4c/classicu_1_1number_1_1LocalizedNumberFormatter.html#a29cd3d107b784496e19175ce0115f26f [3] http://icu-project.org/apiref/icu4c/unum_8h.html#a59870a322f012dc1b9d99cf8a7b708f1 Closes tc39#218

FrankYFTang · 2019-01-19T00:01:45Z

Could someone show me a js example code of how to use BigInt with the toLocaleString?

IS it like

12345678901234567890n.toLocaleString("fr")

?
Is it adding a "n" after the digit good enough to denote the BigInt? Sorry of my lack of confidence of knowing BigInt well here.

jakobkummerow · 2019-01-19T00:26:21Z

Yes, that looks correct.

FrankYFTang · 2019-01-25T04:31:45Z

Thanks for the code review from @jakobkummerow the v8 implementation sync with the current spec ( 040f809 ) is landed into v8 tree behind the flag --harmony-intl-bigint . You can also see the test I wrote here
https://chromium-review.googlesource.com/c/v8/v8/+/1424021/8/test/intl/bigint/tolocalestring.js
Notice in this implementation BigInt(-0).toLocaleString() will output "0" instead of "-0". @jakobkummerow said that is the expected behavior. In the other hand (-0).toLocaleString() will return "-0".

littledan · 2019-01-25T10:08:34Z

Great work, @FrankYFTang !

That's right; for BigInt, -0n is 0n.

I'm going to present on this issue in the upcoming TC39 meeting, arguing we should instead make a separate method, formatBigInt, and consider that a pattern going forward. See slides.

ray007 · 2019-01-25T12:03:47Z

I can understand there being a difference between -0 and 0 for double/float, but for integers?

FrankYFTang · 2019-01-25T19:39:46Z

The reason my current v8 implementation output BigInt(-0) to "0" is because the BitInt.prototype.toString output "0" instead of "-0".

This patch brings Intl.NumberFormat support to BigInt, and adds a BigInt.prototype.toLocaleString method based on it. The design here is to include overloading between BigInt and Number as arguments for the format and formatToParts methods based on ToNumeric. This means that, for example, string arguments are cast to Number, rather than BigInt. This design preserves compatibility and consistency with operators like unary - This definition permits options in the NumberFormat to force decimal places, e.g., 1n formatting as 1.00000 if the minimum fractional digits is 5. Alternative semantics would be to throw an exception in this case. For the algorithm text itself: the specification algorithms ToRawPrecision and ToRawFixed are now used for both Numbers and BigInts. Given the ECMAScript specification's use of implicit coercisions between Numbers and mathematical values, I believe that this is valid without any special changes; the phrasing may change in the future [1]. ICU4C-based implementations of ECMAScript can use LocalizedNumberFormatter::formatDecimal [2] or unum_formatDecimal [3] to implement the algorithms in this patch. [1] tc39/ecma262#1135 [2] http://icu-project.org/apiref/icu4c/classicu_1_1number_1_1LocalizedNumberFormatter.html#a29cd3d107b784496e19175ce0115f26f [3] http://icu-project.org/apiref/icu4c/unum_8h.html#a59870a322f012dc1b9d99cf8a7b708f1 Closes tc39#218

jakobkummerow · 2019-01-25T23:21:24Z

And the reason that (-0n).toString() === "0" is because there is no negative-zero BigInt. It's not a question of what toString does. -0.0 is an IEEE floating-point concept; integers of any kind don't have it.

See our implementation of BigInt negation:

Handle<BigInt> BigInt::UnaryMinus(Isolate* isolate, Handle<BigInt> x) {
  // Special case: There is no -0n.
  if (x->is_zero()) {
    return x;
  }
  Handle<MutableBigInt> result = MutableBigInt::Copy(isolate, x);
  result->set_sign(!x->sign());
  return MutableBigInt::MakeImmutable(result);
}

This patch brings Intl.NumberFormat support to BigInt, and adds a BigInt.prototype.toLocaleString method based on it. The design here is to include overloading between BigInt and Number as arguments for the format and formatToParts methods based on ToNumeric. This means that, for example, string arguments are cast to Number, rather than BigInt. This design preserves compatibility and consistency with operators like unary - This definition permits options in the NumberFormat to force decimal places, e.g., 1n formatting as 1.00000 if the minimum fractional digits is 5. Alternative semantics would be to throw an exception in this case. For the algorithm text itself: the specification algorithms ToRawPrecision and ToRawFixed are now used for both Numbers and BigInts. Given the ECMAScript specification's use of implicit coercisions between Numbers and mathematical values, I believe that this is valid without any special changes; the phrasing may change in the future [1]. ICU4C-based implementations of ECMAScript can use LocalizedNumberFormatter::formatDecimal [2] or unum_formatDecimal [3] to implement the algorithms in this patch. [1] tc39/ecma262#1135 [2] http://icu-project.org/apiref/icu4c/classicu_1_1number_1_1LocalizedNumberFormatter.html#a29cd3d107b784496e19175ce0115f26f [3] http://icu-project.org/apiref/icu4c/unum_8h.html#a59870a322f012dc1b9d99cf8a7b708f1 Closes #218

littledan mentioned this issue May 1, 2018

Normative: Support BigInt in NumberFormat and toLocaleString #236

Merged

sffc added c: numbers Component: numbers, currency, units s: in progress Status: the issue has an active proposal labels Mar 19, 2019

leobalter closed this as completed in #236 Jun 13, 2019

littledan mentioned this issue Jan 8, 2020

BigInt + Scale tc39/proposal-decimal#32

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BigInt.prototype.toLocaleString #218

BigInt.prototype.toLocaleString #218

littledan commented Feb 14, 2018

anba commented Mar 6, 2018

littledan commented Mar 6, 2018

anba commented Mar 15, 2018

sffc commented Mar 16, 2018 •

edited

Loading

anba commented Mar 16, 2018

sffc commented Apr 20, 2018

littledan commented Apr 21, 2018

sffc commented Apr 22, 2018

littledan commented Apr 24, 2018

FrankYFTang commented Jan 19, 2019 •

edited

Loading

jakobkummerow commented Jan 19, 2019

FrankYFTang commented Jan 25, 2019

littledan commented Jan 25, 2019

ray007 commented Jan 25, 2019

FrankYFTang commented Jan 25, 2019

jakobkummerow commented Jan 25, 2019

BigInt.prototype.toLocaleString #218

BigInt.prototype.toLocaleString #218

Comments

littledan commented Feb 14, 2018

anba commented Mar 6, 2018

littledan commented Mar 6, 2018

anba commented Mar 15, 2018

sffc commented Mar 16, 2018 • edited Loading

anba commented Mar 16, 2018

sffc commented Apr 20, 2018

littledan commented Apr 21, 2018

sffc commented Apr 22, 2018

littledan commented Apr 24, 2018

FrankYFTang commented Jan 19, 2019 • edited Loading

jakobkummerow commented Jan 19, 2019

FrankYFTang commented Jan 25, 2019

littledan commented Jan 25, 2019

ray007 commented Jan 25, 2019

FrankYFTang commented Jan 25, 2019

jakobkummerow commented Jan 25, 2019

sffc commented Mar 16, 2018 •

edited

Loading

FrankYFTang commented Jan 19, 2019 •

edited

Loading