ARROW-10600: [Go] Implement Decimal256 #13792

zeroshade · 2022-08-03T21:38:38Z

No description provided.

github-actions · 2022-08-03T21:39:00Z

https://issues.apache.org/jira/browse/ARROW-10600

pitrou

I didn't go through everything, assuming much of the code here is copied and adapted from decimal128?

go/arrow/datatype_fixedwidth.go

pitrou · 2022-08-04T15:23:32Z

go/arrow/decimal256/decimal256.go

+func fromPositiveFloat32(v float32, prec, scale int32) (Num, error) {
+	var pscale float32
+	if scale >= -76 && scale <= 76 {
+		pscale = float32PowersOfTen[scale+76]


Wouldn't it be better to cast v to float64 and then call fromPositive64? You would 1) remove a lot of code 2) get better precision (float32PowersOfTen contains some zeros and infinities for very small or large scales)

So i tried that originally when I was writing the ToFloat functionalities for decimal128 and found (very) small precision issues which I assumed would carry through to the From functions. though right now trying this locally I can't reproduce those precision issues, maybe it was a windows thing? I don't know. That said, given that those precision issues would be outside the bounds of what is considered even reasonable for a float, I agree with you that it's probably just easier/better to cast to float64 and use the fromPositiveFloat64 / tofloat64Positive functions and remove the excess code. I'll go do that.

go/arrow/decimal256/decimal256.go

pitrou · 2022-08-04T15:31:28Z

go/arrow/decimal256/decimal256.go

+	if n == (Num{}) {
+		return 0
+	}
+	return int(1 | (int64(n.arr[3]) >> 63))


Is the purpose here to avoid branching? (otherwise why not a simple conditional?)

yes, the purpose here was to minimize branching as this could be potentially called frequently. I actually pulled this from code in Go's internals

go/arrow/decimal256/decimal256.go

pitrou · 2022-08-04T15:40:48Z

go/arrow/scalar/scalar.go

+
+	switch to.ID() {
+	case arrow.DECIMAL256:
+		return NewDecimal256Scalar(s.Value, to), nil


I think this is missing some casting of s.Value in case the scales don't match.
And even if the scales match, should probably check that the value still fits in the new precision.

fixed this, calling Rescale and FitsInPrecision

pitrou · 2022-08-04T15:41:22Z

go/arrow/scalar/scalar.go

@@ -297,6 +298,8 @@ func (s *Decimal128) CastTo(to arrow.DataType) (Scalar, error) {
 	switch to.ID() {
 	case arrow.DECIMAL128:
 		return NewDecimal128Scalar(s.Value, to), nil
+	case arrow.DECIMAL256:
+		return NewDecimal256Scalar(decimal256.FromDecimal128(s.Value), to), nil


Same as below: should take into account differences in scale or precision.

fixed calling Rescale and FitsInPrecision

go/arrow/array/decimal256_test.go

Co-authored-by: Antoine Pitrou <[email protected]>

zeroshade · 2022-08-04T19:49:48Z

I'm gonna bug you again @wolfeidau to have a look as we still lack many Go developers here who can take a look at these. 😄 After this, the union PR and one more PR afterwards (implementing unions in the IPC handling) Go will officially support all of the Arrow data types!

wolfeidau

Again only one small observation.

Happy to do what i can to review this, great work getting this last type working!

wolfeidau · 2022-08-05T05:34:24Z

go/arrow/array/decimal256.go

+// all values in v are appended and considered valid.
+func (b *Decimal256Builder) AppendValues(v []decimal256.Num, valid []bool) {
+	if len(v) != len(valid) && len(valid) != 0 {
+		panic("len(v) != len(valid) && len(valid) != 0")


Is this missing the convention i have seen with the message being prefixed by "arrow: "?

good point! I've added the prefixes.

Thanks for the reviews! technically Go has better Type support than C++ now! Since C++ doesn't support Float16 lol!

pitrou · 2022-08-05T06:33:52Z

There seem to be a lot of Go CI failures suddenly.

zeroshade · 2022-08-05T14:21:07Z

@pitrou Yea, when I merged the Union type PR it added new requirements to the Builder types which I hadn't implemented in the Decimal256Builder so the build failed. I just had to add the new required interface methods to the Decimal256Builder and now all should be good.

ursabot · 2022-08-05T18:41:40Z

Benchmark runs are scheduled for baseline = 6a3fb97 and contender = 6d1bc62. 6d1bc62 is a master commit associated with this PR. Results will be available as each benchmark for each run completes.
Conbench compare runs links:
[Finished ⬇️0.0% ⬆️0.0%] ec2-t3-xlarge-us-east-2
[Failed ⬇️0.14% ⬆️0.71%] test-mac-arm
[Finished ⬇️0.0% ⬆️0.0%] ursa-i9-9960x
[Finished ⬇️0.39% ⬆️0.92%] ursa-thinkcentre-m75q
Buildkite builds:
[Finished] 6d1bc624 ec2-t3-xlarge-us-east-2
[Failed] 6d1bc624 test-mac-arm
[Finished] 6d1bc624 ursa-i9-9960x
[Finished] 6d1bc624 ursa-thinkcentre-m75q
[Finished] 6a3fb97a ec2-t3-xlarge-us-east-2
[Finished] 6a3fb97a test-mac-arm
[Finished] 6a3fb97a ursa-i9-9960x
[Finished] 6a3fb97a ursa-thinkcentre-m75q
Supported benchmarks:
ec2-t3-xlarge-us-east-2: Supported benchmark langs: Python, R. Runs only benchmarks with cloud = True
test-mac-arm: Supported benchmark langs: C++, Python, R
ursa-i9-9960x: Supported benchmark langs: Python, R, JavaScript
ursa-thinkcentre-m75q: Supported benchmark langs: C++, Java

ursabot · 2022-08-05T18:41:54Z

['Python', 'R'] benchmarks have high level of regressions.
test-mac-arm

zeroshade added 2 commits August 3, 2022 16:05

first pass of decimal256

76d2319

ipc, arrjson and so on

6d02ecb

zeroshade requested review from amol-, andygrove, kszucs, pitrou and emkornfield August 3, 2022 21:38

github-actions bot added Component: Documentation Component: Go labels Aug 3, 2022

forgot to add the bitWidth to the expected JSON

aa4cfa9

pitrou reviewed Aug 4, 2022

View reviewed changes

zeroshade and others added 5 commits August 4, 2022 12:09

Update go/arrow/datatype_fixedwidth.go

f85b07b

Co-authored-by: Antoine Pitrou <[email protected]>

Update go/arrow/datatype_fixedwidth.go

b6e95e7

Co-authored-by: Antoine Pitrou <[email protected]>

changes from review feedback

21aef6d

check FitsInPrecision when casting scalars

c47ef3e

default to decimal128 when missing bitWidth in JSON integration test

45baa6d

zeroshade added 2 commits August 4, 2022 20:47

Merge branch 'master' into arrow-10600-decimal256

bd44686

add missing AppendEmptyValue

d5df467

wolfeidau reviewed Aug 5, 2022

View reviewed changes

zeroshade added 2 commits August 5, 2022 10:19

add Type method to decimal256

0c69de0

error conventions

71312b7

no more need for unsupportedArrayType

fe152cc

zeroshade merged commit 6d1bc62 into apache:master Aug 5, 2022

zeroshade deleted the arrow-10600-decimal256 branch August 5, 2022 16:28

k-anshul mentioned this pull request Aug 28, 2023

[Go] Arrow to parquet conversion fails for arrow.DECIMAL256 type #37419

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARROW-10600: [Go] Implement Decimal256 #13792

ARROW-10600: [Go] Implement Decimal256 #13792

zeroshade commented Aug 3, 2022

github-actions bot commented Aug 3, 2022

pitrou left a comment

pitrou Aug 4, 2022

zeroshade Aug 4, 2022

pitrou Aug 4, 2022

zeroshade Aug 4, 2022

pitrou Aug 4, 2022

zeroshade Aug 4, 2022

pitrou Aug 4, 2022

zeroshade Aug 4, 2022

zeroshade commented Aug 4, 2022

wolfeidau left a comment

wolfeidau Aug 5, 2022

zeroshade Aug 5, 2022

pitrou commented Aug 5, 2022

zeroshade commented Aug 5, 2022

ursabot commented Aug 5, 2022

ursabot commented Aug 5, 2022

ARROW-10600: [Go] Implement Decimal256 #13792

ARROW-10600: [Go] Implement Decimal256 #13792

Conversation

zeroshade commented Aug 3, 2022

github-actions bot commented Aug 3, 2022

pitrou left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zeroshade commented Aug 4, 2022

wolfeidau left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pitrou commented Aug 5, 2022

zeroshade commented Aug 5, 2022

ursabot commented Aug 5, 2022

ursabot commented Aug 5, 2022