speed up Ryu.pow5 #46764

oscardssmith · 2022-09-14T16:28:38Z

The current algorithm does up to p divisions, while this algorithm does only 1. I've assigned @quinnj to review since I'm not 100% sure that there isn't a very good reason that this function isn't written like this.

Seelengrab · 2022-09-15T06:16:06Z

I'm not 100% sure that there isn't a very good reason that this function isn't written like this.

My guess would be overflow protection for sufficiently large p? Does wraparound prevent any shenanigans that could happen due to that?

quinnj

Most of this was a straight port of the code in this repo, so there are probably ways to simplify/clean up (I think @simonbyrne did some of that initially when reviewing).

oscardssmith · 2022-09-15T18:21:09Z

Do you know if this needs an overflow check?

quinnj · 2022-09-15T18:53:48Z

I don't off the top of my head

KristofferC · 2022-09-22T11:11:27Z

Maybe run a PkgEval and merge if it looks ok?

oscardssmith · 2022-09-22T14:37:13Z

It feels kind of silly to use pkgeval for something this small, but I guess we might as well.

KristofferC · 2022-09-22T14:44:32Z

Or just merge, tests pass after all and I guess we will find it in release PkgEval if it is too bad

quinnj · 2022-10-06T06:22:15Z

#yolo

StefanKarpinski · 2022-10-27T17:00:38Z

Felt I should look at this since printing numbers incorrectly would be bad. This use of pow5 seems safe:

julia/base/ryu/shortest.jl

Lines 58 to 66 in f71b839

    
           if q <= qinvbound(T) 
        
               if ((v % UInt32) - 5 * div(v, 5)) == 0 
        
                   b_allzero = pow5(v, q) 
        
               elseif mf_iseven 
        
                   a_allzero = pow5(u, q) 
        
               else 
        
                   c -= pow5(w, q) 
        
               end 
        
           end

My reasoning is that 5^p is correct for p ≤ 27 and here we have that q ≤ qinvbound(T) defined here:

julia/base/ryu/utils.jl

Lines 17 to 19 in f71b839

    
           qinvbound(::Type{Float16}) = 4 
        
           qinvbound(::Type{Float32}) = 9 
        
           qinvbound(::Type{Float64}) = 21

You can see that q can be at most 21, so it's safe. The other use of pow5 is less clear:

julia/base/ryu/exp.jl

Lines 152 to 159 in f71b839

    
           rexp = precision - e 
        
           requiredTwos = -e2 - rexp 
        
           trailingZeros = requiredTwos <= 0 || 
        
               (requiredTwos < 60 && pow2(m2, requiredTwos)) 
        
           if rexp < 0 
        
               requiredFives = -rexp 
        
               trailingZeros = trailingZeros & pow5(m2, requiredFives) 
        
           end

Does anyone know what the range of possible values for precision and e are here? If e can be more than 27 greater than precision then we might have a problem with this change.

Follows up #46764

StefanKarpinski · 2022-11-08T15:34:48Z

Bump: @oscardssmith, @quinnj do you know about the range of values that e can take here?

oscardssmith · 2022-11-08T16:21:09Z

ah it appears that e can possibly be large if someone writes a decimal with a ton of digits.

KristofferC · 2022-11-08T17:02:40Z

Do you have an example?

LilithHafner · 2022-11-09T13:27:46Z

julia> using Printf

julia> @printf "%.8g" 4.645833859177319e63 # master
4.6458338e+63

julia> @printf "%.8g" 4.645833859177319e63 # 1.8
4.6458339e+63

There are two ways that pow5 can be wrong,

returning false when m2 is divisible by big(5)^requiredFives
returning true when m2 is not divisible by big(5)^requiredFives
In the first case, for pow5 to be wrong, big(5)^requiredFives must be greater than typemax(Int), but m2 is always less than 1<<53, and a small number is not divisible by a large number. Thus m2 will never be divisible by the true big(5)^requiredFives when there is overflow and case 1 is okay.

For the second case, we need to find a requiredFives such that pow5 may wrongly return true. That is, find a requiredFives such that there exists m2 where m2 % 5^requiredFives == 0 For this to be the case, either m2 must be 0 (handled much earlier as a special case) or abs(5^requiredFives) <= m2 < 1<<53. Searching exhaustively from the cases that have overflow, findfirst(requiredFives -> abs(5^requiredFives) < 1<<53, 28:10^5)+27 == 55. Note that 5^55 < 0. If, on the other hand, we used unsinged(5)^requiredFives, then we'd have findfirst(requiredFives -> unsigned(5)^requiredFives < 1<<53, 28:10^5)+27 == 1048 and IIUC requiredFives is at most log10(floatmax(Float64)) == 308.25471555991675.

So using unsigned(5) should fix this and I benchmark it as comparable to signed exponentiation.

Fixup for #46764

Fixup for #46764 (cherry picked from commit 02aa0b0)

speed up Ryu.pow5

54ee9ef

oscardssmith requested a review from quinnj September 14, 2022 16:28

quinnj approved these changes Sep 15, 2022

View reviewed changes

oscardssmith added the performance Must go faster label Sep 15, 2022

quinnj merged commit ce04b75 into master Oct 6, 2022

quinnj deleted the oscardssmith-faster-ryu-pow5 branch October 6, 2022 06:22

LilithHafner added a commit that referenced this pull request Nov 4, 2022

Make tiny function definition inline (style)

1dab2bd

Follows up #46764

LilithHafner mentioned this pull request Nov 4, 2022

Make tiny Ryu.pow5 function definition inline (style) #47446

Merged

LilithHafner added a commit that referenced this pull request Nov 4, 2022

Make tiny function definition inline (style) (#47446)

4053f69

Follows up #46764

LilithHafner mentioned this pull request Nov 9, 2022

Fix overflow in pow5 #47511

Merged

LilithHafner added a commit that referenced this pull request Nov 26, 2022

Fix overflow in pow5 (#47511)

02aa0b0

Fixup for #46764

KristofferC pushed a commit that referenced this pull request Nov 28, 2022

Fix overflow in pow5 (#47511)

24505fc

Fixup for #46764 (cherry picked from commit 02aa0b0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speed up Ryu.pow5 #46764

speed up Ryu.pow5 #46764

oscardssmith commented Sep 14, 2022

Seelengrab commented Sep 15, 2022

quinnj left a comment

oscardssmith commented Sep 15, 2022

quinnj commented Sep 15, 2022

KristofferC commented Sep 22, 2022

oscardssmith commented Sep 22, 2022

KristofferC commented Sep 22, 2022

quinnj commented Oct 6, 2022

StefanKarpinski commented Oct 27, 2022

StefanKarpinski commented Nov 8, 2022

oscardssmith commented Nov 8, 2022

KristofferC commented Nov 8, 2022

LilithHafner commented Nov 9, 2022

speed up Ryu.pow5 #46764

speed up Ryu.pow5 #46764

Conversation

oscardssmith commented Sep 14, 2022

Seelengrab commented Sep 15, 2022

quinnj left a comment

Choose a reason for hiding this comment

oscardssmith commented Sep 15, 2022

quinnj commented Sep 15, 2022

KristofferC commented Sep 22, 2022

oscardssmith commented Sep 22, 2022

KristofferC commented Sep 22, 2022

quinnj commented Oct 6, 2022

StefanKarpinski commented Oct 27, 2022

StefanKarpinski commented Nov 8, 2022

oscardssmith commented Nov 8, 2022

KristofferC commented Nov 8, 2022

LilithHafner commented Nov 9, 2022