Add factorize!() method for BunchKaufman #38

MichelJuillard · 2024-01-21T14:34:47Z

factorize() method is missing for BunchKaufmann (see https://discourse.julialang.org/t/ann-fastlapackinterface-jl-v1-0-0-non-allocating-lapack-factorizations/83354/10?u=micheljuillard and https://discourse.julialang.org/t/ann-fastlapackinterface-jl-v1-0-0-non-allocating-lapack-factorizations/83354/10?u=micheljuillard)
check also the other methods

The text was updated successfully, but these errors were encountered:

RoyiAvital · 2024-01-21T18:33:06Z

Appreciate the support.
Will it support using MKL.jl and Accelerate.jl?

MichelJuillard · 2024-01-22T07:31:45Z

I will test for MKL. I don't have the hardware to test for Accelerate. @RoyiAvital could you do that?

RoyiAvital · 2024-01-22T13:14:48Z

I will. Let me know when it is ready.

MichelJuillard · 2024-01-27T15:38:12Z

@RoyiAvital Sorry, I got confused with the actual syntax

factorize!() returns a tuple with sytrf!() output, not the factorization returned by BunchKaufman(). It is therefor stillnecessary to call BunchKaufman() after factorize!()
The shorter syntax factorize!(ws, Symmetrical(x)) allocates more than `factorize!(ws, 'U', x)
A the end, for Bunch Kaufman, there is little difference between using factorize!() or the lower level direct call to LAPACK.sytr!()
Here is a working example with both appoaches

using FastLapackInterface
using LinearAlgebra

function loop_1!(vXs, mCs, ws)
    for mC in mCs    
        # factorization
        F1 = factorize!(ws, 'U', mC)
        F = BunchKaufman(mC, F1[2], 'U', true, false, BLAS.BlasInt(0))    
        # solving linear systems
        for vX in vXs
            ldiv!(F, vX)
        end 
    end
end

function approach1( order, iterations)
    # create workspace
    ws = Workspace(LAPACK.sytrf!, mCs[1])
    mCs_1 = deepcopy(mCs)
    vXs_1 = deepcopy(vXs)
    loop_1!(vXs_1, mCs_1, ws)
    mCs_1 = deepcopy(mCs)
    vXs_1 = deepcopy(vXs)
    @time loop_1!(vXs_1, mCs_1, ws)
end    

function loop_2!(vXs, mCs, ws)
    for mC in mCs    
        # factorization    
        A, ipiv, info = LAPACK.sytrf!(ws, 'U', mC)        
        F = BunchKaufman(mC, ipiv, 'U', true, false, BLAS.BlasInt(0))
        # solving linear systems
        for vX in vXs
            ldiv!(F, vX)
        end 
    end
end

function approach2(order, iterations)
    # create workspace
    ws = BunchKaufmanWs(mCs[1])
    
    mCs_1 = deepcopy(mCs)
    vXs_1 = deepcopy(vXs)
    loop_2!(vXs_1, mCs_1, ws)
    mCs_1 = deepcopy(mCs)
    vXs_1 = deepcopy(vXs)
    @time loop_2!(vXs_1, mCs_1, ws)
end    

order = 100
iterations = 10

mCs = []
vXs = []
for i = 1:iterations
    x = randn(order, order)
    mC = hermitianpart!(randn(n, n)).data
    push!(mCs, mC)
    push!(vXs, randn(order))
end

approach1(mCs, vXs)
approach2(mCs, vXs)

It works with MKL (but seems slower than OpenBlas). Could you please try it with Accelerate?

RoyiAvital · 2024-01-27T15:46:22Z

Do these lines allocate?

F1 = factorize!(ws, 'U', mC)
F = BunchKaufman(mC, F1[2], 'U', true, false, BLAS.BlasInt(0))  
ldiv!(F, vX)

If not, this is perfect.

I will test on Accelerate.jl and report, no problem.

MichelJuillard · 2024-01-27T15:53:42Z

It still allocates for a reason that I don't understand but very little. It doesn't depend on the size of the matrix.

RoyiAvital · 2024-01-27T16:00:31Z

I assume F = BunchKaufman(mC, F1[2], 'U', true, false, BLAS.BlasInt(0)) is the allocating line, right?

MichelJuillard · 2024-01-27T17:23:24Z

F1 = factorize!(ws, 'U', mC)

allocates 64 bytes per iteration

F = BunchKaufman(mC, F1[2], 'U', true, false, BLAS.BlasInt(0))

allocate 48 bytes per iteration

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add factorize!() method for BunchKaufman #38

Add factorize!() method for BunchKaufman #38

MichelJuillard commented Jan 21, 2024

RoyiAvital commented Jan 21, 2024 •

edited

Loading

MichelJuillard commented Jan 22, 2024

RoyiAvital commented Jan 22, 2024

MichelJuillard commented Jan 27, 2024

RoyiAvital commented Jan 27, 2024

MichelJuillard commented Jan 27, 2024

RoyiAvital commented Jan 27, 2024

MichelJuillard commented Jan 27, 2024

Add factorize!() method for BunchKaufman #38

Add factorize!() method for BunchKaufman #38

Comments

MichelJuillard commented Jan 21, 2024

RoyiAvital commented Jan 21, 2024 • edited Loading

MichelJuillard commented Jan 22, 2024

RoyiAvital commented Jan 22, 2024

MichelJuillard commented Jan 27, 2024

RoyiAvital commented Jan 27, 2024

MichelJuillard commented Jan 27, 2024

RoyiAvital commented Jan 27, 2024

MichelJuillard commented Jan 27, 2024

RoyiAvital commented Jan 21, 2024 •

edited

Loading