Feat libfasttransforms #75

MikaelSlevinsky · 2019-08-30T22:37:59Z

This PR adds a Julia wrapper around the C library of the same name. It also removes the resulting duplication. The 26 exported transforms are

julia> FastTransforms.kind2string.(0:25)
26-element Array{String,1}:
 "Legendre--Chebyshev"                                
 "Chebyshev--Legendre"                                
 "ultraspherical--ultraspherical"                     
 "Jacobi--Jacobi"                                     
 "Laguerre--Laguerre"                                 
 "Jacobi--ultraspherical"                             
 "ultraspherical--Jacobi"                             
 "Jacobi--Chebyshev"                                  
 "Chebyshev--Jacobi"                                  
 "ultraspherical--Chebyshev"                          
 "Chebyshev--ultraspherical"                          
 "Spherical harmonic--Fourier"                        
 "Spherical vector field--Fourier"                    
 "Zernike--Chebyshev×Fourier"                         
 "Proriol--Chebyshev²"                                
 "Proriol--Chebyshev³"                                
 "FFTW Fourier synthesis on the sphere"               
 "FFTW Fourier analysis on the sphere"                
 "FFTW Fourier synthesis on the sphere (vector field)"
 "FFTW Fourier analysis on the sphere (vector field)" 
 "FFTW Chebyshev×Fourier synthesis on the disk"       
 "FFTW Chebyshev×Fourier analysis on the disk"        
 "FFTW Chebyshev synthesis on the triangle"           
 "FFTW Chebyshev analysis on the triangle"            
 "FFTW Chebyshev synthesis on the tetrahedron"        
 "FFTW Chebyshev analysis on the tetrahedron"

and they each create a parameterized FTPlan. The first 11 support standard and ortho-normalizations via Bools.

Surprisingly, the Linux and macOS builds succeed. This works by using BinaryProvider to query one's version of homebrew/apt gcc (via detect_compiler_abi()), then downloading the right pre-compiled binary from here (https://github.com/MikaelSlevinsky/FastTransforms/releases/tag/v0.2.6).

Windows support will have to be dropped to Tier 3 and all other platforms and non-x86_64 chips to Tier 4.

TODO:

Full use of BinaryBuilder and BinaryProvider. It would be helpful if BinaryBuilder worked on macOS. Help with this would be appreciated.
Update documentation. Perhaps the best would be to refer to the C documentation (which also needs work).
export tetrahedral transform.

no binarybuilder yet

change tests

expert interface unchanged Basically: x = 1.0./(1:10) norm(leg2cheb(cheb2leg(x)) - x) is small

why gcc-8? I think that's what's current in julia

MikaelSlevinsky · 2019-08-30T22:45:03Z

One niche, but really cool, improvement is to multi-precision transforms. These were only available through the Toeplitz--Hankel transforms but they were slow.

julia> begin
    @time p = FastTransforms.th_leg2chebplan(BigFloat, 1000)
    @time x = rand(BigFloat, 1000)
    @time p*x
end;
3.462209 seconds (50.44 M allocations: 2.628 GiB, 26.64% gc time)
0.000181 seconds (2.01 k allocations: 117.625 KiB)
68.177021 seconds (902.38 M allocations: 46.999 GiB, 35.17% gc time)

compared with the direct mpfr_t routines from C:

julia> begin
    @time p = plan_leg2cheb(BigFloat, 1000)
    @time x = rand(BigFloat, 1000)
    @time p*x
end;
0.321481 seconds (5 allocations: 192 bytes)
0.000196 seconds (2.01 k allocations: 117.625 KiB)
0.019480 seconds (5.02 k allocations: 409.250 KiB)

MikaelSlevinsky · 2019-08-30T23:12:15Z

The main reason for this PR is because transforms are essentially imperative. Writing them in Julia was good for experimental purposes, but Julia's development is more active and volatile than C's.

While a 1024x2047 spherical harmonic transform used to take 4 seconds to plan and 0.6 seconds to execute when I first wrote it, careless syntax changes and compounding performance regressions led to an approximately 100-fold increase in execution time (see #69).

On this branch, we again have something reasonable:

julia> F = sphrandn(Float64, 1024, 2047); # note the change to `sphrandn`. Second integer denotes exact number of columns.

julia> @time G = sph2fourier(F);
  0.118161 seconds (9 allocations: 15.993 MiB)

julia> @time H = fourier2sph(F);
  0.122915 seconds (9 allocations: 15.993 MiB)

So, needless to say, closing #69 will be exciting! CC @AshtonSBradley

EDIT: the advanced interface yet cuts this in half:

julia> F = sphrandn(Float64, 1024, 2047);

julia> P = plan_sph2fourier(Float64, 1024)
FastTransforms Spherical harmonic--Fourier plan for 1024×2047-element array of Float64

julia> @time lmul!(P, F);
  0.062377 seconds (4 allocations: 160 bytes)

julia> @time ldiv!(P, F);
  0.056915 seconds (4 allocations: 160 bytes)

MikaelSlevinsky · 2019-08-30T23:16:43Z

The one performance regression is to plan_leg2cheb and plan_cheb2leg, which take about 5 times longer. The execution is almost the same. This difference can be closed in time.

But what's lost in pre-computation is gained in generality. The new method, no longer the Alper--Rokhlin scheme, solves triangular banded generalized eigenvalue problems, and thus is applicable to all of the Jacobi--Jacobi transforms and Laguerre--Laguerre transforms as well. Lookout for associated OP transforms in a future release!

MikaelSlevinsky · 2019-09-07T20:28:50Z

I have come to learn a bit about building and providing binaries. And yet, I've concluded that the user should always have the right to build from source. One compelling reason is that Travis-hosted (cross-)compilation may not turn on all the best optimization flags for one's personal computer. My Mac Pro with AVX-512 would never be able to use it!

As well, notwithstanding the issue I've filed, BinaryBuilder.jl comes with compromises. It currently requires all of a binary library's dependencies to be installed as well. This is sub-optimal because a user does not need so many copies of, e.g. OpenBLAS (and especially so on macOS since it can use system BLAS).

Therefore, there will be three build strategies ultimately available to the user that will be determined by an environment variable: build from BinaryBuilder.jl (most reliable, default, when this works), build from FastTransforms releases (fastest, which assumes the user has dependencies from the same package managers), build from source (best optimization).

Since one of these already works with Travis CI, it's time to merge.

MikaelSlevinsky added 14 commits September 21, 2018 13:26

trial support of libfasttransforms

ee98f40

no binarybuilder yet

Merge branch 'master' into feat-libfasttransforms

2aa1a6d

tests pass locally!

b967e6f

builds locally on osx!

51177ec

drop 0.7

716cbc5

view directory

cf5b09b

more precise find_library

d645800

whoops!

c3c989e

use download in using to see if test exists

06b6dba

skip find_library

659cb4d

fix warn => @warn,

ccb62a5

change tests

add basic interface

5a9d218

expert interface unchanged Basically: x = 1.0./(1:10) norm(leg2cheb(cheb2leg(x)) - x) is small

expand gcc versions, try apt/homebrew addons with gcc@8

4a0d4d9

why gcc-8? I think that's what's current in julia

avoid upcoming 5-arg mul!

b59f86e

MikaelSlevinsky added 3 commits September 7, 2019 11:42

Merge branch 'master' into feat-libfasttransforms

c4f44ae

add tetrahedral transforms

8cbc6e5

rework readme

89d3e97

MikaelSlevinsky merged commit 8dd3943 into master Sep 7, 2019

MikaelSlevinsky mentioned this pull request Dec 29, 2019

cheb2leg is slow #97

Open

MikaelSlevinsky deleted the feat-libfasttransforms branch October 2, 2020 17:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat libfasttransforms #75

Feat libfasttransforms #75

Uh oh!

MikaelSlevinsky commented Aug 30, 2019 •

edited

Loading

Uh oh!

MikaelSlevinsky commented Aug 30, 2019

Uh oh!

MikaelSlevinsky commented Aug 30, 2019 •

edited

Loading

Uh oh!

MikaelSlevinsky commented Aug 30, 2019

Uh oh!

MikaelSlevinsky commented Sep 7, 2019

Uh oh!

Uh oh!

Feat libfasttransforms #75

Feat libfasttransforms #75

Uh oh!

Conversation

MikaelSlevinsky commented Aug 30, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MikaelSlevinsky commented Aug 30, 2019

Uh oh!

MikaelSlevinsky commented Aug 30, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MikaelSlevinsky commented Aug 30, 2019

Uh oh!

MikaelSlevinsky commented Sep 7, 2019

Uh oh!

Uh oh!

MikaelSlevinsky commented Aug 30, 2019 •

edited

Loading

MikaelSlevinsky commented Aug 30, 2019 •

edited

Loading