Speed up Cholesky Jacobi matrices #169

TSGut · 2024-01-21T22:23:56Z

WIP to address #167. It will also fix some docstring issues.

For now a lot of speed is gained by creating a dedicated method for getindex of Symmetric{Clenshaw}, CholeskyJacobiData as well as better resizing in cholesky_jacobimatrix.

For low polynomial degree of the weight the method is approaching acceptable speeds again, though even with Symmetric Clenshaw now as fast as regular Clenshaw the Clenshaw evals are still a major bottleneck:

julia> P = Normalized(Legendre());
julia> x = axes(P,1);
julia> J = jacobimatrix(P);
julia> wf(x) = (2 .+ x.^4);
julia> W = Symmetric(P \ (wf.(x) .* P));
# raw Clenshaw timing
julia> @time W[1:1000,1:1000];
  0.027092 seconds (18 allocations: 485.281 KiB)
# Cholesky X timing
julia> Jchol = cholesky_jacobimatrix(W, P);
julia> @time Jchol[1:1000,1:1000];
  0.061909 seconds (4.57 k allocations: 8.131 MiB)

Of course for high polynomial degree this causes major slow downs, although there are clearly also other things to optimize:

julia> r = 0.5;
julia> lmin, lmax = (1-sqrt(r))^2,  (1+sqrt(r))^2;
julia> P = Normalized(chebyshevu(lmin..lmax));
julia> x = axes(P,1);
julia> J = jacobimatrix(P);
julia> wf(x) = 1/(x*r);
# raw Clenshaw timing
julia> W = Symmetric(P \ (wf.(x) .* P));
julia> @time W[1:1000,1:1000];
  3.562982 seconds (18 allocations: 10.764 MiB)
# Cholesky X timing
julia> Jchol = cholesky_jacobimatrix(W, P);
julia> @time Jchol[1:1000,1:1000];
  5.413114 seconds (4.60 k allocations: 19.039 MiB)

Todo list:

While symmetric and non-symmetric Clenshaw are now equal in speed, Clenshaw evaluation is still very slow which constitutes 50%+ of the compute time spent in cholesky_jacobimatrix.
Other optimizations and reduce allocations.
Also make changes for QR method once Cholesky is satisfactory
Add tests

TSGut · 2024-01-22T07:49:42Z

Ok, like I said based on profiling and benchmarking the remaining cost is in evaluating the Clenshaw matrix and performing its Cholesky decompositions.

(I still need to add tests and make tests pass, so not ready for merge yet)

@dlfivefifty How do you want to proceed with this? Is there hope for speeding up Clenshaw or the Cholesky of a Clenshaw? If not then I think this is bottlenecked at the speed of this PR.

To be fair, the computation is fast for weight modifications of low to medium degree weights, just not high degree where Clenshaw starts to struggle. We can do a 1 million size Jacobi matrix sized block in under a second for a degree 2 weight:

julia> P = Normalized(Legendre());

julia> x = axes(P,1);

julia> J = jacobimatrix(P);

julia> wf2(x) = (2 .+ x.^2);

julia> W2 = Symmetric(P \ (wf2.(x) .* P));

julia> Jchol2 = cholesky_jacobimatrix(W2, P);

julia> @time Jchol2[1:1000000,1:1000000];
  0.762198 seconds (16.00 M allocations: 1.103 GiB, 5.34% gc time)

Barring Clenshaw and Cholesky speed-ups, I suspect implementing the rational case is the next best thing.

dlfivefifty · 2024-01-22T12:07:09Z

That speed looks fine

codecov · 2024-01-23T06:12:55Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (5256e8e) 92.70% compared to head (148429d) 92.68%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #169      +/-   ##
==========================================
- Coverage   92.70%   92.68%   -0.03%     
==========================================
  Files          17       17              
  Lines        1864     1886      +22     
==========================================
+ Hits         1728     1748      +20     
- Misses        136      138       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

TSGut · 2024-01-23T07:20:36Z

@dlfivefifty Ok this is ready for merge or review. The patch coverage is 100%.

I have also bumped the version number so it would be good to tag after merging, then I will go and make sure SemiclassicalOPs still works as intended.

src/clenshaw.jl

TSGut added 4 commits January 21, 2024 14:01

symmetric clenshaw getindex + better resizing

a73ec41

adjustments to caching + doc improvements

f4fd642

simplify

fd68e69

minor change

9d2c12f

TSGut added 3 commits January 22, 2024 21:45

fix a bunch of tests

51e59e9

bugfix

0e1930a

another bugfix

8ff7898

TSGut added 2 commits January 22, 2024 22:14

increase coverage

50bfe48

Update Project.toml

ba6c277

TSGut changed the title ~~WIP: Speed up Cholesky Jacobi matrices~~ Speed up Cholesky Jacobi matrices Jan 23, 2024

TSGut requested a review from dlfivefifty January 27, 2024 07:06

dlfivefifty requested changes Feb 7, 2024

View reviewed changes

src/clenshaw.jl Outdated Show resolved Hide resolved

src/clenshaw.jl Outdated Show resolved Hide resolved

src/clenshaw.jl Outdated Show resolved Hide resolved

dlfivefifty added 3 commits February 7, 2024 09:39

Update src/clenshaw.jl

9b12a93

Update src/clenshaw.jl

346e595

Update src/clenshaw.jl

148429d

dlfivefifty approved these changes Feb 7, 2024

View reviewed changes

dlfivefifty merged commit d3d03ef into JuliaApproximation:main Feb 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Speed up Cholesky Jacobi matrices #169

Speed up Cholesky Jacobi matrices #169

Uh oh!

TSGut commented Jan 21, 2024 •

edited

Loading

Uh oh!

TSGut commented Jan 22, 2024 •

edited

Loading

Uh oh!

dlfivefifty commented Jan 22, 2024

Uh oh!

codecov bot commented Jan 23, 2024 •

edited

Loading

Uh oh!

TSGut commented Jan 23, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Speed up Cholesky Jacobi matrices #169

Speed up Cholesky Jacobi matrices #169

Uh oh!

Conversation

TSGut commented Jan 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TSGut commented Jan 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dlfivefifty commented Jan 22, 2024

Uh oh!

codecov bot commented Jan 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

TSGut commented Jan 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

TSGut commented Jan 21, 2024 •

edited

Loading

TSGut commented Jan 22, 2024 •

edited

Loading

codecov bot commented Jan 23, 2024 •

edited

Loading

TSGut commented Jan 23, 2024 •

edited

Loading