Add general TensorProduct kernel #81

devmotion · 2020-04-15T15:04:15Z

As discussed in the outdated PR #56 (comment), it is still unclear what type of arguments kernelmatrix and kerneldiagmatrix should take, or how to implement this in a general way to support both data types (i.e., collections of joint observations in all spaces (e.g., of type Vector{Tuple{Vector{Float64},Vector{Int}}}), or collections of collections of observations in each space (e.g., of type Tuple{Matrix{Float64},Matrix{Int}}).

willtebbutt · 2020-04-15T15:11:31Z

It would be good to get a basic implementation off the ground -- so it should be fine to make this a Matrix for now, inline with the rest of the package. In the future when we refactor the inputs types, as per the discussion in #43 , we could consider having some custom input type that plays nicely with this type of kernel. eg. an internal representation comprising a Vector or Tuple comprising D Vectors, that looks like a Vector of N Vectors from the outside.

In an initial implementation, we could just assume 1 dimension per group.

theogf · 2020-04-15T15:15:51Z

Realising that now we have quite a collection of kernels which will only work with kappa(k, x, y) shouldn't we create generic kernelmatrix and kerneldiagmatrix for these kernels (we can create a trait), where we iterate over all columns/rows?

willtebbutt · 2020-04-15T15:57:36Z

Realising that now we have quite a collection of kernels which will only work with kappa(k, x, y) shouldn't we create generic kernelmatrix and kerneldiagmatrix for these kernels (we can create a trait), where we iterate over all columns/rows?

I would be up for that generally, but in a separate PR. I think in this case we're best off implementing kernelmatrix and kerneldiagmatrix directly, since it can be done in terms of other calls to kernelmatrix and kerneldiagmatrix, and that is what is going to be most efficient in general.

theogf · 2020-04-15T16:16:59Z

Ok I will take care of this tomorrow then. This could also be a good start for tackling the "my data is not a matrix" problem

devmotion · 2020-04-15T18:06:20Z

In an initial implementation, we could just assume 1 dimension per group.

So you suggest implementing kernelmatrix and kerneldiagmatrix for Matrix-like inputs, where columns (or rows, depending on obsdim) with n elements represent observations of n groups that are handled by n different kernels?

I think in this case we're best off implementing kernelmatrix and kerneldiagmatrix directly, since it can be done in terms of other calls to kernelmatrix and kerneldiagmatrix

Intuitively, I would say in the setting described above it's more efficient to fill the matrix by evaluating kappa(::TensorProduct, obs_i, obs_j) for each pair of observations (exploiting symmetry as well, I guess) instead of multiplying together n different matrices?

willtebbutt · 2020-04-15T18:54:25Z

So you suggest implementing kernelmatrix and kerneldiagmatrix for Matrix-like inputs, where columns (or rows, depending on obsdim) with n elements represent observations of n groups that are handled by n different kernels?

Yeah, although I think in my notation I was thinking D groups of inputs with D different kernels.

Intuitively, I would say in the setting described above it's more efficient to fill the matrix by evaluating kappa(::TensorProduct, obs_i, obs_j) for each pair of observations (exploiting symmetry as well, I guess) instead of multiplying together n different matrices?

I can see your point, but it's not generally the case that any one of the D kernel matrices will be most efficiently computed in this manner. For example, if one of the D groups has P-dimensional inputs and the Exponentiated Quadratic kernel, the most efficient thing is generally going to be to make a call out to Distances.jl to compute the matrix, then product it, since the naive implementation of the kernel matrix in that case is really bad.

I think the high level point is that any given kernel knows best how to execute kernelmatrix and kerneldiagmatrix, so if you don't use their implementation it's quite possible that you'll be stuck with sub-optimal behaviour.

devmotion · 2020-04-15T19:02:39Z

I completely agree that a general implementation should just call kernelmatrix and kerneldiagmatrix of the individual kernels. But in the case of matrices as inputs P always has to be 1, hasn't it? I got confused since we were talking about implementing this special case with this very special structure, but at the same time the discussion of kernelmatrix and kerneldiagmatrix seemed to suggest that one should still use the more general but in this case maybe less efficient implementation.

willtebbutt · 2020-04-15T20:04:53Z

That's a fair point, my previous argument does really only apply to P > 1.

I think that there are other reason though, including code complexity and ease of doing reverse-mode AD. The former is in the eye of the beholder of course, and the latter is hard to assess without having a decent suite of benchmarks.

devmotion · 2020-04-16T14:33:28Z

I ended up using mapreduce for kernelmatrix and kerneldiagmatrix. I'm not completely happy with the implementation of kernelmatrix! and kerneldiagmatrix!, somehow the use of Iterators.drop feels unsatisfying. It's basically just a more explicit version of the implementation of the out-of-place variants in which in the initial step the output array is filled with kernelmatrix! and kerneldiagmatrix!.

I also had to add some methods for constructing kernel matrices (and their diagonal) from vector-valued inputs (it felt silly to reshape just the slices in tensorproduct.jl, IMO this should work for other kernels as well).

willtebbutt

Quite a bit of readability-related comments, also it would be great if we could have some more tests. In particular the edge-case in which there's only a single kernel in the TensorProduct.

(edit: this looks great other than the above / below 🙂 )

willtebbutt · 2020-04-16T15:26:47Z

src/kernels/tensorproduct.jl

+
+    featuredim = feature_dim(obsdim)
+    if !check_dims(X, X, featuredim, obsdim)
+        throw(DimensionMismatch("Dimensions of the target array K $(size(K)) are not consistent with X $(size(X))"))


Not within 92 char lim. Please wrap string over multiple lines

Yeah, I just copied this from kernelmatrix.jl (which is not great either) and missed that it doesn't follow the style guide.

src/kernels/tensorproduct.jl

src/matrix/kernelmatrix.jl

src/kernels/tensorproduct.jl

Co-Authored-By: willtebbutt <[email protected]>

willtebbutt · 2020-04-16T17:20:09Z

Happy for you to merge when you're happy @devmotion , nice work.

devmotion · 2020-04-16T17:41:38Z

Just want to make sure that tests pass. Otherwise I'm happy.

devmotion · 2020-04-16T18:01:13Z

OK, tests pass (test failures on Julia master are unrelated).

Add TensorProduct kernel

822e29e

Implement kernel matrices for TensorProduct

8e50390

willtebbutt requested changes Apr 16, 2020

View reviewed changes

devmotion and others added 3 commits April 16, 2020 18:15

Apply suggestions from code review

e677fa1

Co-Authored-By: willtebbutt <[email protected]>

Extend documentation of TensorProduct

0c79837

Some fixes

70e4691

willtebbutt approved these changes Apr 16, 2020

View reviewed changes

devmotion added 2 commits April 16, 2020 19:36

Fix some typos

ba4f5ca

More tests

e8ed332

devmotion merged commit bb3e859 into JuliaGaussianProcesses:master Apr 16, 2020

devmotion deleted the tensor branch April 16, 2020 18:11

Add general TensorProduct kernel #81

Add general TensorProduct kernel #81

Uh oh!

Conversation

devmotion commented Apr 15, 2020

Uh oh!

willtebbutt commented Apr 15, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

theogf commented Apr 15, 2020

Uh oh!

willtebbutt commented Apr 15, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

theogf commented Apr 15, 2020

Uh oh!

devmotion commented Apr 15, 2020

Uh oh!

willtebbutt commented Apr 15, 2020

Uh oh!

devmotion commented Apr 15, 2020

Uh oh!

willtebbutt commented Apr 15, 2020

Uh oh!

devmotion commented Apr 16, 2020

Uh oh!

willtebbutt left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

willtebbutt Apr 16, 2020

Choose a reason for hiding this comment

Uh oh!

devmotion Apr 16, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

willtebbutt commented Apr 16, 2020

Uh oh!

devmotion commented Apr 16, 2020

Uh oh!

devmotion commented Apr 16, 2020

Uh oh!

Uh oh!

willtebbutt commented Apr 15, 2020 •

edited

Loading

willtebbutt commented Apr 15, 2020 •

edited

Loading

willtebbutt left a comment •

edited

Loading