This is a comparison of statistical analysis software that allows doing inference with Gaussian processes. This article is written from the point of view of Bayesian statistics, which may use a terminology different from the one commonly used in kriging. The next section should clarify the mathematical/computational meaning of the information provided in the table independently of contextual terminology.
Exact: whether generic exact algorithms are implemented. These algorithms are usually appropriate only up to some thousands of datapoints.
Specialized: whether specialized exact algorithms for specific classes of problems are implemented. Supported specialized algorithms may be indicated as:
* Kronecker: algorithms for separable kernels on grid data.
* Toeplitz: algorithms for stationary kernels on uniformly spaced data.
* Block: algorithms optimized for block diagonal covariance matrices.
Approximate: whether generic or specialized approximate algorithms are implemented. Supported approximate algorithms may be indicated as:
* Sparse: algorithms based on choosing a set of "inducing points" in input space.
* Hierarchical: algorithms which approximate the covariance matrix with a hierarchical matrix.
Input
These columns are about the points on which the Gaussian process is evaluated, i.e. if the process is.
ND: whether multidimensional input is supported. If it is, multidimensional output is always possible by adding a dimension to the input, even without direct support.
Non-real: whether arbitrary non-real input is supported.
Output
These columns are about the values yielded by the process, and how they are connected to the data used in the fit.
Likelihood: whether arbitrary non-Gaussian likelihoods are supported.
Errors: whether arbitrary non-uniform correlated errors on datapoints are supported for the Gaussian likelihood. Errors may be handled manually by adding a kernel component, this column is about the possibility of manipulating them separately. Partial error support may be indicated as:
* Uncorrelated: the datapoints must be independent, but can have different distributions.
* Stationary: the datapoints can be correlated, but the covariance matrix must be a Toeplitz matrix, in particular this implies that the variances must be uniform.
Hyperparameters
These columns are about finding values of variables which enter somehow in the definition of the specific problem but that can not be inferred by the Gaussian process fit, for example parameters in the formula of the kernel.
These columns are about the possibility of fitting datapoints simultaneously to a process and to linear transformations of it.
Deriv.: whether it is possible to take an arbitrary number of derivatives up to the maximum allowed by the smoothness of the kernel, for any differentiable kernel. Example partial specifications may be the maximum derivability or implementation only for some kernels. Integrals can be obtained indirectly from derivatives.
Finite: whether finite arbitrary linear transformations are allowed on the specified datapoints.
Sum: whether it is possible to sum various kernels and access separately the processes corresponding to each addend. It is a particular case of finite linear transformation but it is listed separately because it is a common feature.