Computationally efficient multi-level Gaussian process regression for functional data observed under completely or partially regular sampling designs
Research output: Contribution to journal › Journal article › Research
Standard
Computationally efficient multi-level Gaussian process regression for functional data observed under completely or partially regular sampling designs. / Hoffmann, Adam Gorm; Ekstrøm, Claus Thorn; Jensen, Andreas Kryger.
In: arXiv, 2024.Research output: Contribution to journal › Journal article › Research
Harvard
APA
Vancouver
Author
Bibtex
}
RIS
TY - JOUR
T1 - Computationally efficient multi-level Gaussian process regression for functional data observed under completely or partially regular sampling designs
AU - Hoffmann, Adam Gorm
AU - Ekstrøm, Claus Thorn
AU - Jensen, Andreas Kryger
N1 - 48 pages, 3 figures
PY - 2024
Y1 - 2024
N2 - Gaussian process regression is a frequently used statistical method for flexible yet fully probabilistic non-linear regression modeling. A common obstacle is its computational complexity which scales poorly with the number of observations. This is especially an issue when applying Gaussian process models to multiple functions simultaneously in various applications of functional data analysis. We consider a multi-level Gaussian process regression model where a common mean function and individual subject-specific deviations are modeled simultaneously as latent Gaussian processes. We derive exact analytic and computationally efficient expressions for the log-likelihood function and the posterior distributions in the case where the observations are sampled on either a completely or partially regular grid. This enables us to fit the model to large data sets that are currently computationally inaccessible using a standard implementation. We show through a simulation study that our analytic expressions are several orders of magnitude faster compared to a standard implementation, and we provide an implementation in the probabilistic programming language Stan.
AB - Gaussian process regression is a frequently used statistical method for flexible yet fully probabilistic non-linear regression modeling. A common obstacle is its computational complexity which scales poorly with the number of observations. This is especially an issue when applying Gaussian process models to multiple functions simultaneously in various applications of functional data analysis. We consider a multi-level Gaussian process regression model where a common mean function and individual subject-specific deviations are modeled simultaneously as latent Gaussian processes. We derive exact analytic and computationally efficient expressions for the log-likelihood function and the posterior distributions in the case where the observations are sampled on either a completely or partially regular grid. This enables us to fit the model to large data sets that are currently computationally inaccessible using a standard implementation. We show through a simulation study that our analytic expressions are several orders of magnitude faster compared to a standard implementation, and we provide an implementation in the probabilistic programming language Stan.
KW - stat.ME
KW - stat.CO
KW - 62F15, 60G15, 62G08
KW - G.3
U2 - 10.48550/arXiv.2406.13691
DO - 10.48550/arXiv.2406.13691
M3 - Journal article
JO - arXiv
JF - arXiv
SN - 2331-8422
ER -
ID: 398795189