Computationally efficient multi-level Gaussian process regression for functional data observed under completely or partially regular sampling designs

Research output: Contribution to journalJournal articleResearch

Standard

Computationally efficient multi-level Gaussian process regression for functional data observed under completely or partially regular sampling designs. / Hoffmann, Adam Gorm; Ekstrøm, Claus Thorn; Jensen, Andreas Kryger.

In: arXiv, 2024.

Research output: Contribution to journalJournal articleResearch

Harvard

Hoffmann, AG, Ekstrøm, CT & Jensen, AK 2024, 'Computationally efficient multi-level Gaussian process regression for functional data observed under completely or partially regular sampling designs', arXiv. https://doi.org/10.48550/arXiv.2406.13691

APA

Hoffmann, A. G., Ekstrøm, C. T., & Jensen, A. K. (2024). Computationally efficient multi-level Gaussian process regression for functional data observed under completely or partially regular sampling designs. arXiv. https://doi.org/10.48550/arXiv.2406.13691

Vancouver

Hoffmann AG, Ekstrøm CT, Jensen AK. Computationally efficient multi-level Gaussian process regression for functional data observed under completely or partially regular sampling designs. arXiv. 2024. https://doi.org/10.48550/arXiv.2406.13691

Author

Hoffmann, Adam Gorm ; Ekstrøm, Claus Thorn ; Jensen, Andreas Kryger. / Computationally efficient multi-level Gaussian process regression for functional data observed under completely or partially regular sampling designs. In: arXiv. 2024.

Bibtex

@article{cea11439813a4829b88f1fad9a0cc9a3,
title = "Computationally efficient multi-level Gaussian process regression for functional data observed under completely or partially regular sampling designs",
abstract = "Gaussian process regression is a frequently used statistical method for flexible yet fully probabilistic non-linear regression modeling. A common obstacle is its computational complexity which scales poorly with the number of observations. This is especially an issue when applying Gaussian process models to multiple functions simultaneously in various applications of functional data analysis. We consider a multi-level Gaussian process regression model where a common mean function and individual subject-specific deviations are modeled simultaneously as latent Gaussian processes. We derive exact analytic and computationally efficient expressions for the log-likelihood function and the posterior distributions in the case where the observations are sampled on either a completely or partially regular grid. This enables us to fit the model to large data sets that are currently computationally inaccessible using a standard implementation. We show through a simulation study that our analytic expressions are several orders of magnitude faster compared to a standard implementation, and we provide an implementation in the probabilistic programming language Stan.",
keywords = "stat.ME, stat.CO, 62F15, 60G15, 62G08, G.3",
author = "Hoffmann, {Adam Gorm} and Ekstr{\o}m, {Claus Thorn} and Jensen, {Andreas Kryger}",
note = "48 pages, 3 figures",
year = "2024",
doi = "10.48550/arXiv.2406.13691",
language = "English",
journal = "arXiv",
issn = "2331-8422",

}

RIS

TY - JOUR

T1 - Computationally efficient multi-level Gaussian process regression for functional data observed under completely or partially regular sampling designs

AU - Hoffmann, Adam Gorm

AU - Ekstrøm, Claus Thorn

AU - Jensen, Andreas Kryger

N1 - 48 pages, 3 figures

PY - 2024

Y1 - 2024

N2 - Gaussian process regression is a frequently used statistical method for flexible yet fully probabilistic non-linear regression modeling. A common obstacle is its computational complexity which scales poorly with the number of observations. This is especially an issue when applying Gaussian process models to multiple functions simultaneously in various applications of functional data analysis. We consider a multi-level Gaussian process regression model where a common mean function and individual subject-specific deviations are modeled simultaneously as latent Gaussian processes. We derive exact analytic and computationally efficient expressions for the log-likelihood function and the posterior distributions in the case where the observations are sampled on either a completely or partially regular grid. This enables us to fit the model to large data sets that are currently computationally inaccessible using a standard implementation. We show through a simulation study that our analytic expressions are several orders of magnitude faster compared to a standard implementation, and we provide an implementation in the probabilistic programming language Stan.

AB - Gaussian process regression is a frequently used statistical method for flexible yet fully probabilistic non-linear regression modeling. A common obstacle is its computational complexity which scales poorly with the number of observations. This is especially an issue when applying Gaussian process models to multiple functions simultaneously in various applications of functional data analysis. We consider a multi-level Gaussian process regression model where a common mean function and individual subject-specific deviations are modeled simultaneously as latent Gaussian processes. We derive exact analytic and computationally efficient expressions for the log-likelihood function and the posterior distributions in the case where the observations are sampled on either a completely or partially regular grid. This enables us to fit the model to large data sets that are currently computationally inaccessible using a standard implementation. We show through a simulation study that our analytic expressions are several orders of magnitude faster compared to a standard implementation, and we provide an implementation in the probabilistic programming language Stan.

KW - stat.ME

KW - stat.CO

KW - 62F15, 60G15, 62G08

KW - G.3

U2 - 10.48550/arXiv.2406.13691

DO - 10.48550/arXiv.2406.13691

M3 - Journal article

JO - arXiv

JF - arXiv

SN - 2331-8422

ER -

ID: 398795189