Fast Lasso method for large-scale and ultrahigh-dimensional Cox model with applications to UK Biobank
Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › fagfællebedømt
Standard
Fast Lasso method for large-scale and ultrahigh-dimensional Cox model with applications to UK Biobank. / Li, Ruilin; Chang, Christopher; Justesen, Johanne M.; Tanigawa, Yosuke; Qiang, Junyang; Hastie, Trevor; Rivas, Manuel A; Tibshirani, Robert.
I: Biostatistics, 2020.Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › fagfællebedømt
Harvard
APA
Vancouver
Author
Bibtex
}
RIS
TY - JOUR
T1 - Fast Lasso method for large-scale and ultrahigh-dimensional Cox model with applications to UK Biobank
AU - Li, Ruilin
AU - Chang, Christopher
AU - Justesen, Johanne M.
AU - Tanigawa, Yosuke
AU - Qiang, Junyang
AU - Hastie, Trevor
AU - Rivas, Manuel A
AU - Tibshirani, Robert
N1 - © The Author 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
PY - 2020
Y1 - 2020
N2 - We develop a scalable and highly efficient algorithm to fit a Cox proportional hazard model by maximizing the $L^1$-regularized (Lasso) partial likelihood function, based on the Batch Screening Iterative Lasso (BASIL) method developed in Qian and others (2019). Our algorithm is particularly suitable for large-scale and high-dimensional data that do not fit in the memory. The output of our algorithm is the full Lasso path, the parameter estimates at all predefined regularization parameters, as well as their validation accuracy measured using the concordance index (C-index) or the validation deviance. To demonstrate the effectiveness of our algorithm, we analyze a large genotype-survival time dataset across 306 disease outcomes from the UK Biobank (Sudlow and others, 2015). We provide a publicly available implementation of the proposed approach for genetics data on top of the PLINK2 package and name it snpnet-Cox.
AB - We develop a scalable and highly efficient algorithm to fit a Cox proportional hazard model by maximizing the $L^1$-regularized (Lasso) partial likelihood function, based on the Batch Screening Iterative Lasso (BASIL) method developed in Qian and others (2019). Our algorithm is particularly suitable for large-scale and high-dimensional data that do not fit in the memory. The output of our algorithm is the full Lasso path, the parameter estimates at all predefined regularization parameters, as well as their validation accuracy measured using the concordance index (C-index) or the validation deviance. To demonstrate the effectiveness of our algorithm, we analyze a large genotype-survival time dataset across 306 disease outcomes from the UK Biobank (Sudlow and others, 2015). We provide a publicly available implementation of the proposed approach for genetics data on top of the PLINK2 package and name it snpnet-Cox.
U2 - 10.1093/biostatistics/kxaa038
DO - 10.1093/biostatistics/kxaa038
M3 - Journal article
C2 - 32989444
JO - Biostatistics
JF - Biostatistics
SN - 1465-4644
ER -
ID: 249810844