The additive hazards model with high-dimensional regressors

Research output: Contribution to journalJournal articleResearchpeer-review

This paper considers estimation and prediction in the Aalen additive hazards model in the case where the covariate vector is high-dimensional such as gene expression measurements. Some form of dimension reduction of the covariate space is needed to obtain useful statistical analyses. We study the partial least squares regression method. It turns out that it is naturally adapted to this setting via the so-called Krylov sequence. The resulting PLS estimator is shown to be consistent provided that the number of terms included is taken to be equal to the number of relevant components in the regression model. A standard PLS algorithm can also be constructed, but it turns out that the resulting predictor can only be related to the original covariates via time-dependent coefficients. The methods are applied to a breast cancer data set with gene expression recordings and to the well known primary biliary cirrhosis clinical data.
Original languageEnglish
JournalLifetime Data Analysis
Volume15
Issue number3
Pages (from-to)330-342
Number of pages12
ISSN1380-7870
DOIs
Publication statusPublished - 2009

Bibliographical note

Keywords: Algorithms; Breast Neoplasms; Data Interpretation, Statistical; Female; Gene Expression Profiling; Humans; Kaplan-Meiers Estimate; Least-Squares Analysis; Liver Cirrhosis, Biliary; Oligonucleotide Array Sequence Analysis; Proportional Hazards Models; Regression Analysis

ID: 14828772