Power in the phenotypic extremes: a simulation study of power in discovery and replication of rare variants

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

Power in the phenotypic extremes : a simulation study of power in discovery and replication of rare variants. / Guey, Lin T; Kravic, Jasmina; Melander, Olle; Burtt, Noël P; Laramie, Jason M; Lyssenko, Valeriya; Jonsson, Anna Elisabet; Lindholm, Eero; Tuomi, Tiinamaija; Isomaa, Bo; Nilsson, Peter; Almgren, Peter; Kathiresan, Sekar; Groop, Leif; Seymour, Albert B; Altshuler, David; Voight, Benjamin F.

I: Genetic Epidemiology, 09.02.2011.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Guey, LT, Kravic, J, Melander, O, Burtt, NP, Laramie, JM, Lyssenko, V, Jonsson, AE, Lindholm, E, Tuomi, T, Isomaa, B, Nilsson, P, Almgren, P, Kathiresan, S, Groop, L, Seymour, AB, Altshuler, D & Voight, BF 2011, 'Power in the phenotypic extremes: a simulation study of power in discovery and replication of rare variants', Genetic Epidemiology. https://doi.org/10.1002/gepi.20572

APA

Guey, L. T., Kravic, J., Melander, O., Burtt, N. P., Laramie, J. M., Lyssenko, V., Jonsson, A. E., Lindholm, E., Tuomi, T., Isomaa, B., Nilsson, P., Almgren, P., Kathiresan, S., Groop, L., Seymour, A. B., Altshuler, D., & Voight, B. F. (2011). Power in the phenotypic extremes: a simulation study of power in discovery and replication of rare variants. Genetic Epidemiology. https://doi.org/10.1002/gepi.20572

Vancouver

Guey LT, Kravic J, Melander O, Burtt NP, Laramie JM, Lyssenko V o.a. Power in the phenotypic extremes: a simulation study of power in discovery and replication of rare variants. Genetic Epidemiology. 2011 feb. 9. https://doi.org/10.1002/gepi.20572

Author

Guey, Lin T ; Kravic, Jasmina ; Melander, Olle ; Burtt, Noël P ; Laramie, Jason M ; Lyssenko, Valeriya ; Jonsson, Anna Elisabet ; Lindholm, Eero ; Tuomi, Tiinamaija ; Isomaa, Bo ; Nilsson, Peter ; Almgren, Peter ; Kathiresan, Sekar ; Groop, Leif ; Seymour, Albert B ; Altshuler, David ; Voight, Benjamin F. / Power in the phenotypic extremes : a simulation study of power in discovery and replication of rare variants. I: Genetic Epidemiology. 2011.

Bibtex

@article{9d5ec3f3de14421a98997c8c988ea13e,
title = "Power in the phenotypic extremes: a simulation study of power in discovery and replication of rare variants",
abstract = "Next-generation sequencing technologies are making it possible to study the role of rare variants in human disease. Many studies balance statistical power with cost-effectiveness by (a) sampling from phenotypic extremes and (b) utilizing a two-stage design. Two-stage designs include a broad-based discovery phase and selection of a subset of potential causal genes/variants to be further examined in independent samples. We evaluate three parameters: first, the gain in statistical power due to extreme sampling to discover causal variants; second, the informativeness of initial (Phase I) association statistics to select genes/variants for follow-up; third, the impact of extreme and random sampling in (Phase 2) replication. We present a quantitative method to select individuals from the phenotypic extremes of a binary trait, and simulate disease association studies under a variety of sample sizes and sampling schemes. First, we find that while studies sampling from extremes have excellent power to discover rare variants, they have limited power to associate them to phenotype-suggesting high false-negative rates for upcoming studies. Second, consistent with previous studies, we find that the effect sizes estimated in these studies are expected to be systematically larger compared with the overall population effect size; in a well-cited lipids study, we estimate the reported effect to be twofold larger. Third, replication studies require large samples from the general population to have sufficient power; extreme sampling could reduce the required sample size as much as fourfold. Our observations offer practical guidance for the design and interpretation of studies that utilize extreme sampling. Genet. Epidemiol. 2011.  {\textcopyright} 2011 Wiley-Liss, Inc.",
author = "Guey, {Lin T} and Jasmina Kravic and Olle Melander and Burtt, {No{\"e}l P} and Laramie, {Jason M} and Valeriya Lyssenko and Jonsson, {Anna Elisabet} and Eero Lindholm and Tiinamaija Tuomi and Bo Isomaa and Peter Nilsson and Peter Almgren and Sekar Kathiresan and Leif Groop and Seymour, {Albert B} and David Altshuler and Voight, {Benjamin F}",
note = "{\textcopyright} 2011 Wiley-Liss, Inc.",
year = "2011",
month = feb,
day = "9",
doi = "10.1002/gepi.20572",
language = "English",
journal = "Genetic Epidemiology",
issn = "0741-0395",
publisher = "JohnWiley & Sons, Inc.",

}

RIS

TY - JOUR

T1 - Power in the phenotypic extremes

T2 - a simulation study of power in discovery and replication of rare variants

AU - Guey, Lin T

AU - Kravic, Jasmina

AU - Melander, Olle

AU - Burtt, Noël P

AU - Laramie, Jason M

AU - Lyssenko, Valeriya

AU - Jonsson, Anna Elisabet

AU - Lindholm, Eero

AU - Tuomi, Tiinamaija

AU - Isomaa, Bo

AU - Nilsson, Peter

AU - Almgren, Peter

AU - Kathiresan, Sekar

AU - Groop, Leif

AU - Seymour, Albert B

AU - Altshuler, David

AU - Voight, Benjamin F

N1 - © 2011 Wiley-Liss, Inc.

PY - 2011/2/9

Y1 - 2011/2/9

N2 - Next-generation sequencing technologies are making it possible to study the role of rare variants in human disease. Many studies balance statistical power with cost-effectiveness by (a) sampling from phenotypic extremes and (b) utilizing a two-stage design. Two-stage designs include a broad-based discovery phase and selection of a subset of potential causal genes/variants to be further examined in independent samples. We evaluate three parameters: first, the gain in statistical power due to extreme sampling to discover causal variants; second, the informativeness of initial (Phase I) association statistics to select genes/variants for follow-up; third, the impact of extreme and random sampling in (Phase 2) replication. We present a quantitative method to select individuals from the phenotypic extremes of a binary trait, and simulate disease association studies under a variety of sample sizes and sampling schemes. First, we find that while studies sampling from extremes have excellent power to discover rare variants, they have limited power to associate them to phenotype-suggesting high false-negative rates for upcoming studies. Second, consistent with previous studies, we find that the effect sizes estimated in these studies are expected to be systematically larger compared with the overall population effect size; in a well-cited lipids study, we estimate the reported effect to be twofold larger. Third, replication studies require large samples from the general population to have sufficient power; extreme sampling could reduce the required sample size as much as fourfold. Our observations offer practical guidance for the design and interpretation of studies that utilize extreme sampling. Genet. Epidemiol. 2011.  © 2011 Wiley-Liss, Inc.

AB - Next-generation sequencing technologies are making it possible to study the role of rare variants in human disease. Many studies balance statistical power with cost-effectiveness by (a) sampling from phenotypic extremes and (b) utilizing a two-stage design. Two-stage designs include a broad-based discovery phase and selection of a subset of potential causal genes/variants to be further examined in independent samples. We evaluate three parameters: first, the gain in statistical power due to extreme sampling to discover causal variants; second, the informativeness of initial (Phase I) association statistics to select genes/variants for follow-up; third, the impact of extreme and random sampling in (Phase 2) replication. We present a quantitative method to select individuals from the phenotypic extremes of a binary trait, and simulate disease association studies under a variety of sample sizes and sampling schemes. First, we find that while studies sampling from extremes have excellent power to discover rare variants, they have limited power to associate them to phenotype-suggesting high false-negative rates for upcoming studies. Second, consistent with previous studies, we find that the effect sizes estimated in these studies are expected to be systematically larger compared with the overall population effect size; in a well-cited lipids study, we estimate the reported effect to be twofold larger. Third, replication studies require large samples from the general population to have sufficient power; extreme sampling could reduce the required sample size as much as fourfold. Our observations offer practical guidance for the design and interpretation of studies that utilize extreme sampling. Genet. Epidemiol. 2011.  © 2011 Wiley-Liss, Inc.

U2 - 10.1002/gepi.20572

DO - 10.1002/gepi.20572

M3 - Journal article

C2 - 21308769

JO - Genetic Epidemiology

JF - Genetic Epidemiology

SN - 0741-0395

ER -

ID: 46404312