SNP allele calling of Illumina Infinium Omni5-4 data using the butterfly method

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

SNP allele calling of Illumina Infinium Omni5-4 data using the butterfly method. / Andersen, Mikkel Meyer; Christiansen, Steffan Noe; Andersen, Jeppe Dyrberg; Eriksen, Svante; Morling, Niels.

I: Scientific Reports, Bind 12, 1713, 2022.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Andersen, MM, Christiansen, SN, Andersen, JD, Eriksen, S & Morling, N 2022, 'SNP allele calling of Illumina Infinium Omni5-4 data using the butterfly method', Scientific Reports, bind 12, 1713. https://doi.org/10.1038/s41598-022-22162-8

APA

Andersen, M. M., Christiansen, S. N., Andersen, J. D., Eriksen, S., & Morling, N. (2022). SNP allele calling of Illumina Infinium Omni5-4 data using the butterfly method. Scientific Reports, 12, [1713]. https://doi.org/10.1038/s41598-022-22162-8

Vancouver

Andersen MM, Christiansen SN, Andersen JD, Eriksen S, Morling N. SNP allele calling of Illumina Infinium Omni5-4 data using the butterfly method. Scientific Reports. 2022;12. 1713. https://doi.org/10.1038/s41598-022-22162-8

Author

Andersen, Mikkel Meyer ; Christiansen, Steffan Noe ; Andersen, Jeppe Dyrberg ; Eriksen, Svante ; Morling, Niels. / SNP allele calling of Illumina Infinium Omni5-4 data using the butterfly method. I: Scientific Reports. 2022 ; Bind 12.

Bibtex

@article{ee3a4f12eb214354a593bd07b6161936,
title = "SNP allele calling of Illumina Infinium Omni5-4 data using the butterfly method",
abstract = "We introduce a within-sample SNP calling method, called the “butterfly method”, that improves the quality of SNP calling with the Illumina Infinium Omni5-4 SNP Kit. This was done by improving how no-calls are determined from allele signal intensities. High confidence of SNP allele calling is extremely important in forensic genetics and clinical diagnostics. This paper is accompanied by two open-source R packages, omni54manifest and snpbeadchip that make SNP calling easy by helping with bookkeeping and giving easy access to meta-information about the SNPs typed with the Illumina Infinium Omni5-4 Kit (including chromosome, probe type, and SNP bases). We compared the results from our method with those obtained with the Illumina GenomeStudio software (which does not provide sample and SNP specific genotype probabilities or other quality measures), and with whole-genome sequencing (WGS). Given the signal intensities, the SNP calling quality was optimised using a threshold for the a posteriori probability of a SNP belonging to a SNP cluster. By lowering the a posteriori probability threshold for no-calls, we obtained a higher call rate than GenomeStudio. Using a higher a posteriori probability threshold, we achieved a higher concordance with the WGS data than GenomeStudio. Our method had SNP call and concordance rates with WGS data of approximately 99%.",
author = "Andersen, {Mikkel Meyer} and Christiansen, {Steffan Noe} and Andersen, {Jeppe Dyrberg} and Svante Eriksen and Niels Morling",
year = "2022",
doi = "10.1038/s41598-022-22162-8",
language = "English",
volume = "12",
journal = "Scientific Reports",
issn = "2045-2322",
publisher = "nature publishing group",

}

RIS

TY - JOUR

T1 - SNP allele calling of Illumina Infinium Omni5-4 data using the butterfly method

AU - Andersen, Mikkel Meyer

AU - Christiansen, Steffan Noe

AU - Andersen, Jeppe Dyrberg

AU - Eriksen, Svante

AU - Morling, Niels

PY - 2022

Y1 - 2022

N2 - We introduce a within-sample SNP calling method, called the “butterfly method”, that improves the quality of SNP calling with the Illumina Infinium Omni5-4 SNP Kit. This was done by improving how no-calls are determined from allele signal intensities. High confidence of SNP allele calling is extremely important in forensic genetics and clinical diagnostics. This paper is accompanied by two open-source R packages, omni54manifest and snpbeadchip that make SNP calling easy by helping with bookkeeping and giving easy access to meta-information about the SNPs typed with the Illumina Infinium Omni5-4 Kit (including chromosome, probe type, and SNP bases). We compared the results from our method with those obtained with the Illumina GenomeStudio software (which does not provide sample and SNP specific genotype probabilities or other quality measures), and with whole-genome sequencing (WGS). Given the signal intensities, the SNP calling quality was optimised using a threshold for the a posteriori probability of a SNP belonging to a SNP cluster. By lowering the a posteriori probability threshold for no-calls, we obtained a higher call rate than GenomeStudio. Using a higher a posteriori probability threshold, we achieved a higher concordance with the WGS data than GenomeStudio. Our method had SNP call and concordance rates with WGS data of approximately 99%.

AB - We introduce a within-sample SNP calling method, called the “butterfly method”, that improves the quality of SNP calling with the Illumina Infinium Omni5-4 SNP Kit. This was done by improving how no-calls are determined from allele signal intensities. High confidence of SNP allele calling is extremely important in forensic genetics and clinical diagnostics. This paper is accompanied by two open-source R packages, omni54manifest and snpbeadchip that make SNP calling easy by helping with bookkeeping and giving easy access to meta-information about the SNPs typed with the Illumina Infinium Omni5-4 Kit (including chromosome, probe type, and SNP bases). We compared the results from our method with those obtained with the Illumina GenomeStudio software (which does not provide sample and SNP specific genotype probabilities or other quality measures), and with whole-genome sequencing (WGS). Given the signal intensities, the SNP calling quality was optimised using a threshold for the a posteriori probability of a SNP belonging to a SNP cluster. By lowering the a posteriori probability threshold for no-calls, we obtained a higher call rate than GenomeStudio. Using a higher a posteriori probability threshold, we achieved a higher concordance with the WGS data than GenomeStudio. Our method had SNP call and concordance rates with WGS data of approximately 99%.

U2 - 10.1038/s41598-022-22162-8

DO - 10.1038/s41598-022-22162-8

M3 - Journal article

C2 - 36224332

VL - 12

JO - Scientific Reports

JF - Scientific Reports

SN - 2045-2322

M1 - 1713

ER -

ID: 322785613