ngsLCA — A toolkit for fast and flexible lowest common ancestor inference and taxonomic profiling of metagenomic data

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Dokumenter

  • Fulltext

    Forlagets udgivne version, 844 KB, PDF-dokument

Metagenomic data generated from environmental samples is increasingly common in the analysis of modern and ancient biological communities. To obtain taxonomic profiles from this type of data, DNA sequences are aligned against large genomic reference databases and the lowest common ancestor (LCA) needs to be inferred for each sequence with multiple alignments. To date, efforts have mainly focused on improving the speed, sensitivity and specificity of alignment tools, and little effort has been applied to the LCA algorithm that generates the taxonomic profiles from alignments. We present ngsLCA, a command-line toolkit with two separate modules: the main program (in C/C++) performing LCA inference, and an R package for generating tables and visualisations of the taxonomic profiles. ngsLCA processed large datasets in BAM/SAM alignment format 4–11 times faster and used less memory compared to other available programs. It is compatible with the NCBI taxonomy and has flexible parameter settings. Furthermore, the toolkit offers functions for filtering, contamination removal, taxonomic clustering, and multiple ways of visualising the generated taxonomic profiles. ngsLCA bridges a gap in current metagenomic analyses by supplying a computationally light, easy-to-use, accurate, fast and flexible LCA algorithm with R functions for processing and illustrating the taxonomic profiles.

OriginalsprogEngelsk
TidsskriftMethods in Ecology and Evolution
Vol/bind13
Udgave nummer12
Sider (fra-til)2699-2708
Antal sider10
ISSN2041-210X
DOI
StatusUdgivet - 2022

Bibliografisk note

Funding Information:
Y.W., M.W.P. and T.S.K. were funded by the Carlsberg Foundation (CF16‐0728, CF16‐0913, CF18‐0024, and CF19‐0712). L.H. was supported by the Natural Environmental Research Council (NE/L002531/1). We thank Eske Willerslev and Antonio Fernandez‐Guerra for the discussions and inputs on this project.

Publisher Copyright:
© 2022 The Authors. Methods in Ecology and Evolution published by John Wiley & Sons Ltd on behalf of British Ecological Society.

Antal downloads er baseret på statistik fra Google Scholar og www.ku.dk


Ingen data tilgængelig

ID: 323855167