AlphaPept: a modern and open framework for MS-based proteomics
Research output: Contribution to journal › Journal article › Research › peer-review
Standard
AlphaPept : a modern and open framework for MS-based proteomics. / Strauss, Maximilian T; Bludau, Isabell; Zeng, Wen-Feng; Voytik, Eugenia; Ammar, Constantin; Schessner, Julia P; Ilango, Rajesh; Gill, Michelle; Meier, Florian; Willems, Sander; Mann, Matthias.
In: Nature Communications, Vol. 15, No. 1, 09.03.2024, p. 2168.Research output: Contribution to journal › Journal article › Research › peer-review
Harvard
APA
Vancouver
Author
Bibtex
}
RIS
TY - JOUR
T1 - AlphaPept
T2 - a modern and open framework for MS-based proteomics
AU - Strauss, Maximilian T
AU - Bludau, Isabell
AU - Zeng, Wen-Feng
AU - Voytik, Eugenia
AU - Ammar, Constantin
AU - Schessner, Julia P
AU - Ilango, Rajesh
AU - Gill, Michelle
AU - Meier, Florian
AU - Willems, Sander
AU - Mann, Matthias
N1 - © 2024. The Author(s).
PY - 2024/3/9
Y1 - 2024/3/9
N2 - In common with other omics technologies, mass spectrometry (MS)-based proteomics produces ever-increasing amounts of raw data, making efficient analysis a principal challenge. A plethora of different computational tools can process the MS data to derive peptide and protein identification and quantification. However, during the last years there has been dramatic progress in computer science, including collaboration tools that have transformed research and industry. To leverage these advances, we develop AlphaPept, a Python-based open-source framework for efficient processing of large high-resolution MS data sets. Numba for just-in-time compilation on CPU and GPU achieves hundred-fold speed improvements. AlphaPept uses the Python scientific stack of highly optimized packages, reducing the code base to domain-specific tasks while accessing the latest advances. We provide an easy on-ramp for community contributions through the concept of literate programming, implemented in Jupyter Notebooks. Large datasets can rapidly be processed as shown by the analysis of hundreds of proteomes in minutes per file, many-fold faster than acquisition. AlphaPept can be used to build automated processing pipelines with web-serving functionality and compatibility with downstream analysis tools. It provides easy access via one-click installation, a modular Python library for advanced users, and via an open GitHub repository for developers.
AB - In common with other omics technologies, mass spectrometry (MS)-based proteomics produces ever-increasing amounts of raw data, making efficient analysis a principal challenge. A plethora of different computational tools can process the MS data to derive peptide and protein identification and quantification. However, during the last years there has been dramatic progress in computer science, including collaboration tools that have transformed research and industry. To leverage these advances, we develop AlphaPept, a Python-based open-source framework for efficient processing of large high-resolution MS data sets. Numba for just-in-time compilation on CPU and GPU achieves hundred-fold speed improvements. AlphaPept uses the Python scientific stack of highly optimized packages, reducing the code base to domain-specific tasks while accessing the latest advances. We provide an easy on-ramp for community contributions through the concept of literate programming, implemented in Jupyter Notebooks. Large datasets can rapidly be processed as shown by the analysis of hundreds of proteomes in minutes per file, many-fold faster than acquisition. AlphaPept can be used to build automated processing pipelines with web-serving functionality and compatibility with downstream analysis tools. It provides easy access via one-click installation, a modular Python library for advanced users, and via an open GitHub repository for developers.
KW - Software
KW - Proteomics/methods
KW - Mass Spectrometry/methods
KW - Proteome
U2 - 10.1038/s41467-024-46485-4
DO - 10.1038/s41467-024-46485-4
M3 - Journal article
C2 - 38461149
VL - 15
SP - 2168
JO - Nature Communications
JF - Nature Communications
SN - 2041-1723
IS - 1
ER -
ID: 397722269