spillR: spillover compensation in mass cytometry data

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

spillR : spillover compensation in mass cytometry data. / Guazzini, Marco; Reisach, Alexander G.; Weichwald, Sebastian; Seiler, Christof.

In: Bioinformatics, Vol. 40, No. 6, btae337, 2024.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Guazzini, M, Reisach, AG, Weichwald, S & Seiler, C 2024, 'spillR: spillover compensation in mass cytometry data', Bioinformatics, vol. 40, no. 6, btae337. https://doi.org/10.1093/bioinformatics/btae337

APA

Guazzini, M., Reisach, A. G., Weichwald, S., & Seiler, C. (2024). spillR: spillover compensation in mass cytometry data. Bioinformatics, 40(6), [btae337]. https://doi.org/10.1093/bioinformatics/btae337

Vancouver

Guazzini M, Reisach AG, Weichwald S, Seiler C. spillR: spillover compensation in mass cytometry data. Bioinformatics. 2024;40(6). btae337. https://doi.org/10.1093/bioinformatics/btae337

Author

Guazzini, Marco ; Reisach, Alexander G. ; Weichwald, Sebastian ; Seiler, Christof. / spillR : spillover compensation in mass cytometry data. In: Bioinformatics. 2024 ; Vol. 40, No. 6.

Bibtex

@article{df3ee32d4d5242ab931c82097b4c9182,
title = "spillR: spillover compensation in mass cytometry data",
abstract = "Motivation: Channel interference in mass cytometry can cause spillover and may result in miscounting of protein markers. Chevrier et al. introduce an experimental and computational procedure to estimate and compensate for spillover implemented in their R package CATALYST. They assume spillover can be described by a spillover matrix that encodes the ratio between the signal in the unstained spillover receiving and stained spillover emitting channel. They estimate the spillover matrix from experiments with beads. We propose to skip the matrix estimation step and work directly with the full bead distributions. We develop a nonparametric finite mixture model and use the mixture components to estimate the probability of spillover. Spillover correction is often a pre-processing step followed by downstream analyses, and choosing a flexible model reduces the chance of introducing biases that can propagate downstream. Results: We implement our method in an R package spillR using expectation-maximization to fit the mixture model. We test our method on simulated, semi-simulated, and real data from CATALYST. We find that our method compensates low counts accurately, does not introduce negative counts, avoids overcompensating high counts, and preserves correlations between markers that may be biologically meaningful. Availability and implementation: Our new R package spillR is on bioconductor at bioconductor.org/packages/spillR. All experiments and plots can be reproduced by compiling the R markdown file spillR_paper.Rmd at github.com/ChristofSeiler/spillR_paper.",
author = "Marco Guazzini and Reisach, {Alexander G.} and Sebastian Weichwald and Christof Seiler",
note = "Publisher Copyright: {\textcopyright} The Author(s) 2024. Published by Oxford University Press.",
year = "2024",
doi = "10.1093/bioinformatics/btae337",
language = "English",
volume = "40",
journal = "Bioinformatics (Online)",
issn = "1367-4811",
publisher = "Oxford University Press",
number = "6",

}

RIS

TY - JOUR

T1 - spillR

T2 - spillover compensation in mass cytometry data

AU - Guazzini, Marco

AU - Reisach, Alexander G.

AU - Weichwald, Sebastian

AU - Seiler, Christof

N1 - Publisher Copyright: © The Author(s) 2024. Published by Oxford University Press.

PY - 2024

Y1 - 2024

N2 - Motivation: Channel interference in mass cytometry can cause spillover and may result in miscounting of protein markers. Chevrier et al. introduce an experimental and computational procedure to estimate and compensate for spillover implemented in their R package CATALYST. They assume spillover can be described by a spillover matrix that encodes the ratio between the signal in the unstained spillover receiving and stained spillover emitting channel. They estimate the spillover matrix from experiments with beads. We propose to skip the matrix estimation step and work directly with the full bead distributions. We develop a nonparametric finite mixture model and use the mixture components to estimate the probability of spillover. Spillover correction is often a pre-processing step followed by downstream analyses, and choosing a flexible model reduces the chance of introducing biases that can propagate downstream. Results: We implement our method in an R package spillR using expectation-maximization to fit the mixture model. We test our method on simulated, semi-simulated, and real data from CATALYST. We find that our method compensates low counts accurately, does not introduce negative counts, avoids overcompensating high counts, and preserves correlations between markers that may be biologically meaningful. Availability and implementation: Our new R package spillR is on bioconductor at bioconductor.org/packages/spillR. All experiments and plots can be reproduced by compiling the R markdown file spillR_paper.Rmd at github.com/ChristofSeiler/spillR_paper.

AB - Motivation: Channel interference in mass cytometry can cause spillover and may result in miscounting of protein markers. Chevrier et al. introduce an experimental and computational procedure to estimate and compensate for spillover implemented in their R package CATALYST. They assume spillover can be described by a spillover matrix that encodes the ratio between the signal in the unstained spillover receiving and stained spillover emitting channel. They estimate the spillover matrix from experiments with beads. We propose to skip the matrix estimation step and work directly with the full bead distributions. We develop a nonparametric finite mixture model and use the mixture components to estimate the probability of spillover. Spillover correction is often a pre-processing step followed by downstream analyses, and choosing a flexible model reduces the chance of introducing biases that can propagate downstream. Results: We implement our method in an R package spillR using expectation-maximization to fit the mixture model. We test our method on simulated, semi-simulated, and real data from CATALYST. We find that our method compensates low counts accurately, does not introduce negative counts, avoids overcompensating high counts, and preserves correlations between markers that may be biologically meaningful. Availability and implementation: Our new R package spillR is on bioconductor at bioconductor.org/packages/spillR. All experiments and plots can be reproduced by compiling the R markdown file spillR_paper.Rmd at github.com/ChristofSeiler/spillR_paper.

U2 - 10.1093/bioinformatics/btae337

DO - 10.1093/bioinformatics/btae337

M3 - Journal article

C2 - 38848472

AN - SCOPUS:85196779717

VL - 40

JO - Bioinformatics (Online)

JF - Bioinformatics (Online)

SN - 1367-4811

IS - 6

M1 - btae337

ER -

ID: 396980749