Predicting the impact of rare variants on RNA splicing in CAGI6

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

  • Jenny Lord
  • Carolina Jaramillo Oquendo
  • Htoo A. Wai
  • Andrew G.L. Douglas
  • David J. Bunyan
  • Yaqiong Wang
  • Zhiqiang Hu
  • Zishuo Zeng
  • Daniel Danis
  • Panagiotis Katsonis
  • Amanda Williams
  • Olivier Lichtarge
  • Yuchen Chang
  • Richard D. Bagnall
  • Stephen M. Mount
  • Brynja Matthiasardottir
  • Chiaofeng Lin
  • Raphael Leman
  • Alexandra Martins
  • Claude Houdayer
  • Sophie Krieger
  • Constantina Bakolitsa
  • Yisu Peng
  • Akash Kamandula
  • Predrag Radivojac
  • Diana Baralle

Variants which disrupt splicing are a frequent cause of rare disease that have been under-ascertained clinically. Accurate and efficient methods to predict a variant’s impact on splicing are needed to interpret the growing number of variants of unknown significance (VUS) identified by exome and genome sequencing. Here, we present the results of the CAGI6 Splicing VUS challenge, which invited predictions of the splicing impact of 56 variants ascertained clinically and functionally validated to determine splicing impact. The performance of 12 prediction methods, along with SpliceAI and CADD, was compared on the 56 functionally validated variants. The maximum accuracy achieved was 82% from two different approaches, one weighting SpliceAI scores by minor allele frequency, and one applying the recently published Splicing Prediction Pipeline (SPiP). SPiP performed optimally in terms of sensitivity, while an ensemble method combining multiple prediction tools and information from databases exceeded all others for specificity. Several challenge methods equalled or exceeded the performance of SpliceAI, with ultimate choice of prediction method likely to depend on experimental or clinical aims. One quarter of the variants were incorrectly predicted by at least 50% of the methods, highlighting the need for further improvements to splicing prediction methods for successful clinical application.

OriginalsprogEngelsk
TidsskriftHuman Genetics
ISSN0340-6717
DOI
StatusAccepteret/In press - 2024

Bibliografisk note

Funding Information:
We thank the CAGI organisers for their commitment to improving variant interpretation and for making this challenge happen. The CAGI experiment is supported by NIH U24 HG007346. We acknowledge the NIHR Clinical Research Network (CRN) in recruiting the participants and the Musketeers Memorandum, as well as support from the NIHR UK Rare Genetic Disease Consortium. The authors acknowledge the use of the IRIDIS High Performance Computing Facility, and associated support services at the University of Southampton, in the completion of this work. For the purpose of open access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission.

Funding Information:
The Baralle Lab is supported by the NIHR Research Professorship awarded to DB (RP-2016-07-011). JL is supported by an Anniversary Fellowship from the University of Southampton. Some of the functional validations of variants were funded by a Wessex Medical Research Innovation Grant awarded to JL. RDB is supported by a New South Wales Health Cardiovascular Disease Senior Scientist Grant.

Publisher Copyright:
© 2024, The Author(s).

ID: 382438193