Standardized evaluation of algorithms for computer-aided diagnosis of dementia based on structural MRI

Standardized evaluation of algorithms for computer-aided diagnosis of dementia based on structural MRI: The CADDementia challenge

Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › fagfællebedømt

Esther E. Bron
Marion Smits
Wiesje M. van der Flier
Hugo Vrenken
Frederik Barkhof
Philip Scheltens
Janne M. Papma
Rebecca M.E. Steketee
Carolina Méndez Orellana
Rozanna Meijboom
Madalena Pinto
Joana R. Meireles
Carolina Garrett
António J. Bastos-Leite
Ahmed Abdulkadir
Olaf Ronneberger
Nicola Amoroso
Roberto Bellotti
David Cárdenas-Peña
Andrés M. Álvarez-Meza
Chester V. Dolph
Khan M. Iftekharuddin
Simon Fristed Eskildsen
Pierrick Coupé
Vladimir S. Fonov
Katja Franke
Christian Gaser
Christian Ledig
Ricardo Guerrero
Tong Tong
Katherine R. Gray
Elaheh Moradi
Jussi Tohka
Alexandre Routier
Stanley Durrleman
Alessia Sarica
Giuseppe Di Fatta
Francesco Sensi
Andrea Chincarini
Garry M. Smith
Zhivko V. Stoyanov
Lauge Emil Borch Laurs Sørensen
Sabina Tangaro
Paolo Inglese
Christian Wachinger
Martin Reuter
John C. van Swieten
Wiro J. Niessen
Stefan Klein

Abstract Algorithms for computer-aided diagnosis of dementia based on structural MRI have demonstrated high performance in the literature, but are difficult to compare as different data sets and methodology were used for evaluation. In addition, it is unclear how the algorithms would perform on previously unseen data, and thus, how they would perform in clinical practice when there is no real opportunity to adapt the algorithm to the data at hand. To address these comparability, generalizability and clinical applicability issues, we organized a grand challenge that aimed to objectively compare algorithms based on a clinically representative multi-center data set. Using clinical practice as the starting point, the goal was to reproduce the clinical diagnosis. Therefore, we evaluated algorithms for multi-class classification of three diagnostic groups: patients with probable Alzheimer's disease, patients with mild cognitive impairment and healthy controls. The diagnosis based on clinical criteria was used as reference standard, as it was the best available reference despite its known limitations. For evaluation, a previously unseen test set was used consisting of 354 T1-weighted MRI scans with the diagnoses blinded. Fifteen research teams participated with a total of 29 algorithms. The algorithms were trained on a small training set (n = 30) and optionally on data from other sources (e.g., the Alzheimer's Disease Neuroimaging Initiative, the Australian Imaging Biomarkers and Lifestyle flagship study of aging). The best performing algorithm yielded an accuracy of 63.0% and an area under the receiver-operating-characteristic curve (AUC) of 78.8%. In general, the best performances were achieved using feature extraction based on voxel-based morphometry or a combination of features that included volume, cortical thickness, shape and intensity. The challenge is open for new submissions via the web-based framework: http://caddementia.grand-challenge.org.

Originalsprog	Engelsk
Tidsskrift	NeuroImage
Vol/bind	111
Sider (fra-til)	562-579
Antal sider	18
ISSN	1053-8119
DOI	https://doi.org/10.1016/j.neuroimage.2015.01.048
Status	Udgivet - 2015

Forskningsområder

Alzheimer's disease, Challenge, Classification, Computer-aided diagnosis, Mild cognitive impairment, Structural MRI

ID: 138173157