An algorithm competition for automatic species identification from herbarium specimens
Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › fagfællebedømt
Standard
An algorithm competition for automatic species identification from herbarium specimens. / Little, Damon P.; Tulig, Melissa; Tan, Kiat Chuan; Liu, Yulong; Belongie, Serge; Kaeser-Chen, Christine; Michelangeli, Fabián A.; Panesar, Kiran; Guha, R. V.; Ambrose, Barbara A.
I: Applications in Plant Sciences, Bind 8, Nr. 6, e11365, 01.06.2020.Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › fagfællebedømt
Harvard
APA
Vancouver
Author
Bibtex
}
RIS
TY - JOUR
T1 - An algorithm competition for automatic species identification from herbarium specimens
AU - Little, Damon P.
AU - Tulig, Melissa
AU - Tan, Kiat Chuan
AU - Liu, Yulong
AU - Belongie, Serge
AU - Kaeser-Chen, Christine
AU - Michelangeli, Fabián A.
AU - Panesar, Kiran
AU - Guha, R. V.
AU - Ambrose, Barbara A.
N1 - Funding Information: We thank the New York Botanical Garden for support and funding from the National Science Foundation (IAA‐1444192, DEB‐1343612 and DEB‐0818399 to F.A.M.). Special thanks to the staff of the New York Botanical Garden, particularly Kim Watson and Nichole Tiernan for all the specimen digitization work. We also thank the organizers of FGVC, the Kaggle platform, and all the Herbarium 2019 competitors for taking on the challenge of this data set. Funding Information: We thank the New York Botanical Garden for support and funding from the National Science Foundation (IAA-1444192, DEB-1343612 and DEB-0818399 to F.A.M.). Special thanks to the staff of the New York Botanical Garden, particularly Kim Watson and Nichole Tiernan for all the specimen digitization work. We also thank the organizers of FGVC, the Kaggle platform, and all the Herbarium 2019 competitors for taking on the challenge of this data set. Publisher Copyright: © 2020 Little et al. Applications in Plant Sciences is published by Wiley Periodicals, LLC on behalf of the Botanical Society of America
PY - 2020/6/1
Y1 - 2020/6/1
N2 - Premise: Plant biodiversity is threatened, yet many species remain undescribed. It is estimated that >50% of undescribed species have already been collected and are awaiting discovery in herbaria. Robust automatic species identification algorithms using machine learning could accelerate species discovery. Methods: To encourage the development of an automatic species identification algorithm, we submitted our Herbarium 2019 data set to the Fine-Grained Visual Categorization sub-competition (FGVC6) hosted on the Kaggle platform. We chose to focus on the flowering plant family Melastomataceae because we have a large collection of imaged herbarium specimens (46,469 specimens representing 683 species) and taxonomic expertise in the family. As is common for herbarium collections, some species in this data set are represented by few specimens and others by many. Results: In less than three months, the FGVC6 Herbarium 2019 Challenge drew 22 teams who entered 254 models for Melastomataceae species identification. The four best algorithms identified species with >88% accuracy. Discussion: The FGVC competitions provide a unique opportunity for computer vision and machine learning experts to address difficult species-recognition problems. The Herbarium 2019 Challenge brought together a novel combination of collections resources, taxonomic expertise, and collaboration between botanists and computer scientists.
AB - Premise: Plant biodiversity is threatened, yet many species remain undescribed. It is estimated that >50% of undescribed species have already been collected and are awaiting discovery in herbaria. Robust automatic species identification algorithms using machine learning could accelerate species discovery. Methods: To encourage the development of an automatic species identification algorithm, we submitted our Herbarium 2019 data set to the Fine-Grained Visual Categorization sub-competition (FGVC6) hosted on the Kaggle platform. We chose to focus on the flowering plant family Melastomataceae because we have a large collection of imaged herbarium specimens (46,469 specimens representing 683 species) and taxonomic expertise in the family. As is common for herbarium collections, some species in this data set are represented by few specimens and others by many. Results: In less than three months, the FGVC6 Herbarium 2019 Challenge drew 22 teams who entered 254 models for Melastomataceae species identification. The four best algorithms identified species with >88% accuracy. Discussion: The FGVC competitions provide a unique opportunity for computer vision and machine learning experts to address difficult species-recognition problems. The Herbarium 2019 Challenge brought together a novel combination of collections resources, taxonomic expertise, and collaboration between botanists and computer scientists.
KW - artificial intelligence
KW - computer vision
KW - FGVC
KW - herbarium specimen
KW - Kaggle
KW - machine learning
KW - Melastomataceae
UR - http://www.scopus.com/inward/record.url?scp=85087175472&partnerID=8YFLogxK
U2 - 10.1002/aps3.11365
DO - 10.1002/aps3.11365
M3 - Journal article
AN - SCOPUS:85087175472
VL - 8
JO - Applications in Plant Sciences
JF - Applications in Plant Sciences
SN - 2168-0450
IS - 6
M1 - e11365
ER -
ID: 301822923