The Plant Pathology Challenge 2020 data set to classify foliar disease of apples

Research output: Contribution to journal › Journal article › Research › peer-review

Standard

The Plant Pathology Challenge 2020 data set to classify foliar disease of apples. / Thapa, Ranjita; Zhang, Kai; Snavely, Noah; Belongie, Serge; Khan, Awais.

In: Applications in Plant Sciences, Vol. 8, No. 9, e11390, 01.09.2020.

Research output: Contribution to journal › Journal article › Research › peer-review

Harvard

Thapa, R, Zhang, K, Snavely, N, Belongie, S & Khan, A 2020, 'The Plant Pathology Challenge 2020 data set to classify foliar disease of apples', Applications in Plant Sciences, vol. 8, no. 9, e11390. https://doi.org/10.1002/aps3.11390

APA

Thapa, R., Zhang, K., Snavely, N., Belongie, S., & Khan, A. (2020). The Plant Pathology Challenge 2020 data set to classify foliar disease of apples. Applications in Plant Sciences, 8(9), [e11390]. https://doi.org/10.1002/aps3.11390

Vancouver

Thapa R, Zhang K, Snavely N, Belongie S, Khan A. The Plant Pathology Challenge 2020 data set to classify foliar disease of apples. Applications in Plant Sciences. 2020 Sep 1;8(9). e11390. https://doi.org/10.1002/aps3.11390

Author

Thapa, Ranjita ; Zhang, Kai ; Snavely, Noah ; Belongie, Serge ; Khan, Awais. / The Plant Pathology Challenge 2020 data set to classify foliar disease of apples. In: Applications in Plant Sciences. 2020 ; Vol. 8, No. 9.

Bibtex

@article{a6fda7c157e4406aa3ac9f8dc79745a7,

title = "The Plant Pathology Challenge 2020 data set to classify foliar disease of apples",

abstract = "Premise: Apple orchards in the United States are under constant threat from a large number of pathogens and insects. Appropriate and timely deployment of disease management depends on early disease detection. Incorrect and delayed diagnosis can result in either excessive or inadequate use of chemicals, with increased production costs and increased environmental and health impacts. Methods and Results: We have manually captured 3651 high-quality, real-life symptom images of multiple apple foliar diseases, with variable illumination, angles, surfaces, and noise. A subset of images, expert-annotated to create a pilot data set for apple scab, cedar apple rust, and healthy leaves, was made available to the Kaggle community for the Plant Pathology Challenge as part of the Fine-Grained Visual Categorization (FGVC) workshop at the 2020 Computer Vision and Pattern Recognition conference (CVPR 2020). Participants were asked to use the image data set to train a machine learning model to classify disease categories and develop an algorithm for disease severity quantification. The top three area under the ROC curve (AUC) values submitted to the private leaderboard were 0.98445, 0.98182, and 0.98089. We also trained an off-the-shelf convolutional neural network on this data for disease classification and achieved 97% accuracy on a held-out test set. Discussion: This data set will contribute toward development and deployment of machine learning–based automated plant disease classification algorithms to ultimately realize fast and accurate disease detection. We will continue to add images to the pilot data set for a larger, more comprehensive expert-annotated data set for future Kaggle competitions and to explore more advanced methods for disease classification and quantification.",

keywords = "apple orchards, computer vision, convolutional neural network, disease classification, machine learning",

author = "Ranjita Thapa and Kai Zhang and Noah Snavely and Serge Belongie and Awais Khan",

note = "Funding Information: Financial support was received from the Cornell Initiative for Digital Agriculture (CIDA). The authors thank Zach Guillian (summer intern at Cornell AgriTech, Geneva, New York, USA) for help with data collection. Publisher Copyright: {\textcopyright} 2020 Thapa et al. Applications in Plant Sciences published by Wiley Periodicals LLC on behalf of Botanical Society of America",

year = "2020",

month = sep,

day = "1",

doi = "10.1002/aps3.11390",

language = "English",

volume = "8",

journal = "Applications in Plant Sciences",

issn = "2168-0450",

publisher = "Botanical Society of America",

number = "9",

}

RIS

TY - JOUR

T1 - The Plant Pathology Challenge 2020 data set to classify foliar disease of apples

AU - Thapa, Ranjita

AU - Zhang, Kai

AU - Snavely, Noah

AU - Belongie, Serge

AU - Khan, Awais

N1 - Funding Information: Financial support was received from the Cornell Initiative for Digital Agriculture (CIDA). The authors thank Zach Guillian (summer intern at Cornell AgriTech, Geneva, New York, USA) for help with data collection. Publisher Copyright: © 2020 Thapa et al. Applications in Plant Sciences published by Wiley Periodicals LLC on behalf of Botanical Society of America

PY - 2020/9/1

Y1 - 2020/9/1

N2 - Premise: Apple orchards in the United States are under constant threat from a large number of pathogens and insects. Appropriate and timely deployment of disease management depends on early disease detection. Incorrect and delayed diagnosis can result in either excessive or inadequate use of chemicals, with increased production costs and increased environmental and health impacts. Methods and Results: We have manually captured 3651 high-quality, real-life symptom images of multiple apple foliar diseases, with variable illumination, angles, surfaces, and noise. A subset of images, expert-annotated to create a pilot data set for apple scab, cedar apple rust, and healthy leaves, was made available to the Kaggle community for the Plant Pathology Challenge as part of the Fine-Grained Visual Categorization (FGVC) workshop at the 2020 Computer Vision and Pattern Recognition conference (CVPR 2020). Participants were asked to use the image data set to train a machine learning model to classify disease categories and develop an algorithm for disease severity quantification. The top three area under the ROC curve (AUC) values submitted to the private leaderboard were 0.98445, 0.98182, and 0.98089. We also trained an off-the-shelf convolutional neural network on this data for disease classification and achieved 97% accuracy on a held-out test set. Discussion: This data set will contribute toward development and deployment of machine learning–based automated plant disease classification algorithms to ultimately realize fast and accurate disease detection. We will continue to add images to the pilot data set for a larger, more comprehensive expert-annotated data set for future Kaggle competitions and to explore more advanced methods for disease classification and quantification.

AB - Premise: Apple orchards in the United States are under constant threat from a large number of pathogens and insects. Appropriate and timely deployment of disease management depends on early disease detection. Incorrect and delayed diagnosis can result in either excessive or inadequate use of chemicals, with increased production costs and increased environmental and health impacts. Methods and Results: We have manually captured 3651 high-quality, real-life symptom images of multiple apple foliar diseases, with variable illumination, angles, surfaces, and noise. A subset of images, expert-annotated to create a pilot data set for apple scab, cedar apple rust, and healthy leaves, was made available to the Kaggle community for the Plant Pathology Challenge as part of the Fine-Grained Visual Categorization (FGVC) workshop at the 2020 Computer Vision and Pattern Recognition conference (CVPR 2020). Participants were asked to use the image data set to train a machine learning model to classify disease categories and develop an algorithm for disease severity quantification. The top three area under the ROC curve (AUC) values submitted to the private leaderboard were 0.98445, 0.98182, and 0.98089. We also trained an off-the-shelf convolutional neural network on this data for disease classification and achieved 97% accuracy on a held-out test set. Discussion: This data set will contribute toward development and deployment of machine learning–based automated plant disease classification algorithms to ultimately realize fast and accurate disease detection. We will continue to add images to the pilot data set for a larger, more comprehensive expert-annotated data set for future Kaggle competitions and to explore more advanced methods for disease classification and quantification.

KW - apple orchards

KW - computer vision

KW - convolutional neural network

KW - disease classification

KW - machine learning

UR - http://www.scopus.com/inward/record.url?scp=85091611121&partnerID=8YFLogxK

U2 - 10.1002/aps3.11390

DO - 10.1002/aps3.11390

M3 - Journal article

AN - SCOPUS:85091611121

VL - 8

JO - Applications in Plant Sciences

JF - Applications in Plant Sciences

SN - 2168-0450

IS - 9

M1 - e11390

ER -

ID: 301822705

Forskning