Front. Plant Sci. Frontiers in Plant Science Front. Plant Sci. 1664-462X Frontiers Media S.A. 10.3389/fpls.2021.673505 Plant Science Original Research Wheat Spike Blast Image Classification Using Deep Convolutional Neural Networks Fernández-Campos Mariela 1 Huang Yu-Ting 2 Jahanshahi Mohammad R. 2 3 Wang Tao 4 Jin Jian 4 Telenko Darcy E. P. 1 Góngora-Canul Carlos 1 5 Cruz C. D. 1 * 1Department of Botany and Plant Pathology, Purdue University, West Lafayette, IN, United States 2Lyles School of Civil Engineering, Purdue University, West Lafayette, IN, United States 3School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, United States 4Department of Agricultural and Biological Engineering, Purdue University, West Lafayette, IN, United States 5Tecnológico Nacional de México/IT Conkal, Conkal, Yucatán, Mexico

Edited by: Michele Pisante, University of Teramo, Italy

Reviewed by: Alexandr Muterko, Russian Academy of Sciences, Russia; Pouria Sadeghi-Tehran, Rothamsted Research, United Kingdom

*Correspondence: C. D. Cruz cruz113@purdue.edu

This article was submitted to Technical Advances in Plant Science, a section of the journal Frontiers in Plant Science

17 06 2021 2021 12 673505 27 02 2021 10 05 2021 Copyright © 2021 Fernández-Campos, Huang, Jahanshahi, Wang, Jin, Telenko, Góngora-Canul and Cruz. 2021 Fernández-Campos, Huang, Jahanshahi, Wang, Jin, Telenko, Góngora-Canul and Cruz

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

Wheat blast is a threat to global wheat production, and limited blast-resistant cultivars are available. The current estimations of wheat spike blast severity rely on human assessments, but this technique could have limitations. Reliable visual disease estimations paired with Red Green Blue (RGB) images of wheat spike blast can be used to train deep convolutional neural networks (CNN) for disease severity (DS) classification. Inter-rater agreement analysis was used to measure the reliability of who collected and classified data obtained under controlled conditions. We then trained CNN models to classify wheat spike blast severity. Inter-rater agreement analysis showed high accuracy and low bias before model training. Results showed that the CNN models trained provide a promising approach to classify images in the three wheat blast severity categories. However, the models trained on non-matured and matured spikes images showing the highest precision, recall, and F1 score when classifying the images. The high classification accuracy could serve as a basis to facilitate wheat spike blast phenotyping in the future.

wheat blast convolutional neural networks inter-rater agreement severity classification plant disease phenotyping breeding deep learning controlled conditions National Institute of Food and Agriculture10.13039/100005825 Foreign Agricultural Service10.13039/100009170

香京julia种子在线播放

    1. <form id=HxFbUHhlv><nobr id=HxFbUHhlv></nobr></form>
      <address id=HxFbUHhlv><nobr id=HxFbUHhlv><nobr id=HxFbUHhlv></nobr></nobr></address>

      Introduction

      Wheat blast is an emergent disease caused by the Ascomycetous fungus Magnaporthe oryzae Triticum (MoT). MoT was first detected in Brazil in 1985, with successive spread to Bolivia, Paraguay, and Argentina (Igarashi et al., 1986; Barea and Toledo, 1996; Viedma, 2005; Cabrera and Gutiérrez, 2007; Perello et al., 2015). In 2016, a wheat blast outbreak was first reported in Bangladesh, apparently due to the unintended importation of MoT-infected South American grain (Aman, 2016; Malaker et al., 2016). Many countries in South Asia are actively monitoring wheat fields for the presence of MoT (Bhattacharya and Pal, 2017; Mottaleb et al., 2018). In 2020, MoT presence was reported in Zambia, Africa, which summates another continent to the list (Tembo et al., 2020). Cruz et al. (2016b) predicted areas at risk in the United States (southern and pacific northwest states) for MoT establishment and the threat of this pathogen to soft- and hard-red winter wheat production.

      MoT can infect leaves, stems, and seeds, although the most remarkable and studied symptoms are associated with the spike (Igarashi et al., 1986; Cruz et al., 2015; Cruz and Valent, 2017; Ceresini et al., 2019). The infection by MoT of the spike, spikelets, or rachis causes the wheat spike blast, inducing partial or complete bleaching of the spikes (Igarashi, 1986). Infection can cause shriveled grain reducing the grain quality and yield. A wide range of disease intensities can occur depending on the susceptibility of cultivars planted and the prevalent weather conditions (Goulart and Paiva, 1992).

      Warm temperatures, excessive rain, long and frequent spike wetness, and limited fungicide efficacy exacerbate the intensity of wheat blast epidemics, especially in susceptible cultivars (Goulart et al., 2007). The optimum conditions for wheat blast development include a temperature range between 25 and 30°C and spike surface wetness between 25 and 40 h (based on controlled conditions) (Cardoso et al., 2008). Under conducive field conditions, the fungus can kill up to 100% of susceptible wheat spikes in a period of 2.5–3 weeks (Gongora-Canul et al., 2020).

      Since 1985, when wheat spike blast was first detected, intense efforts have been undertaken to identify resistance (Igarashi et al., 1986; Urashima et al., 2004; Prestes et al., 2007; Cruz et al., 2016b; Ceresini et al., 2019; Cruppe et al., 2020). Recently, two new genes, Rmg8 and RmgGR119, were found to generate resistance to wheat blast (Wang et al., 2018). However, the only currently effective resistance provided by the 2NVS translocation from Aegilops ventricosa (Tausch) confers useful yet partial and environment and/or genetic background-dependent resistance to wheat blast (Cruz et al., 2016a; Valent, 2016; Cruppe et al., 2019, 2020). Obtaining tissue samples from phenotyped wheat entries and testing for the presence or absence of the 2NVS segment is relatively easy and routine (Cruz et al., 2016b; Yasuhara-Bell et al., 2018; Cruppe et al., 2019). Although there is evidence that 2NVS-based resistance may be overcome, additional sources of wheat spike blast resistance should be identified (Cruz et al., 2016b; Cruppe et al., 2019, 2020; Juliana et al., 2020). Thus, there is a continued need to find new sources of resistance to wheat blast.

      Plant disease estimations, or phytopathometry, refer to the measurement and quantification of plant disease severity (DS)or incidence that is essential when studying and analyzing diseases at organ, plant, or population levels (Large, 1966; Bock et al., 2010). Plant disease estimations by human raters are the standard method used for plant disease phenotyping. Humans are trained to perform visual disease evaluations of incidence and severity, and their reliability can be improved with experience. These estimations are helpful, but they are subjective evaluations that can introduce variability and can be time-consuming and labor-intensive (Nutter et al., 1993; Madden et al., 2007; Bock et al., 2010, 2020). Due to issues associated with an agreement in data acquisition, inter-rater agreement among other statistical tests can be used to compare the consensus or agreement between estimations of raters of DS (Nutter et al., 1993; Madden et al., 2007; Bock et al., 2010, 2020). These agreement analyses are relevant in plant pathology and plant breeding since inaccurate disease estimations can cause imprecision and unreliability leading to incorrect conclusions (Chiang et al., 2016; Singh et al., 2021).

      A bottleneck in the identification of novel sources of resistance is measuring disease intensity (i.e., plant disease phenotyping), which is considered a limiting factor in the assessment of genotype performance in plant breeding programs (Mahlein, 2015; Shakoor et al., 2017). Therefore, innovative and transformative solutions for the quantification of plant disease symptoms at the individual and host population levels are needed (Camargo and Smith, 2009; Kumar et al., 2020). Implementation of advanced computer vision and machine learning techniques could reduce the phenotyping bottleneck during breeding and enhance the understanding of genotype–phenotype relationships (Fiorani and Schurr, 2013; Kruse et al., 2014; Shakoor et al., 2017; Yang et al., 2020; Singh et al., 2021).

      Computer vision, machine learning, and deep learning methods have recently been adapted to agriculture due to increased knowledge of algorithms and model capabilities that can learn and make predictions from images Red Green Blue (RGB), multispectral, or hyperspectral (Barbedo, 2016; Kersting et al., 2016; Mahlein et al., 2018). There are two ways in which these models are trained, one is supervised learning, which depends on an annotated dataset, and another is unsupervised learning, which does not rely on annotations (Mahlein et al., 2018). The most frequently used deep learning methods are the Convolutional Neural Networks (CNN). The CNN is characterized by high-accuracy metrics for image recognition and image segmentation. Recent studies have further enhanced the scope of a deep-learning-based approach for classifying, identifying, and quantifying plant diseases (Mahlein et al., 2018; Singh et al., 2018; Barbedo, 2019).

      A variety of CNN classification models are available for plant diseases. These include models for bacterial pustule (Xanthomonas axonopodis pv. glycines), sudden death syndrome (SDS, Fusarium virguliforme), Septoria brown spot (Septoria glycines), bacterial blight (Pseudomonas savastanoi pv. glycinea), and several abiotic stresses in soybean (Ghosal et al., 2018). In tomato (Solanum lycopersicum), deep-learning models were developed with and without pre-training models with images from nine leaf tomato diseases from the website www.PlantVillage.org, obtaining better performance using pre-training models (Brahimi et al., 2018). A total of 54,306 leaf images from several crops with 26 diseases were obtained from PlantVillage.org and trained using AlexNet and GoogleLeNet pre-trained models with a leaf-segmented dataset, obtaining an accuracy of 99.35% (Mohanty et al., 2016). On wheat, an in-field automatic diagnosis system for powdery mildew (Blumeria graminis f. sp. tritici), smut (Urocystis agropyri), leaf blotch (Septoria tritici), black chaff (Xanthomonas campestris pv. undulosa), stripe rust (Puccinia striiformis f. sp. tritici), and leaf rust (Puccinia recondita f. sp. tritici) were developed using deep-learning, and multiple instances–learning techniques from the Wheat Disease Database 2017 (Lu et al., 2017). Although this database is a significant contribution to wheat disease identification based on images, aspects regarding the reliability of the labeler may be compromised (Lobet, 2017). It is appropriate that detection and quantification studies of plant disease provide evidence of (“true”) estimation agreement analysis before using the labeled images as a dataset for training deep-learning models. Currently, phenotyping of wheat spike blast DS relies on a visual estimation made by humans (Cruz et al., 2016a). We hypothesized that deep CNN models can be trained for wheat spike blast severity image classification under a controlled environment. To test this hypothesis, we focused on the following objectives:

      Evaluate the agreement in data acquisition of the human rater who collected and classified datasets.

      Develop an accurate deep CNN model to detect and classify wheat spike blast symptoms in three severity categories.

      Materials and Methods Ethics

      A written informed consent was obtained from the individual for the publication of any potentially identifiable images or data included in this article.

      Plant Cultivation and Genetic Materials

      Two experiments were conducted under controlled conditions in a growth room at the Asociación de Productores de Oleaginosas y Trigo (ANAPO) research facility in Santa Cruz de la Sierra, Bolivia. Wheat cultivars were planted in pots of 15 cm diameter, filled with vermicast:silt (3:1 [v/v]), and grown at 18−25°C, 14 h light/10 h dark photoperiod, and 50–60% relative humidity. Plants were fertilized, and insecticides were sprayed when needed. Plants were arranged in a randomized complete block design with wheat cultivars having various levels of resistance to MoT, two inoculation levels (inoculated and non-inoculated), and four replicates. Wheat cultivars with a range of sensitivity to the wheat blast were used for the experiments. Experiment one included Bobwhite and South American spring cultivars Atlax, BR-18, Motacú, Urubó, AN-120, Sossego, and San Pablo and for experiment two the cultivars included BR-18, San Pablo, Bobwhite, and Atlax (Baldelomar et al., 2015; Fernández-Campos et al., 2020).

      Inoculation

      Plants were inoculated at the growth-stage Feekes 10.5, when the spike had completely emerged, with MoT isolate 008-C (Figure 1A), according to a modified inoculation protocol previously published (Cruz et al., 2016a). A conidial suspension was adjusted to 20,000 spores/ml, and each spike received 1 ml of the spore suspension. Immediately after the spikes were sprayed with the MoT inoculum, plants were moved to a dew chamber (Figure 1B) to induce MoT infection (i.e., 24–26°C, 95–98% RH, and 14 h light photoperiod). Forty-eight hours after inoculation, plants were removed from the dew chamber and left under controlled environment room conditions [(24–26°C and relative humidity of 50–60%), until day 19 after inoculation; Figure 1B].

      Wheat blast image collection flow process: (A) Magnaphorthe oryzae pathotype Triticum inoculation, (B) After inoculation, plants were moved to the yellow dew chamber that provided optimal conditions for fungal infection for 48 h, later transfer to the black trays, (C) wheat spike imaging set up, and (D) an image was captured perpendicular to the spike.

      Data Collection, DS, and Disease Measurements

      Following phytopathometry terminology, we used the term “estimate” for visual disease estimations made by humans and the term “measurement” for estimations made by image analysis (Bock et al., 2010; Gongora-Canul et al., 2020). Visual estimate of DS was obtained by observing the disease area covered in the spike and assigned a corresponding severity value from 0 to 100%. In this study, image analysis disease measurements were achieved by manually measuring spike disease area (pixels) using RGB color threshold segmentation with the image analysis software Fiji ImageJ v.1.52a (Schindelin et al., 2012; Sibiya and Sumbwanyambe, 2019). First, the measurement of the total spike area was obtained, then the diseased area was measured. Finally, the percentage of diseased severity (DS) of the individual spike was calculated (Equation 1), where ADiseased is the proportion of the area of spike that is diseased divided by the total area of the spike ATotal (See Video 1 in Supplementary Material).

      DS= ADiseasedATotal ×100

      Visual estimations of wheat spike blast symptoms were taken seven times after inoculation in each experiment. In experiment one, visual estimations and images were collected 4, 6, 9, 12, 14, 16, and 19 days after inoculation (DAI) and in experiment two, 0, 5, 7, 10, 12, 14, and 19 DAI. Each spike side (four sides total) was visually estimated for DS by Rater 1 (a plant pathologist with experience on wheat blast, rice blast, and other diseases). Simultaneously, an image from each spike side was captured perpendicular to the spike with a distance of 50 cm approximately with a DSLR EOS 6D Canon camera (Canon Inc., Tokyo, Japan) (Figure 1D) using a photography studio set up with umbrellas, lights, and screens (Neewer 2.6 m × 3 m/8.5 ft × 10 ft Background Support System and 800 W 5,500 K Umbrellas Softbox Continuous Lighting Kit for Photo Studio Product) that helped create a uniform light and smooth environment (Figure 1C).

      DS Categories

      The total spike disease estimations of Rater 1 paired with the corresponding image were converted to a three-category scale according to the amount of severity that served to fed training and testing dataset of CNN model. The category selection was based on wheat blast results from published work conducted over the last decade (Baldelomar et al., 2015; Cruz et al., 2016b; Vales et al., 2018; Cruppe et al., 2020; Fernández-Campos et al., 2020). Category 1 (healthy spikes) was used as a baseline (i.e., negative control or fully immune). Category 2 showed 0.1–20% severity (resistant and moderately resistant/low levels of symptoms) corresponding to the selected putative population for successive trials under variable conditions (controlled environment or field). Category 3 showed 20.1–100% severity (moderately susceptible and susceptible/intermediate and high levels of symptoms) corresponding to the plant population that will not be selected for successive trials because of the high potential to be or to become susceptible to the disease studied (Figure 2).

      Examples of images per category: (A–D) healthy wheat spikes no disease (0% severity, Category 1); (E–H) spikes with moderate severity (0.1–20%, Category 2); and (I–L) spikes with high severity (20.1–100%, Category 3).

      Testing Reliability of Rater 1: Inter-rater Analysis of Wheat Spike Blast Severity Estimations

      Rater 1 played a critical role in estimating DS and classifying into categories of all the images belonging to Dataset 1 and Dataset 2 (Datasets are described in the section, generation of data sets according to wheat spike physiological changes). Therefore, an inter-rater analysis was needed to determine the reliability of visual estimations of Rater 1. Inter-rater agreement assesses the degree of agreement between two and more raters who obtain independent ratings about the characteristics of a set of subjects. Subjects of interest include people, things, or events that are rated (Madden et al., 2007; Everitt and Anders, 2010; Bock et al., 2020).

      To determine the agreement of disease estimations of Rater 1, we performed an inter-rater analysis including a second-rater, and ImageJ was used as an image analysis software baseline. Rater 2, is a plant pathologist and expert in the wheat blast. ImageJ is an image analysis software used to measure plant diseases from images.

      We used the power analysis Wilcoxon signed-rank test to determine the sample size for the inter-rater agreement studies of the two training datasets. The test consisted of DS estimations or measurements of 31 and 29 images from the CNN training Dataset 1 and training Dataset 2, respectively. From now on, the 31 images selected from Dataset 1 will be called sample Dataset 1 and the 29 images from Dataset 2 will be referred to as sample Dataset 2. Rater 2, who is an experienced researcher with more than 4 years of working with the wheat blast disease, visually estimated DS from the sample Dataset 1 and Dataset 2. Additionally, disease measurements were obtained from the sample Dataset 1 and the sample Dataset 2 using ImageJ software as indicated above. Ultimately, the DS results of visual disease estimations of human raters and ImageJ measurements were compared. The estimated and measured DS values from both samples were analyzed for inter-rater agreement in two scenarios, one with a scale of 0–100% DS (continuous data), and the other with the images divided into three categories of DS (ordinal data). We, therefore, computed Lin's Concordance Coefficient, Fleiss kappa, and weighted kappa statistics.

      The Lin's concordance coefficient (ρc or CCC) is used to estimate the accuracy1 between two raters using continuous data. From the analysis, we obtained the estimation of accuracy1, precision1, and bias of the disease estimations and disease measurements between the two raters (Lin, 1989; Madden et al., 2007; Bock et al., 2010). For accuracy1 (ρc) and precision1 (r), values range from 0 to 1; values close to 1 indicate high accuracy1 and precision1. Bias (Cb) ranges from 0 to 1, and values close to 1 indicate less bias (Nita et al., 2003). Lin's concordance analysis was performed by using PROG REG ALL procedure on SAS v.9.4 (Cary, NC), based on the macro developed by Lawrence Lin and verified by Min Yang (Lin et al., 2002).

      To determine the degree of association between the estimation of categorical information provided by the two raters (inter-rater agreement), the weighted kappa statistics were computed (Chmura, 1992; Graham and Jackson, 1993; Nelson and Edwards, 2015). The Fleiss kappa coefficient was used to compare the agreement of categorical information among all raters, (i.e., Rater 1, Rater 2, and ImageJ) (Fleiss et al., 2003). The values of both the weighted kappa and Fleiss kappa coefficients range from 0 to 1. Values from 0.5 to 1 indicate that the agreement is better than what is expected by chance (Nelson and Edwards, 2015; Tang et al., 2015; Mitani et al., 2017; Gamer et al., 2019). The Fleiss kappa statistics and weighted kappa were computed with the irr package of the R software (Team, 2017).

      Generation of Datasets According to Wheat Spike Physiological Changes

      Wheat was inoculated at the growth-stage Feekes 10.5 (spike completely emerged) of the host plant. Approximately every 2 days after the inoculation, the spike images were collected to capture the changes developed. Indirectly, progressive physiological changes in spikes were recorded, as maturing begins at wheat growth-stage Feekes10.5.4 (kernels watery ripe) and continues through the growth-stage Feekes 11.4 (mature kernels) (Large, 1954; Wise et al., 2011). During this period, the kernel hardened, and the green spike lose its color (maturing), which mimic the typically bleached spikes caused by wheat spike blast symptoms.

      Two datasets were generated considering the (color) physiological changes that can lead to confusion when training the CNN model. Dataset 1, included maturing and non-matured wheat spikes; and Dataset 2 included only non-matured spikes (data available at: https://purr.purdue.edu/publications/3772/1). The proposed CNN model was trained using the two datasets. Each dataset was randomly separated into the training and testing datasets. The CNN model automatically extracted the features of each image in the training dataset to learn a good classifier, whereas the testing dataset was used to evaluate the performance of the trained CNN model. In general, an unseen dataset was applied to evaluate the CNN model to ensure that the model was not under-fitting or over-fitting. In this research, 80% of the images were categorized and used as the training set, and the remaining 20% as the testing set. Table 1 lists the original distribution of the number of images in Dataset 1 and Dataset 2. Although Category 3 covers a large variability, it does not mean the number of the data in Category 3 is larger than the other two categories. The number of images in each category was extremely imbalanced and using them indiscriminately could have resulted in a biased model. Fortunately, there are several viable methods to cope with the disproportionate training data in each category.

      Training and testing data distribution and the number of images used in Dataset 1 and Dataset 2.

      Sets Category 1 Category 2 Category 3
      Dataset 1 (Maturing and non-matured spikes)
      Training 1,595 640 402
      Augmented training 1,595 1,920 1,608
      Testing 381 178 110
      Dataset 2 (Non-matured spikes only)
      Training 1,430 386 307
      Augmented training 1,430 1,544 1,535
      Testing 327 120 90

      Category 1: 0% severity, Category 2: 0.1–20% severity, Category 3: 20.1–100% severity.

      Data augmentation is a common technique providing a viable solution to data shortage issues by adding copies of original images with modification or noise (Boulent et al., 2019). Data augmentation was used in this study to balance the number of images in each category. In this study, images were randomly flipped horizontally and vertically in order to increase the number of images in Categories 2 and 3. Thus, for Dataset 1, training data were triplicated in Category 2 and quadrupled in Category 3 (Table 1). For Dataset 2, training data were quadrupled in Category 2 and quintupled in Category 3 (Table 1).

      Deep CNN Model

      In recent years, the feasibility of using artificial intelligence, in particular deep learning, has been expanded into a variety of applications (Atha and Jahanshahi, 2018; Chen and Jahanshahi, 2018; Kumar et al., 2018; Wu and Jahanshahi, 2019). Deep learning is a subset of machine learning that enables computers to automatically extract features from a huge amount of data and learn to classify data.

      In this study, wheat spike blast symptoms were automatically detected and classified into three severity categories using a pre-trained CNN model. This model may be more efficient than classifying images visually. To obtain a general and reliable CNN model, the network needed to be trained using a large labeled training dataset. The performance of the CNN model is highly dependent on the number and quality of the training data. However, it was hard to collect a wheat blast dataset having a million images in a short time. The performance CNN model can easily lead to under- or over-fitting due to the lack of a large dataset for training. To address this issue, transfer learning was used as a practical solution where a network was trained using a typically different larger dataset such as ImageNet. A major advantage of using transfer learning is that it can adapt the parameters trained from an abundant number of images. Transfer learning starts with a pre-trained model, e.g., VGG16 model, and replaces the fully-connected (FC) layers of the model with new FC layers. A network trained on the ImageNet dataset was used to initialize the network parameters, and the whole network was fine-tuned since the nature of our dataset was very different from the ImageNet dataset. In this study, an FC layer that consisted of three nodes, representing three categories, were appended to the end of the network. A residual neural network architecture (ResNet101), a CNN model with 101 layers with recurrent connection trained on ImageNet data (He et al., 2015, 2016), was selected as the pre-trained model. Furthermore, as shown in Table 1, it was extremely difficult to obtain a large number of images in each category. An unbalanced dataset can result in a biased CNN model. To address this issue in the dataset, the loss function, which was used to optimize the parameter in a neural network, was transformed into a weighted loss function (Equation 2) by assigning individual weights to each category. Equation (2) defines the cross-entropy loss function in the CNN model, where ωcategory is the assigned weight to each of the categories, the first term in Equation (2) is a negative log-likelihood loss, and the second term in Equation (2) is log-softmax. Four cases of study were tested with an individual weight set to the loss functions assigned to different categories. In the experiments, “cases” refer to specific combinations of weight loss functions for each of the three DS categories (Table 2). Case 1 was the non-weight set [1, 1, 1], with all categories sharing the same class weight. Case 2 used [1, 10, 1] class weights in the loss function, meaning that the highest weight was for Category 2, which includes plants at early disease stages and low levels of disease symptoms. Case 3 used [2, 5, 1] class weights in the loss function, meaning that the higher weight was assigned to Categories 1 (no symptoms) and 2 (early stages and low levels of disease symptoms). Case 4 had class weights [2, 1, 1] in the loss function, assigning a higher weight to category 1 (no symptoms) (Table 2).

      loss(x, category)=-ωcategory *logexcategoryj=1Nexj                     =-ωcategory  (-xcategory + log (jexp(xj)))

      The network was trained for 15 epochs using a stochastic gradient descent optimizer (Bottou, 2010), a learning rate of 0.0001 was used, and the batch size was 16. Additionally, 5-folds cross-validation was applied to the training process. The training took place on a Linux server with Ubuntu 14.04. The server included two Intel Xeon E5-2620 v4 CPUs, 256-GB DDR4 memories, and four NVIDIA Titan X Pascal GPUs. Pytorch (Paszke et al., 2017) was used to implement the CNN.

      Two datasets trained the CNN model with four cases of the study through different weights in loss functions for each category.

      Values of weighted loss function per category [1, 2, 3]
      Model Dataset 1 (Maturing and non-matured spikes) Dataset 2 (Non-matured spikes)
      Case 1 [1, 1, 1] [1, 1, 1]
      Case 2 [1, 10, 1] [1, 10, 1]
      Case 3 [2, 5, 1] [2, 5, 1]
      Case 4 [2, 1, 1] [2, 1, 1]

      [Category 1: 0% severity, Category 2: 0.1–20% severity, Category 3: 20.1–100% severity].

      Model Performance Evaluation

      The performance of the CNN model was evaluated via the classified results of the testing dataset. A 3 × 3 confusion matrix was used to describe the prediction result of the model. Each row of the confusion matrix represented the ground truth of the data, and each matrix column corresponded to a predicted category by the CNN model. Thus, the diagonal elements of the matrix, called true positive (TP), were the number of wheat images correctly classified into the ground truth. The false positive (FP) for each Category was the sum of all errors in that column. For example, the FP of Category 1 was the number of Category 2 and Category 3 severities that were incorrectly classified as Category 1. Based on the confusion matrix, additional evaluation metrics were calculated.

      Accuracy2 was defined as the total number of TP among three categories divided by the total number of the predictions. Precision2 was defined as the total number of the TP instances divided by the total number of predicted positive examples, which was the summation of TP and FP instances in the binary classification task (Equation 3). Similarly, the precision2 of the multi-classes task illustrates the number of instances that were correctly predicted given all the predicted labels for a given category. Recall was defined as the TP instance divided by all the positive samples (TP and FN) (Equation 4). F1 score is a single metric that encompasses both precision2 and recall (Equation 5). Accuracy2, precision2, recall, and F1 score metrics ranged from 0 to 1, where higher values indicate the high predictive ability of the model.

      Precision2= TPTP+FP  Recall= TPTP+FN F1score=2 × precision × recallprecision+ recall 
      Results Cultivar Response to Wheat Spike Blast Under Controlled Conditions

      The final wheat spike blast severity was at day 19 after inoculation when cultivar Atlax reached 100% average DS, followed by Bobwhite (99.7%), San Pablo (32.9%), BR-18 (8.7%), Motacú (3.7%), AN-120 (3.31%), Urubó (1.9%), and Sossego (0.83%). Wheat spike blast symptoms developed on all tested cultivars, with reactions to MoT infection consistent with previous reports, except for cultivar San Pablo that showed moderate susceptibility (Baldelomar et al., 2015; Cruz et al., 2016b; Cruppe et al., 2020; Fernández-Campos et al., 2020; Gongora-Canul et al., 2020). Cultivar Atlax exhibited the highest DS of all the cultivars and had a high level of susceptibility to wheat spike blast.

      Inter-rater Agreement Analysis

      The Lin's concordance correlation analysis showed a high accuracy1 (ρc = 0.89–0.91), high precision1 (r = 0.91–0.94), and less bias (Cb = 0.95–0.99) in the sample Dataset 2 than in the sample Dataset 1 (ρc = 0.77–0.85, precision1 r = 0.80–0.87, and bias Cb = 0.93–0.98) (Figure 3). In the sample Dataset 1, the highest accuracy1 was between Rater 1 and Rater 2 (ρc = 0.85) and between Rater 1 and ImageJ (ρc = 0.85). In the sample Dataset 2, the highest accuracy1 value was between Rater 1 and ImageJ (ρc = 0.92), followed by between Rater 1 and Rater 2 (ρc = 0.91). In both sample datasets, strong accuracy1, high precision1, and low bias involved Rater 1, providing evidence that ratings of disease based on continuous data were done correctly for further classification of the images into categories for model training.

      Regression analysis of wheat spike blast DS estimations made by Rater 1 (responsible to estimate the severity of total image dataset) vs. Rater 2 (expert in wheat blast) and ImageJ DS measurements (image analysis software). Graphs show accuracy (ρc), precision(r), bias (Cb), scale shift (ν), and location shift (u) for wheat spike blast continuous Dataset 1 (A–C) (n = 31 images) and Dataset 2 (D–F) (n = 29 images). (A) Disease estimation comparison from images Dataset 1 between Rater 1 and Rater 2. (B) Disease estimation and disease measurement comparison from images Dataset 1 between Rater 1 and ImageJ. (C) Disease estimation and disease measurement comparison from images Dataset 1 between Rater 2 and ImageJ. (D) Disease estimation comparison from images Dataset 2 between Rater 1 and Rater 2. (E) Disease estimation and disease measurement comparison from images Dataset 2 between Rater 1 and ImageJ. (F) Disease estimation and disease measurement comparison from images Dataset 2 between Rater 2 and ImageJ.

      The weighted kappa statistics (κ), used to quantify inter-rater agreement, were higher in the sample Dataset 1 than in the sample Dataset 2, with κ = 0.72–0.88 (p < 0.01) and κ = 0.78–0.85 (p < 0.01), respectively (Table 3). In the sample Dataset 1, the highest agreement occurred between Rater 1 and ImageJ (κ = 0.88), and in the sample Dataset 2, the highest agreement was between Rater 1 and Rater 2 (κ = 0.85). In both sample datasets, the substantial agreement involved the ground truth (Rater 1), providing evidence that ratings were done correctly for further classification of the images into categories for model training.

      Values of weighted Kappa (κ) analysis for inter-rater agreement between raters and ImageJ in Dataset 1 (maturing and non-matured spikes) and Dataset 2 (non-matured spikes) of wheat spike blast under controlled environment.

      Dataset 1 Dataset 2
      Categories κ z κ z
      Rater 1x vs. ImageJz 0.882** 4.93 0.822** 4.45
      Rater 2y vs. ImageJ 0.727** 4.13 0.776** 4.32
      Rater 1 vs. Rater 2 0.747** 4.32 0.849** 4.65

      p < 0.01.

      Rater 1: Responsible to estimate the severity of the total image dataset.

      Rater 2: Expert in the wheat blast.

      ImageJ: Image analysis software.

      The Fleiss kappa coefficient (Fκ), which compared the association of ordinal categorical information of two or more raters, showed an Fκ = 0.771 (n = 31, z = 9.26, p < 0.001) for the sample Dataset 1 and 0.697 (n = 29, z = 8.1, p < 0.001) for the sample Dataset 2, indicating substantial agreement among the human raters and ImageJ in both datasets. However, the sample Dataset 1 possessed a higher Fleiss kappa coefficient index than the sample Dataset 2, both presented substantial agreement between the rates and ImageJ. Yet, the evidence supported the fact that the three raters correctly estimated the amount of the disease from the same image.

      Deep CNNs Model Performance

      To train the proposed CNN model, two different datasets were used. As mentioned above in the section Generation of Datasets According to Wheat Spike Physiological Changes, testing reliability of Rater 1, Dataset 1 included matured and non-matured wheat spikes and Dataset 2 included only non-matured spikes (Table 1). Four cases applied different weight set of loss functions in both Datasets (Table 2, Supplementary Figures 1, 2). The performance of the CNN model was evaluated via the classified result of the testing data.

      The testing accuracy2 of the model trained with Dataset 1 was 90.1% in Case 1, 90.4% in Case 2, 90.0% in Case 3, and 87.7% in Case 4. The testing accuracy2 of Dataset 2 was 98.4% in Case 1, 93.9% in Case 2, 95.0% in Case 3, and 94.2% in Case 4. Dataset 2 presented higher accuracy2 values compared to Dataset 1, suggesting that the model was accurate. However, it was not sufficient to claim that the model was reliable based on accuracy2 alone since the dataset in this study was unbalanced. In addition to accuracy2, other metrics can help evaluate the performance of the CNN model, such as precision2, recall, and F1 score.

      Precision2 indicates the ability to correctly classify an instance in all predicted positive instances. The focus was on the performance of the CNN model in Category 2 as this was the category that breeders and pathologists will concentrate on for breeding purposes. Dataset 1 Case 2 showed the lowest precision2 (75.4%) among all cases values (Table 4). Moreover, the confusion matrix of Dataset 1 Case 2 showed that the model misclassified 38 images of Category 1 (no symptoms) as Category 2 (early disease stages and low levels of disease symptoms), which was the highest number of wrongly classified images among all the cases (Figure 4B). This suggested that the class weight of Category 2 might be too high since its misclassified images that belonged to other categories as Category 2. Hence, the class weight combination was modified by lowering the weight in Category 2 and increasing the weight in Category 1 as to not overemphasize the impact from Category 2. Precision2 of Category 2 significantly increased from 75.4% in Case 2 to 84.1% in Case 3, and to 85.0% in Case 4 (Table 4). In Case 2, precision2 of Category 2 significantly increased from 75.4% in Dataset 1 to 90.2% in Dataset 2 (Table 4). Precision2 of Category 2 significantly increased from 90.2% in Case 2 to 92.7% in Case 3 and from 90.2% in Case 2 to 94.1% in Case 4 (Table 4).

      Classification performance of the CNN model when classifying the testing set of Dataset 1 (maturing and non-matured spikes) and Dataset 2 (non-matured spikes) in the cases of the study presented different weights in the loss function [weight in Category 1, weight in Category 2, weight in Category 3].

      Dataset 1 Dataset 2
      Model Performance Index Category 1 Category 2 Category 3 Category 1 Category 2 Category 3
      Case 1(A) Precision 0.891 0.852 0.955 0.923 0.918 0.967
      Recall 0.945 0.742 0.955 0.985 0.750 0.967
      F-1 score 0.917 0.793 0.955 0.953 0.826 0.967
      Case 2(B) Precision 0.926 0.754 0.950 0.952 0.902 0.936
      Recall 0.890 0.860 0.864 0.963 0.842 0.978
      F-1 score 0.908 0.803 0.905 0.957 0.871 0.957
      Case 3(C) Precision 0.915 0.841 0.938 0.953 0.927 0.967
      Recall 0.929 0.803 0.955 0.985 0.842 0.967
      F-1 score 0.922 0.822 0.946 0.968 0.882 0.967
      Case 4(D) Precision 0.915 0.850 0.946 0.942 0.941 0.946
      Recall 0.937 0.798 0.964 0.991 0.792 0.967
      F-1 score 0.926 0.823 0.955 0.966 0.860 0.956

      Case 1 [1, 1, 1],

      Case 2 [1, 10, 1],

      Case 3 [2, 5, 1], and

      Case 4 [2, 1, 1].

      The performance measures per class considered were precision, recall, and F1 score.

      Confusion matrix of the images of Dataset 1 (non-matured spikes only) showing “true” categories by Rater 1 (y-axis) and predicted categories by the CNN model (x-axis). Category 1: contained images with 0% severity, Category 2: 0.1–20% severity, Category 3: 20.1–100% severity. The cases of study presented different weights in the loss function [weight in Category 1, weight in Category 2, weight in Category 3]. (A) Case 1 [1, 1, 1], (B) Case 2 [1, 10, 1], (C) Case 3 [2, 5, 1], and (D) Case 4 [2, 1, 1]. Values and color intensity represent number of images.

      The recall metric for evaluating the CNN model that indicates the ability to correctly recognize a category was also used. In datasets 1 and 2, the recall of Category 2 was the lowest, illustrating the challenge of the model to classify images of Category 2 (early disease stages and low levels of disease symptoms) (Table 4). The highest recall of Dataset 1 Category 2 was 86.0% in Case 2, and the lowest was 74.2% in Case 1 (Table 4). This was expected given that Case 2 had a higher weight in the loss function of Category 2 compared to Case 1 (non-weighted loss function). In Case 2, Dataset 1, the recall values were similar among the three categories (Table 4). In Dataset 2 Category 2, the lowest recall was 75.0% in Case 1, and the highest recall was 84.2% in cases 2 and 3 (Table 4). The model in these two cases had the highest weight in loss function of Category 2 (early disease stages and low levels of disease symptoms).

      F1 score is a common indicator of the overall performance of the CNN model. In datasets 1 and 2, the F1 score of Category 2 was the lowest, reaffirming the difficulty of classifying images of Category 2 by the model (Table 4). The lowest F1 score of Dataset 1 Category 2, was 79.3% in Case 1, while the highest was 82% in both Case 3 and Case 4 (Table 4). In Dataset 2 Category 2, the lowest F1 score was 82.6% in Case 1, and the highest F1 score was 88.2% in Case 3 followed by Case 2 with 87.1% (Table 4).

      A comparison of outcomes revealed that Category 2 was the most difficult category to classify correctly (Figure 4). This difficulty was attributed to the disease symptoms being barely visible at the early stage of infection, and some wheat spikes in Category 1 were maturing, and their color was similar to that of MoT infected spikes. We observed that the highest number of images exactly classified as Category 2 was obtained with the Case 2 Dataset 1 (Figure 4B). These results suggested that Case 2 was the most appropriate to classify wheat spike blast images in Dataset 1 because it was capable of detecting the infection at an early stage. Even though Case 2 had a slightly lower precision, this is considered the usual trade-off between precision2 and recall for disease classification purposes. The recall, precision2, and F1 score increased after the images of maturing spikes were omitted when training the model with Dataset 2 (Figure 5). The cases 2 and 3 of Dataset 2 presented the highest number of images exactly classified as Category 2 (Figures 5B,C). Cases 2 and 3 were the most appropriate to detect the wheat spike blast in Dataset 2 because the model was capable of detecting the infection in the early stages. Additionally, in all the cases, the model was more stable predicting Category 3, which is relevant because it covers DS from 20.1 to 100%, potentially aiding breeders and pathologists to discern higher levels of susceptibility among cultivars. Although the CNN model misclassified some images of Category 2, it still provided a promising approach to classify the severity of the disease. It demonstrated that the CNN model is potentially a good method for breeders and pathologists.

      Confusion matrix of the images of Dataset 2 (non-matured spikes only) showing “true” categories by Rater 1 (y-axis) and the predicted categories by the CNN model (x-axis). Category 1: contained images with 0% severity, Category 2: 0.1–20% severity, Category 3: 20.1–100% severity. The cases of study presented different weights in the loss function [weight in Category 1, weight in Category 2, and weight in Category 3]. (A) Case 1 [1, 1, 1], (B) Case 2 [1, 10, 1], (C) Case 3 [2, 5, 1], and (D) Case 4 [2, 1, 1]. Values and color intensity represent number of images.

      Discussion

      Wheat blast is spreading worldwide, the identification of durable and broad-spectrum resistance is urgently needed (Valent et al., 2021). There are a few known sources of effective resistance, and therefore it is crucial to identify more genetic resources. Plant disease phenotyping is a bottleneck in the identification of novel sources of resistance. We developed the first deep CNNs model for wheat spike blast phenotyping under controlled environment.

      This study results demonstrated that the agreement between disease estimations and disease measurements was more significant than what could have been expected to occur by chance. Rater 1 (a pathologist with expertise in multiple diseases besides blast) consistently obtained the higher kappa coefficient (substantial agreement), higher accuracy, and lower bias in all the performed analyses than disease estimations of an expert (Rater 2) in the wheat blast and the disease measurements of ImageJ software. These results are relevant because Rater 1 estimated the DS and classified the entire image dataset into three categories. Therefore, the agreement analysis supports an accurate classification of the images before they were used to train and test the CNN model. The inter-rater agreement analysis also showed that accuracy, precision, and bias are highly dependent on the nature of the dataset. Dataset 1 included images showing disease symptoms and natural plant physiological changes. However, although Dataset 2 was preferred due to higher concordance, results showed that DS assessments among raters were never perfect.

      In the present study, the applicability of CNNs for wheat spike blast severity classification from spring wheat images was investigated. Currently, the CNN approach can classify three severity levels (0%, 0.1–20%, and 20.1–100% severity) and was trained using a reliable wheat spike blast dataset. The advantage of this three categories CNN model is that it detects the infected wheat spike and provides further information on the corresponding blast severity level. It is useful to have such a model to classify different infection levels and identify the resistant cultivars from the susceptible ones. Despite the wheat blast dataset comprising of imbalanced data that could have led to a biased CNN model, two techniques, including data augmentation and weighted loss function, were applied to the training process. The loss function is a function map of the difference between the ground truth and predicted output of the model. The importance of a category with a larger error can be enhanced by assigning a weighted variable in the loss function. The results indicate that the performance of the model has a significant improvement when the weighted loss function is applied. In particular, the model has gained the ability to detect Category 2 using a weighted loss function. These encouraging results demonstrate that the proposed CNN model can distinguish Category 1 and Category 2 even though there is a relatively little difference between both the categories. More significant, the CNN could classify the images of Category 3 with low error, which contained infected spikes with severities higher than 20%.

      The results showed that the CNN models trained in both datasets (Fernandez-Campos et al., 2021) presented good performance classifying the wheat spike blast images in the corresponding severity categories. However, the models trained without images of wheat maturing spikes showed higher precision2, recall, and F1 score when classifying the images than the models trained with maturing and not matured wheat spikes. The performance of the model trained with maturing and non-matured spikes is a critical finding from a biological/physiological point of view. These symptoms on spikes are often reported when wheat has reached the medium milk-to-dough growth stage (Cruz et al., 2016b). The reason why the rating is often stopped at the milk-to-dough stage is that from that point forward, physiological maturity starts to kick in. Our findings will serve to provide future and explicit guidelines to potential users of the preferred model. Users will need to acknowledge the natural wheat maturity process (which alters spike color from green to yellow/white), which can confuse the CNN model. This statement applies when phenotyping for wheat blast or similar diseases with symptoms characterized by spike bleaching [e.g., Fusarium graminearum (Fusarium head blight)].

      Different software based on image analysis are currently available to measure DS (Lamari, 2002; Vale et al., 2003). We used ImageJ, a free image-processing software, and manually thresholded images to measure wheat spike blast severity. de Melo et al. (2020) indicated the inevitable error when delineating the disease area with image analysis software (Bock et al., 2008). This is a challenge that future research needs to address when disease symptoms are not well-defined.

      Researchers could benefit from the proposed approach promising for wheat spike blast severity measurements under controlled environmental conditions. Results are supported by a substantial agreement with “true” data obtained from Rater 1, compared against disease estimations of Rater 2, and disease measurements of ImageJ. In collaboration with data scientists, breeders could pre-select wheat cultivars under controlled environments by automatically analyzing and classifying images using the wheat spike blast CNN model preferably trained with Dataset 2. Next, the breeders can focus on the cultivars that fall into categories 1 and 2, which in general terms, are considered resistant or moderately resistant. This may reduce the high number of cultivars tested under field conditions, accelerating the cultivar screening process. A limitation of the study is that the CNN was trained to classify only images of wheat spike blast (spring wheat) under controlled conditions. Further research is required to improve the generalizability of the CNN model using a greater wheat spike blast dataset consisting of controlled and field images. In addition, the results in this study show an opportunity that could be applied similar to other pathogens.

      The next step in this research is to validate the model with other images with a similar background and deploy it in a Web application. This future option might allow breeders and pathologists to submit their images and have the model classify them by categories automatically. As more images of various cultivars infected with different isolates can be added to the dataset, increasing symptom variability, a more refined and robust model can be developed. To our knowledge, this is the first study presenting a deep CNN model trained to detect and classify wheat spike blast symptoms. The model might help in the pre-screening of wheat cultivars against the blast fungus under controlled conditions in the future.

      Data Availability Statement

      The raw data supporting the conclusions of this article and corresponding models are available at: https://purr.purdue.edu/projects/wheatspikeblastcnn/publications/3772.

      Author Contributions

      MF-C, CDC, MJ, Y-TH, TW, and JJ contributed to the study's conception and design. MF-C and CDC conducted the experiment. MF-C collected data and wrote the first draft of the manuscript. MF-C and CG-C performed the statistical analysis. Y-TH and TW wrote the code for the model. Y-TH wrote sections of the manuscript. CDC, MJ, Y-TH, DT, and CG-C edited the manuscript. All authors approved the submitted version.

      Conflict of Interest

      The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

      We thank the Asociación de Productores de Oleaginosas y Trigo (ANAPO) and the Centro de Investigación Agrícola Tropical (CIAT) for the support provided with the experiments conducted in Bolivia and M. G. Rivadeneira from CIAT for the help with inoculum preparation. We acknowledge the Iyer-Pascuzzi Lab and A. P. Cruz for their guidance in plant phenotyping and D. F. Baldelomar, L. Calderón, J. Cuellar, F. Cortéz, and D. Coimbra from ANAPO for their help with research activities. We also thank Dr. Barbara Valent (Kansas State University) and Gary Peterson for their contribution and commitment to the wheat blast work in South America. Borlaug Fellows Drs. Mr. Kabir and S. Das participated and were trained while conducting experiments associated with this work.

      Supplementary Material

      The Supplementary Material for this article can be found online at: /articles/10.3389/fpls.2021.673505/full#supplementary-material

      References Aman A. (2016). Wheat blast' threatens yield-farmers in 6 districts complain of infection. Dailystar. Available online at: https://www.thedailystar.net/backpage/wheat-blast-threatens-yield-784372 (accessed June 6, 2020). Atha D. J. Jahanshahi M. R. (2018). Evaluation of deep learning approaches based on convolutional neural networks for corrosion detection. Struct. Health Monit. 17, 11101128. 10.1177/1475921717737051 Baldelomar D. Cardenas S. Quispe K. (2015). Caracterización de genotipos de trigo (Triticum aestivum) con resistencia a piricularia durante el verano 2014/2015 en Quirusillas, Vol. 1. Santa Cruz: Revista científica de investigación INFO-INIAF, 1721. Barbedo J. G. A. (2016). A review on the main challenges in automatic plant disease identification based on visible range images. Biosyst. Eng. 144, 5260. 10.1016/j.biosystemseng.2016.01.017 Barbedo J. G. A. (2019). Plant disease identification from individual lesions and spots using deep learning. Biosyst. Eng. 180, 96107. 10.1016/j.biosystemseng.2019.02.002 Barea G. Toledo J. (1996). Identificación y zonificación de pyricularia (Pyricularia oryzae) en el cultivo de trigo en el departamento de Santa Cruz, Informe Técnico. Proyecto de Investigación Trigo, 7686. Bhattacharya R. Pal S. (2017). Deadly wheat blast symptoms enters India through the Bangladesh border, Bengal govt burning crops on war footing. Hindustan Times. Kolkata. Available online at: http://www.hindustantimes.com/kolkata/deadly-wheat-blast-symptoms-enters-india-through-the-bangladesh-border-bengal-govt-burning-crops-on-war-footing/story-3zoWQ0H7sdMU4HxQyzWUsN.html (accessed August 2020). Bock C. H. Barbedo J. G. A. Del Ponte E. M. Bohnenkamp D. Mahlein A. K. (2020). From visual estimates to fully automated sensor-based measurements of plant disease severity: status and challenges for improving accuracy. Phytopathol. Res. 2, 130. 10.1186/s42483-020-00049-8 Bock C. H. Parker P. E. Cook A. Z. Gottwald T. R. (2008). Visual rating and the use of image analysis for assessing different symptoms of citrus canker on grapefruit leaves. Plant Dis. 92, 530541. 10.1094/PDIS-92-4-053030769647 Bock C. H. Poole G. H. Parker P. E. Gottwald T. R. (2010). Plant disease severity estimated visually, by digital photography and image analysis, and by hyperspectral imaging. Crit. Rev. Plant Sci. 29, 59107. 10.1080/07352681003617285 Bottou L. (2010). Large-scale machine learning with stochastic gradient descent, in Proceedings of compstat, eds Lechevallier Y. Saporta G. (Heidelberg: Physica-Verlag HD), 177186. 10.1007/978-3-7908-2604-3_16 Boulent J. Foucher S. Théau J. St-Charles P. L. (2019). Convolutional neural networks for the automatic identification of plant diseases. Front. Plant Sci. 10:941. 10.3389/fpls.2019.0094131396250 Brahimi M. Arsenovic M. Laraba S. Sladojevic S. Boukhalfa K. Moussaoui A. (2018). Deep learning for plant diseases: detection and saliency map visualization, in Human Machine Learning, eds Zhou J. Chen F. (Cham: Springer), 93117. 10.1007/978-3-319-90403-0_6 Cabrera M. G. Gutiérrez S. (2007). Primer registro de Pyricularia grisea en cultivos de trigo del NE de Argentina, in Jornada de Actualización en Enfermedades de Trigo, Vol. 60 (Buenos Aires: IFSC Press). Camargo A. Smith J. S. (2009). Image pattern classification for the identification of disease causing agents in plants. Comput. Electron. Agr. 66, 121125. 10.1016/j.compag.2009.01.003 Cardoso C. A. A. Reis E. M. Moreira E. N. (2008). Development of a warning system for wheat blast caused by Pyricularia grisea. Summa Phytopathol. 34, 216221. 10.1590/S0100-54052008000300002 Ceresini P. Castroagudín V. L. Rodrigues A. F. Rios J. Aucique-Pérez C. Moreira S. I. . (2019). Wheat blast: from its origins in South America to its emergence as a global threat. Phytopathology 104, 95107. 10.1111/mpp.1274730187616 Chen F. Jahanshahi M. R. (2018). Deep learning-based crack detection using convolutional neural network and naïve bayes data fusion. IEEE Trans. Ind. Electron. 65, 43924400. 10.1109/TIE.2017.2764844 Chiang K. S. Bock C. H. El Jarroudi M. Delfosse P. Lee I. H. Liu H. I. (2016). Effects of rater bias and assessment method on disease severity estimation with regard to hypothesis testing. Plant Pathol. 65, 523535. 10.1111/ppa.12435 Chmura H. (1992). Measurement of reliability for categorical data in medical research. Stat. Methods Med. Res. 1, 183199. 10.1177/0962280292001002041341657 Cruppe G. Cruz C. Peterson G. Pedley K. Mohammad A. Fritz A. . (2019). Novel sources of wheat head blast resistance in modern breeding lines and wheat wild relatives. Plant Dis. 104, 3543. 10.1094/PDIS-05-19-0985-RE31660799 Cruppe G. Silva P. da Silva C. L. Peterson G. Pedley K. F. Cruz C. D. . (2020). Genome wide association reveals limited benefits of pyramiding the 1B and 1D loci with the 2NvS translocation for wheat blast control. Crop Sci. 61, 115. 10.1002/csc2.20397 Cruz C. D. Bockus W. Stack J. Valent B. Maciel J. N. Peterson G. L. (2016a). A standardized inoculation protocol to test wheat cultivars for reaction to head blast caused by Magnaporthe oryzae (Triticum pathotype). Plant Health Prog. 17, 186187. 10.1094/PHP-BR-16-0041 Cruz C. D. Kiyuna J. Bockus W. Todd T. C. Stack J. P. Valent B. (2015). Magnaporthe oryzae conidia on basal wheat leaves as a potential source of wheat blast inoculum. Plant Pathol. 64, 14911498. 10.1111/ppa.12414 Cruz C. D. Peterson G. L. Bockus W. W. Kankanala P. Dubcovsky J. Jordan K. W. . (2016b). The 2NvS translocation from Aegilops ventricosa confers resistance to the triticum pathotype of Magnaporthe oryzae. Crop Sci. 56, 9901000. 10.2135/cropsci2015.07.0410 Cruz C. D. Valent B. (2017). Wheat blast disease: danger on the move. Trop. Plant Pathol. 42, 210222. 10.1007/s40858-017-0159-z de Melo V. P. da Silva Mendoca A. C. de Souza H. S. Gabriel L. C. Bock C. H. Eaton M. J. . (2020). Reproducibility of the development and validation process of standard area diagram by two laboratories: an example using the Botrytis cinerea/Gerbera jamesonii pathosystem. Plant Dis. 104, 24402448. 10.1094/PDIS-08-19-1708-RE32649269 Everitt B. Anders S. (2010). The Cambridge Dictionary of Statistics. Cambridge: Cambridge University Press. 10.1017/CBO9780511779633 Fernández-Campos M. Góngora-Canul C. Das S. Kabir M. Valent B. Cruz C. (2020). Epidemiological criteria to support breeding tactics against the emerging, high-consequence wheat blast disease. Plant Dis. 104, 138. 10.1094/PDIS-12-19-2672-RE32584156 Fernandez-Campos M. S. Huang Y. Jahanshahi M. Wang T. Jin J. Telenko D. . (2021). Wheat spike blast image classification using deep convolutional neural networks. Purdue University Research Repository. West Lafayette 10.4231/P0Y7-3428 Fiorani F. Schurr U. (2013). Future scenarios for plant phenotyping. Annu. Rev. Plant Biol. 64, 267291. 10.1146/annurev-arplant-050312-120137 Fleiss J. L. Levin B. Paik M. C. (2003). Statistical Methods for Rates and Proportions. John Wiley and Sons, Inc. Gamer M. Lemon J. Fellows I. Singh P. (2019). Package 'irr' Title Various Coefficients of Interrater Reliability and Agreement. Avaialble online at: https://CRAN.R-project.org/package=irr Ghosal S. Blystone D. Singh A. K. Ganapathysubramanian B. Singh A. Sarkar S. (2018). An explainable deep machine vision framework for plant stress phenotyping. Proc. Natl. Acad. Sci. U.S.A. 115, 46134618. 10.1073/pnas.1716999115 Gongora-Canul C. Salgado J. Singh D. Cruz A. Cotrozzi L. Couture J. J. . (2020). Temporal dynamics of wheat blast epidemics and disease measurements using multispectral imagery. Phytopathology 110, 393405. 10.1094/PHYTO-08-19-0297-R31532351 Goulart A. Paiva F. (1992). Incidência da brusone (Pyricularia oryzae) em diferentes cultivares de trigo (Triticum aestivum) em condições de campo. Fitopatol bras. 17, 321325. Goulart A. C. P. Sousa P. G. Urashima A.S. (2007). Damages in wheat caused by infection of Pyricularia grisea. Summa Phytopath. 33, 35863. 10.1590/S0100-54052007000400007 Graham P. Jackson R. (1993). The analysis of ordinal agreement data: beyond weighted kappa. J. Clin. Epidemiol. 46, 10551062. 10.1016/0895-4356(93)90173-X8263578 He K. Zhang X. Ren S. Sun J. (2015). Deep residual learning for image recognition. arXiv [Preprint] 495arXiv:1512.03385.32166560 He K. Zhang X. Ren S. Sun J. (2016). Deep residual learning for image recognition, in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016 (IEEE), 770778. 10.1109/CVPR.2016.9032166560 Igarashi S. (1986). Brusone no trigo- histórico e distribuição geográfica no Paraná, in Reunião nacional de pesquisa de trigo (Passo Fundo), 15. Igarashi S. Utiamada C. M. Kazuma I. L. C. H. A. Lopes R. S. (1986). Pyricularia em trigo. 1. Ocorrencia de Pyricularia sp. no estado do Parana. Fitopatol. Bras. 11, 351352. Juliana P. He X. Kabir M. R. Roy K. K. Anwar M. B. Marza F. . (2020). Genome-wide association mapping for wheat blast resistance in CIMMYT's international screening nurseries evaluated in Bolivia and Bangladesh. Sci Rep. 10:15972. 10.1038/s41598-020-72735-833009436 Kersting K. Bauckhage C. Wahabzada M. Mahlein A. K. Steiner U. Oerke E. C. . (2016). Feeding the world with big data: uncovering spectral characteristics and dynamics of stressed plants, in Computational Sustainability, Vol. 645 eds Lässig J. Kersting K. Morik K. (Cham: Springer), 99120. 10.1007/978-3-319-31858-5_6 Kruse O. M. O. Prats-Montalbán J. M. Indahl U. G. Kvaal K. Ferrer A. . (2014). Pixel classification methods for identifying and quantifying leaf surface injury from digital images. Comput. Electron. Agr. 108, 155165. 10.1016/j.compag.2014.07.010 Kumar S. Kashyap P. L. Mahapatra S. Jasrotia P. Singh G. P. (2020). New and emerging technologies for detecting Magnaporthe oryzae causing blast disease in crop plants. Crop Protect. 143:105473. 10.1016/j.cropro.2020.105473 Kumar S. S. Abraham D. M. Jahanshahi M. R. Iseley T. Starr J. (2018). Automated defect classification in sewer closed circuit television inspections using deep convolutional neural networks. Autom. Construc. 91, 273283. 10.1016/j.autcon.2018.03.028 Lamari L. (2002). ASSESS: Image Analysis Software for Plant Disease Quantification. St. Paul, MN: APS Press. Large E. C. (1954). Growth stages in cereals illustration of the Feeks scale. Plant Pathol. 3, 128129. 10.1111/j.1365-3059.1954.tb00716.x Large E. C. (1966). Measuring plant disease. Ann. Rev. Phytopathol. 4, 928. 10.1146/annurev.py.04.090166.000301 Lin L. Hedayat A. S. Sinha B. Yang M. (2002). Statistical methods in assessing agreement. J. Am. Stat. Assoc. 97, 257270. 10.1198/016214502753479392 Lin L. I. K. (1989). A concordance correlation coefficient to evaluate reproducibility. Biometrics 45, 255268. 10.2307/25320512720055 Lobet G. (2017). Image analysis in plant sciences: publish then perish. Trends Plant Sci. 22, 559566. 10.1016/j.tplants.2017.05.00228571940 Lu J. Hu J. Zhao G. Mei F. Zhang C. (2017). An in-field automatic wheat disease diagnosis system. Compt. Electron. Agr. 142, 369379. 10.1016/j.compag.2017.09.012 Madden L. V. Hughes G. van den Bosch F. (2007). The Study of Plant Disease Epidemics. St. Paul, MN: APS Press. Mahlein A. K. (2015). plant disease detection by imaging sensors – parallels and specific demands for precision agriculture and plant phenotyping. Plant Dis. 100, 241251. 10.1094/PDIS-03-15-0340-FE30694129 Mahlein A. K. Kuska M. T. Behmann J. Polder G. Walter A. (2018). Hyperspectral sensors and imaging technologies in phytopathology: state of the art. Ann. Rev. Phytopathol. 56, 535558. 10.1146/annurev-phyto-080417-05010030149790 Malaker P. K. Barma N. C. D. Tiwari T. P. Collis W. J. Duveiller E. Singh P. K. . (2016). First report of wheat blast caused by Magnaporthe oryzae pathotype triticum in Bangladesh. Plant Dis. 100, 23302330. 10.1094/PDIS-05-16-0666-PDN Mitani A. A. Freer P. E. Nelson K. P. (2017). Summary measures of agreement and association between many raters' ordinal classifications. Ann. Epidemiol. 27, 677685.e4. 10.1016/j.annepidem.2017.09.00129029991 Mohanty S. P. Hughes D. P. Salathé M. (2016). Using deep learning for image-based plant disease detection. Front. Plant Sci. 7, 1419. 10.3389/fpls.2016.01419 Mottaleb K. A. Singh P. K. Sonder K. Kruseman G. Tiwari T. P. Barma N. C. D. . (2018). Threat of wheat blast to South Asia's food security: an ex-ante analysis. PLoS ONE. 13:e0197555. 10.1371/journal.pone.019755529782528 Nelson K. Edwards D. (2015). Measures of agreement between many raters for ordinal classifications. Stat. Med. 34, 31163132. 10.1002/sim.654629029991 Nita M. Ellis M. A. Madden L. V. (2003). Reliability and accuracy of visual estimation of Phomopsis leaf blight of strawberry. Phytopathology 93, 9951005. 10.1094/PHYTO.2003.93.8.99518943866 Nutter F. W. Gleason M. Jenco J. Christians N. (1993). Assessing the accuracy, intra-rater repeatability, and inter-rater reliability of disease assessment systems. Phytopathology 83, 806812. 10.1094/Phyto-83-806 Paszke A. Gross S. Chintala S. Chanan G. Yang E. DeVito Z. . (2017). Automatic Differentiation in PyTorch. Nips autodiff workshop. Perello A. Martinez I Molina M. (2015). First report of virulence and effects of Magnaporthe oryzae isolates causing wheat blast in Argentina. Plant Dis. 99, 11771178. 10.1094/PDIS-11-14-1182-PDN Prestes A. M. Arendt P. F. Fernandes J. M. C. Scheeren P. L. (2007). Resistance to Magnaporthe grisea among Brazilian wheat genotypes. Wheat Prod. Stress. Environ. 16, 119123. 10.1007/1-4020-5497-1_16 Schindelin J. Arganda-Carreras I. Frise E. Kaynig V. Longair M. Pietzsch T. . (2012). Fiji: an open-source platform for biological-image analysis. Nat. Methods. 9, 676682. 10.1038/nmeth.201922743772 Shakoor N. Lee S. Mockler T. C. (2017). High throughput phenotyping to accelerate crop breeding and monitoring of diseases in the field. Curr. Opin. Plant Biol. 38, 184192. 10.1016/j.pbi.2017.05.00628738313 Sibiya M. Sumbwanyambe M. (2019). An algorithm for severity estimation of plant leaf diseases by the use of colour threshold image segmentation and fuzzy logic inference: a proposed algorithm to update a “leaf doctor” application. Agri. Eng. 1, 205219. 10.3390/agriengineering1020015 Singh A. Ganapathysubramanian B. Sarkar S. Singh A. (2018). Deep learning for plant stress phenotyping: trends and future perspectives. Trends Plant Sci. 23, 883898. 10.1016/j.tplants.2018.07.00430104148 Singh A. Jones S. Ganapathysubramanian S. Mueller D. Sandhu K. Nagasubramanian K. (2021). Challenges and opportunities in machine-augmented plant stress phenotyping. Trends Plant Sc. 26, 5369. 10.1016/j.tplants.2020.07.01032830044 Tang W. Hu J. Zhang H. Wu P. He H. (2015). Kappa coefficient: a popular measure of rater agreement. Shanghai Arch. Psychiatry. 27, 6267. 10.11919/j.issn.1002-0829.21501025852260 Team R. C. (2017). R: A Language and Environment for Statistical Computing (Computer Software Manual). Vienna. Tembo B. Mulenga R. M. Sichilima S. M'siska K. K. Mwale M. Chikoti P. C. . (2020). Detection and characterization of fungus (Magnaporthe oryzae pathotype Triticum) causing wheat blast disease on rain-fed grown wheat (Triticum aestivum L.) in Zambia. PLoS ONE 15:e0238724. 10.1371/journal.pone.023872432956369 Urashima A. Lavorent N. A. Goulart A. C. P. Mehta Y. R. (2004). Resistance spectra of wheat cultivars and virulence diversity of Magnaporthe grisea isolates in brazil. Fitopatol. Bras. 29, 511518. 10.1590/S0100-41582004000500007 Vale F. X. R. Fernandes-Filho E. I. Liberato J. R. (2003). QUANT: a software for plant disease severity assessment, in Proceedings of the Eighth International Congress of Plant Pathology (Christchurch), 105. Valent B. (2016). Novel Strategies for Managing Blast Diseases on Rice and Wheat. Progress report 01/01/15 to 12/31/15. Available online at: https://portal.nifa.usda.gov/web/crisprojectpages/0231543-novel-strategies-for-managing-blast-diseases-on-rice-and-wheat.html (accessed August 2020). Valent B. Cruppe G. Stack J. P. Cruz C. D. Farman M. L. Paul P. A. . (2021). Recovery Plan for Wheat Blast Caused by Magnaporthe Oryzae Pathotype Triticum. Plant Health Prog. Vales M. Anzoátegui T. Huallpa B. Cazon M. I. (2018). Review on resistance to wheat blast disease (Magnaporthe oryzae Triticum) from the breeder point-of-view: use of the experience on resistance to rice blast disease. Euphytica 214, 119. 10.1007/s10681-017-2087-x Viedma L. (2005). Wheat blast occurrence in Paraguay. Phytopathology 95:152. Wang S. Asuke S. Vy T.T.P. Inoue Y. Chuma I. Win J. . (2018). A new resistance gene in combination with Rmg8 confers strong resistance against triticum isolates of Pyricularia oryzae in a common wheat landrace. Phytopathology. 108, 12991306. 10.1094/PHYTO-12-17-0400-R29767554 Wise K. Shaner G. Mansfield C. (2011). Purdue Extension Managing Wheat by Growth Stage. Retrieved from: www.the-education-store.com Wu R. T. Jahanshahi M. R. (2019). Deep convolutional neural network for structural dynamic response estimation and system identification. J. Eng. Mech. 145:04018125. 10.1061/(ASCE)EM.1943-7889.0001556 Yang W. Feng H. Zhang X. Zhang J. Doonan J. H. Batchelor W. D. . (2020). Crop phenomics and high-throughput phenotyping: past decades, current challenges, and future perspectives. Mol. Plant. 13, 187214. 10.1016/j.molp.2020.01.00831981735 Yasuhara-Bell J. Pedley K. F. Farman M. Valent B. Stack J. P. (2018). Specific detection of the wheat blast pathogen (Magnaporthe oryzae Triticum) by loop-mediated isothermal amplification. Plant Dis. 102, 25502559. 10.1094/PDIS-03-18-0512-RE30320534

      Funding. This work was supported by the USDA National Institute of Food and Agriculture Hatch Project accession number 1016253. We thank Purdue University start-up funds for supporting this work. This work was partially supported by the Borlaug Fellowship Program (Grant nos. FX18BF-10777R014 and FX19BF-10777R001) from the USDA Foreign Agricultural Service.

      ‘Oh, my dear Thomas, you haven’t heard the terrible news then?’ she said. ‘I thought you would be sure to have seen it placarded somewhere. Alice went straight to her room, and I haven’t seen her since, though I repeatedly knocked at the door, which she has locked on the inside, and I’m sure it’s most unnatural of her not to let her own mother comfort her. It all happened in a moment: I have always said those great motor-cars shouldn’t be allowed to career about the streets, especially when they are all paved with cobbles as they are at Easton Haven, which are{331} so slippery when it’s wet. He slipped, and it went over him in a moment.’ My thanks were few and awkward, for there still hung to the missive a basting thread, and it was as warm as a nestling bird. I bent low--everybody was emotional in those days--kissed the fragrant thing, thrust it into my bosom, and blushed worse than Camille. "What, the Corner House victim? Is that really a fact?" "My dear child, I don't look upon it in that light at all. The child gave our picturesque friend a certain distinction--'My husband is dead, and this is my only child,' and all that sort of thing. It pays in society." leave them on the steps of a foundling asylum in order to insure [See larger version] Interoffice guff says you're planning definite moves on your own, J. O., and against some opposition. Is the Colonel so poor or so grasping—or what? Albert could not speak, for he felt as if his brains and teeth were rattling about inside his head. The rest of[Pg 188] the family hunched together by the door, the boys gaping idiotically, the girls in tears. "Now you're married." The host was called in, and unlocked a drawer in which they were deposited. The galleyman, with visible reluctance, arrayed himself in the garments, and he was observed to shudder more than once during the investiture of the dead man's apparel. HoME香京julia种子在线播放 ENTER NUMBET 0016www.mgwfge.com.cn
      getpage.com.cn
      www.ho84.org.cn
      eqltuj.com.cn
      www.hzjyc.com.cn
      www.protestant.com.cn
      www.szcct0755.com.cn
      www.uhfjwz.com.cn
      www.tklvyou.com.cn
      wyggjg.com.cn
      处女被大鸡巴操 强奸乱伦小说图片 俄罗斯美女爱爱图 调教强奸学生 亚洲女的穴 夜来香图片大全 美女性强奸电影 手机版色中阁 男性人体艺术素描图 16p成人 欧美性爱360 电影区 亚洲电影 欧美电影 经典三级 偷拍自拍 动漫电影 乱伦电影 变态另类 全部电 类似狠狠鲁的网站 黑吊操白逼图片 韩国黄片种子下载 操逼逼逼逼逼 人妻 小说 p 偷拍10幼女自慰 极品淫水很多 黄色做i爱 日本女人人体电影快播看 大福国小 我爱肏屄美女 mmcrwcom 欧美多人性交图片 肥臀乱伦老头舔阴帝 d09a4343000019c5 西欧人体艺术b xxoo激情短片 未成年人的 插泰国人夭图片 第770弾み1 24p 日本美女性 交动态 eee色播 yantasythunder 操无毛少女屄 亚洲图片你懂的女人 鸡巴插姨娘 特级黄 色大片播 左耳影音先锋 冢本友希全集 日本人体艺术绿色 我爱被舔逼 内射 幼 美阴图 喷水妹子高潮迭起 和后妈 操逼 美女吞鸡巴 鸭个自慰 中国女裸名单 操逼肥臀出水换妻 色站裸体义术 中国行上的漏毛美女叫什么 亚洲妹性交图 欧美美女人裸体人艺照 成人色妹妹直播 WWW_JXCT_COM r日本女人性淫乱 大胆人艺体艺图片 女同接吻av 碰碰哥免费自拍打炮 艳舞写真duppid1 88电影街拍视频 日本自拍做爱qvod 实拍美女性爱组图 少女高清av 浙江真实乱伦迅雷 台湾luanlunxiaoshuo 洛克王国宠物排行榜 皇瑟电影yy频道大全 红孩儿连连看 阴毛摄影 大胆美女写真人体艺术摄影 和风骚三个媳妇在家做爱 性爱办公室高清 18p2p木耳 大波撸影音 大鸡巴插嫩穴小说 一剧不超两个黑人 阿姨诱惑我快播 幼香阁千叶县小学生 少女妇女被狗强奸 曰人体妹妹 十二岁性感幼女 超级乱伦qvod 97爱蜜桃ccc336 日本淫妇阴液 av海量资源999 凤凰影视成仁 辰溪四中艳照门照片 先锋模特裸体展示影片 成人片免费看 自拍百度云 肥白老妇女 女爱人体图片 妈妈一女穴 星野美夏 日本少女dachidu 妹子私处人体图片 yinmindahuitang 舔无毛逼影片快播 田莹疑的裸体照片 三级电影影音先锋02222 妻子被外国老头操 观月雏乃泥鳅 韩国成人偷拍自拍图片 强奸5一9岁幼女小说 汤姆影院av图片 妹妹人艺体图 美女大驱 和女友做爱图片自拍p 绫川まどか在线先锋 那么嫩的逼很少见了 小女孩做爱 处女好逼连连看图图 性感美女在家做爱 近距离抽插骚逼逼 黑屌肏金毛屄 日韩av美少女 看喝尿尿小姐日逼色色色网图片 欧美肛交新视频 美女吃逼逼 av30线上免费 伊人在线三级经典 新视觉影院t6090影院 最新淫色电影网址 天龙影院远古手机版 搞老太影院 插进美女的大屁股里 私人影院加盟费用 www258dd 求一部电影里面有一个二猛哥 深肛交 日本萌妹子人体艺术写真图片 插入屄眼 美女的木奶 中文字幕黄色网址影视先锋 九号女神裸 和骚人妻偷情 和潘晓婷做爱 国模大尺度蜜桃 欧美大逼50p 西西人体成人 李宗瑞继母做爱原图物处理 nianhuawang 男鸡巴的视屏 � 97免费色伦电影 好色网成人 大姨子先锋 淫荡巨乳美女教师妈妈 性nuexiaoshuo WWW36YYYCOM 长春继续给力进屋就操小女儿套干破内射对白淫荡 农夫激情社区 日韩无码bt 欧美美女手掰嫩穴图片 日本援交偷拍自拍 入侵者日本在线播放 亚洲白虎偷拍自拍 常州高见泽日屄 寂寞少妇自卫视频 人体露逼图片 多毛外国老太 变态乱轮手机在线 淫荡妈妈和儿子操逼 伦理片大奶少女 看片神器最新登入地址sqvheqi345com账号群 麻美学姐无头 圣诞老人射小妞和强奸小妞动话片 亚洲AV女老师 先锋影音欧美成人资源 33344iucoom zV天堂电影网 宾馆美女打炮视频 色五月丁香五月magnet 嫂子淫乱小说 张歆艺的老公 吃奶男人视频在线播放 欧美色图男女乱伦 avtt2014ccvom 性插色欲香影院 青青草撸死你青青草 99热久久第一时间 激情套图卡通动漫 幼女裸聊做爱口交 日本女人被强奸乱伦 草榴社区快播 2kkk正在播放兽骑 啊不要人家小穴都湿了 www猎奇影视 A片www245vvcomwwwchnrwhmhzcn 搜索宜春院av wwwsee78co 逼奶鸡巴插 好吊日AV在线视频19gancom 熟女伦乱图片小说 日本免费av无码片在线开苞 鲁大妈撸到爆 裸聊官网 德国熟女xxx 新不夜城论坛首页手机 女虐男网址 男女做爱视频华为网盘 激情午夜天亚洲色图 内裤哥mangent 吉沢明歩制服丝袜WWWHHH710COM 屌逼在线试看 人体艺体阿娇艳照 推荐一个可以免费看片的网站如果被QQ拦截请复制链接在其它浏览器打开xxxyyy5comintr2a2cb551573a2b2e 欧美360精品粉红鲍鱼 教师调教第一页 聚美屋精品图 中韩淫乱群交 俄罗斯撸撸片 把鸡巴插进小姨子的阴道 干干AV成人网 aolasoohpnbcn www84ytom 高清大量潮喷www27dyycom 宝贝开心成人 freefronvideos人母 嫩穴成人网gggg29com 逼着舅妈给我口交肛交彩漫画 欧美色色aV88wwwgangguanscom 老太太操逼自拍视频 777亚洲手机在线播放 有没有夫妻3p小说 色列漫画淫女 午间色站导航 欧美成人处女色大图 童颜巨乳亚洲综合 桃色性欲草 色眯眯射逼 无码中文字幕塞外青楼这是一个 狂日美女老师人妻 爱碰网官网 亚洲图片雅蠛蝶 快播35怎么搜片 2000XXXX电影 新谷露性家庭影院 深深候dvd播放 幼齿用英语怎么说 不雅伦理无需播放器 国外淫荡图片 国外网站幼幼嫩网址 成年人就去色色视频快播 我鲁日日鲁老老老我爱 caoshaonvbi 人体艺术avav 性感性色导航 韩国黄色哥来嫖网站 成人网站美逼 淫荡熟妇自拍 欧美色惰图片 北京空姐透明照 狼堡免费av视频 www776eom 亚洲无码av欧美天堂网男人天堂 欧美激情爆操 a片kk266co 色尼姑成人极速在线视频 国语家庭系列 蒋雯雯 越南伦理 色CC伦理影院手机版 99jbbcom 大鸡巴舅妈 国产偷拍自拍淫荡对话视频 少妇春梦射精 开心激动网 自拍偷牌成人 色桃隐 撸狗网性交视频 淫荡的三位老师 伦理电影wwwqiuxia6commqiuxia6com 怡春院分站 丝袜超短裙露脸迅雷下载 色制服电影院 97超碰好吊色男人 yy6080理论在线宅男日韩福利大全 大嫂丝袜 500人群交手机在线 5sav 偷拍熟女吧 口述我和妹妹的欲望 50p电脑版 wwwavtttcon 3p3com 伦理无码片在线看 欧美成人电影图片岛国性爱伦理电影 先锋影音AV成人欧美 我爱好色 淫电影网 WWW19MMCOM 玛丽罗斯3d同人动画h在线看 动漫女孩裸体 超级丝袜美腿乱伦 1919gogo欣赏 大色逼淫色 www就是撸 激情文学网好骚 A级黄片免费 xedd5com 国内的b是黑的 快播美国成年人片黄 av高跟丝袜视频 上原保奈美巨乳女教师在线观看 校园春色都市激情fefegancom 偷窥自拍XXOO 搜索看马操美女 人本女优视频 日日吧淫淫 人妻巨乳影院 美国女子性爱学校 大肥屁股重口味 啪啪啪啊啊啊不要 操碰 japanfreevideoshome国产 亚州淫荡老熟女人体 伦奸毛片免费在线看 天天影视se 樱桃做爱视频 亚卅av在线视频 x奸小说下载 亚洲色图图片在线 217av天堂网 东方在线撸撸-百度 幼幼丝袜集 灰姑娘的姐姐 青青草在线视频观看对华 86papa路con 亚洲1AV 综合图片2区亚洲 美国美女大逼电影 010插插av成人网站 www色comwww821kxwcom 播乐子成人网免费视频在线观看 大炮撸在线影院 ,www4KkKcom 野花鲁最近30部 wwwCC213wapwww2233ww2download 三客优最新地址 母亲让儿子爽的无码视频 全国黄色片子 欧美色图美国十次 超碰在线直播 性感妖娆操 亚洲肉感熟女色图 a片A毛片管看视频 8vaa褋芯屑 333kk 川岛和津实视频 在线母子乱伦对白 妹妹肥逼五月 亚洲美女自拍 老婆在我面前小说 韩国空姐堪比情趣内衣 干小姐综合 淫妻色五月 添骚穴 WM62COM 23456影视播放器 成人午夜剧场 尼姑福利网 AV区亚洲AV欧美AV512qucomwwwc5508com 经典欧美骚妇 震动棒露出 日韩丝袜美臀巨乳在线 av无限吧看 就去干少妇 色艺无间正面是哪集 校园春色我和老师做爱 漫画夜色 天海丽白色吊带 黄色淫荡性虐小说 午夜高清播放器 文20岁女性荫道口图片 热国产热无码热有码 2015小明发布看看算你色 百度云播影视 美女肏屄屄乱轮小说 家族舔阴AV影片 邪恶在线av有码 父女之交 关于处女破处的三级片 极品护士91在线 欧美虐待女人视频的网站 享受老太太的丝袜 aaazhibuo 8dfvodcom成人 真实自拍足交 群交男女猛插逼 妓女爱爱动态 lin35com是什么网站 abp159 亚洲色图偷拍自拍乱伦熟女抠逼自慰 朝国三级篇 淫三国幻想 免费的av小电影网站 日本阿v视频免费按摩师 av750c0m 黄色片操一下 巨乳少女车震在线观看 操逼 免费 囗述情感一乱伦岳母和女婿 WWW_FAMITSU_COM 偷拍中国少妇在公车被操视频 花也真衣论理电影 大鸡鸡插p洞 新片欧美十八岁美少 进击的巨人神thunderftp 西方美女15p 深圳哪里易找到老女人玩视频 在线成人有声小说 365rrr 女尿图片 我和淫荡的小姨做爱 � 做爱技术体照 淫妇性爱 大学生私拍b 第四射狠狠射小说 色中色成人av社区 和小姨子乱伦肛交 wwwppp62com 俄罗斯巨乳人体艺术 骚逼阿娇 汤芳人体图片大胆 大胆人体艺术bb私处 性感大胸骚货 哪个网站幼女的片多 日本美女本子把 色 五月天 婷婷 快播 美女 美穴艺术 色百合电影导航 大鸡巴用力 孙悟空操美少女战士 狠狠撸美女手掰穴图片 古代女子与兽类交 沙耶香套图 激情成人网区 暴风影音av播放 动漫女孩怎么插第3个 mmmpp44 黑木麻衣无码ed2k 淫荡学姐少妇 乱伦操少女屄 高中性爱故事 骚妹妹爱爱图网 韩国模特剪长发 大鸡巴把我逼日了 中国张柏芝做爱片中国张柏芝做爱片中国张柏芝做爱片中国张柏芝做爱片中国张柏芝做爱片 大胆女人下体艺术图片 789sss 影音先锋在线国内情侣野外性事自拍普通话对白 群撸图库 闪现君打阿乐 ady 小说 插入表妹嫩穴小说 推荐成人资源 网络播放器 成人台 149大胆人体艺术 大屌图片 骚美女成人av 春暖花开春色性吧 女亭婷五月 我上了同桌的姐姐 恋夜秀场主播自慰视频 yzppp 屄茎 操屄女图 美女鲍鱼大特写 淫乱的日本人妻山口玲子 偷拍射精图 性感美女人体艺木图片 种马小说完本 免费电影院 骑士福利导航导航网站 骚老婆足交 国产性爱一级电影 欧美免费成人花花性都 欧美大肥妞性爱视频 家庭乱伦网站快播 偷拍自拍国产毛片 金发美女也用大吊来开包 缔D杏那 yentiyishu人体艺术ytys WWWUUKKMCOM 女人露奶 � 苍井空露逼 老荡妇高跟丝袜足交 偷偷和女友的朋友做爱迅雷 做爱七十二尺 朱丹人体合成 麻腾由纪妃 帅哥撸播种子图 鸡巴插逼动态图片 羙国十次啦中文 WWW137AVCOM 神斗片欧美版华语 有气质女人人休艺术 由美老师放屁电影 欧美女人肉肏图片 白虎种子快播 国产自拍90后女孩 美女在床上疯狂嫩b 饭岛爱最后之作 幼幼强奸摸奶 色97成人动漫 两性性爱打鸡巴插逼 新视觉影院4080青苹果影院 嗯好爽插死我了 阴口艺术照 李宗瑞电影qvod38 爆操舅母 亚洲色图七七影院 被大鸡巴操菊花 怡红院肿么了 成人极品影院删除 欧美性爱大图色图强奸乱 欧美女子与狗随便性交 苍井空的bt种子无码 熟女乱伦长篇小说 大色虫 兽交幼女影音先锋播放 44aad be0ca93900121f9b 先锋天耗ばさ无码 欧毛毛女三级黄色片图 干女人黑木耳照 日本美女少妇嫩逼人体艺术 sesechangchang 色屄屄网 久久撸app下载 色图色噜 美女鸡巴大奶 好吊日在线视频在线观看 透明丝袜脚偷拍自拍 中山怡红院菜单 wcwwwcom下载 骑嫂子 亚洲大色妣 成人故事365ahnet 丝袜家庭教mp4 幼交肛交 妹妹撸撸大妈 日本毛爽 caoprom超碰在email 关于中国古代偷窥的黄片 第一会所老熟女下载 wwwhuangsecome 狼人干综合新地址HD播放 变态儿子强奸乱伦图 强奸电影名字 2wwwer37com 日本毛片基地一亚洲AVmzddcxcn 暗黑圣经仙桃影院 37tpcocn 持月真由xfplay 好吊日在线视频三级网 我爱背入李丽珍 电影师傅床戏在线观看 96插妹妹sexsex88com 豪放家庭在线播放 桃花宝典极夜著豆瓜网 安卓系统播放神器 美美网丝袜诱惑 人人干全免费视频xulawyercn av无插件一本道 全国色五月 操逼电影小说网 good在线wwwyuyuelvcom www18avmmd 撸波波影视无插件 伊人幼女成人电影 会看射的图片 小明插看看 全裸美女扒开粉嫩b 国人自拍性交网站 萝莉白丝足交本子 七草ちとせ巨乳视频 摇摇晃晃的成人电影 兰桂坊成社人区小说www68kqcom 舔阴论坛 久撸客一撸客色国内外成人激情在线 明星门 欧美大胆嫩肉穴爽大片 www牛逼插 性吧星云 少妇性奴的屁眼 人体艺术大胆mscbaidu1imgcn 最新久久色色成人版 l女同在线 小泽玛利亚高潮图片搜索 女性裸b图 肛交bt种子 最热门有声小说 人间添春色 春色猜谜字 樱井莉亚钢管舞视频 小泽玛利亚直美6p 能用的h网 还能看的h网 bl动漫h网 开心五月激 东京热401 男色女色第四色酒色网 怎么下载黄色小说 黄色小说小栽 和谐图城 乐乐影院 色哥导航 特色导航 依依社区 爱窝窝在线 色狼谷成人 91porn 包要你射电影 色色3A丝袜 丝袜妹妹淫网 爱色导航(荐) 好男人激情影院 坏哥哥 第七色 色久久 人格分裂 急先锋 撸撸射中文网 第一会所综合社区 91影院老师机 东方成人激情 怼莪影院吹潮 老鸭窝伊人无码不卡无码一本道 av女柳晶电影 91天生爱风流作品 深爱激情小说私房婷婷网 擼奶av 567pao 里番3d一家人野外 上原在线电影 水岛津实透明丝袜 1314酒色 网旧网俺也去 0855影院 在线无码私人影院 搜索 国产自拍 神马dy888午夜伦理达达兔 农民工黄晓婷 日韩裸体黑丝御姐 屈臣氏的燕窝面膜怎么样つぼみ晶エリーの早漏チ○ポ强化合宿 老熟女人性视频 影音先锋 三上悠亚ol 妹妹影院福利片 hhhhhhhhsxo 午夜天堂热的国产 强奸剧场 全裸香蕉视频无码 亚欧伦理视频 秋霞为什么给封了 日本在线视频空天使 日韩成人aⅴ在线 日本日屌日屄导航视频 在线福利视频 日本推油无码av magnet 在线免费视频 樱井梨吮东 日本一本道在线无码DVD 日本性感诱惑美女做爱阴道流水视频 日本一级av 汤姆avtom在线视频 台湾佬中文娱乐线20 阿v播播下载 橙色影院 奴隶少女护士cg视频 汤姆在线影院无码 偷拍宾馆 业面紧急生级访问 色和尚有线 厕所偷拍一族 av女l 公交色狼优酷视频 裸体视频AV 人与兽肉肉网 董美香ol 花井美纱链接 magnet 西瓜影音 亚洲 自拍 日韩女优欧美激情偷拍自拍 亚洲成年人免费视频 荷兰免费成人电影 深喉呕吐XXⅩX 操石榴在线视频 天天色成人免费视频 314hu四虎 涩久免费视频在线观看 成人电影迅雷下载 能看见整个奶子的香蕉影院 水菜丽百度影音 gwaz079百度云 噜死你们资源站 主播走光视频合集迅雷下载 thumbzilla jappen 精品Av 古川伊织star598在线 假面女皇vip在线视频播放 国产自拍迷情校园 啪啪啪公寓漫画 日本阿AV 黄色手机电影 欧美在线Av影院 华裔电击女神91在线 亚洲欧美专区 1日本1000部免费视频 开放90后 波多野结衣 东方 影院av 页面升级紧急访问每天正常更新 4438Xchengeren 老炮色 a k福利电影 色欲影视色天天视频 高老庄aV 259LUXU-683 magnet 手机在线电影 国产区 欧美激情人人操网 国产 偷拍 直播 日韩 国内外激情在线视频网给 站长统计一本道人妻 光棍影院被封 紫竹铃取汁 ftp 狂插空姐嫩 xfplay 丈夫面前 穿靴子伪街 XXOO视频在线免费 大香蕉道久在线播放 电棒漏电嗨过头 充气娃能看下毛和洞吗 夫妻牲交 福利云点墦 yukun瑟妃 疯狂交换女友 国产自拍26页 腐女资源 百度云 日本DVD高清无码视频 偷拍,自拍AV伦理电影 A片小视频福利站。 大奶肥婆自拍偷拍图片 交配伊甸园 超碰在线视频自拍偷拍国产 小热巴91大神 rctd 045 类似于A片 超美大奶大学生美女直播被男友操 男友问 你的衣服怎么脱掉的 亚洲女与黑人群交视频一 在线黄涩 木内美保步兵番号 鸡巴插入欧美美女的b舒服 激情在线国产自拍日韩欧美 国语福利小视频在线观看 作爱小视颍 潮喷合集丝袜无码mp4 做爱的无码高清视频 牛牛精品 伊aⅤ在线观看 savk12 哥哥搞在线播放 在线电一本道影 一级谍片 250pp亚洲情艺中心,88 欧美一本道九色在线一 wwwseavbacom色av吧 cos美女在线 欧美17,18ⅹⅹⅹ视频 自拍嫩逼 小电影在线观看网站 筱田优 贼 水电工 5358x视频 日本69式视频有码 b雪福利导航 韩国女主播19tvclub在线 操逼清晰视频 丝袜美女国产视频网址导航 水菜丽颜射房间 台湾妹中文娱乐网 风吟岛视频 口交 伦理 日本熟妇色五十路免费视频 A级片互舔 川村真矢Av在线观看 亚洲日韩av 色和尚国产自拍 sea8 mp4 aV天堂2018手机在线 免费版国产偷拍a在线播放 狠狠 婷婷 丁香 小视频福利在线观看平台 思妍白衣小仙女被邻居强上 萝莉自拍有水 4484新视觉 永久发布页 977成人影视在线观看 小清新影院在线观 小鸟酱后丝后入百度云 旋风魅影四级 香蕉影院小黄片免费看 性爱直播磁力链接 小骚逼第一色影院 性交流的视频 小雪小视频bd 小视频TV禁看视频 迷奸AV在线看 nba直播 任你在干线 汤姆影院在线视频国产 624u在线播放 成人 一级a做爰片就在线看狐狸视频 小香蕉AV视频 www182、com 腿模简小育 学生做爱视频 秘密搜查官 快播 成人福利网午夜 一级黄色夫妻录像片 直接看的gav久久播放器 国产自拍400首页 sm老爹影院 谁知道隔壁老王网址在线 综合网 123西瓜影音 米奇丁香 人人澡人人漠大学生 色久悠 夜色视频你今天寂寞了吗? 菲菲影视城美国 被抄的影院 变态另类 欧美 成人 国产偷拍自拍在线小说 不用下载安装就能看的吃男人鸡巴视频 插屄视频 大贯杏里播放 wwwhhh50 233若菜奈央 伦理片天海翼秘密搜查官 大香蕉在线万色屋视频 那种漫画小说你懂的 祥仔电影合集一区 那里可以看澳门皇冠酒店a片 色自啪 亚洲aV电影天堂 谷露影院ar toupaizaixian sexbj。com 毕业生 zaixian mianfei 朝桐光视频 成人短视频在线直接观看 陈美霖 沈阳音乐学院 导航女 www26yjjcom 1大尺度视频 开平虐女视频 菅野雪松协和影视在线视频 华人play在线视频bbb 鸡吧操屄视频 多啪啪免费视频 悠草影院 金兰策划网 (969) 橘佑金短视频 国内一极刺激自拍片 日本制服番号大全magnet 成人动漫母系 电脑怎么清理内存 黄色福利1000 dy88午夜 偷拍中学生洗澡磁力链接 花椒相机福利美女视频 站长推荐磁力下载 mp4 三洞轮流插视频 玉兔miki热舞视频 夜生活小视频 爆乳人妖小视频 国内网红主播自拍福利迅雷下载 不用app的裸裸体美女操逼视频 变态SM影片在线观看 草溜影院元气吧 - 百度 - 百度 波推全套视频 国产双飞集合ftp 日本在线AV网 笔国毛片 神马影院女主播是我的邻居 影音资源 激情乱伦电影 799pao 亚洲第一色第一影院 av视频大香蕉 老梁故事汇希斯莱杰 水中人体磁力链接 下载 大香蕉黄片免费看 济南谭崔 避开屏蔽的岛a片 草破福利 要看大鸡巴操小骚逼的人的视频 黑丝少妇影音先锋 欧美巨乳熟女磁力链接 美国黄网站色大全 伦蕉在线久播 极品女厕沟 激情五月bd韩国电影 混血美女自摸和男友激情啪啪自拍诱人呻吟福利视频 人人摸人人妻做人人看 44kknn 娸娸原网 伊人欧美 恋夜影院视频列表安卓青青 57k影院 如果电话亭 avi 插爆骚女精品自拍 青青草在线免费视频1769TV 令人惹火的邻家美眉 影音先锋 真人妹子被捅动态图 男人女人做完爱视频15 表姐合租两人共处一室晚上她竟爬上了我的床 性爱教学视频 北条麻妃bd在线播放版 国产老师和师生 magnet wwwcctv1024 女神自慰 ftp 女同性恋做激情视频 欧美大胆露阴视频 欧美无码影视 好女色在线观看 后入肥臀18p 百度影视屏福利 厕所超碰视频 强奸mp magnet 欧美妹aⅴ免费线上看 2016年妞干网视频 5手机在线福利 超在线最视频 800av:cOm magnet 欧美性爱免播放器在线播放 91大款肥汤的性感美乳90后邻家美眉趴着窗台后入啪啪 秋霞日本毛片网站 cheng ren 在线视频 上原亚衣肛门无码解禁影音先锋 美脚家庭教师在线播放 尤酷伦理片 熟女性生活视频在线观看 欧美av在线播放喷潮 194avav 凤凰AV成人 - 百度 kbb9999 AV片AV在线AV无码 爱爱视频高清免费观看 黄色男女操b视频 观看 18AV清纯视频在线播放平台 成人性爱视频久久操 女性真人生殖系统双性人视频 下身插入b射精视频 明星潜规测视频 mp4 免賛a片直播绪 国内 自己 偷拍 在线 国内真实偷拍 手机在线 国产主播户外勾在线 三桥杏奈高清无码迅雷下载 2五福电影院凸凹频频 男主拿鱼打女主,高宝宝 色哥午夜影院 川村まや痴汉 草溜影院费全过程免费 淫小弟影院在线视频 laohantuiche 啪啪啪喷潮XXOO视频 青娱乐成人国产 蓝沢润 一本道 亚洲青涩中文欧美 神马影院线理论 米娅卡莉法的av 在线福利65535 欧美粉色在线 欧美性受群交视频1在线播放 极品喷奶熟妇在线播放 变态另类无码福利影院92 天津小姐被偷拍 磁力下载 台湾三级电髟全部 丝袜美腿偷拍自拍 偷拍女生性行为图 妻子的乱伦 白虎少妇 肏婶骚屄 外国大妈会阴照片 美少女操屄图片 妹妹自慰11p 操老熟女的b 361美女人体 360电影院樱桃 爱色妹妹亚洲色图 性交卖淫姿势高清图片一级 欧美一黑对二白 大色网无毛一线天 射小妹网站 寂寞穴 西西人体模特苍井空 操的大白逼吧 骚穴让我操 拉好友干女朋友3p