Damian Mingle; Amit Kumar

doi:10.19080/ARTOAJ.2021.25.556316

Research Article

Classify Rice Disease Using Self-Optimizing Models and Edge Computing with Agricultural Implications

**Damian Mingle^1,2,3* and Amit Kumar¹**

¹LogicPlum, Inc., Franklin, Tennessee, 37067, USA

²SwitchPoint Ventures, Nashville, Tennessee, 37215, USA

³Lipscomb University, Adjunct of College of Computing & Technology, Nashville, Tennessee, 37204, USA

Submission: March 24, 2020; Published: April 05, 2021

*Corresponding author: Damian Mingle, West McEwen Drive, Suite 300, Franklin, TN, 37067, USA

How to cite this article: Damian Mingle, Amit Kumar. Classify Rice Disease Using Self-Optimizing Models and Edge Computing with Agricultural Implications. Agri Res & Tech: Open Access J. 2021; 25 (5): 556316. DOI: 10.19080/ARTOAJ.2021.25.556316

Abstract

Rice continues to be a primary food for the world’s population. Over its complex history, dating as far back as 8,000 B.C., there have been agricultural challenges, such as a variety of diseases. A consequence of disease in rice plants may lead to no harvest of grain; therefore, detecting disease early and providing expert remedies in a low-cost solution is highly desirable. In this article, we study a pragmatic approach for rice growers to leverage artificial intelligence solutions that reduce cost, increase speed, improve ease of use, and increase model performance over other solutions, thereby directly impacting field operations. Our method significantly improves upon prior methods by combining automated feature extraction for image data, exploring thousands of traditional machine learning configurations, defining a search space for hyper-parameters, deploying a model using edge computing field usability, and suggesting remedies for rice growers. These results prove the validity of the proposed approach for rice disease detection and treatments.

Keywords:Agriculture Technology; Machine Learning Applications; Rice Production, Edge Computing; Precision Farming; Agriculture Education; Pre-Trained Models For Image Classification; Deep Learning Applications; Farming Knowledge; Rice Disease Management

Introduction

Rice supports more than half the world’s population as a primary food source [1]. The quality and quantity of rice production are significantly affected by rice disease. In general, identification of rice disease is made by visual observation of experienced producers in the field. This method requires constant surveillance from manual labor, which could be prohibitively expensive for large farms. However, with the advances in image processing and pattern recognition, a cost-effective method for disease identification is demonstrated. Advances in research continue on image processing and pattern recognition as a result of innovations with digital cameras and the increase in computational capacity. These tools have been effectively applied in many areas [2-5]. Prajapati et al., [6] developed a rice plant disease classification system after detailed experimental analysis of various techniques. Four techniques of background removal and three techniques of segmentation were empirically evaluated. It was proposed for accurate feature extraction, a centroid feedingbased K-means clustering for segmentation of disease from a leaf image was necessary. The output from K-means clustering was enhanced by removing green pixels in the disease portion. Additional feature extraction was done under three categories: (1) color, (2) shape, and (3) texture. Ultimately, Support Vector Machines was chosen to perform a multiclass classifier (Figure 1).

Generally, rice growers identify plant disease through leaves as the first source. This can be detected automatically using computer vision techniques. Until now, there have been several researchers who have conducted experiments with very little utility for rice farms. Considerations for farmers are cost, speed, ease of use, model performance, and direct impact on the field. There has been little attention to structuring a useful machine learning approach that is end-to-end in agriculture. Previous investigations have successfully demonstrated the potential of deep learning algorithms in plant disease detection; yet, the cost associated with such architecture makes this unattainable for many rice growers. The length of training time required for such deep learning models has historically been lengthy, and specialty hardware is needed. Additionally, the expertise necessary to maintain and optimize deep learning network hyper-parameters, such as (a) a comparison of activation functions like ReLU, Sigm, and Tanh, (b) learning rate, (c) quantity of neurons per layer, (d) quantity of hidden layers, and (e) dropout regularization remains unrealistic. Much of the research to date has been concerned with many pre-processing steps and augmentation techniques for images to maximize model performance: (a) resize images, (b) denoise, (c) segmentation, and (d) morphology. In almost all the research, model performance has suffered from over-fitting, evidenced by high accuracy scores for training sets but significantly lower accuracy for validation sets.

Given that growers value more what is likely to happen in day-to-day utilization, the emphasis on a practical solution suggests validation scores matter more than training scores. It will measure how well a solution performs. Lastly, there is little to no connection between identifying plant disease and what action rice farms should do next to experience the benefit of an algorithm detecting a plant disease early. In this work, we studied the benefits of crafting an end-to-end solution for rice farmers using an automated machine learning platform with the aim of building a production-grade solution for agriculture that provides real-time decision support for rice farms. We combine several methods, namely, employing automated feature extraction for image data, exploring thousands of possible traditional machine learning configurations, defining a search space for hyper-parameters, deploying a model built for edge computing for field usability, and suggesting remedies for rice growers. This journal article comprises the following sections: methods and materials, results, discussion, and conclusion.

Methods and Materials

Data Acquisition

The dataset contains 120 jpeg images of disease-infected rice leaves. There are 3 classes of images based on the type of disease, each containing 40 images, and captured with a NIKON D90 digital SLR camera with 12.3 megapixels. This dataset was curated by the research team at the Department of Information Technology, Dharmsinh Desai University, and is made publicly available. The authors gathered these leaves from a rice field in a village called Shertha in Gujarat, India, and in consultation with farmers, grouped the leaves into the aforementioned-diseases categories (Figure 2).

LogicPlum

As part of our research and analysis, we opted to use an A.I. innovation platform named LogicPlum. LogicPlum includes a library of proprietary and open-source model types, including linear, non-linear, and deep learning approaches [7]. While manual interventions are possible during model development, in this study, the autonomous model builder was used; specifically, the platform was provided only with the original data images, and it decided appropriate configurations for machine learning models automatically. Additionally, we chose two different autonomous run types within the LogicPlum platform. The first was Rapid mode, which is designed for model development under 5 minutes. We also used Intensive mode, which is intended for model development that allows for an undefined time but stops after several rounds of non-improvement with a given model evaluation metric. The software considers several families of algorithms and ranks them according to model performance based on the machine learning task. Lastly, a combination of base models is automatically evaluated, and a subsequent composite model is tested for increased lift before a final solution.

Deep Learning for Computer Vision

Within research, education, and industry applications, the most essential step in a computer vision process is to extract features from the images in a dataset. In this context, a feature is a tangible piece of information about a given image, such as color, line, edges, etc. A model needs to observe in order to learn the characteristics of a given image and thereby classify it correctly. Traditional machine learning approaches allow for several different feature extraction methods, which require manual feature selection and engineering. This process relies heavily on domain knowledge, both in computer vision and rice plant disease, to create model inputs that make machine learning algorithms work better. To increase speed to market for the solution and eliminate the need for expertise in machine learning and plant pathology, we explored automatically extracting features using deep learning. The network automatically extracts features and learns their importance based on the output by applying weights to its connections. In practice, an individual feeds the raw image to the network and, as it passes through the network layers, the network identifies patterns within the image to create features.

We use the SqueezeNet network to extract features from the images. SqueezeNet is a lightweight architecture that is extremely useful in low bandwidth scenarios like mobile platforms and has ImageNet accuracy similar to AlexNet, the convolution neural network that began the deep learning revolution in 2012 (Figure 3). The first layer demonstrates that the first layer is a squeeze layer comprised of a 1×1 convolution that reduces the amount of channels, for example, from 64 to 16 in each image. The squeeze layer aims to compress the data so that the 3×3 convolution does not need to learn so many parameters. This is followed by an expand block with two parallel convolution layers: one with a 1×1 kernel, the other with a 3×3 kernel. These convolution layers also increase the quantity of channels again, from 16 back to 64. Their outputs are joined together so the output of this fire module has 128 channels overall. SqueezeNet has 8 of these Fire modules in succession, sometimes with max-pooling layers between them. There are zero fully-connected layers. At the end of the process is a convolution layer that performs the classification, followed by the global average [8] (Figure 4).

Determined Architecture with Rapid Mode

Modeling with ExtraTrees

ExtraTrees Classifier was selected as the top performer. This classifier fits many randomized decision trees on numerous sub-samples of the dataset and uses averaging to enhance the predictive accuracy and control over-fitting. ExtraTrees is considered a perturb-and-combine technique specifically designed for trees. Effectively this means that a diverse set of classifiers is designed by introducing randomness in the classifier construction. The prediction of the collection of weak learners is given as the averaged prediction of the individual classifiers (Figure 5).

Determined Architecture with Autonomous Mode

For a composite model to outperform base models, some samples must be better predicted by one model, and other samples by another model. Stacking is an ensemble learning technique to bring together multiple classification models via a meta-classifier [9]. LogicPlum extends the standard stacking algorithm using cross-validation to arrange the input data for the level-2 classifier.

In the usual stacking procedure, the first-level classifiers fit the same training set used to arrange the inputs for the level-2 classifier, which may lead to overfitting. However, the LogicPlum approach uses the concept of cross-validation: the dataset is split into k-folds, and in k successive sequences, k-1 folds are used to fit the first level classifier. The first-level classifiers are then utilized on the remaining 1 subset that was not used for model fitting in each iteration in each round. The resulting predictions are then stacked and provided – as input data – to the second-level classifier. After the training of the StackedCVClassifier, the first-level classifiers are fit to the entire dataset, as illustrated in the figure below. More formally, the Stacking Cross-Validation algorithm can be summarized as follows: (Table 1)

Modeling with Stochastic Gradient Descent

This estimator applies regularized linear models with stochastic gradient descent learning; the loss’s gradient is estimated each sample at a time. The model is revised along the way with a decreasing learning rate [10]. This implementation makes use of data represented as dense or sparse arrays of floating-point values for the features (Figure 6). The model it fits can be monitored with the loss parameter; by default, it fits a linear support vector machine. The regularizer is a consequence added to the loss function that shrinks model parameters in the direction of the zero vector using the squared Euclidean norm (L2), the absolute norm (L1), or a mixture of both (Elastic Net). Many hyperparameters were considered in optimizing for the Stochastic Gradient Descent Classifier. The constant that multiplies the regularization term, alpha, is set to 0.0001. In general, the higher the value, the stronger the regularization. We did not compute the average Stochastic Gradient Descent weights across all updates and therefore did not store the results as coefficients. We did not set class weights, and thus, all classes are assigned to a weight of one. Early stopping was not engaged, forcing us to not terminate training when validation scores did not improve. The initial learning rate set was 1.0. We did not assume the data was already centered, and chose to estimate the intercept. We used a squared hinge loss function that is equivalent to Support Vector Classification, but is quadratically penalized. For the exponent for inverse scaling learning rate, we used a power_t =0.1. We set the maximum number of passes over the training data to be 1,000. The L1 ratio is defined with a range of 0 to 1, and we set it to 1.0. We used Elastic Net as the penalty term, which brought sparsity to the model. The learning rate schedule used was inverse scaling,

Where eta0 and t power _t are hyperparameters chosen by LogicPlum.

Modeling with Gaussian Naïve Bayes

We implemented the Gaussian Naïve Bayes algorithm for classification. The likelihood of the features is believed to be Gaussian:

Where the parameters y σ and y μ are estimated using maximum likelihood.

The classes’ prior probabilities were not specified as part of our experiment and therefore were not adjusted according to the data. It was determined that variable smoothing should be 1e-9, which was the most considerable variance of all features added to variances for calculation stability.

Connected to farmers for usability

Results

Evaluation Metrics

We use a ground-truth-based approach to compare the results of various machine learning models. Ground truth is a term used in multiple fields to refer to the information provided by direct observation instead of the information provided by inference. We understood the machine learning task to be a multiclass classification problem that could be realized in a binary classification model framework.

The model results were compared concerning the ground truth as follows:

Given the definitions of terms within table 2, we can generate standard evaluation metrics for machine learning classification models:

Accuracy is defined as the number of items correctly identified as either true positive or true negative out of the total number of items. Mathematically described as,

Recall is defined as the number of items correctly identified as positive out of the total actual positives. Mathematically described as,

Precision is defined as the number of items correctly identified as positive out of the total items identified as positive. Mathematically described as,

F1 score is defined as the harmonic average of precision and recall, measures the effectiveness of identification when just as much significance is given to recall as to precision. Mathematically, described as,

Macro Average computes the metric independently for each class then averages the results. Mathematically, described as,

Weighted average weights are calculated by the frequency of a class. Mathematically, described as,

LogicPlum randomly selected 30 images from each class and formed a training dataset of 90 images. The remaining 30 images were portioned as a test set. The data in both train and test consisted of 513 features (Table 3).

Model Performance

Statistics

Our primary evaluation metric for model performance was accuracy. We observed accuracy of 0.90 across all rice plant disease classifications on the Rapid model’s validation dataset. Recall, as it relates to Leaf Smut, is the lowest secondary evaluation metric for model performance. This measure aims to answer the question, “What proportion of actual positives was identified correctly?” In the context of the Intensive model, which was completed in 60 minutes, we observed accuracy of 92.5% across all rice disease classes. However, the lowest secondary measure is recall as it relates to Leaf Smut (Table 4).

To completely evaluate the effectiveness of a model, we must examine both precision and recall. Precision and recall are often in tension. That is, improving precision typically reduces recall and vice versa. Thus, many machine learning practitioners rely on the F1 score, which combines the effects of both precision and recall. An F1 score is considered perfect if it reaches 1.0. When comparing the F1 score from both the Rapid and Intensive mode, we can observe that the Intensive mode does significantly better at classifying Leaf Smut than the Rapid mode, with a 15.68% increase. It is worth noting that while the Intensive mode is superior in almost every respect, it does show a percentage decrease of 3.21% when considering the F1 score for Bacterial Leaf Blight.

Confusion Matrix

Below are the confusion matrices for both models.

Table 5 illustrates where the Rapid mode made incorrect predictions: Leaf Smut for the True Class should be 11, and instead we have 7. We incorrectly classified 4 Leaf Smut cases as Bacterial Leaf Blight in two instances and Brown Spot in the remaining instances (Table 6). In the case of the Intensive mode, there was misclassification that occurs in two classes, Brown Spot and Leaf Smut. However, the total misclassification rate for Intensive was lower by 25% over Rapid mode. Additionally, Bacterial Leaf Blight offered new improvement, and Brown Spot created some minor confusion for the Intensive mode.

Comparative Accuracy of Models

Our experiment was conducted in LogicPlum cloud and only leveraged a CPU configuration. As seen in Table 4, we achieve test accuracy of 90% with the Rapid results model, whereas with the Intensive results model, accuracy goes up to 92.5%. Barring one image, all the test images belonging to Bacteria Leaf Light and Brown Spot are correctly classified.

Discussion

Summary of conclusions

This paper proposed two new approaches for detecting disease in rice plants, Rapid mode and Intensive mode, using a meager number of images for training a classifier. We provide an indepth analysis of our methods, which outperform the original paper results on the same dataset with significantly fewer machine learning techniques. Future work involves exploring the edge computing capabilities of these methods.

Relation to other results

We achieved 90.0% on the test dataset with Rapid mode, which builds the A.I. solution from data upload to prediction within 2 minutes. Additionally, we achieved 92.5% accuracy on the test dataset, which has a training time that completes within 60 minutes. Both approaches increase detection accuracy for rice plant disease over the prior research, which achieved 73.33% accuracy on the dataset [6]. As it relates to model performance, the Rapid mode exhibits a 22.73% increase over the prior research, while the Intensive mode demonstrates a 26.14% percent increase. Furthermore, we reduced the number of technical steps taken by practitioners in the preceding study, from 11 steps to 5 steps in the case of Rapid mode, and 6 steps in the Intensive mode—a 54.54% and 45.45% decrease, respectively, over the prior research (Figure 7).

Prior methods

This paper evaluated four techniques of background removal by applying masks generated based on the following: (1) original image, (2) hue component values of the image in HSV color space, (3) value components of the image in HSV color space, and finally (4) saturation component values of the image in HSV color space. Three techniques of segmentation were utilized: (1) LAB color space based K-means clustering, (2) Otsu’s segmentation technique, and (3) HSV color space based K-means clustering. Using various features under three categories: color, texture, and shape, the authors extracted 88 features from the disease portion of a leaf image. Finally, the paper used Support Vector Machines with Gaussian kernel for multiclass classification of the leaf images.

Implications

Edge computing for smartphone users

Edge computing has the capability to address the concerns of bringing machine learning approaches to the farming fields. Specifically, edge computing deals with response time requirements, battery life consumption, bandwidth cost savings, and data safety and privacy. Edge computing is at the center of several IoT agricultural applications, such as pest identification, safety traceability of farm products, unmanned agrarian machinery, agrarian technology promotion, and in this case, classifying diseases from the images of rice leaves purely because of its speed and efficiency compared to the cloud infrastructure. It offers a potentially tractable model for mainstreaming smart agriculture [11]. Agriculture IoT systems can make informed decisions in the field when using edge computing [12].

We propose an approach that allows for access to our A.I. solution without an internet connection in the field. Figure 8 (A) illustrates the process of a farmer in a field who needs access to rice plant disease classification via her smartphone and does not have access to a network connection. The farmer can make use of the classification algorithm as it is embedded on the phone. (B) demonstrates that the trained model is converted to a LogicPlum Lite file type, which is how the model becomes executable on a mobile phone device. Figure 8.C exemplifies the concept of returning to a location that supplies network connection, and a transfer occurs. If an update exists, then an update is made available.

Borrowing knowledge from plant experts

Making expert plant knowledge readily available to farmers in the field promises a meaningful impact. Edge computing allows farmers with a mobile app to capture the image of infected rice leaf and classify the disease, thereby greatly reducing the need for consultation with plant pathologists, which can be a time-consuming process. Furthermore, once a condition is detected, appropriate expert measures can be applied with various management strategies, namely preventive methods, cultural methods, and chemical methods (Figure 9). The Next Action Model is built on a concept of just-in-time learning, which meets farmers where they are instead of requiring structured education to form concept knowledge. The advent of our machine learning approach, coupled with edge computing and remedies for specific management strategies of rice plant disease, shifts farming further into the 21st century. Many education areas have evolved into a self-paced process of finding information or learning skills exactly when and where they are needed, and farming is no different. Our approach offers flexible delivery of learning to farmers with an “anytime, anyplace” framework. This approach allows farmers to independently access information from plant pathology databases in the context of what they observe within their environment. This approach is linked to the idea of lifelong learning, learner-driven learning, and project- based learning. We have organized expert remedies for each of the three rice disease classes we analyzed: Rice Leaf Blight, Brow Spot, and Leaf Smut. According to Tamil Naud Agricultural University, each of these rice diseases has three management strategies categorized as preventive, cultural, and chemicall (Tables 8-14).

Data science knowledge

Our approach leverages an automated machine learning process that allows for rapid experimentation on real-world problems. This approach covers the entire process from beginning to end, more specifically, from uploading the data to the deployment of a machine learning classifier, with little to no human interaction. This approach has data science expertise built into the process, offering guardrails for lay users of machine learning. In this approach, the emphasis is placed on the creative use of the technology rather than the details of a given algorithm.

Effects of age, educational level, and adoption of farming practices

Children who were raised on family farms are familiar with the farming practices that have proven successful for their parents. So, even when younger family members don’t make identical decisions to those of their parents, their decisions will continue to be informed by years spent under their parents’ guidance [13]. This is known as multi-generational farming, which often doesn’t involve technology in agriculture.

According to Moore’s law, computer processing speed doubles every 18 or so months, and a generation is generally understood to be between 20 and 30 years. This means that processing speeds may double 20 times during a given farming generation, allowing for more insight and actionable machine learning models (Koleva, 2021). Although former generations may not have been raised with digital technology, such significant enhancements in machine learning model performance, along with edge computing, should encourage adoption within agriculture, requiring new behaviors and ways of thinking. We believe that just like rakes, hoes, and shovels are essential for today’s farmers, machine learning will be added to the basic set of farming tools in the 21st century.

Digital farming techniques

Our approach is additive in the context of modern agricultural methods. Successfully delivering productive and sustainable agricultural systems worldwide will help form the foundations for overcoming food insecurity and hunger. Economic viability makes edge commuting one of the emerging technologies staged to transform the agricultural industry. With sensors, actuators, and real-time data-driven models, digitization can help us overcome some of the biggest challenges of our time [14]. Autonomous tractors and robotic machinery, often known as Agribots, can run on autopilot, communicating with nearby sensors to acquire the necessary data about the surrounding environment. The introduction of drones has shown great promise with agricultural implications. These unmanned aerial vehicles can help in various ways, such as monitoring livestock and crop growth, and increasing output with real-time insights. Additionally, the introduction of the 5G mobile network, which is designed to connect virtually everyone and everything together, including machines, objects, and devices, will further drive the adoption of digital farming techniques.

Precision farming

Technology has become an imperative consideration for every stakeholder involved in agriculture, starting from farmer to agronomist. Precision farming makes farming more accurate and controlled when it comes to growing crops and raising livestock. It can decide on and carry out the best technical intervention in the right place at the best possible moment. It makes it simpler to plan ahead of time and to act precisely in terms of space. A vital component of the precision farming management approach is the use of technology with its wide array of instruments, such as robotics, drones, variable rate technology, sensors, GPS-based soil sampling, telematics, and software. A balance must be found between precision farming, capable of determining the correct, limited scale of mediation at the right time and in the right place, and a preventive, systemic approach empowering a cultivated ecosystem to produce without the need for curative treatments. Digital technology will make it possible for targeted interventions, through data processing, forecasting and anticipating, simulating, and safeguarding [15].

Conclusion

The best prediction statistics were achieved with a Gaussian Naïve Bayes stacked classifier that used Stochastic Gradient Descent Classifiers predictions as model inputs. The automated model construction approach resulted in a validation set of 92.5% accuracy. Therefore, it can be recommended for use, with little to no involvement from a machine learning expert or trained plant pathologist. Our approach ranged from as much as 60 minutes in total time to 2 minutes. Since our method was automated compared to a manually crafted process, it is faster loading the data, model construction, optimization, and deployment. This method is inexpensive compared to other methods, not only in time but in economic terms, as our method only uses CPU rather than GPU architecture. Our approach cut the number of steps in half compared to prior methods and is also self-optimizing, permitting users of this approach to be hands-free. Additionally, our process does not end with the identification of rice plant disease. Instead, we combined management strategies for specific rice diseases from known plant experts using edge computing. This was chosen to increase accessibility to the machine learning approach, and allows for our system to meet more farmers where they are and when they need it [16-20].