ORIGINAL RESEARCH article

Front. Plant Sci., 02 May 2025

Sec. Technical Advances in Plant Science

Volume 16 - 2025 | https://doi.org/10.3389/fpls.2025.1499909

This article is part of the Research TopicHarnessing Machine Learning to Decode Plant-Microbiome Dynamics for Sustainable AgricultureView all 16 articles

Optimized classification of potato leaf disease using EfficientNet-LITE and KE-SVM in diverse environments

Gopal Sangar*Gopal Sangar*Velswamy RajasekarVelswamy Rajasekar
  • Department of Computer Science and Engineering, Faculty of Engineering and Technology, SRM Institute of Science and Technology, Vadapalani, Chennai, India

Introduction: Potatoes are a vital global product, and prompt identification of foliar diseases is imperative for sustaining healthy yields. Computer vision is essential in precision agriculture, facilitating automated disease diagnosis and decision-making through real-time data. Inconsistent data in uncontrolled contexts undermines classic image classification techniques, hindering precise illness detection.

Methods: We present a novel model that integrates EfficientNet-LITE for enhanced feature extraction with KE-SVM Optimization for effective classification. KE-SVM Optimization cross-references misclassified instances with correct classifications across kernels, iteratively refining the confusion matrix to improve accuracy across all classes. EfficientNet-LITE improves the model's emphasis on pertinent features through Channel Attention (CA) and 1-D Local Binary Pattern (LBP), while preserving computational economy with a reduced model size of 12.46 MB, fewer parameters at 3.11M, and a diminished FLOP count of 359.69 MFLOPs.

Results: Before optimization, the SVM classifier attained an accuracy of 79.38% on uncontrolled data and 99.07% on laboratory-controlled data. Following the implementation of KE-SVM Optimization, accuracy increased to 87.82% for uncontrolled data and 99.54% for laboratory-controlled data.

Discussion: The model's efficiency and improved accuracy render it especially appropriate for settings with constrained computational resources, such as mobile or edge devices, offering substantial practical advantages for precision agriculture.

1 Introduction

Crop and plant diseases lead to substantial revenue drops, incurring elevated disease management expenses and financial losses for farmers globally. Potatoes serve as a fundamental food source in India, which ranks as the second-largest producer globally, contributing over 15% to worldwide potato production. In India, potatoes are grown on around 2 million hectares, yielding 56 million tons (Mishra et al., 2024), thereby playing a crucial role in food security and the economy of agriculture. Potato crops experience yield losses of 5% to 15% owing to leaf diseases (Mishra et al., 2024), necessitating the implementation of effective disease management methods. Precisely diagnosing and categorizing diseases under diverse conditions is important for effective disease management. Conventional methods (Singla et al., 2024) necessitated manual field scouting, resulting in delayed disease diagnosis. These approaches are both inefficient and subjective, depending on visual evaluations conducted by trained plant pathologists. Computer vision-based image analysis (Gulame et al., 2023; Tholkapiyan et al., 2023) has been developed to address these constraints, enabling rapid and precise disease identification. However, initial solutions primarily focused on feature engineering to define particular attributes for each illness, which is unfeasible for the extensive variety of plant species and diseases. This has concluded in increased dependency on deep learning (DL) to provide more generalized and scalable options.

In recent years, deep learning has gained prominence because to developments in Graphics Processing Units (GPUs), increased storage space, and the availability of vast datasets. Convolutional Neural Networks (CNNs) (Huang et al., 2023) have become highly favored for the recognition and classification of plant diseases owing to their capacity to independently extract and learn optimal features from images. Although they perform well in controlled settings, numerous models fail to reproduce these outcomes with field data acquired under uncontrolled conditions (Shabrina et al., 2024). To mitigate this deficiency, the EfficientNet-LITE model, based on Convolutional Neural Networks (CNN) (Haque et al., 2022; Khamparia et al., 2020; Nagaraju and Chawla, 2022; Thakur et al., 2022), was utilized to extract pertinent and advanced features from images, facilitated by the incorporation of Channel Attention (CA) (Chen et al., 2021) and 1-D Local Binary Pattern (LBP) (Rachmad et al., 2022) features. The incorporation of 1-D LBP for texture analysis from feature maps is a distinctive method that markedly improved the model’s capacity to identify complex patterns in uncontrolled settings. Additionally, Sobel edge-detected samples were incorporated into the improved dataset, providing an innovative method to improve edge information during training. Furthermore, KE-SVM Optimization (Deepti, 2023; Shrivastava et al., 2023) was employed to enhance classification by optimizing (Sorensen and Nielsen, 2018) SVM kernels and producing superior prediction data. This integrated methodology attained elevated precision in both regulated laboratory settings and demanding outdoor environments. The primary contributions of the paper are outlined below.

● The EfficientNet-LITE model, with the innovative incorporation of Channel Attention and the original utilization of 1-D Local Binary Pattern features, substantially enhanced the accuracy of plant disease classification, especially in severe uncontrolled situations. This distinctive integration enabled the model to concentrate more efficiently on pertinent image attributes.

● The incorporation of Sobel edge-detected samples into the supplemented dataset greatly enhanced the model’s capacity to capture and leverage edge information, consequently raising classification performance.

● The KE-SVM Optimization utilized a kernel ensemble and presented an innovative method to enhance the confusion matrix by revisiting misclassified samples and accurately categorizing them with other kernels. This novel approach successfully reduced the constraints of conventional SVMs, resulting in enhanced classification efficiency across various datasets.

● The integration of EfficientNet-LITE with KE-SVM Optimization demonstrated a revolutionary methodology that attained higher accuracy and resilience. The model effectively generalized over both controlled and uncontrolled datasets.

● This research introduced an innovative, rapid, precise, and dependable approach for classifying plant diseases, thereby enhancing agricultural disease management, potentially reducing yield losses, and enabling informed decision-making for farmers.

Effective management of plant diseases requires timely and precise identification and classification. Development in artificial intelligence and machine learning has resulted in substantial enhancements in automated disease detection. This review examines contemporary methodologies and technologies, concentrating on image processing and deep learning models applied to various crops, with the objective of summarizing current achievements and pinpointing research opportunities.

Nabila Husna Shabrina et al. revealed shortcomings in the PlantVillage dataset for the diagnosis of potato leaf diseases in real-world scenarios. To resolve this, they presented a novel dataset of 3,076 pictures obtained in uncontrolled settings, encompassing seven disease varieties. This dataset offers a more precise depiction of potato leaf conditions. Testing EfficientNetV2B3 (Shabrina et al., 2024) resulted in 73.63% accuracy on the new dataset, in contrast to 98.15% on PlantVillage.

Aanis Ahmad et al. investigated (Ahmad et al., 2023) the generalization capacity of deep learning (DL) models for diagnosing corn diseases in field conditions using many datasets, including PlantVillage, PlantDoc, Digipathos, NLB, and a proprietary CD&S dataset. Five deep learning architectures—InceptionV3, ResNet50, VGG16, DenseNet169, and Xception—were trained utilizing diverse dataset pairings. DenseNet169 exhibited enhanced performance, achieving an accuracy of 81.60% using RGBA images from the CD&S dataset after background removal. Furthermore, the amalgamation of field-acquired and laboratory data, encompassing sources from PlantVillage and CD&S, yielded an accuracy range of 77.50% to 80.33%, hence improving model generalization for field application.

Penghui Gui et al. tackled the issue of identifying plant diseases in uncontrolled field environments. They proposed an enhanced CNN model for field plant (Gui et al., 2021) disease identification (FPDR), incorporating strategies such as backdrop substitution and leaf resizing to optimize data augmentation. To improve feature differentiation, they employed a channel orthogonal constraint and utilized species categorization as a supplementary task. Utilizing the proprietary Field-PlantVillage (Field-PV) dataset, comprising 665 field photos, the model attained an accuracy of 72.03%, representing a substantial enhancement from 41.81%, despite being exclusively trained on the PlantVillage dataset.

A. Ubaidillah et al. sought to improve the categorizing of corn diseases using Random Forest, Neural Network, and Naive Bayes (Ubaidillah et al., 2022) techniques. The study utilized a compilation of corn leaf photographs obtained from agricultural regions in the Madura Region, concentrating on four classifications: healthy, gray leaf spot, blight, and common rust. The Neural Network technique outperformed the alternatives, with an AUC of 90.09%, a classification accuracy of 74.44%, an F1-score of 72.01%, precision of 74.14%, and recall of 74.43%, so establishing it as the most effective model for detecting maize diseases.

Priyanka Sahu and associates proposed a Deep-Dream (DD) architecture (Sahu et al., 2023) for Crop Leaf Disease Detection (CLDD), amalgamating deep learning (DL) with machine learning (ML) techniques. The study utilized the tomato crop dataset from PlantVillage and created 24 Hybrid Deep Neural (HDN) models, utilizing EfficientNet (B0-B7) as a feature extractor in conjunction with classifiers such as Random Forest (RF), AdaBoost (ADB), and Stochastic Gradient Boosting (SGB). The DD-EffiNet-B4-ADB model achieved optimal accuracy, ranging from 84% to 96%.

Hieu Phan et al. presented a deep learning approach utilizing Simple Linear Iterative Clustering (SLIC) segmentation (Phan et al., 2022) to identify diseased regions on corn leaves. The study employed five pre-trained models—VGG16, ResNet50, DenseNet121, Xception, and InceptionV3—on the PlantVillage and CD&S datasets, concentrating on super-pixel classes like northern leaf blight, gray leaf spot, and common rust. One hundred models were trained using diverse segments and split ratios. DenseNet121 achieved a peak accuracy of 97.77% on the CD&S dataset, employing five segments per image and an 80:20 split. Web and mobile applications were developed for disease identification, demonstrating the effectiveness of automated disease tracking relative to manual monitoring.

Mohit Agarwal et al. devised an efficient CNN model of 8 hidden layers (Agarwal et al., 2020) for the identification of tomato illnesses, therefore alleviating the computational demands linked to pre-trained models. Their approach, assessed with the PlantVillage dataset, achieved an accuracy of 98.4%, surpassing traditional machine learning methods (94.9% with k-NN) and pre-trained models like VGG16 (93.5%). The research employed image pre-processing techniques to enhance efficiency, achieving an accuracy of 98.7% on additional datasets. This study highlights the effectiveness and efficiency of lightweight (Zhu et al., 2023) CNN (Dai et al., 2023) models for disease detection in tomato crops.

Hasibul Islam Peyal and associates developed a lightweight 2D CNN model employing deep learning for the categorization of diseases in tomato and cotton plants. The algorithm, incorporated into an Android application named “Plant Disease Classifier,” (Peyal et al., 2023) proficiently categorized 14 classifications, consisting of 12 diseased and 2 healthy categories. Despite having fewer variables than pre-trained models like VGG16, VGG19, and InceptionV3, it achieved an impressive average accuracy of 97.36%, with precision, recall, and F1-scores around at 97%, and an Area under Curve (AUC) score of 99.9%. The utilization of Grad-CAM for visual interpretations and the model’s rapid classification time of around 4.84ms highlight its efficiency and effectiveness in disease detection.

Qiang Dai et al. created DATFGAN, a generative adversarial network that employs dual-attention and topology-fusion techniques to enhance the identification of agricultural disease photos. DATFGAN (Dai et al., 2020) improves image clarity and resolution, alleviating issues related to unclear images that hinder identification accuracy. The network’s weight-sharing approach reduces the parameter count, and actual evidence demonstrates that DATFGAN produces visually superior results and significantly outperforms existing methods in practical identification tasks.

Junde Chen et al. developed the Crop Disease Recognition Model (CDRM), including the Location-wise Soft Attention mechanism (Ubaidillah et al., 2022) into a pre-trained MobileNet-V2 to enhance the detection of subtle lesion features. This model addresses challenges associated with chaotic backgrounds and variable lighting in crop disease images. The study’s experimental results demonstrated an average accuracy of 99.71% on an open-source dataset, with a 99.13% accuracy in challenging conditions. The proposed method outperforms prior dominant techniques, showcasing its effectiveness and robustness in detecting agricultural illnesses.

Rabbia Mahum et al. proposed an enhanced deep learning technique for the diagnosis and categorization of potato leaf diseases. Unlike existing methods that categorize potato leaves into two groups utilizing the Plant Village dataset, their approach classifies leaves into five separate categories: Potato Late Blight (PLB), Potato Early Blight (Feng et al., 2023) (PEB), Potato Leaf Roll (PLR), Potato Verticillium Wilt (PVw), and Healthy (PH). Their model achieved an accuracy of 97.2% by utilizing a pre-trained Efficient DenseNet (Mahum et al., 2022) model, integrating an additional transition layer, and implementing a reweighted cross-entropy loss function. This method effectively tackles class imbalance and overfitting, offering a robust solution for comprehensive disease classification in potato leaves.

Zubair Saeed and associates developed a deep learning system focused on computer vision for the early detection and classification of potato leaf diseases. Utilizing deep convolutional neural networks (Saeed et al., 2021), specifically ResNet-152 and InceptionV3, trained on the Kaggle potato dataset, their methodology achieved accuracies of 98.34% and 95.24%, respectively, with a learning rate of 0.0005. The method precisely classifies potato leaves into three categories: healthy, early blight, and late blight. This method aims to mitigate economic losses by enabling the prompt detection of disease outbreaks through accurate image-based categorization.

Kashif Shaheed et al. developed EfficientRMT-Net, a novel model that combines Vision Transformer (ViT) with ResNet-50 (Shaheed et al., 2023) for the automated detection and classification of potato leaf diseases. This technology addresses the limitations of traditional methods, such as labor-intensive procedures and inadequate illness detection. EfficientRMT-Net utilizes CNN for feature extraction, depth-wise convolution to reduce processing demands, and a stage block architecture to enhance scalability and sensitivity. The model, trained on bespoke datasets, achieved accuracies of 97.65% on a generic dataset and 99.12% on a tailored potato leaf dataset. EfficientRMT-Net offers a dependable approach for accurate disease classification, consequently improving crop yield and resource efficiency.

Mingjie Lv and associates devised a maize leaf disease recognition method to tackle challenges including variable lighting and complexities in feature extraction. Their methodology integrates a maize leaf enhancement framework and the DMS-Robust AlexNet, an advanced neural network (Lv et al., 2020) based on AlexNet. This network incorporates dilated and multi-scale convolutions to improve feature extraction. It utilizes batch normalization to reduce overfitting, with the PReLU activation function and Adabound optimizer to improve convergence and precision. Experimental results demonstrate that this technique significantly enhances disease identification in complex scenarios, providing a dependable alternative for advanced plant disease diagnostics.

Hatice Catal Reis and Veysel Turk developed the Multi-head Attention Mechanism Depthwise Separable Convolution Inception Reduction Network (MDSCIRNet) for the early identification of potato leaf diseases. This deep convolutional neural network utilizes depthwise separable convolutions and a multi-head attention mechanism to enhance classification accuracy. MDSCIRNet (Reis and Turk, 2024) achieved an accuracy of 99.33% by combining deep learning with SVM, outperforming contemporary algorithms such as Xception and MobileNet, as well as traditional methods like SVM and Random Forest. The study highlights the effectiveness of MDSCIRNet in improving early disease detection and reducing financial losses for agricultural producers.

Xiangpeng Fan and Zhibin Guan address critical challenges in maize disease identification with their proposed VGNet, a system that employs a pretrained VGG16 model. VGNet incorporates batch normalizing, global average pooling, and L2 normalization to enhance performance. Utilizing transfer learning and the Adam optimizer, the model achieves an accuracy of 98.3% with a learning rate of 0.001, exhibiting remarkable precision and recall for nine maize diseases. VGNet’s small architecture (Fan and Guan, 2023), requiring only 79.5 MB, enables efficient processing, demonstrating effective disease recognition with a testing duration of 75.21 seconds for 230 images.

The reviewed literature demonstrates significant advancements in plant disease classification learning models, (Saritha and Thangaraja, 2023; Shahoveisi et al., 2023) using deep learning and machine learning models, yet several limitations persist. Many studies rely heavily on the PlantVillage dataset, which, while comprehensive, is collected in controlled environments and lacks diversity for real-world applications. For instance, Nabila Husna Shabrina et al. and Penghui Gui et al. highlighted the challenges of generalization in uncontrolled settings. Additionally, while methods such as DenseNet and EfficientNet have been explored, the absence of innovative feature extraction techniques, such as attention mechanisms and edge detection, limits their performance in detecting fine-grained features. Furthermore, traditional classifiers like SVMs, as used by A. Ubaidillah et al., often suffer from limitations in handling misclassified samples, reducing overall efficiency. Despite efforts to enhance accuracy, many studies fail to effectively combine lightweight models with robust optimization techniques for scalable and practical applications.

The proposed methods address these gaps by introducing EfficientNet-LITE with Channel Attention (Haider et al., 2024; Kumar et al., 2023; Navrozidis et al., 2018) and 1-D Local Binary Pattern (LBP) features, enabling precise focus on critical attributes even in uncontrolled environments. The inclusion of Sobel edge-detected samples enhances fine-detail recognition, while KE-SVM Optimization revisits and corrects misclassified samples, significantly improving classification efficiency. This integrated approach achieves superior generalization across diverse datasets, offering a fast, accurate, and reliable solution for real-world agricultural disease management, ultimately empowering farmers to reduce yield losses.

The remainder of the article is organized as follows: Section 2 outlines the structure of the feature extraction and classification model. Section 3 examines the experimental findings and analysis, while Section 4 presents the conclusions and future directions.

2 Materials and methods

The proposed approach initiates with image augmentation and Sobel edge identification to improve and diversity the dataset. Figure 1 illustrates the application of an attention-based EfficientNet-LITE model for feature extraction to identify essential leaf attributes, succeeded by KE-SVM optimization for precise classification of potato leaf diseases across diverse environments.

Figure 1
www.frontiersin.org

Figure 1. Proposed methodology for potato leaf disease classification.

2.1 Dataset collection

This work utilized two datasets for the detection of potato leaf diseases: one from an uncontrolled environment (Shabrina et al., 2024) in Indonesia and the PlantVillage Dataset (Potato Species) (Shaheed et al., 2023) from a controlled laboratory setting. The first dataset, acquired from a Kaggle source, was compiled from multiple potato farms throughout Java Island by teams from Universitas Multimedia Nusantara and Universitas Gadjah Mada. It comprises 3,076 photos categorized into seven disease types: Figure 2 (a). virus, (b). phytophthora, (c). nematode, (d). fungal, (e). bacteria, (f). pest, and (g). healthy, taken under various settings. Figure 2 presents the sample photographs for each class. Each image possesses a resolution of 1500 × 1500 pixels and is stored in.jpg format for accessibility and compatibility with image-processing software.

Figure 2
www.frontiersin.org

Figure 2. Samples of the seven categories in the potato leaf disease dataset: (a) Virus, (b) Phytophthora, (c) Nematode, (d) Fungal, (e) Bacteria, (f) Pest, and (g) Healthy.

The second dataset, PlantVillage (potato species), has 2,152 photos categorized into three classes: Healthy, Potato Late Blight, and Potato Early Blight, captured under uniform lighting circumstances with a resolution of 256 x 256 pixels. Both datasets provide a significant contrast between real-world and controlled settings for assessing model efficacy in disease diagnosis.

2.2 Preprocessing

Use bilinear interpolation (cv2.INTER_LINEAR) (Shabrina et al., 2024) to resize 1500x1500 potato leaf disease images to 224x224 pixels for machine learning models. This scaling was necessary to match image dimensions to models. We picked bilinear interpolation because it smoothed images while maintaining crucial characteristics and particulars from the high-resolution originals. Preprocessing the potato leaf disease images reduced computational effort and memory utilization, optimizing model performance and preparing the dataset for training and evaluation.

2.3 Data augmentation strategy

A complete data augmentation technique was applied to expand the training dataset of potato leaf disease image and improve the performance and resilience of the machine learning model. The initial dataset consisted of 3,076 pictures, with 2,460 allocated for training and 616 left aside for testing. Various augmentation strategies were employed to generate a more diverse and comprehensive training dataset, substantially enhancing the quantity of training samples.

Multiple fundamental augmentation methods were employed (Shabrina et al., 2024) to synthetically enlarge the training dataset. Rotation within a 20-degree range was implemented to imitate diverse viewing angles, enhancing the model’s capacity to generalize across multiple orientations. Width and height adjustments of up to 20% of the image dimensions were executed to simulate differences in image positioning. Furthermore, shear transformations with a magnitude of 0.2 were implemented to produce tilting effects, facilitating the model’s ability to manage images with perspective deviations. Zoom changes, with modifications of up to 20%, emulated various focal lengths and scales. Horizontal flips were utilized to mirror pictures and augment the model’s resilience to variations in orientation.

Sobel edge detection was employed to enhance the edges and transitions in the potato leaf disease images. Employing the OpenCV library, Sobel filters calculated gradients in both the x and y directions, yielding edge-detected representations of the source images. This technique enhanced texture and boundary information, which was integrated into the training dataset. The edge-detected images were merged with the augmented versions generated through fundamental changes, enhancing the dataset with intricate edge information.

The enhancing method was efficiently performed by processing images of potato leaf disease in phases. Each image in a batch was initially converted to float32 format and augmented to incorporate a batch dimension. Six specific augmentations were done to each image with Keras’s ImageDataGenerator class, enabling transformations including rotation, shifting, shearing, zooming, and flipping. Furthermore, Sobel edge detection was executed to produce further variations. Figures 3a–h illustrates the modified photos, accompanied by their respective labels, image names, and class names, which were subsequently gathered and preserved for model training.

Figure 3
www.frontiersin.org

Figure 3. Sample images demonstrating original and augmented versions using various techniques: (a) Original Image, (b) Rotate, (c) Flip, (d) Left Shift, (e) Sobel Edge Sample, (f) Sobel Zoom, (g) Sobel Flip, and (h) Zoom.

This augmentation method led to a significant increase in the quantity of training samples. The initial training dataset of 2,460 photos was enlarged to 14,760 augmented samples (Xiong et al., 2020), incorporating those enhanced by Sobel edge detection. The quantity of original testing samples stayed at 616 and was not increased. The augmentation of the training dataset yielded a more varied collection of images, markedly improving the model’s capacity to generalize and excel in multiple circumstances.

2.4 Feature extraction

EfficientNet-LITE is an enhanced version of the basic EfficientNetB0 (Upadhyay et al., 2024) design, specifically engineered to improve feature extraction through the strategic integration of a Channel Attention (CA) mechanism and 1-D Local Binary Pattern (LBP) for features. The improvements implemented post-Global Average Pooling layer are designed to augment the model’s capacity to concentrate on pertinent features in images of diseased potato leaves, thus improving performance while preserving computational efficiency.

EfficientNet-LITE preserves the key principles of EfficientNetB0, which optimizes network depth, width, and resolution for enhanced accuracy with reduced parameters and FLOPs, while incorporating an attention mechanism for more targeted feature extraction. Figure 4 (Reproduced from (Upadhyay et al., 2024)) shows the combination of EfficientNetB0 with Channel Attention mechanism. In contrast to EfficientNetB0, which depends exclusively on convolutional processes and depthwise separable convolutions (Reis and Turk, 2024), EfficientNet-LITE’s incorporation of Channel Attention and 1-D LBP enables the network to dynamically emphasize significant features. This produces a model that is both efficient and proficient at identifying nuanced patterns and details in potato leaf images, rendering it especially suitable for jobs demanding high accuracy with constrained computational resources.

Figure 4
www.frontiersin.org

Figure 4. The architecture of EfficientNet-LITE Model with channel attention mechanism.

The incorporation of the Channel Attention mechanism with 1-Dimensional LBP in EfficientNet-LITE tackles certain issues in feature extraction.

2.4.1 Channel Attention (CA)

Channel Attention operates by initially condensing the spatial dimensions of the input tensor into a channel descriptor by global average pooling. This description encapsulates the overall context for each channel, succinctly conveying its significance.

zc=1H*Wh=1Hw=1WXb,c,h,w (1)

In Equation 1, zcthe global average is pooled value for channel c, Xb,c,h,w is the value of the input tensor at batch b, channel c, height h, and width w.

The two completely connected layers subsequently convert this description into a series of attention weights. The initial fully connected layer diminishes the descriptor’s dimensionality, whereas the subsequent fully connected layer reverts it to the original channel dimension. The ReLU activation introduces non-linearity, while the sigmoid activation guarantees that attention weights remain constrained between 0 and 1.

The vector z is then passed through two fully connected (FC) layers to generate channel attention weights:

First FC Layer:     y1=ReLU(W1z+b1)(2)

In Equation 2, W1 is the weight matrix of the first fully connected layer, b1 is the bias vector of the first fully connected layer, ReLU is the Rectified Linear Unit activation function.

Second FC Layer:     y2=W2y1+b2(3)

In Equation 3, W2 is the weight matrix of the second fully connected layer, b2 is the bias vector of the second fully connected layer.

Apply a sigmoid activation function to obtain the channel attention weights:

ac=σ(y2)(4)

In Equation 4, σ is the sigmoid function, a2 is the attention weight of channel c.

Ultimately, these attention weights are employed to scale the original input tensor, accentuating channels with greater weights and reducing the influence of channels with lesser weights. This approach allows the model to concentrate on the most pertinent aspects, enhancing its capacity to derive significant information from the incoming data.

2.4.2 1-D Local Binary Pattern (1D LBP):

1-D Local Binary Pattern (1-D LBP) is a method for identifying textural features from one-dimensional data, such sequential signals or feature vectors obtained from photographs. It operates by juxtaposing each data point with its adjacent counterparts to produce a binary pattern, subsequently transformed into a decimal code. The codes are compiled into a histogram that illustrates the distribution of local textures within the data points. This approach is resilient to periodic changes and effectively identifies critical local structures, including edges and peaks. The 1-D LBP (Algorithm 1) histogram offers a concise and distinctive feature descriptor that is efficient for signal classification and texture analysis tasks.

Algorithm 1. 1–D Local Binary Pattern (1–D LBP).

www.frontiersin.org

2.4.3 Model Structure:

The Table 1 below summarizes the modified structure of EfficientNet-LITE, detailing the input and output shapes at each stage, along with the expansion factors, repeat times, and strides.

Table 1
www.frontiersin.org

Table 1. Structure of the proposed model.

The proposed EfficientNet-LITE model was meticulously engineered with a systematic arrangement of layers to attain a compromise between computing efficiency and performance. The input layer received potato leaf pictures measuring 224×224×3, which were subsequently processed through a Conv2D layer that downsampled the input to 112×112×32 with a stride of 2, thus diminishing the spatial dimensions while augmenting the channel depth. Batch Normalization and Swish Activation are utilized to stabilize and non-linearly activate the refined feature maps, priming them for the ensuing MBConv blocks.

The Swish activation function is defined Equation 5 as:

Swishx=x·σx(5)

where σx is the sigmoid function, given by Equation 6:

σx=11+ex(6)

The MBConv layers facilitate effective feature extraction by gradually diminishing spatial dimensions while augmenting the amount of channels, culminating in a dense and compact feature representation. The model subsequently employed a 1x1 convolution to refine the features, followed by global pooling and a Channel Attention mechanism, which improved the model’s capacity to concentrate on the most pertinent channels. This was succeeded by a 1-D Local Binary Pattern (LBP) layer that expanded the feature vector to 1290 dimensions by integrating texture features.

2.4.4 Performance Comparison: EfficientNet-LITE vs EfficientNet-B0

In deep learning, determines like Floating Point Operations (FLOPs), parameter count, model size, and depth are essential for evaluating the performance and efficiency of neural network models. FLOPs measure a model’s computational complexity, whereas the parameter count reflects its ability to learn and express intricate aspects. The model’s size pertains to storage demands, whereas depth frequently associates with the model’s capacity to discern complex patterns within the data.

EfficientNet-LITE had 359.69 MFLOPs, somewhat less than EfficientNet-B0’s 390. EfficientNet-LITE required fewer computational resources due to its lower FLOPs, making it ideal for mobile or edge devices. Despite adding Channel Attention and 1-D LBP features, EfficientNet-LITE maintained a computational efficiency similar to EfficientNet-B0, demonstrating its design efficiency. There are 3.11 million parameters in EfficientNet-LITE, compared to 5.3 million in B0. EfficientNet-LITE’s reduced parameters indicate a more streamlined architecture for memory-constrained applications. EfficientNet-LITE’s 12.46 MB model size was lower than EfficientNet-B0’s 20 MB due to fewer parameters. The compactness of EfficientNet-LITE accelerated model loading, memory usage, and inference times, making it better for real-time applications. Table 2 shows the size of pre-trained network model.

Table 2
www.frontiersin.org

Table 2. The model size of the main networks.

Also important is model depth, as deeper models can learn complex representations. EfficientNet-LITE had 27 layers, compared to 24 for EfficientNet-B0. This increased depth suggested that EfficientNet-LITE could capture more complex data characteristics, improving performance in sophisticated feature extraction tasks. The comparable FLOPs show that the extra depth did not reduce computing efficiency. EfficientNet-LITE balanced computational efficiency with model capacity. EfficientNet-LITE was ideal for mobile or embedded systems with limited computational resources because to its low FLOPs, parameter count, and model size. Despite being smaller, the model’s depth let it accomplish complex tasks well.

Finally, EfficientNet-LITE has fewer parameters (3.11 million) and a smaller model (12.46 MB vs. 20 MB) (Ubaidillah et al., 2022) than EfficientNet-B0. It has more layers (27 vs. 24) but fewer FLOPs (359.69 vs. 390), requiring fewer computations. EfficientNet-LITE was more resource-efficient and performed well.

2.5 KE-SVM optimization (kernel ensemble SVM optimization)

SVMs were widely employed in image classification and machine learning to define class boundaries. By translating input information into high-dimensional spaces, SVM classifiers (Sorensen and Nielsen, 2018) accurately handled complex and non-linear patterns in many applications.

However, datasets from uncontrolled environments with different backdrops, perspectives, and lighting conditions were difficult. Inconsistencies in image acquisition caused SVM kernels to struggle. Ensemble approaches (Sorensen and Nielsen, 2018) in machine learning improve performance by combining different models. This helped classify potato leaf diseases, where the dataset’s unpredictability required a more robust technique.

Kernel-Ensemble SVM (KE-SVM) Optimization used Linear, Polynomial, Radial Basis Function (RBF), and Sigmoid SVM kernels to address these issues. KE-SVM Optimization enhanced classification accuracy and discussed dataset variability by capturing different data features and integrating their predictions. KE-SVM Optimization improves classification by combining SVM kernel strengths. Figures 5, 6 shows the work flow of KE-SVM method. This method compares misclassified instances in one kernel against proper classifications in others. The optimum confusion matrix is iteratively adjusted using this ensemble technique to optimize classification accuracy across all classes.

Figure 5
www.frontiersin.org

Figure 5. Block diagram of KE-SVM optimization.

Figure 6
www.frontiersin.org

Figure 6. Misclassified samples re-evaluate with other kernels.

The novelty of this work lies in the application of Kernel-Ensemble SVM (KE-SVM) Optimization (Algorithm 2) to substantially improve classification efficacy by harnessing the advantages of several SVM kernels. Misclassified samples from the kernel exhibiting the highest accuracy were verified against predictions from alternative kernels, with those accurately classified by other kernels deemed as True Positives. The iterative modification process persisted until all classes were sufficiently addressed, resulting in significant enhancements in classification performance.

Algorithm 2. KE–SVM Optimization.

www.frontiersin.org

The potato leaf disease dataset, obtained from uncontrolled conditions, demonstrated that the SVM RBF kernel initially gave the highest performance among the kernels, attaining an accuracy of 79.38%. The Linear kernel achieved an accuracy of 72.89%, followed by the Polynomial kernel at 71.27%, and the Sigmoid kernel at 64.12%. The classification metrics and confusion matrix indicated a necessity for enhancement owing to the dataset’s heterogeneity, including differing backdrops and lighting conditions.

The SVM classifiers with different kernels attained good accuracy on the lab-controlled dataset from PlantVillage (potato species). The Polynomial kernel attained the maximum accuracy of 99.07%, succeeded by the RBF kernel at 98.84%, the Linear kernel at 98.38%, and the Sigmoid kernel at 96.06%. The classification report indicated an exceptional performance, with an overall accuracy of 1.00. The precision, recall, and F1-scores were remarkably elevated across all categories, indicating the consistent conditions of the dataset. The confusion matrix revealed minimal misclassifications, illustrating the effectiveness of the SVM Polynomial kernel in controlled laboratory circumstances.

The EfficientNet-LITE + SVM model demonstrated higher performance on datasets from both controlled and uncontrolled settings. Following KE-SVM optimization, the accuracy on the PlantVillage dataset rises to 99.54%, while on the uncontrolled environment dataset, it dramatically climbs to 87.82%, showing the model’s improved capacity to manage intricate, uncontrolled conditions.

3 Result and discussions

This study’s results are structured into three primary stages: (1) results before augmentation, (2) results before optimization, and (3) results after KE-SVM optimization. These stages comprehensively illustrate the progression in performance of the SVM classifiers when applied to controlled (PlantVillage) and uncontrolled environment datasets for diagnosing potato leaf diseases. The evaluation metrics employed include accuracy, precision, recall, F1-score, and other relevant measures to validate the model’s effectiveness. Equation 7, Equation 8, Equation 9, Equation 10 employed to calculate these measures were included to clarify the evaluation procedure.

Accuracy=TP+TNTP+TN+FP+FN(7)
Precision=TPTP+FP(8)
Recall=TPTP+FN(9)
F1 Score=2*Precision*RecallPrecision+Recall(10)

Accuracy evaluated overall correctness, whereas precision and recall examined the management of false positives and negatives. The F1-score offered a comprehensive assessment of the model’s classification performance, as illustrated in Table 3 below.

Table 3
www.frontiersin.org

Table 3. Shows the results of both datasets before optimization.

The initial experiments were conducted using the raw dataset without applying Sobel edge filtering or augmentation techniques. The SVM classifier’s performance in uncontrolled and controlled environments revealed significant room for improvement. In the uncontrolled environment dataset, the accuracy was 75.62%, while in the lab-controlled dataset, the accuracy was 98.62%. These results underscore the challenges posed by the inherent variability in the uncontrolled environment dataset.

The lab-controlled dataset demonstrated high accuracy due to reduced variability and noise. Following data augmentation with Sobel edge filtering to enhance feature extraction, the performance of the SVM classifiers was evaluated before applying the KE-SVM optimization technique. The augmented samples contributed to improved classification, particularly in uncontrolled environments.

A comprehensive examination of the SVM model was performed on the uncontrolled environment dataset utilizing four distinct kernels: Linear (Figure 7a), Polynomial (Figure 7b), RBF (Figure 7f), and Sigmoid (Figure 7c). Confusion matrices were produced for each kernel, offering insights into the model’s classification proficiency across diverse categories: 0: Virus, 1: Phytophthora, 2: Nematode, 3: Fungi, 4: Bacteria, 5: Pest, 6: Healthy. Visual representations of these matrices are provided to illustrate the model’s performance.

Figure 7
www.frontiersin.org

Figure 7. Confusion matrices of SVM kernels, AUC-ROC curve, and learning curve for the kernel with maximum accuracy (RBF). (a) Linear, (b) Polynomial, (c) Sigmoid, (d) AUC-ROC Curve, (e) Learning Curve, (f) RBF.

The overall effectiveness was evaluated by plotting the AUC-ROC curve (Figure 7d) and learning curves (Figure 7e) for the kernel exhibiting the highest accuracy. These visuals facilitated awareness of the model’s capacity to generalize to unfamiliar data. To test the model’s dependability, 5-fold cross-validation was employed. Table 4 results demonstrated constant performance across the folds, signifying the resilience of the SVM with RBF kernel, which attained the best accuracy.

Table 4
www.frontiersin.org

Table 4. 5-Fold cross validation for potato leaf disease in uncontrolled environment dataset.

The SVM model utilizing the RBF kernel exhibited robust performance, attaining an average training accuracy of 99.32 and a validation accuracy of 96.94. The minor discrepancy between these metrics signified effective generalization throughout the sample. The uniformity of results over the five folds further emphasized the model’s resilience, even in an uncontrolled setting. The RBF kernel effectively captured intricate correlations within the data, demonstrating its appropriateness for the dataset’s inherent unpredictability. The model’s excellent accuracy underscored its efficacy in classifying leaf diseases.

In the lab-controlled dataset, identical SVM kernels were utilized, and confusion matrices were produced for each kernel: Linear (Figure 8a), Polynomial (Figure 8f), RBF (Figure 8b), and Sigmoid (Figure 8c). It offers insights into the model’s categorization proficiency across different categories: 0: Early Blight, 1: Healthy, 2: Late Blight. The findings from this dataset exhibited remarkably high accuracy owing to the controlled environment, which minimized data fluctuation.

Figure 8
www.frontiersin.org

Figure 8. Confusion matrices of SVM kernels, AUC-ROC curve, and learning curve for the kernel with maximum accuracy (Polynomial). (a) Linear, (b) RBF, (c) Sigmoid, (d) AUC-ROC Curve, (e) Learning Curve, (f) Polynomial.

The model’s performance was additionally assessed by plotting the AUC-ROC curve (Figure 8d) and the learning curve (Figure 8e) for the optimal kernel. These curves demonstrated nearly flawless generalization. Consistent with the uncontrolled dataset, 5-fold cross-validation validated the model’s reliability, with Table 5 indicating minimal variance among the folds.

Table 5
www.frontiersin.org

Table 5. 5-Fold cross validation for PlantVillage dataset (potato species).

The SVM model utilizing a polynomial kernel was assessed on laboratory-controlled data, demonstrating superior performance across all five folds. The model attained an average training accuracy of 99.96 and a validation accuracy of 99.84. The training loss of 0.0004 and validation loss of 0.0024 signify little error and robust generalization in a regulated environment. The results highlight the efficacy of the polynomial kernel in managing clean, organized data, exhibiting little variability relative to uncontrolled contexts.

3.1 After optimization

After implementing KE-SVM Optimization, the model’s performance on the uncontrolled environment dataset shown significant enhancement. The optimal accuracy increased to 87.82%, accompanied by enhancements in precision to 86.77%, recall to 88.18%, and F1-score to 87.19%. The lab-controlled dataset has been somewhat enhanced to 99.54%. (Figure 9a) presents the optimized confusion matrix for uncontrolled data, while (Figure 9b) displays the matrix for the laboratory-controlled dataset.

Figure 9
www.frontiersin.org

Figure 9. Optimized confusion matrices for both datasets: (a) Uncontrolled data and (b) Laboratory-controlled data.

The optimization approach improved classification by cross-validating misclassified examples from the most effective kernel with predictions from other kernels, resulting in a more precise and balanced confusion matrix. Table 6 presents the optimal outcomes of the KE-SVM optimization technique.

Table 6
www.frontiersin.org

Table 6. Optimized results for both lab and uncontrolled dataset.

3.2 Comparative performance

The proposed model (EfficientNet-LITE + KE-SVM Optimization) exhibited substantial enhancements in accuracy relative to previous models utilized on comparable datasets. Prior to optimization, the model attained an accuracy of 79.38%, which rose to 87.82% post-optimization. This performance surpassed those of models like DenseNet121, ResNet50, and MobileNetV3-Large, which exhibited accuracies between 59.16% and 73.63%. This significant enhancement can be ascribed to the ensemble SVM kernel methodology and improved feature extraction with EfficientNet-LITE.

In the lab-controlled PlantVillage dataset, the suggested model attained nearly flawless accuracy both prior to and subsequent to KE-SVM Optimization. The model initially achieved an accuracy of 99.07%, which then increased to 99.54% during optimization. This performance surpassed other prominent models, including ResNet152, InceptionV3, and VGNet, which exhibited accuracies between 95.24% and 98.34%. The substantial enhancement upon optimization is attributable to the improved feature extraction and the strong classification capabilities of KE-SVM.

An optimized version of EfficientNetB0, EfficientNet-LITE, integrated Channel Attention (CA) and 1D Local Binary Pattern (LBP) features to increase feature extraction. This model prioritized potato leaf traits while being computationally efficient, making it ideal for resource-constrained mobile devices. KE-SVM Optimization used linear, polynomial, RBF, and sigmoid kernels to overcome typical SVM constraints. With SMOTE and confusion matrix optimization, classification accuracy improved, handling class imbalance and data variability.

The strengths of EfficientNet-LITE and KE-SVM Optimization were combined. EfficientNet-LITE’s superior feature extraction and KE-SVM Optimization’s classification created a model that could handle complex datasets. This collaboration produced high accuracy and reliable performance in all settings. The combined model exceeded expectations in early illness identification and uncontrolled environment management to satisfy research objectives. The results confirmed the model’s efficacy and versatility in solving research problems.

Tables 7 and 8 highlight the superior performance of our proposed EfficientNet-LITE + KE-SVM Optimization model compared to existing methods. Notably, the model achieved an accuracy of 87.82% on uncontrolled datasets and 99.54% on the PlantVillage dataset, surpassing models such as DenseNet121 and ResNet50. These results underscore the robustness of our approach in handling variability and improving classification accuracy. The enhanced classification accuracy of our model has significant implications for agricultural diagnostics. By addressing challenges posed by uncontrolled environments, our model paves the way for reliable and resource-efficient solutions applicable in real-world farming scenarios. This contributes to the broader goal of precision agriculture and early disease detection.

Table 7
www.frontiersin.org

Table 7. To compare the results with existing state-of-art-methods for uncontrolled dataset.

Table 8
www.frontiersin.org

Table 8. To compare the results with existing state-of-art-methods with PlantVillage(Potato) dataset.

4 Conclusion

In conclusion, our research revealed the effectiveness of combining EfficientNet-LITE with KE-SVM Optimization for the classification of potato leaf diseases. Initially, SVM classifiers demonstrated disparate performance, with the RBF kernel achieving 79.38% accuracy on uncontrolled data and the sigmoid kernel reaching 99.07% accuracy on laboratory-controlled data. Subsequent to KE-SVM Optimization, the accuracy on the uncontrolled dataset markedly increased to 0.8782, with precision at 86.77%, recall at 88.18%, and F1-score at 87.19%. Conversely, the accuracy on the lab-controlled dataset exhibited a minor enhancement to 99.54%. This integrated model adeptly tackles issues associated with early disease classification, dataset variability, and model robustness, demonstrating its versatility and dependability across many settings. Future work could explore integrating more comprehensive datasets that combine image data with clinical parameters such as plant height, size, irrigation schedules, and expert farmer insights. Additionally, leveraging generative AI techniques could provide holistic solutions for farmers, enhancing decision-making and improving crop management practices.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/supplementary material.

Author contributions

GS: Conceptualization, Data curation, Investigation, Methodology, Resources, Software, Visualization, Writing – original draft, Writing – review & editing. VR: Conceptualization, Investigation, Project administration, Resources, Supervision, Validation, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be constructed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Agarwal, M., Gupta, S. K., Biswas, K. (2020). Development of Efficient CNN model for Tomato crop disease identification. Sustain. Computing Inf. Syst. 28, 100407. doi: 10.1016/j.suscom.2020.100407

Crossref Full Text | Google Scholar

Ahmad, A., Gamal, A. E., Saraswat, D. (2023). Toward generalization of Deep Learning-Based plant disease identification under controlled and field conditions. IEEE Access 11, 9042–9057. doi: 10.1109/access.2023.3240100

Crossref Full Text | Google Scholar

Chen, J., Zhang, D., Suzauddola, M., Zeb, A. (2021). Identifying crop diseases using attention embedded MobileNet-V2 model. Appl. Soft Computing 113, 107901. doi: 10.1016/j.asoc.2021.107901

Crossref Full Text | Google Scholar

Dai, Q., Cheng, X., Qiao, Y., Zhang, Y. (2020). Crop leaf Disease Image Super-Resolution and Identification with dual attention and Topology Fusion Generative Adversarial Network. IEEE Access 8, 55724–55735. doi: 10.1109/access.2020.2982055

Crossref Full Text | Google Scholar

Dai, M., Sun, W., Wang, L., Dorjoy, M. M. H., Zhang, S., Miao, H., et al. (2023). Pepper leaf disease recognition based on enhanced lightweight convolutional neural networks. Front. Plant Sci. 14. doi: 10.3389/fpls.2023.1230886

PubMed Abstract | Crossref Full Text | Google Scholar

Deepti, K. (2023). Comparative analysis of machine learning techniques for plant disease detection-data deployment. J. Institution Engineers (India) Ser. B 104, 837–849. doi: 10.1007/s40031-023-00897-w

Crossref Full Text | Google Scholar

Fan, X., Guan, Z. (2023). VGNET: A lightweight intelligent learning method for corn diseases recognition. Agriculture 13, 1606. doi: 10.3390/agriculture13081606

Crossref Full Text | Google Scholar

Feng, J., Hou, B., Yu, C., Yang, H., Wang, C., Shi, X., et al. (2023). Research and validation of potato late blight detection method based on deep learning. Agronomy 13, 1659. doi: 10.3390/agronomy13061659

Crossref Full Text | Google Scholar

Gui, P., Dang, W., Zhu, F., Zhao, Q. (2021). Towards automatic field plant disease recognition. Comput. Electron. Agric. 191, 106523. doi: 10.1016/j.compag.2021.106523

Crossref Full Text | Google Scholar

Gulame, M. B., Thite, T. G., Patil, K. D. (2023). Plant disease prediction system using advance computational Technique. J. Phys. Conf. Ser. 2601, 12031. doi: 10.1088/1742-6596/2601/1/012031

Crossref Full Text | Google Scholar

Haider, I., Khan, M. A., Nazir, M., Hamza, A., Alqahtani, O., Alouane, M. T., et al. (2024). Crops leaf disease recognition from digital and RS imaging using fusion of multi Self-Attention RBNET deep architectures and modified dragonfly optimization. IEEE J. Selected Topics Appl. Earth Observations Remote Sens. 17, 7260–7277. doi: 10.1109/jstars.2024.3378298

Crossref Full Text | Google Scholar

Haque, M. A., Marwaha, S., Deb, C. K., Nigam, S., Arora, A., Hooda, K. S., et al. (2022). Deep learning-based approach for identification of diseases of maize crop. Sci. Rep. 12. doi: 10.1038/s41598-022-10140-z

PubMed Abstract | Crossref Full Text | Google Scholar

Huang, Z., Jiang, X., Huang, S., Qin, S., Yang, S. (2023). An efficient convolutional neural network-based diagnosis system for citrus fruit diseases. Front. Genet. 14. doi: 10.3389/fgene.2023.1253934

PubMed Abstract | Crossref Full Text | Google Scholar

Khamparia, A., Singh, A., Luhach, A. K., Pandey, B., Pandey, D. K. (2020). Classification and identification of primitive kharif crops using supervised deep convolutional networks. Sustain. Computing Inf. Syst. 28, 100340. doi: 10.1016/j.suscom.2019.07.003

Crossref Full Text | Google Scholar

Kumar, V. S., Jaganathan, M., Viswanathan, A., Umamaheswari, M., Vignesh, J. (2023). Rice leaf disease detection based on bidirectional feature attention pyramid network with YOLO v5 model. Environ. Res. Commun. 5, 065014. doi: 10.1088/2515-7620/acdece

Crossref Full Text | Google Scholar

Lv, M., Zhou, G., He, M., Chen, A., Zhang, W., Hu, Y. (2020). Maize leaf disease identification based on feature enhancement and DMS-Robust Alexnet. IEEE Access 8, 57952–57966. doi: 10.1109/access.2020.2982443

Crossref Full Text | Google Scholar

Mahum, R., Munir, H., Mughal, Z., Awais, M., Khan, F. S., Saqlain, M., et al. (2022). A novel framework for potato leaf disease detection using an efficient deep learning model. Hum. Ecol. Risk Assess. an Int. J. 29, 303–326. doi: 10.1080/10807039.2022.2064814

Crossref Full Text | Google Scholar

Mishra, P., Alhussan, A. A., Khafaga, D. S., Lal, P., Ray, S., Abotaleb, M., et al. (2024). Forecasting production of potato for a Sustainable Future: Global Market analysis. Potato Res. 67, 1671–1690. doi: 10.1007/s11540-024-09717-0

Crossref Full Text | Google Scholar

Nagaraju, M., Chawla, P. (2022). Maize crop disease detection using NPNet-19 convolutional neural network. Neural Computing Appl. 35, 3075–3099. doi: 10.1007/s00521-022-07722-3

Crossref Full Text | Google Scholar

Navrozidis, I., Alexandridis, T., Dimitrakos, A., Lagopodi, A., Moshou, D., Zalidis, G. (2018). Identification of purple spot disease on asparagus crops across spatial and spectral scales. Comput. Electron. Agric. 148, 322–329. doi: 10.1016/j.compag.2018.03.035

Crossref Full Text | Google Scholar

Peyal, H. I., Nahiduzzaman, M., Pramanik, A. H., Syfullah, M. K., Shahriar, S. M., Sultana, A., et al. (2023). Plant Disease Classifier: Detection of Dual-Crop diseases using lightweight 2D CNN architecture. IEEE Access 11, 110627–110643. doi: 10.1109/access.2023.3320686

Crossref Full Text | Google Scholar

Phan, H., Ahmad, A., Saraswat, D. (2022). Identification of foliar disease regions on corn leaves using SLIC segmentation and deep learning under uniform background and field conditions. IEEE Access 10, 111985–111995. doi: 10.1109/access.2022.3215497

Crossref Full Text | Google Scholar

Rachmad, A., Syarief, M., Rifka, S., Sonata, F., Setiawan, W., Rochman, E. M. S. (2022). Corn leaf disease classification using Local Binary Patterns (LBP) feature extraction. J. Phys. Conf. Ser. 2406, 12020. doi: 10.1088/1742-6596/2406/1/012020

Crossref Full Text | Google Scholar

Reis, H. C., Turk, V. (2024). Potato leaf disease detection with a novel deep learning model based on depthwise separable convolution and transformer networks. Eng. Appl. Artif. Intell. 133, 108307. doi: 10.1016/j.engappai.2024.108307

Crossref Full Text | Google Scholar

Saeed, Z., Khan, M. U., Raza, A., Sajjad, N., Naz, S., Salal, A. (2021). “Identification of leaf diseases in potato crop using deep convolutional neural networks (DCNNs),” in 2021 16th International Conference on Emerging Technologies (ICET). (Islamabad, Pakistan: IEEE). doi: 10.1109/icet54505.2021.9689807

Crossref Full Text | Google Scholar

Sahu, P., Chug, A., Singh, A. P., Singh, D. (2023). Classification of crop leaf diseases using image to image translation with deep-dream. Multimedia Tools Appl. 82, 35585–35619. doi: 10.1007/s11042-023-14994-x

Crossref Full Text | Google Scholar

Saritha, S., Thangaraja, G. A. (2023). Prediction of crop disease using Rank Regressive Learning and Proaftn Fuzzy Classification models. Soft Computing. doi: 10.1007/s00500-023-08357-9

Crossref Full Text | Google Scholar

Shabrina, N. H., Indarti, S., Maharani, R., Kristiyanti, D. A., Irmawati, N., Prastomo, N., et al. (2024). A novel dataset of potato leaf disease in uncontrolled environment. Data Brief 52, 109955. doi: 10.1016/j.dib.2023.109955

PubMed Abstract | Crossref Full Text | Google Scholar

Shaheed, K., Qureshi, I., Abbas, F., Jabbar, S., Abbas, Q., Ahmad, H., et al. (2023). EfficientRMT-NeT—An efficient RESNET-50 and Vision Transformers approach for classifying potato plant leaf diseases. Sensors 23, 9516. doi: 10.3390/s23239516

PubMed Abstract | Crossref Full Text | Google Scholar

Shahoveisi, F., Gorji, H. T., Shahabi, S., Hosseinirad, S., Markell, S., Vasefi, F. (2023). Application of image processing and transfer learning for the detection of rust disease. Sci. Rep. 13. doi: 10.1038/s41598-023-31942-9

PubMed Abstract | Crossref Full Text | Google Scholar

Shrivastava, V. K., Shelke, C. J., Shrivastava, A., Mohanty, S. N., Sharma, N. (2023). Optimized deep learning model for disease prediction in potato leaves. EAI Endorsed Trans. Pervasive Health Technol. 9, 17. doi: 10.4108/eetpht.9.4001

Crossref Full Text | Google Scholar

Singla, A., Nehra, A., Joshi, K., Kumar, A., Tuteja, N., Varshney, R. K., et al. (2024). Exploration of machine learning approaches for automated crop disease detection. Curr. Plant Biol. 40, 100382. doi: 10.1016/j.cpb.2024.100382

Crossref Full Text | Google Scholar

Sorensen, L., Nielsen, M. (2018). Ensemble support vector machine classification of dementia using structural MRI and mini-mental state examination. J. Neurosci. Methods 302, 66–74. doi: 10.1016/j.jneumeth.2018.01.003

PubMed Abstract | Crossref Full Text | Google Scholar

Thakur, P. S., Sheorey, T., Ojha, A. (2022). VGG-ICNN: A Lightweight CNN model for crop disease identification. Multimedia Tools Appl. 82, 497–520. doi: 10.1007/s11042-022-13144-z

Crossref Full Text | Google Scholar

Tholkapiyan, M., Devi, B. A., Bhatt, D., Kumar, E. S., Kirubakaran, S., Kumar, R. (2023). Performance analysis of rice plant diseases identification and classification methodology. Wireless Pers. Commun. 130, 1317–1341. doi: 10.1007/s11277-023-10333-3

Crossref Full Text | Google Scholar

Ubaidillah, A., Rochman, E. M. S., Fatah, D. A., Rachmad, A. (2022). Classification of corn diseases using random forest, neural network, and naive bayes methods. J. Phys. Conf. Ser. 2406, 12023. doi: 10.1088/1742-6596/2406/1/012023

Crossref Full Text | Google Scholar

Upadhyay, S. K., Jain, J., Prasad, R. (2024). Early blight and late blight disease detection in potato using Efficientnetb0. Int. J. Exp. Res. Rev. 38, 15–25. doi: 10.52756/ijerr.2024.v38.002

Crossref Full Text | Google Scholar

Xiong, Y., Liang, L., Wang, L., She, J., Wu, M. (2020). Identification of cash crop diseases using automatic image segmentation algorithm and deep learning with expanded dataset. Comput. Electron. Agric. 177, 105712. doi: 10.1016/j.compag.2020.105712

Crossref Full Text | Google Scholar

Zhu, X., Li, J., Jia, R., Liu, B., Yao, Z., Yuan, A., et al. (2023). LAD-Net: a novel light weight model for early Apple Leaf Pests and Diseases Classification. IEEE/ACM Trans. Comput. Biol. Bioinf. 20, 1156–1169. doi: 10.1109/tcbb.2022.3191854

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: EfficientNet-LITE, KE-SVM optimization, channel attention, 1-D local binary pattern, Sobel edge augmentation, uncontrolled environment data, potato leaf disease

Citation: Sangar G and Rajasekar V (2025) Optimized classification of potato leaf disease using EfficientNet-LITE and KE-SVM in diverse environments. Front. Plant Sci. 16:1499909. doi: 10.3389/fpls.2025.1499909

Received: 22 September 2024; Accepted: 27 March 2025;
Published: 02 May 2025.

Edited by:

Mohsen Yoosefzadeh Najafabadi, University of Guelph, Canada

Reviewed by:

Hariharan Shanmugasundaram, Vardhaman College of Engineering, India
Kelly Lais Wiggers, Federal Technological University of Paraná, Brazil

Copyright © 2025 Sangar and Rajasekar. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Gopal Sangar, c2FuZ2FycmFqYWdvcGFsQGdtYWlsLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.