DATA REPORT article
Front. Agron.
Sec. Pest Management
Volume 7 - 2025 | doi: 10.3389/fagro.2025.1629681
Thermal and RGB Image Dataset for Detection and Management of Fall Army Worm infestation in Maize
Provisionally accepted- VIT University, Vellore, India
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
The Fall Army Worm (FAW), Spodoptera frugiperda, is an invasive species that has rapidly spread across several continents, causing severe damage to a variety of crops, particularly maize 1 . Its ability to reproduce quickly and migrate long distances makes it particularly difficult to manage. Early detection and continuous monitoring of FAW are critical for timely intervention and effective pest management strategies 2 . However, traditional pest monitoring methods, such as visual inspection, are labour intensive, time-consuming and inefficient in large-scale agricultural settings.To address these limitations, recent studies have explored the use of remote sensing 3 and computer vision 4,5 technologies for automated pest detection. Among these, thermal imaging 6 has shown particular promise due to its ability to capture subtle physiological changes in plant tissues through temperature variations -changes that may be early indicators of pest infestation. Combined with RGB imaging, which provides detailed visual information, this multimodal approach would be a powerful tool for improving pest detection accuracy.In this study, we present a novel dataset consisting of both thermal and RGB images of maize crop, including samples both infested by FAW and healthy controls. The images were captured under real field conditions using a FLIR E8 thermal camera and an iPhone RGB camera, ensuring practical relevance.Given the limited availability of datasets for this specific use case involving thermal-RGB fusion, our dataset provides a valuable resource for the agricultural and computer vision communities by filling an important gap. In addition, we applied a comprehensive set of 38 image augmentation techniques to increase the variability and robustness of the dataset, making it suitable for training deep learning models 7 in FAW detection and classification tasks. The dataset consists of thermal and RGB images of maize plants, both infested with FAW and healthy, collected under varied environmental conditions. A detailed breakdown of the data collection process and the augmentation techniques is provided below. The images were collected in agricultural fields where maize plants were actively growing. Two types of images were captured:-Thermal (Infrared) Images: These images were captured using a FLIR E8 thermal imaging camera, which has a thermal resolution of 320 x 240 pixels. The camera captures temperature variations, which are indicative of plant health, pest infestations, or environmental stress.-RGB Images: RGB images were collected from both thermal camera and iPhone camera, providing highresolution visible light images of maize plants. The images were taken from multiple angles and distances to capture varying perspectives and the spatial distribution of FAW infestations.The images were taken under varying lighting conditions (e.g., morning, afternoon, cloudy, clear skies) to ensure the dataset is robust to environmental changes.• RGB Images (FLIR) -RGB images captured using the FLIR E8 camera.• RGB Images (iPhone) -RGB images captured using an iPhone.• Thermal Images -Corresponding thermal images from the FLIR E8 camera. To enhance the diversity and size of the dataset and enable the development of robust AI models, we applied 38 image augmentation techniques. These techniques were chosen to simulate various real-world conditions, such as changes in scale, orientation, lighting, and noise that might occur in agricultural fields. The augmentation methods applied include:-Geometric Transformations: skew, rotate, translate, scale, flip, zoom, random cropping, affine transform, perspective transform, elastic distortion, spatial transform, image warp, and deformable convolution.-Noise Addition: gaussian noise, salt & pepper noise.-Image Distortion: gaussian blur, sharpen, temperature jitter, random erasing, occlusion, pseudo colouring, and mosaic.-Color Adjustments: channel shuffle, solarize, invert, cut mix, color jitter, sigmoid contrast, gamma contrast, linear contrast, color shift, and contrast adjustments.-Advanced Augmentations: bounding box, collar jitter, hide & seek, grid mask, mix up, polar distortion. These augmentations ensure that the dataset is well-suited for training deep learning models that can generalize across a variety of conditions and environments 8 . The augmented images were generated by applying the above methods to both the thermal and RGB images of healthy and infested maize plants. -Image Resolution: Thermal images are captured at a resolution of 320 x 240 pixels, while RGB images are of varying resolutions, typically around 640 x 480 (FLIR) and 3024 x 4032 (iPhone) pixels.-Augmented Images: After augmentation, the dataset is significantly expanded, offering a highly varied set of images for training and testing machine learning models.The dataset is available on Figshare 9 , an open-access repository that enables users to share, cite, and discover research outputs. The dataset includes images of Fall Army Worm (FAW)-infested and healthy maize leaves captured using a FLIR E8 thermal camera and an iPhone. It is structured into categories: 'FAM RGB -IFR', 'Healthy RGB -IFR', 'IFR FAW', 'IFR Healthy', and 'RGB FAW'. The dataset can be accessed at Figshare (DOI: 10.6084/m9.figshare.28388018). To ensure dataset reliability, we:• Cross-validated RGB and thermal images for consistency.• Performed manual inspections for labelling accuracy.• Employed baseline deep learning model (CNN) to validate the dataset's usability for FAW detection. The primary application of this dataset is in the development and training of machine learning models for the detection of FAW infestations in maize crops. The dataset can be used in several key areas: Machine learning models, particularly convolutional neural networks (CNNs), can be trained on this dataset to automatically detect and classify images based on the presence or absence of FAW. Thermal images are particularly useful for identifying temperature anomalies caused by pest activity, while RGB images provide detailed visual information about the physical state of the plants. The dataset is particularly valuable for early-stage pest detection, which is crucial for minimizing crop damage and reducing pesticide use. By leveraging the thermal imaging modality, which can detect heat signatures from pests even before visible signs of damage occur, the dataset can help in the development of AI-based systems that alert farmers to potential infestations in real-time 10 . The dataset can be used as part of precision agriculture initiatives 11 12 13 , where machine learning models analyze images of crops to identify pest outbreaks and other environmental stresses 14 , allowing farmers to take targeted actions, such as localized pesticide spraying or pest control measures. This can reduce costs, minimize pesticide use, and enhance crop yield. In addition to pest detection, this dataset can also be employed for broader crop health monitoring applications.By analyzing both thermal and RGB images, researchers can study the physiological stress factors affecting maize plants, including water stress, disease, and pest damage. To evaluate the effectiveness of the dataset, a simple Convolutional Neural Network (CNN) was implemented to classify maize leaves as either healthy or FAW-infected. The model was trained on both RGB and infrared (IFR) images collected using a FLIR E8 thermal camera and an iPhone. The CNN architecture consisted of multiple convolutional layers, max-pooling, and fully connected layers, ensuring a basic yet effective feature extraction process. The dataset was split into 80% training and 20% validation, with all images resized to 224×224 pixels for uniformity. The results from this preliminary analysis demonstrate the dataset's potential for distinguishing between healthy and infected leaves, serving as a foundation for future, more advanced models. The evaluation results obtained were tabulated below (Table 1 This dataset is unique in several respects:-Combination of Thermal and RGB Imaging: This dataset is among the first to combine close-range thermal and RGB images specifically for FAW detection in maize. The dual-modality approach allows for more accurate and robust identification of infestation, capturing both visual features and thermal signatures associated with pest activity, something not possible with RGB or satellite data alone.-Real-World Applicability: All images were collected under natural, real-field conditions using a FLIR E8 thermal camera and an iPhone for RGB imagery. This enhances the dataset's relevance and applicability for practical deployment in operational agricultural settings.-Extensive Augmentation Techniques: To further improve the dataset's utility, we applied 38 diverse image augmentation techniques to increase variability and robustness, enabling deep learning models trained on this dataset to generalize better across environmental conditions, infestation levels and image noise.This dataset offers a fine-grained, multimodal and field-validated resource that is currently lacking in pest detection research. This contribution can support the development of more precise and scalable AI models for sustainable pest management in agriculture. This dataset offers exciting avenues for future research and application. It can be directly used to develop robust AI models for real-time FAW detection in maize, deployable on mobile devices or UAV-mounted imaging systems. Its dual-modality (thermal and RGB) makes it ideal for integration into smart farming platforms and early warning systems to minimize crop loss. Moreover, researchers can leverage this dataset to explore multimodal learning, domain adaptation, and generalizable pest detection frameworks across various agricultural environments.
Keywords: FAW, FLIR, RGB, IFR, visible, thermal, Multi-Modal
Received: 16 May 2025; Accepted: 11 Jul 2025.
Copyright: © 2025 Sandhya, B and Kumar. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: VENKATARAMANA B, VIT University, Vellore, India
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.