- 1College of Engineering, Nanjing Agricultural University, Nanjing, China
- 2Institute of Agricultural Facilities and Equipment, Jiangsu Academy of Agricultural Sciences, Nanjing, China
- 3Key Laboratory of Agricultural Engineering of the Middle and Lower Reaches of the Yangtze River, Ministry of Agriculture and Rural Affairs of the People’s Republic of China, Nanjing, China
- 4School of Biosciences, University of Nottingham, Nottingham, United Kingdom
- 5Shenzhen Energy Nanjing Holding Co., Ltd., Nanjing, China
- 6China Electric Power Engineering Consulting Group Co., Ltd., Beijing, China
Introduction: Agrivoltaic (AV) systems combine photovoltaic (PV) power generation with agriculture to enhance land use and energy production. However, accurately predicting the microclimate within AV systems remains a challenge, primarily due to existing models failing to get their inherent temporal and spatial variability.
Methods: To address this, this study used long short-term memory (LSTM) networks to process time-series data and incorporated an attention mechanism to adjust the importance of temporal features. The model considered two environmental parameters, including solar radiation intensity and air temperature. Data collected from experimental AV systems with different PV panel density in Nanjing, China. The performance of the LSTM-Attention model was compared with traditional machine learning methods and standard LSTM models.
Results: The results demonstrated that the LSTM-Attention model outperformed the other models in predicting both solar radiation intensity and air temperature within AV systems with different PV panel density. Specifically, the Root Mean Square Error (RMSE) for radiation intensity predictions decreased by 28.0%, 35.7%, and 42.1% at different coverage densities. For air temperature predictions, the RMSE dropped by 39.0% in summer and 18.1% in winter. Importantly, the LSTM-Attention model maintained stable prediction performance even in winter and rainy weather conditions.
Discussion: The results indicated that the LSTM-Attention model could effectively captured the complex temporal variations in solar radiation and air temperature within AV systems, especially under varying weather conditions. The study provides theoretical support for improving crop management strategies within AV systems.
1 Introduction
Agrivoltaics is an emerging agricultural model that combines photovoltaic (PV) power generation with farming, allowing for more efficient use of land (Brown, 2018; Stid et al., 2025). In recent years, agrivoltaics (AV) systems have become increasingly popular as a way to combine renewable energy generation with farming (Li et al., 2022a). They have been given a lot of attention from researchers and businesses due to their dual environmental and economic benefits (Chopdar et al., 2024).
In AV systems, microclimate have a direct impact on the photosynthetic efficiency and physiological metabolic processes of crops, affecting the productivity and benefits of the system (Ghosh, 2023; Moon and Ku, 2023). The intensity of solar radiation that drives the process of photosynthesis is crucial for determining the rate of dry matter production and the final quality parameters of the crop (Gnayem et al., 2024). Air temperature also affects the activity of enzymes and respiration in plants, controlling the growth cycle and stress resistance performance (Gong et al., 2025). It has been found out that the shading caused by the PV panel results in significant changes in the radiation intensity both in space and time, and these changes in the light environment led to complex temperature field changes via alterations in the surface energy balance. All these together can cause physiological problems such as photoinhibition or heat stress for crops. As a non-linear system that is influenced by various factors such as the angle of the sun, clouds, and the shape of the PV array, AV systems display clear temporal and spatial variations in their microclimatic parameters (Pandey et al., 2025). To get a good grasp of the coupling among solar radiation and air temperature and to build up precise prediction models would offer some help to make decisions concerning crop cultivation and at the same time it would also form the basis for designing PV arrays and intelligent control systems (Zainali et al., 2025). Although environmental prediction for PV greenhouses is relatively well-established, there is still limited research on environmental prediction in open-field AV systems. Open-field AV systems differ significantly from greenhouse setups, primarily due to the lack of physical enclosures and the direct interaction between crops and external environmental factors such as wind, precipitation, and solar radiation. Consequently, accurately predicting microclimates in these systems is critical for optimizing both agricultural production and energy generation. By improving the prediction of environmental factors such as solar radiation and air temperature, it could provide data-driven insights that can assist in designing more efficient PV panel layouts and management strategies for crop cultivation.
The current methods used for predicting microclimates in AV systems include three main types of modeling methods: mechanistic approach, computational fluid dynamics (CFD), and data driven method (Li et al., 2021; Williams et al., 2023; Yajima et al., 2023). In practical applications, PV modules block incoming solar radiation in AV systems, forming moving shadows on the ground surface such as farmland, which affects the microclimate. Therefore, mechanistic models generally use solar radiation theory or thermodynamics to study the light distribution and dynamic energy-mass exchange process in these systems. For example, Ma Lu et al. (2022) developed a photosynthetic active radiation (PAR) decomposition model according to the Spitters light response curve theory. By adding cloud cover improvement elements and satellite-based information, they decreased the prediction mistake of the diffuse PAR part to 23.75% at a 30-minute time scale in temperate regions. Zhang et al. (2025) created a spatiotemporal dynamic model for the PV shading width, allowing for the quantitative prediction of the shading effect. Peng et al. (2023) created a light-electricity-heat coupling multi-physics model for PV greenhouses. They achieved a dynamic match between the coverage of PV panels and the photosynthetic characteristics of strawberries by optimizing the PAR and net photosynthetic rate models together. Some parameters are quite hard to get right, such as convective heat transfer coefficients and longwave radiation exchange coefficients. Additionally, the effect of crops on the surrounding environment continues to be a topic of active research, making it challenging to develop complete mechanistic models for AV systems (Zhang et al., 2025).
The CFD model is a special type of mechanistic models (Bellone et al., 2024). Recently, CFD modeling approaches have been extensively employed in studies of AV systems. A representative study by Joseph et al. (2025) implemented coupled multi-physics simulations incorporating source-sink terms within a CFD framework to analyze microclimates in AV systems. The study reduced numerical diffusion artefacts in shaded obstacle simulations through the refinement of the discrete ordinates radiation method. Williams et al. (2023) conducted a comprehensive CFD-based microclimate analysis examining the coupled effects of panel elevation, surface reflectivity, and plant water loss in AV systems. Gong et al. (2025) applied a CFD thermal environment model to explore the thermal regulation in AV systems with different panel heights. While these established CFD models can accurately simulate microclimate distribution within AV systems, they require substantial computational resources. In conclusion, CFD models are better suited for applications that do not have high real-time computation requirements such as agricultural buildings’ structural optimization and energy consumption simulation, rather than real-time environmental control in AV systems (Zainali et al., 2025).
The rapid development of artificial intelligence and computing technologies has prompted increasing adoption of data-driven models for microclimate prediction (Li et al., 2022b; Huang et al., 2024; Li et al., 2025). Although the training process demands considerable computational resources, these models achieve rapid prediction speeds post-training, with their accuracy being highly contingent upon both dataset quality and algorithmic design. Commonly employed machine learning approaches include regression models and convolutional neural networks (Giudici et al., 2023). Among these, the long short-term memory (LSTM) have emerged as a prevalent algorithm for time-series processing (Wu et al., 2024), finding particular application in agricultural building microclimate prediction due to their superior capability in handling sequential data (Geng et al., 2024). However, as the forecasting horizon extends, there is a significant decline in prediction accuracy, making the achievement of precise microclimate predictions under complex scenarios a key research challenge.
As a major innovation for processing sequence data, attention mechanisms dynamically allocate focus among inputs to enhance the model’s capability to recognize and leverage important information (Guo and Feng, 2024). Recent advances have demonstrated its effectiveness in enhancing model predictive performance. Mao et al. (2024) integrated attention mechanisms into BiGRU networks; the computed attention weights for hidden state vectors greatly enhanced the extraction of temporal features in greenhouse environments. Jiang et al. (2022) introduced an Attention-LSTM architecture which yielded impressive results in predicting indoor temperature 90 minutes ahead. They used a four-head attention mechanism to dynamically weight important features such as past temperatures, exterior wall temperatures, and predicted outdoor temperatures, resulting in a coefficient of determination of 0.9, which is a 26.1% improvement over traditional LSTM methods. However, research on applying attention mechanisms to enhance model predictive performance in AV systems remains relatively unexplored.
PV panel coverage density, which is an important structural parameter affecting the microclimate of AV systems, directly affects the spatial distribution patterns and dynamic changes of photothermal resources. The shading effect of PV panels with different coverage densities greatly changes the surface energy balance, causing spatiotemporal differences in the radiation intensity and temperature field, thereby affecting the photosynthesis efficiency and physiological metabolic process of crops (Tan et al., 2025). To conduct microclimate parameter prediction research on multiple density scenarios can not only clarify the quantitative response relationship between the coverage of PV panels and the microenvironmental factors, but also provide data support for optimizing the configuration of PV arrays, so as to achieve the goal of improving the efficiency of power generation and agricultural production at the system level.
To deal with the considerable temporal variation and spatial disparity of microclimate parameters in AV systems, and the reduced predictive precision with longer forecasting intervals, this research created an LSTM-Attention based multi-parameter microclimate prediction model. The model used a multi-dimensional environmental parameter combination such as the PV panel coverage density and photothermal environment and adopted an attention mechanism to dynamically weight important temporal factors. This method was able to make highly accurate forecasts of the microclimate in AV systems during different seasons and weather conditions, which provided a theoretical basis for optimizing the allocation of light and thermal resources and managing crops.
2 Materials and methods
2.1 Experimental AV systems
The experimental AV park located in Nanjing, China (31.6°N, 119.2°E). This park covered a total area of 47 hectares at an altitude of 360 meters, which was constructed in 2016 by Shenzhen Energy Group. Operating under an AV model, it comprised eight array units (Figure 1a). The park was estimated to reduce CO2 emissions by 19,800 t per year, which is equivalent to saving 7,900 t of standard coal. estimated to reduce CO2 emissions by 19,800 t per year, equivalent to saving 7,900 t of standard coal. The experimental area has a subtropical monsoon climate with obvious seasonal changes.
Figure 1. Location of the AV systems and different structure with 3 coverage densities: (a) Experimental AV base; (b) PV array structure; (c) Arrays with 3 different coverage densities of the moderate-density (MD), the high-density (HD) and the full-density (FD).
The AV system adopted a south-facing array configuration with a total span of 6.8 m. Under full-density coverage, the PV modules were arranged in four tightly packed horizontal rows (Figure 1b). The modules were mounted at a 24° tilt angle on Φ300 concrete piles, with the lower edge positioned 2.5 m above ground level. The polycrystalline silicon PV modules had a rated power output of 265 Wp, with a dimension of 1640 × 992 × 35 mm (L × W × H). Considering that low PV array coverage would result in impractical investment recovery periods while accounting for retrofitting costs, the study established two typical PV coverage densities of the moderate-density (MD) and the high-density (HD), based on a tightly packed configuration of 4 horizontal rows representing the full-density (FD) (Figure 1c). The vertical projection ratios of these coverage levels (the total vertical projection area of the modules on the farmland/the area of the farmland where the photovoltaic modules are installed) are 53.3%, 40.0% and 26.7% respectively.
2.2 Data acquisition
To investigate the shading effects of PV arrays, varying PV panel deployment densities (full-density, FD; high-density, HD; and moderate-density, MD) were employed to establish quantitative relationships between shading ratios and microenvironmental parameters, thereby comprehensively covering typical operational scenarios. The monitored AV system in this study comprised four distinct sensor monitoring zones: three inter-panel areas within AV systems with different panel coverage densities and an open-field area without PV panels. The environmental parameters monitored included global solar radiation, temperature, relative humidity, soil moisture, precipitation, wind speed and direction, both within and outside the AV systems. Detailed specifications of the sensor models and their parameters are provided in Table 1. The data recording system integrated a compact weather station recorder, equipped with solar radiation sensors, temperature sensors and other instruments.
As shown in Figure 2a, all sensors were placed for long-term, continuous monitoring at intervals of 10 min. The sensor layout followed the following principles: Under the condition of meeting the information collection requirements, reduce the redundancy of sensor configuration; Taking into full account factors such as shading of PV modules, plant height, and root length of plants, the expected test results can fully reflect objective laws. The spatial distribution of sensor locations was presented in Figure 2b. The data collection period spanned from June 15, 2023, to June 15, 2024, which covered an entire year, including all four seasons. Specifically, the data collection included critical periods such as the summer solstice, with the training set from June 15, 2023, to July 5, 2023, and the validation set from July 16, 2024, to July 24, 2023, representing high-temperature and high-radiation conditions. The winter solstice, with the training set from December 8, 2023, to December 28, 2023, and the validation set from December 29, 2023, to January 6, 2024, representing low-temperature and low-radiation conditions. In addition, we selected data from various weather conditions to enhance the model’s robustness. The sensors operated continuously with 10-minute intervals to monitor parameters. The experimental protocol set daytime as 06:00-18:00 and nighttime as 18:00-06:00+1 (Zhang et al., 2023). According to the observation of cloudiness, precipitation record and sunshine duration measurement, the weather condition during the study period was divided into three types, clear sky, overcast and rainy condition (Zhang et al., 2025).
Figure 2. Monitoring scene and experimental measuring point: (a) Monitoring scene; (b) Arrangement of the experimental measurement points.
2.3 Data preprocessing
2.3.1 Missing values
In order to make sure that data imputation can be feasible and to prevent long-distance false interpolation due to too much missing data, this study used box plots to remove outliers from the collected data. Then, the missing values were filled with mean imputation, linear interpolation, and data from the closest position with similar weather conditions (Ritter, 2023).
2.3.2 Normalization
To ensure data consistency and improve processing efficiency, the study normalized all the input environmental elements to the interval [0, 1]. Normalization of the input data X1, X2, X3… XN was performed by Equation 1 (Singh and Singh, 2020; Yang et al., 2023).
Where, Xi is the original data, Xmin is the minimum values of the original dataset, Xmax is the maximum values of the original dataset, and X is the normalized value after transformation.
2.3.3 Correlation analysis
The collected environmental data were analyzed by the Pearson correlation coefficient (PCC) method. The PCC measured the linear correlation of two continuous random variables, as shown in Equation 2. The PCC ranges from -1 to 1, where values approaching 1 indicate stronger correlations between variables (Jiang et al., 2025).
Where, Xk and Yk are the k-th feature variable and environmental sample data, p is the number of samples, and are the mean values of the feature variables and the environmental data from the AV systems.
2.3.4 Data preparation
To rigorously evaluate model robustness under extreme climatic conditions, we specifically selected data from 30-day periods centered around the summer solstice (representing high-temperature, high-radiation conditions) and winter solstice (representing low-temperature, low-radiation conditions) for model training and validation. The dataset was partitioned such that each seasonal subset contained 21 consecutive days (70% of data) for training and 9 consecutive days (30%) for validation. The sensor records every 10 minutes, and a single environmental parameter generates 1,008 measurements per day. For a single environmental parameter, the training set for each season contained 3,024 environmental data points (21 days), and the validation set contains 1,296 data points (9 days). The whole process of data processing and microclimate prediction of AV systems in this study was shown in Figure 3.
3 Models for solar radiation and air temperature prediction
3.1 Long short-term memory
Microclimate variables in agrivoltaic systems exhibit strong temporal dependence, long-range dependencies, and nonlinear fluctuations due to moving shadows, cloud dynamics, and seasonal changes, which calls for a time-series model that can robustly learn long sequences. LSTM, with its gated architecture, effectively mitigates vanishing/exploding gradients in long-sequence modeling and is therefore well-suited for capturing long-term temporal patterns (Hochreiter and Schmidhuber, 1997; Landi et al., 2021). The LSTM architecture has a complex gate mechanism for controlling the flow of information and updating cell states, thus controlling both information storage and transmission. As shown in Figure 4, the LSTM gating system comprises four fundamental units. At each timestep t, the model processes 3 inputs: (i) the external input at t, (ii) the LSTM unit output at t-1, and (iii) the cell state at t-1. These inputs undergo transformation through the gating units to produce updated outputs and cell states (Fang et al., 2021). The mathematical formulations governing these operations were provided in Equations 3–8.
Figure 4. Door structure of long short-term memory. Ct-1 is cell state at time t-1, Ct is cell state at time t, ht-1 is output at t-1, ht is output at t, ft is forget gate, μt is update gate, ot is output gate, is cell state, σ is activation equation and tanh is hyperbolic tangent function.
Where, tanh denotes the hyperbolic tangent function and σ represents the activation function. The weight matrices (Wf, Wμ, Wc, Wo) and bias vectors (bf, bu, bc, bo) correspond to the forget gate, update gate, cell state, and output gate, respectively. These parameters were initialized randomly and subsequently optimized through data-driven training.
3.2 Attention mechanism
The attention mechanism comes from animal’s visual attention. This mechanism enables models to concentrate more on the most important information from the input sequence, which makes it better at recognizing important features and improves its prediction accuracy. The attention mechanism dynamically assigns higher weights to the most informative time steps, enabling the model to focus on critical information while suppressing noise, thereby improving responsiveness and accuracy under rapidly varying shading (Niu et al., 2021).
The attention mechanism works by taking three input vectors: query (Q), key (K), and value (V). The vector Q gets relevance scores with other vectors, while the key vector K measures the connection strength between Q and other vectors. The value vector V contains the information to be aggregated according to the attention weights gotten from Q. The mechanism calculates the similarity between Q and K to determine how much attention should be given to each element in V during the weighted aggregation process. Specifically, it first calculates the dot product of Q and K, applies a normalization operation to get the importance weights of each element in K, and then performs a weighted sum of V with these weights to get the final representation. As shown in Figure 5, the attention computation has three main steps (Gao et al., 2022).
Figure 5. Schematic diagram of attention mechanism. x1, x2, xn are feature input; WQ, WK and WV are parameter matrix; Q is query vector, K is key vector, V is value vector; qi is the element of Q vectork, ki is the element of K vector, vi is the element of V vector; αi is the attention distribution; Softmax is the normalized function; outi is output of attention mechanism; i is the feature number, the range is 1 to n.
Step 1: The query (Q), key (K), and value (V) matrices were computed using Equations 9–11.
Step 2: The similarity scores between Q and K were calculated via Equation 12. These scores were then normalized using the softmax function to obtain attention weights, where higher values indicate stronger relevance between the i-th input and the task objective.
Step 3: The final attention output was generated by performing a weighted sum of V with the normalized attention weights, as defined in Equation 13.
3.3 Long short-term memory – attention
To enhance the prediction of microclimate within AV systems, the study developed an integrated LSTM-Attention model. The framework of the model was illustrated in Figure 6. The model architecture operates as follows: First, multivariate time-series data were processed through an LSTM layer for feature extraction. Subsequently, an attention mechanism was applied to compute weight coefficients, which quantify the relevance between current and historical data points. By adaptively assigning higher weights to more informative temporal features, the model effectively captured the most critical patterns while suppressing noise. The structure of the LSTM-Attention model includes 5 layers (Ma et al., 2024). The description of each layer is given below:
Figure 6. Network structure of LSTM-Attention prediction model. t1, t2,…, tn are time series; X1, X2,…, Xm are input variables; LSTM is long short-term memory; y1, y2,…, yn are output.
1. Input Layer: Multidimensional time-series features are converted to LSTM-compatible 3D arrays (S, T, X). Where, S is the number of input samples, T is the time dimension, and X is the feature dimension.
2. LSTM Layer: Two LSTM layers are interlaced with dropout layers to prevent overfitting by stochastically deactivating neurons during training, which improves generalization ability.
3. Attention layer: To compute the attention mechanism, we used Equations 9–11 to derive the Q, K, and V vectors for each hidden state. The similarity scores between Q and K were then calculated through Equation 12, followed by normalization via the Softmax function to obtain the attention distribution. Higher values in this distribution indicate stronger relevance between the input information and task objectives. Finally, the attention output was computed as the dot product between the normalized weights and the value vectors (Equation 13).
4. Fully Connected Layer: Attention outputs were processed through dense connections for feature recombination and dimensional transformation.
5. Output Layer: Final predictions were generated with linear activation.
Since the target subjects, regional climatic characteristics, as well as the feature variables, data volume, and data quality of the dataset in this study differ from those in previous studies, it would lack scientific rigor to compare and discuss the predictive performance of the model in this study with that of previous research. To assess the improvement of the LSTM-Attention model over the standard LSTM model and several widely used machine learning models in microclimate prediction, the study conducted comparative analyses under two scenarios: varying prediction time horizons and different weather conditions. Specifically, the LSTM-Attention model was compared with four commonly employed machine learning models—BP, SVM, LSTM, and LSTM-GRU—in terms of their predictive performance for both solar radiation intensity and air temperature within AV systems.
3.4 Algorithm parameter settings
The internal environmental factor model of the AV system was developed using Python 3.11, with PyCharm as the development environment and TensorFlow as the development framework. The training process involved 80 iterations, a learning rate of 0.01, and an input batch size of 32, with the maximum number of epochs set to 100. The “Adam” optimizer and “mean squared error” loss function were employed for model training. Additionally, the number of neurons in both the GRU and LSTM layers was set to 128. To maintain consistency in model comparisons, all modules across were configured with the same parameters.
The study selected several machine learning models for comparison, including BP, SVM, and LSTM-GRU. The rationale for choosing these models was based on their ability to handle sequential data and their inherent differences in handling long-term dependencies. However, these models still faced challenges when dealing with the complex temporal and spatial variations in agrivoltaic systems, particularly under changing weather conditions.
3.5 Evaluation indicators
To assess the model’s performance, the study utilized the coefficient of determination (R2), root mean square error (RMSE), and mean absolute error (MAE). The mathematical expressions for these metrics were given in Equations 14–16.
Where, Xk is the predicted value, Yk is the observed value, is the mean of the observed values, and n is the total number of predicted samples.
4 Results and discussions
4.1 Correlation analysis
Microclimate parameters in AV system have complex nonlinear interdependence with different environmental factors. Including all measured factors as model inputs can add noise and reduce the accuracy of predictions because of possible multicollinearity and interference from irrelevant features (Wang and Chen, 2021). Figure 7 showed a complicated nonlinear relationship among the microclimate parameters inside the AV system, where solar radiation intensity and air temperature exhibited differential coupling with various environmental factors. According to Pearson correlation analysis, the internal solar radiation intensity was highly correlated with the PV panel temperature and the external solar radiation, but it had a moderate association with the relative humidity and soil temperature. External air temperature and wind speed showed considerable but less noticeable connections to the inner microclimate parameters, which implied that the atmosphere has a secondary effect on the inner environment through the PV array. These results matched the known rules about energy balance in partly closed farming places, where how the leaves are shaped and what they’re made of affect how they interact with their surroundings (Ali Abaker Omer et al., 2025).
Figure 7. Pearson correlation between the solar radiation (Isr) and air temperature (Iat) inside AV systems with different environmental factors. Isr represents the intensity of internal solar radiation, Iat represents the internal air temperature, Irh represents the internal relative humidity, Ist represents the internal soil temperature, Ism represents the internal soil moisture, Esr represents the external solar radiation, Eat represents the external air temperature, Erh represents the external relative humidity, Est represents the external soil temperature, Esm represents external soil moisture, Pr represents precipitation, Wd represents the wind direction, Ws represents the wind speed, Tp represents the temperature of the PV panel.
For solar radiation intensity, the analysis showed very strong relationship with the external solar radiation and temperature of PV panel, while demonstrating moderate relationships with the external air temperature and internal relative humidity. There were weak correlations between the strength of the internal solar radiation and the moisture level of the soil or the direction of the wind. It suggested that external solar radiation and temperature of PV panel should be the main input variables for radiation prediction, and external air temperature and internal relative humidity could be added to consider the effect of atmospheric attenuation. Differential correlation pattern indicated that we need different features for the model of these two important microclimate parameters. The internal air temperature showed very high correlation with external air temperature and PV panel temperature. It exhibited a moderate correlation with relative humidity and soil temperature, while showing weak correlations with wind speed and precipitation. The results showed that the main factors affecting the prediction of air temperature were the external air temperature, temperature of PV panel, and internal relative humidity, and the internal soil temperature and wind speed should be considered as secondary.
4.2 Analysis of radiation intensity prediction results
4.2.1 Seasonal variations in predicted radiation intensity
Figure 8 presented the comparative curves of predicted and measured solar radiation intensity across different seasons for the BP, SVM, LSTM, LSTM-GRU, and LSTM-Attention models. The results revealed that the maximum prediction deviations for solar radiation intensity in summer under the MD, HD, and FD treatments for the five models were 300.4, 358.1, 252.9, 261.7, and 163.7 W/m2 (MD); 281.5, 344.3, 274.6, 232.9, and 173.4 W/m2 (HD); and 300.6, 343.0, 268.0, 233.2, and 167.4 W/m2 (FD), respectively. In winter, the maximum prediction deviations under the same coverage densities were 156.8, 160.0, 134.6, 116.9, and 80.0 W/m2 (MD); 165.9, 177.1, 169.1, 121.7, and 73.2 W/m2 (HD); and 54.3, 57.8, 73.7, 47.8, and 29.0 W/m2 (FD). These results demonstrated that the LSTM-Attention model achieved superior prediction accuracy for solar radiation intensity in both summer and winter, with its maximum deviation being significantly lower than those of the traditional machine learning models and the standalone LSTM or LSTM-GRU models.
The performance differences between LSTM and its improved versions, LSTM-GRU and LSTM-Attention, showed that improving the structure of the neural network could enhance the accuracy of forecasts in complicated energy areas. The LSTM model had a maximum prediction deviation of 252.9 W/m2 for summer solar radiation intensity under MD treatment. In comparison, the LSTM-Attention model reduced the maximum prediction error by 37.5% compared to the LSTM model under the FD treatment, demonstrating the quantitative analysis capability of the attention mechanism on multiple reflected radiations among PV panels. Furthermore, the LSTM model had a 25.6% decrease in the maximum prediction error for winter solar radiation intensity with HD treatment compared to MD treatment. However, its maximum error at the FD treatment was still higher than LSTM-GRU and LSTM-Attention. Notably, the LSTM-Attention model performed well in winter, it has the smallest maximum error in the whole dataset at the FD treatment, which is 60.7% less than that of the LSTM model. Regarding solar radiation intensity prediction as a particular environmental element, the LSTM-Attention model exhibited excellent seasonal adaptability. In the high summer radiation condition, the model could keep the prediction deviation below 170 W/m2 through the attention-weighted LSTM framework, which was over 50% better than the traditional machine learning model. In winter, under low-radiation conditions, its maximum deviation was only 29.0 W/m2 at the FD treatment, outperforming LSTM-GRU by 39.3%. In addition, it observed that higher PV coverage densities tend to increase error due to shading effects and localized temperature changes, which were harder for the model to capture. This was especially prominent in summer when radiation intensity fluctuations are more pronounced. This difference in performance showed that the attention layer improved important temporal features, and the LSTM part dealt well with the non-linear changes in radiation intensity (Kong et al., 2023; Şener and Tuğal, 2025).
Solar radiation intensity was a key microclimate parameter that was highly influenced by the density of the PV array. Our study showed that as the PV array density increased, shading effects became more pronounced, leading to greater spatial and temporal variation in radiation intensity. Specifically, under the HD and FD configurations, the model’s ability to predict solar radiation intensity decreased. During the summer, the increased shading caused by higher PV array densities resulted in significant fluctuations in radiation intensity, which the model had difficulty capturing. The RMSE increased with the array density, reflecting the growing complexity in predicting radiation under dense PV arrays. The model performed better under the MD configurations, where the shading effects were less severe. These findings indicated that the attention mechanism incorporated in the model helped mitigate the impact of shading to some extent, but further refinement was needed to address the more dynamic shading patterns in high-density configurations.
Figure 9 showed the comparison of different solar radiation prediction models according to their seasonal variation and coverage density. Results show that the LSTM-Attention model did better than the others during both summer and winter. In summer, at MD, HD, and FD treatments, the LSTM-Attention model got a mean RMSE of 41.6 W/m2. It meant that the reduction was 28.0%, 35.7%, 42.1%, and 49.4% less than the LSTM-GRU, LSTM, BP, and SVM models, respectively. MAE had a similar trend, and R2 was around 96.9% to 97.1%, which meant it performed much better than other models. The benefit of LSTM-Attention was even more evident in winter. At the HD treatment, it reached an RMSE of 11.2 W/m2, which was 30.4%, 52.7%, 55.7%, and 57.6% lower than the LSTM-GRU, LSTM, BP, and SVM models, respectively. For this prediction, the R2 value was 97.8%, indicating that the model had a good ability to capture the temporal characteristics of solar radiation intensity.
Figure 9. Performance comparison of five prediction models for solar radiation intensity in different forecasting seasons.
Further analysis showed that the LSTM-Attention model had better stability than the benchmark models under changing environmental conditions. MD treatment RMSE was only 49.7% of the LSTM model’s RMSE in winter. As coverage density increased, the performance of the LSTM-Attention model decreased slightly; the R2 value dropped by only 0.8% at the FD treatment. In contrast, the R2 values of LSTM-GRU and LSTM decreased by 1.9% and 0.3%, respectively. It is likely because the attention mechanism can model the spatiotemporal variations in solar radiation intensity, particularly in winter when the fluctuations were caused by changes in the solar elevation angle. LSTM-Attention could make the key feature more important so as to avoid the sudden error of prediction. Traditional LSTM and LSTM-GRU models responded more slowly to sudden changes in radiation intensity compared to these models. This can be seen from the higher MAE values for these models in summer, which were up to 53.6% higher than the LSTM-Attention model under high coverage density conditions. From this, it can be seen that the mechanism reduced phase lag and peak mismatch during fast changing radiation period and improved robustness when the signal to noise ratio was low, which explained the stronger performance of LSTM-Attention under complex climate conditions.
4.2.2 Out-of-window generalization test of solar radiation under different weather conditions
To partially evaluate the model’s generalization ability outside the training window, an additional validation was performed on a short period of data (3 days) not included in the training phase. This additional period, which was selected outside of both the summer and winter solstices, provided a preliminary check of the model’s robustness to conditions beyond the specific seasonal extremes. We evaluated its solar radiation forecasting performance relative to conventional LSTM and hybrid LSTM-GRU models under seasonally varied weather conditions (Figure 10). As shown in Figure 10, the mean maximum prediction deviations for the three models (LSTM-Attention, LSTM and hybrid LSTM-GRU models) under summer conditions were 139.2, 105.8, and 100.7 W/m2 for sunny days; 67.2, 58.6, and 41.2 W/m2 for cloudy days; and 44.7, 44.3, and 30.4 W/m2 for rainy days. Under winter conditions, the corresponding mean maximum deviations were 82.6, 68.2, and 5.6 W/m2 for sunny days; 47.1, 38.9, and 3.7 W/m2 for cloudy days; and 9.2, 8.3, and 1.0 W/m2 for rainy days. The comparative analysis of these curves and data demonstrated that, the LSTM-Attention model achieved superior prediction accuracy for solar radiation intensity compared to other models under different weather conditions.
Figure 10. Prediction results of solar radiation intensity under different weather conditions in different seasons by LSTM, LSTM-GRU and LSTM-Attention models: (a) Summer; (b) Winter.
Table 2 presented a numerical comparison of how well the LSTM, LSTM-GRU, and LSTM-Attention models predicted solar radiation intensity in the AV system under typical seasonal weather. The LSTM-Attention model performed better than the others in all tested situations. For example, during summer clear-sky conditions at the MD treatment site, this model reached an RMSE of 31.8 W/m2 and an MAE of 25.7 W/m2. These values were 48.0% and 43.0% lower than those of the LSTM model. It also achieved an R2 of 97.6%. The advantage of the LSTM-Attention model was even clearer in cloudy and rainy weather. On rainy days, its MAE decreased to 41.6% of the LSTM model’s value. In winter clear sky, the LSTM-Attention model reduced the RMSE to 5.2 W/m2, improved by 39.5% compared with LSTM-GRU model. These results showed that the LSTM-Attention structure can adapt well to various weather situations.
Table 2. Comparison of model performance for solar radiation intensity under different weather conditions.
The LSTM-Attention model predicted the solar radiation intensity better than the LSTM-GRU model due to the attention mechanism that adjusted the importance of different weather features more efficiently. Although the LSTM-GRU model did not have much improvement in complex weather conditions, the LSTM-Attention model still maintained an R2 of 96.2% in winter rain, which was 5.4% higher than LSTM. It meant that it can deal with the non-linear effect of rainfall on radiation better. Additionally, as the difference among the three models became smaller with an increase in coverage density, LSTM-Attention still reduced the MAE by an average of 42.1% compared to LSTM under FD treatment. It showed that it works reliably in different circumstances. These findings indicated that hybrid models including attention mechanisms could greatly enhance the accuracy of solar radiation intensity prediction tasks, offering more dependable modeling tools for evaluating the performance of an AV system (Sakib et al., 2024).
4.3 Analysis of air temperature prediction results
4.3.1 Seasonal variations in predicted air temperature
Figure 11 displayed comparative curves between predicted and measured air temperature values across different seasons for the BP, SVM, LSTM, LSTM-GRU, and LSTM-Attention models. The maximum prediction deviations for air temperature in summer under the MD, HD and FD treatment were 2.3, 1.6, 2.2, 1.8, and 1.0°C (MD); 2.3, 1.8, 2.3, 1.8, and 1.1°C (HD); and 2.9, 2.2, 1.6, 1.52, and 0.9°C (FD), respectively, for the five models. Correspondingly, the maximum prediction deviations in winter under the same coverage densities were 4.7, 4.3, 5.0, 3.7, and 3.8°C (MD); 6.7, 4.5, 4.7, 4.2, and 3.4°C (HD); and 7.5, 5.0, 4.3, 2.8, and 2.3°C (FD). The results showed that the LSTM-Attention model improved the ability of the model to forecast seasonal temperatures, with the prediction errors being much smaller than those of the traditional methods (BP, SVM) and other neural network structures (LSTM, LSTM-GRU). It is noteworthy that the curves predicted by the LSTM-Attention model were most consistent with the actual measurement data, especially showing better adaptability during periods of fast temperature changes. On the contrary, the BP and SVM models generated smoother prediction curves with less capacity to capture sudden temperature changes. Though the LSTM and LSTM-GRU models outperformed traditional ML techniques, they still had some lag or overshoot in their peak temperature predictions.
According to the error analysis, the LSTM-Attention model performed the best. In summer, under the MD treatment, the maximum deviation was only 1.0°C. It was much less than the deviations of the BP and SVM models, which were 2.3°C and 1.6°C, respectively. As the PV coverage density rose to the FD treatment, the maximum deviation of the LSTM-Attention model decreased further to 0.9°C. In contrast, the other models’ errors grew. In winter, the performance difference became even larger, with LSTM-Attention keeping a max deviation of 2.3°C for the FD treatment. The main reason for this performance improvement was that the attention mechanism could pay attention to the relevant parts of the temperature feature sequence, which improved the model’s ability to find the nonlinear relationship between the environment and temperature, so as to make more accurate predictions of temperature changes under complex environmental conditions. All models were impacted by the PV panel coverage density, but the degree of impact differed greatly. Traditional machine learning models (BP and SVM) had poor adaptability to environmental parameter changes as they showed significant performance deterioration when the coverage density increased. Deep learning models generally performed better, and LSTM-Attention was the most insensitive to changes in coverage density, indicating the best robustness. This observation indicates that for environmental temperature prediction tasks affected by the intensity of solar radiation, deep learning models with attention mechanisms can better deal with the differences in input features caused by different PV panel layouts, making them a more reliable prediction tool for real-world engineering applications, as reported by He et al (He et al., 2022).
Air temperature predictions showed different patterns under varying PV array densities. The temperature field within the AV system was heavily influenced by both solar radiation and shading effects. As the PV array density increased, the shading reduced the amount of direct sunlight reaching the ground, which in turn lowered the overall temperature. The model performed reasonably well in predicting air temperature in scenarios with the MD configuration. However, in the HD and FD configuration, the shading effects significantly altered the temperature distribution, leading to increased model errors, especially during periods of high solar radiation. The RMSE for air temperature predictions was higher in the HD configuration compared to the MD configuration, where temperature variations were more stable. This suggested that the model struggled to adapt to the complex temperature gradients caused by dense PV arrays. As a result, further adjustments were needed to enhance the model’s ability to capture such variations in high-density configurations.
Figure 12 showed the performance of five models in predicting air temperature in different seasons and PV panel coverage levels. LSTM-Attention model always did better than the rest no matter if it was summer or winter, and the coverage density didn’t matter. It has smaller RMSE and MAE values compared to LSTM-GRU and LSTM. For instance, in summer with the MD coverage, the LSTM-Attention model achieved an RMSE of 0.5°C. This was 39.0% lower than LSTM-GRU and 51.0% lower than LSTM. In winter at the same coverage level, it had an RMSE of 0.8°C, which was 18.1% and 35.8% less than LSTM-GRU and LSTM, respectively. The model also kept an R2 over 97.0% in all cases, indicating a strong capacity to explain temperature changes. Interestingly, in summer with HD coverage, the R2 was as high as 98.2%, which was much higher than those of LSTM-GRU and LSTM.
Figure 12. Performance comparison of five prediction models for air temperature in different forecasting seasons.
The improved accuracy of the LSTM-Attention model may have been because it had an attention mechanism that gave different weights to the input data. It helped the model to recognize the complex and changing temperature patterns usually seen in small climate areas inside AV systems. On the other hand, the LSTM-GRU model did not include a feature selection process, which may explain its larger variation in error. For instance, at the FD treatment site in winter, its RMSE was 0.9°C. Although this result was better than that of the standard LSTM model, it was still less accurate than LSTM-Attention. The basic LSTM model, affected by gradient vanishing problems, performed the worst when predicting long sequences. Its R2 value dropped to 92.3% in winter. These findings support the importance of using attention mechanisms to improve the accuracy of air temperature forecasts (Nandi et al., 2022).
4.3.2 Out-of-window generalization test of air temperature under different weather conditions
To partly assess how well the model performs outside the training period, we ran an extra validation using three days of data that were not part of the training set. These three days were chosen to avoid both summer and winter solstices. This helped us do an early check on whether the model could handle conditions that were not tied to the extreme seasons. We compared how three models (LSTM, LSTM-GRU, and LSTM-Attention) performed during summer and winter under three different types of weather: clear, cloudy, and rainy days (Figure 13). In summer, the highest average prediction errors on clear days were 3.9°C for LSTM, 3.5°C for LSTM-GRU, and 2.8°C for LSTM-Attention. On cloudy days, the errors dropped to 1.1°C, 0.8°C, and 0.6°C for the same models. For rainy days, the errors were even lower: 0.4°C for LSTM, 0.3°C for LSTM-GRU, and 0.2°C for LSTM-Attention. In winter, the maximum mean deviation on clear days was 1.8°C, 1.5°C, and 1.1°C; for cloudy days, it was 0.9°C, 1.0°C, and 0.9°C; and for rainy days, it was 2.2°C, 1.2°C, and 0.7°C for the above three models, respectively.
Figure 13. Prediction results of air temperature under different weather conditions in different seasons by LSTM, LSTM-GRU and LSTM-Attention models: (a) Summer; (b) Winter.
Performance comparison showed that the LSTM-Attention model had better performance than both GRU-combined and traditional LSTM models in terms of weather prediction accuracy. This improved performance was especially noticeable during summer clear-sky conditions, when the LSTM-Attention model cut down the prediction error by 27.7% compared to the regular LSTM model, and during winter rainy conditions, the improvement was as high as 67.7%. The stable performance under various weather conditions suggests that the attention mechanism can capture temperature changes affected by weather, which is difficult for traditional models to handle. In addition, during rainy conditions, the model showed higher prediction errors, likely due to rapid changes in radiation and temperature.
Table 3 provided a quantitative comparison of air temperature forecasts in the AV system for different seasons by means of LSTM, LSTM-GRU and LSTM-Attention. The results indicated that the LSTM-Attention model performed better than the other models in terms of accuracy. In the summer clear sky situation at MD treatment, the RMSE of LSTM-Attention model was 1.4°C, which was 31.4% and 17.7% lower than the RMSE of LSTM and LSTM-GRU models, respectively. Its R2 was 95.9%, higher than the 92.1% and 93.9% of the other models. When it was overcast, this model did even better, with the RMSE dropping to 0.3°C and the R2 rising to 97.5%, indicating that it could do a better job capturing changes in air temperature under different lighting conditions. The LSTM-Attention model demonstrated especially strong stability during winter extreme weather events. Rainy condition at the MD treatment, it achieved an RMSE of 0.4°C, an 87.2% reduction from LSTM’s 3.1°C, and maintained a high R2 of 96.5%. Noteworthy is that in winter clear sky situation, the model’s MAE got as low as 0.4°C, which was 29.1% better than LSTM-GRU, proving the accuracy of detecting small changes in air temperature in cold places. This advantage became more obvious when there were more places covered.
In all kinds of weather conditions, LSTM-Attention always performed better than other models in complex meteorological situations. During the summer rainy season, R2 was high, ranging from 93.4% to 98.1%, which was higher than other models by 1.1% to 6.8%. In the winter overcast condition under HD treatment, the model got an MAE of 0.3°C, which was 40.4% lower than the baseline models, and an R2 of 97.0%. It can be seen that it can fit the non-linear relationship of air temperature under different weather conditions. The model consistently performed well throughout all seasons and weather conditions, showing that it can be relied upon to predict air temperature. From this, it can be seen that the LSTM memory retained longer context, which was essential for predicting delayed temperature responses. This mechanism based explanation was consistent with the observation that the gains of LSTM-Attention became more pronounced under complex weather, particularly in winter rainy scenarios.
4.4 Limitations and future work
This study presents some encouraging outcomes using the LSTM-Attention model for forecasting microclimate parameters in AV systems. However, there are some limitations that should be considered. A major limitation is that only one dataset from one particular area, under flat land conditions, was used. This might impact how well the model works in other places or under different terrain conditions. Therefore, future studies could consider validating the microclimate prediction model under various terrain conditions, such as sloping land. Additionally, although the attention mechanism enhances the model’s prediction accuracy, it has difficulty dealing with extreme weather conditions that have highly variable inputs. Future research will focus on optimizing the model to better handle such variations and improve its adaptability across different environments.
Despite these challenges, the results give us a good starting point to improve how we set up PV arrays and manage crops in AV systems. In the future, the proposed model can be integrated with streaming data from in-field sensors and weather stations to deliver rolling short-term forecasts, thereby supporting real-time AV management decisions such as irrigation scheduling and field operation planning. In addition, the model can be evaluated and applied across different PV coverage densities, and the forecasting module can be embedded into a management framework to enable quantitative co-optimization of PV array layouts and crop management under multi-objective trade-offs.
5 Conclusion
In this study, we developed a model based on LSTM and Attention mechanisms to predict microclimate such as solar radiation and air temperature in AV systems. The model was trained with information collected from the experimental AV systems on flat land in Nanjing, China. The results showed that the LSTM-Attention model performed much better than the traditional machine learning models (BP and SVM) and the standard LSTM model. Specifically, during the summer, the model decreased the RMSE of the solar radiation forecast by 28.0%, 35.7%, and 42.1% for the MD, HD, and FD treatments, respectively. In terms of air temperature prediction, the RMSE was reduced by 39.0% in summer and 18.1% in winter compared to the LSTM-GRU model. The attention mechanism allowed the model to focus on the most relevant temporal features, which improved its ability to detect complex changes in solar radiation and air temperature under different weather conditions such as sunny, cloudy, and rainy days. The model also performed well at both ends of the season. In winter, the RMSE for solar radiation predictions was 11.2 W/m2 and for air temperature predictions was 0.4°C under rainy conditions with the FD treatment. These results show that the LSTM-Attention model can do a good job at predicting microclimate parameters. This research provides a theoretical basis for the optimization of the array structure of AV systems and the adjustment of agricultural planting structure. At the same time, it offers important theoretical support for the microclimate prediction of AV projects in different regions. However, it should be noted that this study was only conducted under flat land conditions. Future studies can extend the model to different terrain types, such as sloping land, to further validate its applicability.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Author contributions
LZ: Investigation, Methodology, Software, Visualization, Writing – original draft, Writing – review & editing. JHG: Formal Analysis, Validation, Writing – original draft. CW: Funding acquisition, Writing – original draft, Writing – review & editing. EM: Writing – review & editing. AG: Writing – review & editing. BL: Investigation, Resources, Writing – original draft. CY: Investigation, Resources, Writing – original draft. GX: Resources, Software, Writing – original draft. JZ: Methodology, Resources, Writing – original draft. JG: Resources, Software, Writing – original draft. MX: Supervision, Writing – review & editing. EB: Conceptualization, Funding acquisition, Project administration, Supervision, Writing – review & editing.
Funding
The author(s) declared that financial support was received for this work and/or its publication. This research was funded by the Major Science and Technology Special Project of China Electric Power Engineering Consulting Group Co., Ltd. (Grant No. DG3-P01-2023), and the Natural Science Foundation of Jiangsu Province (BK20241174).
Acknowledgments
The authors would like to thank the editor and reviewers for their constructive comments.
Conflict of interest
Authors BL, CY and GX were employed by the company Shenzhen Energy Nanjing Holding Co., Ltd. Authors JZ AND JG were employed by the company China Electric Power Engineering Consulting Group Co., Ltd.
The remaining author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The authors declare that this study received funding from China Electric Power Engineering Consulting Group Co., Ltd. The funder had the following involvement in the study: Collection, analysis, interpretation of data.
Generative AI statement
The author(s) declared that generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Ali Abaker Omer, A., Li, M., Zhang, F., Hassaan, M. M. E., El Kolaly, W., Zhang, X., et al. (2025). Impacts of agrivoltaic systems on microclimate, water use efficiency, and crop yield: A systematic review. Renew. Sust. Energ. Rev. 221, 115930. doi: 10.1016/j.rser.2025.115930
Bellone, Y., Croci, M., Impollonia, G., Nik Zad, A., Colauzzi, M., Campana, P. E., et al. (2024). Simulation-based decision support for agrivoltaic systems. Appl. Energy 369, 123490. doi: 10.1016/j.apenergy.2024.123490
Chopdar, R. K., Sengar, N., Giri, N. C., and Halliday, D. (2024). Comprehensive review on agrivoltaics with technical, environmental and societal insights. Renew. Sust. Energ. Rev. 197, 114416. doi: 10.1016/j.rser.2024.114416
Fang, Z., Crimier, N., Scanu, L., Midelet, A., Alyafi, A., and Delinchant, B. (2021). Multi-zone indoor temperature prediction with LSTM-based sequence to sequence model. Energy Build. 245, 111053. doi: 10.1016/j.enbuild.2021.111053
Gao, Y., Miyata, S., and Akashi, Y. (2022). Interpretable deep learning models for hourly solar radiation prediction based on graph neural network and attention. Appl. Energy 321, 119288. doi: 10.1016/j.apenergy.2022.119288
Geng, Q., Wang, L., and Li, Q. (2024). Soil temperature prediction based on explainable artificial intelligence and LSTM. Front. Environ. Sci. 12. doi: 10.3389/fenvs.2024.1426942
Ghosh, A. (2023). Nexus between agriculture and photovoltaics (agrivoltaics, agriphotovoltaics) for sustainable development goal: A review. Sol. Energy 266, 112146. doi: 10.1016/j.solener.2023.112146
Giudici, P., Gramegna, A., and Raffinetti, E. (2023). Machine learning classification model comparison. Socio-Econ. Plan. Sci. 87, 101560. doi: 10.1016/j.seps.2023.101560
Gnayem, N., Magadley, E., Haj-Yahya, A., Masalha, S., Kabha, R., Abasi, A., et al. (2024). Examining the effect of different photovoltaic modules on cucumber crops in a greenhouse agrivoltaic system: A case study. Biosyst. Eng. 241, 83–94. doi: 10.1016/j.biosystemseng.2024.03.012
Gong, W., Zhang, L., Gong, J., Geng, X., Wang, L., Deng, L., et al. (2025). Regulatory effect of agriphotovoltaic systems with different panel heights on the thermal environment. Sci. Rep. 15, 11196. doi: 10.1038/s41598-025-96166-5
Guo, Z. and Feng, L. (2024). Multi-step prediction of greenhouse temperature and humidity based on temporal position attention LSTM. Stochastic Environ. Res. Risk Assess. 38, 4907–4934. doi: 10.1007/s00477-024-02840-x
He, Z., Jiang, T., Jiang, Y., Luo, Q., Chen, S., Gong, K., et al. (2022). Gated recurrent unit models outperform other Machine learning models in prediction of minimum temperature in greenhouse Based on local weather data. Comput. Electron. Agric. 202, 107416. doi: 10.1016/j.compag.2022.107416
Hochreiter, S. and Schmidhuber, J. (1997). Long short-term memory. Neural Comput. 9, 1735–1780. doi: 10.1162/neco.1997.9.8.1735
Huang, H., Wang, Z., Liao, Y., Gao, W., Lai, C., Wu, X., et al. (2024). Improving the explainability of CNN-LSTM-based flood prediction with integrating SHAP technique. Ecol. Inform. 84, 102904. doi: 10.1016/j.ecoinf.2024.102904
Jiang, B., Gong, H., Qin, H., and Zhu, M. (2022). Attention-LSTM architecture combined with Bayesian hyperparameter optimization for indoor temperature prediction. Build. Environ. 224, 109536. doi: 10.1016/j.buildenv.2022.109536
Jiang, Y., Zhang, L., Wang, C., Chen, L., Zhang, W., Weng, H., et al. (2025). Multi-step prediction of dissolved oxygen in fish pond aquaculture using feature reconstruction-based deep neural network. Comput. Electron. Agric. 232, 109997. doi: 10.1016/j.compag.2025.109997
Joseph, V., Baptiste, B., Baptiste, A., Sylvain, E., Martin, F., Eric, D., et al. (2025). An innovative method based on CFD to simulate the influence of photovoltaic panels on the microclimate in agrivoltaic conditions. Sol. Energy 297, 113571. doi: 10.1016/j.solener.2025.113571
Kong, X., Du, X., Xue, G., and Xu, Z. (2023). Multi-step short-term solar radiation prediction based on empirical mode decomposition and gated recurrent unit optimized via an attention mechanism. Energy 282, 128825. doi: 10.1016/j.energy.2023.128825
Landi, F., Baraldi, L., Cornia, M., and Cucchiara, R. (2021). Working memory connections for LSTM. Neural Networks 144, 334–341. doi: 10.1016/j.neunet.2021.08.030
Li, L., Liu, L., Gu, Z., Sun, Z., and Zhang, Q. (2022a). Examining the relationships between carbon emissions and land supply in China. Ecol. Inform. 70, 101744. doi: 10.1016/j.ecoinf.2022.101744
Li, M., Liu, Y., Zhang, F., Zhang, X., Zhang, Z., Omer, A. A. A., et al. (2021). Design of multi-passband polymer multilayer film and its application in photovoltaic agriculture. Chin. Opt. Lett. 19, 112201. doi: 10.3788/col202119.112201
Li, M., Wang, W., Li, H., Yang, Z., and Li, J. (2025). Monitoring of vegetation chlorophyll content in photovoltaic areas using UAV-mounted multispectral imaging. Front. Plant Sci. 16. doi: 10.3389/fpls.2025.1643945
Li, Z., Wang, P., Zhang, J., and Mu, S. (2022b). A strategy of improving indoor air temperature prediction in HVAC system based on multivariate transfer entropy. Build. Environ. 219, 109164. doi: 10.1016/j.buildenv.2022.109164
Ma, S., Ding, W., Zheng, Y., Zhou, L., Yan, Z., and Xu, J. (2024). Edge-cloud collaboration-driven predictive planning based on LSTM-attention for wastewater treatment. Comput. Ind. Eng. 195, 110425. doi: 10.1016/j.cie.2024.110425
Ma Lu, S., Zainali, S., Stridh, B., Avelin, A., Amaducci, S., Colauzzi, M., et al. (2022). Photosynthetically active radiation decomposition models for agrivoltaic systems applications. Sol. Energy 244, 536–549. doi: 10.1016/j.solener.2022.05.046
Mao, X., Ren, N., Dai, P., Jin, J., Wang, B., Kang, R., et al. (2024). A variable weight combination prediction model for climate in a greenhouse based on BiGRU-Attention and LightGBM. Comput. Electron. Agric. 219, 108818. doi: 10.1016/j.compag.2024.108818
Moon, H.-W. and Ku, K.-M. (2023). The effect of additional shading utilizing agriphotovoltaic structures on the visual qualities and metabolites of broccoli. Front. Plant Sci. 14. doi: 10.3389/fpls.2023.1111069
Nandi, A., De, A., Mallick, A., Middya, A. I., and Roy, S. (2022). Attention based long-term air temperature forecasting network: ALTF Net. Knowl-Based Syst. 252, 109442. doi: 10.1016/j.knosys.2022.109442
Niu, Z., Zhong, G., and Yu, H. (2021). A review on the attention mechanism of deep learning. Neurocomputing 452, 48–62. doi: 10.1016/j.neucom.2021.03.091
Pandey, G., Lyden, S., Franklin, E., Millar, B., and Harrison, M. T. (2025). A systematic review of agrivoltaics: Productivity, profitability, and environmental co-benefits. Sustain. Prod. Consumption 56, 13–36. doi: 10.1016/j.spc.2025.03.006
Peng, Y., Ma, X., Wang, Y., Li, M., Gao, F., Zhou, K., et al. (2023). Energy performance assessment of photovoltaic greenhouses in summer based on coupled optical-electrical-thermal models and plant growth requirements. Energy Convers. Manage. 287, 117086. doi: 10.1016/j.enconman.2023.117086
Ritter, F. (2023). Technical note: A procedure to clean, decompose, and aggregate time series. Hydrol. Earth Syst. Sci. 27, 349–361. doi: 10.5194/hess-27-349-2023
Şener, İ.F. and Tuğal, İ. (2025). Optimized CNN-LSTM with hybrid metaheuristic approaches for solar radiation forecasting. Case Stud. Therm. Eng. 72, 106356. doi: 10.1016/j.csite.2025.106356
Sakib, S., Mahadi, M. K., Abir, S. R., Moon, A. M., Shafiullah, A., Ali, S., et al. (2024). Attention-based models for multivariate time series forecasting: Multi-step solar irradiation prediction. Heliyon 10, e27795. doi: 10.1016/j.heliyon.2024.e27795
Singh, D. and Singh, B. (2020). Investigating the impact of data normalization on classification performance. Appl. Soft Comput. 97, 105524. doi: 10.1016/j.asoc.2019.105524
Stid, J. T., Shukla, S., Kendall, A. D., Anctil, A., Hyndman, D. W., Rapp, J., et al. (2025). Impacts of agrisolar co-location on the food–energy–water nexus and economic security. Nat. Sustain. 8, 702–713. doi: 10.1038/s41893-025-01546-4
Tan, Y., Liu, J., Li, W., Yin, J., Chen, H., Peng, Y., et al. (2025). Agrivoltaics development progresses: From the perspective of photovoltaic impact on crops, soil ecology and climate. Environ. Res. 266, 120540. doi: 10.1016/j.envres.2024.120540
Wang, J. and Chen, Y. (2021). Adaboost-based integration framework coupled two-stage feature extraction with deep learning for multivariate exchange rate prediction. Neural Process Lett. 53, 4613–4637. doi: 10.1007/s11063-021-10616-5
Williams, H. J., Hashad, K., Wang, H., and Max Zhang, K. (2023). The potential for agrivoltaics to enhance solar farm cooling. Appl. Energy 332, 120478. doi: 10.1016/j.apenergy.2022.120478
Wu, S., Dong, Z., Guzmán, S. M., Conde, G., Wang, W., Zhu, S., et al. (2024). Two-step hybrid model for monthly runoff prediction utilizing integrated machine learning algorithms and dual signal decompositions. Ecol. Inform. 84, 102914. doi: 10.1016/j.ecoinf.2024.102914
Yajima, D., Toyoda, T., Kirimura, M., Araki, K., Ota, Y., and Nishioka, K. (2023). Agrivoltaic system: Estimation of photosynthetic photon flux density under solar panels based on solar irradiation data using all-climate solar spectrum model. Cleaner Eng. Technol. 12, 100594. doi: 10.1016/j.clet.2022.100594
Yang, Y., Gao, P., Sun, Z., Wang, H., Lu, M., Liu, Y., et al. (2023). Multistep ahead prediction of temperature and humidity in solar greenhouse based on FAM-LSTM model. Comput. Electron. Agric. 213, 108261. doi: 10.1016/j.compag.2023.108261
Zainali, S., Lu, S. M., Fernández-Solas, Á., Cruz-Escabias, A., Fernández, E. F., Zidane, T. E. K., et al. (2025). Modelling, simulation, and optimisation of agrivoltaic systems: A comprehensive review. Appl. Energy 386, 125558. doi: 10.1016/j.apenergy.2025.125558
Zhang, L., Gong, J., Yang, Z., Wu, X., Wang, W., Yang, C., et al. (2025). Evaluating the contribution of decreasing heights of photovoltaic panels on light environment and agricultural production in agrivoltaic systems. J. Cleaner Prod. 495, 145091. doi: 10.1016/j.jclepro.2025.145091
Keywords: agrivoltaics, microclimate prediction, attention mechanism, long short-term memory, solar radiation, air temperature
Citation: Zhang L, Gong J, Wu C, Murchie EH, Gibbs AJ, Liu B, Yang C, Xu G, Zhang J, Guo J, Xiao M and Bao E (2026) Research on time series prediction of microclimate in agrivoltaic systems based on the long short-term memory and attention mechanism. Front. Plant Sci. 17:1755040. doi: 10.3389/fpls.2026.1755040
Received: 26 November 2025; Accepted: 13 January 2026; Revised: 31 December 2025;
Published: 03 February 2026.
Edited by:
Changkai Wen, China Agricultural University, ChinaReviewed by:
Yi Zhang, Shanxi Agricultural University, ChinaHongjun Xu, Xinjiang Agricultural University, China
Yanfei Cao, Northwest A&F University, China
Copyright © 2026 Zhang, Gong, Wu, Murchie, Gibbs, Liu, Yang, Xu, Zhang, Guo, Xiao and Bao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Encai Bao, MjAxODAwNTFAamFhcy5hYy5jbg==; Maohua Xiao, eGlhb21hb2h1YUBuamF1LmVkdS5jbg==
Jianhui Gong2