Long-Term Global Solar Radiation Prediction in 25 Cities in Morocco Using the FFNN-BP Method

This article presents different combinations of input parameters based on an intelligent technique, using neural networks to predict daily global solar radiation (GSR) for twenty-five Moroccan cities. The collected measured data are available for 365 days and 25 stations around Morocco. Different input parameters are used, such as clearness index KT, day number, the length of the day, minimal temperature T min , maximal temperature T max , average temperature T average , difference temperature Δ T , ratio temperature T-Ratio, average relative humidity RH, solar radiation at the top outside atmosphere TOA, average wind speed Ws, altitude, longitude, latitude, and solar declination. A different combination was employed to predict daily GSR for the considered locations in order to find the most adequate input parameter that can be used in the prediction procedure. Several statistical metrics are applied to evaluate the performance of the obtained results, such as coefficients of determination (R 2), mean absolute percentage error (MAPE), root mean square error (RMSE), normalized root mean square error (NRMSE), mean bias error (MBE), test statistic (TS), linear regression coefficients (the slope “a” and the constant “b”), and standard deviation (σ). It is found that the usage of input parameters gives highly accurate results in the artificial neural network (FFNN-BP) model, obtaining the lowest value of the statistical metrics. The results showed the best input of 25 locations, 12 inputs for Er-Rachidia, Marrakech, Medilt, Taza, Oujda, Nador, Tetouan, Tanger, Al-Auin, Dakhla, Settat, and Safi, seven inputs for Fes, Ifrane, Beni-Mellal, and Meknes, six inputs for Agadir and Rabat, five inputs for Sidi Ifni, Essaouira, Casablanca and Kenitra, four inputs for Ouarzazate, Lareche, and Al-Hoceima. In terms of accuracy, R 2 of the selected best inputs parameters varies between 0.9860% and 0.9920%, the range value of MBE (%) being from −0.1076% to −0.5931%, the RMSE between 0.1990 and 0.4580%, the range value of the NRMSE between 0.0355 and 0.8938, and the lowest value MAPE between 0.0019 and 0.0060%. This technique could be used to predict other parameters for locations where measurement instrumentation is unavailable or costly to obtain.


INTRODUCTION
In the last decade, energy demand has been growing internationally, with the rise in population growth and economic development. Thus, it becomes essential for the fulfillment of economic prosperity (Khan et al., 2016). Lately, the installed capacity of renewable energy has undergone a significant increase, reducing dangerous residues or the emissions of fossil fuel (Sayigh, 2017). All communities call for energy resources that fulfill essential needs, such as wellness, heating, eating, room comfort, accessibility, and connectivity and function as conceptual processes (Owusu and Asumadu-Sarkodie, 2016). Renewable energy, such as solar energy, wind energy, thermal energy, and hydroelectric energy, is widely accepted as the key tool for global sustainable economic development. Likewise, there seem to be two fundamental reasons behind the focus on sustainable energy: one is deforestation and global climate change, and the other is the possibility that fossil fuels will inevitably dry up and there will be no more oil, gas, and coal. That could also be said about nuclear energy since nuclear fuel stocks are not limitless (Kumar and Kumar, 2017). Solar photovoltaic technology is gaining importance when it comes to addressing the issue of energy development for a number of reasons: the potential to transform solar radiation directly into power, the low cost, and the decreased pollution (Yu et al., 2017). In addition, the growth of the renewable energy industry thus produces new employment opportunities and contributes to improvement in air quality. However, the installation of solar electricity, such as photovoltaic (PV), concentrated solar power (CSP), or hybrid PV thermal installations, involves high-quality, decade-long time series of solar radiation data (Polo et al., 2020;. The long-term variability of solar radiation plays a major role in predicting the risk of the potential electricity production of a solar power plant being surpassed or unachieved and impacts the financial incentives that the project is expected to achieve. In this context, the information and behavior of solar radiation are very important for many sectors, such as agriculture, architect, design, and technology (Owusu and Asumadu-Sarkodie, 2016;Kaba et al., 2018). Generally, solar radiation energy has two components: terrestrial global and extraterrestrial solar energy (Rahimikhoob, 2010). The terrestrial solar radiation has direct (H b ) and diffuse (H d ) components. The sum of Hb*cosθ and H d represents the global solar radiation (H g ); θ is the zenith angle. The measurement of energy use among houses, the development of heating ventilation and air-conditioning systems, and the use of solar energy need appropriate H g data (Zhao et al., 2019;Liu et al., 2018). In particular, both direct and diffuse components are required to receive solar radiation on a slanted surface (Muneer et al., 2007). The best method for collecting data regarding solar radiation is focused on observations employing adequate equipment, but the measurements are rather time-consuming, the costs are high and the weather stations installation and management are demanding. For example, since 2019, there have been 17 cities in China where H g and H d , respectively, have been calculated (Feng et al., 2018) and 20 cities in Morocco where the solar radiation and other parameters have been collected through the project "proper.ma" (Aarich et al., 2018). In particular, there are challenges regarding long-term background measurement data, such as station changes and incomplete observations. However, the estimation and prediction of global solar radiation play a very important role. Several models have been developed in the literature to estimate and predict global solar radiation, which means it is difficult to choose the most accepted and efficient model for a particular use. In research areas, various solar radiation models have been developed for several years and they are categorized into three major models: empirical or analytical models, machine learning models, which use computational intelligence techniques, and satellite remote sensing models, as reported in Yang et al. (2020). Among all the presented models, all researchers have agreed on the importance of the results given by the Artificial Neural Network (ANN) (Kashyap et al., 2015;Al-Dahidi et al., 2019). The ANN was reported as the most used artificial intelligence technique for the prediction in several fields (Kashyap et al., 2015;Jiang, 2009). Celik and Muneer have presented and interpreted the relationship between solar radiation and other climatic parameters of the location selected (Celik and Muneer, 2013).
Generally, the ANN is the most used technique for the forecast or estimation of several things. For example, in Turkey, Sözen et al. (2004) have used the simple ANN to predict the solar radiation intensity using various parameters, such as latitude, longitude, altitude, month's number, average sunshine duration, and average temperature. The measurement data were collected from 17 different weather stations, where measurement data from 11 stations for training algorithm and other six measurement data are used for testing. In a different review, the same neural networks were used to predict the solar potential for 12 weather stations from Turkey. The performance of the results is validated by calculating the value MAPE 6.78% and R 2 99.78% (Sözen et al., 2005). Sharifi et al. (2016) have implemented the feedforward neural network (FFNN) with four inputs in one hidden layer, one output layer, and Levenberg-Marquardt as a training algorithm. Using data from six stations, they have estimated the daily global radiation for two different countries, Iran (Tabriz and Urmia) and the United States (Grand Island, North Platte, Lincoln, and Scottsbluff). The used inputs are the daily mean clearness index (K T ), range temperature (ΔT), theoretical sunshine duration (N), and extraterrestrial radiation (R a ). The accuracy of the developed model was tested in terms of the average median absolute error (MAE 7.4%), root mean square error (RMSE 9.8%), and U95 27.35%. In Oman, Al-Alawi and Al-Hinai (1998) have used the FFNN with backpropagation training algorithm to predict global solar radiation. Eight input layers were implemented in ANN, namely, location, month, mean pressure, temperature, vapor pressure, relative humidity, wind speed, and sunshine duration. The accuracy of the results was tested by Mean Absolute Percentage Error percentage (MAPE) with 7.3%. In Morocco, Bounoua et al. (2021) have used the neural network method, 22 empirical models, and tree-based ensemble methods to estimate the daily GSR in five studies locations. In terms of accuracy, the proposed methods were evaluated using R 2 , NRMSE, and NMAE.
As a result, the random forest method outperforms the other techniques with the highest value of R 2 and lowest value of NRMSE and NMAE.
In China, Lam et al. (2008) have applied the multi-layer FNN (MLFFNN) to estimate the daily global solar radiation using measured sunshine duration for 40 cities covering nine major thermal climatic zones and sub-zones in China. Six input layers were used: latitude, longitude, altitude, day number, temperature, and sunshine ratio. The quality of the estimated results was verified by the calculation of the statistical performance. The MBE values vary between 16.9% underestimation to 18.6% overestimation, and RMSE values vary in the range from 9.1 to 20.5%. In another paper, in Zimbabwe, Chiteka and Enweremadu (2016) have implemented the FFANN to predict the global horizontal irradiation (GHI) using geographical and meteorological data for 20 locations. 12 locations were used in the ANN training algorithm and other data from eight locations were included in the simulation with seven inputs, namely, altitude, latitude, longitude, relative humidity, pressure, clearness index, and average ambient temperature, one hidden layer, and one input layer. The quality of the predicted results is carried out by calculating the performance indexes such as coefficient of determination R 2 , RMSE, MAE, and MAPE with the following values 99.894%, 0.223 Kwh/m 2 , 0.17 Kwh/m 2 , and 2.56%, respectively.
In Italy, Alsina et al. (2016) have applied the FFNN to forecast the monthly average daily global solar radiation (MDGSR) for 45 different locations. Different input parameters were taken into account, such as meteorological and geographical coordinates. The measurement data from 17 destinations have been used for training, whereas other collected data have been for testing the ANN. The most appropriate input parameters are the automatic relevance determination (ARD) and Bayesian method. The appropriate architecture of the suggested ANNs contained seven input parameters, i.e., TOA, duration of day, number of rainy days, average temperature, latitude, and altitude. The quality and accuracy of the model were tested and validated by calculating the MPE, which varied from 1.67 to 4.25%, NRMSE 1.01-1.65%, and MPBE from -0.20 to 0.03%.
In Spain, Hontoria et al. (2005) have implemented MLP to predict the solar radiation map. This study collected the measurement data from seven different lections. The aim was to generate a time series of hourly solar radiation using specific characteristics, such as temperature and latitude. In other articles in Egypt, Elminir et al. (2007) have proposed two kinds of ANN to predict the hourly and daily diffuse solar radiation using the Multi-Layer Perceptron (MLP) with backpropagation training algorithm. A different type of parameters is selected for the input of the ANN, such as hour, day, month, year, hourly global radiation, and hourly extraterrestrial solar radiation, and the output was the hourly diffuse radiation. On the other hand, three different inputs were considered input layers of the neural network, namely, global solar radiation, extraterrestrial solar radiation, and sunshine ratio for predicting daily diffuse solar radiation. The results of the study using the ANN model were compared with those of the linear regression models and were used for the models on the hourly and daily scales. In both, the ANN results are more accurate in predicting diffuse solar radiation. In India, Alam et al. (2006) have implemented the feedforward backpropagation network (FFBPN) to predict the solar radiation, and the selected inputs were the latitude, longitude, attitude, months of years, rainfall ration, mean duration of sunshine per hour, and ratio of relative humidity. The performance of the predicted results was calculated using the RMSE 1.65% and MBE 2.79%. Mehleri et al. (2010a) have used the neural network radial basis function (NNRBF) to predict solar radiation on the inclined surface by applying the optimum angle for a specific site. The NNRBF used the tilted angle and orientation as input and fuzzy logic for the training algorithm. The accuracy of the results was validated by calculating the coefficient of determining R 2 99.99%. In another study, the same researchers have used the RBFNN to predict the solar radiation on inclined surfaces and total solar irradiance on a horizontal surface. The extraterrestrial radiation, solar zenith angle, and solar incidence angle on a tilted plane are considered input data for Athens, Greece. The performance of the results is validated by the R 2 96%. On the other hand, three different ANN models were developed in Ajaccio, France, by Notton et al. (2012) to predict the hourly solar radiation on an inclined surface for Mediterranean locations. Different parameters were chosen as inputs: the declination angle, hour, zenith angle, hourly extraterrestrial horizontal irradiation, and hourly global irradiation. The first and second models of ANN predicted the hourly global solar radiation for two angles 45°and 60°and the third model for angles 45°and 90°. The obtained results showed that R 2 in the three models is equal to 99.79%, 99.82%, and 99.70%, respectively. Moreover, Celik and Muneer (2013), in Iskenderun, Turkey, have adopted a model using the generalized regression neural networks (GRNN) to estimate the solar radiation on an inclined surface. Three parameters are used here as input data: global solar irradiation on a horizontal surface, declination, and hour angles. The accuracy of the results is validated by R 2 for evaluated values as 98.7% at 14.9 kwh/m 2 .
In Algeria, a simplified hybrid model was developed by Gairaa et al. (2016) using the linear autoregressive moving average (ARMA) model combined with ANN to predict the daily global solar irradiation using measurement data from two locations. The input of the proposed model is the clearness index, while the output is the daily global solar radiation. The calculated performance error metrics were MPE 18.1 and 2.7% for the first location and MAPE 7.26 and 1.39% for the second locations. In another publication in the same country, Mellit et al. (2005) have combined the ANN with Markov transition matrix (MTM) to predict daily global solar radiation. The output data were the daily global solar radiation and the geographical coordinates, such as latitude, longitude, and altitude, were the input data of the neural network layer. The performance of the hybrid ANN-MTM model was calculated to verify the accuracy of the predicted results and was given as RMSE 8%, R 2 varying in the range of [0.9 0.92] (Belmahdi et al., 2020a).
According to the cited references above, there are various applications of the ANNs to predict/estimate or forecast global solar radiation in different locations and angles. Here, we can cite the predicted solar radiation at the horizontal and tilted surface: different meteorological and geographical coordinates have been considered by multiple authors, e.g., latitude, longitude, altitude, wind speed, clearness index, air temperature, sunshine duration, top of atmosphere (TOA) radiation, and rainfall. Table 1 illustrates the most often used input parameters in ANNs categorized into geographic coordinates and climatologic parameters.
In this context, this research is specifically intended to suggest an intelligent paradigm for evaluating and identifying the appropriate input possibilities that can include the highest potential correlations and simplifications with this proposed study, relying on the fact that predictions of solar radiation can be enhanced with ANN models using different input parameters, e.g., location, weather information, and neighboring stations, and other input parameters. According to , the solar radiation predicted by ANNs is correct compared to that by empirical models. Moreover, the ANNs have different single layers, FFBP, or combined with other techniques, and the second is the most widely used in solar radiation. It has forward relations between inputs and outputs across various hidden layers. The main contribution of this study is to investigate and predict daily global solar radiation based on various combination inputs for 25 cities in Morocco. In fact, relying on the FFNN-BP methods, the suggested method would promote correlations among all feasible input configurations in order to select the best input parameters that can provide better results with the lowest values of statistical indicator performance, which present a good approximation with measured data.

METEOROLOGICAL CLIMATE IN MOROCCO
Morocco has a wonderful opportunity to access sustainable energy options that will address two of the burning issues in the country: electricity availability and global warming challenges. Generally, Morocco has a powerful capacity in sustainable resources, primarily solar and wind, with the potential level of generation varying from 20,000 to 25,000 MW (Šimelytė et al., 2020;. On a world wide scale, Morocco holds the sixth rank for solar and the seventh place for wind. The average daily value of solar radiation intensity in Morocco is close to 5.80 kWh/m 2 /day. Meanwhile, Morocco has many sustainable energy programs being implemented and one of the biggest solar energy programs in the world is expected to cost $7.8 billion. This innovative and practical national effort focuses on creating a potential of 2,000 MW by 2020. This generation capacity constitutes 38% of mounted energy by the end of 2008 and 14% of electrical power by 2020.
In this context, it is necessary to analyze and predict the solar radiation behavior for all considered locations in Morocco. The solar radiation depends on and is affected by various parameters, such as altitude and longitude. However, this research aims to present data for 25 various regions with different climatic conditions and geographical features prevalent in all parts of Morocco. The locations are divided to six zones characterized by 1 | Current parameters adopted in the literature to predict and estimate global solar radiation.

Parameters References
Geographical coordinate Latitude Al-Alawi and Al-Hinai (1998)  Input's vector Input parameters Unit The length of the day hours Average of wind speed (W s ) m/s x 12 Altitude (   Morocco's climate is both Mediterranean and Atlantic, varying from dry to hot season combined with a cool and rainy season. The influence of the sea modulates temperature differences, tempers seasons, and raises the humidity of the environment. Bouhal et al. (2018) and Allouhi et al. (2015) have presented Morocco in six different zones adopted according to the Moroccan Agency for Renewable Energies and Energy Efficiency for concentrated solar power use, as presented in Figure 1.

ARTIFICIAL NEURAL NETWORKS
The application of the ANN has gained growing interest in recent years to address several real-world problems. It has been widely used to address several difficult issues where conventional issue approaches either struggled or proved to be ineffective or unsuccessful (Kamruzzaman et al., 2006). ANN is widely used in various fields, such as solar energy, wind (Tagliaferri et al., 2015), electricity (Khwaja et al., 2020), mechanics (Sterjovski et al., 2005), astrophysics (George and Huerta, 2018), partial differential equations (ShekariBeidokhti and Malek, 2009), polymers (Amirian et al., 2018;Kaplan, 2018) or in optimization, control, modeling, and solar radiation forecasting (Belmahdi et al., 2020b).
The neural network structure schematically mimics a human brain network, containing thousands of neurons that are specifically linked to each other. Throughout the biological context, the neuron receives a variety of inputs from other inputs and integrates them in some direction. Every other neuron is responsible for performing basic computation only. Meanwhile, the effects of the procedure are usually nonlinear, and the output is the final product. In either scenario, the configuration of an artificial neuron is easier than a biological neuron (Pacheco-Vega et al., 2001;Kamruzzaman et al., 2006). In addition, the complex architecture of the biological neural networks translates the area of the ANN. Feedforward ANNs (FF-ANNs) are formed through recurring ANNs and repeated ANNs, like feedback loops (Yalçın et al., 2020). The FNNs are those ANNs with only single-way interactions between their neurons (Bebis and Georgiopoulos, 1994).
The ANNs are considered in the research area as data-driven models (Chakraborty et al., 1992;Boland et al., 2001;Durna and Şahin, 2020) and consist of two layers considered as input and output: the first is to receive the data, while the second is the hidden layer displaying the simulated results (Yalcintas and Akkurt, 2005). The hidden layer also learns, memorizes, and creates relationships between output and input data (Moreno et al., 2011). Generally, the ANN training mechanism is related by changing the weights of every connection. The connection between the neural network layer may be investigated by weighting coefficients (w) and bias (B) for increasing processor and activation function, such that the network tries to generate the optimal performance close to the real performance, as illustrated in Figure 2 (Yilmaz and Özer, 2009).
Following the initial training phase, performance can be viewed as network integration and it is important to follow a backpropagation (BP) appropriate learning algorithm to minimize the errors during the simulation (Riahi-Madvar  . The main objective of using the BP algorithm method is to reduce the size of a network error. The BP based on gradient descent can be calculated by the following equation (Al-Alawi and Al-Hinai, 1998): where N represents the number of processors, O k is the output result in the K-th processor, and t k denotes the target values.
In the BP algorithm, there are two steps of the training process: one to reproduce the input signal across the network and the other to follow the output by adjusting the weights in the network. Those are the error indicators that are backspread to the hidden layer(s) throughout the network operation. The output layer error is being used as the basis for correcting the relation weights between both the origin and the hidden layers (Khare and Nagendra, 2007). The modification of both the relation weights between the input and the hidden layers and the corresponding recalculation of the output values will once more be a small sensitivity point since the iterative procedure is carried out before the error decreases. The BP training algorithm begins by copying and pasting the simulation data collection to the network, and the simulation data collection is made up of input and output vectors. Once certain vectors are dynamically introduced to the neural network, the following equations are implemented: where Z j represents the input to the j-th hidden layer neuron, X i is the numerical value of the i-th input vector, W ij is the weight of the i-th input layer neuron to the j-th hidden layer neuron, n and h represent the number of the input and hidden layer neurons, respectively, and b j is the bias value for the j-th hidden layer neuron.
The function in the output of the hidden layer using the sigmoid function is determined as follows: with f representing the transfer function for the hidden layer. In addition, the performance of the network will not be equivalent to the target performance during the training process. Consequently, the output error is measured as the difference between the output and the desired input of the network. If the output error does not reach the acceptable level, the network adjusts the interaction weights according to the output error significance; instead, the training data are re-entered to the network and the output of the network is determined (Battiti, 1992). The training process will proceed until the network reaches the optimal degree of tolerance. If the error value is below the tolerance range, the network should become a "qualified" network (Haykin, 1988). Then, BP algorithm is outlined in the following seven steps: 1) Initialize the network weights.
2) Analyze the first input vector from the training data and present it to the network. 3) Propagate the input vector across the network to get an output. 4) Determine the error by calculating the difference between the output and the input. 5) The error will be propagated back to the network. 6) Change the weights to eliminate the cumulative error. 7) Repeat steps (2) to (7) for the next input vector until those errors are satisfactorily minimal.
The BP algorithm includes two adaptive variables, a learning rate η and a moment term μ, which will improve the training method by eliminating the local minimum. η identifies the measure provided along the iterative gradient of descent learning method, and μ is used to encourage the gradient descent operation because it is trapped to a local minimum. Figure 3 depicts the FFNN with a backpropagation algorithm structure.
The definition "feedforward" implies that a neuron association occurs either from the neurons in the input layer to other neurons in the hidden layer or from the neurons in the hidden layer to the neurons in the output layer, and the neurons inside the layer are not interconnected.

PREDICTION MODEL ARCHITECTURE
The described FFNN-BP model is implemented to predict the daily global solar radiation in 25 different Moroccan cities (El Mghouchi et al., 2016;El Mghouchi et al., 2019). The FFNN-BP model operates with the monitoring of the learning approach and the Levenberg-Marquardt training algorithm, as illustrated by Al-Alawi and Al-Hinai (1998). The training step is the most important and essential aspect of the neural network. The training algorithm is used to adopt and fix the ANNs by training and changing the weight to find the accurate output (Sözen et al., 2005). The used structure of the FFNN-BP model is shown in Figure 4.
The data collected across different locations were separated into "training dataset" and "validation dataset" and were rendered taking into account weather conditions. The FFNN-BP model used 15 input data organized in the following vector as input parameters: Table 3 presents the vector X input variable of the network. While the ten neurons are used in one hidden layer, they are followed by one output layer. At the first step, we used the data collected from station 1 for 1 year for neural network training, which offered 365 pairs of input and outputs for training when the neural network started the training process. It is noticed that some error function was well-defined until the training process described in Eq. 1 approaches a minimum value. After this step, the predicted model was used for testing, where the data from a similar station are used. Some of the 15 input parameters used and computed are as follows: sun declination δ, the length of the day D length , TOA, and K t . They are calculated using the given formulas in the literature by the following equations (Belmahdi et al., 2020b;Rajendran and Smith, 2016): D length 24 1 − cos −1 [tan(δ). tan(λ)./π] , where N represents the Julian day, λ is the longitude, I 0 is the solar constant, h is the solar elevation, and E 0 is the correction of the Earth-Sun distance.
In order to test the quality and the performance of the used FFNN-BP model, we propose to calculate and compare several errors metrics. These error metrics are widely used in the research area to test the validity and accuracy of the results, like in photovoltaics (Al-Dahidi et al., 2019;Louzazni et al., 2020). In this article, the error metrics are organized into two categories, as given by Gueymard (2014). The details about the error metrics are mentioned in the following equations: Root mean square error (RMSE) Normalized root mean square error (NRMSE) Mean absolute percentage error (MAPE) 2. Indicators of overall performance The linear regression coefficients are as follows: where K is the total number of observations, G forecast and G exp are the forecasted and measured values of daily global solar radiation (daily GSR), respectively, G forecast and G exp represent the average of forecasted and measured values of daily solar radiation. It should be mentioned that the smaller values of MBE, RMSE, NRMSE, and MAPE show more accuracy than the predicted values and an ideal case is zero. All these statistical indicators or error metrics have the added advantage of indicating whether the estimated values are statistically significant or not at a particular confidence level. The error formulations of MBE and RMSE provide quantitative measures that have the same physical units as the dependent variable. The NRMSE and R 2 are dimensionless and the MAPE provides quantitative measures in percentage.

RESULT ANALYSIS AND DISCUSSION
The feedforward backpropagation was used to predict daily global solar radiation using the neural network toolbox of MATLAB 2012 software. Particularly, other authors have pointed out a multi-step prediction model: collecting data, creating a network, configuring the network, modeling, initializing weight and biases, training the network, and validating and using the network. One of the most important issues in this article is how the model was selected to perform a series of solar radiation values transversely.
Firstly, the Quantile-Quantile (Q-Q) graph is used to evaluate the goodness of fit for the used FFNN-BP model and recognize the performance models of global solar radiation values. For daily GSR, the Q-Q graph, presented in Figure 5, shows the comparison of simulated daily global solar radiation generated by ANN model. Against the measured data, an overestimation and an underestimation of solar radiation for 25 locations appear. Although the graphs of all study places accurately simulated the global solar radiation, this result indicates that the FFNN-BP model is accurate to be employed in all study locations of Morocco.
In the following prediction, all inputs parameters variables are applied to train and validate the FFNN-BP model. These input parameters data are divided into three categories, 80% for training, 10% for testing, and 10% for validation, to choose the most suitable input scenario of the models. It is noted that Rao et al. (Siva Krishna Rao et al., 2018) and Huang et al. (Huang and Shih, 2003) have explained the multiple steps phases of the ANN architecture. We adopt in this article the methodology used in (Huang and Shih, 2003), adopted as the first step (training). It is useful to start with a minimum number of neurons in the hidden layer of artificial Intelligent (AI). The number of these neurons will continually increase and retraining of the AI is continued, until the significant training of ANN is achieved and a suitable number is selected, as indicated in Table 4. After several trials, the ten neurons are considered a hidden layer Mba et al., 2016); the standard criteria for choosing among of ANN models are Mean Square Error (MSE). Moreover, various experimental parameters data used in the proposed models are presented in Table 4.
Binomial coefficient or combination can be estimated using the following mathematical expression: where C is the number of all combinations and m is the total number of inputs. Hence, 32,767 combinations were found. All results should be compared for selecting the optimum inputs parameters that can predict the output parameters. After several trials, we found a total of 125 considered an adequate combination to predict the daily global solar radiation of all considered study locations in Morocco. In addition, the ANN process started with the selection of input combinations. For a good fitting, we train the network more before predicting the output results. In another study, Silva et al. (Siva Krishna Rao et al., 2018) have developed six ANNs using several input data with 32 possible combinations of inputs. The input parameters are as follows: theoretical sunshine hours (S 0 ), sunshine hours (S), extraterrestrial radiation (H o ), daily minimum and maximum of temperature, difference of daily maximum temperature, and difference of daily minimum temperature. The methodology adopted in this study is presented in Figure 6. Furthermore, the selected input parameters are tested using the linear relationship with measured daily global solar radiation ( Table A1). The expression of the linear relationship is given in the following equation: For a more specific analysis, the Taylor diagram is used as a comparative method between measured and predicted values. The following statistical error metrics, correlation coefficient (R 2 ), root mean square error (RMSE), and standard deviation (σ), were combined in a polar diagram (two-dimensional). In order to compare the performance of the most suitable input-output parameters of all considered locations, all graphs based on the statistical error metrics are presented in Figure 7. The illustrated figures present the lowest value of RMSE and standard deviation (σ). The highest value of R 2 represents the accuracy relationship between measured and predicted values.
The FFNN-BP model performance is presented with the top five selected input parameters of all twenty-five cities and is shown in Table 5.
The FFNN-BP models, based on different input parameters, have the capability of finding relationships between the

Number of inputs
Combination of inputs K t , TOA, T max , T ratio x 5 K t , TOA, T max , T ratio , ΔT x 6 K t , TOA, T max , T ratio , ΔT, T Average x 7 K t , TOA, T max , T ratio , ΔT, T Average , T min x 8 K t , TOA, T max , T ratio , ΔT, T Average , T min , Long x 9 K t , TOA, T max , T ratio , ΔT, T Average , T min , Long, Alt x 10 K t , TOA, T max , T ratio , ΔT, T Average , T min , Long, Alt, Lat x 11 K t , TOA, T max , T ratio , ΔT, T Average , T min , Long, Alt, Lat, δ x 12 K t , TOA, T max , T ratio , ΔT, T Average , T min , Alt, Long, Lat, δ, D length x 13 K t , TOA, T max , T ratio , ΔT, T Average , T min , Alt, Long, Lat, δ, D length , D number x 14 K t , TOA, T max , T ratio , ΔT, T Average , T min , Alt, Long, Lat, δ, D length , D number , W s x 15 K t , TOA, T max , T ratio , ΔT, T Average , T min , Alt, Long, Lat, δ, D length , D number , W s , R h Frontiers in Energy Research | www.frontiersin.org September 2021 | Volume 9 | Article 733842 variation of each input parameter and the variation of the output parameters. This variation is related to the daily GSR using the proposed method. All statistical metrics performance of the five best inputs parameters indicates that the ANN performs better. Based on R 2 results, all five best inputs of six zones have shown high prediction accuracy. Best performance in the first zone is obtained by the lowest value of MBE, RMSE, NRMSE, MAPE, TS, and σ, respectively. It is reported that the twelve input parameters are the most selected of the following cities: Safi, Settat, Dakhla, and Al-Auin; however, the inputs D number, W s , and R h are based on MBE, RMSE, NMRSE, MAPE, TS and K t , TOA, T max , T ratio , ΔT, T average , T min , Alt, Long, δ, and D length , which have shown excellent estimation. Rabat and Agadir present six input parameters for Kenitra, Casablanca, Essaouira, and Sidi Ifni and five input parameters were selected as the most suitable. For the second zone, the selected input parameters are twelve for Tanger, Tetouan, and Nador, whereas four input parameters are considered as the best input parameters of Al-Hoceima and Laareche. For the third zone, twelve parameters were found for Oujda and Taza as the best input data. For the other locations, seven parameters are found as the best inputs. One site in the fourth and five zones shows that the twelve parameters are the best inputs. The other zones present two best inputs. For Ouarzazate, there are four parameters and for Er-Rachidia, there are twelve parameters. According to the results, it is suitable to obtain the founded input parameters as basic information for developing new solar radiation based on different meteorological data and parameters coordinates. Figure 8 presents the scattered plot of daily GSR values predicted by the proposed model of the best-found parameters (twelve, four to seven inputs) with measured daily GSR values. The figures show that the lowest values of daily GSR below demonstrate that the perfect fit was very close to the observed values. The showed values indicate that the FFNN-BP model shows good interpretation and good practicality for daily GSR.     Table 6 presents a brief comparison of multiple studies for different locations using several meteorological, astronomical, and geographical data as input parameters between the present study and others previously cited in the literature in order to predict the daily global solar radiation. The comparison metric is based on the coefficient of determination (R 2 ). Therefore, most of the cited studies for the prediction of daily global solar radiation used different algorithms have generally presented close results to another one. According to the statistical metrics, it can be noted that the highest value of the coefficient of determination corresponds to the appropriate models. As we can see from Table 6, it clearly appears that the current study presents the appropriate model compared to other studies cited in the literature. The R 2 value of the proposed model is 99.20, which ensures that the proposed model performs better than other existing methods. Figure 9 ranks the twenty-five studied locations in terms of predicted outputs. The best results of the selected input parameters for Ouarzazate, Al-Hoceima, and Lareche cities are K t , TOA, T max , and T ratio . The best results of the selected input parameters for Kenitra, Casablanca, Essaouira, and Sidi Ifni are K t , TOA, T max T ratio , and ΔT. The K t , TOA, T max T ratio , ΔT, and T average are the selected input parameters for Rabat and Agadir city. The best seven input parameters for the considered location (Meknes, Beni-Mellal, Ifrane, and Fes) are K t , TOA, T max T ratio , ΔT, T average , and T min . Er-Rachidia, Marrakech, Midelt, Taza, Oujda, Nador, Tetouan, Tanger, Al-Auin, Dakhla, and Settat City are the locations considered for twelve input parameters (K t , TOA, T max T ratio , ΔT, T average , Alt, Long, Lat, δ, D length , D number , W s , and R h ). The analysis indicator shows that the increase in input numbers improves the accuracy and adaptability of the ANN model.
In order to compare the agreement between measured daily average global solar radiation and the predicted one for twentyfive Moroccan selected locations, the relative difference values are 4.84 and 6.75% for the ANN model and the mean, max, min value of the predicted and measured model are 5. 493, 5.09, 5.19 and 5.796, 5.39, and 5.49, respectively, as presented in Figure 9, revealing the high accuracy of the ANN model in simulation. The selected model also seems well adapted to predict daily global solar radiation in other study locations successfully.

CONCLUSION
The article presents the FFNN-BP model and its application in predicting the daily global solar radiation. A feedforward backpropagation is employed using different metrological and geographical coordinates. The proposed neural network runs with ten neurons and the log-sigmoid transfer function of the hidden layer and the output layers are employed for the prediction model. Geographical and meteorological parameters, such as K t , TOA, T max T ratio , ΔT, T average , T min , Alt, Long, Lat, δ, D length , D number , W s , and R h , have been used as input variables and daily global solar radiation as outputs variables. Each step was increasing the number of inputs to determine the most accurate and suitable parameters. As the results have shown, all parameters are qualified to demonstrate their adaptability and efficiency for the study weather conditions. The best twelve, seven, six, five, and four input parameters with the proposed model in this study can also be preferred to predict daily GSR with high logical accuracy and faster performance analysis. This obtained result is an indication of a good matching between measured and predicted global solar radiation. Following the same procedures, this technique could be used to predict other outputs data of different locations where measurement instrumentation is unavailable or costly to obtain. The statistical error metrics calculated for each predicted value and the Taylor diagrams show the good ability of the proposed models in predicting daily solar radiation. This result can be useful for installing several solar energy capacities, such as photovoltaic farms or solar collectors.

DATA AVAILABILITY STATEMENT
The raw data supporting the conclusions of this article will be made available by the authors without undue reservation.

AUTHOR CONTRIBUTIONS
BB collected the data, conceived the analysis procedure, and provided the analysis tools, the methodology, and the software. BB and ML designed the analysis procedure and supervised it. BB and ML was responsible for the original