A functional autoregressive approach for modeling and forecasting short-term air temperature

Shah, Ismail; Mubassir, Pir; Ali, Sajid; Albalawi, Olayan

doi:10.3389/fenvs.2024.1411237

ORIGINAL RESEARCH article

Front. Environ. Sci., 22 May 2024
Sec. Environmental Informatics and Remote Sensing
Volume 12 - 2024 | https://doi.org/10.3389/fenvs.2024.1411237

A functional autoregressive approach for modeling and forecasting short-term air temperature

Ismail Shah^1,2*^†

Pir Mubassir²^†

Sajid Ali²^†

Olayan Albalawi³*^†

¹Department of Statistical Sciences, University of Padua, Padua, Italy
²Department of Statistics, Quaid-i-Azam University, Islamabad, Pakistan
³Department of Statistics, Faculty of Science, University of Tabuk, Tabuk, Saudi Arabia

A precise forecast of atmospheric temperatures is essential for various applications such as agriculture, energy, public health, and transportation. Modern advancements in technology have led to the development of sensors and other tools to collect high-frequency air temperature data. However, accurate forecasts are challenging due to their specific features including high dimensionality, non-linearity, seasonal dependency, etc. To address these forecasting challenges, this study proposes a functional modeling framework based on the components estimation technique by partitioning the air temperature time series into deterministic and stochastic components. The deterministic component that comprises daily and yearly seasonalities is modeled and forecasted using generalized additive modeling techniques. Similarly, the stochastic component that accounts for the short-term dynamics of the process is modeled and forecasted by a functional autoregressive model, autoregressive integrated moving average, and vector autoregressive models. To evaluate the performance of models, hourly air temperature data are collected from Islamabad, Pakistan, and one-day-ahead out-of-sample forecasts are obtained for a complete year. The forecasting results from all models are compared using the root mean squared error, mean absolute error, and mean absolute percentage error. The results suggest that the proposed FAR model performs relatively well compared to ARIMA and VAR models, resulting in lower out-of-sample forecasting errors. The findings of this research can facilitate informed decision-making across sectors, optimize resource allocation, enhance public safety, and promote socio-economic resilience.

1 Introduction

Air temperature is a crucial meteorological parameter that measures the level of heat or coldness in the air. It is essentially a measure of the kinetic energy, or energy of motion, of the gases that make up the air. The acceleration of the molecular movement of gases corresponds directly to an increase in air temperature (Spiridonov and Ćurić, 2021). Various factors, including solar radiation, atmospheric pressure, and the presence of greenhouse gases, influence this kinetic energy.Air temperature is a fundamental aspect of weather that plays a major role in many areas of our lives including agriculture, energy consumption, public health, and transportation. Accurate air temperature forecasting benefits different stakeholders by providing critical information for decision-making, resource allocation, public safety improvement, and social and economic resilience. The availability of accurate and reliable air temperature forecasts over a short period of time can facilitate farmers, energy, transport, urban planners, and other decision-makers in other sectors to make informed decisions on crop cultivation, energy demand management, traffic management, infrastructure maintenance and many more. Moreover, they help to optimize resource allocation in areas such as energy and water management. For example, utilities can adjust the production and distribution of electricity based on the expected temperature changes. At the same time, water resources managers can plan more effectively for irrigation and water supply management. On the other hand, reliable forecasts of atmospheric temperatures play an important role in public safety, especially in anticipating and preparing for extreme weather events such as heat waves and cold periods. Having accurate forecasts, emergency response agencies, health facilities, and local authorities can take proactive measures to protect vulnerable populations, prevent heat-related diseases, and reduce the impact of extreme temperatures on public health and infrastructure. In addition, forecasting air temperature can contribute to improving social and economic resilience, enabling communities and companies to better anticipate and mitigate the impact of temperature fluctuations. For example, tourism and hospitality companies can adapt their activities according to weather forecasts, while urban planners can design heat-resistant infrastructures to mitigate the impact of urban heat islands (Ostro et al., 2010). Moreover, accurate temperature forecasting plays a significant role in the context of sustainable development goals (SDGs). For example, SDG 13 refers to climate action, which focuses on urgent measures to combat climate change and its impact. A better air temperature forecast is essential to understand climate patterns, predict extreme weather events, and implement strategies to mitigate climate change. In addition, SDG 11, i.e., sustainable cities and communities, aims to promote inclusive, secure, resilient, and sustainable cities. Improvements in air temperature forecasting help urban planners and policymakers design climate-resistance infrastructure, develop heat mitigation strategies, and improve overall city quality of life (United Nations, 2015).

Forecasting hourly air temperature is tricky due to the ever-changing atmospheric factors like sunlight, clouds, wind, and the land’s shape that can significantly affect temperatures. The interaction of these elements can lead to swift temperature changes, making predictions challenging. Moreover, temperature can follow various patterns depending on the location and the time of the year. Air temperature typically follows a daily cycle, with the warmest temperatures occurring in the afternoon and the coolest in the early morning. This phenomenon arises from the earth’s rotation, resulting in varying levels of solar radiation received by different regions of the planet throughout the day. Air temperature also exhibits a seasonal cycle, peaking during summer and reaching its lowest points during winter. This variation is attributed to the earth’s axial tilt, leading to variations in solar radiation received by different regions throughout the year (Yan et al., 2014; Zhu et al., 2022).

Air temperature forecasting is an essential task in many fields of study, and, in the past, many researchers have proposed several methods and techniques to model and forecast air temperature (Asha et al., 2021; Astsatryan et al., 2021; Liu et al., 2021; Ozbek et al., 2021). For instance, Chen et al. (2018) used the seasonal ARIMA (SARIMA) model for predicting monthly mean temperature. The temperature data were collected hourly from a weather station in Nanjing, China, from January 1951 to December 2017. The study evaluated the forecasting accuracy of the proposed model by computing the mean squared error (MSE) of the forecasted values for the period 2015 to 2017. The result concluded that the proposed model demonstrated better forecasting accuracy. Curceac et al. (2019) conducted a study on short-term air temperature prediction using a nonparametric functional (NPF) model and a SARMA model using the air temperature data from the United Arab Emirates (UAE). The data span a period of 29 years, ranging from 1982 to 2010. Forecasts for 1–24 h are obtained from both models, and results are summarized using the MSE, root MSE (RMSE), relative root mean squared error (RMSEr), mean bias (BIAS), and relative mean bias (BIASr). The study’s findings indicated that the SARMA model outperformed during the initial 6 h of a day, while the NPF was more accurate for forecasting durations ranging from 7 to 24 h. Zahroh et al. (2019) presented a study on predicting the daily maximum and minimum air temperatures using the long short-term memory (LSTM) network model and examines the impact of key parameters such as hidden layers, neurons, epochs, and the stochastic gradient algorithm on the accuracy of temperature forecasts. Roy (2020) studied three different models, namely, multilayer perceptron (MLP), LSTM, and a combination of convolutional neural network (CNN) and LSTM to forecast one-day-ahead mean temperature for the next 10 days.

As machine learning techniques are robust and flexible and can account for different features in the data, they are widely used for air temperature forecasting (Agrawal et al., 2012; Kumari et al., 2012; Hossain et al., 2015; Nadtoka and Balasim, 2015; Salcedo-Sanz et al., 2016). For example, Ustaoglu et al. (2008) forecasted the average daily, maximum, and minimum temperature from two meteorological stations, Sakarya and Geyve, in Turkey, ranging from 1989 to 2003, with a total of 5,468 days. For this purpose, three different artificial neural networks (ANN) models, feed-forward back-propagation (FFBP), generalized regression neural network (GRNN), and radial basis function (RBF), were studied. The performance of the ANN models was also compared to the multiple linear regression (MLR) model. The study results suggested that FFBP and RBF models performed superior to GRNN and MLR in predicting daily minimum and maximum temperature. Assuming nonlinearities in the temperature data, Abhishek et al. (2012) used different ANN models with different hidden layers and neurons to forecast the daily maximum temperature for Toronto, Canada, for 1 year. The study found that the ANN model with five hidden layers, 10 or 16 neurons per layer, and ten sigmoid transforms was more effective in predicting weather patterns than only one hidden layer.

The crucial affect of air temperature cannot be ignored in fields like, agriculture, energy, consumption, public health, and transportation. For example, Ali et al. (2013) investigated the relationship between extreme temperatures and electricity demand in Pakistan, finding a positive correlation between the two variables. This implies that as temperatures rise, electricity demand also increases. The study further revealed that this relationship is stronger in urban areas than in rural regions. McFarland et al. (2015) focused on the intricate relationship between rising air temperatures and the performance of the electric grid in the United States (US). As temperatures increase, the grid’s ability to transmit electricity decreases while the demand for electricity increases. From a health perspective, as climate change worsens, its impact on human health becomes more apparent, especially through the combined effects of rising temperatures and worsening air pollution. This results into more heat-related illnesses, respiratory and heart diseases, and the spread of infectious diseases. In addition, it emphasizes that these health problems affect vulnerable populations, such as the elderly, children, and those living in poverty, to a greater extent (Lou et al., 2019). Research in air temperature forecasting is ongoing, driven by the need for more precise and dependable predictions. As statistical models such as time series models get more advanced and data collection technology improves, we can expect more accurate temperature forecasts (Haris et al., 2022; Nandi et al., 2022; Ozbek et al., 2022).

Hourly air temperature forecasting is a challenging task due to the dynamic nature of the atmosphere and the need for precision. Local temperature variations can be significant, and forecasting methods must be able to account for these variations. The traditional forecasting models, including multivariate and univariate, contain many limitations when applied to such datasets. For example, they can only be used to obtain forecasts for a precise time period. In addition, they are less efficient when the data is high-dimensional. The inherent smoothness as well as other properties of the data cannot be used with the classical forecasting models. To overcome these issues, this research proposed a functional time series approach for hourly air temperature forecasts. Within the functional approach, the daily air temperature profile is considered a single functional datum, and unlike the traditional methods, the forecast can be obtained for ultra-short periods. Functional data may or may not be independent of each other and are useful because the derivatives are available for further analysis. Since it is a curve and not like a scalar quantity, being a single datum, the problem of multicollinearity is automatically resolved. It also solves the problem of high dimensionality and removes the noise from the data. Furthermore, it utilizes the inherent smoothness of the data. Functional data analysis (FDA) techniques have been used in various fields, such as bio-statistics, econometrics, engineering, energy, and other sciences (Campbell et al., 2006; Leng and Müller, 2006; Bonner et al., 2014; Jan et al., 2022; Shah et al., 2022). However, the FDA has been less explored in the context of environmental variables forecasting. In addition, neither the proposed model nor the component estimation technique has been used for the air temperature data from the considered site. Furthermore, the proposed model is compared to classical time series models to assess their performance.

The remaining sections of this manuscript are arranged as follows. A brief introduction to the FDA, along with the proposed functional autoregressive model, is provided in Section 2. The general modeling framework and the two competitor models are discussed in Section 3. An empirical investigation of the proposed model and competitors is conducted in Section 4. Finally, the concluding remarks are given in Section 5.

2 Functional Data Analysis

The term “Functional Data Analysis” (FDA) was first introduced by Ramsay in 1982 (Ramsay, 1982) and several traditional statistical tools have been adapted and extended to suit the framework of the FDA (Ferraty, 2006). The FDA is a way of looking at data that are curves, shapes, or patterns rather than just discrete values. Instead of thinking about data as discrete points, like dots on a graph, FDA treats data as curves or functions. This is a convenient way of dealing with information that changes smoothly over time or space (Ramsay and Silverman, 2005).

In general, functional data is gathered on discrete points; however, the frequency of collected data is often very high, and thus, they are easily converted to functional objects. Typically represented by curves, the functional data is constructed using a suitable basis functions system. A system of basis functions denoted as y(j), is defined as a collection of functions that can be expressed as a linear combination of coefficients C_k and basis functions ϕ_k, i.e.,

y (j) = \sum_{k = 1}^{K} C_{k} ϕ_{k} (j), j \in J (1)

where C_k represents the coefficients matrix, and ϕ_k represents the known basis functions. The number of basis functions used to construct the functions is an important issue in the FDA. A penalized residual sum of squares criterion is generally employed to determine the optimal number of basis functions. This criterion balances the smoothness of the curve and avoids an inadequate fit to the data. The argument values j are the discretized points where the function is evaluated in the J domain (Ramsay and Silverman, 2005). For simplicity, the notation (j) will be dropped from the function where the notion is clear.

One of the commonly used basis functions is called the Fourier basis functions which are generally employed when dealing with periodic data. The functions are constructed by linearly combining sine and cosine functions of increasing order and can be expressed mathematically as

y (j) = c_{0} + c_{1} \sin ω j + c_{2} \cos ω j + c_{3} \sin 2 ω j + c_{4} \cos 2 ω j + \dots (2)

where c₀ is a constant term and c₁, c₂, …, c_k represent the coefficients associated with K basis functions. Here, the number of basis functions is always odd due to including one constant term. The parameter ω specifies the period as 2π/a where a represents the periodicity of the series. An example of a Fourier basis function with k = 10 is plotted in Figure 1.

Figure 1

Figure 1. Fourier basis functions with k = 10 and a constant basis function.

2.1 Functional autoregressive model

The functional autoregressive model (FAR) is a statistical technique that helps us to understand how curves change over time when studying functional time series (FTS). It is the extension of the traditional autoregressive (AR) model but in a functional framework. The FAR model assumes that the current state of the function depends on its own past state. This research work used the FAR model of order one within the framework of a Hilbert space $H$ . This approach provides the inherent structure of our data, where observations are not discrete values but dynamic functions evolving over time.

The model is defined within a separable Hilbert space $H$ with the inner product ⟨⋅, ⋅⟩ and the generating norm ‖⋅‖. For specificity, we focus on the Hilbert space L² [0, 1], although our considerations readily extend to other L²-spaces. A sequence $Y_{t}, t \in N$ , of Hilbert-random variables, is termed an Autoregressive Hilbertian Process of order 1 (ARH (1)) or FAR(1) if it is strictly stationary and satisfies

Y_{t} - μ = ψ (Y_{t - 1} - μ) + ε_{n} for n \in N (3)

where Y_t represents functions in L² [0, 1], ɛ_n is a strong $H$ -white noise with zero mean and finite second moment $(E {‖ε_{n}‖}^{2} < \infty)$ , ψ denotes a bounded linear operator, and μ is the functional mean of the process (Bosq, 2000). For simplicity, μ is assumed to be zero, though this may not be the case in practical applications. The primary advantage of this model is its capability to forecast Y_t+1 using ψ(Y_t). The autoregressive operator ψ: L² [0, 1] → L² [0, one] is supposed to be a bounded linear operator. The following section describes the estimation of the operator ψ within the Hilbert space $H$ .

2.1.1 Estimation of the operator ψ

In estimating the autoregressive operator, ψ within the FAR model, specific assumptions must be addressed to ensure a stationary solution. In particular, two key assumptions are crucial for establishing the existence of such a solution. The first assumption is the presence of an integer, s₀ ≥ 1 such that $‖ ψ^{s 0} ‖_{L} < 1$ . The second assumption requires the existence of a > 0 and 0 < b < 1 such that ‖ψ^s‖_L ≤ ab^s for all j ≥ 0. Under certain conditions, these assumptions guarantee the existence of a unique strictly stationary solution, as demonstrated in (Bosq, 2000).

It is crucial to emphasize that the estimation of ψ cannot rely on likelihood due to the non-existence of the Lebesgue measure in non-locally compact spaces, and the concept of density is unavailable for functional data. Instead, the classical method of moments is employed. The estimation of ψ is performed as ψ = CΓ⁻¹, where Γ = E (Y_t ⊗ Y_t) and C = E (Y_t ⊗ Y_t+1) represents the covariance and cross-covariance operators of the process, and ⊗ is the Kronecker product. The estimates of these operators are denoted as $\hat{Γ}$ and $\hat{C}$ , respectively.

Without loss of generality, it is assumed that the mean of the process E (Y_t) = 0 is known. The sample versions of the covariance and cross-covariance operators, denoted as $\hat{Γ}$ and $\hat{C}$ , are given by:

\hat{Γ} = \frac{1}{t} \sum_{t = 0}^{t} Y_{t} \otimes Y_{t}

and

\hat{C} = \frac{1}{t} \sum_{t = 0}^{t - 1} Y_{t} \otimes Y_{t + 1}

The covariance operator Γ possesses key properties, such as being symmetric, positive definite, and compact. It is decomposable into eigenvalues λ_l and ν_l, respectively. However, Γ⁻¹ is not a bounded operator. To overcome this limitation, a practical solution is introduced, involving the use of the m most significant empirical functional principal components (EFPCs) as surrogates for unknown population principal components. This leads to the expression:

{\hat{Γ}}^{- 1} (y) = \sum_{z = 1}^{m} {\hat{λ}}_{z}^{- 1} ⟨ y, {\hat{ν}}_{z} ⟩ {\hat{ν}}_{z} = {\hat{Γ}}^{†} (y) .

Transitioning to the context of the scalar autoregressive process with one lag, FAR(1), a relation emerges when multiplying the equation by Y_t as

Y_{t} \otimes Y_{t + 1} = Y_{t} \otimes (ψ Y_{t}) + Y_{t} \otimes ϵ_{t + 1} = ψ Y_{t} \otimes Y_{t} + Y_{t} \otimes ϵ_{t + 1} .

Taking into account the definitions of covariance and cross-covariance operators within the framework of FAR(1) and accounting for the vanishing of the ϵ term when expectations are considered, we can express the relationships as follows:

C = ψ Γ and ψ = C Γ^{- 1} .

The estimation of ψ is then defined as:

{\hat{ψ}}_{t} (y) = \frac{1}{t - 1} \sum_{k = 1}^{t - 1} \sum_{z = 1}^{m} \sum_{i = 1}^{m} {\hat{λ}}_{z}^{- 1} ⟨ y, {\hat{ν}}_{z} ⟩ ⟨ Y_{k}, {\hat{ν}}_{z} ⟩ ⟨ Y_{k + 1}, {\hat{ν}}_{i} ⟩ {\hat{ν}}_{i} .

This expression is obtained by incorporating an additional smoothing step on Y_t+1 and ${\hat{ν}}_{z}$ . Notably, the empirical eigenfunctions are recognized for their asymptotic convergence to the population eigenfunctions.Once the estimator $\hat{ψ}$ of the population parameter ψ is obtained, it becomes crucial to assess its optimality in accurately estimating the true parameter regarding the FAR parameter ψ. Didericksen et al. (2012) demonstrated that the proposed predictor is best in terms of the MSE and Mean Absolute Error (MAE). This is evident as the prediction error of this estimator is similar to that of the unfeasible predictor ψ(y), given by a suitably chosen m.

3 Modeling framework

This section describes the general modeling framework used for the prediction and understanding of the temporal dynamics of hourly air temperature. In addition, it also provides the details about the competing models, i.e., ARIMA and VAR, that are used in this study.

3.1 The model

This research focuses on the crucial task of one-day-ahead hourly air temperature forecasting which is a significant challenge due to the inherent complexities of atmospheric dynamics. These complexities encompass daily and yearly seasonality, non-stationarity, non-linearity, and diverse influencing factors. To accurately capture them in the model, the air temperature series is partitioned into deterministic and stochastic components and are modeled separately. To be more precise, let S_t,j represents an air temperature for day $t (t \in N)$ and hour j (j = 1, …, 24). Then, the dynamics of this series can be modeled as

S_{t, j} = D_{t, j} + Y_{t, j} (4)

where D_t,j comprises of deterministic components and Y_t,j represents the stochastic component of the series.

The deterministic component captures predictable patterns like daily and annual seasonalities. One way to deal with daily seasonality is to treat each hour series separately which is adopted in this study (Lisi and Shah, 2020). On the other hand, the annual seasonality A_t,j is modeled and forecasted by using a smooth function of time. Mathematically, it can be written as

D_{t, j} = f (A_{t, j}) (5)

where the function f represents a smooth function of time estimated through cubic smoothing splines. Generally, cross-validation techniques are used to select the number of knots when fitting the cubic smoothing splines (Eilers and Marx, 2010). The stochastic component accounts for unpredictable fluctuations and residual behavior. Once the deterministic component is modeled and forecasted, the stochastic component is obtained as

\begin{array}{l} Y_{t, j} = S_{t, j} - {\hat{D}}_{t, j} \\ Y_{t, j} = S_{t, j} - {\hat{A}}_{t, j} . \end{array}

The stochastic component is modeled and forecasted through the proposed FAR(1) and two competing models given in Section 3.2. In the case of FAR(1), the component Y_t,j is first converted to daily functional trajectories using Eq 1 and the model is applied to functional profiles. In the case of the VAR model, the stochastic component Y_t,j is used as a vector of 24 variables, whereas each hourly series is treated independently in the case of ARIMA. Once both components are estimated, the final forecast is obtained by combining the individual forecasts as

{\hat{S}}_{t + 1, j} = {\hat{D}}_{t + 1, j} + {\hat{Y}}_{t + 1, j} (6)

3.2 Competing models

This section describes two competing classical time series models whose results are compared with the proposed FAR model.

3.2.1 Vector autoregressive model (VAR)

The VAR model is an effective tool for examining the dynamic changes of multiple time series variables over time. It was first proposed by Christopher Sims and Thomas Sargent in the 1980s and has become widely used as it can capture complex relationships between different variables Sargent (1984). The VAR model assumes that the present value of each variable is impacted by its past values as well as the past values of all other variables within the system. The VAR model of order “p” is represented as follows.

Y_{t} = α + \sum_{r = 1}^{p} ψ_{r} Y_{t - r} + ϵ_{t} (7)

where Y_t represents a vector of time series variables at time t, α is a vector of constants (intercepts), ψ represents the coefficient matrices for lag p, and ϵ_t is the vector of error terms at time t.For the estimation of parameters, techniques such as the ordinary least squares (OLS) or the maximum likelihood (ML) are generally employed. Note that for fitting a VAR model, all variables included in the time series model must be stationary. The order “p” of a VAR model is selected using different information criteria and cross validation approaches. In a VAR model, the total number of estimated coefficients are K + K²∗p, where K represents the number of coefficients for the intercepts, and K²∗p represents the number of coefficients for the lagged values of each variable up to the order “p”.

3.2.2 Autoregressive integrated moving average (ARIMA) models

The ARIMA models, also known as the Box-Jenkins models, are statistical forecasting methods that have been widely used for time series forecasting. In the context of time series analysis, the ARIMA models stand as a foundation, providing a flexible framework for predicting and understanding data that changes over time. Developed by George Box and Gwilym Jenkins in the 1970s, ARIMA models have gained widespread recognition for their ability to capture underlying patterns and trends in a wide range of time series data (Box et al., 2015).

An ARIMA model contains three components: autoregression, differencing, and moving average. The AR component of the ARIMA model captures the notion that the current value of the time series is affected by its past values and is achieved through a linear combination of past values, with the parameter “p” representing the order of the AR term, indicating the number of lagged values incorporated into the prediction equation. Stationarity is a fundamental requirement for ARIMA models to produce reliable forecasts. However, many times series data exhibit non-stationary behavior, meaning their statistical properties, such as mean and variance, vary over time. Differencing transforms non-stationary data into a stationary series by eliminating trends and seasonalities. The order of differencing “d” indicates how many times the data needs to be differenced to achieve stationarity. Finally, the MA component accounts for the influence of past errors on the current value of the time series. It suggests that the accuracy of current predictions can be enhanced by considering the discrepancies between past predictions and the actual observed values. The order of the MA term “q” determines the number of past errors considered when constructing the prediction equation (Shumway et al., 2000).The ARIMA (p,d,q) model can be expressed mathematically as follows.

Y_{t}^{d} = \sum_{r = 1}^{p} ψ_{r} Y_{t - r}^{d} + \sum_{i = 1}^{q} ϕ_{i} ε_{t - i} + ε_{t} (8)

where $Y_{t}^{d}$ represents the d-th difference of the time series Y at time t, $Y_{t - r}^{d}$ is the lagged and differenced value of the time series Y at time t-1, ψ_r (r = 1,2,…,p) and ϕ_i (i = 1,1,2,…,q) are the parameter of AR and MA, respectively, and $ε_{t} \sim N$ (0, $σ_{ε}^{2}$ ). The orders “p” and “q” are determined using information criteria or by analyzing the autocorrelation function (ACF) and partial autocorrelation function (PACF) of a stationary time series. The model parameters are generally estimated through the MLE approach.

4 Modeling and forecasting air temperature

This section provides an empirical application of the proposed modeling framework using a real dataset. Before going into details, a brief description of the dataset is given as under.

4.1 Data description

The research work used an hourly air temperature dataset collected from Islamabad, Pakistan. The dataset is collected through sensors installed at different location in Islamabad and an average value has been reported for each hour (Power, 2022). Hourly measurements capture the dynamic nature of air temperature changes, offering a more comprehensive picture than daily or monthly averages. No missing observations are present within the dataset, ensuring the integrity and reliability of the data for models training and evaluation. The dataset spans over 5-year period, ranging from 1 January 2018, to 31 December 2022, encompasses 43,824 individual observations. The dataset is plotted in Figure 2, where one can see the patterns and variations in the air temperature throughout the years. The red line distinguishes between the model estimation and the out-of-sample forecasting periods.

Figure 2

Figure 2. Hourly air temperature time series. The red line distinguishes between model estimation and out-of-sample forecasting periods.

The summary statistics listed in Table 1 provide a comprehensive overview of the hourly temperature data. The table shows that the minimum temperature recorded is −0.45 c°, indicating the presence of lower extreme values. The first quartile (Q1) is 14.54 c° and the median at 22.50 c° provides insights into the central tendency, showcasing that at least 25% of the data falls below 14.54 c° and 50% falls below 22.50 c°. The mean temperature is 21.78 c°, indicating the average value. The third quartile (Q3) at 28.61 c° signifies that at least 75% of the data falls below this point. The maximum temperature recorded is 45.51 c°, indicating the presence of higher extreme values. The variability in the dataset is reflected in the variance, calculated at 80.30404, and the standard deviation, which is 8.961,252 c°, indicates a moderate level of variability.

Table 1

Table 1. Descriptive statistics for hourly air temperature in Islamabad.

4.2 Out-of-sample forecasting

To achieve accurate hourly air temperature forecasting, the dataset was divided into training and testing sets utilizing 80/20 splits. More precisely, from 1 January 2018, to 31 December 2021 (35,064 observations covering 1,461 days) the data points were used for training various forecasting models. The remaining 20%, i.e., from 1 January 2021, to 31 December 2022 (8,760 observations, covering 365 days) served as a hold-out set for evaluating one-day-ahead out-of-sample forecasting performance for each model.

To compare the forecasting accuracy of the models, three different types of error metrics are used in the research work including MAE, RMSE, and mean absolute percentage error (MAPE) (Bibi et al., 2021). The MAE, also known as the mean absolute deviation (MAD), is determined by averaging the absolute differences between the forecasts and the actual values at corresponding time points. In mathematical terms, it is represented as follows.

\begin{array}{l} MAE = mean (|S_{t, j} - {\hat{S}}_{t, j}|) \end{array}

where S_t,j is the observed and ${\hat{S}}_{t, j}$ is the one-day-ahead forecast, for j = 1, …, 24 and t = 1, 2, …, 365. The MAPE is calculated by averaging the absolute deviation divided by the corresponding observed value, multiplied by 100. It is a relative error metric, indicating the extent of error observed in the forecast relative to the actual value. The mathematical expression for the MAPE is written as

\begin{array}{l} MAPE = mean (|\frac{S_{t, j} - {\hat{S}}_{t, j}}{S_{t, j}}|) \times 100 . \end{array}

On the other hand, the RMSE is a commonly used metric for measuring the average magnitude of the errors between predicted and actual values. It is an extension of MSE and provides a more interpretable result by taking the square root of the average squared differences. Mathematically, the RMSE is defined as

\begin{array}{l} RMSE = \sqrt{mean {(S_{t, j} - {\hat{S}}_{t, j})}^{2}} . \end{array}

The one-day-ahead out-of-sample air temperature forecasting results are listed in Table 2. These results indicate that the proposed modeling framework efficiently forecasts air temperature as it produces relatively low errors for each model. Comparing the three models, it is evident that the proposed FAR model outperforms the other two models across all metrics. The proposed FAR model achieved an MAE, MAPE, and RMSE of 1.0852, 6.115, and 1.5178, respectively, which are lower than the MAE, MAPE, and RMSE of 1.1075, 6.1320, and 1.5660, respectively of the VAR model, as well as 1.1022, 6.1198, and 1.5535, respectively of the ARIMA model.

Table 2

Table 2. One-day-ahead out-of-sample forecasting errors for FAR, VAR, and ARIMA models.

To investigate the performance of each model more deeply, one-day-ahead out-of-sample forecasting errors for each hour are calculated for each model and the results are listed in Table 3. The table indicates that the forecasting errors generally vary throughout the day. The forecasting errors, in general, are low in the initial hours of the day and are high during the final hours. In the initial hours, the performance of the VAR model is slightly better than that of the proposed FAR model. For example, the VAR model achieved the lowest MAE, MAPE, and RMSE values of 0.2744, 0.2022, and 1.5639 respectively in the first hour of the day, which is slightly better than the MAE, MAPE, and RMSE values (0.3611, 0.2556, and 1.9209, respectively) of the proposed model. However, as the day progresses, the proposed model produced better results compared to the VAR and ARIMA models, by providing lower values of MAE, MAPE, and RMSE. Note that both multivariate (FAR and VAR) models perform relatively better than the univariate (ARIMA) model. These findings can be easily noticed in Figure 3 where the hour-specific MAPE values are depicted for each model.

Table 3

Table 3. Hour specific forecasting errors using FAR, VAR, and ARIMA models.

Figure 3

Figure 3. Hour-specific MAPE values for FAR, VAR, and ARIMA models.

The one-day-ahead forecasting errors are summarized month-wise and listed in Table 4. These results indicate that the proposed model performs better, outperforming the ARIMA and VAR models in most months. The errors are relatively higher in the winter and are lower in the summer. In September, the proposed model achieved the lowest MAE, MAPE, and RMSE values of 0.8671, 0.6106, and 2.3199, respectively, outperforming both the VAR and ARIMA models. However, in January, the VAR model outperforms the proposed model, achieving lower MAE, MAPE, and RMSE values of 1.3845, 1.0592, and 13.1341, respectively. Moreover, it is noteworthy that the ARIMA model consistently performs the worst across all months. For a visual illustration of these results, the month-specific MAPE is plotted in Figure 4. Finally, it is worth mentioning that all computations were performed using the R programming environment (R Core Team, 2023) run on an Intel(R)-Core(TM) i7-4770 CPU running at 3.40 GHz.

Table 4

Table 4. Monthly Forecast Errors for Air Temperature using FAR(1), VAR, and ARIMA models.

Figure 4

Figure 4. Month-specific MAPE values for FAR, VAR, and ARIMA models.

5 Conclusion

Air temperature is a fundamental aspect of weather that plays a significant role in diverse areas of our lives, and thus, its accurate forecast is crucial. However, an air temperature time series is comprised of different deterministic and stochastic variations that make forecasting challenging. This research work proposes a functional data approach to forecast one-day-ahead air temperature. Moreover, the component estimation technique, which divides the data into deterministic and stochastic components, is used to accurately predict the temperature series dynamics. The deterministic part of the series is modeled and forecasted using smoothing splines, whereas FAR, ARIMA, and VAR models are used for the stochastic component. For empirical assessment, air temperature data for Islamabad (Pakistan) are collected and one-day-ahead out-of-sample forecasting results for a complete year are summarized using MAE, MAPE, and RMSE.The results indicated that the proposed component estimation procedure is efficient in forecasting air temperature. In addition, the functional model, i.e., the FAR model, further improves the forecasting accuracy compared to ARIMA and VAR models, resulting in lower out-of-sample forecasting errors. Finally, the multivariate models, VAR and FAR, outperform ARIMA, demonstrating their effectiveness in predicting air temperature.

Despite the valuable insights obtained from this study, recognizing its limitations is important. The current research work considers only parametric (linear) models. In addition, the dataset is used only from one location. As the current study does not account for the effects of exogenous variables in the model, it would be interesting to see their impact on forecasting air temperature using the current approach in the future. Moreover, the proposed model can be compared with machine learning approaches, generally known as nonlinear models, in a future study. Furthermore, the performance of the proposed approach can be assessed by conducting a study on other site datasets.

Data availability statement

Publicly available datasets were analyzed in this study. This data can be found here: https://power.larc.nasa.gov/data-access-viewer/.

Author contributions

IS: Conceptualization, Investigation, Methodology, Supervision, Writing–original draft. PM: Conceptualization, Formal Analysis, Validation, Writing–review and editing. SA: Data curation, Project administration, Software, Visualization, Writing–review and editing. OA: Funding acquisition, Resources, Validation, Writing–original draft.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. Open Access funding was received for the research provided by Università degli Studi di Padova/University of Padua, Open Science Committee.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The reviewer (SMMR) declared a past co-authorship with the authors IS and SA to the handling editor.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Abhishek, K., Singh, M., Ghosh, S., and Anand, A. (2012). Weather forecasting model using artificial neural network. Procedia Technol. 4, 311–318. doi:10.1016/j.protcy.2012.05.047

ORIGINAL RESEARCH article

A functional autoregressive approach for modeling and forecasting short-term air temperature

1 Introduction

2 Functional Data Analysis

2.1 Functional autoregressive model

2.1.1 Estimation of the operator ψ

3 Modeling framework

3.1 The model

3.2 Competing models

3.2.1 Vector autoregressive model (VAR)

3.2.2 Autoregressive integrated moving average (ARIMA) models

4 Modeling and forecasting air temperature

4.1 Data description

4.2 Out-of-sample forecasting

5 Conclusion

Data availability statement

Author contributions

Funding

Conflict of interest

Publisher’s note

References

People also looked at