Forecasting the Amount of Traffic-Related Pollutant Emissions by Neural Networks

Shepelev, Vladimir; Slobodin, Ivan; Gritsenko, Alexandr; Fadina, Olga

doi:10.3389/fbuil.2022.945615

OPINION article

Front. Built Environ., 14 June 2022

Sec. Transportation and Transit Systems

Volume 8 - 2022 | https://doi.org/10.3389/fbuil.2022.945615

This article is part of the Research TopicSustainable Transport SystemsView all 6 articles

Forecasting the Amount of Traffic-Related Pollutant Emissions by Neural Networks

¹Department of Road Transport, South Ural State University, Chelyabinsk, Russia
²Department of Machine-Tractor Fleet Operation, South Ural State Agrarian University, Chelyabinsk, Russia

1 Introduction

Continuous urbanization has led to a significant increase in traffic density in large cities and a concomitant growth of vehicle emissions (Davis et al., 2005; Perugu, 2019). An effective way to protect public health is air quality forecasting through early warning of the concentration of harmful substances in the air (Bai et al., 2016). The authors proposed a hybrid model to predict the concentration of air pollutants. The model is based on combining the empirical mode decomposition method, the sample entropy index, and a bidirectional neural network with long and short-term memory (Teng et al., 2022). In several-days-ahead forecasting tasks, the researchers presented an ensemble system for multi-step PM 2.5 forecasting in urban areas. The authors applied support vector regression based on the least squares method in conjunction with the capabilities of a neural network (LSTM) (Tong et al., 2019; Ahani et al., 2020; Petry et al., 2021). Studies on training parallel artificial networks based on AutoRegressive with eXternal input models (Alkasassbeh et al., 2013) are focused on modelling air pollution parameters (Feng et al., 2019). presents a method for estimating PM 2.5 transfer rates based on complex relationships between air pollutants, urban development, and meteorology. To obtain a highly accurate forecast of changes in the concentration of harmful pollutants, researchers have developed a hybrid model based on Empirical Wavelet Transform and a deep learning neural network (Kim et al., 2021; Zeng et al., 2022). Several researchers focused on the development of highly accurate emission models at the level of individual vehicles due to their diversity, driving conditions, and other factors (Motallebiaraghi et al., 2021; Makarova et al. (2020)).

Studies on the development of prediction models are generally focused on the final assessment of the concentration of atmospheric emissions, taking into account all urban pollutants (Deep et al., 2021). Thus, Kleine Deters et al. (2017), Rybarczyk and Zalakeviciute (2016), Ni et al. (2017), Han et al. (2018) propose a machine learning approach to forecast PM 2.5 concentrations based on the analysis of meteorological data, including the average regional precipitation, average daily temperature, average relative humidity, average wind speed, maximum wind speed, and pollution data. At the same time, Li et al. (2015) revealed negative correlations between other meteorological parameters and PM 2.5, with the exception of the average atmospheric pressure. A prediction model built on a dataset of many variables with relatively few observations can cause accuracy issues and restrict the performance of a deep learning model (Choi and Kim, 2021).

However, this is still challenging due to limited information on the primary source of emissions (road traffic) and the high uncertainty of dynamic processes (Adams and Kanaroglou, 2016; Shepelev et al., 2021). This study proposes a new hybrid model based on the use of a convolutional (YOLOv4) and recurrent (LSTM) neural network to improve the accuracy of forecasting the changes in the concentration of traffic-related particulate matter.

This study is aimed at developing a hybrid prediction model to quantify road traffic parameters, i.e. forecast the number of vehicles by their driving directions. To achieve this goal, we divide the task into two subtasks: collection of data on the number of vehicles and short-term forecast.

2 Data and Methods

2.1 Data Collection

An important component of urban air quality engineering is traffic-related emission management, which depends on the number and type of vehicles and their dynamic performance indicators (Wen et al., 2019; Kelp et al., 2020; Glushkov and Shepelev, 2021). According to international methodologies and approved standards (Ntziachristos et al., 2009; garantru, 2019; GOST R 56162-2019, 2019), the amount of emissions is primarily determined by the amount, type, and nature of traffic flows (Equation):

M_{LV} = \int_{0}^{t_{1}} (\frac{L_{0}}{1200} \times \sum_{1}^{k} M_{k, i}^{L} \times G_{k}) \times r_{V} (t) dt

where: L₀ is length of the road section, km; $M_{k, i}^{L}$ is specific mileage emission of the ith pollutant of the kth type of vehicle, g/km; G_k is the intensity of the traffic flow of each of the k-groups of a certain section of the road per unit of time in all lanes; k is number of vehicle groups; r_V [V(t)] is a correction factor determined by the current speed of the traffic flow V(t); t1 is the time needed to the traffic flow to cross the intersection of the length L₀.

In this study, we have focused on developing a prediction model for estimating the number of vehicles. Considering that traffic is concentrated at urban intersections, we have focused on one of the busiest intersections in the city of Chelyabinsk, Russia (AIMS eco, 2022 ). A dataset of many variables with relatively few observations can cause a dimensionality issue and restrict the performance of a deep learning model. We received a continuous data stream from a street video surveillance camera with a large viewing angle and a stable video stream (25 frames per second), supporting a 1920 × 1,080 resolution. We trained and modified the YOLOv4 convolutional neural network to collect data on traffic parameters, such as the number, trajectory, speed, and idle time of vehicles (Figure 1) (Gorodokin et al., 2020; Shepelev et al., 2020; Winter et al., 2021; Shepelev et al., 2022).

FIGURE 1

FIGURE 1. An algorithm of a hybrid model for vehicle forecasting.

2.2 Emission Forecasting

The recurrent neural network was implemented in the Python programming language using the Keras library. This is an open library, which is a high-level API facilitating the operation of neural networks and capable of working as an add-in for TensorFlow (an open machine learning library to build and train neural networks).

A dataset was formed for training and testing the neural network. We extracted the latest records from a database created by a system based on the YOLOv4 convolutional neural network and aggregated them into 20-min time intervals, in which the amount of PM2.5 emissions was summed for all transport directions and categories. Thus, we had 1,500 records for about 3 weeks. The data were divided into training and test samples in the ratio of 85%–15%.

A set of 72 24-h time intervals was chosen as the input data of the neural network used as a basis for training and forecasting. The dataset contained three parameters: the amount of PM2.5 emissions, the day of the week, and the index of the time interval in days. Thus, the shape of the two-dimensional input dataset is (72, 3).

A set of 72 time intervals with one parameter (amount of PM2.5 emissions) was chosen as the output (predicted value). The shape of the two-dimensional output dataset is (72, 1).

All input and output data were converted to the interval [0,1] for normalization. For each 20-min time interval, the amount of PM2.5 emissions was divided by the maximum value in the entire dataset equal to 30. Each weekday from the interval 0 … 6, where 0 is Monday and six is Sunday, was divided by 6. Each time interval in the interval 0 … 71, where 0 is the interval from 0:00 to 0:20 and 71 is the interval from 23:40 to 00:00, was divided by 71.

To find the best neural network configuration, we have implemented a program that generates several configurations, trains each configuration, conducts tests, and evaluates the work quality.

To evaluate the quality of the neural network, we chose the mean square error (m_se), the mean absolute error (m_ae), and the maximum absolute error (M_ae).

The following layers were chosen to create configurations:

• LSTM (recurrent layer).

• Dropout with the rate = 0.2 (a layer that prevents over-training by ignoring randomly selected neurons during training). This layer follows each LSTM layer.

• Dense (an output layer reconfiguring the data into the desired format).

We tested 32 configuration options with different parameters:

• the number of LSTM layers: 1, 2, 3, 4;

• the number of neurons in each of the LSTM layers: 50, 100, 250, 400, 600, 800, 1,000, 1,200.

The number of learning epochs: 500.

The following results were obtained as a result of training and testing.

• 1 layer:

o m_se: 0.097–0.194;

o m_ae: 1.143–1.577;

o M_ae: 9.553–10.077;

o training time: 5.17–101.75 s; o average operation time: 0.027–0.065 s;

• 2 layers:

o m_se: 0.092–0.149;

o m_ae: 1.109–1.39;

o M_ae: 8.541–10.058;

o training time: 8.42–307.76 s; o average operation time: 0.033–0.121 s;

• 3 layers:

o m_se: 0.081–0.13;

o m_ae: 1.013–1.326;

o M_ae: 8.347–9.757;

o training time: 12.92–527.34 s; o average operation time: 0.038–0.181 s;

• 4 layers:

o m_se: 0.083–0.124;

o m_ae: 1.068–1.324;

o M_ae: 8.372–10.657;

o training time: 16.77–697.52 s; o average operation time: 0.048–0.251 s

As a result of testing, configuration 24 with 3 layers and 1,200 neurons in each layer showed the minimum errors (m_se: 0.081; m_ae: 1.013; M_ae: 8.347; training time: 527.34 s; average operation time: 0.181 s). However, this configuration spends much time on training and operation. Another optimal configuration is 29 with 4 layers and 600 neurons in each layer (m_se: 0.083; m_ae: 1.068; M_ae: 9.245; training time: 212.32 s; average operation time: 0.086 s). It is close in quality to the first configuration but is 2.5 times faster.

We have also found that configurations with 50 and 100 neurons in any number of layers show poor results. Configurations with 1 and 2 layers are worse than the others. Configurations with 1,000 and 1,200 neurons spend a lot of time on training and work and do not show much better results.

As a result of the experiments, we have determined that the accuracy of forecasting the number of vehicles crossing the studied intersection varies in the range of 80%–96% using a limited number of measurements.

3 Discussion

Accuracy of the hybrid model is superior to the considered methods and allows us to continue optimizing the model to increase the depth of forecasting and taking into account the influence of buildings and additional meteorological predicts.

The proposed predictor architecture does not only use the advantages of the fast extraction of data bulks from a convolutional neural network, but also incorporates the efficiency of the long-term feature extraction of the LSTM recurrent neural network.

Future research will expand this methodology capable of short-term forecasting (per day) and allow us to proceed to long-term forecasting (up to 7 days) with potential model self-learning, based on the continuous accumulation of data history.

4 Conclusion

Thus, we can conclude that the proposed model for forecasting traffic-related pollutant emissions built on convolutional and recurrent neural networks is superior to the competing models in terms of its forecast accuracy. The main findings of this study are summarized as follows:

1. The proposed model is an effective method for improving the accuracy of deep learning neural network models to forecast the amount and concentration of air pollutants.

2. High-quality road traffic monitoring and choosing features, taking into account spatial-time correlations and characteristics of urban development, can improve significantly the predictability.

3. The YOLOv4 neural network is a good option for extracting features to forecast road traffic parameters, which provides for environmental risk management.

Author Contributions

VS: Conceptualization, methodology, investigation, visualization, writing—review and editing, supervision. IS: Software, validation, writing—original draft, AG: Data curation, writing—original draft, OF: review and editing.

Funding

This research was supported by Russian Science Foundation, grant number 22-21-20057.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Adams, M. D., and Kanaroglou, P. S. (2016). Mapping Real-Time Air Pollution Health Risk for Environmental Management: Combining Mobile and Stationary Air Pollution Monitoring with Neural Network Models. J. Environ. Manag. 168, 133–141. doi:10.1016/j.jenvman.2015.12.012