ORIGINAL RESEARCH article

Front. Robot. AI, 26 April 2019

Sec. Computational Intelligence in Robotics

Volume 6 - 2019 | https://doi.org/10.3389/frobt.2019.00032

Forecasting Climatic Trends Using Neural Networks: An Experimental Study Using Global Historical Data

  • 1. Field Science Education and Research Center (FSERC), Kyoto University, Kyoto, Japan

  • 2. Japan Science and Technology Agency (JST), Kawaguchi, Japan

Abstract

Climate change is undoubtedly one of the biggest problems in the 21st century. Currently, however, most research efforts on climate forecasting are based on mechanistic, bottom-up approaches such as physics-based general circulation models and earth system models. In this study, we explore the performance of a phenomenological, top-down model constructed using a neural network and big data of global mean monthly temperature. By generating graphical images using the monthly temperature data of 30 years, the neural network system successfully predicts the rise and fall of temperatures for the next 10 years. Using LeNet for the convolutional neural network, the accuracy of the best global model is found to be 97.0%; we found that if more training images are used, a higher accuracy can be attained. We also found that the color scheme of the graphical images affects the performance of the model. Moreover, the prediction accuracy differs among climatic zones and temporal ranges. This study illustrated that the performance of the top-down approach is notably high in comparison to the conventional bottom-up approach for decadal-scale forecasting. We suggest using artificial intelligence-based forecasting methods along with conventional physics-based models because these two approaches can work together in a complementary manner.

Introduction

Because climate change is the biggest environmental problem currently, it has attracted interest from many researchers and policymakers. For climate forecast, physics-based models have been widely used. General circulation models (GCMs) have been constructed by numerical representations of atmospheric physical conditions (Manabe et al., 1965). Earth system models (ESMs) are advanced models based on GCMs and are mainly used for current climatic studies (e.g., Collins et al., 2006), which considers features such as biogeochemical cycling and atmospheric chemistry. These models are based on the laws of physics such as conservation of mass, energy, and momentum. These models can be referred to as bottom-up approaches because they forecast climate using physical boundary conditions. Although the performance of ESMs is improving, these models still suffer from significant forecast uncertainties. Such uncertainties in future climate may delay amelioration and adaptation to climate change (Intergovernmental Panel on Climate Change, 2013).

There should be another approach for climate forecasting; a top-down, phenomenological approach can complement a bottom-up, mechanistic approach. The most intuitive top-down approach is using statistical models such as regression analysis. For forecasts, there are many statistical approaches for time series, such as Autoregressive Integrated Moving Average. However, a top-down approach is not mainstream for the current research as its forecast ability is believed to be limited, especially under novel environmental conditions. Still, there are several interesting examples of the top-down approach. Sévellec and Drijfhout (2018) predicted climatic trends by probabilistic forecast where trained researchers “picked” seemingly important trends and expressed them in a statistical manner.

We believe that using deep neural networks (DNNs) can be an excellent tool for the top-down approach because DNNs are proven to be very successful in artificial intelligence (LeCun et al., 2015). For example, deep learning has revolutionized computer vision (e.g., Ise et al., 2018), chemical engineering (e.g., Pławiak and Rzecki, 2015), and medical diagnosis (e.g., Yıldırum et al., 2018). It has attracted many users as it has a variety of libraries and computational environments, especially for image detection and classification. DNNs have been used for time-series analysis (e.g., Sak et al., 2014). However, these approaches are mostly unsuitable for finding large-scale features in climatic trends. Although there are a few studies where machine learning has been utilized in parametrization of atmospheric models (Gentine et al., 2018; Rasp et al., 2018), DNNs are not fully utilized in climate studies currently.

In this study, we apply a simple but novel approach to determine climatic trends. Using global historical data of mean monthly temperature for the years 1901–2016, we graphically represent the temperature dynamics and feed these images to a DNN. We classify the images in two categories: when a temperature rise is observed after the training period and when a temperature fall is observed after the training period. This study is a pedagogical experiment as we know the “correct” answer; we verify whether the DNN successfully determines the answer. We test whether this top-down approach can be a tool for phenomenological forecast of future climate. We also test the forecast performance in limited spatiotemporal scales to find specific timings and locations for which the performance is unusually good or bad. Our approach is unique because it employs a DNN-based, top-down approach for climate forecasts. Our aim is to illustrate the performance and characteristics of this new approach and to contribute to the studies on climate change.

Methods

The input data used for this experiment was the global mean monthly temperature data (CRU TS 4.01) from the Climatic Research Unit (Harris et al., 2014). This dataset covers the global terrestrial area in 0.5° × 0.5° grids (Figure 1). Using this dataset, we randomly selected a place and start time for each graphical representation. In one graphical representation, the mean monthly temperature of the selected place was retrieved, and data for 30 years (training period) from the start time were used for the graphical representation; we created an image of 60 × 60 pixels from the temperature data (Figure 2) using R 3.4.4 (R Core Team, 2018). Then, we classified the images into two categories (RISE and FALL) based on the mean temperature for 10 years after the training period. The assigned categories were used as training data paired with the training images for the DNN. The hardware to run and test the DNN had XEON E5-2630v3 CPU, 16 GB RAM, and NVIDIA Quadro K620 GPU; the operating system was Ubuntu 14.04 LTS. The settings and parameters for NVIDIA DIGITS 6.0 (Caffe version: 0.15.13), the platform for the DNN, are summarized in Table 1. We employed convolutional neural network (CNN) with LeNet (LeCun et al., 1998). Randomly selecting 25% of images for validation, accuracy, and loss are calculated in each training epoch. We set the number of training epoch to 30, with systematically decreasing learning rates.

Figure 1

Figure 2

Table 1

% validation images25
Image encodingpng
DB backendlmdb
DB compressionnone
Training epochs30
Snapshot interval1
Validation interval1
Random seednone
Solver typestochastic gradient descent
Base learning rate0.01
NetworkLeNet (LeCun et al., 1998)

NVIDIA DIGITS 6.0 settings and parameters.

In this framework, there is a choice for the DB backend (data format for the database): lmdb and hdf5. We created the database in the former format. Although we set snapshot interval = 1 to store the model for each training epoch, we only used the models from the last (30th) training epoch for analyses in this study. We set validation interval = 1 to calculate accuracy for each training epoch. We did not set the random seed, and the models for this study were individually initialized. We used the stochastic gradient descent solver (default); other choices were Nesterov's accelerated gradient, adaptive gradient, AdaDelta, adaptive moment estimation, and RMSprop. The base learning rate is the parameter to determine efficiency for learning. We set the parameter to 0.01 (default). The network we used was LeNet (LeCun et al., 1998). Other network choices were AlexNet (Krizhevsky et al., 2012) and GoogLeNet (Szegedy et al., 2014). See Discussion for the comparison.

There are 67,420 terrestrial grid cells in a resolution of 0.5°. The length of the time series is 116 years. When we systematically shift the window of 40 years (30 years for training images and 10 years to assign categories: RISE and FALL), there can be 116–40 +1 = 77 different training images. Thus, the maximum number of the training images is 67,420 × 77 = 5,191,340. To accelerate the series of experiments in this study, we randomly chose subsets of training images by assigning a random number c from 0 to 1 for each training image in each experiment. Only when c > ct, where ct is the threshold, the ith training image was chosen for the subset. For example, when ct = 0.99, approximately 1% of the training images (~51,913 images) will be selected as the subset. We analyzed the data to determine the effect of ct.

We also performed analysis to see the difference in the color schemes in R (heat.colors, topo.colors, and rainbow). Moreover, we tested the effect of upper and lower limits of the images. In the default condition, upper and lower limits of an image plot are automatically defined by the “image” function in R, according to the monthly temperature data. In this experiment, we created an alternative condition where universal highest and lowest temperatures of the entire dataset were set as the upper and lower limits of all images; global maximum and minimum temperatures for 1901–2016 were 39.5°C and −59.5°C, respectively. The images for Manaus, Brazil (Figures 2A,E,I), for example, were drawn using identical temperature data. Although the differences were only from the plotting scheme (upper and lower limits and color scheme), these differences somewhat affected the resultant classification accuracy.

Moreover, we made classification experiments with space and time restrictions. For spatial limitation, we selected four climatic zones (Amazon as tropical rainforest, Sahara as desert, Eastern US as temperate, and Siberia as boreal) and evaluated the differences in the model performance. For temporal limitation, we restricted the starting year to a single decade (1900s−1960s) and compared the model performance.

Results

The training process of the DNN in NVIDIA DIGITS 6.0 was successful in classifying images into RISE and FALL categories. The greater number of training images involved, the higher was the attained accuracy of classification (Table 2). We set MODEL3 as the reference model for the following experiments. With the hardware and software configurations for this study (see Methods and Table 1), the time required to train MODEL3 was 13 min. For MODEL4, the required time was increased to 133 min, showing that the training times were linearly correlated to the number of training images. For training of MODELs 1–24, the same settings such as DNN parameters and structure were used (see Table 1). Figure 3 shows the learning process of DIGITS 6.0 with 30 epochs.

Table 2

Model numberctSpatiotemporal restrictionsNumber of training imagesImage designAccuracy (%)
FALLRISETotal
MODEL10.9999none141328469default65.625
MODEL20.999none1,5243,5735,097default78.984
MODEL30.99none15,68536,20651,891default92.026
MODEL40.9none157,042362,676519,718default97.037
MODEL50.9999none148342490topo.colors72.656
MODEL60.999none1,5813,5885,169topo.colors77.210
MODEL70.99none15,52536,41451,939topo.colors91.441
MODEL80.9none157,154362,716519,870topo.colors96.762
MODEL90.9999none146325471rainbow70.313
MODEL100.999none1,5983,5295,127rainbow79.040
MODEL110.99none15,59336,37851,971rainbow91.818
MODEL120.9none157,028361,762518,790rainbow96.913
MODEL130.99none15,62336,02651,649fixed universal min and max88.165
MODEL140.3Amazon15,02036,85151,871default98.468
MODEL150.907Sahara17,25934,72451,983default97.551
MODEL160.1Eastern US21,88729,93551,822default97.145
MODEL170.946Siberia13,79937,99951,798default93.366
MODEL180.9231900s8,41443,60552,019default96.821
MODEL190.9231910s22,81029,48152,291default94.988
MODEL200.9231920s30,69321,40052,093default95.393
MODEL210.9231930s35,49216,16151,653default95.862
MODEL220.9231940s16,96134,89851,859default95.797
MODEL230.9231950s4,80146,81351,614default98.059
MODEL240.9231960s1,04550,72451,769default98.974

Summary of classification experiments.

For image design, the color scheme “default” is for heat.colors, and the maximum and minimum temperatures are set automatically. MODEL3 is the reference model and MODEL4 is the best model. MODEL13–MODEL24 are compared with MODEL3 because these models use approximately the same number of training images.

Figure 3

The performance of the models improved with the increase in the number of training images. However, the change in accuracy per unit number of training images gradually decreased, suggesting saturation. This suggests that researchers should consider cost performance because accuracy and computational burden have a tradeoff relationship.

We tested the effect of color schemes in R. Overall, heat.colors showed the best performance among other color schemes. This implies that the color scheme of artificially generated images can affect the performance of DNNs. We also noted that topo.colors was not the best color scheme when the number of training images was small.

Next, using the reference model (MODEL3) settings, we limited the time or space of the training data to determine the performance in specific conditions. We focused on specific climate zones (MODEL14–MODEL17): tropical rain forest (Amazon, 0-12S, 50-70W), desert (Sahara 12-26N, 10W-30E), temperate (Eastern United States, 30-42N, 70-92W), and boreal (Siberia, 60-70N, 80-140E). The accuracy of these spatially restricted models was generally higher than the reference model (MODEL3) even though the numbers of training images was almost the same. Globally, there are several different climatic zones with different interannual trends. Our results may suggest that the DNN is able to capture climatic trends more successfully when the target area is restricted to a single climatic zone. Among climatic regions, Amazon (tropical rain forests) shows particularly high accuracy, whereas Siberia (boreal forests) has relatively low accuracy. This may be caused by the heterogeneity within the climatic regions. When interannual trends in temperature are homogeneous within the climatic region, we expect higher accuracy. This is understandable from the viewpoints of artificial intelligence and computer vision. For example, to construct a model of human faces, numerous training images are required because human faces are heterogeneous. On the other hand, a homogeneous object such as the trade mark of Coca Cola can be identified relatively easily with a few training images with sample size amplification by modifying colors and shapes.

In experiments with temporal restrictions (MODEL18–MODEL24), the accuracy of the models was in the range 95.0–96.8 for MODEL18 (1900s) to MODEL22 (1940s); however, for MODEL23 (1950s) to MODEL24 (1960s), the accuracy was above 98%. This may be because in MODEL23 and MODEL24, more than 90% of the training images were classified as RISE because global warming became obvious in the latter half of the 20th century. This bias may make predictions easier. Another reason is the data quality; we assume that the quality of the data is improving gradually in the temporal range and this makes predictions more reliable. Although MODEL18 (1900s) also has a bias toward RISE, the accuracy is not particularly high possibly because of the low data quality in the early 20th century.

Discussion

In this study, we showed that, without physics-based mechanisms, a top-down forecast model can successfully predict rise or fall in temperature in a decadal timescale. Although this study has limitations in predictability because only two classes (RISE or FALL) are used for forecast, this top-down approach can be a meaningful measure for climate change studies. We suggest that this top-down approach should be used together with physics-based bottom-up approaches because these approaches can work in a complementary manner. This study is different from the preceding top-down approach using probabilistic forecast (Sévellec and Drijfhout, 2018) in that we did not need to make any assumptions related to climatology; we simply prepared two-dimensional figures using climate data and fed them to a DNN.

Different color schemes lead to some differences in the model performance (Figure 4). We noticed that the rainbow color scheme (see Figures 2I–L) gives more information to human observers but this may not be the case for an artificial intelligence system constructed using a DNN model. When number of training images is small, the performance of the default color scheme (heat.colors) is not good. With large numbers of training images, the performance of heat.colors was better than the other color schemes. These findings may be insightful for various studies on artificial intelligence.

Figure 4

This study can be enhanced with several modifications. For example, we employed LeNet among choices of CNN because of its simplicity. We did not change the default parameter settings of NVIDIA DIGITS 6.0 to show the robustness of our concept. However, other networks such as AlexNet (Krizhevsky et al., 2012) or GoogLeNet (Szegedy et al., 2014) with systematic parameter tuning and multifold cross validation can be used for further improvements in forecasting. Moreover, although this study is a two-class classification in temperature trend (RISE or FALL), it can be possible to use ca. 5 temperature classes for better forecasting. There are advantages and disadvantages for this study. Since this is a top-down approach, it is difficult to measure whether our approach would perform well under novel conditions. This limitation is universal for phenomenological forecasting. Therefore, we suggest that climatic forecasting should integrate both the top-down approach such as this study and the bottom-up approach based on physics.

Conclusion

This study uncovers several insightful research topics. For example, the models constructed in this study can be used to forecast future trends in temperature. Moreover, along with the mean monthly temperature, other meteorological datasets such as maximum and minimum temperatures and precipitation can be used to generate images for DNN training. This may further improve the model performance. We also suggest using temperature data from simulation studies such as ESM. By doing this, we may be able to show whether climate simulations generate interannual trends similar to observed data. Overall, because climate change is a very important problem, many different approaches should be used to address it. We hope that our DNN-based, top-down approach can be one such novel approach.

Statements

Author contributions

TI contributed image processing, coding DNN, and study design. YO contributed executing experiments.

Funding

This study was supported by JST PRESTO, Japan (Grant No. JPMJPR15O1) and the Link Again Program of the Nippon Foundation-Kyoto University Joint Project.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  • 1

    CollinsW. D.BitzC. M.BlackmonM. L.BonanG. B.BrethertonC. S.CartonJ. A.et al. (2006). The community climate system model version 3 (CCSM3). J. Clim.19, 21222143. 10.1175/JCLI3761.1

  • 2

    GentineP.PritchardM.RaspS.ReinaudiG.YacalisG. (2018). Could machine learning break the convection parameterization deadlock?Geophys. Res. Lett. 45, 57425751. 10.1029/2018GL078202

  • 3

    HarrisI.JonesP. D.OsbornT. J.ListerD. H. (2014). Updated high-resolution grids of monthly climatic observations - the CRU TS3.10 Dataset. Int. J. Climatol. 34, 623642. 10.1002/joc.3711

  • 4

    Intergovernmental Panel on Climate Change (2013). Climate change 2013: the physical science basis, in Contribution of Working Group I to the Fifth Assessment Report of the Intergovernmental Panel on Climate Change, eds StockerT. F.QinD.PlattnerG. K.TignorM.AllenS. K.BoschungJ.et al. (Cambridge; New York, NY: Cambridge University Press), 1535.

  • 5

    IseT.MinagawaM.OnishiM. (2018). Classifying 3 moss species by deep learning, using the “chopped picture” method. Open J. Ecol. 8, 166173. 10.4236/oje.2018.83011

  • 6

    KrizhevskyA.SutskeverI.HintonG. E. (2012). Imagenet classification with deep convolutional neural networks. Adv. Neural Inform. Proc. Syst. 60, 10971105. 10.1145/3065386

  • 7

    LeCunY.BengioY.HintonG. (2015). Deep learning. Nature521, 436444. 10.1038/nature14539

  • 8

    LeCunY.BottouL.BengioY.HaffnerP. (1998). Gradient-based learning applied to document recognition. Proc. IEEE86, 22782324. 10.1109/5.726791

  • 9

    ManabeS.SmagorinskyJ.StricklerR. F. (1965). Simulated climatology of a general circulation model with a hydrologic cycle. Mon. Wea. Rev.93, 769798.

  • 10

    PławiakP.RzeckiK. (2015). Approximation of phenol concentration using computational intelligence methods based on signals from the metal-oxide sensor array. IEEE Sens. J. 15, 17701783. 10.1109/JSEN.2014.2366432

  • 11

    R Core Team (2018). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna. Available online at: URL https://www.R-project.org/ (accessed February 12, 2019).

  • 12

    RaspS.PritchardM. S.GentineP. (2018). Deep learning to represent subgrid processes in climate models. Proc. Natl. Acad. Sci. USA. 115, 96849689. 10.1073/pnas.1810286115

  • 13

    SakH.SeniorA.BeaufaysF. (2014). Long short-term memory recurrent neural network architectures for large scale acoustic modeling, in Fifteenth Annual Conference of the International Speech Communication Association (Singapore).

  • 14

    SévellecF.DrijfhoutS. S. (2018). A novel probabilistic forecast system predicting anomalously warm 2018-2022 reinforcing the long-term global warming trend. Nat. Commun. 9:3024. 10.1038/s41467-018-05442-8

  • 15

    SzegedyC.LiuW.JiaY.SermanetP.ReedS.AnguelovD.et al. (2014). Going deeper with convolutions. arXiv:1409.4842.

  • 16

    YıldırumO.PławiakP.TanR.-S.AcharyaU. R. (2018). Arrhythmia detection using deep convolutional neural network with long duration ECG signals. Comp. Biol. Med. 102, 411420. 10.1016/j.compbiomed.2018.09.009

Summary

Keywords

climate change, neural networks, big data, historical data, NVIDIA DIGITS, top-down approach, graphical image classification, global environmental change

Citation

Ise T and Oba Y (2019) Forecasting Climatic Trends Using Neural Networks: An Experimental Study Using Global Historical Data. Front. Robot. AI 6:32. doi: 10.3389/frobt.2019.00032

Received

08 January 2019

Accepted

10 April 2019

Published

26 April 2019

Volume

6 - 2019

Edited by

Fabrizio Riguzzi, University of Ferrara, Italy

Reviewed by

Paweł Pławiak, Tadeusz Kościuszko University of Technology, Poland; Wellington Pinheiro dos Santos, Federal University of Pernambuco, Brazil

Updates

Copyright

*Correspondence: Takeshi Ise

This article was submitted to Computational Intelligence, a section of the journal Frontiers in Robotics and AI

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics