Domain of Influence Analysis: Implications for Data Assimilation in Space Weather Forecasting

Millas, Dimitrios; Innocenti, Maria Elena; Laperre, Brecht; Raeder, Joachim; Poedts, Stefaan; Lapenta, Giovanni

doi:10.3389/fspas.2020.571286

ORIGINAL RESEARCH article

Front. Astron. Space Sci., 08 October 2020

Sec. Stellar and Solar Physics

Volume 7 - 2020 | https://doi.org/10.3389/fspas.2020.571286

This article is part of the Research TopicData-driven MHD: Novel Applications to the Solar AtmosphereView all 7 articles

Domain of Influence Analysis: Implications for Data Assimilation in Space Weather Forecasting

Dimitrios Millas^1,2^*

Maria Elena Innocenti¹^*

Brecht Laperre¹

Joachim Raeder³

Stefaan Poedts^1,4

Giovanni Lapenta¹

¹Department of Mathematics, Centre for Mathematical Plasma Astrophysics, KU Leuven, Leuven, Belgium
²Department of Physics and Astronomy, University College London, London, United Kingdom
³Institute for the Study of Earth, Oceans and Space, University of New Hampshire, Durham, NH, United States
⁴Institute of Physics, University of Maria Curie-Skłodowska, Lublin, Poland

Solar activity, ranging from the background solar wind to energetic coronal mass ejections (CMEs), is the main driver of the conditions in the interplanetary space and in the terrestrial space environment, known as space weather. A better understanding of the Sun-Earth connection carries enormous potential to mitigate negative space weather effects with economic and social benefits. Effective space weather forecasting relies on data and models. In this paper, we discuss some of the most used space weather models, and propose suitable locations for data gathering with space weather purposes. We report on the application of Representer analysis (RA) and Domain of Influence (DOI) analysis to three models simulating different stages of the Sun-Earth connection: the OpenGGCM and Tsyganenko models, focusing on solar wind—magnetosphere interaction, and the PLUTO model, used to simulate CME propagation in interplanetary space. Our analysis is promising for space weather purposes for several reasons. First, we obtain quantitative information about the most useful locations of observation points, such as solar wind monitors. For example, we find that the absolute values of the DOI are extremely low in the magnetospheric plasma sheet. Since knowledge of that particular sub-system is crucial for space weather, enhanced monitoring of the region would be most beneficial. Second, we are able to better characterize the models. Although the current analysis focuses on spatial rather than temporal correlations, we find that time-independent models are less useful for Data Assimilation activities than time-dependent models. Third, we take the first steps toward the ambitious goal of identifying the most relevant heliospheric parameters for modeling CME propagation in the heliosphere, their arrival time, and their geoeffectiveness at Earth.

1. Introduction

Solar activity affects the terrestrial environment with a constantly present but highly variable solar wind and with higher energy, transient events, such as flares and Coronal Mass Ejections (CMEs).

“Space weather” (Bothmer and Daglis, 2007) is the discipline that focuses on the impact of these solar drives on the solar system and in particular on the Earth and its near space environment.

Space weather events can have serious effects on the health of astronauts and on technology, with potentially large economic costs (Eastwood et al., 2017). The importance of space weather forecasting has grown with societal dependence on advanced space technology, on communication and on the electrical grid. For example, the Halloween 2003 solar storms that impacted Earth between 19th of October 2003 and 7th of November 2003 caused an hour long power outage in Sweden (Pulkkinen et al., 2005), forced airline flight reroutes, and affected communication and satellite systems (Plunkett, 2005). The “great geomagnetic storm” of March 13–14, 1989 caused, among other disruptions, a blackout of up to 9 h in most of Quebec Province, due to a massive failure experienced by the power grid Hydro-Quebec Power Company (Allen et al., 1989).

In order to improve space weather forecasting, accurate models of the Sun-Earth connection are needed. Such forecasts are challenging because of the complexity of the processes involved and the large range of spatial and temporal scales. Commonly the heliosphere is divided into sub-systems, where each one is simulated with a different model, such that the models feed into each other (Luhmann et al., 2004; Tóth et al., 2005). These models can be either physics-based or empirical. Empirical models (such as, in the solar domain, Altschuler and Newkirk, 1969; Schatten et al., 1969; Schatten, 1971; Wang and Sheeley Jr, 1992; Nikolić, 2019) usually require less computational resources, enabling faster forecasts. They can also serve as a baseline for physics-based models (Siscoe et al., 2004). However, empirical models lack the sophistication of more expensive first-principles based numerical models. Recently, machine learning methods have emerged, that can provide a new approach to space weather forecasting (Camporeale, 2019; Laperre et al., 2020). Most of these methods, while promising, must still undergo extensive validation.

The technique of Data Assimilation (DA) was developed to improve model predictions by properly initializing models from data and by keeping a model on track during its time evolution (Kalnay, 2003; Bouttier and Courtier, 1999; Evensen, 2009). DA methods were originally applied to atmosphere and ocean models, which exhibit a large degree of inertia. The latter is also true for the solar wind, but not for the magnetosphere-ionosphere system, which is strongly driven and dissipative. Therefore, in space weather forecasting, DA aims not only at initializing the models, but also at using information from various observations to bring the evolution of a system as predicted from a model closer to the real system evolution (Kalman, 1960; Bouttier and Courtier, 1999; Bishop and Welch, 2001; Evensen, 2009; Le Dimet and Talagrand, 1986; Innocenti et al., 2011), making up for model deficiencies in the terms of resolution and incomplete physical description.

The quantity and quality of available data is a critical factor in the effectiveness of Data Assimilation. This is the reason why fields where data can be obtained more easily and continuously have shown early successes in DA implementations. Examples of these fields are meteorology and oceanography, and, in space sciences, ionospheric and radiation belt physics (Bennett, 1992; Ghil and Malanotte-Rizzoli, 1991; Egbert and Bennett, 1996; Kalnay, 2003; Rigler et al., 2004; Schunk et al., 2004; Kondrashov et al., 2007). Examples of DA applications targeting specifically the interplanetary space environments are Schrijver and DeRosa (2003), Mendoza et al. (2006), Arge et al. (2010), Innocenti et al. (2011), Skandrani et al. (2014), Lang and Owens (2019), and Lang et al. (2020).

Representer analysis (RA) and Domain of Influence analysis (DOI) (Bennett, 1992; Egbert and Bennett, 1996; Echevin et al., 2000; Evensen, 2009; Skandrani et al., 2014), briefly summarized in section 2, are powerful statistical tools used to estimate the effectiveness of DA techniques when applied to a specific model, without assimilating actual data. Such analysis can be used in several ways. It allows us to optimize assimilation strategies, it may uncover model biases that can then be addressed by further model development, and it may be used to optimize the observation systems that provide operational data for DA. For example, RA/DOI can be used to optimize locations for solar wind monitors, such as locations proposed near the L5 Lagrangian point (Vourlidas, 2015; Lavraud et al., 2016; Pevtsov et al., 2020).

In the present paper, the RA and DOI analysis is applied to three models: the OpenGGCM magnetosphere—ionosphere model (section 3.1), two of the empirical Tsyganenko magnetosphere magnetic field models (section 3.2), and a solar wind simulation based on the PLUTO code (section 3.3). These models simulate critical sub-systems in the Sun-Earth connection with a focus on the terrestrial magnetosphere and Coronal Mass Ejection propagation.

The present paper provides insights into the locations of the terrestrial magnetosphere that should be prioritized (ideally, in absence of orbital constraints) for space weather forecasting and monitoring activity. We compare a time-dependent, physics based model (e.g., OpenGGCM) and time-independent, empirical (e.g., Tsyganenko) models in terms of the expected benefits that DA can provide. We conclude that time-dependent models should be preferentially chosen for DA. We take the first steps toward the goal if understanding the main physical parameters, close to the Sun and in interplanetary space, that control CME propagation and hence their arrival time at Earth.

This manuscript is organized as follows: in section 2 we introduce the theoretical background on RA and the DOI; section 3 discusses the application of the method to the different models; in section 4 we summarize the results and discuss potential improvements and new applications.

2. Representer Analysis and Domain of Influence Analysis

This section introduces the mathematical basis of RA and DOI analysis. The reader is referred to Skandrani et al. (2014) and references therein for an in depth derivation.

Let us start from a system described by the state variable vector $x_{t} \in ℝ^{n}$ .x_t is a vector containing the n state variables that describe the system at a time t. Let us assume that the evolution of the system can be described as a discrete-time process controlled by an evolution law A. The state of the system then evolves as follows: x_t = A(x_t−1) + w_t−1, where w ∈ ℝⁿ is process noise. The process noise is assumed to be Gaussian and with covariance matrix Q.

If a model M (for example, a simulation model) of the evolution law A is available, we can obtain, following Kalman (1960) and Evensen (2009), a prior estimate ${\hat{x}}_{t}^{-}$ of the state variable x_t through the simulation model as

\begin{array}{l} {\hat{x}}_{t}^{-} = M ({\hat{x}}_{t - 1}^{-}) + w_{t - 1} . & (1) \end{array}

Assume now that we have m observational values or measurements $z_{t} \in ℝ^{m}$ . These measurements can be mapped to the current state x_t through the so-called observation operator H ∈ ℝ^m×n, such that z_t = Hx_t + ν_t. Here, ν is the (assumed Gaussian) measurement noise, with a covariance matrix R.

It can be then shown (Bishop and Welch, 2001) that a posterior estimate of the state ( ${\hat{x}}_{t}$ ) can be obtained from the prior estimate of the state ( ${\hat{x}}_{t}^{-}$ ), obtained from Equation (1), as follows:

\begin{array}{l} {\hat{x}}_{t} = {\hat{x}}_{t}^{-} + K_{t} (z_{t} - H {\hat{x}}_{t}^{-}) . & (2) \end{array}

Here, the term $(z_{t} - H {\hat{x}}_{t}^{-})$ is called the “innovation,” and represents the difference between the measurements and their expected values, calculated by applying the observation operator to the prior state estimate. The Kalman gain K_t is obtained by minimizing the posterior error covariance matrix. This is the “correction” step of the Kalman filter, where the Kalman gain is calculated and the estimate and error covariance matrix of the posterior state are updated. The “prediction” (forecast) phase of the filter results in the calculation of the prior state estimate and prior error covariance matrix (used to compute the Kalman gain).

The prior and posterior error covariance matrices are respectively defined as

\begin{array}{l} P_{t}^{-} = 𝔼 [(x_{t} - {\hat{x}}_{t}^{-}) {(x_{t} - {\hat{x}}_{t}^{-})}^{T}], P_{t} = 𝔼 [(x_{t} - {\hat{x}}_{t}) {(x_{t} - {\hat{x}}_{t})}^{T}], & (3) \end{array}

where 𝔼 is the expected value, x_t is the “real,” unknown system state and $ϵ^{-} = ({\hat{x}}_{t}^{-} - x_{t})$ and $ϵ = ({\hat{x}}_{t} - x_{t})$ are the prior and posterior errors, calculated as the difference between the prior ( ${\hat{x}}_{t}^{-}$ )/posterior ( ${\hat{x}}_{t}$ ) state and the real state, x_t. Notice that, although these are the definitions of the error covariances, this is not how they are computed in the filter, since the real state is not known.

The formula for the calculation of the posterior state, Equation (2), can be written as

\begin{array}{l} {\hat{x}}_{t} = {\hat{x}}_{t}^{-} + r b & (4) \end{array}

where r ∈ ℝ^n×m and b ∈ ℝ^m are defined as

\begin{array}{l} r = P^{-} H^{T}, b = {(H P^{-} H^{T} + R)}^{- 1} (z - H {\hat{x}}^{-}), & (5) \end{array}

with R the measurement noise covariance matrix. The time index t has been dropped for ease of reading.

We will from now on assume that the system (and in particular, the observation operator H) is linear. Then, each column of the matrix r, denoted as r_j with j = 1, …, m, is the representer associated to a given observation z_j (remember that z is the vector with m observations), and gives a measure of the impact of that observation in “correcting” the prior state estimate. If we further assume that each observation j is located at grid point k_j, and is associated to a state variable, then each column r_j (now r_{k_j}, given the assumption mentioned above) contains the covariances (“cov”) between the prior errors at the observation point k_j and at every other grid node, for all the state variables.

Since the real state is not available for error covariance calculations, an ensemble of simulations can be used to estimate the prior errors instead. An ensemble (Evensen, 2009) can be generated by perturbing one or several of the sources of model errors. In this work, ensembles are generated for each model by perturbing the respective initial/boundary conditions. Once the ensemble is available, the covariances of the prior errors at a certain simulated time can be approximated as the ensemble covariances (“cov^ens”) of the prior errors. These in turn become the ensemble covariances of the simulated state variables, if one assumes that the prior errors are unbiased. The ensemble covariance between the state variable x and y is defined as

\begin{array}{l} {cov}^{e n s} (x, y) = \frac{1}{N} \sum_{s = 1}^{N} [(x_{s}^{-} - \frac{1}{N} \sum^{N} x_{s}^{-}) (y_{s}^{-} - \frac{1}{N} \sum_{s = 1}^{N} y_{s}^{-})] & (6) \end{array}

where N is the number of members in the ensemble, and x and y represent two state variables (notice that, for ease of reading, we indicate here two state variables as x and y, while earlier we indicated as vector x all the state variables).

In a set of simulations, the prior state variables are the simulation results at a specific time. Being able to calculate the representers associated to observations from the ensemble rather than from assimilating observations simplifies the RA significantly.

Following the discussion of the representer term r in the calculation of the posterior state, Equations (4) and (5), we now examine the term b. Assuming that the only observation point for observation j is at grid node k_j (i.e., the row j of the matrix H has only the term k_j different from zero), the element of b associated to the observation at grid point k_j, denoted as b_{k_j}, becomes, from Equation (5):

\begin{array}{l} b_{k_{j}} = \frac{z_{k_{j}} - {\hat{x}}_{k_{j}}^{-}}{c o v (ϵ_{k_{j}}^{-}, ϵ_{k_{j}}^{-}) + c o v (ϵ_{k_{j}}^{z}, ϵ_{k_{j}}^{z})}, & (7) \end{array}

where we have made use of the simplified form of the matrix H and where $ϵ_{k_{j}}^{z}$ is the observation error associated with the observation z_{k_j}.

If x_i is one of the state variables at grid node i, the correction to x_i brought by the assimilation of the measure z_{k_j}, following Equations (4), (5), (7) and some straightforward manipulation based on the definitions of covariance, variance, correlation, can be written as (Skandrani et al., 2014)

\begin{array}{l} \hat{x_{i}} - {\hat{x}}_{i}^{-} = {corr}^{e n s} ({\hat{x}}_{k_{j}}^{-}, {\hat{x}}_{i}^{-}) F (z_{k_{j}}) . & (8) \end{array}

Here, F(z_{k_j}) is the modulation factor and ${corr}^{e n s} ({\hat{x}}_{k_{j}}^{-}, {\hat{x}}_{i}^{-})$ the correlation. The correlation is computed from the ensemble, and is calculated between the state variable at node k_j and at node i. This correlation reflects how a change at node k_j, caused, e.g., by the assimilation of the measurement z_{k_j}, will influence the node i, and is what we call the DOI. The correlation over the ensemble is defined, using the dummy variables x and y for brevity, as:

\begin{array}{l} {corr}^{e n s} (x, y) = \frac{{cov}^{e n s} (x, y)}{\sqrt{var (x) var (y)}} . & (9) \end{array}

The modulation factor F(z_{k_j}) in Equation (8) depends, among other things, on the measurement z_{k_j} and on the error associated to the measure z_{k_j}. Hence Data Assimilation has to be performed to calculate this term. The ${corr}^{e n s} ({\hat{x}}_{k_{j}}^{-}, {\hat{x}}_{i}^{-})$ term reflects how large we can expect the area that will be affected by the assimilation of z_{k_j} to be. But to know how large the difference between the posterior and prior state, $\hat{x_{i}} - {\hat{x}}_{i}^{-}$ , will be, we need to know the modulation factor as well.

So now the DOI of the measurement z_{k_j} on the state variable at grid point i, x_i, can be defined as

\begin{array}{l} D O I (z_{k_{j}}, x_{i}) = {corr}^{e n s} ({\hat{x}}_{k_{j}}^{-}, {\hat{x}}_{i}^{-}) . & (10) \end{array}

One can see from its definition that the DOI can be calculated before assimilation by computing the ensemble correlation of the state variable value at the grid point k_j with that at grid point i. Dropping the i index, i.e., examining the expected impact of measurement z_{k_j} on all the state variables x at all grid points, we obtain the more general definition of the DOI as

\begin{array}{l} D O I (z_{k_{j}}) = {corr}^{e n s} ({\hat{x}}_{k_{j}}^{-}, {\hat{x}}^{-}) . & (11) \end{array}

We can then draw “DOI maps” that show the correlation between a field at grid point k_j, the “observation point,” and the other grid points.

Notice that in this derivation we have assumed, for simplicity, that the measurement z and the state variable x refer to the same field, for example, the x component of the magnetic field, or of the velocity. This simplifies the formulation of the numerator of Equation (7) and improves the readability of the derivation. Skandrani et al. (2014) shows examples where the DOI is calculated between different fields, e.g., magnetic field and velocity.

DOI analysis has the advantage that it can be calculated for all state variables and at any grid point without actual assimilation, i.e., without the need for measurements z. In order to compute the DOI at a time step t, we only require evolving the ensemble up to said time step t, and then performing the correlation over the ensemble between the state variable value at the observation point k_j and at all other grid points.

Because DOI values are derived from a correlation they are bounded between −1 and 1. |DOI| ~ 1 indicates that the field at that specific point significantly changes when the same field (or a different field, in the case of cross-correlation) varies at the observation point. |DOI| ~ 0 indicates the opposite, i.e., variation at the observation point have little or no effect. Thus, DOI analysis also provides information on how information propagates within the model, and therefore sheds light on the physical processes within the model. We will exploit this property in section 3.2.

We note that in this study we only focus on spatial correlations, neglecting temporal correlations. In other words, the following analysis (i.e., the calculation of variances, DOI, etc.) is restricted to specific instances in time, rather than examining correlations between fields at difference times as well. The dependence on time will be addressed in a future project.

The RA is applied to “artificial” data, obtained from ensembles of simulations focused on different processes of interest in the Sun-Earth connection: the interaction of the solar wind with the terrestrial magnetosphere (via OpenGGCM and Tsyganenko simulations) and the propagation of a CME-like event in the steady solar wind (PLUTO).

OpenGGCM and the Tsyganenko Geomagnetic Field Models both simulate the interaction of the solar wind with the magnetospheric system. OpenGGCM is a physics-based magnetohydrodynamic (MHD) model, while the Tsyganenko models are semi-empirical best-fit representations for the magnetic field, based on a large number of satellite observations (Tsyganenko, 1995, 2002a; Tsyganenko and Sitnov, 2005).

PLUTO is an MHD-modeling software used to simulate the propagation of a CME in the background solar wind. This software can be used to numerically solve the partial differential equations encountered in plasma physics problems, in conservative form, in different regimes (from hydrodynamics to relativistic MHD). The structure of the software is explained in Mignone et al. (2007, 2012). Full documentation and references can be found in the relevant web page: http://plutocode.ph.unito.it/.

Because the DOI analysis is an ensemble based technique the size of the ensemble and its properties matter. In order to test for sufficient size, we performed the DOI calculation using a limited, random subset of the ensemble, which we gradually expanded. We found that using at 25 runs were sufficient to obtain a consistent ensemble mean, variance, and DOI. We note, however, that this may change for different choices of simulation resolution and parameters used for the generation of the ensemble.

3. Applications

3.1. Magnetospheric Applications I: OpenGGCM

The OpenGGCM (Open Geospace General Circulation Model) is a MHD based model that simulates the interaction of the solar wind with the magnetosphere-ionosphere-thermosphere system. OpenGGCM is available at the Community Coordinated Modeling Center at NASA/GSFC for model runs on demand (see: http://ccmc.gsfc.nasa.gov). This model has been developed and continually improved over more than two decades. Besides numerically solving the MHD equations with high spatial resolution in a large volume containing the magnetosphere, the model also includes ionospheric processes and their electrodynamic coupling with the magnetosphere.

The mathematical formulation of the software is described in Raeder (2003). The latest version of OpenGGCM, used here, is coupled with the Rice Convection Model (RCM) (Toffoletto et al., 2001), which treats the inner magnetosphere drift physics better than MHD and allows for more realistic simulations that involve the ring current (Cramer et al., 2017). The model is both modular and efficiently parallelized using the message passing interface (MPI). It is written in Fortran and C, and uses extensive Perl scripting for pre-processing. The software runs on virtually any massively parallel supercomputer available today.

OpenGGCM uses a stretched Cartesian grid (Raeder, 2003) that is quite flexible. There is a minimal useful resolution, about 150 × 100 × 100 cells, that yields the main magnetosphere features but does not resolve mesoscale structures, such as FTEs or small plasmoids in the tail plasma sheet. At the other end, we have run simulations with grids as large as ~1,000³ (on some 20,000 cores). In terms of computational cost that is almost a 10⁴ ratio. Here, we used a grid of 325 × 150 × 150 cells which is sufficient for the purposes of this study and runs faster than real time on a modest number of cores.

OpenGGCM has been used for numerous studies of magnetospheric phenomena, such as storms (Raeder et al., 2001b; Raeder and Lu, 2005; Connor et al., 2016), substorms (Raeder et al., 2001a; Ge et al., 2011; Raeder et al., 2010), magnetic reconnection (Dorelli, 2004; Raeder, 2006; Berchem et al., 1995), field-aligned currents (Moretto et al., 2006; Vennerstrom et al., 2005; Raeder et al., 2017; Anderson et al., 2017), and magnetotail processes (Zhu et al., 2009; Zhou et al., 2012; Shi et al., 2014), to name a few.

The boundary conditions require the specification of the three components of the solar wind velocity and magnetic field, the plasma pressure and the plasma number density at 1 AU, which are obtained from ACE observations (Stone et al., 1998) and applied for the entire duration of the simulation at the sunward boundary.

We generate an ensemble of 50 OpenGGCM simulations by perturbing the v_x component of the input solar wind velocity. Changing this particular parameter guarantees a direct and easy way to interpret magnetospheric response.

First, we run a reference simulation using the observed solar wind values at 1 AU starting from May 8th, 2004, 09:00 UTC (denoted as t₀) until 13:00 UTC on May 8th, 2004. We choose this period because it is relatively quiet: no iCME were registered in the Richardson/Cane list of near-Earth interplanetary CMEs (Richardson and Cane, 2010), and as it is common during the declining phase of the solar cycle, geomagnetic activity is driven by Corotating Interaction Regions and High Speed Streams (Tsurutani et al., 2006). The study of outlier events, such as CME arrival at Earth, is left as future work.

To generate the ensemble, the solar wind compression is changed in each of the “perturbed” simulations. The perturbed velocity in the x (Earth-to-Sun) direction is set, for the entire duration of each simulation, to a constant value obtained by multiplying the average observed $v_{x}^{a v g}$ by a random number S sampled from a normal distribution with mean μ = 1 and standard deviation σ = 0.1:

\begin{array}{l} v_{x} = S v_{x}^{a v g}, with S \in N (1, {(0.1)}^{2}) . & (12) \end{array}

The time period we use to calculate the average is the duration of the reference simulation, between 9:00 UTC and 13:00 UTC on May 8th, 2004.

Our choices for the generation of the ensemble are determined by the necessity to preserve both the Gaussian characteristic of the model error and the physical significance of the simulations: the solar wind compression in all ensemble members is not too far from the reference value. With such low standard deviation, the average of the obtained perturbed value plus/minus several sigmas are still within the typical range for the solar wind: the minimum and maximum values of the constant, perturbed input velocities are |v_x| ~ 363 km/s and |v_x| ~ 583 km/s, respectively. The solar wind velocity v_x is negative in the Geocentric Solar Ecliptic (GSE) coordinates used here. The v_x values that we obtain in this way are not supposed to be representative of the full range of values that v_x can assume; they are used to generate an ensemble of simulations “slightly perturbed” with respect to our reference simulation. We note that the real distribution of solar wind velocity is far from a normal distribution, with two distinct peaks and extreme outliers, and would not be appropriate to produce the required ensemble. We refer the reader to Fortin et al. (2014) for optimal procedures on how to choose the variance of the ensemble.

The ensemble analysis requires running 50 simulations to produce the ensemble members, plus the unperturbed reference simulation. Each run takes ~12 h on 52 cores on the supercomputer Marconi-Broadwell (Cineca, Italy), for a total cost of ~32,000 core hours.

We verify that the prior errors are unbiased (as assumed in the derivation of the method summarized in section 2) by comparing the reference simulation and the average of the ensemble. We note that the ensemble mean is an appropriate metric to use in this case because the perturbed simulations have not been generated in order to represent all possible solar wind velocity values, but small perturbations around the reference case.

Figure 1 shows this comparison in the xz plane for the x component of the velocity and of the magnetic field at a fixed time (172 min after the beginning of the simulation), for both the reference simulation (panels A,B) and the average of the ensemble (panels C,D). The magnetic field lines, depicted in black, are calculated from the reference simulation in panels (A,B) and from the ensemble average in panels (C,D). The distances are normalized by the Earth radius R_E.

FIGURE 1

Figure 1. Reference simulation (A,B), ensemble mean (C,D), and difference between the ensemble mean and the reference simulation results (E,F) for the ensemble of OpenGGCM magnetospheric simulations. The ensemble is generated by perturbing the v_x solar wind boundary condition. v_x is depicted in (A,C,E), b_x in (B,D,F). The coordinate system is GSE The depicted time is 172 min after the beginning of the simulation, May 8th 2004, 9:00 UTC. The boundary conditions at the sunward boundary of the “reference” simulation are observed solar wind values.

Visual inspection of panels (A–D) and of the difference between the ensemble mean and the reference simulation results, depicted in panels (E,F) for v_x and B_x, respectively, highlight the areas where the behavior differs most within the ensemble: the bow shock, the plasma sheet, and the neutral line. The former is a plasma discontinuity that moves back and forth in response to the changing solar wind Mach number, and thus gets smeared out in the ensemble. The latter is a region of marginal stability in the magnetosphere that reacts in a non-linear way to solar wind changes.

In order to determine if 50 ensemble members are sufficient for our analysis, we have compared corresponding plots of the difference between the ensemble mean and the reference simulation for v_x with decreasing number (40, 30, 20) of ensemble members. We find that, with decreasingly smaller ensembles, the plasma sheet structure seen in Figure 1E, is only minimally affected. However, the differences around the bow shock become more pronounced. The velocity difference increases in the solar wind and magnetosheath as well, and the magnetic field structure at the magnetosphere/solar wind interface (as shown by the magnetic field lines, which are drawn for the average field in panels E,F and similar analysis) begin to change significantly with respect to the reference simulation. By comparing the plots with 50, 40, 30, and 20 ensemble members, we conclude that 30 is the minimum number of ensemble members that gives average fields compatible with the reference simulation, with our choice of perturbation to generate the ensemble.

Figures 1A,B show characteristic signatures of magnetic reconnection in the magnetotail, i.e., the X pattern and the formation of dipolarization fronts in the magnetic field lines, and the presence of earthward and tailward jets in v_x departing from the X point. We provide a movie showing the dynamic evolution in the Supplementary Material (ReferenceSimVx.avi). The movie shows the solar wind b_z time variation and the subsequent occurrence of several magnetopause/magnetotail reconnection events. The “formation” of the magnetosphere occurs during the first ~30 min of the simulation and should be disregarded.

The movie MeanVx.avi, also in the Supplementary Material, shows how the global evolution changes in the ensemble mean: the magnetopause and magnetotail reconnection patterns are still overall visible, but smoothed out by the averaging procedure with respect to the reference simulation, since the different ensemble instances reconnect at different times and the smaller scale features of each single run are averaged away.

In Figure 2 and movies LowerVx.avi and HigherVx.avi we compare the evolution of the members of the ensemble generated with the lower (|v_x| ~ 363 km/s, panel A) and higher (|v_x| ~583 km/s, panel B) absolute value of the v_x velocity component. The movies show that the velocity values and magnetic field line patterns are significantly different from the reference simulation and from the ensemble average, demonstrating that the perturbations are not trivial.

FIGURE 2

Figure 2. v_x field component, at the same time as Figure 1, for the ensemble member with the lowest (A) and highest (B) absolute value of the perturbed, inflowing v_x velocity component in the OpenGGCM ensemble.

We now discuss the RA and DOI analysis for a set of different observation points, depicted as white stars in the following figures, in the inflowing solar wind (A), in the magnetosheath (B), in the northern lobe (C), and in the plasma sheet (D), for the same plane and time as Figure 1. The coordinates of each of these points in the xz-plane are given in Table 1, the y coordinate being y/R_E = 0. Figures 3, 4 show the DOI maps for v_x and b_x, respectively. Note that the correlations which are displayed are not cross-correlations: the correlation is done between the value of a field at the observation point and the values of the same field at the other grid points.

TABLE 1

Table 1. Coordinates of the observation points used in the DOI analysis.

FIGURE 3

Figure 3. DOI maps for v_x, computed from the correlation of an ensemble of OpenGGCM magnetospheric simulation, with observation points in the solar wind (A), magnetosheath (B), northern lobe (C), plasma sheet (D).

FIGURE 4

Figure 4. DOI maps for b_x, computed from the correlation of an ensemble of OpenGGCM magnetospheric simulations, with observation points in the solar wind (A), magnetosheath (B), northern lobe (C), plasma sheet (D).

Figures 3, 4 show that the correlations are mostly ordered by the principal regions of the magnetosphere, such as the lobes, the magnetosheath, and the plasma sheet. For example, Figure 3 shows the results for the v_x correlations. The DOI values for the plasma sheet are different from those in the magnetosheath and the lobes in all panels. As expected, the stronger correlations are somewhat localized around the observation point, for example, the strongest correlations in panel (D), where the observation point is in the plasma sheet, are in the plasma sheet itself and its immediate surroundings. However, some other observation points have a much larger DOI, such as the ones in the solar wind and the magnetosheath. This makes physical sense, because v_x variations in those regions will propagate through much of the magnetosphere. Figure 4 shows the B_x correlations. The northern and southern lobes clearly stand out, with opposite DOI values, and the magnetosheath stands out as well. Because the B_x values have opposite signs in the two lobes, the DOI values also have opposite signs. The correlation values depend on the variability of the field value at the observation point, thus the panels that exhibit lower correlations are those with observation point in the plasma sheet, where the intrinsic variance of both v_x and B_x is higher (see Figure 1) due to the different reconnection patterns in the different members of the ensemble. For example, in Figure 3D, the velocity value at the observation point in the plasma sheet exhibits little correlation with the v_x values outside of the plasma sheet and the neighboring areas. This is a consequence of the jet structure which is caused by internal magnetospheric dynamics rather than the solar wind driver.

The temporal dynamic DOI behavior is similar: the DOI maps of v_x and B_x with observation point in the plasma sheet exhibit higher temporal variability than those with a observation point in the magnetosheath, as can be seen in the DOI movies DOI_bx_bx_MSheath.avi, DOI_bx_bx_pSheet.avi, DOI_vx_vx_MSheath.avi, DOI_vx_vx_pSheet.avi and in Figure 5. The Figure shows the DOI map for v_x (panels A,B) and B_x (panels C,D) with observation point in the plasma sheet (panels A,C) and in the magnetosheath (panels B,D) at t₀ + 192 min. All the previous figures, Figures 1–4, were at t₀ + 172 min.

FIGURE 5

Figure 5. DOI maps for v_x (A,B) and B_x (C,D) from an ensemble of OpenGGCM magnetospheric simulations with observation point in the plasma sheet (A,C) and in the magnetosheath (B,D), at t₀+ 192 min. All previous Figures were at t₀+ 172 min.

We note that the plots with observation points in the magnetosheath are not significantly different to earlier plots (see Figures 3, 4), except for the plasma sheet plots, which differ profoundly.

To summarize and interpret the OpenGGCM results, the DOI analysis is well in line with our understanding of the terrestrial magnetosphere. In the v_x case, when the observation point is in the solar wind or in the magnetosheath, the |DOI| values are very high in both the solar wind and the magnetosheath region. This is expected, because v_x in the solar wind is a correlation with itself (and thus a sanity test for the calculation), whereas the magnetosheath is largely driven by the interaction between solar wind and the bow shock, where the Rankine-Hugoniot conditions predict a positive correlation of the downstream velocity with the upstream velocity. When the observation points are in the solar wind and magnetosheath regions, the |DOI| values in the plasma sheet are expected to be lower due to internal transient dynamics (e.g., reconnection events, bursty bulk flows) in the sheet which may be triggered by local plasma sheet dynamics, rather than solar wind compression. Local dynamics in the sheet is also the reason why, in Figure 3D, when the observation point is in the plasma sheet, the correlation with the solar wind and magnetosheath regions is close to zero. Even if, in global terms, magnetic reconnection in the plasma sheet were triggered by magnetopause dynamics, in any region of the plasma sheet v_x may flow sunward or anti-sunward, depending on the location of the reconnection site, and thus would be uncorrelated with the velocity in the solar wind or in the magnetosheath. The lobe magnetic field is expected to be directly driven by the solar wind dynamic pressure, and thus by v_x. As the dynamic pressure increases, the lobe flare angle decreases, and vice versa. As the flare angle decreases, the lobe field gets compressed. Figures 4B,C show that effect, as expected.

Similar consideration broadly apply to the B_x DOI results shown in Figure 4. There is, however, a significant difference between panel (A) in Figures 3, 4. When the observation point is in the solar wind, high correlations are obtained in large parts of the magnetosphere for v_x, while B_x correlations are much lower. This can be attributed to the fact that B_x in the solar wind is not a major driver of magnetospheric dynamics, unlike the solar wind speed and solar wind dynamic pressure. The geo-effective component of the interplanetary magnetic field (IMF) is the B_z component, which controls reconnection at the magnetopause and thus the dominant energy input into the magnetosphere.

3.2. Magnetospheric Applications II: Tsyganenko Model

The Tsyganenko models are a family of empirical, static terrestrial magnetic field models (Tsyganenko, 1987, 1989, 1995, 2002a,b; Tsyganenko et al., 2003; Tsyganenko and Sitnov, 2005). The successive model versions (available at http://geo.phys.spbu.ru/~tsyganenko/modeling.html) reflect increasing knowledge of the magnetospheric systems and are based on an increasing amount of data from all regions in the magnetosphere.

The models are based on a mathematical description of the magnetosphere, which includes contributions from major magnetospheric current sources, such as the Chapman-Ferraro current, the ring current, the cross-tail current sheet and large-scale field-aligned currents. Terms are added to account for the magnetopause and for partial penetration of the IMF into the magnetosphere. The most recent versions can also take into account the dipole tilt, the dawn-dusk asymmetry, and allow for open magnetospheric configurations. The parameters of the models are derived from a regression to magnetic field observations, and keyed to magnetic indices and/or solar wind parameters. The model requires the user to specify a date and time for the dipole orientation. The other model parameters, either an index, such as the Kp, or solar wind variables, are to be given by the user. In more recent models, Tsyganenko also provides yearly input data files for his models. From these inputs, an approximation of the magnetosphere is created for the specified date and time. Notice that the Tsyganenko models are static, and only provides a snapshot of the magnetosphere. However, since the parameters are time dependent the model can be used in a quasi-dynamic mode.

Several versions of the Tsyganenko model have been tested over the years against observations and physics-based, MHD models (Thomsen et al., 1996; Huang et al., 2006; Woodfield et al., 2007). While the Tsyganenko models do not account for the Earth's internal magnetic field, methods are provided to add the internal field model as described in the above cited literature.

In order to simulate the evolution of the magnetosphere with the chosen Tsyganenko model, we create snapshots of the magnetosphere at different times. The time May 8th, 2004, 09:00 UTC is taken as t₀, the same time as the OpenGGCM simulations presented in section 3.1. The model is “evolved” by using a time series of the required input parameters, which are obtained from the OMNIWeb database (King and Papitashvili, 2005).

We use two versions of the Tsyganenko model, the T96 model (Tsyganenko, 1996) and the TA15 model (Tsyganenko and Andreeva, 2015). We generate the Tsyganenko ensembles in the same way as the OpenGGCM examples, by using a distribution of v_x values as described in section 3.1.

Before we analyse the results of the T96 ensemble, we show the magnetospheric configuration computed by the model using the original solar wind data. In Figure 6, the first row of figures shows the results of the “reference” simulation, e.g., the simulation without any perturbed inputs, at time t₀ + 85 min, for B_x (panel A) and B_z (panel B).

FIGURE 6

Figure 6. Results from the T96 Tsyganenko model. The top row (A,B) shows the reference magnetic field (B_x and B_z component respectively) at time t₀ + 85 min, with a negative IMF. The middle row (C,D) shows the ensemble mean of the same magnetic field components at the same time t₀ + 85 min. We clipped the magnetic field values to |50nT|, in order to make variations in the tail better visible. The last row (E,F) shows the logarithm of the absolute difference between the reference simulation and the mean of the ensemble.

Unlike the OpenGGCM, the T96 model cannot model reconnection, although some approximation of reconnection is included in later Tsyganenko models (Tsyganenko, 2002a,b). Also, the day-side magnetospheric structure is only approximated with respect to physics-based models, and bow shock and magnetosheath are not clearly distinguishable. In Figure 6, the second row shows the average of the ensemble at the same time of the reference simulation, for B_x (panel C) and B_z (panel D). The results are similar to the reference simulation, as shown by the logarithm of the absolute difference between the reference and ensemble mean (e.g., Figures 6E,F). The only significant difference is located at the magnetopause, which is expected since varying the solar wind velocity changes the standoff distance.

Next we analyse the DOI maps of the T96 model. Figure 7 shows the DOI maps for the B_x and B_z field components at time t₀ + 85 min. Although the T96 model is parameterized by the solar wind velocity, it only models the magnetic field in the magnetosphere. Because of this, we are only able to analyze the DOI maps of the magnetic field components. The observation points are placed in the northern lobe, in proximity to the current sheet, at the dayside magnetosphere, and in the southern lobe. Like in the OpenGGCM case, the DOI maps reflect the general regions of the magnetosphere as reproduced by the T96 model. However, the correlation only takes values of ±1 in the magnetosphere, and zero in the solar wind. The latter is simply a consequence of the fact that the model does not predict the IMF, which is therefore independent of the v_x variations of the ensemble. The former is due to the fact that the model has no intrinsic time dependence. Any variations of the solar wind affect the entire magnetosphere instantly and in proportion to the variation. Thus, after normalization, only the sign matters, i.e., whether a given change at the observation point leads to a positive or negative change at a different point.

FIGURE 7

Figure 7. Each row displays the DOI maps for B_x and B_z, from an ensemble of Tsyganenko model T96 simulations with observation point in the plasma sheet (A,B), dayside magnetosphere (C,D) and southern lobe tail (E,F), respectively.

Now we focus on the results of the TA15 model. Figure 8 shows the reference simulation and ensemble mean for the B_x and B_z fields, with superimposed field lines, at time t₀ + 85 min, together with the difference between reference and ensemble mean. We observe that the reproduced dayside magnetosphere structure is improved compared to the T96 model, at the expense of unrealistically high magnetic field values in the inflowing solar wind, and correspondingly distorted magnetic field lines. These artificial boundary conditions in the Sunwards boundary are used to obtain an “open” magnetosphere which blends with the inflowing solar wind, without seeming to form a nightside magnetosheath. Notice also the high values at these artificial boundary conditions in the difference plot, indicating that there is a high variability in their values.

FIGURE 8

Figure 8. The reference and mean magnetic field (B_x: (A,C) and B_z: (B,D) respectively) at time t₀ + 85 min from the Tsyganenko TA15 model are displayed in the top two rows of figures. Notice the non-realistic high magnetic field values in the inflowing solar wind (at x/R_E > 10). We assume these unrealistic values are necessary for the model to construct the day-side magnetosphere. The values of the magnetic field have been cut off in the top two rows of figures at |50nT|, to make sure the variations in the tail are visible. The last row (E,F) of figures shows the logarithm of the absolute error between the reference simulation and the mean of the ensemble.

From the DOI maps in Figure 9 (with observation points at the same positions as Figure 7), we can confirm that the modeled IMF is used to construct the internal magnetospheric solution. While in the T96 model the solar wind B_x and B_z values were uncorrelated with the magnetospheric values, here the absolute value (i.e., ignoring the sign) of the correlation is very high: the solar wind input strictly determines the inner magnetospheric solution, making the correlation practically unitary. This could be because of the deterministic analytical formula used to construct the magnetic field, where everything is exactly determined on a global scale.

FIGURE 9

Figure 9. DOI maps for B_x (A,C,E) And B_z (B,D,F), computed from an ensemble of Tsyganenko TA15 simulations at t₀ + 85 min with observation points at the same position as Figure 7.

Note that the correlations reported are spatial and not temporal, therefore no causality is implied. High correlation between the IMF and magnetospheric fields point to the fact that, in an ensemble generated by perturbing the solar wind input, the model is built in such a way that variations in the magnetic field are highly correlated through the system, apparently without highlighting the boundary regions that we were able to spot in the DOI maps for the OpenGGCM and Tsyganenko T96 simulations.

A last remark on the DOI analysis applied to the Tsyganenko models is the following. The analysis helps us understand and visualize how the different models are built, with regards to the relationship between the solar input and the magnetospheric solution. DA analysis then proves useful here as a model investigation tool.

It also highlights that caution should be used when deciding to apply DA techniques to a particular model, depending on the objectives of the investigation. The Tsyganenko models were built to provide time-independent, empirical-based insights into the structure of the magnetosphere at a particular instant in time. They do not aim at representing the state of the pristine solar wind, which is used only to better the magnetospheric solution (hence the somehow unrealistic solar wind patterns identified in Figures 8, 9). Also, they do not intend to reproduce temporal dynamics in the magnetosphere. These factors result in DOI maps where the absolute value of the correlation is always either 1 or −1. When using DOI techniques with the purposes of identifying useful locations for satellite placement, these are not useful results: we are interested in the value of the DOI, not in the sign. Hence, caution should be used before using empirical, time-independent models for this particular purpose: more significant information will possibly be acquired from their physics-based, time-dependent counterparts. This consideration does not intend to diminish the importance of empirical, time-independent models for other scientific objectives such us, most importantly, quick forecasting.

3.3. Heliospheric Application: PLUTO

In this section, we study the propagation of a Coronal Mass Ejection in a solar meridional plane, which is defined by the rotation axis of the Sun and a radial vector in the equatorial plane. In all the runs of the ensemble, the computational domain is 1R_⊙ ≤ r ≤ 216R_⊙ and 0 ≤ θ ≤ π in spherical coordinates, where R_⊙ is the solar radius and θ is the polar angle (or colatitude), corotating with the Sun. Assuming axisymmetry around the solar rotation axis, we may limit our analysis to 2.5D (pseudo 3D) simulations. The grid resolution is uniform in both directions, 384 × 384 cells, which is sufficient to capture the structure of the background solar wind while keeping the computational cost and output size manageable.

We simulate the background solar wind using a simple adiabatic model with effective polytropic index Γ = 1.13 (Keppens and Goedbloed, 2000). We also assume a time-independent dipole background magnetic field:

\begin{array}{l} B_{r} = - 2 B_{o} cos θ / r^{3}, & (13) \end{array}

\begin{array}{l} B_{θ} = - B_{o} sin θ / r^{3}, & (14) \end{array}

where B_r, B_θ are the r, θ components of the magnetic field in spherical coordinates and B_o a constant used to scale the field to B = 1.1G on the solar equator. We impose the density distribution ρ as a function of the latitude θ at the inner boundary to achieve a “dead zone” of low velocity near the equator and a fast solar wind near the poles simultaneously (see Keppens and Goedbloed, 2000 and Chané et al., 2008). The differential rotation of the Sun is also taken into account, following Chané et al. (2005); this is achieved by imposing a varying azimuthal velocity v_ϕ = v_ϕ(θ) at the inflow boundary.

Once the simulation reaches a steady state, roughly after ~ 2.5 days or t = 10 in normalized units, the radial velocity at 1 AU is ~ 300 km/s near the equator and ~ 850 km/s near the poles. This is consistent with the large-scale bimodal solar wind structure that is typically observed during solar minimum (McComas et al., 1998).

We create two ensembles of 100 simulations each. In the first ensemble, the velocity of the CME in each case is randomly selected from a Gaussian distribution with mean μ = 900 km/s and standard deviation σ = 25 km/s. The resulting values are typical of strong CME events. In the second ensemble, the spatial extent of the boundary conditions that launch the CME varies as well, along with the velocity of the CME as described above. The half-width of this region is also randomly selected from a Gaussian distribution with μ = 10° and σ = 0.5°. All other parameters remain the same in every run.

The values of the CME widths that are used here are comparable to observed events. The choice of parameters in the second ensemble is less constrained by observations and leads to the appearance of very small values of variance. We thus find large areas where the DOI ~ 1, since the simulations in the ensemble do not differ significantly. This was confirmed by creating and analyzing a third ensemble, where the width of the CME is chosen from a Gaussian with μ = 20° and σ = 2.0°.

Figure 10 shows the evolution of the radial velocity average over the whole ensemble. Up to t = 11, i.e., before the CME is launched, all runs are identical. The CME is initialized similar to the simplified approach of Keppens and Goedbloed (2000), such that the boundary conditions on the solar surface are modified to represent a change of mass flux. In our case, we modify the boundary conditions at R=1R_⊙, in a given region around θ = 80°. A tracer (a passive scalar only present as an advected quantity within the flow, without effect on the plasma) is also injected with the CME, to facilitate monitoring its propagation. In the middle and right panel of Figure 10 we show the ejection of the CME and its propagation. The CME front can be clearly distinguished at t = 12.

FIGURE 10

Figure 10. Injection and propagation of a CME via the average radial velocity in the ensemble (in km/s). From (A–C): relaxed solar wind (t = 10), injection and propagation of the CME (t = 11, t = 12). The leading edge (front shock) is prominent in the last panel, marked by a black arrow.

To apply the RA technique, as explained in section 2, we select a point of interest and perform the analysis based on (a) the plasma density, or (b) the radial velocity. We present results at t=14, when the CME has reached a distance of ~ 150R_⊙, for two detection points (at R = 90 and R = 150R_⊙, θ = 80°). At times earlier than t = 10 (when the solar wind reaches a steady state and the CME is injected), the DOI is zero, since the observation point is disconnected from the rest of the domain before the CME reaches it.

The propagation of the CME can be monitored in the MHD simulations easily via, e.g., a tracer (or the radial velocity). The DOI map, when the tracer is used as a criterion, follows closely the CME propagation pattern observed in the MHD runs. However, this is of limited use, besides testing, as the tracer (in our case) does not represent a real physical quantity.

The DOI map for the first ensemble, where we perturb only the radial velocity of the CME, is shown in Figure 11. The regions where information from the CME front has not yet arrived have DOI = 0, as shown in the radial velocity DOI map (Figure 11). When only one parameter is modified (first ensemble), the density DOI map shows a very large area of the domain saturated with correlation ≃1. This is probably due to variations in density of the background solar wind induced by the propagation of the CME. The regions with absolute value of the DOI ≃ 1 that are located far from the CME propagation front (at small or large angles θ) are the areas of high radial velocity in Figure 10, where the information on the perturbation introduced when triggering the CME has already propagated. The density and radial velocity of all ensemble members are modified in a similar way, hence the large |DOI| values.

FIGURE 11

Figure 11. DOI maps for the density (A,B) and the radial velocity (C,D), from an ensemble of PLUTO simulations. The first column has the observation point at R = 90R_⊙ and the second column at R = 150R_⊙ (marked by the black dot). In this set we only perturb the radial velocity of the CME. The structure of the CME can be seen quite clearly in (A,B), where the leading edge is evident and marked by a black arrow in all panels.

In the second ensemble, where the width of the CME is also modified, the DOI map of the density shows smaller correlation values (compare especially panels B,D in the two figures) compared to the previous ensemble, because the differences between the runs of the ensemble are now larger (see Figure 12). This results in smaller regions where the DOI is close to unity compared to the first case.

FIGURE 12

Figure 12. DOI maps for the density (A,B) and the radial velocity (C,D), computed from PLUTO simulation ensembles. The first column has a detection point at R = 90R_⊙ and the second column at R = 150R_⊙ (marked by the black dot). In this set we perturb two parameters, the radial velocity and the size of the CME.

The last ensemble, where the CME width and its perturbation are larger compared to the second case, is shown in Figure 13. The DOI pattern is qualitatively similar to Figure 12, but due to the larger values in the size of the CME and its perturbation, the regions with high DOI values (meaning the regions affected by CME propagation in at least one member of the ensemble) are slightly larger as well.

FIGURE 13

Figure 13. DOI maps from ensembles of PLUTO simulations for the density (A,B) and the radial velocity (C,D). The first column has a detection point at R = 90R_⊙ and the second column at R = 150R_⊙. In this set we perturb two parameters, the radial velocity and the size of the CME, with a larger perturbation in the size with respect to the second data set.

Additional analysis, not shown here, was carried out on subsets of the ensembles to ensure the ensemble size is sufficient. We found that in this case convergence was achieved if at least 25 ensemble members were used (as described in section 2); however, this number may differ in other cases, depending on the specifics of the ensemble.

The DOI analysis applied to the simulations performed with PLUTO are indicative of the versatility of the method. In all PLUTO ensembles, we can monitor the influence of the CME during its propagation and the response of the system via the DOI. Moreover, we can identify certain CME components, such as the leading edge, from the DOI maps. Differences in the response of the system due to the choice of the perturbation or parameters are captured as well. The resolution used here was sufficient to capture the CME injection and propagation within reasonable computational cost; the typical run time for simulating a member of the ensemble was of the order of ~10′ min on 28 cores.

However, some limitations of the model must be considered. The axisymmetric assumption simplifies the problem and allows to reduce computation costs, but with the drawback of not accounting for the three-dimensional CME structure. The limited angular resolution imposes a weak constraint on both the perturbed and unperturbed size of the CME that we can simulate. Runs with a higher resolution can remove this constraint at additional computational cost. Simulations in 3D will be part of future work in order to capture the full system, where also differences in the polar direction can be examined. Finally, a more realistic model for the background magnetic field should be used, rather than a simple static dipole. We focused mainly on calculating the DOI at different times and locations, but a similar approach can be used to estimate the arrival time of the CME, as described in Owens et al. (2020).

4. Summary and Conclusions

In this paper, we apply the Representer analysis and the Domain of Influence analysis to two fundamental components of the Sun-Earth connection: the interaction between the solar wind and the terrestrial magnetosphere, simulated with the OpenGGCM MHD code and with the empirical Tsyganenko models, and the propagation of CMEs in the background solar wind, simulated with the MHD PLUTO model.

In each case an ensemble is generated by appropriately perturbing initial/boundary conditions. Subsequently, the DOI analysis is applied over the ensemble. Localization methods, which can be used to reduce spurious correlations in the estimated prior covariance matrix (Anderson, 2007; Bishop and Hodyss, 2007; Sakov and Bertino, 2011), are not used at this stage.

Primarily, the DOI analysis is a first step in the application of Data Assimilation techniques to a model, and can be applied before assimilation itself to gain insight on the system and on the model. However, the DOI analysis can also be used to gain physical insight, and to devise optimized observation systems, as discussed below.

Our main results are as follows.

First, we have demonstrated that DOI analysis can provide useful information on the most appropriate locations for future observation points, such as solar wind and magnetospheric monitors. Large absolute values of the DOI, calculated with respect to an observation point, means that observations at that location would provide significant information of that field in the specific, large |DOI| area, but less so in areas with lower |DOI|. This can be used in two different ways. On one hand, DOI analysis can help to find observations points that are connected to large |DOI| areas, in order to increase the amount of information brought in by a single new observation. On the other hand, the same information can be used for a different objective. Given a particular location, one can ask where observations need to be obtained to improve knowledge of that area. A useful example here is the plasma sheet in the OpenGGCM analysis, section 3.1. Figures 3, 4 show that |DOI| values in the plasma sheet are consistently low, notwithstanding the field which is examined (v_x or B_x) and the location of the observation point. |DOI| values in the plasma sheet are low even if the observation point is in the plasma sheet itself: |DOI| values, which are of course 1 at the observation point itself, quickly become smaller even a small distance away. Since the plasma sheet is a location of particular importance for space weather forecasting, or basic research for that matter, single s/c in the plasma sheet are of limited use, and rather a constellation of satellites, such as proposed in Angelopoulos et al. (1998) and Raeder and Angelopoulos (1998) would be necessary.

Second, we have used the DOI analysis to improve our knowledge of the models we use, and in particular to investigate whether these models are appropriate for the implementation of Data Assimilation. The DOI analysis for the Tsyganenko models in section 3.2 powerfully highlights the model evolution from version T96 to version TA15. In version T96, the magnetosphere is a closed system, and solar wind conditions are not correlated (DOI ~ 0 in Figure 7) with the magnetospheric region. In version TA15, the magnetosphere opens up to solar wind driving, and the correlation between the solar wind region and the magnetosphere becomes very high (Figure 9). One should remember that the Tsyganenko models are supposed to be used to investigate the magnetospheric system, and the solar wind configuration is artfully modified as to give the best representation of the magnetosphere under the specified conditions.

One common aspect of the two versions of the Tsyganenko models is that (with the exception of the solar wind region in T96) the DOI values are always either 1 or −1, for all fields and regions examined. These results appear less realistic than the OpenGGCM results obtained in section 3.1, where DOI values have larger variability. The Tsyganenko models differ with respect to OpenGGCM in two fundamental aspects, in that they (a) empirically reconstruct the magnetospheric magnetic field from an array of observations and (b) that they are not time-dependent. Either of these two aspects can contribute to the unrealistically high correlations we observe. Investigations on other models, and specifically on empirical, time-dependent models, will possibly help disentangle the role of these two aspects. At this stage of the investigation, we advance the hypothesis that time-dependent models may be better suited than time-independent models as background models for Data Assimilation techniques.

Third, with this analysis we have highlighted a possible path for future, targeted improvements of global heliospheric models used, among other things, for simulations of CME propagation in the heliosphere. It has long been known that one of the critical aspects of the simulation of CME arrival time is the estimation of the physical parameters to use as initial conditions in the simulations. While some parameters can be easily estimated from remote sensing, others are more difficult to determine properly and their variability affects the accuracy of the forecast (Falkenberg et al., 2010). In this paper, we have shown that DOI analysis could constitute an important stage of a model analysis effort aimed at clarifying which aspects of a model should be prioritized in order to obtain more accurate simulations of CME propagation.

In this study, as a first step, we show DOI maps obtained from the correlations of a single variable calculated between the variable at the observation point and the same variable in the domain under investigation. As demonstrated in Skandrani et al. (2014), cross-correlations can be used to find the influence of one variable upon another.

The results of a DOI cross-correlation analysis can then be used to determine which quantities and areas in a simulation are most relevant in determining a certain observational quantity (such as the radial velocity of a CME in the case of CME propagation simulations). This analysis can then guide modelers on deciding which aspects of a model could be improved for more realistic results. It could help understanding, for example, if CME propagation in a model is mainly controlled by the background magnetic field configuration or by the properties of the CME itself at launch. In the first case, modeling efforts could be directed into accurate high resolution representation of the magnetic field configuration in the lower corona. In the second case, instead, modeling improvements could be focused on extracting better estimates of CME launch parameters (e.g., CME density, velocity, internal magnetic field configuration with respect to the background wind) from available observations.

The spatial correlations provided by DOI can also be of particular interest in evaluating the effect of actual measurements done at positions different from the traditional L1, such as, for example, missions planned for L5 or missions closer to the Sun.

Future work will extend this study to include temporal and cross correlations between different field components. This will further increase our knowledge of the models used to simulate such critical space weather processes.

The DOI analysis presented here can also be combined with an Observing System Simulation Experiment (OSSE), an approach already used in ionospheric and solar dynamo studies (Hsu et al., 2018; Dikpati, 2017) to help provide a cost-effective approach to the evaluation of the potential impact of new observations. OSSE requires that DA is already implemented and uses independently simulated “data” that are ingested into a different model or a different instance of the same model. The effect of DA can then be investigated, albeit with caveats, since the “data” are not real. DOI analysis would obviate the need to have DA implemented, which can be very costly. Instead, only ensemble runs with an unmodified model are required, and can provide a measure of the usefulness of a model and the available data for a specific situation.

Data Availability Statement

The datasets generated for this study are available on reasonable request to the corresponding author.

Author Contributions

MI, BL, and DM created the ensembles using OpenGGCM, Tsyganenko, and PLUTO, respectively, as they appear in the main body of the article, processed the data and presented the results of the analysis, and prepared the manuscript. JR provided the OpenGGCM and contributed to the interpretation of the magnetosphere model results. JR, GL, and SP provided the support during the analysis and the preparation of the article. All authors contributed to the article and approved the submitted version.

Funding

DM aknowledges support from AFRL (Air Force Research Laboratory)/USAF (US Air Force) project (AFRL Award No. FA9550-14-1-0375, 2014-2019) and partial support by UK STFC (Science and Technology Facilities Council) Consolidated Grant ST/S000240/1 (UCL-MSSL, University College London-Mullard Space Science Laboratory, Solar System). MI's work was supported by an FWO (Fonds voor Wetenschappelijk Onderzoek–Vlaanderen) post-doctoral fellowship. BL, MI, GL, and JR acknowledge funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No. 776262 (AIDA, Artificial Intelligence for Data Analysis, www.aida-space.eu). JR also acknowledges support through AFOSR grant FA9550-18-1-0483 and from the NASA/THEMIS mission through a subcontract from UC Berkeley. SP and GL acknowledge funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No. 870405. These results were also obtained in the framework of the projects C14/19/089 (C1 project Internal Funds KU Leuven), G.0A23.16N (FWO-Vlaanderen), C 90347 (ESA Prodex), and Belspo (Belgian Science Policy) BRAIN project BR/165/A2/CCSOM.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The PLUTO simulations were performed using allocated time on the clusters Genius and Breniac. The computational resources and services used in this work were provided by the VSC (Flemish Supercomputer Center), funded by the Research Foundation Flanders (FWO) and the Flemish Government–Department EWI. The OpenGGCM simulations were performed on the supercomputer Marconi-Broadwell (Cineca, Italy) under a PRACE allocation. We acknowledge the NASA National Space Science Data Center, the Space Physics Data Facility, and the ACE Principal Investigator, Edward C. Stone of the California Institute of Technology, for usage of ACE data. We acknowledge use of NASA/GSFC's Space Physics Data Facility's OMNIWeb service, and OMNI data.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fspas.2020.571286/full#supplementary-material

Supplementary Video 1. Temporal variation of b_z, reference run.

Supplementary Video 2. Evolution of the ensemble mean.

Supplementary Video 3. Evolution when |v_x| ~ 363 km/s.

Supplementary Video 4. Evolution for |v_x| ~ 583 km/s.

Supplementary Video 5. DOI for Bx, observation point in the plasmasheet.

Supplementary Video 6. DOI for Vx, observation point in the plasmasheet.

Supplementary Video 7. DOI for Bx, observation point in the magnethosheath.

Supplementary Video 8. DOI for Vx, observation point in the magnethosheath.

1. Dynamic representation of the open magnetosphere scenario using OpenGGCM (shown in Supplementary Video 1), showing the temporal variation of the b_z component and reconnection events.

2. Global evolution of the ensemble mean (Supplementary Video 2), where the modified magnetopause-magnetotail reconnection pattern is visible.

3. Evolution of the members of the ensemble using different values for the v_x component. Two different values are tested, |v_x| ~ 363 km/s (Supplementary Video 3) and |v_x| ~ 583 km/s (Supplementary Video 4), resulting in two different magnetosheath patterns.

4. The DOI movies for the Bx and Vx with observation points in the plasma sheet (Supplementary Videos 5, 6).

5. The DOI movies for the Bx and Vx with observation points in the magnetosheath (Supplementary Videos 7, 8).

References

Allen, J., Sauer, H., Frank, L., and Reiff, P. (1989). Effects of the march 1989 solar activity. EOS Trans. Am. Geophys. Union 70, 1479–1488. doi: 10.1029/89EO00409