1 Introduction
The stochastic differential equation (SDE) is a mathematical framework employed to study dynamical systems subjected to both deterministic and stochastic influences. It combines the deterministic part expressed through ordinary differential equations with the random forcing term formulated by Gaussian white noise, making itself invaluable across diverse disciplines (Øksendal, 1987; Särkkä and Solin, 2019). For example, SDEs are used to capture the unpredictable nature of stock prices in financial mathematics and describe the erratic movement of particles in fluids, as studied in physics (Øksendal, 1987; Rüschendorf, 2023; Seifert, 2012). Furthermore, beyond modeling, SDEs also find applications in inverse problems, including linear inverse models (LIMs) (Kwasniok, 2022; Penland, 1989; Penland and Magorian, 1993; Penland and Matrosova, 1994).
LIMs serve as mathematical tools that extract the linear dynamics and random forcing behavior of the underlying complex non-linear stochastic process based on finite sampling data. This approach allows researchers to study the time evolution of deviations from a stable equilibrium point, infer system dynamics, and quantify uncertainties that are difficult to measure directly (Penland and Magorian, 1993; Penland and Matrosova, 1994; Penland and Sardeshmukh, 1995). More precisely, consider a dynamical system of the following form:
where is the system state at time , is the unknown system, and represents the normalized Gaussian vector. The stationary LIM (ST-LIM) approximates Equation 1 using a Markov system, given by
where A and Q describe the constant first-order deterministic dynamics and the covariance of noise, respectively (Penland, 1989; Penland and Matrosova, 1994). This is referred to as the Markov approximation or assumption. In the stationary state, the correlation function of Equation 2 follows an exponential form, where the exponent corresponds to . The value of can be obtained if the values of are known at the origin and at some lag . Subsequently, the constant diffusion matrix can be obtained through the fluctuation–dissipation relation (FDR). Therefore, for stationary finite observations, first-order dynamics and diffusion for the underlying stochastic process can be estimated with the Markov approximation. Moreover, to verify the effectiveness of the linear model, the tau-test has been widely applied (Penland and Sardeshmukh, 1995; Penland, 2019). This test estimates the dynamics over a range of lags and is considered successful if the estimated dynamics remain independent of .
Due to its mathematical simplicity and applicability, the ST-LIM has been widely applied to climate science to study large-scale climate events, including El Niño–Southern Oscillation (ENSO) (Alexander et al., 2008; Johnson et al., 2000a; Newman et al., 2011b; Penland and Magorian, 1993; Penland and Matrosova, 1994; Penland and Matrosova, 2006; Perkins and Hakim 2020). However, it can fail drastically in reconstructing the Markov system, even if the observational dataset is sampled directly from the system itself, due to the Nyquist issue (Johnson et al., 2000b; Penland and Sardeshmukh, 1995; Penland, 2019). This occurs when the lag is close to the multiples of the Nyquist lags, which are half the imaginary parts of the eigenvalues of the first-order dynamics. Moreover, its assumption of stationarity can lead to significant limitations in practice. For example, ST-LIM cannot resolve the periodic feature of cyclostationary observations or analyze the seasonality of global-scale physical phenomena (Penland and Sardeshmukh, 1995).
The interactions between seasonal cycles and climate modes have been increasingly recognized as a central aspect of climate variability research in recent years (Stuecker et al., 2015; Stuecker et al., 2013; Zhao et al., 2024). Previous studies have addressed seasonality in climate science through different strategies (Stuecker, 2023). One approach imposes periodicity on the data analysis, including the cyclostationary empirical orthogonal function, analyzing the time-dependent physically meaningful modes (Hamlington et al., 2011; Kim, 2002; Kim et al., 2015). Another example involves dividing observational data by the calendar month and analyzing each segment separately—for instance, by applying the Markov approximation (Xue et al., 2000). On the other hand, mathematical modeling with periodicity offers an alternative, but the periodicity is often simplified as a sinusoidal fluctuation to reduce parameter estimation demands and overall model complexity (Johnson et al., 2000b; OrtizBeviá, 1997). Meanwhile, some studies assume that cyclostationary dependence is primarily driven by stochastic random forcing rather than by deterministic dynamics (Alexander et al., 2008; Newman et al., 2011a; Penland, 1996). However, none of these approaches fully integrate deterministic dynamics with external random forcing, motivating the development of cyclostationary LIMs (CS-LIMs) (Shin et al., 2021).
The original CS-LIM, a variant of LIMs, was introduced recently in the climate science community, bridging a critical gap in analyzing and modeling the seasonal cycle of a climate phenomenon (Shin et al., 2021). This method is designed to capture the first-order temporal structure of the deterministic and stochastic components of a cyclostationary process (i.e., the unknown system in Equation 1 is periodic), and it has been applied to study the seasonal variation in ENSO using geophysical fields such as monthly sea surface temperature (SST) time-series data (Shin et al., 2021). Compared to the analyses based on the ST-LIM framework, the original CS-LIM has shown an improved forecast skill and a more accurate ENSO characterization (Kido et al., 2023; Shin et al., 2021; Vimont et al., 2022). In recent years, CS-LIM has been increasingly recognized as an important mathematical tool in climate science. However, its mathematical basis is worth further exploration. Enhancing the theoretical rigor of CS-LIM would not only solidify its mathematical foundation but also extend its applicability to broader areas in climate research or other applied fields of science.
From a mathematical perspective, under the cyclostationary condition, the original CS-LIM first approximates the complex non-linear stochastic system with a periodic Markov system of the following form:
where and are periodic families of dynamical and diffusion matrices, respectively. To estimate and , the original CS-LIM divides the full period into several intervals, applies the ST-LIM interval-wise to extract the first-order dynamics, and then estimates diffusion by enforcing the cyclostationary version of FDR (Shin et al., 2021). Therefore, unlike the ST-LIM, which reconstructs the linear dynamics and diffusion of a Markov system, the original CS-LIM does not reconstruct the periodic linear dynamics and diffusion for a process satisfying Equation 3; rather, it estimates them. Even so, this estimation is sufficient for most applications. The detailed formulations of Equations 2, 3 will be provided in the later sections.
In this article, we study CS-LIMs from a mathematical perspective by providing a theoretical background and examining the validity of approximation used in the models. In particular, we refine the original CS-LIM to the -CS-LIM by improving the finite difference stencils to remove an artificial phase shift. However, because of the interval-wise application of ST-LIM, it is still vulnerable to the Nyquist issue. As an alternative, we present -CS-LIM, an analytic inverse model corresponding to Equation 3. The time-dependent dynamics are estimated using the first-order right-derivative of the correlation function at 0-lag, and hence, -CS-LIM is unaffected by the Nyquist issue. We also note that it corresponds to the pointwise Markov approximation, in contrast to the interval-wise version.
Notably, the -CS-LIM and -CS-LIM are closely related. In practical implementation, the former utilizes an exponential fitting, while the latter uses a linear fitting to the observed correlation function to estimate the first-order dynamics of the underlying system using finite cyclostationary data. The exponential fitting approaches the linear fitting when the sampling interval and the lag are sufficiently small and the number of intervals and the sampling size are sufficiently large. Therefore, in such a limit, -CS-LIM converges to -CS-LIM, in the sense that the difference between their estimated results can be arbitrarily small. On the other hand, for , they can be designed to serve different purposes. The -CS-LIM constructs dynamical matrices, each of which best represents the integrated effect of the underlying dynamics within each interval, leading to its robustness against noise. In addition, the tau-test provides a quantitative measure of the validity of the linearity assumption within each interval. On the other hand, as a derivative-based method, the -CS-LIM estimates the Jacobian of the underlying system, representing the instantaneous dynamical information.
The remainder of this study is structured as follows: in Section 2, we review the mathematical background of the ST-LIM and discuss the Nyquist issue based on Penland (2019), extending the analysis by examining it from the perspective of perturbation theory. In Section 3, we present the formulation of CS-LIMs, including the original CS-LIM by Shin et al. (2021), -CS-LIM, and -CS-LIM, along with their numerical issues and the relationships among various LIMs. The proofs of the key equations are presented in the Supplementary Material. Then, the numerical experiments are presented in Section 4 to demonstrate the effectiveness of -CS-LIM and -CS-LIM. Finally, in Section 5, we apply the CS-LIMs to real-world cyclostationary data to discuss the ENSO mechanism.
Before proceeding to the next section, we briefly explain the convention of our notation. Vectors are assumed to be column vectors unless stated otherwise. We use bold font for continuous-time forward formulations and regular font for inverse problems. Moreover, true values are denoted in bold, while model outputs are in regular font in the numerical experiments or applications. When referring to time-series data, we mean a discrete-time sequence of -dimensional vectors indexed by with an equal sampling interval represented by (i.e., ). Furthermore, we assume either stationarity or cyclostationarity throughout this article.
2 Review of the ST-LIM
To represent the underlying unknown system from finite observations, the ST-LIM framework constructs a best-fit linear system within the class of Ornstein–Uhlenbeck (OU) processes given by the following equation:
where is the real-valued dynamical matrix and is the positive definite diffusion matrix (Penland, 1989; Penland and Magorian, 1993). The noise term is a normalized Gaussian random vector with zero mean, and
where the bracket and denote the expectation and Dirac delta function, respectively. Given this setup, the evolution of probability distribution is described by the well-known Fokker–Planck equation as follows:
where is the Fokker–Planck operator (Øksendal, 1987; Särkkä and Solin, 2019). To ensure a stationary distribution, each eigenvalue of must have a negative real part, thereby making it a damping operator. The balance between the dissipative dynamics and the external excitation random forcing is given by the fluctuation-dissipation relation (FDR) as follows:
Here, denotes covariance. The correlation function admits an explicit form for ,
allowing us to formally express the dynamical matrix in terms of the matrix logarithm of the Green function , given by ; thus, for any lag , we obtain
In practical applications, given a finite set of observational data , where follows a continuous distribution and is governed by an unknown complex system, the ST-LIM uses the observed correlation function , computed as
and Equations 6, 8 to determine the estimated first-order dynamics and diffusion matrix . Consequently, it yields the approximate system of the form
to represent the underlying system. This approach is referred to as the Markov assumption or Markov approximation.
Though the ST-LIM algorithm appears straightforward, the choice of is critical. It cannot be arbitrary due to the multi-valued nature of the matrix logarithm and the potential numerical instabilities arising from the eigenstructure of the Green function at multiples of Nyquist lags. This issue becomes especially critical for larger values, particularly in the tau-test, where is varied over a broader range (Penland, 2019).
2.1 Multi-valued logarithm
In theory, to ensure that the logarithm (Equation 8) correctly estimates , it is essential to select the appropriate branch cut rather than always using the principal logarithm. Let and , respectively, be the matrices consisting of the left eigenvectors and right eigenvectors corresponding to the eigenvalue of such that (Penland and Matrosova, 1994; Risken, 1989). As , we have the spectral decomposition as follows:
and
where . Suppose that has a conjugate eigenpair ; for clarity, we drop the subscript . Then, the computation of the matrix logarithm reduces to the complex logarithm of eigenvalues , which can take values from the set k = 0, 1, 2,...}.
If the principal logarithm, whose branch cut is the negative real line, is used, a jump discontinuity will appear when first crosses , and Equations 9a, b will not return the correct phase argument for . Furthermore, as the eigenpair of merges into repeated eigenvalues at , an ambiguity arises in choosing and numbering the eigenvectors. This critical value is known as the Nyquist lag (Penland, 2019). To ensure that Equations 9a, b hold for all , with an initial , one must track the phase of each eigenvalue and the corresponding eigenvectors. This is particularly important when approaches , where k > 0, as the discontinuities in the logarithm or repeated eigenvalues may occur. In such cases, it is necessary to adjust the branch cut or appropriately select the eigenvectors. With an appropriate eigen-adjustment, Equations 9a, b hold for each , allowing to be consistently constructed from . The number of Nyquist lags corresponds to the number of complex eigenpairs, so the initial point should be selected to be less than the smallest Nyquist lag.
2.2 Nyquist issue
The Nyquist issue arises from the multivalued nature of the complex logarithm and the instability of eigenvectors of at the Nyquist lag (Penland, 2019; Penland and Sardeshmukh, 1995). Let denote an observed Green function computed from finite observations of a Markov system, and let , , and , be defined in the same manner as before. We can express this as follows:
where denotes the residual matrix. As the eigenvalue varies continuously with respect to , the repeated eigenvalues can split into two distinct real eigenvalues at a Nyquist lag whenever . This typically does not create significant issues if is sufficiently small, which is expected when the observation time is sufficiently long. In such cases, we can track the eigenvalues of as a function of discrete-valued and number them so that each eigenvalue is “numerically” continuous in . However, at the Nyquist lag, the eigenvectors become discontinuous at due to the repeated eigenvalues and undergo a drastic change once perturbed. In particular, the eigenvectors , , , and may have complex entries, while is real-valued. Therefore, Equations 9a, b are not stable at the Nyquist lags and their multiples, where repeated eigenvalues should ideally be present. More concerningly, at odd multiples of the Nyquist lags, the perturbation further introduces complex entries into , contradicting the real-valued observations (Culver, 1966). In practice, is inevitably noisy due to finite observation, causing potential failures in the tau-test whenever approaches one of the multiples of the Nyquist lags. This issue becomes increasingly unavoidable as increases (Penland, 2019).
To demonstrate the Nyquist issue, we consider the linear system (Equation 4), where
, and . This linear system, similar to example 1 in Penland (2019), consists of two non-interacting rotation submatrices, with the corresponding Nyquist lags being 0.1 and 0.14. We compute the dynamical matrices over different values of using Equations 9a, b under four different conditions: whether the Green function is subjected to residual noise or not and whether the eigen-adjustment is applied or not. Then, the resulting output matrices are compared with the ground truth matrix using the relative error measured by the -norm, defined as follows:
Figure 1 shows the relative errors as a function of . Without eigen-adjustment, the relative errors of the estimated dynamical matrices diverge after the smallest Nyquist lag of 0.1, regardless of the presence of residual noise, and do not recover as increases. With eigen-adjustment, the ground truth can be estimated if the Green function is free from noise, which is equivalent to an ideal scenario of flawless and infinite observations. On the other hand, when the noisy Green function is noisy, the Nyquist issue persists at multiples of the Nyquist lags.
3 CS-LIM
Although the ST-LIM is effective for stationary observational data, its stationary assumption renders it incapable of capturing the periodic characteristics of cyclostationary data. In the CS-LIM framework, we model the underlying unknown periodic system using the periodic OU process as follows:
where and are periodic dynamical and (positive definite) diffusion matrices, respectively, with the same period. Without loss of generality, we may assume that the period equals 1. The time evolution of the probability distribution is given by Equation 5, with and being time-dependent. To achieve a cyclostationary steady state, all Floquet multipliers (i.e., the eigenvalues of the monodromy matrix ) of Equation 13 must have modulus less than 1 (Chicone, 2006; OrtizBeviá, 1997). In the cyclostationary steady state, the covariance function can be viewed as a function on the circle of perimeter 1, called the phase space, which is parametrized by . By naturally identifying and with functions on through the projection map , the balance equation is now described by the cyclostationary FDR (CS-FDR) as follows:
For convenience, we identify the phase variable with the time variable via without explicitly stating it when there is no confusion.
In general, since for some and , the monodromy matrix and the correlation function do not admit an explicit expression. To better understand the periodic system, as a heuristic example, let us consider , where is the mean state and satisfying describes the variation. In this special case, the monodromy matrix can be explicitly written as , meaning that stability requires all eigenvalues of to have negative real parts and that temporary instability is possible as may take values less than at certain times. Moreover, the correlation function can be expressed as , implying that the integrated effect of the dynamical matrices over the current time and its -lag later is considered.
Following the workflow of the ST-LIM, our goal is to express in terms of the correlation function . However, is, in general, mathematically intractable, making it difficult to derive a logarithmic-like equation similar to Equation 8. Therefore, an alternative approach is necessary.
3.1 e-CS-LIM
To determine the first-order dynamics, Shin et al. (2021) proposed what we now call the interval-wise Markov approximation, forming the basis of the original CS-LIM and the proposed -CS-LIM. For clarity, we briefly outline this method using an example. Consider a set of observations of a physical phenomenon with a diurnal period (1 day), with a sampling interval of 15 min over days. First, by equi-partitioning the phase space into intervals , the data are divided into 24 subsets given by , where the th subset contains data sampled during the th hour of a day. Each presents a similar stage of cyclostationary dependence. For each , we compute the observed correlation function for and using data sampled in the -th hour of a day and minutes later; that is, we compute
where denotes the number of observations in .
The interval-wise Markov approximation plays a crucial role in estimating the dynamics. The -CS-LIM makes the stationary assumption for each hour and attempts to apply Equation 8 to estimate the dynamics. This extends the Markov approximation across all hours, justifying the term interval-wise Markov approximation. However, in contrast to ST-LIM, which uses all available observations to estimate a single dynamical matrix, the -CS-LIM approach may suffer from insufficient sampling due to equi-partitioning. This may further result in noisy in the phase direction , potentially requiring a filter to smooth in before solving the dynamics. This procedure is known as phase averaging (Shin et al., 2021). Once a suitable is obtained, Equation 8 is applied for each to compute the estimated dynamics , followed by the CS-FDR (Equation 14) to compute the estimated diffusion matrix for the th hour. As a result, -CS-LIM formally produces an approximate system of the following form:
whose cyclostationary probability distribution is Gaussian at each . The original CS-LIM follows the same procedure but differs in the numerical implementation, with its output matrices denoted by and .
Although the interval-wise Markov approximation works in practice, its validity requires further clarification. Though does not have an explicit form in general, its first-order right derivative with respect to the lag variable at the origin satisfies the following equation:
Therefore, by integrating with respect to , we obtain
which indicates that for each phase , the correlation function behaves approximately as an exponential function in the lag variable . As a result, applying Equation 8 to each in -CS-LIM provides a reasonable approximation of the underlying deterministic dynamics as long as there are no abrupt phase-dependent variations in the dynamics of the sampling data used in Equation 15. This suggests that -CS-LIM is a hybrid-type model: the dynamics are approximated, while the diffusion process follows an analytic formula. Consequently, it does not analytically reconstruct the periodic linear dynamics and diffusion of the linear stochastic process (Equation 3), as mentioned in Section 1.
From a numerical aspect, the scheme of computing the derivative of observed covariance and the assignment of time coordinates for , , and remain unspecified at this point. For instance, it is unclear whether th covariance corresponds to o’clock, :30 or another reference time.
In the -CS-LIM framework, the time coordinate for is assigned to the midpoint of the th interval. In contrast, the time coordinate for is shifted by half the lag, i.e., , since the data at 0-lag and -lag are used to compute in Equation 15. In general, after sorting, the distance between and and the distance between and can differ. In this case, the computation of requires careful treatment. To address this, for simplicity, we assume to be a multiple of whenever discussing diffusion. This ensures that the time coordinates and (after appropriate rearrangement) either coincide or alternate at uniform intervals of distance . Then, we assign the time coordinate for as , and we use the center difference scheme to compute when and coincide and the forward difference scheme when they alternate.
Contrary to the -CS-LIM, the original CS-LIM framework always adopts centered differencing to compute and a common th time coordinate at the midpoint of the th interval for , , and . This stencil configuration, however, introduces an artificial phase shift in the estimated dynamics. A schematic illustration and the pseudocode for the original CS-LIM and -CS-LIM are presented in Figure 2 and Algorithm 1, respectively.
3.1.1 Phase shift of the original CS-LIM
To demonstrate the phase shift, we consider an ideal case in which the observational data are sampled using the following equation:
where , , , and , over an infinitely long window with a sampling interval . We apply the original CS-LIM and -CS-LIM with (i.e., on the sample points) and . The results are shown in Figure 3.
Regardless of , the dynamics from the original CS-LIM exhibit a notable phase shift of approximately , which further compromises the accuracy of . In contrast, -CS-LIM significantly mitigates the phase shift, even when is partitioned into 10 subsets. However, it is worth noting that for a larger value, variations in the dynamics are smoothed out as a consequence of interval-wise Markov approximation and Equation 18. In addition, the Nyquist issue does not arise in the one-dimensional case.
3.1.2 Nyquist issue for the e-CS-LIM
Due to the interval-wise application of the ST-LIM, the CS-LIM is also subject to the Nyquist issue. Contrary to the ST-LIM, in which the Nyquist issue does not appear in the theoretical computation when the eigen-adjustment is applied (i.e., (adjusted) in Figure 1), the small-o term in Equation 18 introduces a non-trivial residual in Equation 10, causing the Nyquist issue even if flawless infinite observation is possible.
To demonstrate the Nyquist issue, we suppose that the data are sampled infinitely from the periodic linear system (Equation 19), where , is given by Equation 11, and . To examine the dependence on the partitioning of the phase space, we consider and apply the -CS-LIM over different values of .
Figure 4 shows the relative errors between and for certain values of as a function of . Since the dynamical matrix varies in time, the Nyquist issue does not necessarily appear at the multiples of the Nyquist lags of the mean state , but rather near them. In addition, even outside the Nyquist lags, the relative error remains non-zero, which again indicates that the -CS-LIM is not an analytic inverse model for Equation 13. Nevertheless, the relative error is acceptable for application purposes, provided that is not overly large.
3.2 l-CS-LIM
As the -CS-LIM interval-wise approximates the underlying stochastic process using a Markov system and applies Equation 8, we aim to build an (analytic) inverse model for the periodic Markov system (Equation 3). Instead of using the global information of as in the ST-LIM, the -CS-LIM extracts the dynamics by applying Equation 17, the local information of , followed by computing the diffusion matrix using CS-FDR. This suggests that given the observational data, the -CS-LIM outputs the best-fit periodic Markov system for the unknown complex periodic system, provided that the sampling interval and the observation window . In practice, the observed correlation function is obtained using the following equation:
and the partial derivatives of correlation function and are estimated by forward finite difference. The corresponding time coordinates for the estimated time-dependent dynamics and diffusion are shifted to the right by . In other words, , where , , and denote the general reference of the time coordinates for , , and , respectively, as previously stated. As a result, the -CS-LIM formally determines a periodic linear system as follows:
whose cyclostationary distribution is also Gaussian for each . The algorithm of -CS-LIM is summarized in Algorithm 2.
This derivative-based method avoids the multi-valued matrix logarithm and is, therefore, not affected by the Nyquist issue. However, it is more sensitive to noise, and phase averaging is applied to whenever necessary.
3.3 Relationship among LIMs
All LIMs discussed in this article are related. First, we note that the -CS-LIM is indeed an extension of the ST-LIM. If the linear dynamics and diffusion matrix are constant in time, the correlation function is independent of phase variable , and Equation 17 can be written in the following form.
which is equivalent to the right derivative of Equation 7. At the same time, for a general periodic stochastic process, we can view -CS-LIM as approximating the underlying system using a Markov system at each instant as Equations 17, 22 share the same form. Therefore, we may say that -CS-LIM utilizes the pointwise Markov approximation contrary to the interval-wise version adopted in the -CS-LIM.
A natural question is whether the ST-LIM accurately estimates the mean state from the infinite cyclostationary observational data sampled from the periodic Markov system (Equation 3). As the correlation function used in the ST-LIM is the integral of over , it is clear that it does not equal , where is the mean covariance. Nevertheless, if does not vary drastically, the estimated linear dynamics from ST-LIM remain close to the mean state, preserving the dynamics-relevant information of the underlying system.
From a numerical viewpoint, at a given phase , the -CS-LIM applies the forward difference method to the observed correlation function at two consecutive points , which amounts to a linear fit of these two points. On the other hand, the -CS-LIM calculates at the origin and the point at and applies Equation 8, which corresponds to an exponential curve fitting whenever the Nyquist issue does not persist. This justifies the prefixes -CS-LIM and -CS-LIM. As a result of Equation 18, the -CS-LIM converges to the -CS-LIM under the conditions of , , , and . Consequently, the -CS-LIM can be viewed as an instantaneous version of the -CS-LIM.
In practical applications, the difference between the -CS-LIM and -CS-LIM goes beyond the estimation of dynamics. The -CS-LIM, as a derivative-based method, pointwise estimates the Jacobian, which is the first-order approximation of the unknown dynamics. As the state variable moves away from the steady state or as time marches, the higher-order terms in the function’s expansion may become more significant, and the accuracy of linear approximation decreases, lowering the effectiveness of approximating the unknown system using Equation 13. Nevertheless, the Jacobian can still be important in certain practical applications. The -CS-LIM, on the other hand, finds the best-fit linear system within each subinterval in the phase space. Though the -CS-LIM is not an analytic inverse model for a general periodic OU process, it remains effective because for each phase, more sampling can be used in Equation 15 than in Equation 20, and the lag can exceed . As a result, the -CS-LIM is more robust to noise than -CS-LIM. The intercomparison among LIMs is summarized in Table 1.
4 Numerical experiments
As the CS-LIMs compute the dynamical and diffusion matrices using the observed correlation function , they can be regarded as functions that take as input and produce the estimated dynamics and diffusion. As an inverse method, the accuracy of these estimates depends on the quality of (Lien et al., 2024). If computed from observations accurately captures the true correlation function, then both CS-LIMs are expected to effectively estimate both dynamics and diffusion. However, due to the finite number of observations, is inevitably affected by noise, raising concerns about the stability of the algorithms as numerical derivatives are required. Therefore, in this section, we evaluate the performance and sensitivity of both -CS-LIM and -CS-LIM using the synthetic time-series data of Equation 3 under various conditions. For notational convenience, we write and as general references for and as well as and , respectively.
More precisely, for the given and , we use the Euler approach with a timestep of from to to generate auxiliary time-series data; we then subsample it with a sampling step of to obtain the observational data , and we use CS-LIMs to obtain the estimates and . As the accuracy of models depends on the stochastic nature of the dataset, the abovementioned process will be repeated 128 times in each scenario, and the resulting statistics (e.g., median) of relative errors, defined similarly to Equation 12, will be used to evaluate the models. This evaluation process will be applied to each case presented in this section.
4.1 One-dimensional case
To visualize the CS-LIM results, we start with a one-dimensional case represented by Equation 19, where , , , and . For the -CS-LIM, no filter is applied to the observed correlation function , while for the -CS-LIM, a sliding average with a window of length is applied to and .
Figure 5a shows an example of the -CS-LIM results for , , and under a distinct number of intervals . Although a smaller reduces the resolution of the reconstruction, it does not significantly affect the reconstructed dynamics as long as the variation can be effectively captured by . Moreover, a smaller enhances robustness against noise, leading to a more stable estimate of . For , the stochastic influences in estimated dynamics remain limited since the lag can be chosen to be larger than , thereby mitigating numerical instability associated with dividing noisy data by a small number. This is consistent with the theoretical case illustrated in Figure 3. However, in the computation of , the numerical differentiation of noisy results in unfavorable noise amplification in . Although phase averaging of could mitigate this issue, we do not pursue this further for -CS-LIM in our analysis.
Figure 5b shows an example of the -CS-LIM results for under different parameter settings: sampling intervals and a window width of moving averaging . Without phase averaging , noisy causes amplified noisy variations in , especially for the small sampling interval , raising concerns over whether contains excessive stochastic influences. This issue can be effectively addressed by applying phase averaging to ; when , the estimated provides a clean representation of the underlying dynamics.
In contrast, the estimated for appears more robust to the noisy numerical derivatives, though this may be attributed to the idealized nature of synthetic data and the choice of finite difference schemes. To further examine the sensitivity, we introduce artificial perturbations to by adding a small noise matrix and apply the resulting correlation function to lines 12 and 13 in Algorithm 2. The corresponding estimated diffusion then exhibits noisy variation, confirming the sensitivity of to noisy . Nevertheless, applying phase averaging to the noisy correlation function again effectively removes the undesired noisy variations in the estimates. Finally, it is noted that the CS-FDR is maintained since the filtering is applied to rather than or .
Next, we examine the sensitivity of CS-LIMs to various parameters. Figure 6a shows the results of the -CS-LIM for a fixed . As the observation time increases, more accurately reflects the underlying system, and hence, the performance consistently improves. However, the accuracy decreases with increasing due to its smoothing effect on , which is consistent with the analytic case shown in Figure 3. In addition, reducing leads to a modest performance enhancement for the -CS-LIM since more data points are used in Equation 15, but the improvement is limited.
In addition, although the numbers of the observational data for cases and are the same, their performance differs. This discrepancy arises because in the former case, all observational data corresponding to the subinterval contribute to the computation of and , which is equivalent to capturing the integrated effect near and . In contrast, and in the latter are directly at and since .
Figure 6b shows the results for a fixed with varying . As previously discussed, the noisy variations in associated with larger are limited due to a suitable . However, the remains more sensitive to the noisy numerical derivative of covariance, implying the need for phase averaging. As a result, to optimize the performance of the -CS-LIM, a balance must be struck: should be kept as small as possible to minimize noise amplification in , while , optimally chosen as , should remain small enough to prevent excessive smoothing or insufficient resolution that could obscure meaningful variations in and .
Moreover, we analyze the impact of sampling interval and filtering on the -CS-LIM, as shown in Figure 7. First, we see that if no filter is applied, a larger in the calculation of the numerical derivative, though less accurate in the finite difference approximation, is less sensitive to noise, leading to a lower relative error in the noise-dominant cases. Then, we observe that applying phase averaging to effectively reduces the noisy variations in , stabilizing the dynamics estimation, particularly at small or small , with a clear decrease as window size increases. However, in cases with limited noisy variations, particularly at large and larger , excessive filtering can lead to additional error by overly dampening meaningful fluctuations, which is noticeable in the case of and . Thus, optimal phase averaging requires balancing noise reduction and the preservation of essential variations to prevent over-smoothing, particularly when the noise level is low.
4.2 Higher-dimensional cases
From the previous section, we observed that the lag variable and the number of division of the phase space play critical roles in the -CS-LIM performance, and filtering with the optimal window size effectively removes the noisy component and retains the meaningful fluctuation in the -CS-LIM. In this section, we extend the analysis to multi-dimensional cases, setting and for the -CS-LIM and for the -CS-LIM with a sampling interval . The underlying equation is still represented by Equation 19, with and being the same as in the previous section. Each entry of and is initially drawn from the standard normal distribution and then adjusted so that is a damping operator and is positive definite. The Nyquist lag in this setup is significantly larger than , eliminating concerns about the Nyquist issue.
The performance of CS-LIMs across dimension from 1 to 10 is summarized in Figure 8. In general, if the observation timespan is sufficiently long so that well-estimates the underlying true correlation function , both CS-LIMs can effectively capture dynamics and diffusion. Otherwise, the estimation can fail drastically, especially for in higher-dimensional cases, since it is the sum of multiple noisy time series. It should be noted that this study does not aim to compare the two CS-LIM variants directly because the accuracy of CS-LIMs is influenced by numerous factors. For example, the -CS-LIM may struggle to capture fine variations or locate peaks due to its relatively sparse resolution on the phase space, while the -CS-LIM depends on the choice of filtering, and the optimal support or window size of filtering may vary by applications.
Finally, we revisit the Nyquist issue. We keep the underlying system the same (i.e., and remains unchanged) but with given by Equation 11 and . Figure 9 shows the relative errors in the dynamics for an observation time , with parameters and for the -CS-LIM and when and when for the -CS-LIM. The window size is chosen such that the sliding average filter has a support of 0.1, which is equal to the lag in thed -CS-LIM. At a timestep of , -CS-LIM performance appears stable and reliable, but closer examination reveals that extreme spikes in the estimated dynamics occur randomly but frequently. Unlike the case of the stationary LIM, this error is relatively limited, as only certain points are affected by the Nyquist issue, while others remain accurate. The -CS-LIM result, in contrast, exhibits a narrower error range but overall higher relative errors. When is reduced to 0.002, the sensitivity of the -CS-LIM to the Nyquist issue increases, with relative errors now ranging from 8% to 81%, even exceeding the error range in Figure 8 for and , where and are randomly generated. Meanwhile, the -CS-LIM exhibits a stable and narrow error range with significant improvement, implying that the large relative error for the -CS-LIM with mainly arises from the finite difference scheme used in the computation.
5 Real-world example: the ENSO recharge oscillator
The El Niño-Southern Oscillation is a dominant interannual climate mode driven by interactions between the ocean and the atmosphere over the tropical Pacific Ocean (Di Lorenzo et al., 2015). The ENSO oscillates between two primary phases, namely, El Niño and La Niña, characterized by unusual warming and cooling of SSTs in the central and eastern Pacific, respectively (Capotondi et al., 2015; Okumura et al., 2011). The Niño 3.4 SST index, defined as the anomaly of the average SST over the region 5S–5N and 170W–120W, serves as a widely used indicator to monitor and predict El Niño and La Niña events (Capotondi et al., 2015; Okumura, 2019).
In climate modeling, the variability in oceanic variables can be viewed as the superposition of atmospheric variability and subsurface processes, modeled by external stochastic forcing and deterministic dynamics, respectively (Deser et al., 2010; Lou et al., 2020). Consequently, ENSO can be viewed as a periodically driven stochastic process. Previous studies have emphasized the seasonal variations in the ENSO (Cane et al., 1986; Chen and Jin, 2021; Chen and Jin, 2022; Levine and McPhaden 2015; Shin et al., 2021; Webster and Yang 1992), but only a few have quantified the relative roles of the seasonal variations in the predictable SST dynamics and unpredictable stochastic forcing (Shin et al., 2021).
In this study, we revisit one of the simplest low-dimensional linear models of the ENSO, which represents the system using the anomalies of area-averaged SSTs and the depth of the 20°C isotherm (D20) over the equatorial Pacific (Burgers et al., 2005; Jin et al., 2007). Under the stationary assumption, previous research has interpreted the ENSO mechanism as a recharge oscillator, where the SST anomaly and D20 anomaly play the roles of momentum and position, respectively (Burgers et al., 2005).
For our analysis, we obtained monthly SST and D20 data from ORAS5 reanalysis for the period 1979–2021 (Zuo et al., 2019). These data were then averaged over 5S–5N and 170W–120W for SSTs and 5S–5N and 120E–80W for D20. Given that the data may exhibit quasi-stationary behavior due to low-frequency signals such as Pacific decadal variability, we applied a linear detrending procedure as a preprocessing step (Kuo et al., 2023). To study the seasonal dependence of the ENSO, we utilized the -CS-LIM and -CS-LIM, applying them to the z-scores of the preprocessed averaged SST and D20 anomalies.
To reduce sampling uncertainties, we also applied a three-month sliding average to for both CS-LIMs rather than directly smoothing the input data. This approach follows the methodology proposed by Shin et al. (2021). The input data and their distributions are shown in Figure 10.
Using the CS-LIMs, we obtain the time-dependent dynamics and diffusion matrices . As they are defined at discrete points, we apply linear interpolation entry-wise to construct the approximate systems, as described in Equations 16, 21. The covariance matrices reconstructed by the CS-LIM are calculated from the synthetic data generated from the approximate systems over 5,500 years and are used to estimate the cyclostationary distributions and (collectively referred to as ).
To compare with observations, we apply the bivariate Gaussian kernel density estimator to calculate the observed joint probability density function (PDF) for each calendar month. Figures 11, 12 show the cyclostationary distributions , the vector fields associated with the dynamics , and eigenvectors of scaled by the square root of the corresponding eigenvalues to represent the approximate periodic Markov systems.
The results reveal that the estimated dynamics exhibit a clockwise spiral sink for most calendar months, contributing to a rotational pattern in the cyclostationary distribution, except in late August and September. In addition, the principal direction of the diffusion matrices consistently remains in the first quadrant throughout the year, indicating that atmospheric stochastic forcing mainly acts to warm or cool both the surface and sublayer at the same time. The discrepancy , shown by the filled contours in Figures 11, 12, predominantly appears negative on the upper plane but positive on the lower plane throughout the year. This tendency can be mainly attributed to the negative skewness in the z-score of the D20 anomaly shown in Figure 10 as the approximate systems yield a time-dependent Gaussian distribution .
Figure 13 shows the seasonal variation in the eigenvalues of and . The results for the deterministic dynamics from both CS-LIMs are consistent, showing a notable seasonal dependence; although the dynamics overall act as a damping force, they become temporarily unstable in boreal fall. The imaginary part is especially prominent in boreal late spring and summer, contributing to a phase transition in the SST–D20 correlation that shifts from negative to positive. However, in boreal autumn, the dynamics slow down significantly, and two real eigenvalues even appear in October, indicating a non-rotating regime. The Floquet exponents (i.e., the principal logarithms of the Floquet multipliers) of the approximate system determined by the -CS-LIM and -CS-LIM reveal decay rates of 2.01 and 2.41 years with periods of 4.05 and 3.83 years, respectively. These are slightly different from, but consistent with, the ST-LIM result with a decay rate of 2.00 years and a period of 3.08 years (Burgers et al., 2005).
The diffusion terms reveal distinct seasonal patterns between the two methods. The -CS-LIM shows a minimal seasonal variation in diffusion, whereas the -CS-LIM suggests that the random forcing intensifies in two main periods, namely, April to June and October to November. These contrasting seasonal patterns in diffusion may lead to different interpretations of the ENSO mechanism.
The -CS-LIM result suggests a recurring annual cycle in both oceanic deterministic dynamics and atmospheric stochastic forcing. In March and April, while the background oceanic system pulls the SST and D20 back to equilibrium, the unpredictable atmospheric system intends to determine the ENSO phase (either positive, negative, or neutral) of the upcoming year. Once the phase is established in boreal summer, the oceanic system then acts as a driving force to build up the SST anomaly from August to late October. As the atmospheric forcing intensifies in the following months, the SST anomaly can reach extreme levels—either high or low—from November to January, potentially causing the El Niño and La Niña events. As the cycle progresses, weaker atmospheric excitation and stronger oceanic damping from February and April work help store the SST anomaly, gradually returning the system to its equilibrium state.
It is also noteworthy that the time-dependent stochastic forcing also affects the transition of the SST–D20 correlation. Compared with the case where is replaced by the time-independent , the seasonality further speeds up the rotation in from April to June since the principal direction of the covariance matrix rotates toward that of the diffusion matrix, whose intensity reaches the maximum over these months. Though the dynamics play a major role in the transition, the seasonal feature of diffusion is not negligible.
Overall, this recurring annual pattern demonstrates how random forcing drives the system away from its mean state, contributing to the inter-annual variability observed in the ENSO. The combination of lower SST anomalies in March and April, along with stronger atmospheric forcing, makes it challenging to predict whether SST anomalies will increase or decrease in the subsequent months, which is a factor potentially linked to the well-known ENSO prediction barrier (Cane et al., 1986; Levine and McPhaden, 2015; Webster and Yang, 1992). Moreover, the unstable deterministic dynamics and enhanced random forcing in boreal autumn may help explain the increased SST variability in boreal winter, contributing to the observed ENSO phase locking (Chen and Jin, 2021; Chen and Jin, 2022).
In contrast, the nearly constant random forcing in the -CS-LIM results may imply that the seasonal variability of the SST anomaly is mainly driven by the seasonal variation in deterministic oceanic dynamics rather than that in the atmospheric excitation force. Compared with the -CS-LIM, we can say that the seasonal feature of (if any) is transferred to that of because the CS-FDR holds in both the methods.
For each model, we generate a 128-member ensemble of 43 years for SST and D20 anomalies by numerically integrating the system (Equation 13) determined by CS-LIMs with a timestep of years. The 90% confidence interval levels of the SST covariance derived from the ensembles are shown in Figure 13. The -CS-LIM result shows wider confidence intervals than the -CS-LIM results from April to June and October to November due to stronger random forcing during these periods. Despite different interpretations of the seasonality of stochastic forcing, both models capture the seasonal variation in observed SST covariance (without smoothing). Moreover, the reconstructed monthly SST–D20 cross-covariances, D20–D20 auto-covariances, and annual correlation functions, shown in Supplementary Material, are consistent with observations, demonstrating their effectiveness in capturing the dominant modes of a non-linear system. Although this is a simplified model for ENSO studies, we emphasize that CS-LIMs can be applied to higher-dimensional climate variables, offering spatial–temporal coherence in complex systems (Kido et al., 2023; Shin et al., 2021).
6 Conclusion
In this article, we examine the mathematical background of CS-LIMs, a class of data-driven methods that construct periodic OU processes from finite cyclostationary data and present the -CS-LIM and -CS-LIM. The -CS-LIM is developed based on interval-wise approximation, dividing the phase space into intervals where data exhibit similar cyclostationary dependencies. This approach enables the ST-LIM to estimate the linear dynamics by capturing the integrated effect over each interval, making it effective for most applications. In addition, the improved stencil design in the -CS-LIM eliminates the phase shift observed in the original CS-LIM. However, despite these improvements, it remains vulnerable to the Nyquist issue, which also affects the ST-LIM. In contrast, the -CS-LIM serves as an analytic inverse model to the periodic OU process and estimates the time-dependent Jacobian of a complex system in general. As a derivative-based method, it inherently avoids the Nyquist issue, offering an alternative for studying periodic systems.
Numerical experiments on synthetic data show that both CS-LIM variants effectively capture the temporal structure of the underlying periodic systems. With an optimally chosen interval division, the -CS-LIM uses a relatively sufficient amount of data to estimate the dynamics and preserves meaningful variations without amplifying noise or excessive smoothing. On the other hand, although the -CS-LIM is sensitive to noise, its performance significantly improves when optimal phase averaging is applied to the observed correlation function. In cases where the Nyquist lag of the mean dynamics is close to the optimal range of , the -CS-LIM estimation may fail at certain intervals, leading to significant and abrupt derivations from the underlying system. In contrast, the -CS-LIM remains robust, provided that the sampling interval is sufficiently small to ensure accurate finite difference calculations.
We have applied the CS-LIMs to the real-world Niño SST 3.4 index and the area-averaged depth of C isotherm, two oceanic variables commonly used to study the ENSO. Our findings show that in the boreal spring, the strong damping dynamics and stochastic forcing associated with the ENSO make deterministic forecasting challenging, which is related to the ENSO prediction barrier. In autumn, the unstable dynamics amplify the SST anomaly established in summer, increasing the SST variability and covariance in winter and potentially contributing to ENSO phase locking. Moreover, the estimated decay rates and periods obtained by both methods align with the current understanding of the ENSO, and the reproduced SST covariance captures observed seasonal dependency. This ENSO study demonstrates the capabilities of CS-LIMs and highlights their potential to provide a deeper insight into complex climate systems.
In addition to time-series analysis, CS-LIMs can also serve as predictive tools for periodic systems (Shin et al., 2021; Vimont et al., 2022). While the ST-LIM has been used for forecasting, particularly in climate science, recent studies have recognized the importance of accounting for seasonality in predictive models (Alexander et al., 2008; Vimont et al., 2022; Zhao et al., 2024). A valuable direction for future research would be a comparative study evaluating how different assumptions in LIMs affect model prediction skills and how they perform relative to AI-based forecasting methods and full dynamical models. Such a comparison would provide key insights into the strengths and limitations of CS-LIMs and help identify the fundamental elements of model predictability.
Moreover, the CS-LIM framework has broad applicability beyond analysis and prediction. Dynamical mode decomposition, the deterministic counterpart of the ST-LIM, has been widely applied to study chaotic and quasi-cyclostationary systems, integrated with wavelet decomposition, and applied to enhance machine learning techniques (Donge et al., 2023; Hou et al., 2025; Krishnan et al., 2023; Kutz et al., 2016; Naderi et al., 2019; Tu et al., 2014). By building on these ideas, CS-LIMs, as periodic extensions of the ST-LIM, have the potential to improve these applications by explicitly incorporating periodic features into system dynamics. We believe that both -CS-LIM and -CS-LIM will offer new perspectives and advancements in climate sciences and other fields of study.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Author contributions
JL: conceptualization, formal analysis, methodology, software, and writing–original draft. Y-NK: conceptualization and writing–review and editing. HA: funding acquisition, supervision, and writing–review and editing. SK: investigation and writing–review and editing.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by the (1) Council for Science, Technology, and Innovation (CSTI), the Cross-ministerial Strategic Innovation Promotion Program (SIP), and the 3rd period of SIP “Smart energy management system” grant number JPJ012207 (funding agency: JST) and (2) Grant-in-Aid for Scientific Research (B), 23K25946.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that Generative AI was used in the creation of this manuscript. During the preparation of this manuscript, the authors used ChatGPT-4o in order to improve readability. After using this tool, the authors reviewed and edited the content as needed and took full responsibility for the content of the publication.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcpxs.2025.1563687/full#supplementary-material
References
Alexander, M. A., Matrosova, L., Penland, C., Scott, J. D., and Chang, P. (2008). Forecasting pacific SSTs: linear inverse model predictions of the PDO. J. Clim. 21, 385–402. doi:10.1175/2007JCLI1849.1
CrossRef Full Text | Google Scholar
Burgers, G., Jin, F.-F., and van Oldenborgh, G. J. (2005). The simplest ENSO recharge oscillator. Geophys. Res. Lett. 32. doi:10.1029/2005GL022951
CrossRef Full Text | Google Scholar
Cane, M. A., Zebiak, S. E., and Dolan, S. C. (1986). Experimental forecasts of El Niño. Nature 321, 827–832. doi:10.1038/321827a0
CrossRef Full Text | Google Scholar
Capotondi, A., Wittenberg, A. T., Newman, M., Di Lorenzo, E., Yu, J.-Y., Braconnot, P., et al. (2015). Understanding ENSO diversity. Bull. Am. Meteorological Soc. 96, 921–938. doi:10.1175/bams-d-13-00117.1
CrossRef Full Text | Google Scholar
Chen, H.-C., and Jin, F.-F. (2021). Simulations of ENSO phase-locking in CMIP5 and CMIP6. J. Clim. 34, 5135–5149. doi:10.1175/JCLI-D-20-0874.1
CrossRef Full Text | Google Scholar
Chen, H.-C., and Jin, F.-F. (2022). Dynamics of ENSO phase-locking and its biases in climate models. Geophys. Res. Lett. 49, e2021GL097603. doi:10.1029/2021GL097603
CrossRef Full Text | Google Scholar
Chicone, C. (2006). “Ordinary differential equations with applications,” in Texts in applied mathematics. New York: Springer.
Google Scholar
Culver, W. J. (1966). On the existence and uniqueness of the real logarithm of a matrix. Proc. Am. Math. Soc. 17, 1146–1151. doi:10.1090/s0002-9939-1966-0202740-6
CrossRef Full Text | Google Scholar
Deser, C., Alexander, M. A., Xie, S.-P., and Phillips, A. S. (2010). Sea surface temperature variability: patterns and mechanisms. Annu. Rev. Mar. Sci. 2, 115–143. doi:10.1146/annurev-marine-120408-151453
PubMed Abstract | CrossRef Full Text | Google Scholar
Di Lorenzo, E., Liguori, G., Schneider, N., Furtado, J. C., Anderson, B. T., and Alexander, M. A. (2015). ENSO and meridional modes: a null hypothesis for pacific climate variability. Geophys. Res. Lett. 42, 9440–9448. doi:10.1002/2015GL066281
CrossRef Full Text | Google Scholar
Donge, V. S., Lian, B., Lewis, F. L., and Davoudi, A. (2023). Accelerated reinforcement learning via dynamic mode decomposition. IEEE Trans. Control Netw. Syst. 10, 2022–2034. doi:10.1109/TCNS.2023.3259060
CrossRef Full Text | Google Scholar
Hamlington, B. D., Leben, R. R., Nerem, R. S., Han, W., and Kim, K.-Y. (2011). Reconstructing sea level using cyclostationary empirical orthogonal functions. J. Geophys. Res. 116, C12015. doi:10.102910.1029/2011jc0075292011JC007529
CrossRef Full Text | Google Scholar
Hou, Y., Chen, J., Ma, D., and Chan, P. W. (2025). Steady-state linear response matrix of the lorenz-63 system. Authorea Preprints.
Google Scholar
H Tu, J., W Rowley, C., M Luchtenburg, D., L Brunton, S., and Nathan Kutz, J. (2014). On dynamic mode decomposition: theory and applications. J. Comput. Dyn. 1, 391–421. doi:10.3934/jcd.2014.1.391
CrossRef Full Text | Google Scholar
Jacka, S. D., and Oksendal, B. (1987). Stochastic differential equations: an introduction with applications. J. Am. Stat. Assoc. 82, 948. doi:10.2307/2288814
CrossRef Full Text | Google Scholar
Jin, F.-F., Lin, L., Timmermann, A., and Zhao, J. (2007). Ensemble-mean dynamics of the ENSO recharge oscillator under state-dependent stochastic forcing. Geophys. Res. Lett. 34. doi:10.1029/2006GL027372
CrossRef Full Text | Google Scholar
Johnson, S. D., Battisti, D. S., and Sarachik, E. S. (2000a). Empirically derived markov models and prediction of tropical pacific sea surface temperature anomalies. J. Clim. 13, 3–17. doi:10.1175/1520-0442(2000)013<0003:edmmap>2.0.co;2
CrossRef Full Text | Google Scholar
Johnson, S. D., Battisti, D. S., and Sarachik, E. S. (2000b). Seasonality in an empirically derived markov model of tropical pacific sea surface temperature anomalies. J. Clim. 13, 3327–3335. doi:10.1175/1520-0442(2000)013<3327:siaedm>2.0.co;2
CrossRef Full Text | Google Scholar
Kido, S., Richter, I., Tozuka, T., and Chang, P. (2023). Understanding the interplay between ENSO and related tropical SST variability using linear inverse models. Clim. Dyn. 61, 1029–1048. doi:10.1007/s00382-022-06484-x
CrossRef Full Text | Google Scholar
Kim, K.-Y. (2002). Investigation of ENSO variability using cyclostationary EOFs of observational data. Meteorology Atmos. Phys. 81, 149–168. doi:10.1007/s00703-002-0549-7
CrossRef Full Text | Google Scholar
Kim, K.-Y., Hamlington, B., and Na, H. (2015). Theoretical foundation of cyclostationary EOF analysis for geophysical and climatic variables: concepts and examples. Earth-Science Rev. 150, 201–218. doi:10.1016/j.earscirev.2015.06.003
CrossRef Full Text | Google Scholar
Krishnan, M., Gugercin, S., and Tarazaga, P. A. (2023). A wavelet-based dynamic mode decomposition for modeling mechanical systems from partial observations. Mech. Syst. Signal Process. 187, 109919. doi:10.1016/j.ymssp.2022.109919
CrossRef Full Text | Google Scholar
Kuo, Y.-N., Kim, H., and Lehner, F. (2023). Anthropogenic aerosols contribute to the recent decline in precipitation over the u.s. southwest. Geophys. Res. Lett. 50, e2023GL105389. doi:10.1029/2023gl105389
CrossRef Full Text | Google Scholar
Kutz, J. N., Brunton, S. L., Brunton, B. W., and Proctor, J. L. (2016). Dynamic mode decomposition. Philadelphia, PA: Society for Industrial and Applied Mathematics.
Google Scholar
Kwasniok, F. (2022). Linear inverse modeling of large-scale atmospheric flow using optimal mode decomposition. J. Atmos. Sci. 79, 2181–2204. doi:10.1175/JAS-D-21-0193.1
CrossRef Full Text | Google Scholar
Levine, A. F. Z., and McPhaden, M. J. (2015). The annual cycle in ENSO growth rate as a cause of the spring predictability barrier. Geophys. Res. Lett. 42, 5034–5041. doi:10.1002/2015GL064309
CrossRef Full Text | Google Scholar
Lien, J., Kuo, Y.-N., Ando, H., and Kido, S. (2025). Colored linear inverse model: A data-driven method for studying dynamical systems with temporally correlated stochasticity. Physical Review Research.
Google Scholar
Naderi, M. H., Eivazi, H., and Esfahanian, V. (2019). New method for dynamic mode decomposition of flows over moving structures based on machine learning (hybrid dynamic mode decomposition). Phys. Fluids 31, 127102. doi:10.1063/1.5128341
CrossRef Full Text | Google Scholar
Newman, M., Alexander, M. A., and Scott, J. D. (2011a). An empirical model of tropical ocean dynamics. Clim. Dyn. 37, 1823–1841. doi:10.1007/s00382-011-1034-0
CrossRef Full Text | Google Scholar
Newman, M., Shin, S.-I., and Alexander, M. A. (2011b). Natural variation in ENSO flavors. Geophys. Res. Lett. 38. doi:10.1029/2011GL047658
CrossRef Full Text | Google Scholar
Okumura, Y. M. (2019). ENSO diversity from an atmospheric perspective. Curr. Clim. Change Rep. 5, 245–257. doi:10.1007/s40641-019-00138-7
CrossRef Full Text | Google Scholar
Okumura, Y. M., Ohba, M., Deser, C., and Ueda, H. (2011). A proposed mechanism for the asymmetric duration of El Niño and La Niña. J. Clim. 24, 3822–3829. doi:10.1175/2011JCLI3999.1
CrossRef Full Text | Google Scholar
OrtizBeviá, M. J. (1997). Estimation of the cyclostationary dependence in geophysical data fields. J. Geophys. Res. Atmos. 102, 13473–13486. doi:10.1029/97JD00243
CrossRef Full Text | Google Scholar
Penland, C. (1989). Random forcing and forecasting using principal oscillation pattern analysis. Mon. Weather Rev. 117, 2165–2185. doi:10.1175/1520-0493(1989)117<2165:rfafup>2.0.co;2
CrossRef Full Text | Google Scholar
Penland, C. (1996). A stochastic model of IndoPacific sea surface temperature anomalies. Phys. D. Nonlinear Phenom. 98, 534–558. doi:10.1016/0167-2789(96)00124-8
CrossRef Full Text | Google Scholar
Penland, C. (2019). The Nyquist issue in linear inverse modeling. Mon. Weather Rev. 147, 1341–1349. doi:10.1175/.MWR-D-18-0104.1
CrossRef Full Text | Google Scholar
Penland, C., and Magorian, T. (1993). Prediction of Niño 3 sea surface temperatures using linear inverse modeling. J. Clim. 6, 1067–1076. doi:10.1175/1520-0442(1993)006<1067:ponsst>2.0.co;2
CrossRef Full Text | Google Scholar
Penland, C., and Matrosova, L. (1994). A balance condition for stochastic numerical models with application to the El niño-southern oscillation. J. Clim. 7, 1352–1372. doi:10.1175/1520-0442(1994)007<1352:abcfsn>2.0.co;2
CrossRef Full Text | Google Scholar
Penland, C., and Matrosova, L. (2006). Studies of El Niño and interdecadal variability in tropical sea surface temperatures using a nonnormal filter. J. Clim. 19, 5796–5815. doi:10.1175/JCLI3951.1
CrossRef Full Text | Google Scholar
Penland, C., and Sardeshmukh, P. D. (1995). The optimal growth of tropical sea surface temperature anomalies. J. Clim. 8, 1999–2024. doi:10.1175/1520-0442(1995)008<1999:togots>2.0.co;2
CrossRef Full Text | Google Scholar
Perkins, W. A., and Hakim, G. (2020). Linear inverse modeling for coupled atmosphere-ocean ensemble climate prediction. J. Adv. Model. Earth Syst. 12, e2019MS001778. doi:10.1029/2019MS001778
CrossRef Full Text | Google Scholar
Risken, H. (1989). “The Fokker-Planck equation,” in Methods of solution and applications, vol. 18 (Springer-Verlag, Berlin).
Google Scholar
Rüschendorf, L. (2023). “Stochastic processes and financial mathematics,” in Mathematics study resources. Springer Berlin Heidelberg.
Google Scholar
Särkkä, S., and Solin, A. (2019). “Applied stochastic differential equations,” in Institute of mathematical statistics textbooks. Cambridge University Press.
Google Scholar
Shin, S.-I., Sardeshmukh, P. D., Newman, M., Penland, C., and Alexander, M. A. (2021). Impact of annual cycle on ENSO variability and predictability. J. Clim. 34, 171–193. doi:10.1175/JCLI-D-20-0291.1
CrossRef Full Text | Google Scholar
Stuecker, M. F. (2023). The climate variability trio: stochastic fluctuations, El Niño, and the seasonal cycle. Geosci. Lett. 10, 51. doi:10.1186/s40562-023-00305-7
CrossRef Full Text | Google Scholar
Stuecker, M. F., Jin, F.-F., Timmermann, A., and McGregor, S. (2015). Combination mode dynamics of the anomalous northwest pacific anticyclone. J. Clim. 28, 1093–1111. doi:10.1175/JCLI-D-14-00225.1
CrossRef Full Text | Google Scholar
Stuecker, M. F., Timmermann, A., Jin, F.-F., McGregor, S., and Ren, H.-L. (2013). A combination mode of the annual cycle and the El Niño/Southern Oscillation. Nat. Geosci. 6, 540–544. doi:10.1038/ngeo1826
CrossRef Full Text | Google Scholar
Vimont, D. J., Newman, M., Battisti, D. S., and Shin, S.-I. (2022). The role of seasonality and the ENSO mode in central and east pacific ENSO growth and evolution. J. Clim. 35, 3195–3209. doi:10.1175/JCLI-D-21-0599.1
CrossRef Full Text | Google Scholar
Webster, P. J., and Yang, S. (1992). Monsoon and ENSO: selectively interactive systems. Q. J. R. Meteorological Soc. 118, 877–926. doi:10.1002/qj.49711850705
CrossRef Full Text | Google Scholar
Xue, Y., Leetmaa, A., and Ji, M. (2000). ENSO prediction with markov models: the impact of sea level. J. Clim. 13, 849–871. doi:10.1175/1520-0442(2000)013<0849:epwmmt>2.0.co;2
CrossRef Full Text | Google Scholar
Zhao, S., Jin, F.-F., Stuecker, M. F., Thompson, P. R., Kug, J.-S., McPhaden, M. J., et al. (2024). Explainable El Niño predictability from climate mode interactions. Nature 630, 891–898. doi:10.1038/s41586-024-07534-6
PubMed Abstract | CrossRef Full Text | Google Scholar
Zuo, H., Balmaseda, M. A., Tietsche, S., Mogensen, K., and Mayer, M. (2019). The ECMWF operational ensemble reanalysis–analysis system for ocean and sea ice: a description of the system and assessment. Ocean Sci. 15, 779–808. doi:10.5194/os-15-779-2019
CrossRef Full Text | Google Scholar