ORIGINAL RESEARCH article
Improving the Estimation of Markov Transition Probabilities Using Mechanistic-Empirical Models
- 1International Research Institute of Disaster Science, Tohoku University, Sendai, Japan
- 2POMPLUS Consulting Ltd., Hanoi, Vietnam
- 3Institute of Construction and Infrastructure Management, ETH Zurich, Zurich, Switzerland
- 4Department of Civil Engineering, Osaka University, Osaka, Japan
In many current state-of-the-art bridge management systems, Markov models are used for both the prediction of deterioration and the determination of optimal intervention strategies. Although transition probabilities of Markov models are generally estimated using inspection data, it is not uncommon that there are situations where there are inadequate data available to estimate the transition probabilities. In this article, a methodology is proposed to estimate the transition probabilities from mechanistic-empirical models for reinforced concrete elements. The proposed methodology includes the estimation of the transition probabilities analytically when possible and when not through the use of Bayesian statistics, which requires the formulation of a likelihood function and the use of Markov Chain Monte Carlo simulations. In an example, the difference between the average condition predicted over a 100-year time period with a Markov model developed using the proposed methodology and the condition predicted using mechanistic-empirical models were found to be 54% of that when the state-of-the-art methodology, i.e., a methodology that estimates the transition probabilities using best fit curves based on yearly condition distributions, was used. The variation in accuracy of the Markov model as a function of the number of deterioration paths generated using the mechanistic-empirical models is also shown.
Many state-of-the-art infrastructure management systems are making use of the Markov model for the prediction of deterioration and the determination of the optimal intervention strategies. When using the Markov model, elements of infrastructures are considered to be in discrete states (deterioration condition states) defined using physical characteristics, and the deterioration of elements over time is described as probable transitions between these states over time.
In estimating transition probabilities, there are two basic situations, (1) there are sufficient time-series data, i.e., when sufficient data are available for a minimum of two consecutive time intervals and (2) there are no sufficient time-series data, i.e., when there are no sufficient data available for a minimum of two consecutive time intervals. Needless to say, infrastructure managers should make decisions of interventions for all elements by reliable transition probabilities when the management system uses Markov models. In the first situation, infrastructure managers use statistical methods to estimate the transition probabilities of the Markov models. Many of the models were developed with the assumption that elements could jump no more than one state per time period, such as those used in Klein (1962), Carnahan et al. (1987), Jiang et al. (1988), Madanat and Ben-Akiva (1994), and Thompson et al. (1998). However, others were developed without this assumption, such as those developed by Tsuda et al. (2006) and Kobayashi et al. (2012a,b). The former have the advantage that they are relatively easy to compute. In the former, however, as the deterioration being predicted is relatively rapid with respect to the time periods selected, error becomes non-negligible. For example, if it is assumed that the transition probability from state i to i + 2 in 5 years is 0, the model cannot express rapid deterioration processes. The latter allow to avoid this assumption and have the advantage that resulting transition probabilities lead to more accurate prediction of deterioration. In addition, they can be used when data have been collected at non-uniform intervals.
Despite progressive development in the field of monitoring over the last decades so that data can be collected more frequently and more accurately, the second situation likely occurs in municipalities with relatively small scale and it can be readily imagined that not a few administrative organizations are facing this situation. They are especially pertinent in developing nations, in which there is too little attention paid to managing infrastructures, including a vast number of concrete bridges. Situations such as (1) time-series data cannot be used to estimate the transition probabilities because historical intervention records between inspections are unavailable, (2) time series data are not recorded as electronic data, and (3) time-series data in current criteria of states are not available because criteria of states had been changed recently, come under this situation as well. In the situation where there are no sufficient time-series data, infrastructure managers either rely solely on expert opinion or use expert opinion in conjunction with predictions made using mechanistic-empirical models.
In mechanistic-empirical models, deterioration processes are modeled as physical processes that can be described using the characteristics and properties of the materials of the elements and their chemical and physical response/reaction against factors of the environment. Using mechanistic-empirical models, once the values of endogenous and exogenous factors governing the deterioration process are determined, the condition evolution of the bridge element over time can be estimated without the need for inspection data for a minimum of two consecutive time intervals. Unlike Markov models, condition evolution over time is modeled as being continuous (DuraCrete, 1998; Kirkpatricka et al., 2002). When mechanistic-empirical models are used together with expert opinion, it is usually assumed that an element cannot transition more than one state in one time period.
An improvement to the estimation of transition probabilities to be used in infrastructure management systems, and, therefore, an improvement to infrastructure management, would be to have a methodology to be used to estimate transition probabilities using mechanistic-empirical models for use in the second situation. In addition, it is desired that elements are modeled so that they can transition more than one state in one time period. Such a methodology is proposed in this article. The proposed methodology makes use of two approaches: (1) the analytical approach and (2) the Bayesian approach. The first is used in situations where it is possible to find an analytical solution so that the transition probability can be derived directly from the mechanistic-empirical models. The second is used in situations where it is not possible to find the analytical solution. It makes use of Bayesian statistics, which requires the formulation of a likelihood function of the transition probabilities and the use of Markov Chain Monte Carlo (MCMC) simulation.
The remainder of the article is structured as follows: Sections “Finite State Markov Models and Transition Probabilities” and “Estimating Transition Probabilities Using Mechanistic-Empirical Models” include a background to position the contribution of the proposed method. Specifically Section “Finite State Markov Models and Transition Probabilities” contains an overview of finite state Markov models and transition probabilities, and Section “Estimating Transition Probabilities Using Mechanistic-Empirical Models” contains a literature review regarding methods to estimate transition probabilities using mechanistic-empirical models. Section “Mechanistic-Empirical Models” includes the mechanistic-empirical models to be used in the work presented in this article. In the Section “Relationship between Mechanistic-Empirical Models and Transition Probabilities,” it is explained how transition probabilities are to be estimated analytically using mechanistic-empirical models. In Section “Methodology,” the methodology is described, in which, the steps to formulate the mechanistic-empirical model of deterioration process is given along with a way to convert the condition evolution predicted using this model to the discrete states required by the Markov model. An example of how the method is to be used is given in Section “Example.” In Section “Comparison with the State-of-the-Art,” the Markov models developed using the proposed methodology and a state-of-the-art methodology are compared. In the Section “The Number of Deterioration Paths from the Mechanistic-Empirical Models,” the effect of the number of deterioration paths required to obtain a satisfactory result is shown and discussed. Section “Conclusion” contains the conclusions of the work and recommendations for future research.
Finite State Markov Models and Transition Probabilities
In finite state Markov models, transition of condition states between time point t1 and t2 = t1 + z is expressed with a transition probability matrix whose i* j (i = 1, …, I;j = 1, …, I) element is a transition probability defined as Prob[h(t2) = j|h(t1) = i] = πij. h(t) is a function which denotes a condition state at t. πij is a conditional probability which indicates the occurrence probability of condition state j at t2 with given condition state i observed at t1. In finite state Markov models, it is assumed that the transition probability between time points t1 and t2 is only dependent on the condition state at t1 so as to satisfy the Markov property. Finite state Markov models have been used in the management of deteriorating systems since the 1960s, when there was a rapid development of mechanical and electrical systems (Howard, 1960; Gertsbakh, 2000; Kolowrocki, 2014). They have been used to ensure that to determine optimal intervention strategies, i.e., the strategy to follow to ensure that the costs of executing interventions are balanced with the costs of not executing interventions. Finite state Markov models allowed for the modeling of deterioration processes that could not be perfectly modeled deterministically, to be modeled as stochastic processes. They were, and are, considered to be good models for systems where the transitions from one state to another can be considered to be memoryless. They are less good where this is not the case but are still often used due to both their ease of use and ease of understanding in situations where the assumptions of memoryless do not lead to large deviations in condition evolution prediction from reality. To ensure that a Markov model is developed to give accurate predictions of deterioration using statistical methods, condition state data are required. In general, the longer the time series of inspections, the better.
Finite state Markov models are used in infrastructure management systems to model the deterioration of elements and to determine optimal intervention strategies (AASHTO, 2004; Swei et al., 2015). They are used instead of continuous Markov models due to their convenience, in terms of using visual inspections (it is easier to say that an element is in state 2 than to say if it is in 2.1, 2.2, or 2.3) and in terms of assessing the state of the element that is to trigger an intervention (Howard, 1960; White, 1992; Puterman, 1994). The estimation of the transition probabilities for the Markov models is ideally done using available condition state data that have been collected at uniform time intervals over a long period of time (Lee, 1970). Although somewhat more complicated, when data have been collected at non-uniform intervals of time over a long period of time, the transition probabilities can be estimated using statistical methods, such as survival analysis, maximum likelihood estimation, and Bayesian estimation approaches (Hastings, 1970; Lancaster, 1990; Kobayashi et al., 2012a; Mizutani et al., 2013; Lethanh et al., 2015). When little to no condition state data are available, transition probabilities have been estimated using expert opinion or estimated in various ways to obtain a best fit with the condition states predicted using mechanistic-empirical models (Golroo and Tighe, 2012; Indiana Department of Transportation, 2013).
To simplify the estimation of transition probabilities some researchers and developers of management systems have assumed that it is not possible for an element to move more than one condition state in one time interval, e.g., Jiang et al. (1988), Mishalani and Madanat (2002), and Robelin and Madanat (2007). However, others have explicitly developed models where that is not the case (Tsuda et al., 2006). Being able to estimate transition probabilities with data that have been collected at both uniform and non-uniform time intervals and allowing elements to move more than one condition state per time interval, it has been shown to increase the accuracy of the estimation of the transition probabilities in Markov models (Tsuda et al., 2006; Kobayashi et al., 2012b; Mizutani et al., 2013; Lethanh et al., 2015).
Estimating Transition Probabilities Using Mechanistic-Empirical Models
In situations where there are little to no time-series condition state data estimating transition probabilities so that the results of a Markov model fit those of mechanistic-empirical models, using mechanistic-empirical models is likely to yield more accurate deterioration predictions.
However, fitting can be done in different ways. No research has been conducted using an approach to find an analytical solution so that the transition probability can be derived directly from the mechanistic-empirical models. When this approach is unavailable due to relatively complicated functional forms of the mechanistic-empirical models, an approach is available to estimate the transition probabilities using the predictions of condition of the elements with the Markov model and with the mechanistic-empirical models. Roelfstra et al. (2004), which is perhaps the first work in this area, used a restricted least squares approach to minimize the difference between the predictions of the average condition of the elements with the Markov model and the predictions of condition of the elements with the mechanistic-empirical models. They assumed that there was a maximum transition of one state in one time interval. Overcoming this assumption, Lethanh et al. (2017) used a restricted least squares approach to minimize the difference between the prediction of the probabilities of the elements being in each condition state in each time interval within the investigated time period estimated using the Markov model and those predicted using the mechanistic-empirical models. In other words, they minimized the sum of the differences between the elements of the state vectors, which are estimated using the Markov model and the mechanistic-empirical models, respectively. In this process, however, information of the transitions of the states in each element, i.e., information of the transitions from state i to j of an element, is lost by aggregating this information into the values of the state vectors. To prevent this loss of information, in the proposed methodology, a likelihood function of the transitions of the states in each element is formulated, and the transition probabilities are estimated based on the likelihood function. The methodology used by Lethanh et al. (2017) is used as a reference methodology in this article.
Many mechanistic-empirical models can be used to predict the future condition state of reinforced concrete elements. In this work, a mechanistic-empirical model was selected to predict condition states of the element during the initiation phase of chloride-induced corrosion, and another was selected to predict condition states of the element during the propagation phase. The models used were based on those given in DuraCrete (2000). An illustration of the two phases, along with the ranges of chloride concentrations (kg/m3) and crack widths (mm) used to define the condition states, is given in Figure 1.
Figure 1. Deterioration process of reinforced concrete due to chloride-induced corrosion [adopted from Lethanh et al. (2017)].
Chloride penetration was modeled using Fick’s second law of diffusion (Fick, 1855).
where Ccl is the chloride ion concentration at the depth of the reinforcement (here small notation “cl” denotes the abbreviation for chloride); x is the depth; and Dcl is the chloride diffusion coefficient.
The solution for partial differential equation (Eq. 1) gives the following explicit form to calculate the chloride concentration as a function of the distance of the reinforcement from the concrete surface xcl and time t (DuraCrete, 2000).
where Cs is the surface chloride content and erf[⋅] denotes the error function.
The time required for corrosion to start, that is time t in Eq. 2 was estimated by setting the value of Ccl to be equal to the chloride concentration. Then the value of Ccl is used to determine the entry point into a condition state. In other words, by setting the upper bounds on the values of Ccl to be used to define each discrete condition state i, the time to arrive at that condition state was obtained by solving Eq. 2 with respect to time t and a certain depth of concrete cover from the reinforcement. The value of variables Ccl, Cs, and Dcl in the above equations were considered to be random, with each one being represented with a probabilistic distribution.
After the value of chloride concentration reached a certain lower limit, corrosion of the reinforcement was assumed to start. After it reached a certain higher limit, cracking was assumed to start. The following equations from DuraCrete (2000) were used:
where w(t) is the crack width (mm) over time; β is the parameter that controls the propagation; w0 is the crack width when it is visible (≈0.05 mm); P0 is the amount of loss of re-bar diameter (mm) when the crack width is visible; and P(t) is the amount of loss of re-bar diameter (mm) at time t, which is given by the following equation:
where Vcorr is corrosion rate coefficient (mm/year); wet is the wet period in a year (equal to the ratio between total numbers of rainy day and 365 days); and α is pitting factor that takes non-uniform corrosion of the re-bars into consideration.
Relationship Between Mechanistic-Empirical Models and Transition Probabilities
The relationship between the mechanistic-empirical models and the transition probabilities is explained in this section. The two phases of deterioration as illustrated in Figure 1 are expressed as a function of a set of random variable X, where X represents a vector of parameters such as Dcl, Vcorr, α, and wet. When the mechanistic-empirical model, which is defined as the general notation of deterioration function y = g(t, x), includes a single random variable x with its probability density distribution f (x) and y = g(t, x) is a monotonic increasing function, the relationship between the mechanistic-empirical model and the transition probabilities can be derived. Here, t is elapsed time. y is an indicator of deterioration, and the value of y becomes larger as the reinforced concrete element deteriorates. The inverse function of y is denoted as x = m(y, t).
As x is a random variable, the value of i is also a random variable. The occurrence probability of observing condition state i is given by the following equation:
where e(t) is the probability density function of elapsed time t and (yi−1, yi] denotes the pre-defined range of y for condition state i.
The probability of observing condition state j at any subsequent time t + Δt is defined as follows:
where and are the lower bound and the upper bound of y at t, respectively, that function y = g(t, x) passes both ranges (yi−1, yi] at t and (yj−1, yj] at t + Δt, which are given by the following equations:
Since Markov transition probabilities are constant in the finite state Markov model, it is assumed that t is selected randomly. Consequently, the continuous uniform distribution U(0, tz) is available as the probability density function of elapsed time e(t), and tz is enough large number. The Markov transition probability πij(Δt) is given by the following equation:
where π(i) is the marginal distribution of i, which is given by the following equation:
and π(i, j, Δt) is the marginal distribution of (i, j), which is given by the following equation:
The methodology proposed to estimate the transition probabilities is shown in Figure 2. The first two tasks are required to select the mechanistic-empirical models. Once the models are selected, the values of their parameters are to be defined, as are the condition states to be used to map the continuous values determined using the mechanistic-empirical models to the discrete states used in the Markov model. When transition probabilities can be calculated directly from the mechanistic-empirical models, transition probabilities are determined analytically. When this is not possible, the transition probabilities are to be estimated using the Bayesian approach shown in Figure 2 as the sub-process. The use of the Bayesian approach is explained in more detail in the rest of this section.
Generate Sample Deterioration Paths
In this step, a set of values for the vector X are generated using its parametric inferences (e.g., mean and SD of a probability distribution). The generated set of values of X is then used, together with the mechanistic-empirical models to generate a sample path of deterioration for the concrete element. N paths are generated, and Xn (n = 1, …, N) denotes X used to generate path n. Path n is considered to be independent from the other paths 1, …, n − 1, n + 1, …, N. These paths are then discretized into Z time intervals as condition states at time points on a discrete time axis tz+1 = tz + a (z = 1, …, Z − 1). a is the time interval of the discrete time axis. The condition state of the element associated with path n at time tz is denoted as qn(tz). The sample paths are generated using the following rules:
where is a dummy variable. The total number of sample paths is then expressed as the vector .
At time tz+1, the condition value associated with sample path n, is as follows:
As the same Xn is used to generate path n, the condition states of the element, for time points tz and tz+1, satisfy the following constraint:
Estimate the Transition Probabilities
In this step, Bayesian estimation is used to estimate the transition probabilities. This includes the three sub-steps of
○ specify the initial values of unknown parameters, Π(0), and the prior probability distribution function p(Π) through the specification of its parameters a1, …, aI−1 based on the prior information;
○ define the likelihood function L(Θ, Π) using the obtained data Θ;
○ determine the posterior probability density function p(Π|Θ) as the product of the prior probability density function and the likelihood function in accordance with the Bayes’ theorem.
Here, Π denotes the unknown parameter vector. It is assumed that Π is a random variable and is subjected to the prior probability density function p(Π). Under these conditions and according to Bayes’ theorem (Bayes and Price, 1763), when the observed data Θ are given, the posterior probability density function p(Π|Θ) of the unknown parameters Π is defined as follows:
where Ξ represents the parameter space. At this time, p(Π|Θ) can be expressed as follows:
where the symbol “ ∝” denotes “be proportional to.”
Specify the Prior Probability Distribution
The specification of the prior probability density function requires that the random variables of the prior probability density function have same domains as the unknown parameters. The distribution to be used has to satisfy the condition of the transition probabilities, that is
The Dirichlet distribution is a good example. The function of the distribution is given by the following equation:
where B(⋅) is a beta function; πi = (πi,1, …, πi,I) holds; and αi = (αi,1, …, αi,I) is a parameter vector of the Dirichlet distribution.
Define the Likelihood Function
The definition of the likelihood function requires determining the number of sample paths and number of condition states to be used. In the likelihood function, the transition probabilities from condition states i to j are the model parameters and the data used to estimate the transition probabilities are the sample paths generated using the mechanistic-empirical models. The likelihood function, therefore, consists of the probabilities of having specific values of the transition probabilities, as follows:
Determine the Posterior Probability Density Function
The posterior probability density function p(Π|Θ) is determined through the definition of the unknown parameters. It has the following form:
The unknown parameters are determined Π(c) using MCMC simulation with Gibbs sampling (Geman and Geman, 1984) and the random walk Metropolis Hastings (MH) algorithm (Hastings, 1970), and recorded Π(c). MCMC simulation has been used successfully in this way to estimate the posterior distributions in the past in situations where the multidimensional integration of the objective function of a model was not possible (Robert, 1996).
For each MCMC simulation, the random walk MH algorithm estimates the transition probabilities in each loop c(c = 1, …, C) by comparing a candidate set of transition probabilities with a sample . is the transition probability that is compared with the candidate of transition probabilities. is defined as and is i × j element of . The candidate set of transition probabilities must satisfy the constraint of the Markov model, . Here, represents the state that is most likely to occur. For example, when i = 1, I = 4, and j = 2, and the transition probabilities are (πi1, …, πi4) = (0.1, 0.1, 0.7, 0.1), becomes 3. When is generated so that the sum of and is same as the sum of and , is satisfied. At this time, has to be within the interval [0, U], and holds. To do this, is generated using a truncated normal distribution with mean , on [0, U] and variance μjump, . When is generated, is fixed uniquely as . Thus, the occurrence probability of corresponds to the joint occurrence probability of the set , , as follows:
where is the expected value, μjump is the variance, and is . ϕ(⋅) is a probability density function of a standard normal distribution, and Φ(⋅) is a cumulative distribution function of a standard normal distribution. The probability that a candidate set is accepted is expressed as follows:
where is the parameter set which consists of except and . Equation 23 means that a candidate is accepted with probability 1 if it has a better fit with the paths, and a candidate is accepted with the probability formulated as a ratio of the product of the posterior probability density function and the occurrence probability of the candidate if it does not have a better fit with paths. Using this probability, parameters following the posterior distribution are sampled numerically avoiding falling into local optima.
is sampled as follows:
where is a uniform random number drawn from the uniform distribution whose domain is [0,1]. In Gibbs sampling, can be sampled from the other elements of . In the iterative procedure, the elements of are defined successively in order from j = i to j = I in each i, and is fixed as if j = I.
Evaluate the Results
A Geweke test statistic (Geweke, 1992) is used to evaluate convergence during sampling. The Geweke test statistic indicates difference between the first 10% and the last 50% of sampled transition probabilities. is burn-in. A statistical hypothesis test is conducted using the Geweke test statistic. When the statistics are less than 1.96 (significance level 5%), it is judged that sampling has converged and are samples from the posterior distribution.
The expected transition probabilities, which are used as the estimated values, are then given by the following equation:
The 100(1 − 2κ)% Bayesian credible interval of each parameter, , is calculated by as follows:
Here, the symbol #(⋅) indicates the number of c that satisfies the logical expression in parentheses. indicates that the probability that the expected transition probability lies in is 1 − 2κ.
The length of time required to transition to condition state i − 1 (i = 2, …, I) and i in the deterioration path of s%, , is calculated using following equation:
where and is jth element of state vector at tz calculated by . is defined so that the probability that length of time required to transition to condition state i − 1 (i = 2, …, I) and i is less than is s%. The first term on the right hand side of Eq. 28 is the expected length of time to transition from the initial time point to condition state i. The second is the condition value obtained from the mechanistic-empirical model. The ratio of the difference between and s/100 is added to the difference between and to express the length of time more precisely. For example, when a = 1, , , and s = 30(%), 1 × (30/100 − 0.27)/(0.33 − 0.27) = 0.5 is added to to evaluate the length of time between and . For example, when and , the sum of the first and second term on the right hand side of Eq. 28 is calculated as 20.5.
The estimated transition probabilities are evaluated using the Bayesian credible intervals of expected length of time required to transition between condition states. The Bayesian credible interval of the expected length of time to make transitions between condition states 1 and I is . The 100(1 − 2κ)% Bayesian credible interval of the expected duration is defined as , i.e., the probability that the expected duration lies in is 1 − 2κ, treating the expected length of time as a random variable. The magnitude of the difference can be used to determine how good the estimated values are. If they are good enough, then the estimated transition probabilities are considered correct. If not, then more sample paths are generated using the mechanistic-empirical models and the process is started over again. This process is illustrated in the sub-process of Figure 2.
and are defined as sample order statistics as follows:
Here, the symbol #(⋅) indicates the number of c which satisfies the logical expression in parentheses. As shown in Figure 2, using a threshold value R that is established in advance and depends on the desired amount of accuracy. If r is smaller than R, the algorithm is controlled to be stopped. On the other hand, if r is greater than R, the algorithm returns to the sample generation phase and redefine N.
The methodology was tested by using it to estimate the transition probabilities for a reinforced concrete element. The mechanistic-empirical models used for the initiation phase and the propagation phase were given by the following equations:
In these equations, x1,1, x2,1, and x2,2 were considered as random variables. As information about distributions of the parameters Cs, xcl, Dcl, Vcorr, α, wet, w0, β, and P0 in Eqs 2–4 was not available, they are summarized as Cs = Cs, d = xcl, x1,1 = Dcl, x2,1 = w0 − βP0, and x2,2 = β*Vcorr*α*wet in Eqs 32 and 33, and it is assumed that x1,1, x2,1, and x2,2 are distributed based on normal distributions. The values of these variables used are shown in Table 1.
The condition values obtained from Eqs 32 and 33 were mapped to condition state as shown in Table 2.
Estimation Based on Relationship between Mechanistic-Empirical Models and Transition Probabilities
When the mechanistic-empirical models in Eqs 32 and 33 are used, the transition probabilities cannot be estimated analytically based on their relationship shown in Section “Relationship between Mechanistic-Empirical Models and Transition Probabilities.” The transition probabilities π11 and π12, however, can be estimated with numerical integration (e.g., Monte Carlo simulation) based on the relationship between the initiation model and the transition probabilities because the initiation model includes a single random variable x1,1. Here, as an example, π11(1) is estimated. It can be expressed, using probability density functions of a truncated normal distribution and a continuous uniform distribution, as follows:
where H(⋅; hmean, hsd, hl, hu) is the cumulative distribution function of the truncated normal distribution with mean hmean, SD hsd, and support [hl, hu]. In addition, π(1, 1, 1) can be formulated as follows:
As π(1, 1, 1) = 0.0754 and π(1) = 0.0858 by computation using Monte Carlo simulation to solve integration in Eqs 35 and 36, π11(1) is 0.879. Estimation of the transition probabilities with the proposed Bayesian approach can be regarded as solving the integrations by generating sample deterioration paths and statistically estimating the transition probabilities instead of using numerical integration. As estimated result can be evaluated using posterior distributions and Bayesian credible intervals, the proposed Bayesian approach is superior to the numerical integration when the integration cannot be solved analytically.
Generate Sample Deterioration Paths
To estimate the other transition probabilities, a set of sample paths (N = 10,000) was generated for 1-year time intervals over a period of 100 years, i.e., a = 1, Z = 100.
Estimate Transition Probabilities
The estimated transition probabilities are shown in Table 3. Comparing π11(1) between the Bayesian approach and the approach based on relationship between mechanistic-empirical models and transition probabilities, it can be found that π11(1) = 0.879 derived in 7.2 fits well to the result with the proposed Bayesian approach.
To calculate Bayesian credible intervals of the expected length of the transition between condition states, the transition probabilities obtained in every step of the MCMC simulation were used. C and were set to 11,000 and 1,000, respectively. In sampling , the expected duration was calculated from the transition probability matrix Π(c) and Eq. 28.
The suitability of the number of generated paths N were evaluated using
○ a distribution of the length of time to transition from condition state 1–5 with s = 50 (%), T(c),
○ a 90% Bayesian credible interval, (with , ), and
○ R = 1 (year).
The Bayesian credible intervals and Geweke test statistics are shown in Table 3. As r ≤ R and all of the Geweke test statistics are less than 1.96 (significance level 5%), it is concluded that 10,000 deterioration paths were sufficient to estimate the transition probabilities.
By using the estimated values of the Markov transition probabilities, deterioration processes can be expressed stochastically as transition of the condition state vector over time. Figure 3A shows the transition of values of the condition vector calculated based on the transition probabilities estimated with the proposed methodology. The results shown in Figure 3A enable the evaluation of the condition at arbitrary time points, as well as the deterioration paths. Considering the state vectors estimated using the Markov model, probability s that the reinforced concrete element in each condition state can be used as a risk control level. Figure 4 shows deterioration paths with different risk control levels. These can be used to determine the points in time when inspections should be performed or interventions should be executed.
Figure 3. Transition of state vector over time: (A) calculated using the transition probabilities estimated by the proposed methodology; (B) calculated using the transition probabilities estimated by the state-of-the-art methodology; and (C) calculated using the mechanistic-empirical models.
Comparison with the State-of-the-Art
The comparison between the proposed methodology and the state-of-the-art methodology (Lethanh et al., 2017) was done by estimating the transition probabilities using both methodologies from the mechanistic-empirical models shown in Eqs 32 and 33 and Table 1, and measuring the differences (or residuals) between the average condition values predicted by using the Markov models and the condition values predicted using the mechanistic-empirical models. Figure 3B shows the state vectors calculated using transition probabilities estimated by the state-of-the-art methodology shown in Table 4. The condition state distributions for each time interval calculated by the mechanistic-empirical models are shown in Figure 3C, i.e., the condition state distributions in Figure 3C are calculated as ratios of condition states at each time point in the generated sample deterioration paths in 7.3. The values of the log likelihood with the transition probabilities estimated using the proposed methodology and the state-of-the-art methodology were −1.0924E+05 and −1.2697E+05, respectively. The proposed methodology can maximize the likelihood function and fit transition probabilities to generated sample paths better than the state-of-the-art methodology. To discuss the results in detail, the differences of sums of squared residuals between the Markov models and the condition values predicted using the mechanistic-empirical models are given in Table 5. This table shows, for example, that the sum of squared residuals in the period between 11 and 20 years was 4.20E−01. It can be seen that the differences between 11 and 60 years using the models developed with the proposed methodology are smaller than the differences using the models developed with the state-of-the-art methodology. This tendency can be seen in the expected deterioration paths shown in Figure 5 as the path of the proposed methodology is closer to that of data than the state-of-the-art methodology after 10 years elapsed.
Table 5. Sum of the differences between the condition values predicted using the Markov models and those predicted using the mechanistic-empirical models.
From Table 5, it can be also seen that the sums of differences over 100 years were 1.88 condition states and 3.46 condition states using the Markov models developed using the proposed and the state-of-the-art methodologies, respectively. From these results, it can be inferred that the Markov model developed using the state-of-the-art methodology overestimates life expectancy and underestimates the speed of deterioration because the result of the state-of-the-art methodology was influenced excessively by the data between 61 and 100 years to estimate transition probabilities π3j, π4j, and π5j.
The Number of Deterioration Paths from the Mechanistic-Empirical Models
In the proposed methodology, the accuracy of the Markov models depends on the number of deterioration paths N used in the estimation of the transition probabilities. This relationship is shown in Figure 6 and Table 6 for all transition probabilities for values of N ranging between 1,000 and 10,000 at 1,000 step intervals. In all cases, as above, the following values were used a = 1 (year), , and Z = 100. The posterior distributions calculated using sampled parameter π11(1) are shown in Figure 7 for N = 1,000 and N = 10,000 as an example. From Figures 6 and 7 and Table 6, it can be seen that the parameter dispersion decreases from 1.16E−02 to 3.39E−03 as N increases from 1,000 to 10,000. As shown in Table 6, all of the other parameters πij have the same magnitude correlation to the case of π11 as N increases from 1,000 to 10,000. The credible interval of π15 with N = 4,000 is not shown in Table 6 because all sampled π15 were 0.
It can be seen in the previous table and in Figure 8, which shows the 90% Bayesian credible intervals of life expectancies with N from 1,000 to 10,000 at 1,000 step intervals that the Bayesian credible intervals vary as a function of the number of deterioration paths used. With a threshold of R = 1 (year), where , it seen that N is to be at least 3,000 to obtain a satisfactory result. Figure 9 indicates that the required N increases when R decreases. From Figure 9, it can be seen that N has to be greater than 5,000 or more when R = 0.8 (years), and 8,000 or more when R = 0.6 (years), respectively.
In this article, a methodology is presented to estimate the transition probabilities to be used in Markov models from mechanistic-empirical models. The methodology can be used when there is little to no available time-series inspection information, but mechanistic-empirical models are available. The proposed methodology includes the use of analytical solutions where possible and the use of Bayesian statistics where analytical solutions are not possible. When the Bayesian approach is used, the accuracy of the transition probabilities is to be estimated as a function of the number of deterioration paths calculated.
The methodology is demonstrated by using it to estimate the transition probabilities to be used in a Markov model for reinforced concrete bridge elements deteriorating due to chloride-induced corrosion of the reinforcement. It was shown in this example that the proposed methodology was an improvement on the state-of-the-art methodology, as the sum of residuals over 100 years with the proposed methodology was 100*1.88/3.46 = 54% of that of the state-of-the-art methodology, i.e., the proposed methodology gave a 46% point decrease. In addition, the later would have over-estimated the speed of deterioration.
In the work presented in this article, it was assumed that stationary transition probabilities are to be used. When Markov models with stationary transition probabilities are used, certain limitations are imposed which restrict the modeling of the deterioration processes. This is considered to be okay if the sum of residuals is close to 0. If, however, it is found that they do not, the use of non-stationary transition probabilities might provide better results. The use of non-stationary transition probabilities could improve the modeling of deterioration in these circumstances. They could, for example, be used to capture sudden decreases in the states elements. Future work should be focused on the use of non-stationary transition probabilities in bridge management systems. Some preliminary work is that done by Wellalage (2015).
In addition, the proposed methodology has potential of use for many other materials in addition to reinforced concrete, in situations where condition data are not available but mechanistic-empirical models are available. Exactly how this should be done is, for each type of material, a topic for future work.
DM contributed to theoretical formulation, program coding, computation, and preparation of the manuscript. BA, NL, and KK contributed to preparation of the manuscript.
Conflict of Interest Statement
NL is employed by POMPLUS Consulting Ltd. All other authors declare no competing interests.
Bayes, T., and Price, R. (1763). An essay towards solving a problem in the doctrine of chances. By the Late Rev. Mr. Bayes, F. R. S. Communicated by Mr. Price, in a Letter to John Canton, A. M. F. R. S. Philos. Trans. R. Soc. Lond. 53, 370–418. doi: 10.1098/rstl.1763.0044
Carnahan, J. V., Davis, W. J., Shahin, M. Y., Keane, P. L., and Wu, M. I. (1987). Optimal maintenance decisions for pavement management. J. Transp. Eng. 113, 554–572. doi:10.1061/(ASCE)0733-947X(1987)113:5(554)
Kirkpatricka, T. J., Weyers, R. E., Anderson-Cook, C. M., and Sprinkel, M. M. (2002). Probabilistic model for the chloride-induced corrosion service life of bridge decks. Cem. Concr. Res. 32, 1943–1960. doi:10.1016/S0008-8846(02)00905-5
Kobayashi, K., Kaito, K., and Lethanh, N. (2012a). A Bayesian estimation method to improve deterioration prediction for infrastructure system with Markov chain model. Int. J. Archit. Eng. Constr. 1, 1–13. doi:10.7492/IJAEC.2012.001
Kobayashi, K., Kaito, K., and Lethanh, N. (2012b). A statistical deterioration forecasting method using hidden Markov model for infrastructure management. Transp. Res. B Methodol. 46, 544–561. doi:10.1016/j.trb.2011.11.008
Lethanh, N., Hackl, J., and Adey, B. T. (2017). Determination of Markov transition probabilities to be used in bridge management from mechanistic-empirical models. J. Bridge Eng. 22, 4017063. doi:10.1061/(ASCE)BE.1943-5592.0001101
Lethanh, N., Kaito, K., and Kobayashi, K. (2015). Infrastructure deterioration prediction with a Poisson hidden Markov model on time series data. J. Infrastruct. Syst. 21, 1–10. doi:10.1061/(ASCE)IS.1943-555X.0000242
Mishalani, R. G., and Madanat, S. M. (2002). Computation of infrastructure transition probabilities using stochastic duration models. J. Infrastruct. Syst. 8, 139–148. doi:10.1061/(ASCE)1076-0342(2002)8:4(139)
Mizutani, D., Matsuoka, K., and Kaito, K. (2013). Statistical deterioration prediction model considering the heterogeneity in deterioration rates by hierarchical Bayesian estimation. Struct. Eng. Int. 23, 394–401. doi:10.2749/101686613X13627351081515
Robelin, C.-A., and Madanat, S. M. (2007). History-dependent bridge deck maintenance and replacement optimization with Markov decision processes. J. Infrastruct. Syst. 13, 195–201. doi:10.1061/(ASCE)1076-0342(2007)13:3(195)
Robert, C. P. (1996). “Mixtures of distributions: inference and estimation,” in Markov Chain Monte Carlo in Practice, eds W. R. Gillks, S. Richardson, and D. J. Spiegelhalter (London: Chapman and Hall), 442–464.
Roelfstra, G., Hajdin, R., Adey, B. T., and Brühwiler, E. (2004). Condition evolution in bridge management systems and corrosion-induced deterioration. J. Bridge Eng. 9, 268–277. doi:10.1061/(ASCE)1084-0702(2004)9:3(268)
Tsuda, Y., Kaito, K., Aoki, K., and Kobayashi, K. (2006). Estimating Markovian transition probabilities for bridge deterioration forecasting. Struct. Eng. Earthquake Eng. 23, 241s–256s. doi:10.2208/jsceseee.23.241s
Keywords: mechanistic-empirical corrosion models, Markov chain models, reinforced concrete bridges, Bayesian statistics, bridge management
Citation: Mizutani D, Lethanh N, Adey BT and Kaito K (2017) Improving the Estimation of Markov Transition Probabilities Using Mechanistic-Empirical Models. Front. Built Environ. 3:58. doi: 10.3389/fbuil.2017.00058
Received: 13 April 2017; Accepted: 19 September 2017;
Published: 05 October 2017
Edited by:Bruno Briseghella, Fuzhou University, China
Reviewed by:Jesus Miguel Bairan, Universitat Politecnica de Catalunya, Spain
Jian Li, University of Kansas, United States
Copyright: © 2017 Mizutani, Lethanh, Adey and Kaito. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Daijiro Mizutani, firstname.lastname@example.org