Skip to main content

Predicting the natural history of metabolic syndrome with a Markov-system dynamic model: a novel approach



Markov system dynamic (MSD) model has rarely been used in medical studies. The aim of this study was to evaluate the performance of MSD model in prediction of metabolic syndrome (MetS) natural history.


Data gathered by Tehran Lipid & Glucose Study (TLGS) over a 16-year period from a cohort of 12,882 people was used to conduct the analyses. First, transition probabilities (TPs) between 12 components of MetS by Markov as well as control and failure rates of relevant interventions were calculated. Then, the risk of developing each component by 2036 was predicted once by a Markov model and then by a MSD model. Finally, the two models were validated and compared to assess their performance and advantages by using mean differences, mean SE of matrices, fit of the graphs, and Kolmogorov-Smirnov two-sample test as well as R2 index as model fitting index.


Both Markov and MSD models were shown to be adequate for prediction of MetS trends. But the MSD model predictions were closer to the real trends when comparing the output graphs. The MSD model was also, comparatively speaking, more successful in the assessment of mean differences (less overestimation) and SE of the general matrix. Moreover, the Kolmogorov-Smirnov two-sample showed that the MSD model produced equal distributions of real and predicted samples (p = 0.808 for MSD model and p = 0.023 for Markov model). Finally, R2 for the MSD model was higher than Markov model (73% for the Markov model and 85% for the MSD model).


The MSD model showed a more realistic natural history than the Markov model which highlights the importance of paying attention to this method in therapeutic and preventive procedures.

Peer Review reports


The study of natural history of chronic diseases is doubly complex due to their complex nature and multifactorial causality [1,2,3]. Because of this complexity, there are few detailed descriptions about chronic diseases natural history [4]. The aim of a study on natural history is to clarify the factors that affect the overall risk of transition from one stage to another as a diseases progresses (or regresses) [5]. Among the existing studies, some have looked at the natural history of diseases and their pathophysiology from a systems biology and complex and dynamic systems perspective [6]. Other studies, on the other hand, have illustrated the natural histories with complex statistical methods [7,8,9,10]. The most common methods for investigation of dynamic and complex situations and their progression are simulation-based statistical methods. Among these, Markov models, which pay special attention to random changes in processes (stochastic processes), are more important. Markov and system dynamics models clearly belong to two different scientific fields, but similar to the system dynamics models, the Markov models provide a powerful framework for analyzing dynamic systems [11,12,13]. However, Markov model requires a lot of computational capacity when a system becomes complex due to increase in the number of states and transitions. The MSD model is a hybrid model that combines Markov and system dynamics approaches to overcome the limitations of Markov models in modeling complex systems. Indeed, despite the difference between the Markov and the system dynamic models in terms of the stochastic and deterministic nature of the states, due to the important similarity of Markov model with system dynamic model in terms of “state” and “transition”, these two models can be combined with each other or even in some cases converted to each other [14]. This hybrid model have been mainly used in non-medical fields and repairable systems for reliable analysis in a more realistic way [12, 15,16,17]. In fact, in a MSD model the failure and repair or control rate of a system, which are time-varying indexes, are considered for transient availability modeling or analysis of system reliability in calculations [15, 18]. Due to information feedback theory, easiness of tweaking parameters to test different hypotheses and possibility of providing solutions to various problems by adjusting flow rates [16], system dynamics models are a suitable solution to eliminate computational limitations of Markov models.

Metabolic Syndrome (MetS) is a global public health challenge with a plethora of increasing research around its epidemiology and physiological mechanisms [19,20,21,22,23,24]. However, there are few studies on its natural history which have provided contradictory findings [7,8,9,10, 25,26,27]. MetS is a very complicated disorder and one can have a different combination of the syndrome components at any given time, depending on their lifestyle. To be precise, one can be in one of the states of “no component”, “isolated hypertension”, “isolated overweight/obesity”, “isolated hyperglycemia”, “isolated dyslipidemia”, “obesity + hypertension”, “obesity + dyslipidemia”, “obesity + hyperglycemia”, “hypertension + dyslipidemia”, “hypertension + hyperglycemia”, “dyslipidemia + hyperglycemia”, and a set of combinations of three or four components [28, 29]. The MetS is quite dynamic and one can transit from one state to another. This dynamicity of the disorder development in individuals makes it a proper candidate for an MSD approach. Therefore, this study was conducted to evaluate the performance of a MSD model in a context of investigating MetS natural history in a large population-based study.


Study type and participants

This retrospective study was undertaken on 4 waves of Tehran Lipid and Glucose Study (TLGS), ranging from year 1999 to year 2016 [30,31,32]. Data collection and measurement procedures, sampling processes, eligibility criteria for participants, and definitions of MetS criteria in TLGS are published elsewhere [33, 34].

The aim of the study was to evaluate the performance of a MSD model in investigation of MetS natural history (transition between 4 components of MetS, i.e. abdominal obesity, hypertension, hyperglycemia, high triglycerides with low HDL (dyslipidemia) and their combinations (12 states)). To be precise, the investigation encompassed calculating and predicting transition probabilities (TPs) between the mentioned 12 states over a period of 21 years (2015–2036) through a compartmental MSD model. The findings then were compared with those of a Markov model to see which model worked better.

Markov model

At the beginning of this section, a 12-state Markov model was designed and used to describe the natural history of MetS (Fig. 1). A Markov process is a random model for describing a sequence of probable events in which the probability of each event depends only on the present time, not preceding event. In other words, if the status of a process is known at times x1, x2…xn, then it can be said that only the latest information (that is the state of the process at the xn time), is sufficient to predict the future progression of the process (Xn + 1). Accordingly, the Markov dependency that is introduces as Markov properties (Memoryless) is assumed as follow:

$$P\left({X}_{n+1}={x}_{n+1}|{X}_1={x}_1,{X}_2={x}_2,\dots, {X}_n={x}_n\right)=P\left({X}_{n+1}={x}_{n+1}|{X}_n={x}_n\right)$$
Fig. 1
figure 1

A 12-State dynamic transition diagram for MetS natural history

Of course, it should be noted that the Markov dependency to the current state, can also be of a different order than the first, e.g., of the second-order [35]. So, in many practical situations the first-order dependency is sufficient but not always justifiable. However, this is a fundamental assumption in the Markov model which is mainly considered. Markov process can be fully described by its TP function or pij (t) which is the probability that a system is in (j) state at time (t), provided that the process starts from time (t = 0) and state (i). Hence when (i) = x (n-1) [36], then one can write a Markov process as follow:

$${P}_{ij}\ \left({X}_{n+1}= sj|{X}_n= si\right)=P\left({X}_n= sj|{X}_{n-1}= si\right)$$

That, (s) is the total number of states that a system can occupy at any given time. In our model, the time criteria for calculating the TPs in each phase across all 12 states was triennial and the final matrix of TPs was equal to the average of total values of TPs in all periods [9]. Also, the final state (i.e. MetS), was considered as an absorbing state. An absorbing state is a state in which no transition to any subsequent state will take place [37].

Based on the number of states in our model, a 12 × 12 transition matrix was used to calculate the TPs.

$$\mathrm{P}=\left[\begin{array}{ccc}{P}_{11}& \cdots & {P}_{112}\\ {}\vdots & \ddots & \vdots \\ {}{P}_{121}& \cdots & {P}_{1212}\end{array}\right]$$

MSD model design

In order to design a MSD model, control rate (CR) and failure rate (FR) indices were first calculated (section A in Additional file 1). FR and CR indices are used to evaluate the reliability of system models. They are also used to evaluate the effects of various interventions in a system model (SD). The interventions in our study were medicinal (i.e. self-reported consumption of different medications to control blood pressure, blood lipids, and blood sugar levels) and lifestyle-based interventions (i.e. TLGS phase II to reduce risk factors for non-communicable diseases in some participants [38]). A SD model performs the risk prediction process by using these two indices in a pre-fabricated model that is the product of actions-reactions between the states in a Markov model. FR (CR) indicates any progress (regress) from less (more) components towards more (less) components across the natural history of MetS.

In the CR calculation, both for lifestyle interventions and medicinal therapies, patients who were on the MetS state were not included in the calculation. Also, since no medicinal intervention was needed for healthy individuals, people with no-components state were not included in the CR calculation for medicinal interventions. Mean values of CR and FR were considered as the final values.

After calculating the CR and FR indices, based on our Markov diagram (Fig.1), causal loop and stock and flow diagrams were drawn for formulation of the SD model separately for no component, 1-component, 2-component and MetS (Figs. 2, 3 and 4). In fact, at this stage, in order to perform simulations, qualitative models (causal loop diagram) were transformed into quantitative models (stock and flow diagram).

Fig. 2
figure 2

Causal loop and stock-flow diagrams for the no-component state

Fig. 3
figure 3

Causal loop and stock-flow diagrams for 1 and 2-component states

Fig. 4
figure 4

Causal loop and stock-flow diagrams for MetS state

In these SD diagrams, the dynamic processes of transitions from each component to the preceding component (i.e. recovery or transition probability backward (TPO-B)), to the next component (i.e. disease progression or transition probability forward (TPO-F)), and lack of transition (stoppage) are shown as in-transition probabilities (TPIs) under the influences of CR and FR. B sign in the diagrams is indicative of a balancing cycle and R sign refers to a reinforcement cycle. To all of these transitions, which are in fact longitudinal (vertical) transitions, the types of lateral (horizontal) transitions must be added. Lateral transitions (Width TP = TPW) are the conversion (replacing) of each of the 1-components to other 1-components and also each of the 2-components to the other 2-components, which include a total of 21 transitions (6 transitions in 1-component and 15 in 2-component). Therefore, lateral transitions were defined in the form of TPW index, which is not affected by CR and FR. The final MSD model, which has become a quite complicated model of MetS and its components, is shown in Additional file 1 (refer to Fig. 1 in Section C of the Additional file 1).

Statistical analysis

Predictions with Markov model

In a Markov model, Pij(n) is the probability of transition from state (i) to state (j) in nth step. To calculate the matrix pn (the transition matrix of step (n)), one must multiply the matrix (P) n times by itself, which the (Pij) element in the matrix pn will be the same of Pij(n) [39]. For Risk prediction, the time horizon, based on the minimum average time of transition of individuals directly from no component to the MetS (which was approximately 2 years) in the follow-up periods, for 7 periods (3-year) (2015–2036) calculated and presented (refer to Section B of the Additional file 1). Given that transition probabilities were non-homogeneous or time dependent in our model, instead of transition probability, transition rate was used in which transitions are calculated as per unit time or instantaneous, which is equivalent to rate. Hence, the predictive rate was used in the Y axis as proportion of individuals who developed the MetS from various states over time period per total person-time.

Markov model validation

To validate Markov model performance, values of parameters in the fourth wave of the TLGS were predicted using the data from three preceding waves. The predicted values then were compared with the actual data using measures of mean differences and mean standard error of matrices. As a visual evaluation, a graph of the proportionality of the predicted data values with the empirical data in terms of trends was drawn in general.

Verification and prediction with the final MSD model

In order to validate the MSD model, the same validation steps in the case of the Markov model were repeated. Finally, for risk prediction by continuing the existing conditions such as Markov model, Time horizon, for 7 periods (3-year) (2015–2036) was calculated and presented. Since the SD model is actually a differential equations system whose its order depends on the number of variables, in the risk prediction section, to calculate the value (N) of each state, the following differential equation was designed. As an example, the equation is designed for the no component state and applies to other states as well.

$$\mathrm{No}\ \mathrm{component}\ \left(\mathrm{nc}\right)\ \mathrm{equation}=\frac{d\ (nc)}{d\ (t)}=\left({\upalpha}_{\mathrm{inflow}\ \left(\mathrm{t}0\right)}\times {\mathrm{nc}}_{\left(\mathrm{t}0\right)}\right)-\left({\upalpha}_{\mathrm{outflow}\ \left(\mathrm{t}\mathrm{n}\right)}\times {\mathrm{nc}}_{\left(\mathrm{t}\mathrm{n}\right)}\right)$$

On the other hand, based on the above differential equation, the integral equation based on the 3-year interval was written as follow:

$$nc={\int}_{\mathrm{t}0}^{\mathrm{t}3}\left[\left(\mathrm{CR}\times \mathrm{TP}\times {\sum}_{\mathrm{N}}\mathrm{n}\mathrm{c}\left(\mathrm{t}0\right)\right)+\left(\sum \mathrm{CR}\times \mathrm{TP}\times {\sum}_{\mathrm{N}}\mathrm{otherstonc}\left(\mathrm{t}\mathrm{n}\right)\right)-\right(\mathrm{FR}\times \mathrm{TP}\times {\sum}_{\mathrm{N}}\mathrm{n}\mathrm{c}\mathrm{toothers}\left(\mathrm{t}\mathrm{n}\right)\Big]$$

Finally, the integral equation for calculating the values of TPO-F and TPO-B, which are longitudinal transitions (from no component to MetS), and TPI, which are considered as insider transitions and are described in the previous sections, are as follows:

$$TPO-F=\int \left[{N}_{\left( origin\ state\right)}\times {TP}_{\left( origin\ \mathrm{state}\ \mathrm{to}\ \mathrm{next}\ \mathrm{state}\right)}\times {FR}_{\left( origin\ \mathrm{state}\right)}\right]$$
$$TPO-B\ and\ TPI=\int \left[{N}_{\left( origin\ state\right)}\times {TP}_{\left( origin\ \mathrm{state}\ \mathrm{to}\ \mathrm{next}\ \mathrm{state}\right)}\times {CR}_{\left( origin\ \mathrm{state}\right)}\right]$$

The integral equation of lateral transitions was also written as follows:

$$TPW=\int \left[{N}_{\left( origin\ state\right)}\times {TP}_{\left( origin\ \mathrm{state}\ \mathrm{to}\ \mathrm{next}\ \mathrm{state}\right)}\right]$$

Evaluation of models’ performance

A triple approach was used to compare the performance of Markov and MSD models. At first, mean standard error of matrices, mean of the differences, and fit of the graphs were used to compare the two models’ outputs. Then, using Kolmogorov-Smirnov two-sample test, closeness of predicted and empirical (goodness of fit) samples distributions in both Markov and MSD models was compared. Finally, in the third approach, the value of the R2 index was calculated based on a simple linear regression model between the actual and predicted values. As a result, the model with a smaller mean SE, a smaller mean difference, a more appropriate graph, and also a higher goodness of fit as well as higher R2 was selected as the desired one. Also, for quantification of uncertainty in predicted model’s performance assessment in both models, standard errors for estimated transitions (predictive rates) as a measure of the accuracy of the resulting estimates that provide ability to objectively assess the quality of the reported estimates, was calculated. To estimation of the standard error associated with each transition, our approach was to use a bootstrap method [40] with 1000 iterations and combine results.

Additional analysis

Mean and percentage were used for descriptive analysis of baseline and follow-up waves of the TLGS. Also, Cochran’s Q test was used to examine the significance of revealed trends in data. 0.05 was set as the significant value. IBM SPSS Statistics software for Windows version 24 (IBM Corp, Armonk, NY), excel 2016, and R-4.0.3 (“msm [41]“and “markovchain [42]“packages) were used for data analyses. The maximum likelihood method was used for parameter estimation in methods that have been implemented within “markovchain” and “msm” packages.

Ethical considerations

As this study was conducted on the TLGS data, it is ethically subject to the ethical considerations observed in the TLGS project. The study was also ethically approved by National Committee of Ethics in Iranian Biomedical Research (code# IR.SUMS.REC.1398.835).


Demographic variables description

56.16% (7235) of participants in TLGS sample (12,882) were female. At baseline, the mean of participants’ age was 31.34 ± 17.3 years (median age = 29 years).

States description

Table 1 shows the status and trend of changes in 12 states of MetS during study periods. In general, the highest prevalence in baseline belonged to isolated dyslipidemia. In terms of difference between baseline and final stage values, states of “no component”, “isolated dyslipidemia”, “obesity + dyslipidemia”, “hypertension + dyslipidemia”, and “dyslipidemia + hyperglycemia” all had decreasing trends and the highest decrease was related to “hypertension + dyslipidemia”. Other combinatorial states had increasing trends and the highest increase belonged to “obesity + hyperglycemia” state.

Table 1 Longitudinal change of MetS states among participants over the study period

TPs values

The overall TPs matrix is given in Table 2. Over the study period (4 follow-up periods), probability of direct and non-stop transition from “no component” to MetS was 8.6%. The highest transition probability from “no component” to other states belonged to “isolated abdominal obesity”. Among isolated components, the highest TP towards MetS was related to hyperglycemia and hypertension, respectively. Among the composite components, the highest TP towards MetS belonged to obesity & hypertension which was the highest value of TP toward MetS among all components and their combinations (41.1%), which was generally introduced as the main initiator. The diagonal row in the matrix (Table 2) indicates the probability of remaining in same state, no transition, over time. In overall, people with MetS had the highest probability (60.2%) of no transition over time.

Table 2 Matrix of transition probabilities (%)

Markov model

Markov predictions show that as the time continues over the years, the probability of transition towards MetS for all isolated states would first experience an upward trend until the sixth year, and then all the states would have a same probability of transition. Among the isolated states, people in “no component” state will experience the highest increase in the upward trend towards MetS and those with hyperglycemia will experience the least increase, before reaching to a constant probability (steady state) (Fig. 5). In other words, the highest rate of MetS seems to occur among people with no component state. At the same time, the highest rate of progression toward the MetS was related to hyperglycemia and the lowest was associated with no component.

Fig. 5
figure 5

Risk of progression towards MetS for isolated components in Markov model

In the case of composite states, the predictions showed that except for “obesity + hypertension” and “obesity & hyperglycemia” which would have a decreasing or a constant trend of transition towards MetS, other states would all first have an increasing trend for 6 years and then would flat-out. Before reaching to the steady probability, the highest progression toward the MetS was associated with obesity & hyperglycemia, and the highest progression was associated with dyslipidemia & hyperglycemia (Fig. 6). In other words, the highest rate of MetS among all the composite states, until reaching to the steady level, seems to occur in people with “dyslipidemia + hyperglycemia”.

Fig. 6
figure 6

Risk of progression towards MetS for composite components in Markov model

Validation of the Markov model

In general, the mean of differences was 0.0562 and the mean SE of the predicted matrix from the actual matrix for the fourth period of TLGS was 0.003684. Also, the trend analysis showed that the fit between values in empirical and predicted data was favorable. Moreover, in terms of closeness of values, with an overestimation of about 5.62%, the estimated values were relatively desirable (Figs. 2 and 3 in section C Additional file 1). Generally, the evaluation was suggestive of relative adequacy of the Markov model in risk prediction.

MSD model

The overall CR and FR indices are presented in Table 3 (detailed tables can be found in Additional file 1 section D). The SD model was built to examine the progression of each component towards the MetS (separately for isolated components and composite components). For this purpose, the CR and FR values along with transition probabilities were entered into the final MSD model (Fig. 1 in Additional file 1 section C) and a risk prediction process was simulated.

Table 3 Control and failure rates in metabolic syndrome interventions

According to the MSD modeling outputs, among the isolated components, the highest progression rate towards MetS was related to hyperglycemia and obesity, respectively. The trends of other components had also a small upward slope and the lowest rate of progression belonged to dyslipidemia (Fig. 7). In the case of composite components, the rate of “obesity + hyperglycemia” progression towards MetS was higher than others composites (Fig. 8). But, in overall, progression slope of composites was greater than that of isolated components (except for obesity and hyperglycemia). The lowest progression slope was related to “hypertension + dyslipidemia”.

Fig. 7
figure 7

Risk of progression towards MetS for isolated components in MSD model

Fig. 8
figure 8

Risk of progression towards MetS for composite components in MSD model

Validation of the MSD model for risk prediction

The mean difference between the predicted values and the empirical values was 0.04911 and the mean SE of the predicted matrix from the real matrix in the fourth period of TLGS was 0.002056. Also, the trend analysis showed that the fit between values in empirical and predicted data was desirable. Moreover, in terms of proximity of values, with an overestimation of about 4.9%, the estimated values were desirable (Figs. 4 and 5 in section C of Additional file 1). Overall, the evaluation indicated that the MSD model performance, in terms of risk prediction, was satisfactory.

Evaluation of models’ performance

According to the evaluation outputs, both the Markov and MSD models were shown to be desirable models for risk prediction. But, according to greater proximity of predictions made by the MSD model to the real (empirical) conditions (i.e. fitter graphs of values proportionality, lower mean difference (less overestimation), lower SE of the general matrix, and also significance of the MSD model test (p = 0.808 for MSD model and p = 0.023 for Markov model) which is indicative of equal distribution of real and predicted samples in the MSD model) and Finally, a higher R2 for the MSD model (73% for the Markov model and 85% for the MSD model), the MSD model was shown to be a more desirable model for predictions. Also, uncertainty quantifications given in section E in Additional file 1.


In this study, a MSD model was designed to model the natural history of MetS, i.e. progression from its components. The model then was compared with a Markov model to evaluate their performance. The findings showed that both the Markov and MSD models were adequate enough to predict the secular trends of the MetS. But based on the greater proximity of the predictions made by the MSD model to the real data gathered in TLGS, the MSD model was introduced as the desirable model.

The MSD model has a systemic approach and adopts a comprehensive and integrated view to the processes that lead up to MetS. For instance, a MSD model enriches one’s understanding of the natural history of MetS by integration of the effectiveness of MetS-driven therapeutic and life-style interventions (i.e. control and failure rates) into the model. It also enriches the understanding by being open and inclusive to dynamicity of MetS components and the fact that one can shuffle back and forth between simple and complex components of the MetS over time. Therefore, the authors thought that a MSD model is a good match to sheer complexity of the MetS and might outperform other models, e.g. Markov model, in terms of risk predictions.

The risk prediction by Markov model in our study showed that all states/components first showed an upward trend towards MetS until the ninth year. Then, all the trends levels off at a same risk value. This pattern of trends (only the trends and not the time until leveling-off) was seen in other studies [7,8,9,10]. However, in the risk prediction process with the MSD model, which, there was no similar evidence, assuming that the existing conditions continued, the progression of all states toward the MetS (with differences between various states in different conditions) was upward which was completely different from the process observed in Markov modeling, both in our study and in other studies [7,8,9,10]. To be specific, the predictions made by the MSD model is a component-specific prediction that is not comparable with the general trends reported in other studies. In fact, in other studies, the general trend of MetS is drawn and described, while in our study, the trend of each components is described as part of the natural history of the disease. In other word, the trends shown by Markov model mainly refers to progression of the disease as a whole and lifetime, but the MSD model reveals the progression and dynamicity of each component in the natural history towards MetS. To clarify it more, it seems that since the Markov model does not systemically consider the natural history of MetS, interactions between components over time, non-linear knock-on effects of changes in each component on other components, and influence of external factors (e.g. interventions) on the natural history of the MetS are not considered in the modelling. As a result, rather than seeing the natural history of progress of components and states as a whole, the natural history of each component or state is examined and predicted separately. In this case, the real contribution of each component or state in the occurrence of MetS and its trend is probably not seen in full, and the rate of progress and change in progress are not accurately calculated. Importantly, in this study, our aim was to model the development trend of MetS components as different compounds as the natural history of MetS, rather than the development of MetS over time as a whole, that usually can be seen in other studies. Clinically and mechanically, as seen in this study, the highest rate of progression has been from no component to isolated components and from isolated components to composite components and finally to MetS, which illustrate a cumulative and ascending process over time.

In addition to Markov model, there are a few other prospective risk prediction models that have been used to predict the process of MetS development. For example, one study used a biomarker-base model [43] and the risk of MetS based on age and gender was predicted for 5 to 10 years into future. In another study, Framingham Risk Score (FRS) model was used [44] and an upward trend and an irregular trend was predicted for high and low risk people, respectively. Retrospective studies have also been another way to investigate the developmental process of MetS. For instance, it was shown that overall prevalence of MetS over a 15-year period had an upward trend, but incidence of MetS from different components had an irregular and relatively upward trend among children & adolescents [33] and adults [45]. The differences between our findings and those of others in this section are probably related to different study methodologies.

Accordingly, lack of similar studies in terms of methodology was a challenge for our study. In fact, although we showed that MSD outperformed Markov model (and probably other models) in revealing the developmental process of MetS, but unless it is widely used in various medical fields, a clear-cut judgment on functional advantages and strength of MSD model would be avoided. This is where we cordially invite researchers to work on in future.


The natural history of many chronic diseases, e.g. MetS, develops through multistate and dynamic paths. This chronicity, state-multiplicity, and dynamicity therefore calls for systemic approaches in order to understand and control these health problems. In this study, a MSD model showed to outperform a commonly used Markov model in revealing the developmental process of MetS over time. Our findings, therefore, invite researchers to adopt MSD models in investigation of chronic and complex health problems and test its practicalities.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.


  1. Vaillant GE. Twelve-year follow-up of New York narcotic addicts. N Engl J Med. 1966;275(23):1282–8.

    PubMed  CAS  Google Scholar 

  2. Moore JE. The natural history of chronic illness. J Chronic Dis. 1955;1(3):335–7.

    PubMed  CAS  Google Scholar 

  3. Bynum B. A history of chronic diseases. Lancet. 2015;385(9963):105–6.

    Google Scholar 

  4. The Natural History of Disease. N Engl J Med. 1949;240(11):442–3.

  5. Jewell NP. Natural history of diseases: statistical designs and issues. Clin Pharmacol Ther. 2016;100(4):353–61.

    PubMed  Google Scholar 

  6. Rozendaal YJW, Wang Y, Hilbers PAJ, van Riel NAW. Computational modelling of energy balance in individuals with metabolic syndrome. BMC Syst Biol. 2019;13(1):24.

    PubMed  PubMed Central  Google Scholar 

  7. Chen X, Chen Q, Chen L, Zhang P, Xiao J, Wang S. Description and prediction of the development of metabolic syndrome in Dongying City: a longitudinal analysis using the Markov model. BMC Public Health. 2014;14:1033.

    PubMed  PubMed Central  Google Scholar 

  8. Hwang L-C, Bai C-H, You S-L, Sun C-A, Chen C-J. Description and prediction of the development of metabolic syndrome: a longitudinal analysis using a markov model approach. PLoS One. 2013;8(6):e67436–e.

    PubMed  PubMed Central  CAS  Google Scholar 

  9. Tang X, Liu Q. Prediction of the development of metabolic syndrome by the Markov model based on a longitudinal study in Dalian City. BMC Public Health. 2018;18(1):707.

    PubMed  PubMed Central  Google Scholar 

  10. Jia X, Chen Q, Wu P, Liu M, Chen X, Xiao J, et al. Dynamic development of metabolic syndrome and its risk prediction in Chinese population: a longitudinal study using Markov model. Diabetol Metab Syndr. 2018;10(1):24.

    PubMed  PubMed Central  Google Scholar 

  11. Howard RA. Dynamic Probabilistic Systems. Enabled: Dover Publications; Illustrated edition (May 4, 2012); 2012.

  12. Rao MS, Naikan V. A hybrid Markov system dynamics approach for availability analysis of degraded systems; 2011.

    Google Scholar 

  13. Kirkwood JR. Markov processes 1, editor: CRC press; 2015.

    Google Scholar 

  14. Pratap KJ, Mohapatra RKR. System dynamics model for markov processes england system dynamics'91; 1991. p. 10–29.

    Google Scholar 

  15. Rao M, Naikan V. A novel Markov system dynamics framework for reliability analysis of systems. Econ Qual Contr. 2009;24:101–16.

    Google Scholar 

  16. Naikan MSRVNA. A system dynamics model for transient availability modeling of repairable redundant systems. Int J Performability Eng. 2015;11(3):203–11.

    Google Scholar 

  17. Schütte C, Sarich M. A critical appraisal of Markov state models. Eur Phys J Spec Top. 2015;224(12):2445–62.

    Google Scholar 

  18. Rao MS, Naikan VNA. A managerial tool for reliability analysis using a novel Markov system dynamics (MSD) approach. Int J Manage Sci Engin Manage. 2009;4(3):230–40.

    Google Scholar 

  19. Saklayen MG. The global epidemic of the metabolic syndrome. Curr Hypertens Rep. 2018;20(2):12.

    PubMed  PubMed Central  Google Scholar 

  20. Schiffer TA, Lundberg JO, Weitzberg E, Carlström M. Modulation of mitochondria and NADPH oxidase function by the nitrate-nitrite-NO pathway in metabolic disease with focus on type 2 diabetes. Biochim Biophys Acta Mol basis Dis. 1866;2020(8):165811.

    Google Scholar 

  21. Jepsen S, Suvan J, Deschner J. The association of periodontal diseases with metabolic syndrome and obesity. Periodontol. 2020;83(1):125–53.

    Google Scholar 

  22. Shaikh S, Dahani A, Arain SR, Khan F. Metabolic syndrome in young rheumatoid arthritis patients. J Ayub Med Coll Abbottabad. 2020;32(3):318–22.

    PubMed  Google Scholar 

  23. Carnethon MR, Loria CM, Hill JO, Sidney S, Savage PJ, Liu K. Risk factors for the metabolic syndrome: the coronary artery risk development in young adults (CARDIA) study, 1985-2001. Diabetes Care. 2004;27(11):2707–15.

    PubMed  Google Scholar 

  24. Choi SH, Yun KE, Choi HJ. Relationships between serum total bilirubin levels and metabolic syndrome in Korean adults. Nutr Metab Cardiovasc Dis. 2013;23(1):31–7.

    PubMed  CAS  Google Scholar 

  25. Scuteri A, Morrell CH, Najjar SS, Muller D, Andres R, Ferrucci L, et al. Longitudinal paths to the metabolic syndrome: can the incidence of the metabolic syndrome be predicted? The Baltimore longitudinal study of aging. J Gerontol A Biol Sci Med Sci. 2009;64(5):590–8.

    PubMed  Google Scholar 

  26. Cheung BM, Wat NM, Tam S, Thomas GN, Leung GM, Cheng CH, et al. Components of the metabolic syndrome predictive of its development: a 6-year longitudinal study in Hong Kong Chinese. Clin Endocrinol. 2008;68(5):730–7.

    CAS  Google Scholar 

  27. Tao LX, Wang W, Zhu HP, Huo D, Zhou T, Pan L, et al. Risk profiles for metabolic syndrome and its transition patterns for the elderly in Beijing, 1992-2009. Endocrine. 2014;47(1):161–8.

    PubMed  CAS  Google Scholar 

  28. Vanlancker T, Schaubroeck E, Vyncke K, Cadenas-Sanchez C, Breidenassel C, González-Gross M, et al. Comparison of definitions for the metabolic syndrome in adolescents. The HELENA study. Eur J Pediatr. 2017;176(2):241–52.

    PubMed  Google Scholar 

  29. Beilby J. Definition of metabolic syndrome: report of the National Heart, Lung, and Blood Institute/American Heart Association conference on scientific issues related to definition. Clin Biochem Rev. 2004;25(3):195–8.

    PubMed Central  Google Scholar 

  30. Azizi F. Tehran Lipid and Glucose Study: A National Legacy. Int J Endocrinol Metab. 2018;16(4 Suppl):e84774–e.

    PubMed  PubMed Central  Google Scholar 

  31. Azizi F, Zadeh-Vakili A, Takyar M. Review of Rationale, Design, and Initial Findings: Tehran Lipid and Glucose Study. Int J Endocrinol Metab. 2018;16(4 Suppl):e84777–e.

    PubMed  PubMed Central  Google Scholar 

  32. Azizi F. Tehran lipid and glucose study: a legacy for prospective community-based research. Arch Iran Med. 2014;17(6):392–3.

    PubMed  Google Scholar 

  33. Bagheri P, Khalil D, Seif M, Khedmati Morasae E, Bahramali E, Azizi F, et al. The dynamics of metabolic syndrome development from its isolated components among iranian children and adolescents: findings from 17 years of the Tehran lipid and glucose study (TLGS). Diabetes Metab Syndr. 2020;15(1):99–108.

    PubMed  Google Scholar 

  34. Bagheri P, Khalili D, Seif M, Rezaianzadeh A. Dynamic behavior of metabolic syndrome progression: a comprehensive systematic review on recent discoveries. BMC Endocr Disord. 2021;21(1):54.

    PubMed  PubMed Central  Google Scholar 

  35. Kedem B. Sufficient statistics associated with a two-state second-order Markov chain. Biometrika. 1976;63(1):127–32.

    Google Scholar 

  36. Rao MS, Naikan VNA. A Markov System Dynamics Approach for Repairable Systems Reliability Modeling. Int J Reliab Qual Saf Eng. 2016;23(01):1650004.

    Google Scholar 

  37. Carswell CI. Essentials of Pharmacoeconomics. PharmacoEconomics. 2008;26(12):1065.

    Google Scholar 

  38. Azizi F, Ghanbarian A, Momenan AA, Hadaegh F, Mirmiran P, Hedayati M, et al. Prevention of non-communicable disease in a population in nutrition transition: Tehran lipid and glucose study phase II. Trials. 2009;10:5.

    PubMed  PubMed Central  Google Scholar 

  39. Banimahd SA, Khalili D. Drought Class Transition Analysis by Markov Chains and Log-Linear Models: Approach for Early Drought Warning. Iran J Watershed Manage Sci Engin. 2014;8(24):0.

    Google Scholar 

  40. Zoubir A, Iskander D. Bootstrap methods and applications. Signal Process Magazine IEEE. 2007;24:10–9.

    Google Scholar 

  41. Jackson C. Multi-state modelling with R: the msm package. U.K: MRC Biostatistics Unit Cambridge; 2019.

    Google Scholar 

  42. Spedicato G, Signorelli M, editors. The markovchain Package: A Package for Easily Handling Discrete Markov Chains in R. Computer Science; 2013.

  43. Zhang W, Chen Q, Yuan Z, Liu J, Du Z, Tang F, et al. A routine biomarker-based risk prediction model for metabolic syndrome in urban Han Chinese population. BMC Public Health. 2015;15(1):64.

    PubMed  PubMed Central  Google Scholar 

  44. Yousefzadeh G, Shokoohi M, Najafipour H, Shadkamfarokhi M. Applying the Framingham risk score for prediction of metabolic syndrome: the Kerman coronary artery disease risk study, Iran. ARYA Atheroscler. 2015;11(3):179–85.

    PubMed  PubMed Central  Google Scholar 

  45. Khalili D, Bagheri P, Seif M, Rezaianzadeh A, Khedmati Morasae E, Bahramali E, et al. The dynamics of metabolic syndrome development from its isolated components among Iranian adults: findings from 17 years of the Tehran lipid and glucose study (TLGS). J Diabetes Metab Disord. 2021;20(1):95–105.

    PubMed  CAS  Google Scholar 

Download references


This article is the result of Pezhman Bagheri’s PhD. Thesis in Epidemiology with registration code: SUMS.98/19936. We have to express our sincere thanks to all the personnel of The Shahid Beheshti University of Medical Sciences (SBMU) Research Institute for Endocrine Sciences for respectable cooperation in data collection phase that lead to the outcome of this project.


This study was financially supported by the Vice-Chancellor for Research and Technology of Shiraz University of Medical Sciences (SUMS), which is worthy of thanks and appreciation.

Author information

Authors and Affiliations



P.B. developed the theory and performed the analysis, performed the literature search, assessed the literature, extracted data, wrote the manuscript with support from D.K., A.R. developed the theoretical framework, encouraged and supervised the findings of this work, design and implementation of the research, verified the analytical methods., M.S. developed the theoretical framework, verified the analytical methods., E.KH. developed the theoretical framework, design and implementation of the research, verified the analytical methods. E.B. was a contributor in developed the theory and design, F.A. was a major contributor in licensing access and use of TLGS data. All authors discussed the results and contributed to the final manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Pezhman Bagheri.

Ethics declarations

Ethics approval and consent to participate

As this study was conducted on the TLGS data, it is ethically subject to the ethical considerations observed in the TLGS project. Then, informed consent for study participation was obtained from all subjects in TLGS previously. The study was also ethically approved by National Committee of Ethics in Iranian Biomedical Research (code# IR.SUMS.REC.1398.835). Also, all methods were performed in accordance with the relevant guidelines and regulations.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Rezaianzadeh, A., Morasae, E.K., Khalili, D. et al. Predicting the natural history of metabolic syndrome with a Markov-system dynamic model: a novel approach. BMC Med Res Methodol 21, 260 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Metabolic syndrome
  • Markov-system dynamics
  • Natural history