- Research article
- Open Access
- Published:

# Examining the BMI-mortality relationship using fractional polynomials

*BMC Medical Research Methodology*
**volume 11**, Article number: 175 (2011)

## Abstract

### Background

Many previous studies estimating the relationship between body mass index (BMI) and mortality impose assumptions regarding the functional form for BMI and result in conflicting findings. This study investigated a flexible data driven modelling approach to determine the nonlinear and asymmetric functional form for BMI used to examine the relationship between mortality and obesity. This approach was then compared against other commonly used regression models.

### Methods

This study used data from the National Health Interview Survey, between 1997 and 2000. Respondents were linked to the National Death Index with mortality follow-up through 2005. We estimated 5-year all-cause mortality for adults over age 18 using the logistic regression model adjusting for BMI, age and smoking status. All analyses were stratified by sex. The multivariable fractional polynomials (MFP) procedure was employed to determine the best fitting functional form for BMI and evaluated against the model that includes linear and quadratic terms for BMI and the model that groups BMI into standard weight status categories using a deviance difference test. Estimated BMI-mortality curves across models were then compared graphically.

### Results

The best fitting adjustment model contained the powers -1 and -2 for BMI. The relationship between 5-year mortality and BMI when estimated using the MFP approach exhibited a J-shaped pattern for women and a U-shaped pattern for men. A deviance difference test showed a statistically significant improvement in model fit compared to other BMI functions. We found important differences between the MFP model and other commonly used models with regard to the shape and nadir of the BMI-mortality curve and mortality estimates.

### Conclusions

The MFP approach provides a robust alternative to categorization or conventional linear-quadratic models for BMI, which limit the number of curve shapes. The approach is potentially useful in estimating the relationship between the full spectrum of BMI values and other health outcomes, or costs.

## Background

Obesity and its impact on health and the healthcare system is one of the most important public health issues Western society faces. Many studies have measured the detrimental effect of obesity on life expectancy by estimating the relationship between mortality and body mass index (BMI) [weight (kg)/height^{2} (m^{2})]. However, the evidence is mixed as to the exact relationship. While some studies have concluded no relation [1], an inverse relation [2] or a direct relation [3], the majority of studies have identified a U-shaped relation [4–10] or a J-shaped relation [11–15]. One reason for these differences is the wide array of datasets used in the analyses. However, the results in these studies are also sensitive to assumptions regarding the estimation sample and functional form for BMI.

The vast majority of studies have employed a non-parametric approach, by treating BMI as a categorical variable. The mortality risk of individuals in different BMI groups is computed relative to a reference BMI category. World Health Organization (WHO) BMI classifications [16] are typically used. For example, numerous studies have computed the excess mortality due to being overweight (BMI = 25-30) and obese (BMI ≥ 30) [17–22]. Categorizing continuous variables has been a popular approach, particularly because of *a priori* knowledge that the relationship between the two measures is nonlinear. However, Royston, et al. (2006) [23] pointed out a number of disadvantages with this approach. The most significant drawback is the loss of information and power through what is equivalent to rounding. For example, studies of excess mortality using WHO BMI classification implicitly assume all individuals that are in the normal category (BMI = 18.5-25) exhibit the same mortality risk. However, the normal category is heterogenous and includes individuals who are healthy and those who are chronically ill. Categorization is particularly problematic if groups are large. In addition, the number of cutoff points and where to place cutoff points is arbitrary. Finally, results are not necessarily robust across the choice of reference category.

A number of studies have used other approaches that maintain BMI as a continuous variable. Estimation of the BMI-mortality curve using a continuous measure is problematic because the relationship is nonlinear and the distribution of BMI is right skewed. Schauer et al. (2010) [24] included linear and squared terms to account for nonlinearities, but truncated their sample to respondents with a BMI of 25 or greater to address the skewness in BMI. Durazo et al. (1998) [6] transformed BMI into a normally distributed variable using Tukey's "ladder of powers" method. Gronniger et al. (2006) [25] treated BMI non-parametrically without categorization.

The purpose of this study was to investigate a flexible approach to modelling the nonlinear and asymmetric relationship between adult mortality and obesity measured using BMI. We implemented the multivariable fractional polynomials (MFP) method [26, 27] and maintained BMI as a continuous variable. Instead of imposing a specific functional form, the MFP method allows the data to determine the best fitting functional form for BMI and other adjustment variables. We hypothesized that this method would provide the ability to capture the relationship between mortality and BMI in a compact, parsimonious model. The MFP performance and results were then compared against other commonly used regression models that estimate the BMI-mortality relationship.

## Methods

The data used in this study were from the NHIS, publicly available through the Centers for Disease Control and Prevention (CDC). The NHIS is a nationally representative cross-sectional household survey covering the non-institutionalized civilian population in the United States (U.S.) and is conducted annually [28]. Households and non-institutional sample units with special living arrangements (e.g. dormitories, boarding houses) were randomly sampled. For each unit sampled, a randomly selected adult and child (if present) were used to collect core health information. Beginning in 1997, individuals aged 65 and above who were black, Hispanic or Asian were oversampled. We combined data from 1997 through 2000. Our sample was linked to the National Death Index (NDI) with mortality follow-up through December 31, 2005 ensuring all respondents were tracked for at least five years after completing the NHIS. Self-reported weight and height measurements without shoes were used to construct BMI. Smoking history was a dichotomous variable indicating whether an individual has ever smoked. Individuals below the age of 18 and under 18.5 BMI were excluded from our analysis. We also excluded 4,599 respondents who have missing BMI measurements or have a BMI of over 99.99, whose observations were truncated, and 229 observations with unknown smoking status. The final sample contained 117,961 respondents.

We investigated the relationship between mortality and obesity through BMI using the logistic regression model, stratified by gender and adjusting for age and smoking history. The logistic model was chosen over the Cox proportional hazards model because the proportional hazards condition did not hold for the BMI fractional polynomial (FP) terms. We used 5-year all-cause mortality as the dependent outcome because of the very low incidence of death annually. To check for robustness, we also estimated all models using 3-year mortality as the outcome, which produced similar results. Analyses were stratified by gender because the biological process by which men and women gain and maintain weight is different [29]. We adjusted for smoking status because it confounded the BMI-mortality relationship, which if ignored may result in overestimation of the BMI associated with minimum mortality [30]. Sample adult weights from the NHIS, which denoted the inverse probability of inclusion into the sample were used within the logistic regression model to correct for potential biases resulting from the NHIS sampling design. Because data were pooled, sampling weights were divided by the number of years to generate a sample that is representative of the U.S. population on average from 1997-2000.

We maintained BMI as a continuous variable in our analysis. To account for the nonlinear and asymmetric relationship between BMI and mortality, we first applied the fractional polynomials [31, 32] method. To allow for flexibility in fitting a curve with a single turning point, we considered second degree polynomial transformations for BMI. We used the closed test procedure [33] which first determined the best fitting second degree polynomial by choosing power transformations from the set {-2, -1, -0.5, 0, 0.5, 1, 2, 3}, where 0 denotes the log transformation. The best fitting second-degree FP was then compared against the null model using a deviance difference test with four degrees of freedom to determine whether BMI should be included in the model. If the first test was statistically significant, a second deviance difference test with three degrees of freedom was applied to compare the best fitting second degree FP against the linear model. If the second test was significant, a final deviance difference test with two degrees of freedom compared the best fitting second degree FP with the best fitting first degree FP. If the final test was significant, the second degree FP was included, otherwise the first degree FP was chosen. To prevent collinearity and model overfit, the best fitting first degree polynomial was chosen for age. The selection of powers for BMI and age was computed simultaneously using the multivariable fractional polynomials (MFP) method [26, 27], which combined backward elimination to select the best fitting model. The regression model we estimated was

where *π*
_{
i
} was the 5-year death probability for individual *i*, *p1* and *p2* were the fractional powers for BMI, and *q1* was the fractional power for age. The MFP method also scaled and centered variables in model selection process to improve numerical stability and to provide a model intercept that was easier to interpret. A nominal p-value of 0.05 was used to test all hypotheses. To evaluate the validity of the FP model for BMI, we graphically compared three models with the main FP model defined by (1). First, we estimated the model which categorized BMI into 30 narrow bins (1 bin for each BMI unit between 18 and 40, for every two BMI units between 40 and 54 and a single bin for BMI above 54) while also adjusting for age and smoking status. We then estimated separate FP models after omitting subjects with early death (< 1 year from baseline) and extreme BMI values (> 50).

Interactions between adjustment variables were tested to address the possibility of differences in the BMI-mortality curve across the age distribution, and by smoking history. The multivariable fractional polynomial interaction (MFPI) algorithm [26] was used to assess interactions, which first determined the best fitting polynomial functions for BMI and age using MFP and then tested for significant interactions between fractionally transformed variables and smoking history using a deviance difference test. We then verified interactions found by the MFPI algorithm graphically using Lowess smoothed curves. The use of FPs when fitting models using BMI as a continuous variable avoided inclusion of spurious interactions in a strictly linear model.

The BMI associated with minimum mortality was calculated by first estimating the final MFP model (including interaction terms) using logistic regression. To derive the optimal BMI, we set the first derivative of the estimated FP model equal to 0 and solved for BMI. As an example, the optimal BMI for the model with linear and quadratic BMI terms without interactions is $-{\widehat{\beta}}_{1}\u2215\left(2{\widehat{\beta}}_{2}\right)$, where ${\widehat{\beta}}_{1}$ and ${\widehat{\beta}}_{2}$ are the logistic regression coefficients for the linear and quadratic BMI terms, respectively. Confidence intervals were based on standard errors computed using the delta method.

We compared the BMI-mortality curves derived using the MFP method with the continuous BMI model containing linear and quadratic BMI terms and the categorical model based on WHO BMI classifications. Models were compared on the basis of model fit, the shape of the BMI-mortality curve, the magnitude and uncertainty in the BMI associated with minimum mortality and mortality estimates. All statistical models were fit using the STATA Statistical Software (Version 11; College Station, TX). The STATA procedure MFP was used to determine the functional forms for age and BMI and the procedure NLCOM was used to calculate estimates and confidence intervals for the BMI associated with minimum mortality.

## Results

Descriptive statistics for the NHIS sample used in this study are presented in Table 1. Our sample included 52,549 men and 65,412 women with a mean age of 45.49 and 47.13 years, respectively. The percentage of respondents who died within five years was 6.37% among men and 5.55% among women. The number of deaths per thousand individuals were similar across annual survey cohorts for both the male and female samples. The majority of the sample was in the normal and overweight BMI ranges. Ever smokers made up 54.93% of the male sample and 40.47% of the female sample.

### Model Fit

The best fitting model for BMI identified by the MFP procedure included the terms ${\overline{BMI}}^{-2}$ and ${\overline{BMI}}^{-2}*\mathsf{\text{ln}}\left(\overline{BMI}\right)$ for the male sample and the terms ${\overline{BMI}}^{-2}$ and ${\overline{BMI}}^{-1}$ for the female sample, where $\overline{BMI}=BMI\u221510$. In both samples, the best fitting model included a squared term for age. The main adjustment model contained smoking status, two BMI and one age polynomial terms. For men, the transformed model significantly improved model fit relative to the untransformed model (Deviance Difference = 231.79, *p-value* < 0.001), the linear-quadratic model (Deviance Difference = 105.62, *p-value* < 0.001) and the categorical model (Deviance Difference = 81.63, *p-value* < 0.001). Similar improvements in model fit were found in the female sample for the FP model relative to the untransformed model (Deviance Difference = 173.04, *p-value* < 0.001), the linear-quadratic model (Deviance Difference = 131.93, *p-value* < 0.001) and the categorical model (Deviance Difference = 82.11, *p-value* < 0.001). The FP model for BMI produced a similar BMI-mortality curve compared to the model categorizing BMI into narrow bins for both genders. Also, the BMI-mortality curves produced by the main FP model and the models omitting early deaths and extreme BMI values were similar in shape (Figure 1).

After finding the best fit for the main model, the age-smoking history (Deviance Difference = 15.88, *p-value* < 0.001) and BMI-age interactions (Deviance Difference = 35.31, *p-value* < 0.001) were both identified as statistically significant in the female sample. The final model was selected using forward selection. After including the age-BMI interactions, the age-smoking history interaction remained significant (Deviance Difference = 15.44, *p-value* < 0.001). Logistic regression results for the FP model, including significant interactions are in Table 2. We did not find statistically significant interactions in the male sample.

Overfit of the model may result in spurious interactions. We assessed interactions identified as significant graphically by performing Lowess smoothing on 5-year of death transformed into logits. Figure 2 shows the differential effect of age across smoking status and BMI, respectively, in the female sample. Differences in the slope of running smoothed lines across the age distribution confirmed the interactions identified as significant.

### BMI-Mortality Curve

Fitted curves showing the relationship between the 5-year probability of death and the associated 95% confidence interval as a function of BMI are in Figure 3. We present curves for male and female non-smokers at ages 40, 50 and 65, respectively. For BMI 18.5 and above, the estimated relationship between BMI and mortality was J-shaped for women, but U-shaped for men. At all ages, the female curve was lower and less steep at both tails of the BMI distribution. Increases in age were associated with increases in mortality, however, fitted curves have the same shape across the age distribution. For both genders, the 5-year probability of death increased exponentially with BMI. Death probabilities increased rapidly starting at BMI = 40. The wide confidence intervals at the right tail of the BMI distribution stemmed from the low proportion of extremely obese (BMI ≥ 40) individuals in the population. Similar curve shapes were found for those who have ever smoked (Figure 4).

The top panel of Figure 5 compares the MFP and linear-quadratic BMI models for male and female never smokers, respectively, at age 50. For both genders, the BMI-mortality curve produced by the linear-quadratic model was J-shaped. The linear-quadratic overestimated mortality at the right tail of the BMI distribution and underestimated mortality in the 31-50 range for men and 30-52 BMI range for women when compared to the MFP model. In the male sample, the MFP model produced higher mortality estimates for subjects at the low end of the normal category.

The bottom panel of Figure 5 compares the MFP and categorical models for age 50 never smokers. The categorical approach matched MFP estimates closely in the overweight category for men. There was a lower degree of correspondence in BMI-mortality curves for all other BMI classifications. The categorical model also underestimated mortality at both tails of the BMI distribution.

### BMI Associated with Minimum Mortality

For the MFP model, the BMI associated with minimum mortality was 26.97 (95% Confidence Interval [CI], 26.41 to 27.54) (Figure 5) in the male sample. Because we did not identify significant interactions with BMI, the optimal BMI was constant for both smokers and non-smokers and across all ages. In contrast, the BMI associated with minimum mortality for women increased with age. At age 50, the optimal BMI was 22.34 (95% CI, 20.10 to 24.57). The optimal BMI ranged from 19.25 (95% CI, 13.31 to 25.18) at age 18 to 26.86 (95% CI, 25.70 to 28.02) at age 85. The BMI associated with minimum mortality was lower for men at all ages and for women age 56 and below when compared to the linear-quadratic model. The optimal BMI in the linear-quadratic model was 31.84 (95% CI, 30.34 to 33.34) for men and 23.04 (95% CI, 18.15 to 27.93) for women. With the categorical model, minimum mortality was associated with the overweight category for both genders.

### Mortality Estimates

Mortality estimates for a 50-year old individual across models are in Table 3. At the optimal BMI, the MFP model produced lower mortality estimates compared to the linear-quadratic model, but higher mortality estimates compared to the categorical model for men. For women, the MFP model produced lower mortality estimates compared to both the linear-quadratic and categorical models. Based on the MFP model, 5-year mortality for a 50-year old never smoker with BMI = 50 was 3.11 times greater than the minimum in the male sample and 3.54 times greater in the female sample. Mortality for an individual with BMI = 50 relative to the minimum was smaller in both the linear-quadratic and categorical models. For male never smokers, adjusted mortality was greater by a factor of 2.89 in the linear-quadratic model and a factor of 2.10 in the categorical model. For female never smokers, adjusted mortality was higher by a factor of 2.39 in the linear-quadratic model and a factor of 2.43 in the categorical model.

## Discussion

This paper outlined and applied a flexible method to modelling the nonlinear and asymmetric relationship between BMI and mortality. Using the MFP approach, we found that the BMI-mortality relation was J-shaped for women and U-shaped for men among individuals with a BMI of 18.5 and over. We also identified the nadir of the BMI-mortality curve to exist in the overweight range for the average U.S. male and the normal range for the average U.S female. However, differences in death probabilities around the nadir were small. The results in this paper with regard to the shape and nadir [2, 10, 25, 34, 35] of the BMI-mortality curve are consistent with prior findings. With regard to the nadir, most studies have found the nadir is in the normal BMI category, however, minimum mortality associated with the overweight category has been found in a number of other studies. For example, using NHANES I data, Durazo et al. (1997) [34] reported the BMI of minimum mortality to be 24.8 for white men and 24.3 for white women. However, for black men and women, the BMI of minimum mortality was 27.1 and 26.8, respectively. The downward slope at the low end of the BMI distribution for men stemmed from the fact that the normal BMI category consists of a mix of healthy lean and chronically ill, which confounded the relationship between mortality and obesity. This result was consistent with other studies [36–38] that report an inverse relation between low BMI individuals and mortality irrespective of the length of follow-up.

The MFP approach provides a robust alternative to other commonly used methods for addressing the nonlinear and asymmetric relationship between BMI and mortality by allowing the data itself to determine the functional form for BMI and other adjustment variables. The closed test algorithm used in this study determined the best fitting model from a predefined set of candidate models based on power transformations of BMI. Improvements in model fit were found relative to other commonly used models including the linear-quadratic BMI model, which imposes symmetry on the BMI-mortality curve. This in conjunction with low variation in mortality among those in the 21-30 BMI range and high mortality among those with BMI over 40 resulted in an overly flat curve in the center. Allowing the estimated curve to have a flexible shape made the MFP model more sensitive to variation in death rates in the data. Improvements in model fit were generally accompanied by smaller estimates of the BMI associated with minimum mortality and narrower confidence intervals. While the differences in optimal BMI were small for the representative 50-year old female, there was a large discrepancy in the male sample with the nadir extending into the class I obese range (BMI 30.0-34.9).

The other common approach to modelling the nonlinear functional form for BMI is a nonparametric approach incorporating categorical variables defined by WHO BMI classifications. Assessing the risk of a BMI category relative to the normal category is a convenient method to account for the nonlinear form, but assumes mortality is uniform across a BMI category, which is problematic when a category is heterogenous. In particular, the normal category consists of a mix of healthy and sick lean. Studies employing the categorical approach also typically take the mortality risk of obese individuals beyond a given threshold as constant. We addressed this difficulty by allowing the data from across the entire BMI distribution to predict mortality risk at extreme obesity levels, where fewer observations exist. Because our results showed that mortality increased exponentially for extremely obese individuals, categorization can drastically underestimate mortality at the right tail of the BMI distribution. The use of wide BMI categories are also inadequate for the purposes of prediction. Categorizing BMI using finer intervals can alleviate some of these difficulties, but the decision of which categories to add is generally an arbitrary choice. Moreover, the additional categories increase the variance of parameter estimates, particularly in high BMI categories where the sample size is small.

Our study also employed methods that differentiate the effect of BMI across gender and age. The identification of distinct FP terms across gender samples point to important differences in the BMI-mortality relationship. Further differences were identified within the female sample through interactions. The finding of significant interactions between age and BMI is not common in literature, but has been identified in a select number of studies [39, 40]. The inclusion of age-BMI interactions resulted in optimal BMI estimates that varied with age and predicted the nadir to exist in the overweight range for women at age 71 and above.

This study has at least two notable limitations. The logistic regression model with 5-year mortality as the dependent outcome was chosen over the Cox survival model because the assumption of proportional hazards failed to hold. However, the disadvantage of the logistic regression model is that full information regarding the individual's exact time of death was not used. Second, while the primary goal of this study was to compare approaches to modelling the BMI-mortality relation using three important covariables, the complete case approach to addressing missing data and the omission of other potentially important explanatory variables may have introduced biases in parameter estimates.

## Conclusions

The MFP method identified improvements in model fit compared to other commonly employed models that estimate the BMI-mortality relationship, and is a robust method to determine the functional form for BMI. Using the MFP method, we found that the shape of the BMI-mortality curve was different across gender, but consistent with other previous studies. Specifically, the relation was U-shaped for men and J-shaped for women. We also identified important differences in shape and nadir of the BMI-mortality curve and mortality estimates compared to other commonly used models. Understanding the relation between obesity and BMI is important from a policy perspective, for addressing issues such as determining the efficacy of approaches designed to reduce obesity and in communicating with the public about the importance of obesity as a public health issue. Flexible methods, such as those employed in this study, are central in achieving reliability in measures used relevant analyses and are also potentially useful in estimating the relationship between the full spectrum of BMI values and other health outcomes or costs.

## References

- 1.
Stevens J, Keil JE, Rust PF, Verdugo RR, Davis CE, Tyroler HA, Gazes PC: Body mass index and body girths as predictors of mortality in black and white men. Am J Epidemiol. 1992, 135 (10): 1137-1146.

- 2.
Diehr P, Bild DE, Harris TB, Duxbury A, Siscovick D, Rossi M: Body mass index and mortality in nonsmoking older adults: the Cardiovascular Health Study. Am J Public Health. 1998, 88 (4): 623-629. 10.2105/AJPH.88.4.623.

- 3.
Chaturvedi N, Fuller JH: Mortality risk by body weight and weight change in people with NIDDM. The WHO Multinational Study of Vascular Disease in Diabetes. Diabetes Care. 1995, 18 (6): 766-774. 10.2337/diacare.18.6.766.

- 4.
Allison DB, Faith MS, Heo M, Kotler DP: Hypothesis concerning the U-shaped relation between body mass index and mortality. Am J Epidemiol. 1997, 146 (4): 339-349.

- 5.
Allison DB, Zhu SK, Plankey M, Faith MS, Heo M: Differential associations of body mass index and adiposity with all-cause mortality among men in the first and second National Health and Nutrition Examination Surveys (NHANES I and NHANES II) follow-up studies. Int J Obes Relat Metab Disord. 2002, 26 (3): 410-416. 10.1038/sj.ijo.0801925.

- 6.
Durazo-Arvizu R, U nA, McGee DL, Cooper RS, Liao Y, Luke A: Mortality and Optimal Body Mass Index in a Sample of the US Population. Am J Epidemiol. 1998, 147 (8): 739-749.

- 7.
Hanson RL, McCance DR, Jacobsson LT, Narayan KM, Nelson RG, Pettitt DJ, Bennett PH, Knowler WC: The U-shaped association between body mass index and mortality: relationship with weight gain in a Native American population. J Clin Epidemiol. 1995, 48 (7): 903-916. 10.1016/0895-4356(94)00217-E.

- 8.
Heitmann BL, Erikson H, Ellsinger BM, Mikkelsen KL, Larsson B: Mortality associated with body fat, fat-free mass and body mass index among 60-year-old swedish men-a 22-year follow-up. The study of men born in 1913. Int J Obes Relat Metab Disord. 2000, 24 (1): 33-37. 10.1038/sj.ijo.0801082.

- 9.
Hozawa A, Okamura T, Oki I, Murakami Y, Kadowaki T, Nakamura K, Miyamatsu N, Hayakawa T, Kita Y, Nakamura Y, et al: Relationship between BMI and all-cause mortality in Japan: NIPPON DATA80. Obesity (Silver Spring). 2008, 16 (7): 1714-1717. 10.1038/oby.2008.237.

- 10.
Orpana HM, Berthelot J-M, Kaplan MS, Feeny DH, McFarland B, Ross NA: BMI and Mortality: Results From a National Longitudinal Study of Canadian Adults. Obesity. 2009, 18 (1): 214-218.

- 11.
Calle EE, Thun MJ, Petrelli JM, Rodriguez C, Heath CW: Body-mass index and mortality in a prospective cohort of U.S. adults. N Engl J Med. 1999, 341 (15): 1097-1105. 10.1056/NEJM199910073411501.

- 12.
Folsom AR, Kaye SA, Sellers TA, Hong CP, Cerhan JR, Potter JD, Prineas RJ: Body fat distribution and 5-year risk of death in older women. JAMA. 1993, 269 (4): 483-487. 10.1001/jama.1993.03500040049035.

- 13.
Hu FB, Willett WC, Li T, Stampfer MJ, Colditz GA, Manson JE: Adiposity as compared with physical activity in predicting mortality among women. N Engl J Med. 2004, 351 (26): 2694-2703. 10.1056/NEJMoa042135.

- 14.
Jee SH, Sull JW, Park J, Lee S-Y, Ohrr H, Guallar E, Samet JM: Body-mass index and mortality in Korean men and women. N Engl J Med. 2006, 355 (8): 779-787. 10.1056/NEJMoa054017.

- 15.
Manson JE, Willett WC, Stampfer MJ, Colditz GA, Hunter DJ, Hankinson SE, Hennekens CH, Speizer FE: Body weight and mortality among women. N Engl J Med. 1995, 333 (11): 677-685. 10.1056/NEJM199509143331101.

- 16.
World Health Organization - BMI Classification. [http://apps.who.int/bmi/index.jsp?introPage=intro_3.html]

- 17.
Allison DB, Fontaine KR, Manson JE, Stevens J, VanItallie TB: Annual Deaths Attributable to Obesity in the United States. JAMA. 1999, 282 (16): 1530-1538. 10.1001/jama.282.16.1530.

- 18.
Bender R, Trautner C, Spraul M, Berger M: Assessment of Excess Mortality in Obesity. Am J Epidemiol. 1998, 147 (1): 42-48.

- 19.
Flegal KM, Graubard BI: Estimates of excess deaths associated with body mass index and other anthropometric variables. Am J Clin Nutr. 2009, 89 (4): 1213-1219. 10.3945/ajcn.2008.26698.

- 20.
Flegal KM, Graubard BI, Williamson DF, Gail MH: Excess Deaths Associated With Underweight, Overweight, and Obesity. JAMA. 2005, 293 (15): 1861-1867. 10.1001/jama.293.15.1861.

- 21.
Flegal KM, Graubard BI, Williamson DF, Gail MH: Cause-Specific Excess Deaths Associated With Underweight, Overweight, and Obesity. JAMA. 2007, 298 (17): 2028-2037. 10.1001/jama.298.17.2028.

- 22.
Monteverde M, Noronha K, Palloni A, Novak B: Obesity and Excess Mortality Among the Elderly in the United States and Mexico. Demography. 2010, 47 (1): 79-96. 10.1353/dem.0.0085.

- 23.
Royston P, Altman DG, Sauerbrei W: Dichotomizing continuous predictors in multiple regression: a bad idea. Stat Med. 2006, 25 (1): 127-141. 10.1002/sim.2331.

- 24.
Schauer DP, Arterburn DE, Livingston EH, Fischer D, Eckman MH: Decision modeling to estimate the impact of gastric bypass surgery on life expectancy for the treatment of morbid obesity. Arch Surg. 2010, 145 (1): 57-62. 10.1001/archsurg.2009.240.

- 25.
Gronniger JT: A Semiparametric Analysis of the Relationship of Body Mass Index to Mortality. Am J Public Health. 2006, 96 (1): 173-178. 10.2105/AJPH.2004.045823.

- 26.
Royston P, Sauerbrei W: A new approach to modelling interactions between treatment and continuous covariates in clinical trials by using fractional polynomials. Stat Med. 2004, 23 (16): 2509-2525. 10.1002/sim.1815.

- 27.
Sauerbrei W, Royston P: Building multivariable prognostic and diagnostic models: transformation of the predictors by using fractional polynomials. Journal Of The Royal Statistical Society Series A. 1999, 162 (1): 71-94. 10.1111/1467-985X.00122.

- 28.
NHIS: 1997 National Health Interview Survey (NHIS) Public Use Data Release, NHIS Survey Description. 2000, Hyattsville, MD: National Center for Health Statistics

- 29.
Kramer FM, Jeffery RW, Forster JL, Snell MK: Long-term follow-up of behavioral treatment for obesity: patterns of weight regain among men and women. Int J Obes. 1989, 13 (2): 123-136.

- 30.
Durazo-Arvizu RA, Cooper RS: Issues related to modeling the body mass index-mortality association: the shape of the association and the effects of smoking status. Int J Obes (Lond). 2008, 32 (Suppl 3): S52-55.

- 31.
Royston P, Ambler G, Sauerbrei W: The use of fractional polynomials to model continuous risk variables in epidemiology. Int J Epidemiol. 1999, 28 (5): 964-974. 10.1093/ije/28.5.964.

- 32.
Royston P, Altman DG: Regression Using Fractional Polynomials of Continuous Covariates: Parsimonious Parametric Modelling. Journal of the Royal Statistical Society Series C (Applied Statistics). 1994, 43 (3): 429-467. 10.2307/2986270.

- 33.
Sauerbrei W, Meier-Hirmer C, Benner A, Royston P: Multivariable regression model building by using fractional polynomials: Description of SAS, STATA and R programs. Computational Statistics & Data Analysis. 2006, 50 (12): 3464-3485. 10.1016/j.csda.2005.07.015.

- 34.
Durazo-arvizu R, McGee D, Li Z, Cooper R: Establishing the nadir of the body mass index-mortality relationship: a case study. J Am Stat Assoc. 1997, 92 (440): 312-319. 1

- 35.
Berrington de Gonzalez A, Hartge P, Cerhan JR, Flint AJ, Hannan L, MacInnis RJ, Moore SC, Tobias GS, Anton-Culver H, Freeman LB, et al: Body-mass index and mortality among 1.46 million white adults. N Engl J Med. 2010, 363 (23): 2211-2219. 10.1056/NEJMoa1000367.

- 36.
Thorogood M, Appleby PN, Key TJ, Mann J: Relation between body mass index and mortality in an unusually slim cohort. J Epidemiol Community Health. 2003, 57 (2): 130-133. 10.1136/jech.57.2.130.

- 37.
Whitlock G, Lewington S, Sherliker P, Clarke R, Emberson J, Halsey J, Qizilbash N, Collins R, Peto R: Body-mass index and cause-specific mortality in 900 000 adults: collaborative analyses of 57 prospective studies. Lancet. 2009, 373 (9669): 1083-1096. 10.1016/S0140-6736(09)60318-4.

- 38.
Troiano RP, Frongillo EA, Sobal J, Levitsky DA: The relationship between body weight and mortality: a quantitative analysis of combined information from existing studies. Int J Obes Relat Metab Disord. 1996, 20 (1): 63-75.

- 39.
Diehr P, O'Meara ES, Fitzpatrick A, Newman AB, Kuller L, Burke G: Weight, mortality, years of healthy life, and active life expectancy in older adults. J Am Geriatr Soc. 2008, 56 (1): 76-83. 10.1111/j.1532-5415.2007.01500.x.

- 40.
Stevens J, Cai J, Pamuk ER, Williamson DF, Thun MJ, Wood JL: The effect of age on the association between body-mass index and mortality. N Engl J Med. 1998, 338 (1): 1-7. 10.1056/NEJM199801013380101.

### Pre-publication history

The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2288/11/175/prepub

## Acknowledgements

The authors are grateful for helpful comments and suggestions from Patrick Royston and Ralf Bender.

This paper was written for the Bariatric Obesity Outcome Modeling Collaborative. Members are listed at the end of the manuscript. This work was supported by the HQ Air Force Surgeon General under Award No. FA7014-08-2-0002. Opinions, interpretations, conclusions and recommendations are those of the author and are not necessarily endorsed by the U.S. Air Force. Dr. Wong was supported by the Department of Veterans Affairs, Health Services Research and Development Post-Doctoral Fellowship TPP 61-024.

The Bariatric Outcomes and Obesity Modeling (BOOM) Project is a multidisciplinary research collaboration investigating obesity health services. Primary collaborators include: Franklin Skip Carr and Larry Belenke (Ventura Healthcare Systems LLC); David R. Flum MD, MPH, Andrew Wright MD, Allison Devlin Rhodes MS, Kara MacLeod MPH, Katrina Golub MPH, Erin Machinchick, C. Bradley Kramer MPA, Hao He PhD (Surgical Outcomes Research Center, University of Washington); Sean D. Sullivan PhD, Louis P. Garrison, Jr. PhD, Rafael Alfonso- Cristancho MD, MSc, Bruce C. M. Wang PhD, Edwin S. Wong PhD (Pharmaceutical Outcomes Research and Policy Program, University of Washington and Department of Veterans Affairs); David E. Arterburn MD, MPH, Malia Oliver, Renee Hawkes (Center for Health Studies, Group Health Cooperative); and Louis Martin, MD, MS (Samaritan Physicians).

## Author information

### Affiliations

### Corresponding author

## Additional information

### Competing interests

The authors declare that they have no competing interests.

### Authors' contributions

ESW conceived of the study, participated in its design, performed statistical analyses and drafted the manuscript. BCMW, LPG, RAC and DEA participated in the design of the study and helped to draft the manuscript. DRF and SDS participated in the design of the study, helped to draft the manuscript and acquired funding for the project. All authors read and approved the final manuscript.

## Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

## Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

## About this article

### Cite this article

Wong, E.S., Wang, B.C., Garrison, L.P. *et al.* Examining the BMI-mortality relationship using fractional polynomials.
*BMC Med Res Methodol* **11, **175 (2011). https://doi.org/10.1186/1471-2288-11-175

Received:

Accepted:

Published:

### Keywords

- Body Mass Index
- Body Mass Index Category
- Fractional Polynomial
- Body Mass Index Range
- Body Mass Index Distribution