Estimating uncertainty of alcoholattributable fractions for infectious and chronic diseases
 Gerrit Gmel^{1, 2}Email author,
 Kevin D Shield^{1, 3},
 Hannah Frick^{4},
 Tara Kehoe^{1, 5},
 Gerhard Gmel^{1, 6, 7, 8} and
 Jürgen Rehm^{1, 3, 9}
https://doi.org/10.1186/147122881148
© Gmel et al; licensee BioMed Central Ltd. 2011
Received: 14 September 2010
Accepted: 17 April 2011
Published: 17 April 2011
Abstract
Background
Alcohol is a major risk factor for burden of disease and injuries globally. This paper presents a systematic method to compute the 95% confidence intervals of alcoholattributable fractions (AAFs) with exposure and risk relations stemming from different sources.
Methods
The computation was based on previous work done on modelling drinking prevalence using the gamma distribution and the inherent properties of this distribution. The Monte Carlo approach was applied to derive the variance for each AAF by generating random sets of all the parameters. A large number of random samples were thus created for each AAF to estimate variances. The derivation of the distributions of the different parameters is presented as well as sensitivity analyses which give an estimation of the number of samples required to determine the variance with predetermined precision, and to determine which parameter had the most impact on the variance of the AAFs.
Results
The analysis of the five Asian regions showed that 150 000 samples gave a sufficiently accurate estimation of the 95% confidence intervals for each disease. The relative risk functions accounted for most of the variance in the majority of cases.
Conclusions
Within reasonable computation time, the method yielded very accurate values for variances of AAFs.
Background
Alcohol consumption is a major risk factor for burden of disease and injuries globally [1, 2] as demonstrated by the Comparative Risk Analyses (CRA) within the Global Burden of Disease and Injury (GBD) Studies [2, 3]. To estimate the impact of alcohol consumption on infectious and chronic diseases, alcohol attributable fractions (AAFs) are calculated [4] and applied to the number of deaths or number of incident cases [5].
Up until now, confidence intervals (CIs) have not been presented in the CRA for the estimates of alcoholattributable health harms. While there are methods to calculate uncertainty around AAFs when both exposure and risk relations are derived from the same cohort [6, 7], no such methods exist for the case where both exposure and risk relations stem from two different metaanalyses (for general concerns and considerations see [5, 8, 9] and the Discussion section below). This article aims to fill this gap, and for the first time will present a method to calculate CIs for the new AAFs modelling methodology used in the 2005 CRA study for chronic diseases by region, sex and age (see [4] for a description of the AAF modelling methodology and [10] for a comparison of the new AAF modelling methodology with previous methods.)
Alcohol is related to many disease categories [5]. Since globally morbidity and mortality can only be reliably estimated for broad disease or injury categories, the GBD is restricted to 126 distinct broad disease or injury categories http://www.globalburden.org/GBD_Study_Operations_Manual_Jan_20_2009.pdf, of which 31 are causally related to alcohol [5]. We will first be using exposure measures and relative risks for disease categories from the 2005 CRA study for which a metaanalysis providing a continuous relative risk function exists to estimate AAFs [5], and then will explain the methodology to construct CIs for these AAFs. This paper will focus on the Asian regions as an illustration of our results. Asia presents an interesting mix of low income and high income regions and allows us to illustrate succinctly our methodology.
Methods
This method has two main steps: (1) calculation of the AAFs, and (2) calculation of the variance for the AAFs. Information from multiple sources, all of which carries a certain degree of uncertainty, is required in order to calculate the AAFs. This information is outlined below.
Definition of regions
Regions were defined in accordance with the 2005 GBD study [11]. Countries were grouped into regions which were defined by their geographical location and epidemiological profile which includes child and adult mortality levels and major causes of death. Neither income nor population of the countries in a region had an impact on the grouping. For the purpose of illustrating the method, we restricted the analysis to the five Asian regions containing the countries listed below:

Asia Pacific, High Income: Brunei Darussalam, Japan, Republic of Korea, Singapore

Asia Central: Armenia, Azerbaijan, Georgia, Kazakhstan, Kyrgyzstan, Mongolia, Tajikistan, Turkmenistan, Uzbekistan

Asia East: China, Democratic People's Republic of Korea, Hong Kong, Macao, Taiwan

Asia South: Afghanistan, Bangladesh, Bhutan, India, Nepal, Pakistan

Asia Southeast: Cambodia, Christmas Island, Cocos Island, Indonesia, Lao People's Democratic Republic, Malaysia, Maldives, Mauritius, Myanmar, Philippines, Reunion, Seychelles, Sri Lanka, Thailand, Timor Leste, Vietnam
Population estimates for each region by country for 2005 were based on estimates obtained from the 2008 revisions of the United Nations Population Division [12].
Definition of age categories
Three age categories were used in the CRA study: 15  34, 35  64 and 65 or greater; we limited our study to the age category of 15 to 34 years. Ages were clustered so that results would be comparable with the 2005 GBD study.
Measures of alcohol consumption
Adult per capita consumption is calculated by adding together the estimated recorded and unrecorded alcohol consumption [13, 14]. The variance of an estimate of recorded consumption was based on estimates from different sources (for example, government data, industry data, Food and Agriculture Organization), which are usually quite similar. The main sources for determining unrecorded consumption are home production, alcohol intended for industrial, technical, and medical uses, and illegal production or importation of alcohol. The variance of an estimate of unrecorded consumption is larger in comparison to that of recorded consumption and there are usually only sparse sources for information on unrecorded consumption which is often based on limited empirical evidence [14, 15]. Since uncertainty of unrecorded adult per capita consumption is not provided in the 2005 CRA study, we assumed the standard deviation of unrecorded adult per capita consumption was proportionally five times larger than the standard deviation for recorded adult per capita consumption. The prevalence of lifetime abstainers and former drinkers was estimated from a populationweighted average of surveys in the respective regions by sex and age. Using the proportion of current drinkers we calculated the per capita consumption of alcohol per current drinker, which was used in modelling alcohol consumption. The variance of prevalence can be estimated using a binomial distribution, as illustrated below in the Statistical procedures section.
Modelling alcohol consumption
Using comparable studies, involving 1001 distributions from 66 countries by sex and age, it can be shown that the distribution of alcohol consumption for the drinking population is modelled best using the gamma distribution [4]. It is well known that population surveys underestimate true consumption, and thus data from surveys have to be triangulated with estimates of adult per capita consumption, which are often based on sales data [4, 13]. To be conservative, we assumed that 80% of this registrybased estimate reflected the true adult per capita consumption; this level was chosen to account for the alcohol wasted and not consumed (for example, broken bottles and quantities left over in glasses) and for the underestimation of true consumption in medical epidemiological studies, which were used in the metaanalyses that estimated the relative risk functions. A regression of the abovementioned studies showed a strong relationship between mean and standard deviation (for men and women, the explained variance of standard deviation was greater than 90%). This relationship allows us to compute the standard deviation of an upshifted distribution very easily. Finally, this method relies on the assumption that the proportion of alcohol consumed by the various sex and age groups derived from surveys is accurate [4].
Measures of relative risk
Step 1: Calculation of the AAF
This step requires calculating the daily consumption AAF estimates, and will be outlined below.
Alcoholattributable fractions (AAFs)
where P_{abs} is the proportion of lifetime abstainers, P_{form} is the proportion of former drinkers among the population, and RR_{form} is the relative risk of the latter proportion. P(x) represents the prevalence of drinking at level × (in grams per day, modelled by a gamma function), and RR(x) is the relative risk at this level compared to lifetime abstainers. In the CRAs, AAFs are usually calculated separately by sex, age, and sometimes by ethnic groups. In our study of Asian regions, AAFs were computed by region (see below), sex and age.
We did not use this mathematical expression in its original form when estimating the AAFs for several reasons. Firstly, a person whose daily consumption exceeds 150 grams per day is highly unlikely to consume this amount over a long period of time. Therefore, to be conservative, the average daily consumption was truncated at 150 grams per day. Secondly, when there is truncation at 150 grams per day, the gamma distribution needs to be normalized by adding a coefficient in front of the probability density function to ensure that the area under this function will integrate to 1 between 0 and 150 grams of alcohol per day.
Step 2: Calculation of the variance of the AAF
This step requires calculating the variance of the AAF estimates with risk data, and will be outlined below.
In order to derive 95% CIs for AAFs, two paths can be taken. The first one consists of deriving the expression for the variance of the AAF by taking into account all the errors of the parameters on which it depends, and subsequently computing the CI for the AAF. This approach, although mathematically accurate, is too complex in our case. Indeed, the AAF depends on the relative risk function, the prevalence of former drinkers and abstainers, and the distribution of consumption among drinkers. Since errors in these values and functions are nontrivial, it is virtually impossible to compute the variance of AAFs algebraically.
The second approach is simpler, but less accurate, and requires more computation. A number (we will call it N for simplicity) of random sets of the lowest level parameters (the parameters from which all other values are derived) are generated, namely the coefficients of the relative risk functions, the adult per capita consumption and the prevalence of former drinkers and lifetime abstainers. Each random set of lowest level parameters will then yield an AAF value for a total of N AAFs for each region, sex, age group and disease. The variance of the N AAFs will approach the true variance as N increases. This corresponds to calculating the variance of an AAF using a Monte Carlotype method [18].
In order to generate these random samples for each lower level parameter, the distribution, mean and variance of each parameter must be known. The following paragraphs elucidate the methods used to determine the properties of each parameter.
Statistical procedures
The simulations were implemented in R (version: 2.10.1, refer to "Additional file 1: Example of R  code for simulations" for an example of the code) and the numerical errors inherent in any computational program were neglected (for example, the error (uncertainty) which is added by using numerical integration in calculating the AAFs was not taken into consideration for our variance calculations). The random normal generation of adult per capita consumption for the drinking population sometimes yields values that are negative or zero, which are factually impossible. In these instances, the value was set to 0.001 to symbolize very low consumption. Mathematically, a zero mean consumption would transform the gamma distribution into a Dirac distribution located at 0. In addition, drinkers would have a consumption of 0 grams per day which is not compatible with the definition of current drinkers.
The generation of adult per capita consumption assumes a random normal distribution as we have no information about an alternative distribution. Very low per capita consumption occasionally obtained by the random normal generation caused some additional trouble during the computation. The method used in R to numerically integrate a function results in errors and incorrect results if the corresponding function is either constant or approximately constant. When a gamma distribution has mean values that approach 0, it is spread very little (according to the linear relation between mean and standard deviation). This makes the distribution approximately constant after the initial spike close to the origin. These functions cannot be integrated and R produces an error message. As this problem occurs only when consumption levels are estimated to be very low, the assumption was made that under such circumstances the AAF calculated with this set of parameters would also be 0. This method assumes that former drinkers are not at an elevated risk for the given disease.
The generation of θ parameters is more difficult. Estimates of θ are different for each region, sex and age group since θ depends on the mean and variance of each gamma distribution. The generation of θ was performed in 2 steps: first, we generated a random sample of adult per capita consumption values from a normal distribution using the mean and standard deviation of this distribution, and second we generated a random sample of the prevalence of lifetime abstainers and former drinkers.
The effective sample size of each survey used to estimate the proportion of lifetime abstainers and former drinkers was assumed to be 1000 (to reflect an average sample size for surveys of 6000 per population, assuming that the three agesex categories have equal cell size). Using these values, it is possible to calculate the corresponding proportion of drinkers, the mean consumption per sexage category and, finally, θ, which is then simply given by .
To account for the error of the final relative risk functions, N instances of each betacoefficient were generated based on the covariance matrix. Each of these N relative risk functions obtained with one instance of each betacoefficient was then assigned to one set of parameters defining the population (mean adult per capita consumption, proportion of abstainers and proportion of former drinkers). The relative risk functions were assumed to be the same for all regions and age groups.
As previously mentioned, each random set of lowest level parameters described above were then used to calculate an AAF value for a total of N AAFs for each region, sex, age group and disease. The variance of the N AAFs was used as the true variance of the AAF estimates.
Main analysis, sensitivity analyses, and evaluation of the impact of each variable on the variance
As an example of this method, we calculated the AAFs for males aged 15 to 34 in the Asian regions; however, the abovedescribed methods can also be used to calculate the AAFs for females. In addition, to demonstrate that partial AAFs and variances for these AAFs can be calculated for different consumption levels, we estimated the AAFs for cardiovascular diseases, ischemic stroke and diabetes for males aged 15 to 34 who are low consumers of alcohol (0 to 39.9 grams of alcohol per day), moderate consumers of alcohol (40 to 59.9 grams of alcohol per day) and heavy consumers of alcohol (60 to 150 grams of alcohol per day).
In order to accurately estimate the variance of an AAF we need to determine how many samples are required. Too few samples could lead to inaccurate results, while increasing the number of samples increases computing times and may require a larger amount of storage. Additionally, after a large number of iterations, the gain in accuracy is very small and does not provide new substantial information. Therefore, in order to determine the optimal number of random samples needed to calculate the variance of an AAF, a sensitivity analysis was performed. Since our samples are randomly generated, each set of samples is independent and allows us to collect a large amount of data relatively quickly. To decrease computation time, the code was adapted to generate 150 sets, each containing 1000 AAF estimates for each region (by sex and age and disease). The variance of each set of 1000 AAFs can then be averaged to estimate the variance of larger sets. By systematically increasing the number of sets used to calculate the average variance, we estimated the number of samples required for the variance to settle.
Next, we carried out an analysis to estimate the impact of each component on the final variance using the same sets of randomly generated variables, but in different arrangements. For the purposes of this analysis, only 1000 sets of lowest level parameters (see above for a definition) were generated.
To calculate the impact on the variance of each parameter, the AAFs were calculated for a set of parameters in which only the parameter tested was randomly generated while the other parameters were held constant. The variance obtained from the generated AAFs then represented the variance induced by the error of this single variable. Since the AAF function is nonlinear, the variances obtained cannot simply be added together to obtain the total variance. To simplify the interpretation of the results, each contribution was normalized so that the sum equalled the total variance obtained as a result of the computation explained in the previous paragraphs. For the purpose of our analysis, the computations of the proportion of total variance explained by different variables were restricted to men in the five Asian regions defined above.
To compare in terms of dose response and magnitude the AAFs calculated using the new methodology by Rehm and colleagues to the method used in the 2004 CRA study [19], we calculated the partial AAFs for cardiovascular diseases and diabetes of multiple drinking categories for men in the five above defined Asian regions. The drinking categories were defined as
1) 0 to < 0.25 grams per day, 2) 0.25 to < 20 grams per day, 3) 20 to < 40 grams per day, and 4) 40+ grams per day. The relative risks used in the 2004 CRA study for cardiovascular diseases and diabetes were obtained from Gutjahr et al., [20], Reynolds et al., [21], Carrao et al., [22], and Corrao et al., [23].
Considerations of computing time
As R is a singlecore program, splitting up the code into different parts (for example, by sex and age) allows a user to take advantage of the multicore architecture of modern central processing units. Additionally, when dealing with large data sets, R slows down considerably. The splitting of the program into different subprograms by age, sex and region allows a user to reduce the size of the data sets, and therefore to speed up the computations.
Results and Discussion
Per capita alcohol estimates (in grams per day)
Region  Men  S.E.  

1  Asia, Pacific (high income)  13.51  1.57 
2  Asia, Central  10.62  1.73 
3  Asia, East  9.88  1.42 
4  Asia, South  3.80  0.80 
5  Asia, Southeast  5.21  0.79 
Prevalence of current drinkers, former drinkers and lifetime abstainers by region, and age
Region  Sex  lifetime abstainer  S.E.  former drinker  S.E.  current drinker  S.E. 

1  M  0.05  0.01  0.07  0.01  0.87  0.01 
2  M  0.23  0.01  0.13  0.01  0.64  0.02 
3  M  0.13  0.01  0.15  0.01  0.72  0.01 
4  M  0.73  0.01  0.11  0.01  0.17  0.01 
5  M  0.56  0.02  0.17  0.01  0.27  0.01 
AAFs and 95% confidence intervals for the 5 Asian regions considering only the male population aged 15  34 years
Asia, Pacific (High Income)  Asia, Central  Asia, East  Asia, South  Asia, Southeast  

DISEASE  AAF  lower bound  upper bound  AAF  lower bound  upper bound  AAF  lower bound  upper bound  AAF  lower bound  upper bound  AAF  lower bound  upper bound 
Oral Cavity and Pharynx Cancer  43.00%  35.50%  50.40%  41.80%  32.60%  51.00%  25.70%  19.60%  31.80%  16.40%  10.20%  22.60%  22.70%  16.50%  28.90% 
Oesophagus Cancer  25.90%  20.60%  31.30%  25.60%  18.70%  32.60%  14.90%  11.30%  18.50%  9.30%  5.80%  12.80%  13.50%  9.80%  17.20% 
Colon Cancer  4.40%  1.60%  7.10%  4.70%  2.00%  7.40%  4.70%  2.70%  6.70%  2.70%  1.50%  4.00%  4.90%  2.80%  7.00% 
Rectum Cancer  7.40%  5.10%  9.60%  7.40%  5.00%  9.80%  6.10%  4.20%  8.00%  3.40%  2.20%  4.70%  5.90%  3.90%  8.00% 
Liver Cancer  13.40%  8.40%  18.40%  12.60%  7.50%  17.60%  9.10%  6.00%  12.10%  4.90%  2.90%  6.90%  8.10%  5.30%  10.90% 
Larynx Cancer  27.70%  21.70%  33.60%  27.30%  19.70%  34.90%  15.80%  11.90%  19.80%  9.90%  6.10%  13.80%  14.30%  10.30%  18.30% 
Coronary Heart Disease  13.80%  36.40%  8.80%  5.40%  24.10%  13.30%  7.50%  16.60%  1.60%  0.40%  4.20%  5.10%  0.40%  6.80%  6.00% 
Epilepsy  24.90%  17.80%  32.10%  24.60%  16.00%  33.30%  14.50%  10.30%  18.70%  8.90%  4.90%  13.00%  13.00%  8.80%  17.20% 
Conduction Disorder and other Dysrhythmias  11.70%  7.30%  16.10%  11.40%  6.70%  16.10%  8.10%  5.40%  10.70%  4.60%  2.70%  6.40%  7.50%  4.90%  10.00% 
Pancreatitis  19.90%  11.80%  28.00%  27.00%  12.80%  41.10%  7.80%  5.10%  10.50%  10.20%  3.40%  17.00%  11.00%  6.30%  15.70% 
Lower Respiratory Infections  9.80%  2.50%  17.00%  9.60%  2.30%  16.90%  7.20%  3.40%  10.90%  4.10%  1.60%  6.50%  6.80%  3.50%  10.10% 
Hemorrhagic Stroke  Morbidity  15.80%  12.20%  19.30%  15.70%  10.80%  20.70%  11.20%  4.70%  17.80%  6.70%  2.10%  11.30%  10.70%  2.90%  18.50% 
Hemorrhagic Stroke  Mortality  14.20%  9.00%  19.50%  14.20%  8.00%  20.50%  10.60%  3.80%  17.40%  6.20%  1.50%  11.00%  10.10%  2.20%  18.10% 
Ischemic Stroke  Morbidity  4.70%  11.80%  2.40%  1.80%  4.40%  8.00%  2.00%  11.40%  7.40%  3.00%  1.90%  7.80%  4.10%  5.00%  13.10% 
Ischemic Stroke  4.30%  11.30%  2.70%  2.30%  3.80%  8.40%  2.00%  11.40%  7.40%  3.10%  1.80%  8.00%  4.20%  4.80%  13.20% 
Tuberculosis  22.10%  12.50%  31.70%  23.20%  13.00%  33.40%  9.90%  5.30%  14.40%  8.40%  4.00%  12.80%  11.80%  6.50%  17.10% 
Diabetes Mellitus  5.30%  17.80%  7.30%  0.20%  11.50%  11.20%  2.50%  10.30%  5.20%  1.40%  2.80%  5.60%  1.50%  5.40%  8.30% 
Hypertension  17.50%  12.30%  22.70%  16.50%  10.30%  22.70%  8.40%  5.50%  11.40%  4.70%  2.20%  7.30%  6.80%  4.00%  9.50% 
Liver Cirrhosis  Morbidity  34.20%  24.50%  43.90%  35.10%  22.60%  47.60%  19.80%  9.20%  30.50%  13.80%  4.60%  23.10%  18.80%  6.20%  31.40% 
Liver Cirrhosis  57.00%  45.20%  68.90%  61.40%  45.80%  77.00%  32.10%  21.30%  43.00%  30.30%  15.10%  45.60%  34.00%  20.20%  47.80% 
AAFs and 95% confidence intervals for the 5 Asian regions based on consumption patterns considering only the male population aged 15  34 years
Asia, Pacific (High Income)  Asia, Central  Asia, East  Asia, South  Asia, Southeast  

AAF  lower bound  upper bound  AAF  lower bound  upper bound  AAF  lower bound  upper bound  AAF  lower bound  upper bound  AAF  lower bound  upper bound  
Low consumption (0 to 39.9 grams of alcohol per day)  Coronary Heart Disease  13.91%  23.33%  4.49%  7.08%  13.80%  0.37%  10.88%  19.18%  2.58%  1.55%  2.97%  0.14%  4.18%  6.43%  1.93% 
Conduction Disorder and other Dysrhythmias  5.58%  3.76%  7.40%  3.39%  1.95%  4.82%  3.86%  2.18%  5.55%  0.81%  0.38%  1.25%  1.92%  1.31%  2.53%  
Ischemic Stroke  Morbidity  9.24%  13.37%  5.12%  4.48%  7.51%  1.45%  8.13%  11.65%  4.62%  0.99%  1.61%  0.38%  2.84%  3.81%  1.88%  
Ischemic Stroke  Mortality  9.27%  13.45%  5.09%  4.46%  7.55%  1.38%  8.23%  11.77%  4.69%  0.99%  1.58%  0.39%  2.85%  3.82%  1.88%  
Diabetes Mellitus  7.19%  12.98%  1.40%  3.84%  8.06%  0.38%  5.49%  10.65%  0.32%  0.87%  1.76%  0.03%  2.26%  3.71%  0.81%  
Hypertension  8.82%  6.98%  10.67%  5.44%  3.95%  6.94%  6.15%  4.43%  7.86%  1.33%  0.78%  1.88%  3.10%  2.37%  3.84%  
Moderate consumption (40 to 60 grams of alcohol per day)  Coronary Heart Disease  1.08%  4.02%  1.85%  0.88%  3.29%  1.52%  0.40%  3.88%  3.09%  0.21%  1.20%  0.77%  0.37%  1.70%  0.96% 
Conduction Disorder and other Dysrhythmias  2.45%  1.41%  3.50%  2.05%  1.05%  3.05%  0.89%  0.29%  2.07%  0.51%  0.16%  0.86%  0.85%  0.39%  1.31%  
Ischemic Stroke  Morbidity  0.55%  0.02%  1.12%  0.47%  0.01%  0.95%  0.19%  0.48%  0.85%  0.12%  0.07%  0.30%  0.19%  0.06%  0.44%  
Ischemic Stroke  Mortality  0.69%  0.18%  1.20%  0.59%  0.14%  1.03%  0.24%  0.36%  0.84%  0.14%  0.03%  0.31%  0.24%  0.01%  0.46%  
Diabetes Mellitus  0.45%  2.12%  1.21%  0.36%  1.72%  1.00%  0.17%  2.16%  1.81%  0.09%  0.64%  0.47%  0.15%  0.91%  0.61%  
Hypertension  4.15%  2.71%  5.59%  3.48%  2.00%  4.96%  1.52%  0.15%  3.18%  0.87%  0.38%  1.37%  1.45%  0.85%  2.06%  
Heavy consumption (60+ grams of alcohol per day)  Coronary Heart Disease  0.22%  3.00%  3.43%  0.60%  2.47%  3.67%  0.01%  5.64%  5.61%  0.17%  3.27%  3.60%  0.08%  3.09%  3.25% 
Conduction Disorder and other Dysrhythmias  3.71%  2.11%  5.31%  5.21%  3.46%  6.96%  0.61%  2.11%  3.33%  1.42%  0.11%  2.96%  1.35%  0.07%  2.77%  
Ischemic Stroke  Morbidity  1.91%  1.07%  2.75%  2.82%  1.91%  3.73%  0.29%  1.19%  1.77%  0.76%  0.09%  1.61%  0.69%  0.07%  1.45%  
Ischemic Stroke  Mortality  2.17%  1.26%  3.07%  3.19%  2.17%  4.21%  0.33%  1.25%  1.91%  0.86%  0.07%  1.79%  0.78%  0.02%  1.59%  
Diabetes Mellitus  1.34%  0.81%  3.50%  2.31%  0.22%  4.40%  0.15%  3.72%  4.02%  0.63%  1.84%  3.11%  0.49%  1.72%  2.70%  
Hypertension  6.70%  4.04%  9.36%  9.41%  6.40%  12.42%  1.11%  3.31%  5.53%  2.66%  0.04%  5.28%  2.49%  0.15%  4.84% 
Impact on total variance of each parameter
Examples of changes in variance as the sample size increases
Comparison of AAFs using new and old methodologies
AAFs for the 5 Asian regions based on methods used in the 2004 CRA study and methods used in the 2005 CRA study for males aged 15  34 years
Asia, Pacific (High Income)  Asia, Central  Asia, East  Asia, South  Asia, Southeast  

New Method  Old Method  New Method  Old Method  New Method  Old Method  New Method  Old Method  New Method  Old Method  
Abstainers, Former drinkers or very light drinkers  Coronary Heart Disease  0.71%  0.00%  1.63%  0.00%  2.93%  0.00%  1.95%  0.00%  3.68%  0.00% 
Ischemic Stroke  Morbidity  1.27%  0.00%  2.61%  0.00%  4.68%  0.00%  3.05%  0.00%  5.71%  0.00%  
Ischemic Stroke  Mortality  1.26%  0.00%  2.61%  0.00%  4.67%  0.00%  3.05%  0.00%  5.71%  0.00%  
Diabetes Mellitus  0.66%  0.00%  1.43%  0.00%  2.57%  0.00%  1.68%  0.00%  3.19%  0.00%  
Hypertension  0.09%  0.00%  0.04%  0.00%  0.10%  0.00%  0.01%  0.00%  0.03%  0.00%  
Drinking Category I (0.25 to < 40 grams of alcohol per day)  Coronary Heart Disease  18.18%  20.30%  8.74%  9.52%  15.26%  17.50%  1.89%  2.04%  5.30%  5.84% 
Ischemic Stroke  Morbidity  13.22%  8.46%  6.08%  4.03%  12.36%  7.90%  1.33%  0.90%  3.96%  2.61%  
Ischemic Stroke  Mortality  13.25%  8.46%  6.07%  4.03%  12.46%  7.90%  1.32%  0.90%  3.96%  2.61%  
Diabetes Mellitus  11.05%  4.19%  5.44%  1.91%  9.55%  4.34%  1.20%  0.43%  3.37%  1.32%  
Hypertension  6.03%  7.47%  4.11%  4.35%  2.93%  5.61%  1.01%  1.04%  2.10%  2.59%  
Drinking Category II (40 to < 60 grams of alcohol per day)  Coronary Heart Disease  1.08%  4.81%  0.88%  2.63%  0.40%  4.18%  0.21%  0.60%  0.37%  1.53% 
Ischemic Stroke  Morbidity  0.55%  2.30%  0.47%  0.64%  0.19%  3.27%  0.12%  0.13%  0.19%  0.72%  
Ischemic Stroke  Mortality  0.69%  2.30%  0.59%  0.64%  0.24%  3.27%  0.14%  0.13%  0.24%  0.72%  
Diabetes Mellitus  0.45%  7.08%  0.36%  4.42%  0.17%  4.98%  0.09%  1.01%  0.15%  2.25%  
Hypertension  4.15%  0.86%  3.48%  1.89%  1.52%  2.08%  0.87%  0.49%  1.45%  0.35%  
Drinking Category III (60+ grams of alcohol per day)  Coronary Heart Disease  0.22%  3.38%  0.60%  1.49%  0.01%  3.66%  0.17%  0.33%  0.08%  1.07% 
Ischemic Stroke  Morbidity  1.91%  2.13%  2.82%  0.06%  0.29%  3.43%  0.76%  0.07%  0.69%  0.64%  
Ischemic Stroke  Mortality  2.17%  2.13%  3.19%  0.06%  0.33%  3.43%  0.86%  0.07%  0.78%  0.64%  
Diabetes Mellitus  1.34%  5.20%  2.31%  3.79%  0.15%  3.98%  0.63%  0.91%  0.49%  1.68%  
Hypertension  6.70%  3.95%  9.41%  7.53%  1.11%  2.25%  2.66%  2.11%  2.49%  1.53% 
Conclusions
In this paper we have presented a method to estimate uncertainty around AAFs and illustrated our results using data for men aged 15 to 34 years in several Asian regions. The use of 60 000 to 70 000 Monte Carlo samples yields stable variance estimates in most cases, but we propose the use of 150 000 samples to ensure stable CIs. Uncertainties about risk relations and about total per capita consumption were identified as the main contributors to variances of the AAFs for men aged 15 to 34 years in the five Asian regions. These variances indicate that for some disease categories the doseresponse curve from alcohol has not been sufficiently researched. The observed large variances may result from an insufficient number of underlying articles describing doseresponse or from the nonlinear nature of doseresponse relationships. As some of the nonlinear nature may be caused by other dimensions of alcohol consumption (for example, irregular heavy drinking occasions in the case of ischemic diseases) [24, 25], it will not be enough to just conduct more epidemiological studies into the impact of average volume of alcohol consumption on the incidence of diseases (for an overview see [5]). Instead, other relevant dimensions of alcohol consumption, which could play a role in confounding the average volume of alcohol consumption, should be included in the design of cohort studies, and then should be statistically controlled for by using, for example, metaregression techniques [26].
One limitation of our approach was the use of adjusted relative risks in determining AAFs. The relative risk formulas we used were developed for risks only adjusted for age (see [8, 9, 27]). Two arguments can be made to justify the use of these formulas. Firstly, in risk analyses, such as the CRA for the GBD Studies [28], almost all of the underlying studies for the different risk factors report only adjusted risks. Relying on unadjusted risks would severely bias the estimated risk functions as only a small proportion of generally older studies could be included. Secondly, for alcohol in particular, most of the analyses show no marked differences after adjustment for the usual risk factors tested (see [5], and the metaanalyses cited there). The need for adjustment to the relative risks may change when other dimensions of alcohol consumption, such as irregular heavy drinking occasions, are considered (see above).
Another limitation of the new methodology is the nature of the relative risks that are used in the CRA study. As there is likely to be undercoverage of alcohol consumption in the medical epidemiological studies upon which the relative risks are based, modelling 100% of adult per capita consumption will lead to biased results. Accordingly, as coverage of alcohol consumption in these studies is likely greater than 70% [10], we modelled alcohol consumption as 80% of adult per capita consumption. This adjustment leads to lower estimates of alcoholattributable health harms [10]. Additionally, we modelled average daily alcohol consumption from 0 to 150 grams a day, using 150 grams as a maximum level. In very rare cases people may drink more than 150 grams per day; however, it is unlikely that this level of consumption would be maintained over an extended period of time [29]. An upper limit of alcohol consumption in grams per day may lead to an underestimation of the effects of alcohol in terms of total harms, especially where alcohol at low doses has a positive effect and at high doses has a negative effect, such as with cardiovascular diseases, ischemic stroke and diabetes. Such instances are limited, however, as the risk ratios used to model the effects of alcohol were fractional polynomials allowing us to accurately characterize curvilinear risk relationships. Additionally, alcohol starts to have a negative effect well below a consumption level of 150 grams per day and, thus, limiting our consumption models to 150 grams per day does not have a substantial effect on the AAFs. Furthermore, as the upper limit of sustainable alcohol consumption probably differs depending on the sex of the drinker, more research is needed to define these limits.
Our new methodology is capable of being adjusted to take into account different parameters of alcohol consumption [10]. For example, this method can easily be modified for future research that focuses on the effects of specific alcohol consumption patterns on the burden of disease. In summary, future iterations of the CRA, or similar studies, should include CIs, as our methodology offers a feasible way to estimate the uncertainty of attributable fractions for all burdens of disease.
Declarations
Acknowledgements
Financial support for this study was provided to the last author listed above by the National Institute for Alcohol Abuse and Alcoholism (NIAAA) with contract # HHSN267200700041C to conduct the study titled "Alcohol and Drugattributable Burden of Disease and Injury in the US". In addition, the last author received a salary and infrastructure support from the Ontario Ministry of Health and LongTerm Care.
Authors’ Affiliations
References
 WHO: Global Health Risks. Mortality and burden of disease attributable to selected major risks. 2009, Geneva, SwitzerlandGoogle Scholar
 Rehm J, Mathers C, Popova S, Thavorncharoensap M, Teerawattananon Y, Patra J: Global burden of disease and injury and economic cost attributable to alcohol use and alcoholuse disorders. Lancet. 2009, 373: 22232233. 10.1016/S01406736(09)607467.View ArticlePubMedGoogle Scholar
 Rehm J, Room R, Monteiro M, Gmel G, Graham K, Rehn N, Sempos CT, Frick U, Jernigan D: Alcohol Use. Comparative quantification of health risks: global and regional burden of disease attributable to selected major risk factors. Edited by: Ezzati M, Lopez AD, Rodgers A, Murray CJL. 2004, Geneva: WHO, 1: 9591109.Google Scholar
 Rehm J, Kehoe T, Gmel G, Stinson F, Grant B, Gmel G: Statistical modelling of volume of alcohol exposure for epidemiological studies of population health: the example of the US. Population Health Metrics. 2010, 8: 310.1186/1478795483.View ArticlePubMedPubMed CentralGoogle Scholar
 Rehm J, Baliunas D, Borges GLG, Graham K, Irving HM, Kehoe T, Parry CD, Patra J, Popova L, Poznyak V, Roerecke M, Room R, Samokhvalov AV, Taylor B: The relation between different dimensions of alcohol consumption and burden of disease  an overview. Addiction. 2010, 105: 817843. 10.1111/j.13600443.2010.02899.x.View ArticlePubMedPubMed CentralGoogle Scholar
 Hilebrandt M, Bender R, Gehrmann U, Bletter M: Calculating confidence intervals for impact numbers. BMC Medical Research Methodology. 2006, 6: 3210.1186/14712288632.View ArticleGoogle Scholar
 Stata Corporation: Statistical Software: Release 11.0. 2009, College Station, TXGoogle Scholar
 Rockhill B, Newman B: Use and misuse of population attributable fractions. Am J Public Health. 1998, 88: 1519. 10.2105/AJPH.88.1.15.View ArticlePubMedPubMed CentralGoogle Scholar
 Korn EL, Graubard BI: Analysis of Health Surveys. 1999, New York: John Wiley & Sons IncView ArticleGoogle Scholar
 Shield K, Kehoe T, Taylor B, Patra J, Rehm J: Alcoholattributable burden of disease and injury in Canada, 2004. Int J Public Health. 2011,Google Scholar
 Institute for Health Metrics and Evaluation: GBD operations manual final draft. [http://www.who.int/healthinfo/global_burden_disease/GBD_2005_study/en/index.html]
 United Nations Populations Division: World populations prospects  the 2008 revision. 2010, New YorkGoogle Scholar
 Rehm J, Klotsche J, Patra J: Comparative quantification of alcohol exposure as risk factor for global burden of disease. Int J Methods Psychiatr Res. 2007, 16: 6676. 10.1002/mpr.204.View ArticlePubMedGoogle Scholar
 Rehm J, Rehn N, Room R, Monteiro M, Gmel G, Jernigan D, Frick U: The global distribution of average volume of alcohol consumption and patterns of drinking. Eur Addict Res. 2003, 9: 147156. 10.1159/000072221.View ArticlePubMedGoogle Scholar
 Rehm J, Kanteres F, Lachenmeier DW: Unrecorded consumption, quality of alcohol and health consequences. Drug Alcohol Rev. 2010, 29: 426436. 10.1111/j.14653362.2009.00140.x.View ArticlePubMedGoogle Scholar
 Rehm J, Taylor B, Mohapatra S, Irving H, Baliunas D, Patra J, Roerecke M: Alcohol as a risk factor for liver cirrhosis  a systematic review and metaanalysis. Drug and Alcohol Review. 2010, 29: 437445. 10.1111/j.14653362.2009.00153.x.View ArticlePubMedGoogle Scholar
 Patra J, Taylor B, Irving H, Roerecke M, Baliunas D, Mohapatra S, Rehm J: Alcohol consumption and the risk of morbidity and mortality from different stroke types  A systematic review and metaanalysis. BMC Public Health. 2010, 10: 25810.1186/1471245810258.View ArticlePubMedPubMed CentralGoogle Scholar
 Kalos M, Whitlock P: Monte Carlo Methods, Second, revised and enlarged edition. 2008, Weinheim: WileyVCH Verlag GmBH & CoGoogle Scholar
 Rehm J, Patra J, Popova S: Alcoholattributable mortality and potential years of life lost in Canada 2001: implications for prevention and policy. Addiction. 2006, 101: 373384. 10.1111/j.13600443.2005.01338.x.View ArticlePubMedGoogle Scholar
 Gutjahr E, Gmel G, Rehm J: The relation between average alcohol consumption and disease: an overview. Eur Addict Res. 2001, 7: 117127. 10.1159/000050729.View ArticlePubMedGoogle Scholar
 Reynolds K, Lewis B, Nolen J, Kinney G, Sathya B, He J: Alcohol consumption and risk of stroke: a metaanalysis. JAMA. 2003, 289: 579588. 10.1001/jama.289.5.579.View ArticlePubMedGoogle Scholar
 Corrao G, Rubbiati L, Bagnardi V, Zambon A, Poikolainen K: Alcohol and coronary heart disease: A metaanalysis. Addiction. 2000, 95: 15051523. 10.1046/j.13600443.2000.951015056.x.View ArticlePubMedGoogle Scholar
 Corrao G, Bagnardi V, Zambon A, Arico S: Exploring the doseresponse relationship between alcohol consumption and the risk of several alcoholrelated conditions: a metaanalysis. Addiction. 1999, 94: 15511573. 10.1046/j.13600443.1999.9410155111.x.View ArticlePubMedGoogle Scholar
 Puddey IB, Rakic V, Dimmitt SB, Beilin LJ: Influence of pattern of drinking on cardiovascular disease and cardiovascular risk factors  a review. Addiction. 1999, 94: 649663. 10.1046/j.13600443.1999.9456493.x.View ArticlePubMedGoogle Scholar
 Roerecke M, Rehm J: Irregular heavy drinking occasions and risk of ischemic heart disease: a systematic review and metaanalysis. American Journal of Epidemiology. 2010, 171: 633644. 10.1093/aje/kwp451.View ArticlePubMedGoogle Scholar
 Bagnardi V, Zambon A, Quatto P, Corrao G: Flexible metaregression functions for modeling aggregate doseresponse data, with an application to alcohol and mortality. Am J Epidemiol. 2004, 159: 10771086. 10.1093/aje/kwh142.View ArticlePubMedGoogle Scholar
 Flegal KM, Williamson DF, Graubard BI: Using adjusted relative risks to calculate attributable fractions. Am J Public Health. 2006, 96: 39810.2105/AJPH.2005.079731.View ArticlePubMedPubMed CentralGoogle Scholar
 Ezzati M, Lopez A, Rodgers A, Murray CJL: Comparative quantification of health risks. Global and regional burden of disease attributable to selected major risk factors. 2004, Geneva: WHOGoogle Scholar
 Kremer G: Alkoholprobleme im Allgemeinkrankenhaus. Früherkennung und Kurzintervention bei Patientinnen und Patienten mit Alkoholproblemen in der somatischen Medizin. Ph.D. Thesis. 2001, The University of Bielefeld, Faculty of Health SciencesGoogle Scholar
 The prepublication history for this paper can be accessed here:http://www.biomedcentral.com/14712288/11/48/prepub
Prepublication history
Copyright
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.