Skip to main content
  • Research article
  • Open access
  • Published:

Clustering of continuous and binary outcomes at the general practice level in individually randomised studies in primary care - a review of 10 years of primary care trials



In randomised controlled trials, the assumption of independence of individual observations is fundamental to the design, analysis and interpretation of studies. However, in individually randomised trials in primary care, this assumption may be violated because patients are naturally clustered within primary care practices. Ignoring clustering may lead to a loss of power or, in some cases, type I error.


Clustering can be quantified by intra-cluster correlation (ICC), a measure of the similarity between individuals within a cluster with respect to a particular outcome. We reviewed 17 trials undertaken by the Department of Primary Care at the University of Southampton over the last ten years. We calculated the ICC for the primary and secondary outcomes in each trial at the practice level and determined whether ignoring practice-level clustering still gave valid inferences. Where multiple studies collected the same outcome measure, the median ICC was calculated for that outcome.


The median intra-cluster correlation (ICC) for all outcomes was 0.016, with interquartile range 0.00–0.03.

The median ICC for symptom severity was 0.02 (interquartile range (IQR) 0.01 to 0.07) and for reconsultation with new or worsening symptoms was 0.01 (IQR 0.00, 0.07). For HADS anxiety the ICC was 0.04 (IQR 0.02, 0.05) and for HADS depression was 0.02 (IQR 0.00, 0.05). The median ICC for EQ. 5D-3 L was 0.01 (IQR 0.01, 0.04).


There is evidence of clustering in individually randomised trials primary care. The non-zero ICC suggests that, depending on study design, clustering may not be ignorable. It is important that this is fully considered at the study design phase.

Peer Review reports


The past decade has seen a steady increase in the use of the cluster randomised trial design [1]. In cluster-randomised trials, the unit of randomisation is the cluster, such as a hospital or school, rather than the individual participant. This design is often employed when an intervention is aimed at a health practitioner level rather than an individual level or where individual randomisation is not possible [2]. Cluster randomised studies generally have lower power than individually randomised trials because there may be a correlation between the responses from participants within the same cluster. This may be due to the fact that background characteristics of participants are more similar within each cluster, and in addition, cluster-level characteristics such as the effectiveness of the practitioner may differ between clusters [2].

However, clustering may also occur in individually randomised trials, for example, the natural clustering of participants within centre in a multi-centre trial [3]. Lee and Thompson [3] reviewed individually randomised trials published in the BMJ in 2002 and found that 38/42 (90%) of them had some form of clustering. Of these, only four correctly accounted for clustering in the analysis. A failure to account for clustering in the analysis of a trial will give an unbiased estimate of the treatment effect but standard errors can be too small, leading to the type I error being too large [3, 4].

The presence of patients clustered within general practioners (GPs) or GP practices in an individually randomised trial does not necessarily imply that type I error will occur [5, 6]. Kahan and Morris [7] set out the two conditions for clustering to be ignorable:

  1. 1)

    The intra-cluster correlation (ICC), which represents the degree of similarity in the responses of individuals from the same cluster, must be zero; or

  2. 2)

    The correlation of patient assignments within clusters must be zero.

In primary care, we may expect a non-zero ICC for many outcomes. Patients are naturally clustered within GPs and practices. There may be a treatment effect of the practitioner, as the definition of “usual care” in some interventions may vary between GPs or practices. The effectiveness of some practitioners at delivering an intervention may result in clustering of patient outcomes. Individuals who choose the same GP or practice may also naturally be more similar to one another than to patients at another practice. In practice, the true value of the ICC is often not known in advance and the conservative assumption may be to assume that it will be non-zero [7].

The intervention design may also lead to the correlation of patient assignments within clusters being non-zero. This may occur in partially nested trials, for example where the intervention is delivered by a clinician, who only participates in one arm of the design [8, 9]. This correlation may also be non-zero if clusters are used in the randomisation process, for example, block randomisation, randomisation that balances on patient factors or trials where outcomes are measured at several time points [7]. Block randomisation within GP practices leads to a negative correlation between treatment assignments because for each patient assigned to one treatment arm, future patients are less likely to be assigned to that same treatment. Randomisation that stratifies on patient factors leads to correlation between treatment assignments because, for example, for each patient with high baseline severity assigned to one treatment arm, a patient with low baseline severity is less likely to be assigned to that treatment arm. These designs are common in primary care trials, particularly block randomisation and the stratification of randomisation on key patient factors.

Since both these conditions may frequently be met in primary care trials, it is possible that clustering may be non-ignorable. Moreover, even if the clustering is ignorable, but the estimates of the ICCs are high, analyses that do not adjust for clustering might lead to a loss of power [7]. It is therefore important that we have robust estimates of the ICCs that are likely to apply for different outcomes and contexts. This can help us to decide firstly whether one of the conditions for clustering being ignorable may be met and whether unadjusted analyses might lead to a potential loss of power. Estimates of the ICC may be obtained from various sources, such as previous trials, databases of ICCs or from overall patterns of ICCs. Increasingly, cluster randomised trials include estimates of the ICC, at least for the primary outcome, in their published trial reports. However, a single trial may not include the setting or outcome of interest, and precision of the ICC is often not included. Moreover, there have not been any papers which have examined ICCs for individually randomised trials.

As noted above, primary care is a logical place to examine estimates of these ICCs. This study reviewed the data from all individually randomised trials carried out in the Primary Care Research Group at the University of Southampton over the last 10 years in order to provide robust estimates of ICC values which may help to inform future studies.


This was a secondary analysis of all individually randomised trials undertaken in the Primary Care Research Group at the University of Southampton. An audit of all trials undertaken since 2005/6 was conducted and the Principal Investigators of all identified studies were contacted to request permission to access the study data. Feasibility studies were not included as they are not designed to provide robust measures of effect size [10] and estimates of ICC from these trials may not be reliable. Ethics approval was not required for this study as analysis was based on existing datasets from previously conducted studies, all of which had the appropriate approvals in place. All data was anonymised and contained no potentially identifiable data.

For each included trial, the ICC was estimated for continuous and binary outcomes using a mixed model in Stata with GP and/or practice included as a random effect. Whilst random effects models are not appropriate for all studies, for the level of inference that was desired in all these studies it was the best approach [11].

Whether to include GP, practice or both was determined based upon the study design as set out in the study protocol and upon the data available. Using Stata version 14, it is possible to directly calculate the ICC from the stored estimates after fitting a mixed model using the command ‘estat ICC’ [12, 13]. The ICC may be expressed as:

$$ \rho ={s}_b^2/\left({s}_b^2+{s}_w^2\right) $$

where \( {s}_b^2 \) is the between cluster component of variance and \( {s}_w^2 \) is the within-cluster component of variance [14]. A one-way random effects model may be written as

$$ {Y}_{jk}=\alpha +{e}_{jk}+{\mu}_j $$

where Yjk is the observation for the kth individual in the jth cluster, α is a constant, μj are cluster level effects and ejk are individual residual effects, and μj and ejk are assumed to be normally distributed [14]. The parameters \( {s}_b^2 \) and \( {s}_w^2 \) required to calculate the ICC can be estimated from the model using restricted maximum likelihood and substituted into eq. (1).

For binary outcomes, a logistic regression model was used with a random effect for GP practice. In this case, the ICC (ρl) on the logistic scale can be expressed as the proportion of the total outcome variance that is due to between-cluster variation:

$$ {\rho}_l={s}_b^2/\left({s}_b^2+{\pi}^{\frac{2}{3}}\right) $$

where \( {s}_b^2 \) is the between cluster component of variance, as in the continuous model above [14].

For each outcome, the ICC was estimated and results presented for models both with and without controlling for baseline covariates. Overall, the median ICC, interquartile range (IQR) and range for all studies and all outcomes were calculated. Where multiple studies have collected the same outcome measure, the median, IQR and range for that outcome measure were calculated.

One advantage of using mixed effects models to calculate the ICC is that covariates can be easily included. The initial analyses were performed without including any covariates. We then adjusted for sociodemographic characteristics and any potential confounders that had been included in the original analysis. This better represents the kind of analysis that would be undertaken in practice.

In order to illustrate the potential effect of clustering on the sample size, we can calculate the design effect in different situations that might arise in practice. The design effect is an adjustment made to the sample size to account for clustering in the design of a study. It is defined as the ratio of the variance of the estimator, e.g., treatment effect, when the centre effect is taken into account and the variance of the estimator assuming a simple random sample. It has been shown [8] that in a multi-centre study with two treatment arms, the design effect can be approximated by Deff = 1 + (S − 1)ρ, where ρ represents the ICC as defined in [1] and S is defined as

$$ S=\frac{n_1{n}_2}{N}{\sum}_{j=1}^Q{\left(\frac{m_{1j}}{n_1}-\frac{m_{2j}}{n_2}\right)}^2 $$

where mj = m1j + m2j is the number of people in cluster j, \( {n}_i={\sum}_{j=1}^Q{m}_{ij} \) is the number of people in treatment group i, and N = n1 + n2 is the total number of people in the study. The S statistic is a measure of how balanced the two randomised treatment groups are within centres. If the treatment arms are perfectly balanced (ie. an equal number of patients in both treatment arms) for every centre then S = 0, and the design effect is 1-ρ. If S < 1 (slightly unbalanced numbers of people between treatment arms in each centre), the design effect is less than 1 and the trial is overpowered. If S = 1 (somewhat unbalanced treatment arms in each centre), the design effect is equal to 1. If S > 1 (unbalanced treatment arms in each centre), then design effect> 1 and the trial is underpowered. Given that the true value of S, like the true value of the ICC, may not be known in advance, an assumed value of S > 1 may represent a conservative assumption for sample size calculations.


Sample characteristics

A total of 17 trials had available data and were included. Although the initial plan had been to look at both GP and Practice level effects, data was only available at the GP level in two studies and therefore the analyses were limited to practice level effects.

Table 1 sets out the characteristics of included studies. There was some variability in the average cluster size, with the number of participants recruited per practice ranging from 2 to 90. This often reflected the study question; large clusters often occurred in studies aiming to recruit to large trials to answer public health questions, whilst smaller clusters tended to occur in studies of rarer conditions or with more restrictive inclusion criteria. Studies contributed between 2 and 6 outcome measures to the analyses.

Table 1 Study characteristics

The design of each was also evaluated to determine whether the clustering was ignorable (Table 2). To give an example, the GRACE trial of antibiotics for lower respiratory tract infection had an ICC greater than zero and had a non-zero correlation of patient assignments within GP practices due to the use of blocked randomisation. Therefore in this study, the clustering would not be ignorable. In contrast, the InternetDoctor trial of a self-management intervention for respiratory tract infections also had a non-zero ICC. However, this study used simple randomisation and therefore the correlation of patient assignment within GP practices was zero. As such clustering was ignorable for this study. In total, six studies used simple randomisation, six studies used block randomisation, three studies used randomisation stratified on patient factors and two studies used both blocked and stratified randomisation. In three studies, the ICC was zero, and in the remaining 14, the ICC was non-zero. As such in eight studies the clustering was non-ignorable, and in nine studies the clustering was ignorable.

Table 2 Study design – Was clustering non-ignorable?

Clustering by GP practice

In total there were 55 outcome measures from the 17 studies for which an ICC could be calculated. Some outcomes were only collected at follow up, giving 52 outcome measures for which an ICC could be reported after controlling for baseline value of the outcome measure. The median ICC was 0.016 (IQR 0, 0.03) unadjusted and 0.011 (IQR 0, 0.026) adjusted for baseline covariates (Table 3). This suggests there is evidence clustering at the practice level. A table setting out these results split by outcome type (continuous/binary) is provided in Additional file 1.

Table 3 Distribution of ICCs across studies clustered by GP practice

Seven outcome measures were collected in more than one study. The median ICCs for these measures are summarised in Table 4. These suggest modest levels of clustering, largely in line with what was observed overall, with adjusted ICCs ranging from 0.00 to 0.04.

Table 4 ICC for outcome measures collected in multiple studies

Effect of clustering on sample size

Depending on the balance of the treatment arms within GP practices, there may be a loss or gain of power, and the ICC will influence how large this loss or gain of power is. Table 5 presents the required sample size for various values of the ICC and S statistic likely to arise in practice. For values of S < 1, where there is slight imbalance between treatment groups within GP practices, a trial would be very slightly overpowered with an ICC of 0.01. For values of S > 1, the trial would slightly underpowered. If ICC = 0.1 (corresponding to the top 5% of the studies in our sample), then these effects would be more pronounced. However, as the true value of S may not be knowable in advance, an S value > 1 may represent a more conservative sample size assumption.

Table 5 Sample size required for various values of ICC and S


This paper presents the intra-cluster correlation coefficients from 17 individually randomised studies set in primary care. The median ICC of 0.01 was in line with previous estimates of the ICC from cluster-randomised trials, demonstrating that there is clustering of outcomes. This effect may be important in sample size calculations and the planning of analyses for individually randomised trials in primary care, depending on how the intervention will be delivered and the way in which randomisation is stratified. These are important considerations for researchers at the design phase as clustering may not be ignorable and the conservative assumption may be that the ICC is greater than 0.

Comparison with other studies

A number of other studies have investigated intra-cluster correlation coefficients from epidemiological population based surveys [31,32,33,34,35,36]. Several authors have also looked at patterns of ICCs from studies in primary care. Adams et al. [37] undertook an analysis of cluster randomised studies and surveys in primary care. They investigated 1039 ICCs from 31 studies and found an overall ICC of 0.01, with an interquartile range 0 to 0.03. Campbell et al. [38] also investigated the factors that affect the magnitude of ICCs in various settings. Based on 145 ICCs from cluster randomised trials in primary care, they estimated a median ICC of 0.045 with range 0 to 0.28. The MRC Trial of the Assessment and Management of Older People in the Community [39], reported a median ICC was 0.016 (IQR 0.008–0.020, range 0.001–0.055) from 54 ICCs from various measures. Although these studies have concentrated on cluster randomised trials, our estimate for individually randomised trials, a median ICC of 0.01 (IQR 0.00, 0.03), is very similar.

Strengths and limitations of the study

To our knowledge this is the first study to focus on providing estimates of the clustering in individually randomised trials in primary care. These results will be helpful to statisticians and researchers working in the design and analysis of trials in this field.

The data is limited to 17 studies run from the University of Southampton. The included studies recruited from centres across the UK and internationally, however, they were based with PIs from a single centre. This may impact on the generalisability of these results. We compared the broad areas of research represented by these trials to those reported in observational studies of GP consultations. Using International Classification of Primary Care (ICPC-2), which comprises 726 codes within 17 chapters representing bodily systems/topic areas such as musculoskeletal or circulatory conditions [40], Salisbury et al. coded 308 consultations in 22 GP practices in Bristol and North Somerset [41]. They found that the 5 most common medical reasons for consultation were Musculoskeletal (19%), Skin (8.2%), Digestive (8.2%) Respiratory (7.8%) and Psychological (7.6%). The studies included in this sample do broadly represent these conditions with Musculoskeletal (12%), Digestive (6%), Psychological (6%), Skin studies were not represented at all and Respiratory studies were over-represented (65%). It is undeniable that the included studies reflect research interests of the department, and therefore are not necessarily represented in this study in proportion to how often they are encountered in general practice. However, we feel that these estimates represent a useful starting point for those conducting research in this area and hope that other researchers will also publish estimates to allow the literature to expand to include a representative sample of ICCs.

The studies considered here were all analysed with a random effects model. Where the level of inference is the individual, a generalised estimating equation (GEE) model may be more appropriate. We have not considered the implications for these types of models and further research is needed to consider the implications, if any, for GEE models of clustering at the practice level.

We were only able to calculate the ICCs for continuous and binary outcomes, using readily available software. The calculation of ICCs for time-to-event outcomes is an area on ongoing methodological research [42].


The implications of this study are that the amount of clustering in individually randomised trials may be similar to that of cluster-randomised trials and may not be ignorable. It is good practice to consider at the planning stages how best to take clustering into account both in the design and the analysis of trials. Where appropriate, the distribution of the ICCs presented here can be used to assist in future sample size calculations and analysis plans.


In the 17 individually randomised studies in primary care, there was evidence of clustering at the practice level, with a median ICC of 0.01 (IQR 0.00 to 0.03). This is in line with ICCs previously reported in cluster-randomised trials in primary care, indicating that the amount of clustering by GP practice in individually randomised trials is at a similar level to that in cluster-randomised trials, in which GP practices are randomised rather than individual participants. This may have implications for sample size calculations. Further data from other primary care studies is required to improve generalisability.

Availability of data and materials

The datasets analysed during the current study are not publicly available because they contain individual-level data which were obtained from third parties. Several of the participating studies may require data sharing agreements and current agreements are for the purposes of this analysis only. Please contact the corresponding author and any requests will be considered by the relevant data holder involved in this collaboration.



Generalised estimating eq.


General practioner


Hospital anxiety and depression scale


Intra-cluster correlation


Interquartile range


Patient enablement instrument


  1. Moberg J, Kramer M. A brief history of the cluster randomised trial design. J Roy Soc Med. 2015;108(5):192–8.

    Article  Google Scholar 

  2. Eldridge S, Kerry SM. A practical guide to cluster randomised trials in health services research. Chichester: Wiley; 2012.

    Book  Google Scholar 

  3. Lee KJ, Thompson SG. Clustering by health professional in individually randomised trials. Brit Med J. 2005;330(7483):142–4.

    Article  Google Scholar 

  4. Bland M. Cluster designs: a personal view. Talk at Public Health Laboratory Service Disease Surveillance Centre on 11 Oct 2001. 2001 Available from:

  5. Kahan BC, Forbes G, Ali Y, Jairath V, Bremner S, Harhay MO, et al. Increased risk of type I errors in cluster randomised trials with small or medium numbers of clusters: a review, reanalysis, and simulation study. Trials. 2016;17.

  6. Parzen M, Lipsitz SR, Dear KBG. Does clustering affect the usual test statistics of no treatment effect in a randomized clinical trial? Biom J. 1999;40(4):385–402.

    Article  Google Scholar 

  7. Kahan BC, Morris TP. Assessing potential sources of clustering in individually randomised trials. BMC Med Res Methodol. 2013;13(1):58.

    Article  Google Scholar 

  8. Vierron E, Giraudeau B. Design effect in multicenter studies: gain or loss of power? BMC Med Res Methodol. 2009;9:39.

    Article  Google Scholar 

  9. Candlish J, Teare MD, Dimairo M, Flight L, Mandefield L, Walters SJ. Appropriate statistical methods for analysing partially nested randomised controlled trials with continuous outcomes: a simulation study. BMC Med Res Methodol. 2018;18(1):105.

    Article  Google Scholar 

  10. Eldridge SM, Chan CL, Campbell MJ, Bond CM, Hopewell S, Thabane L, et al. CONSORT 2010 statement: extension to randomised pilot and feasibility trials. BMJ Brit Med J. 2016;355.

  11. Lee KJ, Thompson SG. The use of random effects models to allow for clustering in individually randomized trials. Clin Trials. 2005;2(2):163–73.

    Article  Google Scholar 

  12. StataCorp. Stata Statistical Software: Release 14. College Station: StataCorp LP; 2015.

    Google Scholar 

  13. StataCorp. Stata 14 Base Reference Manual. College Station: Stata Press; 2015.

    Google Scholar 

  14. Eldridge SM, Ukoumunne OC, Carlin JB. The intra-cluster correlation coefficient in cluster randomized trials: a review of definitions. Int Stat Rev. 2009;77(3):378–94.

    Article  Google Scholar 

  15. Little P, Stuart B, Moore M, Coenen S, Butler CC, Godycki-Cwirko M, et al. Amoxicillin for acute lower-respiratory-tract infection in primary care when pneumonia is not suspected: a 12-country, randomised, placebo-controlled trial. Lancet Infect Dis. 2013;13(2):123–9.

    Article  CAS  Google Scholar 

  16. Little P, Stuart B, Andreou P, McDermott L, Joseph J, Mullee M, et al. Primary care randomised controlled trial of a tailored interactive website for the self-management of respiratory infections (Internet Doctor). BMJ Open. 2016;6(4).

  17. Geraghty AWA, Essery R, Kirby S, Stuart B, Turner D, Little P, et al. Internet-based vestibular rehabilitation for older adults with chronic dizziness: a randomized controlled trial in primary care. Ann Fam Med. 2017;15(3):209–16.

    Article  Google Scholar 

  18. Little P, Moore M, Kelly J, Williamson I, Leydon G, McDermott L, et al. Ibuprofen, paracetamol, and steam for patients with respiratory tract infections in primary care: pragmatic randomised factorial trial. BMJ Brit Med J. 2013;347.

  19. Little P, Stuart B, Hobbs FDR, Kelly J, Smith ER, Bradbury KJ, et al. An internet-based intervention with brief nurse support to manage obesity in primary care (POWeR plus ): a pragmatic, parallel-group, randomised controlled trial. Lancet Diabetes Endo. 2016;4(10):821–8.

    Article  Google Scholar 

  20. Little P, Stuart B, Hobbs FDR, Moore M, Barnett J, Popoola D, et al. An internet-delivered handwashing intervention to modify infl uenza-like illness and respiratory infection transmission (PRIMIT): a primary care randomised trial. Lancet. 2015;386(10004):1631–9.

    Article  Google Scholar 

  21. Little P, Stuart B, Mullee M, Thomas T, Johnson S, Leydon G, et al. Effectiveness of steam inhalation and nasal irrigation for chronic or recurrent sinus symptoms in primary care: a pragmatic randomized controlled trial. Can Med Assoc J. 2016;188(13):940–9.

    Article  Google Scholar 

  22. Little P, Stuart B, Wingrove Z, Mullee M, Thomas T, Johnson S, et al. A randomised controlled factorial trial of probiotic capsules and xylitol chewing gum for the symptomatic management of pharyngitis. Can Med Assoc J. 2017;189(50):1543–50.

    Article  Google Scholar 

  23. Williamson I, Vennik J, Harnden A, Voysey M, Perera R, Kelly S, et al. Effect of nasal balloon autoinflation in children with otitis media with effusion in primary care: an open randomized controlled trial. Can Med Assoc J. 2015;187(13):961–9.

    Article  Google Scholar 

  24. Williamson I, Benge S, Barton S, Petrou S, Letley L, Fasey N, et al. Topical intranasal corticosteroids in 4-11 year old children with persistent bilateral otitis media with effusion in primary care: double blind randomised placebo controlled trial. Clin Otolaryngol. 2010;35(2):125–33.

    Article  Google Scholar 

  25. Little P, Lewith G, Webley F, Evans M, Beattie A, Middleton K, et al. Randomised controlled trial of Alexander technique lessons, exercise, and massage (ATEAM) for chronic and recurrent back pain. Brit Med J. 2008;337(7667).

  26. Williamson IG, Rumsby K, Benge S, Moore M, Smith PW, Cross M, et al. Antibiotics and topical nasal steroid for treatment of acute maxillary sinusitis - a randomized controlled trial. J Am Med Assoc. 2007;298(21):2487–96.

    Article  CAS  Google Scholar 

  27. Everitt HA, Little PS, Smith PWF. A randomised controlled trial of management strategies for acute infective conjunctivitis in general practice. Brit Med J. 2006;333(7563):321–4.

    Article  Google Scholar 

  28. Little P, Rumsby K, Kelly J, Watson L, Moore M, Warner G, et al. Information leaflet and antibiotic prescribing strategies for acute lower respiratory tract infection - a randomized controlled trial. J Am Med Assoc. 2005;293(24):3029–35.

    Article  CAS  Google Scholar 

  29. Kendrick T, Chatwin J, Dowrick C, Tylee A, Morriss R, Peveler R, et al. Randomised controlled trial to determine the clinical effectiveness and cost-effectiveness of selective serotonin reuptake inhibitors plus supportive care, versus supportive care alone, for mild to moderate depression with somatic symptoms in primary care: the THREAD (THREshold for AntiDepressant response) study. Health Technol Asses. 2009;13(22):1.

    Article  Google Scholar 

  30. Everitt H, Moss-Morris R, Sibelli A, Tapp L, Coleman N, Yardley L, et al. Management of irritable bowel syndrome in primary care: the results of an exploratory randomised controlled trial of mebeverine, methylcellulose, placebo and a self-management website. BMC Gastroenterol. 2013;13.

  31. Hannan PJ, Murray DM, Jacobs DR, Mcgovern PG. Parameters to aid in the design and analysis of community trials - Intraclass correlations from the Minnesota-heart-health-program. Epidemiology. 1994;5(1):88–95.

    Article  CAS  Google Scholar 

  32. Kelder SH, Jacobs DR, Jeffery RW, Mcgovern PG, Forster JL. The worksite component of variance - design effects and the healthy worker project. Health Educ Res. 1993;8(4):555–66.

    Article  CAS  Google Scholar 

  33. Murray DM, Catellier DJ, Hannan PJ, Treuth MS, Stevens J, Schmitz KH, et al. School-level intraclass correlation for physical activity in adolescent girls. Med Sci Sport Exer. 2004;36(5):876–82.

    Article  Google Scholar 

  34. Reading R, Harvey I, McLean M, Team CFHS. Cluster randomised trials in maternal and child health: implications for power and sample size. Arch Dis Child. 2000;82(1):79–83.

    Article  CAS  Google Scholar 

  35. Smeeth L, Ng ESW. Intraclass correlation coefficients for cluster randomized trials in primary care: data from the MRC trial of the assessment and Management of Older People in the community. Control Clin Trials. 2002;23(4):409–21.

    Article  Google Scholar 

  36. Verma V, Le T. An analysis of sampling errors for the demographic and health surveys. Int Stat Rev. 1996;64(3):265–94.

    Article  CAS  Google Scholar 

  37. Adams G, Gulliford MC, Ukoumunne OC, Eldridge S, Chinn S, Campbell MJ. Patterns of intra-cluster correlation from primary care research to inform study design and analysis. J Clin Epidemiol. 2004;57(8):785–94.

    Article  Google Scholar 

  38. Campbell MK, Fayers PM, Grimshaw JM. Determinants of the intracluster correlation coefficient in cluster randomized trials: the case of implementation research. Clin Trials. 2005;2(2):99–107.

    Article  Google Scholar 

  39. Fletcher AE, Jones DA, Bulpitt CJ, Tulloch AJ. The MRC trial of assessment and management of older people in the community: objectives, design and interventions [ISRCTN23494848]. BMC Health Serv Res. 2002;2.

  40. Okkes I, Becker H, Bernstein R, Lamberts H. The march 2002 update of the electronic version of ICPC-2: a step forward to the use of ICD-10 as a nomenclature and a terminology for ICPC-2. Fam Pract. 2002;19(5):543–6.

    Article  CAS  Google Scholar 

  41. Salisbury C, Procter S, Stewart K, Bowen L, Purdy S, Ridd M, et al. The content of general practice consultations: cross-sectional study based on video recordings. Br J Gen Pract. 2013;63(616):e751.

    Article  Google Scholar 

  42. Kalia S, Klar N, Donner A. On the estimation of intracluster correlation for time-to-event outcomes in cluster randomized trials. Stat Med. 2016;35(30):5551–60.

    Article  Google Scholar 

Download references


The authors would like to thank Professor Mark Mullee for his support in securing the funding for this study and for commenting on earlier drafts of this manuscript but hope he now enjoys a wonderful retirement.

The authors also wish to acknowledge the principal investigators and participants who contributed data towards this review.

An early draft of the results of this study was presented at the Society for Academic Primary Care Annual Meeting in July 2017.


This paper presents independent research funded by the National Institute for Health Research School for Primary Care Research (NIHR SPCR grant number 320). The views expressed are those of the author(s) and not necessarily those of the NIHR, the NHS or the Department of Health. The funder played no role in any of the design of the study, collection, analysis, and interpretation of data, writing the manuscript.

Author information

Authors and Affiliations



BS devised the study with input from MM and PL, acquired the data and wrote the manuscript. TB performed analysis of the data and revised the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Beth Stuart.

Ethics declarations

Ethics approval and consent to participate

Ethics approval was not required for this study as analysis was based on existing datasets from previously conducted studies, all of which had the appropriate approvals in place.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Table S1.

Distribution of ICCs across studies clustered by GP practice for continuous and binary outcomes.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Stuart, B., Becque, T., Moore, M. et al. Clustering of continuous and binary outcomes at the general practice level in individually randomised studies in primary care - a review of 10 years of primary care trials. BMC Med Res Methodol 20, 83 (2020).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: