Skip to main content

Strategies for assessing the impact of loss to follow-up on estimates of neurodevelopmental impairment in a very preterm cohort at 2 years of age



Loss to follow-up is a major challenge for very preterm (VPT) cohorts; attrition is associated with social disadvantage and parents with impaired children may participate less in research. We investigated the impact of loss to follow-up on the estimated prevalence of neurodevelopmental impairment in a VPT cohort using different methodological approaches.


This study includes births < 32 weeks of gestational age (GA) from 4 regions in the UK and Portugal participating in a European birth cohort (N = 1737 survivors). Data on maternal characteristics, pregnancy complications, neonatal outcomes and neighborhood deprivation were collected at baseline. Neurodevelopment was assessed at 2 years of corrected age (CA) using standardized parent-report measures. We applied (1) multiple imputation (MI) and (2) inverse probability weighting (IPW) to estimate the impact of non-response on the prevalence of moderate to severe neurodevelopmental impairment and assessed violations of the missing at random (MAR) assumption using the delta method.


54.2% of children were followed-up. Follow-up was less likely when mothers were younger, multiparous, foreign-born, did not breastfeed and came from deprived areas. The prevalence of neurodevelopmental impairment was 18.4% (95% confidence interval (CI):15.9–21.1) and increased to 20.4% (95%CI: 17.3–23.4) and 20.0% (95%CI:16.9–23.1) for MI and IPW models, respectively. Simulating strong violations of MAR (children with impairments being 50% less likely to be followed-up) raised estimates to 23.6 (95%CI:20.1–27.1)


In a VPT cohort with high loss to follow-up, correcting for attrition yielded modest increased estimates of neurodevelopmental impairment at 2 years CA; estimates were relatively robust to violations of the MAR assumption.

Peer Review reports


Very preterm birth (VPT, < 32 weeks of gestation) is associated with increased infant mortality and morbidity. Survivors have higher risks of poor physical health, neurodevelopmental impairment and psychological disorders than children born at term [1,2,3]. Many countries have constituted longitudinal very preterm birth cohorts using population-based designs, to evaluate the longer-term health burden and to investigate the determinants of adverse long-term outcomes [4,5,6]. One problem facing these cohorts is loss to follow-up which varies from 25 to 50% in most cohorts, but can be up to 70% [4,5,6,7].

There are many reasons for loss to follow-up, including difficulties tracing the location of families who move, lack of time due to other family obligations or work, financial barriers and not wanting to be reminded of the circumstances of the child’s birth. Studies have shown that patients who are lost to follow-up differ from those who remain in the study, with most finding that they have lower socioeconomic status [8,9,10,11]. Some studies have also found that children who are not included are more likely to be impaired [11, 12], although having data on the full sample to investigate this bias is uncommon. Despite limited empirical evidence, there are reasons to be concerned about attrition related to the child’s health condition since the time and psychosocial consequences of raising an impaired child may make families less mobile or willing to participate in research [13]. Loss to follow-up can undermine the representativeness of estimates and introduce selection biases when the factors affecting follow-up are associated with health and developmental outcomes [14, 15].

Most studies of very preterm cohorts include a description of the characteristics of children lost to follow-up, but several analytical strategies are available for going beyond a qualitative assessment and adjusting for attrition, notably inverse probability weighting (IPW) and multiple imputation (MI) [16,17,18,19]. IPW creates a pseudo-population where individuals are weighted to represent the inverse of the probability of follow-up conditional on baseline covariates; individuals who are under-represented in the follow-up sample will be assigned a larger weight and those over-represented will have a lower weight. MI replaces each missing value with a set of plausible values based on the distribution conditional on the observed data.

These methods assume that all possible variables associated with the missingness are included in the model so that data are missing at random (MAR). Otherwise, data are missing not at random (MNAR). Checking for the MAR assumption is difficult in general because the data needed to perform such checks are missing. Sensitivity analysis can be performed in these cases. For example, we can impute under MAR, shifts to the imputations by an amount delta to account for the imperfect MAR assumption, and re-analyze the data. If the result does not change under the values for delta, then we may conclude that the estimate is robust to violations of the MAR assumption. This method is called the delta method [20, 21].

In this study, we assess how the estimated prevalence of neurodevelopmental impairment at 2 years of corrected age (CA) among very preterm infants varies under two statistical methods (IPW and MI) and study the robustness to the MAR condition.


Study design

This study uses data from four regions of the European Perinatal Intensive Care in Europe (EPICE) cohort. EPICE is a population-based prospective cohort of infants born before 32 weeks of gestation between March 2011 and July 2012 in 19 regions in 11 countries in Europe [22, 23]. Clinical information was collected from medical records during the neonatal hospitalization and at 2 years CA using a parental questionnaire. Four regions were selected because of the availability of small area-based socioeconomic data on children in the sample: Yorkshire & Humber and East Midlands regions from United Kingdom and Lisbon and Northern regions from Portugal.

Study population

We included very preterm infants discharged alive from the neonatal hospitalization (N = 1763) and excluded children who died between discharge and 2 years (N = 7) and those with severe congenital anomalies (N = 12). For the analysis of neurodevelopment, we excluded children who were deaf or blind, because of the difficulty of assessing our primary outcome in this population (N = 7).


Moderate to severe neurodevelopmental impairment was derived from standardized questions in the parent-report questionnaire completed when the child was 2 years CA, as reported previously [24]. Briefly, this measure includes gross motor impairment based on the following questions: (1a) unable to walk without assistance or aids, (1b) unable to sit without support, (1c) unable to hold head up without support and/or (2) non-verbal cognitive (NVC) impairment based on the scale of the Parent Report of Children’s Abilities-Revised questions (PARCA-R), a parent validated screening tool [25]. The PARCA-R includes 34 items scored 0/1 from which a total NVC score is derived. Based on data from a UK term-born cohort, NVC scores < 22, corresponding to scores < 2.5th percentile were classified as moderate to severe NVC impairment [26].


We identified demographic and clinical factors likely to influence both the probability of follow-up and our outcome based on the literature and analyses in the EPICE cohort. These factors included maternal characteristics (age, foreign birth (Portugal)/ethnicity (UK), parity and previous cesarean section) and pregnancy and neonatal characteristics (gestational age, multiple pregnancy, small for gestational age (birthweight <10th percentile [27]), pregnancy complications (antepartum hemorrhage after 20 weeks and preterm premature rupture of membranes (PPROM), transfer in utero, presentation (breech/vertex/other), sex, Apgar score at 5 min, surfactant, respiratory support (any mechanical ventilation or nasal continuous positive airway pressure), severe neonatal morbidity (intraventricular hemorrhage grades III or IV, cystic periventricular leukomalacia, retinopathy of prematurity (grade III or more), severe necrotizing enterocolitis (defined as surgery or, peritoneal drainage)), bronchopulmonary dysplasia (defined as oxygen or respiratory support at 36 weeks post menstrual age), surgery and breastfeeding at discharge.

Socioeconomic data

Parental socioeconomic characteristics were not collected at baseline because this information is not systematically or comparably recorded in medical records in Europe; however, information on residential postal code could be linked with small-area measures of socioeconomic deprivation in both Portugal and the UK. In Portugal, we used the European Deprivation Index built from the 2011 European Union-Statistics on Income and Living Conditions survey [28, 29]. In the UK, we used the 2015 Department for Communities and Local Government measure of deprivation, based on data collected in 2012/13 [30].

Analysis strategy

First, we described loss to follow-up, defined as parental non-response to the two-year questionnaire for children surviving to 2 years CA. Numbers and percentages of missing values for perinatal and socioeconomic data overall and, among responders and non-responders were also reported. For responders, numbers and percentages of missing values for the outcome were also given. Then, we assessed the association of each predictor with loss to follow-up and neurodevelopmental impairment using a logistic regression adjusted for the region of birth.

The next step was to estimate the prevalence of neurodevelopmental impairment taking into consideration loss to follow-up using two different techniques. The first method was MI by chained equations [21, 31, 32]. Each missing value was replaced by 100 synthetic draws [33], and the prevalence estimates for the outcome were pooled according to Rubin’s rules. Variables that were potential predictors of missingness of the outcome as well as the outcome itself were included in MI models. Note that the MI approach imputes outcomes for children who are lost to follow-up as well as children who were followed-up, but had missing outcomes. Standard MI assumes a MAR mechanism, so we undertook a sensitivity analysis using the delta method. This consists in modifying the initial imputation model under MAR by adding a fixed quantity delta to the linear predictor before imputing data. The delta represents the difference in log-odds of having the outcome for children with missing values for the outcome compared with children with observed values. We tested a range of delta values, from 0.8 to 1.5, based on prior knowledge of the outcome to determine confidence in the prevalence estimates under the MAR assumption.

The second method for correcting for attrition was IPW where a weight was generated based on the inverse probability of follow-up [34, 35]. The probability of follow-up was estimated with a multivariate logistic regression using variables associated with follow-up. A P-value inferior or equal to 0.2 was used to include the possibility that some variables might be associated with the follow-up conditional on others. Missing data on predictors were imputed in a second version of the IPW method to improve the accuracy of results [36].

All calculations were performed with Stata 14.0 (Stata Corp. 2015. Stata Statistical Software: Release 14. College Station, TX: StataCorp LP). Stata coding of the delta method (see Additional file 1: Appendix for code) was informed by mice (version 3.13.0) in R [37].


At two-years, parental questionnaires were not returned for 45.8% (n = 796) children, varying from 32.5% in Portugal and 52.9% in the UK (Table 1). Among responders, neurodevelopmental impairment, our principal outcome, was missing for 11.1% of children meaning the prevalence of the outcome was computed on only 48.2% of children surviving at 2 years old. These missing values were mainly due to cases in UK regions (85 of the 104 missing observations). Table 1 also illustrates the impact of having missing data on perinatal variables. The complete case dataset represented 85.5% of the total sample, leading to a 14.5% loss of data with a distribution of 17.7% for non-responders and 11.7% for responders to the 2 year follow-up. The deprivation index had few missing observations (4.2%); it was well correlated with individual measures of socioeconomic disadvantage that were collected at 2 years and therefore available for the follow-up sample only (Table S1).

Table 1 Loss to follow-up and missing data at two years of corrected age in a very preterm cohort

Table 2 shows the perinatal characteristics associated with follow-up and impairment. Neonatal morbidities and postnatal care were not associated with non-response except for children having received surfactant and those breastfeeding at discharge, who were more likely to be followed-up. In contrast, maternal factors such as younger age, having more than one child, foreign-birth/ethnicity, having had a previous cesarean section and PPROM during pregnancy were associated with non-response. Unlike the association with follow-up, neonatal morbidities and care were most strongly associated with the presence of neurodevelopmental impairment. Males were more likely to have an impairment as were children with more than three siblings. Living in the most deprived neighborhoods was strongly associated with loss to follow-up, but the association with neurodevelopmental impairment was not statistically significant. Having a missing outcome among responders was associated with being foreign-born, but not other baseline characteristics or small area deprivation (Table S2).

Table 2 Maternal, pregnancy and perinatal characteristics associated with loss to follow-up and the presence or absence of neurodevelopmental impairment at 2 years of corrected age (N = 1737)

Table 3 compares the prevalence of neurodevelopmental impairment before and after correction for loss to follow-up. All three approaches, IPW using the complete-case dataset, IPW using the imputed dataset, and multiple imputation, provided higher prevalence estimates, with relative increases of about 10% over the crude prevalence (range from 5.4 to 10.9%). This table also provides results separately for responders and non-responders, derived from the MI models: 18.6% (95CI (16.0%; 21.2%)) for responders versus vs 23.0% (95CI (17.4; 28.7) for non-responders.

Table 3 Estimated prevalence of neurodevelopmental impairment after corrections for loss to follow-up using inverse probability weighting (IPW) and multiple imputation (MI)

Table 4 presents the sensitivity analyses simulating MNAR mechanism with values ranging from 0.8 to 1.5 which produced prevalence estimates from 19.0% (95% CI: 16.2 21.8) to 23.6% (95% CI: 20.1 27.1) versus 20.4% (95% CI: 17.3 23.4) under MAR. The UK regions varied from 20.5% (95% CI: 16.6 24.3) to 26.1% (95% CI: 21.3 30.8) compared to 21.9% (95% CI: 17.9 26.0), Portugal regions from 16.3% (95% CI: 12.5 20.0) to 19.0% (95% CI: 15.0 23.0) compared to 17.4% (95% CI: 13.4 21.4).

Table 4 Estimated prevalence of neurodevelopmental impairment under different MNAR scenarios


Loss to follow-up in this sample of children born very preterm was associated with most maternal sociodemographic factors (younger age, foreign born, multiparous) as well as area-based deprivation scores, but with few clinical and neonatal variables. Using statistical methods to account for loss to follow-up led to a modest relative increase, between 5.4 and 10.9%, in the estimated prevalence of moderate to severe neurodevelopment, with marginal differences in individual country estimates. Results were consistent using MI and IPW techniques. The estimated prevalence of moderate and severe neurodevelopmental impairment was 20.4% (95% CI: 17.3–23.4) and 20.0% (95% CI: 16.9–23.1) for MI and IPW models, respectively, versus the crude prevalence of 18.4% (95% CI 15.9–21.2). Sensitivity analyses adopting the extreme assumption that the prevalence of neurodevelopment delay was 1.5 times greater for those lost to follow-up or with a missing outcome gave an estimated prevalence of 23.6% (95% CI: 20.1–27.1). The relatively small differences between the more tempered scenarios (e.g., in the range 0.8 to 1.2) suggest that the prevalence estimate is robust to small to medium violations of the MAR assumption.

In our sample, the response rate was 54.2% which is at the lower end reported in other very preterm cohorts in early childhood which range from about 50 to 90% [6, 7, 12, 38, 39, 40]. Follow-up rates tend to be higher for studies from neonatal networks when compared to population-based studies and those that have regular contact with families after discharge. Our results on the factors associated with loss to follow-up are consistent with studies on very preterm birth [5, 6, 7, 39, 40] and other cohorts [8, 9, 41, 42] which find that non–responders have lower socioeconomic status, as measured by maternal education, social deprivation scores, migrant status and young maternal age. These studies also report differences in exposures associated with socioeconomic status, such as breastfeeding and tobacco use [7, 12, 40]. These factors, in particular, maternal age, migration, maternal education and breastfeeding have all been associated with neurodevelopmental outcomes after VPT birth [43, 44].

As in our study, fewer and less consistent associations have been observed between follow-up and perinatal variables which are the strongest predictors of poor outcome for VPT children [44], as we saw in this study; many of the studies cited above report similar neonatal characteristics and morbidities for responders and non-responders. Higher birthweight has been related to lower follow-up rates in some neonatal network studies which may reflect a closer connection between families and the NICU for higher risk low-birthweight infants [38, 39]. In contrast, in some studies, families with more impaired children were less likely to attend clinical assessments or responded by postal questionnaires only versus full participation. This attrition mechanism, i.e. loss to follow-up resulting from the child’s health status, is of particular concern for VPT research.

Both IPW and MI are appropriate statistical approaches to correct estimators with baseline information and to produce standard errors that consider the uncertainty caused by missing data [16, 17, 19] and have been recommended by research advisory boards [18]. In the presence of loss to follow-up, complete-case analysis will be unbiased when data are missing completely at random or if the outcome is not included in the missingness mechanism once accounting for all remaining variables. In our analysis, these two approaches yielded similar prevalence estimates after correction, however, multiple imputation has several advantages over weighting. First, it provides an estimate of the level of impairment among non-responders conditional on fact that the imputation model was well specified. MI also allows for sensitivity analyses of the MAR assumption based on the delta method. Finally, another feature, which we could not use in this study, is the time invariance of MI, meaning that if some children are included in future follow-up waves of a cohort, their data from these time points can be used to improve precision at earlier time points.

It is difficult to compare our results to the literature because this bias is rarely quantified. In the French Epipage 2 study of very preterm births < 32 weeks of gestation, MI was used to adjust for non-response at two-years; these adjustments were generally concordant with ours: estimates of cerebral palsy prevalence rose from 4.6 to 4.8% (4% increase) and, among children without severe motor impairments, the percentage with Ages and Stages Questionnaire (ASQ) scores below the screening threshold increased from 42.0 to 47.8% (a 14% increase) [40]. The larger adjustment for the neurodevelopmental measure, compared to cerebral palsy, may reflect the stronger impact of social factors on neurodevelopment than on severe gross motor impairment [45, 46]. Corrections for attrition may also have more impact on other developmental outcomes that are more socially patterned, such as language capabilities [47] or cognition in later childhood. Having continuous as opposed to dichotomous measurements of neurodevelopment, such as parent reported scores or standardized clinical measures of cognition may also lead to different results.

Despite the importance of socioeconomic factors in determining participation in follow-up, measures of social status are often not included in baseline data because of the absence of such information in medical records. We therefore carried out our analysis in two regions which had a measure of socioeconomic status in addition to other sociodemographic or behavioral variables associated with social status which are more often available in medical records (maternal age, country of birth, parity, breastfeeding). However, although neighborhood deprivation was strongly related to follow-up it was not related to neurodevelopment in this sample.

Strengths and limitations

Strengths of this study are the population-based design with verification of completeness at inclusion and availability of detailed perinatal data and socioeconomic deprivation based on small geographic zones. We excluded children with severe congenital anomalies or who were blind and deaf to avoid missing not at random mechanisms because these impairments may preclude parents from responding to questions not relevant to their children’s situation; Other reasons for loss to follow-up that are not observable from our data might still exist. For instance, our measure of socioeconomic status was area-based and may not fully capture social differences based on individual measures, despite good correlation with social variables in the follow-up sample. Unfortunately, among non-responders, we were not able to describe why parents did not respond to the 2 year questionnaire (explicit refusals, lack of time, postal errors, moved away etc.).


Despite high attrition, adjustment for loss to follow-up in our sample led to a modest approximately 10% relative increase in estimates of neurodevelopmental impairment. Simulation exercises showed these to be relatively insensitive to MAR violations. These results are likely applicable to other cohorts, given the similarity between the factors affecting follow-up in our data and those reported in the literature. Given the wide availability of IPW and MI techniques in most software packages, systematically addressing loss to follow-up in VPT cohorts using these methods should become standard practice to provide more accurate estimates, draw attention to this important issue and build up the evidence-base on the impact of attrition in varying contexts for multiple key developmental and health outcomes.

Availability of data and materials

The datasets analysed in the current study are not publicly available because of data protection guarantees included in the ethics authorization, but are available from the corresponding author on reasonable request, in accordance with data access rules.



Bronchopulmonary Dysplasia


Corrected age


Continuous positive airway pressure


Cystic periventricular leukomalacia (cPVL)


European Perinatal Intensive Care in Europe


Gestational age


Inverse probability weighting


Intraventricular hemorrhage (IVH)


Multiple imputation


Severe necrotizing enterocolitis


Non-verbal cognition


Parent Report of Children’s Abilities-Revised questions


Preterm premature rupture of membranes


Retinopathy of prematurity


Very preterm


  1. Allotey J, Zamora J, Cheong-See F, Kalidindi M, Arroyo-Manzano D, Asztalos E, et al. Cognitive, motor, behavioural and academic performances of children born preterm: a meta-analysis and systematic review involving 64 061 children. BJOG. 2018;125(1):16–25.

  2. Bilgin A, Mendonca M, Wolke D. Preterm Birth/Low Birth Weight and Markers Reflective of Wealth in Adulthood: A Meta-analysis. Pediatrics. 2018;142(1):e20173625.

    Article  Google Scholar 

  3. Twilhaar ES, Wade RM, de Kieviet JF, van Goudoever JB, van Elburg RM, Oosterlaan J. Cognitive Outcomes of Children Born Extremely or Very Preterm Since the 1990s and Associated Risk Factors: A Meta-analysis and Meta-regression. JAMA Pediatr. 2018;172(4):361–7.

    Article  Google Scholar 

  4. Ancel PY, Goffinet F, Kuhn P, et al. Survival and morbidity of preterm children born at 22 through 34 weeks' gestation in France in 2011: results of the EPIPAGE-2 cohort study. JAMA Pediatr. 2015;169(3):230–8.

    Article  PubMed  Google Scholar 

  5. Costeloe KL, Hennessy EM, Haider S, Stacey F, Marlow N, Draper ES. Short term outcomes after extreme preterm birth in England: comparison of two birth cohorts in 1995 and 2006 (the EPICure studies). BMJ. 2012;345(dec04 3):e7976.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Serenius F, Kallen K, Blennow M, et al. Neurodevelopmental outcome in extremely preterm infants at 2.5 years after active perinatal care in Sweden. JAMA. 2013;309(17):1810–20.

    Article  CAS  Google Scholar 

  7. MacBean V, Drysdale SB, Zivanovic S, Peacock JL, Greenough A. Participant retention in follow-up studies of prematurely born children. BMC Public Health. 2019;19(1):1233.

    Article  Google Scholar 

  8. Saiepour N, Ware R, Najman J, Baker P, Clavarino A, Williams G. Do participants with different patterns of loss to follow-up have different characteristics? A multi-wave longitudinal study. J Epidemiol. 2016;26(1):45–9.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Ware RS, Williams GM, Aird RL. Participants who left a multiple-wave cohort study had similar baseline characteristics to participants who returned. Ann Epidemiol. 2006;16(11):820–3.

    Article  PubMed  Google Scholar 

  10. Schindler T, Koller-Smith L, Lui K, et al. Causes of death in very preterm infants cared for in neonatal intensive care units: a population-based retrospective cohort study. BMC Pediatr. 2017;17(1):59.

    Article  Google Scholar 

  11. Wolke D, Sohne B, Ohrt B, Riegel K. Follow-up of preterm children: important to document dropouts. Lancet. 1995;345(8947):447.

    Article  CAS  Google Scholar 

  12. Moore T, Hennessy EM, Myles J, et al. Neurological and developmental outcome in extremely preterm children born in England in 1995 and 2006: the EPICure studies. BMJ. 2012;345:e7961.

    Article  Google Scholar 

  13. Marques SCS, Doetsch J, Abate G, et al. Understanding participation in European cohort studies of preterm children: the views of parents, healthcare professionals and researchers. BMC Med Res Methodol. 2021;21(1):19.

    Article  Google Scholar 

  14. Nohr EA, Frydenberg M, Henriksen TB, Olsen J. Does low participation in cohort studies induce bias? Epidemiology. 2006;17(4):413–8.

    Article  PubMed  Google Scholar 

  15. Hille ET, Elbertse L, Gravenhorst JB, Brand R, Verloove-Vanhorick SP, Dutch P-CSG. Nonresponse bias in a follow-up study of 19-year-old adolescents born as preterm infants. Pediatrics. 2005;116(5):e662–6.

    Article  CAS  PubMed  Google Scholar 

  16. Hayati Rezvan P, Lee KJ, Simpson JA. The rise of multiple imputation: a review of the reporting and implementation of the method in medical research. BMC Med Res Methodol. 2015;15:30.

    Article  Google Scholar 

  17. Pedersen AB, Mikkelsen EM, Cronin-Fenton D, Kristensen N, Pham TM, Pedersen L, et al. Missing data and multiple imputation in clinical epidemiological research. Clin Epidemiol. 2017;9:157–66.

  18. National Research Council Panel on Handling Missing Data in Clinical Trials. The Prevention and Treatment of Missing Data in Clinical Trials. Washington (DC): National Academies Press (US); 2010. Available from:

  19. Harel O, Mitchell EM, Perkins NJ, et al. Multiple Imputation for Incomplete Data in Epidemiologic Studies. Am J Epidemiol. 2018;187(3):576–84.

    Article  Google Scholar 

  20. van Buuren S. Flexible imputation of missing data. Boca Raton: Chapman & Hall/CRC Press; 2012.

    Book  Google Scholar 

  21. van Buuren S, Boshuizen HC, Knook DL. Multiple imputation of missing blood pressure covariates in survival analysis. Stat Med. 1999;18(6):681–94.

    Article  Google Scholar 

  22. Zeitlin J, Maier RF, Cuttini M, et al. Cohort Profile: Effective Perinatal Intensive Care in Europe (EPICE) very preterm birth cohort. Int J Epidemiol. 2020;49(2):372–86.

    Article  Google Scholar 

  23. Zeitlin J, Manktelow BN, Piedvache A, et al. Use of evidence based practices to improve survival without severe morbidity for very preterm infants: results from the EPICE population based cohort. BMJ. 2016;354:i2976.

    Article  Google Scholar 

  24. Draper ES, Zeitlin J, Manktelow BN on behalf of the EPICE group, et al. EPICE cohort: two-year neurodevelopmental outcomes after very preterm birth Archives of Disease in Childhood - Fetal and Neonatal Edition. 2020;105:350-6.

  25. Johnson S, Wolke D, Marlow N. Preterm infant parenting study G. developmental assessment of preterm infants at 2 years: validity of parent reports. Dev Med Child Neurol. 2008;50(1):58–62.

    Article  PubMed  Google Scholar 

  26. Johnson S, Evans TA, Draper ES, Field DJ, Manktelow BN, Marlow N, et al. Neurodevelopmental outcomes following late and moderate prematurity: a population-based cohort study. Arch Dis Child Fetal Neonatal Ed. 2015;100(4):F301–8.

  27. Zeitlin J, Bonamy AE, Piedvache A, et al. Variation in term birthweight across European countries affects the prevalence of small for gestational age among very preterm infants. Acta Paediatr. 2017;106(9):1447–55.

    Article  PubMed  Google Scholar 

  28. Ribeiro AI, Mayer A, Miranda A, Pina MF. The Portuguese Version of the European Deprivation Index: An Instrument to Study Health Inequalities. Acta Medica Port. 2017;30(1):17–25.

    Article  Google Scholar 

  29. Ribeiro AI, Launay L, Guillaume E, Launoy G, Barros H. The Portuguese version of the European deprivation index: development and association with all-cause mortality. PLoS One. 2018;13(12):e0208320.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Smith T, Noble M, Noble S, Wright G, McLennan D, Plunkett E. The English indices of deprivation 2015. Department for Communities and Local Government. London; 2015.

  31. Sterne JA, White IR, Carlin JB, et al. Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls. BMJ. 2009;338(jun29 1):b2393.

    Article  PubMed  PubMed Central  Google Scholar 

  32. van Buuren S, Groothuis-Oudshoorn K. mice: Multiple Imputation by Chined Equations in R. Journal of Statistical Software, Foundation for Open Access Statistics. 2011;45(i03).

  33. Graham JW, Olchowski AE, Gilreath TD. How many imputations are really needed? Some practical clarifications of multiple imputation theory. Prev Sci. 2007;8(3):206–13.

    Article  PubMed  Google Scholar 

  34. Rezvan PH, White IR, Lee KJ, Carlin JB, Simpson JA. Evaluation of a weighting approach for performing sensitivity analysis after multiple imputation. BMC Med Res Methodol. 2015;15:83.

    Article  Google Scholar 

  35. Seaman SR, White IR. Review of inverse probability weighting for dealing with missing data. Stat Methods Med Res. 2013;22(3):278–95.

    Article  PubMed  Google Scholar 

  36. Seaman SR, White IR, Copas AJ, Li L. Combining multiple imputation and inverse-probability weighting. Biometrics. 2012;68(1):129–37.

    Article  PubMed  PubMed Central  Google Scholar 

  37. mice/mice.impute.mnar.norm. R at master · amices/mice GitHub < R>. Accessed 19 Mar 2021.

  38. Lowe JR, Fuller JF, Do BT, Vohr BR, Das A, Hintz SR, et al. Behavioral problems are associated with cognitive and language scores in toddlers born extremely preterm. Early Hum Dev. 2019;128:48–54.

  39. Schlapbach LJ, Adams M, Proietti E, et al. Outcome at two years of age in a Swiss national cohort of extremely preterm infants born between 2000 and 2008. BMC Pediatr. 2012;12:198.

    Article  Google Scholar 

  40. Pierrat V, Marchand-Martin L, Arnaud C, et al. Neurodevelopmental outcome at 2 years for preterm children born at 22 to 34 weeks' gestation in France in 2011: EPIPAGE-2 cohort study. BMJ. 2017;358:j3448.

    Article  Google Scholar 

  41. Dettori JR. Loss to follow-up. Evid Based Spine Care J. 2011;2(1):7–10.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Cameron CM, Osborne JM, Spinks AB, Davey TM, Sipe N, McClure RJ. Impact of participant attrition on child injury outcome estimates: a longitudinal birth cohort study in Australia. BMJ Open. 2017;7(6):e015584.

    Article  Google Scholar 

  43. Herich LC, Cuttini M, Croci I, Franco F, di Lallo D, Baronciani D, et al. Maternal education is associated with disparities in breastfeeding at time of discharge but not at initiation of enteral feeding in the neonatal intensive care unit. J Pediatr. 2017;182:59–65 e57.

  44. Linsell L, Malouf R, Morris J, Kurinczuk JJ, Marlow N. Prognostic factors for poor cognitive development in children born very preterm or with very low birth weight: a systematic review. JAMA Pediatr. 2015;169(12):1162–72.

    Article  PubMed  PubMed Central  Google Scholar 

  45. Wong HS, Edwards P. Nature or nurture: a systematic review of the effect of socio-economic status on the developmental and cognitive outcomes of children born preterm. Matern Child Health J. 2013;17(9):1689–700.

    Article  PubMed  Google Scholar 

  46. Beaino G, Khoshnood B, Kaminski M, et al. Predictors of cerebral palsy in very preterm infants: the EPIPAGE prospective population-based cohort study. Dev Med Child Neurol. 2010;52(6):e119–25.

    Article  PubMed  Google Scholar 

  47. Sentenac M, Johnson S, Charkaluk ML, Sëppanen AV, Aden U, Cuttini M, et al. Maternal education and language development at 2 years corrected age in children born very preterm: results from a European population-based cohort study. J Epidemiol Community Health. 2020;74(4):346–53.

Download references


EPICE Research group: BELGIUM: Flanders (E Martens, G Martens, P Van Reempts); DENMARK: Eastern Region (K Boerch, A Hasselager, LD Huusom, O Pryds, T Weber); ESTONIA (L Toome, H Varendi); FRANCE: Burgundy, Ile-de France and Northern Region (PY Ancel, B Blondel, A Burguet, PH Jarreau, P Truffert); GERMANY: Hesse (RF Maier, B Misselwitz, S Schmidt), Saarland (L Gortner); ITALY: Emilia Romagna (D Baronciani, G Gargano), Lazio (R Agostino, D DiLallo, F Franco), Marche (V Carnielli), M Cuttini, I Croci; NETHERLANDS: Eastern & Central (C Koopman-Esseboom, A van Heijst, J Nijman); POLAND: Wielkopolska (J Gadzinowski, J Mazela); PORTUGAL: Lisbon and Tagus Valley (LM Graça, MC Machado), Northern region (Carina Rodrigues, T Rodrigues), H Barros; SWEDEN: Stockholm (AK Bonamy, M Norman, E Wilson); UK: East Midlands and Yorkshire and Humber (E Boyle, ES Draper, BN Manktelow), Northern Region (AC Fenton, DWA Milligan); INSERM, Paris (J Zeitlin, M Bonet, A Piedvache).


STROBE guidelines were followed for reporting the background, methods and results of this observational cohort study.


The research leading to these results received funding from the European Union’s Seventh Framework Programme (FP7/2007–2013) under grant agreement No. 259882 and from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 733280.

Author information

Authors and Affiliations




AP and JZ conceived of the study and wrote the first draft. SVB provided methodological advice, while AIR provided help with area-based SES analyses. SVB, HB, AIR, ED interpreted the results and reviewed and revised the manuscript. HB and ED supervised the study in their countries. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Jennifer Zeitlin.

Ethics declarations

Ethics approval and consent to participate

Both regions obtained approval from local ethics boards according to national legislations prior to the start of data collection. The EPICE study was also approved by the French Advisory Committee on Use of Health Data in Medical Research (CCTIRS) and the French National Commission for Data Protection and Liberties (CNIL). Informed consent to participate in the cohort was obtained from all mothers included in the follow-up study.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

Individual socio-demographic characteristics associated with small area-based deprivation scores, sample at birth (for maternal age) and followed-up when children were 2 years of corrected age (other characteristics). Table S2. Baseline characteristics associated with missing neurodevelopmental impairment (N = 951)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Piedvache, A., van Buuren, S., Barros, H. et al. Strategies for assessing the impact of loss to follow-up on estimates of neurodevelopmental impairment in a very preterm cohort at 2 years of age. BMC Med Res Methodol 21, 118 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: