- Research article
- Open Access
- Open Peer Review
Benefits of extensive recruitment effort persist during follow-ups and are consistent across age group and survey method. The TRAILS study
BMC Medical Research Methodology volume 12, Article number: 93 (2012)
Extensive recruitment effort at baseline increases representativeness of study populations by decreasing non-response and associated bias. First, it is not known to what extent increased attrition occurs during subsequent measurement waves among subjects who were hard-to-recruit at baseline and what characteristics the hard-to-recruit dropouts have compared to the hard-to-recruit retainers. Second, it is unknown whether characteristics of hard-to-recruit responders in a prospective population based cohort study are similar across age group and survey method.
First, we compared first wave (T1) easy-to-recruit with hard-to-recruit responders of the TRacking Adolescents’ Individual Lives Survey (TRAILS), a prospective population based cohort study of Dutch (pre)adolescents (at first wave: n = 2230, mean age = 11.09 (SD 0.56), 50.8% girls), with regard to response rates at subsequent measurement waves. Second, easy-to-recruit and hard-to-recruit participants at the fourth TRAILS measurement wave (n = 1881, mean age = 19.1 (SD 0.60), 52.3% girls) were compared with fourth wave non-responders and earlier stage drop-outs on family composition, socioeconomic position (SEP), intelligence (IQ), education, sociometric status, substance use, and psychopathology.
First, over 60% of the hard-to-recruit responders at the first wave were retained in the sample eight years later at the fourth measurement wave. Hard-to-recruit dropouts did not differ from hard-to-recruit retainers. Second, extensive recruitment efforts for the web based survey convinced a population of nineteen year olds with similar characteristics as the hard-to-recruit eleven year olds that were persuaded to participate in a school-based survey. Some characteristics associated with being hard-to-recruit (as compared to being easy-to-recruit) were more pronounced among non-responders, resembling the baseline situation (De Winter et al.2005).
First, extensive recruitment effort at the first assessment wave of a prospective population based cohort study has long lasting positive effects. Second, characteristics of hard-to-recruit responders are largely consistent across age groups and survey methods.
The first purpose of the present study was to investigate if extensive recruitment efforts at the start of a prospective population based cohort study pay off in the long term. A large literature on the short term effects of extensive recruitment effort shows that such efforts can increase representativeness of the study population by decreasing non-response bias (see for instance Kessler et al.  or Nakash et al. ). However, it is not known to what extent increased attrition occurs during subsequent measurement waves among subjects who were hard-to-recruit and what characteristics the hard-to-recruit dropouts have compared to the hard-to-recruit retainers. The second purpose of the study was to investigate whether characteristics of hard-to-recruit participants vary depending on age of the sample and survey method. More specifically, does additional recruitment effort convince the same type of individuals in 11-year-old preadolescents who need parental consent to participate in a school-based survey as in 19-year-olds who do not need parental consent to participate in a web-based follow-up?
It is well known that non-response at baseline can lead to response bias in cohort studies. Non-responders are more frequently males, of lower socio-economic status, of non-western ethnicity, and have poorer academic achievement and more health problems than responders [3–6]. Although some researchers suggest that the effects of response bias are overestimated , others have shown that non-response at baseline is a threat for external validity .
Different strategies have been described to reduce response bias, such as repeated mailings following initial non-response [9, 10] and the use of alternative, shortened versions of measurement instruments . In our own study, the TRacking Adolescents Individual Lives’ Survey (TRAILS), extra recruitment effort at the first measurement wave consisted of one or two house visits after no response to both an initial and a reminder letter had been received, and offering a two-month reflection period if the initial participation request was at an inconvenient time . Different studies have shown that recruitment efforts lead to a more representative sample in terms of sex, age, race, socio-economic status and health [5, 11, 12]. Although the representativeness increases, the quality of the data has been shown to decrease with extra recruitment effort, because of more missing values and errors in data from late compared to early responders [10, 12].
Attrition, or drop-out, is largely predicted by the same variables as non-response. Males [13, 14], as well as participants with low socio-economic status , non-western ethnicity [13, 15–17], low academic achievement [3, 15, 17, 18] and physical and mental health problems [13, 16–19] are particularly likely to drop-out from longitudinal studies. The observation that non-response is predicted by the same variables as attrition makes it plausible that participants for whom extra recruitment effort was done at inclusion are more likely to drop-out of longitudinal studies than those who were easy-to-recruit at inclusion. As far as we know, this has never been investigated. The first purpose of our study was to investigate how extensive recruitment effort at the first wave was related to attrition over an eight year follow-up period in the longitudinal study of adolescents TRAILS.
Sample and survey characteristics related to success of extra recruitment effort
The second aim of our study was to identify factors that predicted attrition. Factors associated with non-response at the first wave (T1) have been described in detail by De Winter and colleagues . At that time, the study population was about 11 years old and hence needed parental consent to participate in the study. The measurements took place at school. At the fourth assessment wave (T4), the study population was about 19 years old and did not need parental consent anymore, and a web-based survey method was used. Just like at T1, extra recruitment efforts were made at T4 to recruit initial non-responders. This gives us the opportunity to compare factors related to being easy or hard-to-recruit at these two assessment waves .
School-based surveys usually lead to higher response rates [13, 14] compared to mail-based surveys, and obtaining written parental consent has been reported to be harder for boys, students with lower grades, students with non-Western ethnicity and less sociable children [20, 21]. Age has also been associated with non-response and attrition. Adolescents and older adults are generally harder to include than children and young to middle-aged adults [4, 6, 11, 17]. However, very little is known about how the effect of extensive recruitment efforts relate to sample and survey characteristics. In other words, the second purpose of this study was to investigate how extra recruitment effort in a web-based follow-up in 19-year olds affected attrition rates compared to extra recruitment effort in a school-based survey in 11-year olds in the same sample.
The TRacking Adolescents’ Individual Lives Survey (TRAILS) is a prospective cohort study of Dutch (pre)adolescents, with the aim to chart and explain the development of mental (ill)health from preadolescence into adulthood . The present study involves data from all four assessment waves of TRAILS, which ran from March 2001 to July 2002 (T1), September 2003 to December 2004 (T2), September 2005 to August 2008 (T3), and October 2008 to September 2010 (T4), respectively. The study was approved by the Dutch Central Committee on Research Involving Human Subjects.
TRAILS participants were selected from five municipalities in the North of the Netherlands, including both urban and rural areas. Children born between 1 October 1989 and 30 September 1991 were eligible for inclusion, providing that their schools were willing to cooperate and that they met the study’s inclusion criteria . Over 90% of the schools accommodating 2935 eligible children agreed to participate in the study.
Initially, 66% of parents and children agreed to participate (T1-easy-to-recruit). As parents were a source of information in TRAILS (see below), an ‘opt-in’ parental consent was necessary. Parents who refused to participate were asked permission to contact them again after 2 months, in order to minimise the number of refusals for temporary reasons. Parents with an unlisted telephone number were requested to contact the research team and pass on their number. If parents did not react to the initial letter, or to the reminder sent a few weeks later, a staff member paid a personal visit to their house. After two home visits, a letter was left with a reply card and a prepaid envelop. These extra recruitment efforts convinced 145 initial non-responders (T1-hard-to-recruit) and raised the final response rate to 76% (N = 2230, mean age = 11.09 years, SD = 0.56, 50.8% girls).
The extended efforts resulted in the recruitment of more vulnerable children and thus partially prevented a non-response bias regarding the prevalence of psychopathology . Teacher reports, which were available for 40.7% of the non-responders, further revealed that the non-responders were more likely to be boys, to have a low socioeconomic background, and to perform poorly at school. Non-responders did not differ from responders regarding associations between sociodemographic variables and mental health outcomes .
Of the 2230 baseline participants, 96.4% (N = 2149, 51.0% girls) participated in the first follow-up assessment (T2). Mean age at T2 was 13.56 years (SD = 0.53). The response at the third wave was 81.4% (N = 1816, 52.3% girls). Mean age at T3 was 16.27 years (SD = 0.73). No extra efforts were undertaken to raise the response rates at T2 and T3.
At T4 the adolescents had reached the age of 18 or 19, and no parental consent was needed for participation anymore. At this wave, a custom research company (CRC) was hired to recruit and assess participants. The CRC was asked to recruit all respondents that had participated at T1 and at T2 or T3 and had not definitely refused further participation. The TRAILS research team sent information about the upcoming fourth wave, thereby explaining that the CRC would be responsible for the logistics. After participants had given informed consent, the CRC sent logon information for a web-based questionnaire. A gift certificate of 10 euro was included. Adolescents who did not respond to the questionnaire within 2–3 weeks, were contacted by telephone with the request to participate in (parts of) this wave. When they still did not respond after several reminders, or when adolescents could not be reached by telephone, a CRC employee paid one or two home visits, both announced and unannounced. The CRC realized a response rate of 72% (N = 1610). These responders are hereafter called ‘T4-easy-to-recruit’.
Participants who had not completed any assessments with the CRC, were contacted by the TRAILS research team. The TRAILS team approached these initial non-responders to evaluate the recruitment methods of the CRC, and to try to convince them to participate. The TRAILS research team also contacted T1 participants who had refused participation at both T2 and T3. Willingness to participate at T4 of initial non-responders was assessed and when they seemed willing, information about the fourth wave was sent, including a paper questionnaire and a gift certificate (10 Euro). The TRAILS team gave individuals who did not wish to fill out the full questionnaire the option to fill out a shortened version of the survey. The term web-based survey method should therefore be read as web or mail-based survey method throughout this paper. These extensive recruitment efforts lead to inclusion of 271 extra participants (T4-hard-to-recruit). The recruitment efforts increased the response rate of T4 to 84.3% (total n = 1881, mean age 19.1 (SD 0.60), 52.3% girls).
In short, T1-easy-to-recruit participants responded immediately; T1-hard-to-recruit-participants responded after several phone calls (until contact), one or two house visits and/or a two months reflection period. T4-easy-to-recruit participants responded to the CRC, which in some cases included reminders and one or two house visits; T4-hard-to-recruit participants responded only after the extra recruitment efforts of the TRAILS research team.
To be able to answer our second research question, we compared four groups: a) T4-easy-to-recruit responders; b) T4-hard- to-recruit responders; c) T4-non-responders, who participated in T3 but at T4 responded to neither the CRC nor the TRAILS team; and d) drop-outs since T2 or T3, who participated in T1 (and T2), but not in T3 and T4.
TRAILS has biological, psychological, and social information from multiple sources, i.e. adolescents, their parents, their teachers and their peers. Huisman et al. gave an overview of all measurements of the first three waves . The fourth wave was comparable to the earlier waves with a few adaptations. For example, a structured diagnostic interview [23–25] and a life stress interview  were administered; the Amsterdam Neuropsychological Tasks [27, 28] were readministered; the adult version replaced the adolescent version of a number of questionnaires; and a number of age appropriate questions were added. For this paper, we used the following variables that we hypothesized to predict attrition:
Sociodemographic characteristics were assessed during an interview with one of the parents (usually the mother), administered at T1. The parent reported on whether the (biological) parents were divorced, the number of siblings, and whether the participant belonged to a single parent family. Educational level, occupational level  and socioeconomic position (SEP)  of the parents were also assessed at T1. Intelligence quotient (IQ) of TRAILS participants was estimated at T1 using the Vocabulary and Block Design subtests from the Revised Wechsler Intelligence Scales for children [27, 31, 32].
The position in the educational system of all respondents at T2 and T3 was established by means of the so-called ‘educational ladder,’ developed by Bosker, Van der Velden, and Hofman . This measure incorporates two aspects of a student’s position in the educational system, namely (1) the level of education (in the Dutch secondary educational system four tracks are distinguished corresponding to the level of difficulty), and (2) the progress within education. The scale ranges from 1 to 7 at T2 and 2 to 10 at T3. A score of 10 reflects the final exam of the highest track of secondary education. A score of 7 means that it will take three years until the final exam of the highest track can be obtained. Because the distances between the tracks can be considered as approximately similar, it is possible to scale them on an interval scale. Moving up a grade within the same track results in winning one point, whereas repeating a grade within the same track as well as streaming down to a lower track without repeating results in retaining the same score.
Sociometric status of participants was assessed by means of peer nominations at T1 and T2. In classes with at least 10 TRAILS participants, children were asked to indicate whom they liked (peer acceptance), disliked (peer rejection), who bullied them (bullying), whom they bullied (victimization) and who helped them (helping). Children could nominate an unlimited number of same-gender and cross-gender classmates [34–37].
Alcohol, cigarettes and cannabis use was assessed at T2, T3 and T4 by self-report questionnaires. Participants were asked to report whether they had ever used alcohol, cigarettes or cannabis (lifetime use), when they had started using it (age of onset) and the frequency of use. Although the validity and reliability of self-reports on substance use has been a subject of debate, previous research has concluded that, when anonymity is assured, self-report measures of substance use have acceptable validity and reliability [38, 39].
Externalizing and internalizing problems were assessed at T1, T2 and T3 by the Dutch version of the Child Behavior Checklist (CBCL) and the self-report version of this questionnaire, the Youth Self-Report [40, 41]. At T4, the Adult Self-Report (ASR,) was administered. These questionnaires contain a list of behavioural and emotional problems, which parents or the participant themselves can rate as 0 = not true, 1 = somewhat or sometimes true, or 2 = very or often true in the past 6 months. The broad-band dimension of Externalizing Problems encompasses the narrow-band scales Aggressive Behaviour and Rule-Breaking Behaviour. The dimension of Internalizing Problems included the scales Anxious/Depressed, Withdrawn/Depressed, and Somatic Complaints . A Total Problem Score scale was constructed as the sum of all problem behaviours, that is, internalizing and externalizing problems as well as thought problems, attention problems and social problems.
Additionally, the Composite International Diagnostic Interview (CIDI, [23–25]) was administered at T4. The CIDI is a comprehensive, fully-structured interview designed to be used by trained lay interviewers for the assessment of mental disorders according to the definitions and criteria of ICD-10 and DSM-IV. It is intended for use in epidemiological and cross-cultural studies as well as for clinical and research purposes. The diagnostic section of the interview is based on the World Health Organization's CIDI [23–25]. Diagnoses were grouped into internalizing behaviour diagnoses, including anxiety and depressive disorders; and externalizing behaviour diagnoses, including substance abuse, conduct disorder and oppositional defiant disorder. A sum score of total problem behaviour diagnoses was calculated, including all internalizing and externalizing behaviour diagnoses, bipolar disorders and attention deficit hyperactivity disorder.
To investigate whether the extra recruitment effort at T1 had a long-lasting effect, we used a logistic regression analysis with ‘being hard-to-recruit at T1’ as independent variable predicting response in the following measurement waves. To find out whether T1-hard-to-recruit-retainers (those that stayed in the cohort) were different from the T1-hard-to-recruit-dropouts (those that dropped out at T2,T3 or T4), the T1-easy-to-recruit- retainers or the T1-easy-to-recruit-dropouts, we performed single and multivariate multinomial regression analyses to provide estimates (odds ratio’s, including 95% confidence intervals) of the included predictors for each of the following categories: T4-responders that were T1-hard-to-recruit (‘T1-hard-to-recruit retainers’), T4-non-responders that were T1-hard-to-recruit (‘T1-hard-to-recruit dropouts’), T4-responders that were T1-easy-to-recruit and T4-non-responders that were T1-easy-to-recruit. To be able to show differences between T1-hard-to-recruit retainers and T1-hard-to-recruit dropouts, the T1-hard-to-recruit retainers were used as reference category, rather than the T1-easy-to-recruit retainers, which is the largest group. The following predictors were included in both the single and multivariate analyses: family composition, SEP, IQ, education, sociometric status, substance use, and psychopathology. The multivariate models were constructed using backward stepwise selection using likelihood ratio tests. P values were set at 0.1 to prevent relevant predictors from being excluded from the final model. Non-nested models (eg. when comparing the effects of parental education with a composite measure for socioeconomic status, which also includes parental education) were evaluated using Akaike’s (AIC) and Bayesian (BIC) information criteria.
For our second research question, we first used single multinomial regression analysis to provide estimates (odds ratio’s, including 95% confidence intervals) of the included predictors for each of the following categories: T4-easy-to-recruit, T4-hard-to-recruit, T4-non-responders and drop-outs since T2 or T3. Included predictors are family composition, SEP, IQ, education, sociometric status, substance use, and psychopathology. For predictors that were measured at T4 only, binary logistic regression was used. Then, to find out which predictors related most strongly to participation at T4, we performed a stepwise multivariate multinomial regression analysis using the same method as described above. In addition, we investigated possible interaction effects of predictors and T1 recruitment status on participation at T4.
The reporting of this observational study followed guidelines from the STROBE statement .
An overview of sample characteristics at each of the four measurement waves can be found in Table 1. At eight year follow-up, the response rate was 84%. With an initial response rate of 76%, this implies that 64% of the eligible children still participated in TRAILS eight years later.
Effects of extensive recruitment efforts eight years later
The first question in the present study was whether extensive recruitment effort at the first assessment wave (age 11) resulted in a more diverse sample eight years later, during the fourth assessment wave (age 19). Table 2 shows the response rates at T2, T3 and T4 of T1-easy-to-recruit responders and T1-hard-to-recruit responders, respectively. Of the T1-hard-to-recruit responders, 61% were still in the cohort at T4. As expected, attrition rates were significantly higher among T1-hard-to-recruit participants than among T1-easy-to-recruit participants, at all successive measurement waves (Table 2). This notwithstanding, over half of T1-hard-to-recruit participants were easy-to-recruit at T4 (Figure 1). Among the T1-hard-to-recruit participants we found no significant differences at T4 between retainers and drop-outs in sociodemographic variables, peer status or psychiatric symptoms (Table 3). This indicates no selective attrition of the most vulnerable T1-hard-to-recruit participants along the four measurement waves. In addition, T1-hard-to-recruit-retainers differ significantly from T1-easy-to-recruit retainers, indicating that the increased generalisability that was generated by the extra recruitment efforts at T1 is maintained throughout the waves.
Effects of extensive recruitment efforts at age 19
Table 4 shows sociodemographic variables and outcome measures for the 4 groups (T4-easy-to-recruit, T4-hard-to-recruit, T4-non-responders, and drop-outs since T2 or T3). Similar to T1 , T4-hard-to-recruit responders seem a relatively vulnerable group of adolescents: like T4-non-responders and T2/T3-drop-outs, they had a lower IQ, their parents were more often divorced, and they more often came from families with a low socioeconomic position. This suggests that extensive recruitment efforts to prevent attrition at age 19 increased the representativeness of our sample, like it did eight years earlier. Like at T1 , the socioeconomic position of T4-non-responders and T2/T3-drop-outs was lower than the socioeconomic position of the T4-hard-to-recruit responders (Table 4). Regarding IQ and parental divorce, drop-outs since T2 or T3 were equally likely to have a low IQ or divorced parents as T4-hard-to-recruit participants, while T4 non-responders were more likely to have a low IQ or divorced parents (Table 4). The same can be concluded for educational position. T4-easy-to-recruit participants had attained the highest educational positions at both T2 and T3, whereas T4-non-responders had attained the lowest educational positions at both waves.
At T1, being nominated as popular by peers predicted being a responder, whereas being rejected predicted being hard-to-recruit . Peer acceptance at T1 did not predict participation anymore at T4, whereas being rejected by peers, as well as bullying, at T1 still predicted being hard-to-recruit at T4 (Table 5). Hard-to-recruit participants, non-responders, and drop-outs did not differ with respect to being rejected at T1. Thus, sociometric status at T1 differentially predicted participation in a school-based survey at age 11 compared to participation in a web-based survey at age 19.
Peer acceptance at T2 predicted being a non-responder at T4, while there was no association with being hard-to-recruit or a dropout since T2/T3. Bullying or being a victim of bullying behaviour both predicted being T4-hard-to-recruit, whereas being nominated as a helper predicted being T4-easy-to-recruit.
Respondents who were easy-to-recruit at T4 were less likely to have used cigarettes or cannabis at T2 than T4-hard-to-recruit participants and T4-non-responders (Table 6). T4-hard-to-recruit participants were more likely than all other groups to have used cannabis at T2, but not at later waves.
In terms of externalising problems, the parents of T4-hard-to-recruit participants reported more externalising problems from T1 up to T3 (Table 7). Differences in parent-reported externalising problems between T4 non-responders and drop-outs since T2 or T3 seemed to have diminished over time, whereas differences in self-reported externalising problems emerged at T3 and remained at T4. Hard-to-recruit participants were also more likely to receive a lifetime externalising diagnosis in the CIDI interview at T4. Notably, T4-easy- and hard-to-recruit participants did not differ with regard to self-reported externalising problems at T4. Furthermore, T4-easy-to-recruit participants reported more internalising problems both at T1 and at T3 (Table 7).
In the current analysis, with T4-easy-to-recruit participants as reference category, we cannot show whether T4-non-responders differ significantly from T4-hard-to-recruit participants. Results from the analysis with T4-hard-to-recruit participants as reference category show that T4-non-responders significantly more often have a low educated mother, low family income, low SEP, low IQ and lower educational position compared to T4-hard-to-recruit responders. In terms of psychopathology, substance use and other sociodemographic variables, the differences were not statistically significant (results not shown but available upon request).
Finally, the multiple regression analysis shows that being T1-hard-to-recruit most strongly predicts recruitment status at T4, and furthermore that being male, from non-Western origin, having a low educated mother, low family income, low IQ and having internalising and externalising problems remain statistically significant risk factors for being T4-hard-to-recruit in a multivariate model (Table 8). Analyses including interaction terms yielded strong main effects of both recruitment status and predictors; their interaction however yielded negligible effects in the opposite direction. These interaction results might be unreliable resulting from the small numbers in the various categories.
Main findings regarding effects of recruitment efforts eight years later
The response rate after eight years follow up is 84%; among the T1 hard-to-recruit participants we found no significant differences between participants and non-participants at T4 in demographic variables, peer status or psychiatric symptoms. This indicates there is no selective attrition of the most vulnerable T1-hard-to-recruit participants along the four measurement waves. We may conclude that extensive recruitment effort does not only increase the representativeness of the sample at initial assessment waves [5, 11, 12], but also eight years later. This is an important finding. We encourage other researchers to investigate retention rates of easy-to-recruit and hard-to-recruit participants in their longitudinal samples to examine the robustness of these findings.
A response rate of 84% at eight year follow-up can be considered high. Although response rates in some other studies are unequalled , reported response rates are usually similar [15, 18, 45] or lower in population-based cohorts [13, 14, 17, 19]. Two population-based studies have reported eight year follow-up rates [19, 45]. In the Great Smoky Mountains Study (GSMS), the initial inclusion rate was 80%, and the participation rate after eight years follow-up ranged from 77-83% in three different cohorts , giving a total response rate of about 62-66%. Total response rates of the Avon Longitudinal Study of Parents And Children (ALSPAC) seem somewhat lower, that is, 54% after eight years follow-up . The total response rate in TRAILS was 64% after eight years. Total response rates in population studies in which participants with a certain psychiatric disorder are oversampled are usually remarkably lower. For example, the Netherlands Study of Depression and Anxiety or NESDA achieved a two-year follow-up response rate of 87%, but the initial response rates were low. Less than 50% of individuals recruited through primary care or from other cohort studies, and 57% of patients recruited via specialized mental health care settings enrolled in the study , giving a total response of about 44%.
Main findings regarding effects of extensive recruitment efforts at age 19
Like at T1 , we can conclude that, although differences between participants (T4-easy and hard-to-recruit) and non-participants (T4-non-responders and drop-outs since T2/T3) on sociodemographic variables decreased, they did not disappear with extensive recruitment efforts. This conclusion parallels conclusions from other studies that sociodemographic variables predict being hard-to-recruit [11, 12] and non-response [3, 13–18].
As far as we know, the association between peer nominations for sociometric status and response or attrition has not been studied in other samples than TRAILS . At T1, being nominated as popular by peers predicted being a T1- responder, whereas being rejected predicted being T1-hard-to-recruit . However, peer acceptance at T1 did not predict recruitment status at T4, while peer rejection, as well as bullying and being bullied still predicted being T4-hard-to-recruit. We might speculate that popular children felt encouraged to participate in a school-based survey, whereas this type of positive peer pressure did not influence their decision to participate eight years later in a web-based survey. Peer rejection, bullying and being bullied at T1 however remain important predictors for being hard-to-recruit, also 8 years later in a web-based survey. It would be interesting to investigate how peer acceptance or rejection predicted participation rates in cohort studies that used simultaneous school and web-based surveys in the same age groups [13, 14].
Substance use has been shown to be a predictor of being hard-to-recruit, being a non-responder or dropping out at follow-up [12, 13, 18, 47]. Indeed, hard to recruit respondents were more likely to have used alcohol, cannabis and cigarettes. The fact that T4-hard-to-recruit responders reported more cannabis use at T2 suggests that the extensive recruitment efforts at T4 increased representativeness of the whole sample.
The finding that parent-reported problems decreased over time while self-reported problems seemed to emerge could be related to the decreasing knowledge the parent has of the behaviour of the child as the child grows older. That easy- and hard-to-recruit participants did not differ with regard to self-reported externalizing problems at T4 might indicate that the effect of extensive recruitment efforts at T4 increased the number of participants high on externalizing behaviours, like it did at T1 . Indeed, subjects high on externalizing problems have been shown to be less likely to respond to single recruitment efforts [11, 14] and more likely to drop-out from longitudinal studies [13, 15, 19]. Extensive recruitment efforts at age 11 also decreased differences between participants and non-participants on internalizing problems : teachers reported more internalizing problems for T1-hard-to-recruit participants than for T1-easy-to-recruit participants. At age 19, there seems to be a different trend. Easy-to-recruit participants at T4 reported more internalizing problems both at T1 and at T3 (Table 4). This might have been a report bias as these differences were not apparent in parent-reported internalizing problems, nor were T4 easy-to-recruit participants more likely to have received a lifetime internalizing diagnosis in the CIDI interview at T4. Results from other studies are inconsistent with respect to internalizing problems as well; whereas most found that internalizing problems did not predict response [6, 11, 13, 15, 16], others showed that individuals with internalizing problems were less likely to participate  or more likely to drop-out at follow-up .
Overall, we conclude that the extra recruitment efforts of the TRAILS research team have increased the number of vulnerable adolescents participating in the fourth wave over and above the recruitment efforts of the CRC, resulting in a similarly diverse sample that was reached by the extensive recruitments efforts at T1, giving confidence in estimated associations in TRAILS studies.
In spite of intensive recruitment efforts we were not able to contact all T4-non-responding TRAILS participants. This means we have no information about their current (mental) health status, substance use or educational level. Also, at T2 and T3, we did not contact non-responders to collect reasons for non-response or information regarding their current (mental) health status and other measures. Therefore, information on factors predicting non-response at T3 and T4 is derived from earlier measurement waves in which the non-responders still participated.
Furthermore, the measurement of sociometric status was only possible in classrooms with at least 10 TRAILS participants . This lead to a much smaller number of participants for these measures (at T1 N = 1065; at T2 N = 1023 for the peer nominations).
Implications of the findings
The results that are presented here have implications in two fields. First, when setting up a longitudinal study, researchers might want to put extra effort in recruiting initial non-responders as we have shown this pays off in the short and long term. It results in enrolling a more representative sample at baseline, and ensures increased generalisability even after eight years and four assessment waves later, when over 60% of those who were hard-to-recruit at baseline are still in the sample. We found that there are no significant differences between T1-hard-to-recruit dropouts and T1-hard-to-recruit retainers in terms of sociodemographic variables, peer status or psychiatric symptoms, indicating we did not lose the most vulnerable T1-hard-to-recruit participants and the increased generalisability of the sample is maintained.
Second, the results of this paper might have implications for the analysis of longitudinal data, wherein researchers are commonly confronted with missing data. Missing values can be dealt with by multiple imputation, which has been shown to cause less bias compared to complete case analysis, single imputation or the missing indicator method . Based on the results presented in this paper, ‘drop out’ could be modelled, which might aid researchers in decisions they need to make when imputing data for missing participants or participants with missing data.
First, we conclude that extensive recruitment efforts at the first assessment wave of a population-based cohort still pays off eight years later. Over 60% of T1 hard-to-recruit responders who were persuaded to participate by extensive recruitment efforts still participated in the study four assessment waves later. This is an important conclusion, especially for researchers who are designing a population-based cohort study and have to decide whether or not to invest in recruiting initial non-responders.
Second, we conclude that the effects of extensive recruitment effort are largely similar in different age groups using different survey methods. Differences between easy and hard-to-recruit responders at the first assessment wave, when the mean age was 11 and a school-based assessment method was used, were very similar to the differences between easy and hard-to-recruit responders at the fourth wave, when the mean age was 19 and a web-based survey method was used. At both measurement waves, differences between responders and non-responders decreased after inclusion of hard-to-recruit participants.
TRAILS data of the T1 and T2 measurement waves are deposited in DANS-KNAW and can be accessed at http://www.dans.knaw.nl.
Kessler RC, Little RJ, Groves RM: Advances in strategies for minimizing and adjusting for survey nonresponse. Epidemiol Rev. 1995, 2: 192-204.
Nakash RA, Hutton JL, Jorstad-Stein EC, Gates S, Lamb SE: Maximising response to postal questionnaires–a systematic review of randomised trials in health research. BMC Med Res Methodol. 2006, 6: 5-10.1186/1471-2288-6-5.
Pérez RG, Ezpeleta L, Domenech JM: Features associated with the non-participation and drop out by socially-at-risk children and adolescents in mental-health epidemiological studies. Soc Psychiatry Psychiatr Epidemiol. 2007, 42 (3): 251-258. 10.1007/s00127-006-0155-y.
Liese AD, Liu L, Davis C, Standiford D, Waitzfelder B, Dabelea D, Bell R, Williams D, Imperatore G, Lawrence JM: Participation in pediatric egidemiologic research: The SEARCH for Diabetes in Youth Study experience. Contemp Clin Trials. 2008, 29 (6): 829-836. 10.1016/j.cct.2008.05.008.
de Winter AF, Oldehinkel AJ, Veenstra R, Brunnekreef JA, Verhulst FC, Ormel J: Evaluation of non-response bias in mental health determinants and outcomes in a large sample of pre-adolescents. Eur J Epidemiol. 2005, 20: 173-181. 10.1007/s10654-004-4948-6. 0393–2990; 0393–2990; 2
Van Der Veen WJ, Van Der Meer K, Penninx BW: Screening for depression and anxiety: correlates of non-response and cohort attrition in the Netherlands study of depression and anxiety (NESDA). Int J Methods Psychiatr Res. 2009, 18 (4): 229-239.
Gerrits MH, EJCGvd O, Voogt R: An evaluation of nonresponse bias in peer, self, and teacher ratings of children's psychosocial adjustment. J Child Psychol Psychiatry Allied Disciplines. 2001, 42 (5): 593-602. 10.1111/1469-7610.00755.
Hamilton C, Fuchs D, Fuchs LS, Roberts H: Rates of classroom participation and the validity of sociometry. School Psychology Review. 2000, 29 (2): 251-266.
Kolonel LN, Henderson BE, Hankin JH, Nomura AM, Wilkens LR, Pike MC, Stram DO, Monroe KR, Earle ME, Nagamine FS: A multiethnic cohort in Hawaii and Los Angeles: baseline characteristics. Am J Epidemiol. 2000, 151 (4): 346-357. 10.1093/oxfordjournals.aje.a010213.
Tate AR, Jones M, Hull L, Fear NT, Rona R, Wessely S, Hotopf M: How many mailouts? Could attempts to increase the response rate in the Iraq war cohort study be counterproductive?. BMC Med Res Methodol. 2007, 7: 51-10.1186/1471-2288-7-51.
Baines AD, Partin MR, Davern M, Rockwood TH: Mixed-mode administration reduced bias and enhanced poststratification adjustments in a health behavior survey. J Clin Epidemiol. 2007, 60 (12): 1246-1255. 10.1016/j.jclinepi.2007.02.011.
Steffen AD, Kolonel LN, Nomura AM, Nagamine FS, Monroe KR, Wilkensi LR: The effect of multiple mailings on recruitment: The Multiethnic Cohort. Cancer Epidemiol Biomarkers Prev. 2008, 17 (2): 447-454. 10.1158/1055-9965.EPI-07-2576.
Bjertness E, Sagatun A, Green K, Lien L, Sogaard AJ, Selmer R: Response rates and selection problems, with emphasis on mental health variables and DNA sampling, in large population-based, cross-sectional and longitudinal studies of adolescents in Norway. BMC Public Health. 2010, 10: 602-
Frojd SA, Kaltiala-Heino R, Marttunen MJ: Does problem behaviour affect attrition from a cohort study on adolescent mental health?. Eur J Public Health. 2011, 21 (3): 306-310. 10.1093/eurpub/ckq078.
Badawi MA, Eaton WW, Myllyluoma J, Weimer LG, Gallo J: Psychopathology and attrition in the Baltimore ECA 15-year follow-up 1981–1996. Soc Psychiatry Psychiatr Epidemiol. 1999, 34 (2): 91-98. 10.1007/s001270050117.
Bucholz KK, Shayka JJ, Marion SL, Lewis CE, Pribor EF, Rubio DM: Is a history of alcohol problems or of psychiatric disorder associated with attrition at 11-year follow-up?. Ann Epidemiol. 1996, 6 (3): 228-234. 10.1016/1047-2797(96)00002-6.
de Graaf R, Bijl RV, Smit F, Ravelli A, Vollebergh WA: Psychiatric and sociodemographic predictors of attrition in a longitudinal study: The Netherlands Mental Health Survey and Incidence Study (NEMESIS). Am J Epidemiol. 2000, 152 (11): 1039-1047. 10.1093/aje/152.11.1039.
Eerola M, Huurre T, Aro H: The problem of attrition in a Finnish longitudinal survey on depression. Eur J Epidemiol. 2005, 20 (1): 113-120. 10.1007/s10654-004-1657-0.
Wolke D, Waylen A, Samara M, Steer C, Goodman R, Ford T, Lamberts K: Selective drop-out in longitudinal studies and non-biased prediction of behaviour disorders. Br J Psychiatry. 2009, 195 (3): 249-256. 10.1192/bjp.bp.108.053751.
Noll RB, Zeller MH, Vannatta K, Bukowski WM, Davies WH: Potential bias in classroom research: Comparison of children with permission and those who do not receive permission to participate. J Clin Child Psychol. 1997, 26 (1): 36-42. 10.1207/s15374424jccp2601_4.
Unger JB, Gallaher P, Palmer PH, Baezconde-Garbanati L, Trinidad DR, Cen S, Johnson CA: No news is bad news - Characteristics of adolescents who provide neither parental consent nor refusal for participation in school-based survey research. Eval Rev. 2004, 28 (1): 52-63. 10.1177/0193841X03254421.
Huisman M, Oldehinkel AJ, de WA, Minderaa RB, de BA, Huizink AC, Verhulst FC, Ormel J: Cohort profile: the Dutch 'TRacking Adolescents' Individual Lives' Survey'; TRAILS. Int J Epidemiol. 2008, 37: 1227-1235. 10.1093/ije/dym273. 1464–3685; 0300–5771; 6
World Health Organization: Composite International Diagnostic Interview. 1990, Geneva, Switserland: World Health Organization
Haro JM, Arbabzadeh-Bouchez S, Brugha TS, de GG, Guyer ME, Jin R, Lepine JP, Mazzi F, Reneses B, Vilagut G, Sampson NA, Kessler RC: Concordance of the Composite International Diagnostic Interview Version 3.0 (CIDI 3.0) with standardized clinical assessments in the WHO World Mental Health surveys. Int J Methods Psychiatr Res. 2006, 15: 167-180. 10.1002/mpr.196. 1049–8931; 1049–8931; 4
Kessler RC, Abelson J, Demler O, Escobar JI, Gibbon M, Guyer ME, Howes MJ, Jin R, Vega WA, Walters EE, Wang P, Zaslavsky A, Zheng H: Clinical calibration of DSM-IV diagnoses in the World Mental Health (WMH) version of the World Health Organization (WHO) Composite International Diagnostic Interview (WMHCIDI). Int J Methods Psychiatr Res. 2004, 13: 122-139. 10.1002/mpr.169. 1049–8931; 1049–8931; 2
Kendler KS, Karkowski LM, Prescott CA: Stressful life events and major depression: risk period, long-term contextual threat, and diagnostic specificity. J Nerv Ment Dis. 1998, 186: 661-669. 10.1097/00005053-199811000-00001. 0022–3018; 11
Brunnekreef AJ, de Sonneville LM, Althaus M, Minderaa RB, Oldehinkel AJ, Verhulst FC, Ormel J: Information processing profiles of internalizing and externalizing behavior problems: evidence from a population-based sample of preadolescents. J Child Psychol Psychiatry. 2007, 48: 185-193. 10.1111/j.1469-7610.2006.01695.x. 0021–9630; 0021–9630; 2
De Sonneville LMJ: Amsterdam Neuropsychological Tasks: A computer-aided assessment program. Cognitive ergonomics, clinical assessment, and computer-assisted learning: Computers in psychology. Edited by: Brinker BPLM, Peek PJ, Brand AN, Maarse FJ, Mulder LJM. 1999, Lisse, The Netherlands: Swets & Zeitlinger, 187-203. 6
Ganzeboom HBG, Treiman DJ: Internationally comparable measures of occupational status for the 1988 International Standard Classification of Occupations. Soc Sci Res. 1996, 25: 201-239. 10.1006/ssre.1996.0010.
Vollebergh WA, ten Have M, Dekovic M, Oosterwegel A, Pels T, Veenstra R, de Winter A, Ormel H, Verhulst F: Mental health in immigrant children in the Netherlands. Soc Psychiatry Psychiatr Epidemiol. 2005, 40 (6): 489-496. 10.1007/s00127-005-0906-1.
Wechsler D: Wechsler intelligence scale for children-revised manual. 1974, New York: The Psychological Corporation
Van Haasen PP, De Bruyn EEJ, Pijl YJ, Poortinga YH, Spelberg HC, Van der Steene G: WISC-R, Wechsler Intelligence Scale for Children-Revised. 1986, Lisse: Swets & Zeitlinger, Dutch
Bosker RJ, Velden RKW, Hofman WHA: Een generatie geselecteerd. Deel 1: De loopbanen. 1985, Groningen, The Netherlands: RION
Kupersmidt JB, Coie JD: Preadolescent peer status, aggression, and school adjustment as predictors of externalizing problems in adolescence. Child Dev. 1990, 61: 1350-1362. 10.2307/1130747.
Dijkstra JK, Lindenberg S, Veenstra R: Same-gender and cross-gender peer acceptance and peer rejection and their relation to bullying and helping among preadolescents: comparing predictions from gender-homophily and goal-framing approaches. Dev Psychol. 2007, 43 (6): 1377-1389.
Dijkstra JK, Lindenberg S, Veenstra R: Beyond the class norm: bullying behavior of popular adolescents and its relation to peer acceptance and rejection. J Abnorm Child Psychol. 2008, 36 (8): 1289-1299. 10.1007/s10802-008-9251-7.
Oldehinkel AJ, Rosmalen JG, Veenstra R, Dijkstra JK, Ormel J: Being admired or being liked: classroom social status and depressive problems in early adolescent girls and boys. J Abnorm Child Psychol. 2007, 35: 417-427. 10.1007/s10802-007-9100-0. 0091–0627; 0091–0627; 3
Murray DM, Perry CL: The measurement of substance use among adolescents: when is the 'bogus pipeline' method needed?. Addict Behav. 1987, 12: 225-233. 10.1016/0306-4603(87)90032-3. 0306–4603; 0306–4603; 3
Creemers HE, Korhonen T, Kaprio J, Vollebergh WA, Ormel J, Verhulst FC, Huizink AC: The role of temperament in the relationship between early onset of tobacco and cannabis use: the TRAILS study. Drug Alcohol Depend. 2009, 104: 113-118. 10.1016/j.drugalcdep.2009.04.010. 1879–0046; 0376–8716; 1–2
Achenbach TM: Manual for the child behavior checklist/4–18 and 1991 profile. 1991, Burlington, VT: University of Vermont, Vermont
Achenbach TM: Manual of the youth self-report and 1991 profile. 1991, Burlington, VT: University of Vermont, Vermont
Achenbach TM, Rescorla LA: Manual for the ASEBA Adult Forms & Profiles. 2003, Burlington, VT: University of Vermont, Research Center for Children, Youth & Families
Von Elm E, Altman DG, Egger M, Pocock SJ, Gotzsche PC, Vandenbroucke JP: Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies. BMJ. 2007, 335: 806-808. 10.1136/bmj.39335.541782.AD. 1468–5833; 0959–535; 7624
Moffitt TE, Caspi A, Taylor A, Kokaua J, Milne BJ, Polanczyk G, Poulton R: How common are common mental disorders? Evidence that lifetime prevalence rates are doubled by prospective versus retrospective ascertainment. Psychol Med. 2010, 40 (6): 899-909. 10.1017/S0033291709991036.
Copeland W, Shanahan L, Costello EJ, Angold A: Cumulative prevalence of psychiatric disorders by young adulthood: a prospective cohort analysis from the Great Smoky Mountains Study. J Am Acad Child Adolesc Psychiatry. 2011, 50 (3): 252-261. 10.1016/j.jaac.2010.12.014.
Lamers F, Hoogendoorn A, Smit J, van Dyck R, Zitman FG, Nolen WA, Penninx BW: Sociodemographic and psychiatric determinants of attrition in the Netherlands Study of Depression and Anxiety (NESDA). Compr Psychiatry. 2012, 53 (1): 63-70. 10.1016/j.comppsych.2011.01.011.
Vinther-Larsen M, Riegels M, Rod MH, Schiotz M, Curtis T, Gronbaek M: The Danish Youth Cohort: characteristics of participants and non-participants and determinants of attrition. Scand J Public Health. 2010, 38 (6): 648-656. 10.1177/1403494810374222.
Knol MJ, Janssen KJ, Donders AR, Egberts AC, Heerdink ER, Grobbee DE, Moons KG, Geerlings MI: Unpredictable bias when using the missing indicator method or complete case analysis for missing confounder values: an empirical example. J Clin Epidemiol. 2010, 63: 728-736. 10.1016/j.jclinepi.2009.08.028. 1878–5921; 0895–4356; 7
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2288/12/93/prepub
This research is part of the TRacking Adolescents' Individual Lives Survey (TRAILS). Participating centers of TRAILS include various departments of the University Medical Center and University of Groningen, the Erasmus University Medical Center Rotterdam, the University of Utrecht, the Radboud Medical Center Nijmegen, and the Parnassia Bavo group, all in the Netherlands. TRAILS has been financially supported by various grants from the Netherlands Organization for Scientific Research NWO (Medical Research Council program grant GB-MW 940-38-011; ZonMW Brainpower grant 100-001-004; ZonMw Risk Behavior and Dependence grants 60-60600-98-018 and 60-60600-97-118; ZonMw Culture and Health grant 261-98-710; Social Sciences Council medium-sized investment grants GB-MaGW 480-01-006, GB-MaGW 480-07-001, and GB-MaGW 481-08-013; Social Sciences Council project grants GB-MaGW 457-03-018, GB-MaGW 452-04-314, and GB-MaGW 452-06-004; NWO large-sized investment grant 175.010.2003.005); the Sophia Foundation for Medical Research (projects 301 and 393), the Dutch Ministry of Justice (WODC), the European Science Foundation (EuroSTRESS project FP-006), and the participating universities. We are grateful to all adolescents, their parents and teachers who participated in this research and to everyone who worked on this project and made it possible.
FCV is a contributing author of the Achenbach System of Empirically Based Assessment, from which he receives remuneration. All other authors declare that they had no competing interests.
FJ and EN are responsible for the development and design of the study. DR performed the statistical analysis in close collaboration with FJ and EN. FJ and EN wrote the first draft. JO and AJO were involved in the interpretation of the results. RV is responsible for the construction of the measure of educational position. JO, AJO, RV and FCV provided critical comments on earlier versions of the paper. All authors had full access to all of the data (including statistical reports and tables) in the study and can take responsibility for the integrity of the data and the accuracy of the data analysis. All authors approved the final submitted version.
Esther Nederhof, Frederike Jörg contributed equally to this work.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.