Bias and heteroscedastic memory error in self-reported health behavior: an investigation using covariance structure analysis

Background Frequent use of self-reports for investigating recent and past behavior in medical research requires statistical techniques capable of analyzing complex sources of bias associated with this methodology. In particular, although decreasing accuracy of recalling more distant past events is commonplace, the bias due to differential in memory errors resulting from it has rarely been modeled statistically. Methods Covariance structure analysis was used to estimate the recall error of self-reported number of sexual partners for past periods of varying duration and its implication for the bias. Results Results indicated increasing levels of inaccuracy for reports about more distant past. Considerable positive bias was found for a small fraction of respondents who reported ten or more partners in the last year, last two years and last five years. This is consistent with the effect of heteroscedastic random error where the majority of partners had been acquired in the more distant past and therefore were recalled less accurately than the partners acquired more recently to the time of interviewing. Conclusions Memory errors of this type depend on the salience of the events recalled and are likely to be present in many areas of health research based on self-reported behavior.


Background
Scientific research using self-reports for past events is increasingly frequent in areas which need to quantify certain aspects of health behavior whose nature is essentially private and therefore not directly observable, as well as amenable to memory error. Examples range from food consumption to health beliefs and sexual behavior. The latter is the cornerstone for various models of prevalence, incidence and risky behavior related to sexually transmit-ted diseases among which AIDS has received particular attention. This paper presents a new statistical approach to investigate sources of errors in self-reported behavior, using an example with number of sexual partners.
The accuracy of self-reported number of sexual partners plays a prominent role in the discussions on validity of this method. [1,2] The accuracy is usually expected to increase as the defined time period asked about is closer to the time of inquiry. This is based on numerous examples of recall difficulties in sex research, notwithstanding many other methodological factors affecting self-reported behavior in this context. [3] Several studies of the cognitive processes relevant in the survey context pointed to the difficulty of converting episodic memory to an incidence. [4][5][6] While the former is a typical way of storing information on sexual partners in individual memory, the latter is what scientific research usually aims at. For example, to locate a sexual partner in time, it is often necessary for the respondent to reconstruct the order of many other clues, e.g. the partner's name and most vivid physical and personal characteristics, where did they meet, what was he/she doing at that time, what happened before and after, which year was that. This personal perception of time -defined in terms of ordering personally salient characteristics of events -is very different from the physical time which is independent of the salience clues, e.g. the number of partners in the last year. Under time pressure to respond, guessing may be used as a strategy [7], thus adding to the inaccuracy of answers.
In a large national sex survey in Britain, the issue of recall accuracy might have been related to a large decrease in variances for the number of partners reported ever, in the last five years, in the last two years and in the last year. [8] It was demonstrated with the same data how a few large outliers can affect the mean of the number of partners distribution. [2] This paper builds on these findings and relates the accuracy to the issue of bias in self-reported number of partners. Methodological interest here is in estimating the variance components of number of partners' reports due to true variation on one hand and potential sources of bias on the other. Among the latter, gender, recency of the period asked about, large number of partners and age of the respondent have been mentioned. [8] Gender-specific factors such as underreporting for women versus overreporting for men and likely underrepresentation of women with extremely high number of partners such as commercial sex workers in a large national survey [2] are not examined here as the analysis concentrated on young men aged 26-35 years.
A recent comprehensive review of sources of rater bias [9] provides an excellent systematization of the field but does not deal with self-reports. Hoyt's concept of dyadic variance due to rater's unique perception of targets cannot be applied to the task of reporting the number of sexual partners for varying past periods because there is only one observation per rater-target dyad. When the ratings are based on counts of observed behaviors, explicit nature of this task leaves little scope for different interpretations of the target as opposed to inferential attribute type rating system. [10] It is important to underline that the behavior of interest here was explicitly defined at the beginning of the self-report questionnaire. [8] This article is meant to reach both statistically minded and less statistically oriented audience. Because of the latter, mathematical expressions are kept to minimum. The first part of the article briefly describes the survey methodology and statistical methods used to deal with the bias in self-reported number of sexual partners, including the model specification and variable selection criteria. The second part presents results of model selection and bias estimation analyses. Finally, the results are discussed in the light of possible memory distortions such as telescoping effect.

Subjects
The National Survey of Sexual Attitudes and Lifestyles was undertaken in 1990-91 with primary aims to provide reliable data for modeling the spread of HIV. A random sample of 18876 men and women aged 16-59 living in Great Britain was interviewed and invited to complete a questionnaire containing more sensitive questions. One quarter of them were randomly assigned to fill in the long version of the questionnaire in the sampling phase of the survey. Overall response rate was 63.3%, while acceptance rate among eligible respondents who could be interviewed reached 71.5%.
Multistage cluster sampling was applied to stratified lists of areas in order to arrive at a systematic probability sample of addresses within electoral wards as primary sampling units. The second stage was sampling addresses within wards and the third stage was selecting an individual randomly from each address. As the probability of selecting a survey participant was inversely related to the number of eligible individuals at that address, the household weight was assigned to each respondent to adjust for the differences in selection probability. Regional differences in response rate were also accounted for by introducing another set of weights inversely proportional to the regional response rates. The final weight was calculated by multiplying the two weights and scaling their sum to equal the total sample size. All the analyses in this work used final weights.
The final sample achieved was broadly representative of the target population in terms of gender, age, ethnicity, marital status and socioeconomic status. [8] Internal consistency of answers to different questions and external validation of some self-reports indicated good quality of the data. Item non-response did not exceed 5% for most of the questions (for further details see [11]) and the impact of non-response to the mean of the number of partners' distribution was likely to be small. [12] The methodology of the survey is discussed in more details elsewhere. [13] All men aged 26-35 years among the respondents who filled in the long version of the questionnaire and had at least one heterosexual partner were included in this analysis, thus excluding 2.8% of the former who had not had sex by the time of the survey. The long questionnaire contained details on family conditions, sexual attitudes and lifestyle thought to be related to the number of sexual partners.
The age span for this analysis was chosen to minimize the factors related to sexual behavior which might be specific to one birth cohort and absent in the others such as those related to HIV/AIDS, while providing an opportunity to explore the recall accuracy for a variety of sexual lifestyles. Thus age 26-35, the typical age for marriage which is meant to be an exclusive type of sexual partnership, was chosen. As some of the influences on the number of partners were known to be non-linear (e.g. age) and genderspecific, stratified analysis on this basis seemed a more realistic choice than to try to accommodate all these complex effects within one model.

Statistical methods
This analysis is principally interested in estimating accuracy and bias for the number of partners reported in the last year, last two years, last five years, and the period more than five years ago. The latter was calculated as the difference between the lifetime number of partners and the number reported in the last five years as there was no explicit question about the number of partners beyond the last five years period but there was a question about the number of partners ever. This approach allows a clear time ordering of the periods referred to by the questions on number of partners and at the same time preserves the format used in the questionnaire.
Covariance structure analysis [14][15][16] was used to explore the relationship between reported and estimated true number of partners for the varying length of the past covered by the questions on the number of partners. The analysis is also known as simultaneous equations modeling, path analysis and 'causal modeling' in psychometric, econometric and social sciences but here a generic name is preferred to help recognizing its similarity with the covariance structures analyzed in mixed and/or generalized linear models. Covariance structure analysis (CSA) can model the error structure of both observed and latent variables, as well as the dependency structure due to fractional overlapping of number of partners reported in the last five years, the last two years and the last year. A detailed presentation of CSA is beyond the scope of this work but can be found in many specialized texts [14][15][16] and some more recent multivariate statistics textbooks. [17,18] It is important to underline that CSA model is fitted to the observed covariance matrix. A bit of matrix algebra is needed to state this formally. If S, M and W are observed, CSA model-based and weight matrix respectively, then the CSA fit criterion F, for generalized least squares estimation, is given by which in the case of maximum likelihood estimation can be written as where n is the number of observed variables and "det" stands for matrix determinant. [19] The chi-square statistic for CSA models is defined as the optimum value of the above discrepancy function F multiplied by sample size minus one. [19] It compares the model-based (estimated) covariances with those computed from the observed data i.e. unconstrained covariance matrix. Thus the comparison is formalized as the likelihood ratio test between the two. The degrees of freedom for this test equal the number of constraints imposed on the parameter estimates due to the model proposed. If the latter holds, the chi-square is non-significant and rejects the alternative hypothesis that the similarity between the data-based and model-based covariances is spurious.
The analysis presented used instrumental variables, widely applied in econometrics [20,21], to obtain unbiased regression estimates for the dependent variables of interest, which in turn can be used for further analysis. Instrumental variables are often known confounders such as age, gender and education, whose measurement errors are assumed to be uncorrelated between themselves and with the other variables in the model. Here instrumental variables were used to predict 'true' number of partners and compare it with corresponding self-report. Bias estimates were defined as differences between the two.
There were three basic steps in bias calculation. First, the CSA model parameters were obtained and used to calculate factor scores which represent 'true' number of partners for each respondent. Secondly, as these were given on an arbitrary scale, they were rescaled to the mean and variance of the cube root of reported number of partners (see later in the text about the reasons for this transformation). Thirdly, the difference between reported and rescaled values of 'true' number of partners was taken. Then the mean difference and its standard error was calculated for a com-parison of various groups of interest. It is important to underline that factor scores can be justified as posterior expectations of the factors. [22] Consequently, bias can be viewed in terms of a posterior means distribution of residuals, i.e. observed minus predicted or expected on the basis of a CSA model.

Model specification
The system of simultaneously estimated regression equations can be represented as following. Let subscripts 0, 5, 2, and 1 denote temporal sequence of the number of partners for more than five years ago, last five years, last two years and last year, respectively. Let uppercase R and T denote reported and true number of partners respectively. Let lowercase e and d denote error terms for observed and estimated true values respectively. Let uppercase I denote the instrumental variables in the model (indexed from 1 to n). Then eight equations R 0 = a 0 T 0 + e 0 (3.1) R 5 = a 5 T 5 + e 5 (3.2) R 2 = a 2 T 2 + e 2 (3.3) T 0 = g 01 I 1 + g 02 I 2 + ...g 0n I n + d 0 (4.1) T 5 = g 51 I 1 + g 52 I 2 + ...g 5n I n + d 5 (4.2) T 2 = g 21 I 1 + g 22 I 2 + ...g 2n I n + d 2 (4.3) T 1 = g 11 I 1 + g 12 I 2 + ...g 1n I n + d 1 While all of the above assumptions are standard for many covariance structure models [14,15], they are likely to be violated to a degree in observational studies due to confounding variables omitted from the model and model misspecification. [23] The critical issue is the practical impact of the violations regarding the key model parameters of interest which can sometimes be estimated in CSA (an example is given later in the text) or through sensitivity analysis.
It should be noted that the assumptions about the distribution of the error terms in the model leave the covariance matrix of the measurement errors to be estimated, as these are the parameters of interest in this case. The assumption of other influences in the model can be tested by releasing particular restrictions and comparing the restricted and non-restricted model in terms of likelihood ratio under multivariate normality assumption or some other goodness of fit measure [24][25][26][27], or simply by univariate t-test for the parameter value when its variance is also estimated by maximum likelihood. [15] Strictly inferential approach is usually tenuous given the number of simplifying assumptions in a complex covariance structure model, particularly for the univariate t-test. [28] With many combinations of the influences between the variables tested and particularly in post hoc model modifications based on prior CSA analysis with the same data, the problem of finding a spuriously significant parameter becomes acute and makes hypothesis testing even more dif- ficult. [29] However, some logical ordering of the effects in the model such as clear temporal sequence in this case and/or theoretical grounds can reduce this problem by focusing only on a priori justified subset of the relationships between the variables analyzed.

Transformations and estimation method
Because of some respondents claiming large number of partners, the positive extreme of the distribution is heavy tailed. This adds more of the same problem already present with the lower end as majority of men had few partners. Power transformations indicated the cubic root transformation for number of partners, length of last living-in relationship and age of first sexual intercourse as it reduced relative multivariate kurtosis [30] to the value of 1.47. Two main estimation options were considered: accounting for violation of multivariate normality assumption needed for maximum likelihood (ML) or using asymptotic distribution free (ADF) estimates [30] which do not require this assumption. While the latter are prone to considerable bias unless the sample size is pretty large -of the order of few thousands at least -the former were shown to be reasonably robust with sample sizes of few hundreds. The robustness of CSA parameters and goodness of fit indices is not elaborated here as it is not the central theme of this work but it is discussed in other works. [26,[30][31][32][33][34][35][36][37][38] The strategy adopted was to use both ML and ADF estimates and compare the results.
As there were few cases with missing values on any of the variables included in the CSA, these were excluded from the analysis. SAS software was used. [19] Instrumental variables' selection It was known from a previous analysis that a large number of reported sexual partners was associated with respondents' age, gender, marital status, age of first sexual intercourse and social class in a multivariate model. [8] Another analysis showed that in addition to the above effects, smoking and alcohol consumption in young men were associated with increased number of partners in last five years.
[39] Attending STD clinic, bodymass index, educational level, living with both natural parents until the age of 16, the length of last living-in relationship and sexual attitudes from the survey questions 40 and 41 were added to this selection of instrumental variables to be considered. Although many other factors have been shown to influence the number of sexual partners in other publications not cited here, an empirical rather than a purely theoretical basis guided the instrumental variables' selection in this work. It is important to stress that the aim of this work focused on the error part rather than on the structural relationship. Notwithstanding the importance of the latter both for estimating the error components and in its own right, the selection of instrumental variables was not aimed at explaining specific contributions of a wide range of social and psychological factors influencing the number of sexual partners.
Several combinations of instrumental variables were tried to check their impact on the CSA parameters of interest. Likelihood ratio test was used for the final selection of these variables in CSA.

Results
Basic descriptive statistics for the number of partners showed that more partners were reported for the more distant periods of time (table 1).
The latter were also characterized by sharp increase in variability. Essential information about other variables used in the model is presented in table 2.

Covariance structure model
Latent variables (circled on Figure 1) represent the true values estimated in the model. They are measured by observed variables i.e. reported number of partners for different time periods (in rectangles on Figure 1). The lack of accuracy is expressed as measurement errors affecting the number of partners reported. The two-digit subscripts for regression coefficients marked "b" for the temporal se- quence of 'true' number of partners indicate the time frame of dependent (first digit) and independent variable (second digit).
The differences between ML and ADF estimates were in the second decimal place for few parameters, so it did not have any practical impact on the model interpretation. Therefore only ML estimates are reported here (table 3). The chi-square values for overall goodness of fit test was 13.3 with nine degrees of freedom and associated probability of 0.15, thus indicating the acceptability of the model. Root mean square error approximation [25] was 0.03 with 90% confidence interval upper bound of 0.06, which is considered a good fit. Goodness of fit index and its adjusted version [15] were 1.0 and 0.97 respectively. Probability of close fit [25,27] was 0.82. This alternative measure of fit assumes approximate rather than exact model fit. The latter is unrealistic to expect even if no structural misspecifications are present as measurement errors due to sampling variation would still indicate some discrepancy between the population and data based esti-mates. Normalized residuals did not exceed the range from -2 to 2, thus showing good fit on the level of individual variables in the model. All these indices confirmed the acceptability of the model.
The magnitudes of error variances for reported number of partners were ordered as expected -the largest one for most distant past was more than six times the one for the last year (table 3). The correlations between measurement errors were considerably larger between adjacent time periods as compared to those further apart (Figure 1). High correlation of .77 was found between measurement errors for the number of partners in the last two years and in the last year. This was expected as the carry-over effect should be large for the large overlap of one year period between the questions on number of partners. This effect was somewhat smaller for the number of partners in the last two and in the last five years. When the time periods for the number of partners did not overlap (the last five years versus before that), the correlation between the measurement errors was .23.

Figure 1 Correlations among measurement errors
Among exogenous variables, the length of the last livingin relationship was found to influence the estimated true number of partner for all the questions analyzed. The age of first sexual intercourse was a highly significant determinant of the number of partners more then five years ago and in the last five years but not afterwards (table 3 and Figure 1). For men aged 26-35, being married at the time of the interview and not living with both natural parents until the age of 16 was associated with increased number of partners reported in the last year.
The length of last living-in relationship was a significant predictor of number of partners for any past period. Longer live-in relationships were associated with fewer partners in the last five years and before that and larger number of partners in the last two years and last year (cf. the change of sign for the parameters g7 and g8 with respect to g5 and g6 in table 3). This change in the direction of influence paralleled the change in percentage of those reporting more than one partner while in living-in relationship which rose from 6% in the last five years to 18% in the last two years and 23% in the last year. Thus the living-in relationships in more distant past could have had the opposite effect of the more recent ones after adjusting for the other influences in the model.
The earliest sexual lifestyle in a temporal sequence can be regarded as baseline. No effect between the baseline and the number of partners for non-adjacent time periods was found significant (details not shown).  measurement error estimate for the reported age of first sex was almost five times less than the smallest error term for the reported number of partners. Therefore it seems reasonable to conclude that both overall goodness of fit and specific error terms estimates give no support for the hypothesized relationship between error terms of reported number of partners and age of first sex with these data. Even if such remote hypothesis were accepted, the shift in estimates of error terms' variances and covariances would have probably been fairly small and of no practical importance in this case.

Bias analysis
The largest estimated bias was found among the men who reported large number of partners in the last five years, last two years and last year ( Figure 2). The confidence intervals for those reporting ten or more partners in the last two years and last year were notably widened. It should be noted that only a small fraction of men aged 26-35 reported ten or more partners in last five years. Hence overall overreporting in this stratum is likely to be small for most practical purposes. A small percentage of men reporting ten or more partners in the last five years (8.2%), last two years (2.2%) or last year (0.2%) were estimated to have overreported between 3 and 4 partners on average, while overreporting for those who reported three to nine partners did not exceed one (Figure 2). On the other hand, two thirds of all men who reported one or two partners in last five years were likely to underreport slightly.
Estimated bias was very small with respect to age, age of first sex, visiting STD clinic, education, social class, marital status, length of living-in relationship, reporting a partner outside living-in relationship, smoking and alcohol consumption after adjusting for the effect of the number of partners reported (details not shown). An interaction effect between number of partners on one hand and age of first sex and visiting an STD clinic on the other was also investigated. Negative moral connotation of STD and common British misconception that a person below the age of consent is himself/herself committing an offence [8] led to the hypothesis that social desirability of the answers to these questions could have biased the reports on number of partners. Self-reporting of socially disapproved behavior such as drug use has been shown to be biased downwards as a function of the time elapsed in repeated surveys. [40] However, no support for the above hypotheses was found with these data (details not shown) as the number of partners effect was far too dominant.

Discussion
Overreporting the number of partners was particularly large among men who reported five or more partners for more recent periods (Figure 2). This may be a result of socalled telescoping effect where events from the past are shifted towards more recent period during the recall process due to the compression of the time scale or because of heteroscedastic random measurement error due to the increase of error variance for more distant events. [41] In the latter case, more accurately recalled recent events result in reported event density such as locating sexual partners in time being skewed towards the present, i.e. the time of interviewing. Thus the net effect is overreporting the number of partners in last five years due to the redistribution of some partners acquired more than five years ago within the last five years period. This is consistent with Figure 2 and the estimates of measurement error variances for varying periods in the past (table 3). The overall telescoping effect depends on both the rate of increase in error variance and true distribution of events. [41] As most of the partners were acquired in more distant past period when the recall was less accurate, this could explain the bias for men who reported three or more partners in the last five years, last two years and the last year ( Figure 2). The bias clearly increases with the number of partners. Men with one or two partners for any period in the past were unlikely to have had serious difficulties in recalling the numbers accurately, so no heteroscedasticity effect could have contributed to the bias estimates. This again supports the view that the shift of recalled density for the number of partners towards the date of the interview can be explained in terms of heteroscedastic random error.
The bias estimation involved several simplifying assumptions. They seem reasonable in a rough and ready analysis with these data but may be too simplistic in other situations, particularly with small samples. The key problem is that the interval scale property of the population factor scores cannot be guaranteed by their empirical normal distribution in one sample -only ordinal scale can be assumed. [22] However, with reasonably large sample the asymptotic equivalence of the scales should be satisfactorily approximated. Another problem is quantification of complex CSA model uncertainty which factor scores are conditional upon. No simple confidence limits can be provided as there are many parameters, possibly estimated with variable precision, so that power considerations become fairly complicated. [42] Bayesian approach seems a natural framework for including this type of knowledge into prior distribution specification. However, many CSA

Figure 2
Bias estimates by period and reported number of partners Note: Words "year", "years" and "partners" abbreviated with "yr", "yrs" and "p.", respectively. Vertical axis (number of partners bias) is given on the cubic root scale.
model parameters would considerably increase the computational burden in this case, which is a likely reason why so far only limited use of Bayesian statistics has been found in this area. [43] Bootstrapping CSA parameters [14] and calculating corresponding factor scores may be an alternative way of obtaining more realistic confidence intervals for bias scores. Gibbs sampling can also be used to construct the confidence limits of posterior means for residual scores of interest. [44] Other important developments in this field include an adaptation of CSA to deal with count data [45] and to treat counts as ordinal scale variables in CSA. [46] The limitations of the use of instrumental variables were often understated in the literature, principally that for many realistic data in the econometrics, social sciences and psychology it is extremely difficult to ascertain the independence of the instrumental and endogenous variables in the model [21,48] because of confounding. Therefore at least a theoretical justification or preferably some empirical evidence for this assumption should be provided. [47] Testing robustness of the model obtained could be done by assessment of goodness of fit and parameter modifications when some model assumptions are altered, which may require fixing some parameters on the basis of the previous model in order to identify and estimate the others. This strategy is as reasonable as the certainty about the former but provides a simple measure of shift in the parameters of interest under various violations of model assumptions which may arise in practice. Alternatively, a full sensitivity analysis [48] may be used.
For the model presented, long term health problems and social class are examples of potential confounders both on theoretical and empirical grounds [8] which may influence selected instrumental variables and the number of partners reported. However, their addition to the model hardly changed the estimates of error terms variances and covariances. While it is impossible to ascertain that the same holds for the unknown confounders, this is an empirical issue which can be further explored by examining the robustness of the model with the data from other studies of a similar kind.
Colinearity among predictors may also influence the choice of instrumental variables. Generalized inverse of a Hessian matrix may be a simple computational solution for this problem and is available in many statistical software for CSA but there are other strategies if this one fails. [49] However, a sensible selection of instrumental variables on sound theoretical basis should precede more complicated ways of dealing with this issue.
Memory error and personal salience of the partners were reported as important sources of bias for the number of partners in recent period of time in various studies. [1,3,8] Personal salience is likely to be more influential for more distant past events. For example, first sexual intercourse is a relatively vivid memory for most respondents as an event of personal importance despite the time elapsed, while some more recent partnerships which bear little relevance to the current emotional life may have faded away. Personal salience is usually culturally and/or ethnically contextualized, e.g. machismo or male adolescents overreporting of number of sexual partners could be a way of asserting their perceived social role. [3] However, this survey was conducted at the time of great concern for and media attention to AIDS epidemic in Great Britain. In addition, the age range 26-35 years does not include adolescents.
Large number of partners is difficult to report accurately, particularly if the time frame is long. The difficulty arises from the effort to convert episodic memory which operates with discrete time intervals into continuous time scale from which incidence is derived. [4][5][6]41] Motivation to search through complex patterns of sexual behavior for people with large number of partners is likely to be lower in the general population in comparison with some special groups at risk of AIDS such as gay men who may perceive the study as personally relevant. [3] The finding that overreporting is particularly large among men who reported highest number of partners in last year highlights the need for caution in interpreting mean values for a heterogeneous population. It also calls for some more focused research into sexual behavior of people with large number of partners.
The potential of CSA in the context of reliability analysis has been recognized [9,41,50,51] but is still largely of illustrative nature. When the contrast between exploratory and confirmatory models [52] is seen in relative rather than in absolute terms, i.e. dependent on the amount of knowledge accumulated in a particular area of research, a more realistic expectation of the interpretational gains based on these models may help assessing their performance in a wider range of applications. In this context, it is important to emphasize that CSA estimates of the true values and measurement errors of self-reports are not based on any new evidence with respect to the data at hand. The estimated bias of self-report calculated in this way should be seen as a possible scenario which is supported by the data analyzed, rather than as any sort of direct validation by some additional independent information. The latter provides much stronger evidence for generalizability of the findings than the former. However, the feasibility of gathering new data may well be limited. CSA can highlight the likely sources of bias which can then become a matter of a separate study. From this point of view, empirical cross-validation in terms of new data and statistical modeling of the existing data can be seen as complemen-tary rather than mutually exclusive ways of addressing the issue of bias in self-reports.

Conclusions
CSA provided evidence of the impact of the telescoping effect on bias in self-reported number of sexual partners. Considerable positive bias was found for a small fraction of respondents who reported ten or more partners in the last year, last two years and last five years. This is consistent with the effect of heteroscedastic random error where majority of partners had been acquired in more distant past and therefore recalled less accurately then the partners acquired more recently to the time of interviewing. Memory errors of this type depend on the salience of the events recalled and are likely to be present in many areas of health research based on self-reported behavior.