 Debate
 Open Access
 Published:
Measures and models for causal inference in crosssectional studies: arguments for the appropriateness of the prevalence odds ratio and related logistic regression
BMC Medical Research Methodology volume 10, Article number: 66 (2010)
Abstract
Background
Several papers have discussed which effect measures are appropriate to capture the contrast between exposure groups in crosssectional studies, and which related multivariate models are suitable. Although some have favored the Prevalence Ratio over the Prevalence Odds Ratio  thus suggesting the use of logbinomial or robust Poisson instead of the logistic regression models  this debate is still far from settled and requires close scrutiny.
Discussion
In order to evaluate how accurately true causal parameters such as Incidence Density Ratio (IDR) or the Cumulative Incidence Ratio (CIR) are effectively estimated, this paper presents a series of scenarios in which a researcher happens to find a preset ratio of prevalences in a given crosssectional study. Results show that, provided essential and nonwaivable conditions for causal inference are met, the CIR is most often inestimable whether through the Prevalence Ratio or the Prevalence Odds Ratio, and that the latter is the measure that consistently yields an appropriate measure of the Incidence Density Ratio.
Summary
Multivariate regression models should be avoided when assumptions for causal inference from crosssectional data do not hold. Nevertheless, if these assumptions are met, it is the logistic regression model that is best suited for this task as it provides a suitable estimate of the Incidence Density Ratio.
Background
Mainstream books devoted to organizing knowledge on epidemiological methods used to emphasize the study of the distribution of health events according to person, time and place [1, 2]. Following a period when vital statistics were the main data sources for this aim, crosssectional studies began to play an important role in this field providing prevalence aggregate and groupspecific estimates.
Acknowledging the limitations of prevalence visàvis incidence estimations in some circumstances, from the 1980's, a particular literature focused on crosssectional data as a means to indirectly obtain the estimations of incidence rates [3–8]. Simultaneously, following a growing interest of the epidemiological community on causal inference [9, 10], crosssectional studies were not only accepted as a way of estimating prevalences, but given certain conditions, also as a suitable design for investigating causal relationships. Since the 1990's, papers recognizing the analytical role of epidemiological surveys have been concerned with two issues: (i) which measure would be appropriate for capturing the contrast between exposure groups, whether the prevalence ratio (PR) or the prevalence odds ratio (POR); and (ii) which model should be used for estimating these quantities in the multivariate context.
The first issue raised a debate which split two camps. Whereas Strömberg [11, 12] favored the POR, Lee & Chia [13, 14], Lee [15], Axelson et al. [16, 17] and Thompson et al. [18] argued that the estimator was difficult to interpret and communicate; was very discrepant from PR when outcomes were common; and that the conditions when POR estimates the incidence density ratio (IDR)  stationarity and equal duration of disease  were hardly met. Reservations were also expressed on the account that the POR was a numerical mimic to other effect measures; and was misinterpreted as cumulative incidence ratio (CIR) in the presence of common outcomes.
Although Pearce [19]  again favoring the POR  revisited the dispute only a few years ago, the debate per se seemed to have died away, being outshined by the second contention, namely, as to which methods would be best to model prevalence data. Since surveys most often involve frequent events, several authors called up the 'rare disease assumption' [20, 21] and argued that the POR obtained by logistic regression model would thus overestimate the PR. As an extension, statistical models other than logistic regression have been proposed to estimate PR, namely, the logbinomial, Poisson or Cox models with robust variance estimates [18, 22–26]. Several studies have taken this on and have been effectively using these models to handle data arising from crosssectional data and envisaging control for confounding variables. For instance, a preliminary and tentative literature search in Medline (October 8, 2009) using the keywords ("prevalence ratio*"[All Fields] OR "crosssectional"[All Fields]) AND (Poisson[All Fields] OR "logbinomial"[All Fields]) AND "humans"[MeSH Terms] found 444 references of this kind. Conspicuously, the numbers increased from 20 papers between 19901994, to 262 in the 20052009 period.
It is our contention that this perspective is essentially misguided and that a discussion on the best model only makes sense if preceded by a thorough debate about what is actually sought with a crosssectional study: estimating the magnitude of a condition in a population or making causal inference? The purpose of this paper is thus to revisit a dispute that, though hardly new by any means, is far from settled. In fact, our motivation has been this growing literature that utilizes multivariate models for PR to address relationships between a 'dependent' and several 'independent' variables, yet in our view without a clearcut supporting rationale behind it.
Discussion
To address our contention, this section is organized as follows. The first subsection provides a brief review regarding the purposes of crosssectional studies, with a particular eye on identifying the specific situations when measures of association based on information gathered from crosssectional studies are in fact wanted and/or required, and when multivariate modeling procedures are hence due. The ensuing subsection dwells on the relation between prevalence and incidence ratio measures. It starts by presenting the necessary conditions whereby a crosssectional study may be able to provide measures of effect representing causal parameters. Next, in preparation to the proposed scenarios and discussions that follow, an outline on the formal relations between measures originating from crosssectional approaches (PR and POR) and respective relations to causal parameters  the cumulative incidence ratio and the incidence density ratio (IDR)  are provided. In sequence we create several scenarios that pragmatically assume that data come from surveys (as in so many real instances) and enquire whether and how the actual obtained measures (PR or POR) effectively inform about either CIR or IDR. The next subsection offers the rationale for choosing a suitable multivariable model, followed by a subsection building upon the arguments provided previously and proposing a decision tree for analyzing crosssectional data. Finally, in the light of our own insights, the last subsection of the Discussion revisits some of the points mentioned in the Background section for and against the measures and models used for causal inference in crosssectional studies.
Purposes of a crosssectional studies
Before delving into a debate about which multivariate model to use, a key issue in deciding between contrast measures (PR or POR) concerns the underlying reason for actually wanting to obtain ratios involving estimates in crosssectional studies. To answer this question one needs to summon up the two main purposes for carrying out health surveys.
The most usual and uncontroversial is to provide overall and groupspecific prevalences of a particular health event in a given population, usually with an outlook to help organizing resources and to guide decisionmaking processes. In this particular case, estimating a PR  or more precisely, a singlevalue estimate contrasting the prevalences obtained in two population strata  may not be of much interest. Concretely, what would a health officer make of being informed that the PR equals 2.0 in a particular population? This value could signify, for example, that the magnitude of an event of interest in two different strata is 20% and 10%, but may also relate to prevalences of 0.02% and 0.01%, respectively. Clearly, these are very different scenarios in terms of relevance and resulting health actions, which an omnibus measure as this PR of 2.0 simply fails to portray.
This also reminds us that when causal inference is not desired or possible (c.f. subsection "Structuring conditions" further on) the knowledge regarding tangible magnitudes of specific prevalences is much more informative than their ratio. It follows that statistical modeling to 'adjust for several variables' may also provide little help here and is unwarranted in such circumstances. An exception is when it is unfeasible to directly calculate estimates of prevalence due to data sparseness within subgroups, especially if there are specific target groups to be identified for further action. In order to get around this problem, it is possible to resort to multivariate modeling to predict or project prevalences (probabilities) using, for instance, a logistic model and thereafter applying the antilogit function Pr\left(P\right)={\left(1+\mathrm{exp}({\beta}_{0}+{\displaystyle {\sum}_{i=1}^{k}{\beta}_{i}{X}_{i}})\right)}^{1}, with k variables describing patterns of characteristics in these subgroups. Still, it important to realize that in this particular situation one is neither modeling nor interested in any effect (ratio) measure, but rather in the actual probabilities/prevalences occurring in  or rather, projected for  certain subgroups of interest.
Another purpose of surveys is to uncover causal relationships. Although longitudinal study designs are better suited for this aim, crosssectional studies have often been used to answer causal questions, mostly for pragmatic reasons such as unavailability of incidence data, reducing cost and duration of a study, and sometimes because of ethical constraints, as for instance, when a detected exposure unequivocally needs immediate intervention, rendering a 'neutral' follow up unsustainable.
Now, if the purpose of a survey is to address a causal relation, one of the key issues required for dealing with observational data concerns the modeling procedure. Yet, ahead of engaging in modeling the 'natural estimator' yielded in a survey (e.g., PR for some), one has to step back and ask what is really being achieved by controlling for several covariables. Specifically in the context depicted here, the question is why a researcher would want to obtain the average prevalence ratio accounting for the other variables in the model. One answer would be to 'recover' a counterfactual estimation that contrasts the exposed with 'themselves if unexposed' regarding the outcome of interest, which is only empirically achieved by comparing the exposed with the actual unexposed, once the effect of other relevant factors are explained away. This is evidently (and not surprisingly to most readers) a way to deal with confounding within the perspective of what has been labeled the potentialoutcome model [27, 28]. If this is indeed a reasonable model, what would the required estimator(s) then be? Would a ratio of two prevalences (which, inter alia, may conflate incidence and duration of the event) suffice or would its ultimate aim be to estimate risk ratios (CIR) or rate ratios (IDR) given both are recognizably 'true' causal parameters and largely recommended as the appropriate quantities to be attained [9, 29]? If the latter, an essential task is to scrutinize as to when and how a survey is effectively able to produce measures that are capable of representing any of these two causal parameters.
Relating prevalence and incidence ratio measures
Structuring conditions
At the outset, five conditions are necessary for a crosssectional approach to be able to investigate an etiological hypothesis, and without which any attempt to relate an ensuing estimator to either the CIR or IDR breaks down. For one, the population must be in steady state over the study period (stationary). In this case, within any given period of time, the size of the population needs to be constant across the exposure groups, as well as in regards to any other covariable used in the modeling process. Secondly, no selective survival is allowable, i.e., the probability of withdrawal or death from the outcome under study or from other related causes may not be different across exposure groups. Thirdly, the mean duration of the outcome must be the same regardless of exposure group, that is, the exposure may not differentially influence the survival or recovery probabilities. Fourthly, no reverse causality is allowed, i.e., the outcome being modeled may not reciprocally cause (influence) the exposure status in any way. Lastly, the temporal directionality from the exposure to the outcome must be sustainable, either theoretically (e.g., if a lifelong attribute is studied as the exposure for a recent outcome event) or by means of a thorough data collection procedure that assures the exposure as an antecedent of the outcome (e.g., in a study on the effects on child birth, recalling at birth a past exposure during pregnancy) [9, 30].
Once these criteria are exhaustively met, the next step consists of inspecting the conditions whereby the estimates obtained in crosssectional studies capture causal parameters or, in contrast and most importantly, under which circumstances they fall apart.
Formal relations between measures
Recalling Kleinbaum et al. [9] and notation therein, let {R}_{({t}_{0},t),i}=C{I}_{({t}_{0},t),i} be the risk or the cumulative incidence of an outcome of interest (e.g., disease, illness) occurring in stratum i within a time interval Δt = (t _{0}, t); ID _{ i }be the respective incidence density; {\overline{T}}_{i} be the mean duration of the outcome; and P _{ i }the point prevalence measured (obtained) through a crosssectional approach.
Dropping the subscript (t _{0}, t) for ease of notation, define
and thus conversely
Since, generically, the relation between incidence density and prevalence is
the ensuing prevalence ratio is
Hence, if according to equation (2) one has ID _{0} = [ln(1/(1  CI _{0}))]/Δt for the unexposed and ID _{1} = [ln(1/(1  CI _{1}))]/Δt for the exposed, further substituting these quantities into equation (3), the ensuing prevalences are, respectively,
and, thus, the prevalence ratio (PR) may also be written as a function of the underlying risks (CI _{ i }) and outcome durations {\overline{T}}_{i} as
From equation (3), ID _{ i }may be expressed as a function of an estimated prevalence
and, therefore,
Note that the IDR given in equation (9) may also be written as a function of the prevalence oddsratio
and when the outcome durations are the same in both strata (exposed and nonexposed), the IDR equals the POR.
The CI _{ i } may also be expressed from the prevalence P _{ i }. Solving equation (5) or (6), generically for stratum i, one has
and therefore,
All results in the following subsection (including figures) were obtained through an ad hoc Stata^{®} program (epiconv adofile) based on the above relations. The routine may be obtained with one of the authors (MER) on request.
Exploring several scenarios
The scenarios that follow assume that data are collected through crosssectional approaches and that the ratios of effectively measured prevalences between exposed and unexposed are always 2.0. Bearing a causal outlook, the fundamental issue concerns the interpretation in terms of the experience of the population under investigation. In the light of some or even all of the conditions presented (c.f. subsection "Structuring conditions"), how are the estimates to be read? What do they signify in terms of CI and ID, and by extension, their related effect measures CIR and IDR?
The scenarios portrayed in Table 1 vary according to whether the outcomes of interested are (i) rare or frequent events; (ii) whether their average durations are long or short; and (iii) if they are equal or unequal according to exposure group. The time units and prevalences specified in the scenarios are described in the table footnotes and are jointly meaningful. Note that given the specified values  P _{ 1 } and P _{ 0 } ; T _{ 1 } and T _{ 0 } ; and Δt , the IDR and CIR are specifically obtained through equations (9) and (12), respectively.
Overall, the PR estimate is only consistent with the IDR and the CIR in very restricted situations. Table 1 shows that there is only proximity when the outcome is rare and its duration long and alike across exposure strata (scenario 1). In a still more constrained condition  when the outcome is rare, short and of equal duration (scenario 3)  only the IDR is numerically compatible with the PR, whereas the CIR is already quite faroff (9.5% attenuation).
The shortcoming of the PR visàvis CIR is plainly depicted in Figure 1, which shows how the latter varies according to the duration of outcome, given fixed prevalences among exposure groups and time period (Δt) concerning the risks involved in the projected CIR. Bearing that the straight dotted line in the centre of the figure indicates the 'constant' PR = 2.0 that would be uncovered in several crosssectional studies, note that the CIR is met in just a very narrow range of outcome durations, strictly reaching equality only when {\overline{T}}_{1}={\overline{T}}_{0}=0.69. Note that arrows are indicative of the combinations shown in scenarios 5 and 7 of Table 1.
Although less common, one could contend that rather than standing for the CIR, the PR may capture the IDR instead. The dotted line at the top of Figure 1 shows that the projected IDRs are quite afar from the PR. In this scenario the latter blatantly underestimates the former by 50% throughout the {\overline{T}}_{i} range, clearly not supporting the above proposition. Although one may argue that the PR tends to converge to the IDR as the outcome event gets rarer, the question remains as to 'how rare should an outcome be' in order to enable one to accept the PR as a consistent proxy to the causal effect parameter.
The discrepancy between the PR and the CIR may be made more poignantly from yet another angle. Figure 2 extends Figure 1 by showing how the CIR departs from the PR, not only in regards to the outcome duration ({\overline{T}}_{i}), but also according to the risk period of follow up (Δt) to which the PR is referred to. Placed again within a plausible survey context (P _{ 1 } = 0.5 and P _{ 0 } = 0.25), Figure 2 portrays 10 Δtscenarios. As before, on the whole, the detected PRs (= 2.0) imply an immense gamut of CIR estimates. In situations where Δt assumes relatively short cumulative risk periods, the CIRs corresponding to the PR = 2 go to extremes, be it under or overestimating the latter as {\overline{T}}_{i} progressively increases. As the Δtrisk increases, the PR tends to gradually overestimate the CIR. In the 'extreme' Δt = 5 condition (last graph in Figure 2), all surveys detecting a PR = 2.0 would overestimate the CIR whatever the value of {\overline{T}}_{i}.
Almost all scenarios depicted in Figure 2 suggest that there are certain combinations wherein the curves cross the lower dotted line demarcating the detected PR. This shows that there is always a prospect of finding a PR estimate that is close to the 'true' CIR in a survey nested into a particular fixed population follow up (cohort), e.g., {\overline{T}}_{i} = 0.4 if Δt = 0.5; or {\overline{T}}_{i} = 1.1 if Δt = 1.5; or {\overline{T}}_{i} = 1.7 if Δt = 2.5; or {\overline{T}}_{i} = 2.0 if Δt = 3.0; or {\overline{T}}_{i} = 2.75 if Δt = 4.0. However, even if one knows something about the outcome's duration ({\overline{T}}_{i}={\overline{T}}_{1}={\overline{T}}_{0}), it is never possible to specify to which Δtrisk the PR really relates to. Having carried out a survey and estimating the contrast between two prevalences (exposed and unexposed), the researcher will always be in dark as to which time period the risks account for and thus, by extension, the ensuing CIR purportedly emulated by the PR.
At this point one should recall an essential question posed before. In a survey, what would the interpretation of a prevalence contrast be in terms of a CIR (relative risk) on detecting, for instance, a PR = 2.0 as a proxy to a CIR = 2.0 in a scenario akin to scenario 5 of Table 1? One line of thought would be to picture this survey accommodated within another study carried out on a fixed population (cohort) in steadystate and installed in t _{ 0 } but already followed for Δt = 1.45 (e.g., year), governed by constant forces of morbidity of ID_{0} = 0.333 and ID_{1} = 1.0 (i.e., IDR = 3). The detected PR = 2.0 would then represent a CIR of 2.0, the estimate that would have been found in t _{ 1.45 } if the closed and intact population were effectively followed up for the specified Δt (1.45 year). This situation is signaled with an arrow in Figure 3.
At close scrutiny, though, this quite attractive appraisal is untenable. From the stance of a relative risk (CIR) interpretation, a researcher uncovering a PR = 2.0 cannot know which Δt is at issue. The assumption that Δt = 1.45 is empirically unrecognizable, as is thus the interpretation itself. Counter to a common view, a detected prevalence estimate may not be referred to any risk estimate. The key point is that prevalences have little bearing to the CIR, which comes to show that, beyond any numerical discrepancy, the interpretation of PR in terms of a CIR also implies a conceptual misunderstanding.
Turning to another scenario, what is being measured given the conditions portrayed in Figure 3 when, for instance, a researcher faces a PR = 2.0? Let us focalize, for instance, the particular moment t _{ 5 } when Δt = 5. Here, the PR would be attempting to report a situation depicting an underlying force of morbidly of IDR = 3, wherein subjects installed at t _{ 0 } were followed through t _{ 5 } , and upon which the obtained CIR would be 1.225; CI_{0} = 0.811 and CI_{1} = 0.993; and {\overline{T}}_{1}={\overline{T}}_{0}=1. This is very different from the PR = 2.0 obtained in the survey at this point in time. Another key issue to emphasize, therefore, is that it is necessary to specify which scenario a PR estimate is referred to, something that is unfeasible in most real life circumstances. Prevalence ratios may well be calculated in a particular crosssectional study, but within a causal model framework, in general, their interpretation is neither that of risk ratios (CIR) nor of rate ratios (IDR).
The auspicious news, though, is that the directly calculated POR consistently estimates the IDR (equation (10)) if the supporting pillars effectively hold. Given the prevalence used in all 3 figures displayed in this subsection, POR = [P _{1} × (1  P _{0})]/[P _{0} × (1  P _{1})] = [0.5 × (1  0.25)]/[0.25 × (1  0.5)] = 3.0, which is consistent with the underlying IDR (top dotted lines). Examining Figure 2 in particular, the equality stands whatever {\overline{T}}_{i} and Δt is involved.
Choosing a suitable multivariable model
Considering the arguments so far, one has to question the option to unconditionally take the analysis a step further and, without any scrutiny, model the PR. If its meaning in terms of both the CIR and IDR is indefensible in most situations, what would thus be inferable from a noncausal estimate accounting (controlling) for several variables? Apart from very exceptional (and eccentric) circumstances in which a crosssectional approach were used to study a rare outcome  when the PR tends to the IDR , the best answer perhaps should be an uncompromising "not much"! Whichever modeling procedure is used  whether robust Poisson or Cox, or logbinomial models , the PR rarely accomplishes its purpose in providing a meaningful estimate within the potential outcomes model framework. On the other hand, if the conditions for causal inference from crosssectional data are fulfilled and the POR effectively and consistently estimates the IDR, the logistic regression model will provide an unbiased estimate, independently of any "rare disease assumption". It is the 'natural' choice once the potential outcome model is held.
An exception whereby models like the robust Poisson, Cox, or logbinomial may be suitable for modeling data arising from a crosssectional approach is when one is able to retrospectively reconstitute the entire empirical experience of a fixed population. This may be the case if it is possible to retrieve information by recall, not at all different from a retrospective non concurrent cohort study in which one is recovering the history of a fixed population by way of health service records. This recall would be informing about what happened 'backwards in time' although incident cases taking place at some earlier point would only be counted at the moment of interviewing, irrespectively of when cases occurred between time of inception (t _{ 0 } ) and time of interview (t _{ 1 } ).
One contention is that some subjects  whether exposed or nonexposed  would initiate this potential 'closed population' follow up, but would eventually not reach t _{ 1 } (for instance, in a sample of women interviewed at birth about an outcome event occurring during pregnancy). However, if this missingness is only conditional on exposure status and not on the outcome (i.e., missing data is random, either completely at random  MCAR  or just at random  MAR [31]), the proportions excluded should be balanced across exposureoutcome combinations, so that the frequency of outcome cases remains the same across exposure groups. Accordingly, the subjects accrued and observed at t _{ 1 } do not stand for prevalent and non prevalent cases (by exposure group), but rather provide unbiased information on incidence as in the 'complete' cohort that would have been installed before any withdrawal took place (as abortion would be in the example above).
Thus, if the sample is complete or non differential missingness holds, the design may be characterized as a classical 'fixed population' retrospective cohort, with exposed and nonexposed cohorts installed at t _{ 0 } ; outcomes occurring along Δt and being eventually measured and counted at t _{ 1 } . This typifies incident cases and thus the causal measure of interest should be the CIR rather than the PR. To emphasize, although the approach is crosssectional (at t _{ 1 } ), information concerns a cohort moving through time. Also note that, besides an identifiable inception moment  t _{ 0 } , there is also a clearly identifiable followup period  Δt  as there should be to effectively obtain a CIR.
Decision tree for analyzing crosssectional data
Building on the arguments provided so far, Figure 4 proposes a set of steps to be followed on deciding which measure to use when data is collected through a crosssectional approach. If the purpose is genuinely to study prevalences in population subgroups, then simple uni or bivariate analysis will suffice. As presented in the section on the purpose of crosssectional studies, sometimes probability prediction models will also be useful.
Yet, given the aim is to use a crosssectional approach to assess causality and the outcome is rare, whichever estimator is chosen will be adequate. Alas, rare outcome events are hardly ever studied in surveys. Outcomes are usually common and in this case, a crucial first step is to ask whether the structuring assumptions previously outlined are actually met. If not, there is little one can do and for practical reasons one has to clearly opt for a descriptive perspective at the most (although this is not less important from a public health perspective).
If all conditions are met  population is in steady state/stationary; there is no selective survival; mean duration is the same in both exposure groups; there is no reverse causality; and temporal directionality from the exposure to the outcome is sustainable , the researcher has then to figure out if there is enough information available to recognize the time frame of the underlying population, including several timerelated references such as t _{ 0 } and Δt.
If there is information on this time frame, the next step is to identify which variable is under focus as the exposure of interest in the analysis. If the researcher is fairly confident that the studied exposure is referred to the beginning of the presumed de jure reconstituted follow up window and the individual risk periods are definable, then the analysis is analogous to a retrospective cohort study. For instance, if birth weight is the exposure of interest (visàvis a childhood development benchmark as outcome such as sitting unaided) and is retrieved from mothers of children one year of age, the researcher may be quite confident that s/he is capturing information regarding an exposure at the inception of the cohort that is being 'reconstituted'  birth  and that a recognizable 'closed' follow up period is demarcated  1 year. In so being, the measure of interest is the CIR and the robust Poisson, Cox or logbinomial models are perhaps the most suitable ensuing multivariable models.
Another possibility is that the exposure of interest may not be informative of an event occurring at t _{ 0 } and started someway along the reconstructed follow up period (Δt). Given that the timing of exposure occurrence t _{ e } lies within Δt (t _{ 0 } < t _{ e } < t _{ 1 } ), individual follow up times may no longer be equal and as a consequence some subjects would not be observed for the whole risk period. Since this is equivalent to an unequal persontime apportionment in a prospective study design, the possibility of studying risk (CI_{ i }) is no longer possible. The same applies if the exposure of interest changes status along Δt and/or if the potential time of follow up varies across subjects.
Strictly, these scenarios are akin to that obtained in the vast majority of crosssectional studies whereby data refers to a dynamic population and the time frame of an underlying followed up population is simply not recoverable. In all these situations, provided the structuring assumptions are tenable, the analysis proceeds as in a density sampling casecontrol study wherein nonprevalent cases are proportional to respective persontime quantities and ultimately sustain an unbiased estimate of the IDR through the calculated crossproduct ratio [30, 32]. Hence, as previously mentioned, the logistic model may be the most suitable related multivariable model.
Putting it all together
In the light of our own contentions, it is worth reexamining the dispute 'between camps' alluded to in the introduction. As mentioned, several arguments against the POR have been raised [13–18]. One is that the POR is difficult to interpret and communicate. According to Pearce [19], Strömberg [12] and our own arguments, the POR should not be difficult to interpret once it is understood that the POR is an acceptable measure to estimate the IDR, which, in turn, also implies understanding what the quantities represented by the prevalent and nonprevalent cases stand for (akin to a density sampling casecontrol study [20, 33]).
Another argument against the POR was that it is very discrepant from PR when outcomes are common. We stand with Pearce [19] in contending that "the fact that the two methods give different results when the disease is common [...] does not tell us which measure is more appropriate to use". We go beyond this assertion by putting forward that it is in fact a positive aspect that the POR is discrepant from the PR when the outcome is common since, except for very restrictive and most unrealistic circumstances, the latter does not stand for much as a causal parameter, whereas the former may be meaningful (namely, standing for the IDR) given certain conditions  e.g., stationarity and equal duration of disease. Although, according to Thompson [18], these conditions are hardly met and would thus disfavor the POR, it is by no means clear how the PR would survive this criticism as well, since it is also strongly affected by any violation of these assumptions.
Yet another argument against the POR within the context of common outcomes is that the measure is at times misinterpreted as a cumulative incidence ratio (CIR). In our view, it does not seem fair to blame the measure instead of who misunderstands it. Moreover, we believe that the same risk of misinterpretation holds for the PR.
The remark made by some authors such as Lee [15] that the only usefulness of POR is to mimic other ratio measures should not be taken as a criticism, but conversely, as a positive facet. If anything, it is auspicious that the cross product ratio generated by way of a crosssectional approach is able to provide a contrast of incidence measures in some conditions, which is unlikely to occur with the PR. By extension, coefficients obtained in a logistic regression should be regarded as multivariate crossproduct ratios that, given a causal framework, stand for unbiased estimates of IDRs.
Beyond the points debated so far, Thompson et al. [18] additionally argue that, given the absence of longitudinal data and the inability to make proper causal inferences once crosssectional data will be used, it is best to use PR because this overtly signals its "limited inferential value" and thus warns the reader ensuring "truth in advertising". Again, it is our stand that this perspective is essentially illadvised in regards to etiological inference. First and foremost, conditions for causal inference are either present (assumable) or not. If not, there is not much reason for estimating an effect measure and label it as of 'limited inferential value'. The discussion on the best estimator and related model only makes sense if preceded by a thorough debate about what is actually sought with a crosssectional study and what the ensuing best measures for inference are. If the assumptions for etiological inference do not hold, there is no reason for modeling data in order to control for confounding. Dealing with confounders has no meaning outside a causal reasoning and is only justified under a counterfactual logic [28, 34].
The growing literature on alternative multivariable models to estimate effects arising from crosssectional data, instead of aiding in the understanding and development of epidemiological research, on the contrary, may have brought more shadows than light on the matter. Not because the intricacies of the proposed models are incorrect, but because the importance of actual epidemiological model building have been largely sidestepped. The conditions and appropriateness for modeling are taken for granted, yet as this paper attempts to remind, considering theoretically based putative relations between the events of interest is crucial to the process. Outcomes and exposures, as well as other elements involved in the causal system  confounders, effect modifiers, mediators, colliders [30, 35]  not only require assessment on matters of substance and meaning, but also in regards to their temporal relations within the time frame of any given study. Only then are decisions to be taken in favor of any statistical model. And given the recognizably restricted situations where causal modeling is indeed obtainable from cross sectional data, the POR  or rather, the calculated cross product ratio as an estimate of the IDR  ought to be the most widely indicated estimator.
Summary
During the last two decades the choice between Prevalence Ratio (PR) and Prevalence Odds Ratio (POR) to contrast exposure groups in crosssectional studies has been the subject of debate. In the last 10 years, this debate became more focused on the choice between different regression models for estimating PR. In this paper we used different scenarios to illustrate and sustain our point of view concerning two issues related to the analysis of crosssectional data. Firstly, when conditions for causal inference in crosssectional studies do not hold, crude or subgroup prevalences are the quantities to be presented. Secondly, when the assumptions for causal inference are met, two additional aspects need to be considered: (i) cumulative incidence ratio (CIR) is not properly estimated using either PR or POR; (ii) the only measure that provides an unbiased estimate of incidence density ratio (IDR) is the POR. An exception for these two statements is the presence of rare diseases, which usually are not subject of surveys. Based on these facts, we sustain that multivariate modeling should be restricted to scenarios were assumptions for causal inference from crosssectional studies effectively hold and that for such cases the logistic regression model remains the appropriate choice to capture the incidence contrast between exposure groups.
Abbreviations
 CI:

Cumulative incidence
 CIR:

Cumulative incidence ratio
 ID:

Incidence density
 IDR:

Incidence density ratio
 OR:

Odds ratio
 P:

Prevalence
 POR:

Prevalence odds ratio
 PR:

Prevalence ratio
 RR:

Risk ratio.
References
MacMahon B, Pugh T: Epidemiology: principles and methods. 1970, Little, Brown Boston
Lilienfeld AM, Lilienfeld DE: Foundations of Epidemiology. 1980, New York: Oxford University Press, 2
Freeman J, Hutchison GB: Prevalence, incidence and duration. Am J Epidemiol. 1980, 112: 707723.
Keiding N: Agespecific Incidence and Prevalence: a Statistical Perspective. Journal of the Royal Statistical Society. 1991, 154: 371412. 10.2307/2983150.
Brunet RC, Struchiner CJ: A nonparametric method for the reconstruction of age and timedependent incidence from the prevalence data of irreversible diseases with differential mortality. Theor Popul Biol. 1999, 56: 7690. 10.1006/tpbi.1999.1415.
Brunet RC, Struchiner CJ: Rate estimation from prevalence information on a simple epidemiologic model for health interventions. Theor Popul Biol. 1996, 50: 209226. 10.1006/tpbi.1996.0029.
Marschner IC: A method for assessing agetime disease incidence using serial prevalence data. Biometrics. 1997, 53: 13841398. 10.2307/2533505.
Marschner IC: Fitting a multiplicative incidence model to age and timespecific prevalence data. Biometrics. 1996, 52: 492499. 10.2307/2532889.
Kleinbaum DG, Kupper LL, Morgenstern H: Epidemiologic Research: Principles and Quantitative Methods. 1982, New York: Van Nostrand Reinhold Company
Rothman KJ: Modern Epidemiology. 1986, Boston: Little, Brown and Co
Strömberg U: Prevalence odds ratio v prevalence ratio. Occup Environ Med. 1994, 51: 143144. 10.1136/oem.51.2.143.
Strömberg U: Prevalence odds ratio v prevalence ratiosome further comments. Occup Environ Med. 1995, 52: 14310.1136/oem.52.2.143.
Lee J, Chia KS: Estimation of prevalence rate ratios for cross sectional data: an example in occupational epidemiology. Br J Ind Med. 1993, 50: 861862.
Lee J, Chia KS: Use of the prevalence ratio v the prevalence odds ratio as a measure of risk in cross sectional studies. Occup Environ Med. 1994, 51: 84110.1136/oem.51.12.841.
Lee J: Odds ratio or relative risk for crosssectional data?. Int J Epidemiol. 1994, 23: 201203. 10.1093/ije/23.1.201.
Axelson O, Fredriksson M, Ekberg K: Use of the prevalence ratio v the prevalence odds ratio as a measure of risk in cross sectional studies. Occup Environ Med. 1994, 51: 57410.1136/oem.51.8.574.
Axelson O, Fredriksson M, Ekberg K: Use of the prevalence ratio v the prevalence odds ratio in view of confounding in cross sectional studies. Occup Environ Med. 1995, 52: 49410.1136/oem.52.7.494.
Thompson ML, Myers JE, Kriebel D: Prevalence odds ratio or prevalence ratio in the analysis of cross sectional data: what is to be done?. Occup Environ Med. 1998, 55: 272277. 10.1136/oem.55.4.272.
Pearce N: Effect measures in prevalence studies. Environ Health Perspect. 2004, 112: 10471050. 10.1289/ehp.6927.
Greenland S, Thomas DC: On the need for the rare disease assumption in casecontrol studies. Am J Epidemiol. 1982, 116: 547553.
Rodrigues L, Kirkwood BR: Casecontrol design in the study of common diseases: updates on the demise of the rare disease assumption and the choice of sampling schemes for controls. Int J Epidemiol. 1990, 19: 205213. 10.1093/ije/19.1.205.
Skov T, Deddens J, Petersen MR, Endahl L: Prevalence proportion ratios: estimation and hypothesis testing. Int J Epidemiol. 1998, 27: 9195. 10.1093/ije/27.1.91.
Barros AJ, Hirakata VN: Alternatives for logistic regression in crosssectional studies: an empirical comparison of models that directly estimate the prevalence ratio. BMC Med Res Methodol. 2003, 3: 2110.1186/14712288321.
Petersen MR, Deddens JA: A comparison of two methods for estimating prevalence ratios. BMC Med Res Methodol. 2008, 8: 910.1186/1471228889.
Santos CA, Fiaccone RL, Oliveira NF, Cunha S, Barreto ML, do Carmo MB, Moncayo AL, Rodrigues LC, Cooper PJ, Amorim LD: Estimating adjusted prevalence ratio in clustered crosssectional epidemiological data. BMC Med Res Methodol. 2008, 8: 8010.1186/14712288880.
Behrens T, Taeger D, Wellmann J, Keil U: Different methods to calculate effect estimates in crosssectional studies. A comparison between prevalence odds ratio and prevalence ratio. Meth Inform Med. 2004, 43: 505509.
Little RJ, Rubin DB: Causal effects in clinical and epidemiological studies via potential outcomes: concepts and analytical approaches. Annu Rev Public Health. 2000, 21: 121145. 10.1146/annurev.publhealth.21.1.121.
Greenland S, Brumback B: An overview of relations among causal modelling methods. Int J Epidemiol. 2002, 31: 10301037. 10.1093/ije/31.5.1030.
Greenland S: Interpretation and choice of effect measures in epidemiologic analyses. Am J Epidemiol. 1987, 125: 761768.
Rothman KJ, Greenland S, Lash TL: Modern Epidemiology. 2008, Philadelphia, PA: Lippincott Williams & Wilkins, 3
Little RLA, Rubin DB: Statistical Analysis of Missing Data. 1992, Hoboken, NJ: John Wiley and Sons, Inc, 2
Wacholder S, McLauglin J, Siverman D, Mandel J: Selection of controls in casecontrol studies.iii. disign options. Am J Epidemiol. 1992, 135: 10421050.
Miettinen O: Estimability and estimation in casereference studies. Am J Epidemiol. 1976, 103: 226235.
Rothman KJ, Greenland S: Causation and causal inference in epidemiology. Am J Public Health. 2005, 95 (Suppl 1): S144S150. 10.2105/AJPH.2004.059204.
Glymour MM: Using causal diagrams to understand problems in social epidemiology. Methods in social epidemiology. Edited by: Oakes JM, Kaufman JS. 2006, San Francisco, CA: JosseyBass, 393428.
Prepublication history
The prepublication history for this paper can be accessed here:http://www.biomedcentral.com/14712288/10/66/prepub
Acknowledgements
MER was partially supported by the National Council for Scientific and Technological Development (CNPq), process no. 306909/20065 and 301221/20090. ESFC was partially supported by CNPq, process no. 302269/20088.
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors' contributions
Both authors (MER and ESFC) made substantial contributions to conception, analysis and interpretation of the data. Following discussions on the outline of the paper both authors made separate contributions to the first drafting of the manuscript and, thereafter, interactively participated in critically revising the content up to the final (submitted) version. MER was responsible for setting up the scenarios and generating the computer programs (routines) for the analysis and entailing displays (tables and figures). The authors read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Reichenheim, M.E., Coutinho, E.S. Measures and models for causal inference in crosssectional studies: arguments for the appropriateness of the prevalence odds ratio and related logistic regression. BMC Med Res Methodol 10, 66 (2010). https://doi.org/10.1186/147122881066
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/147122881066
Keywords
 Causal Inference
 Exposure Group
 Prevalence Ratio
 Risk Period
 Outcome Duration