 Research article
 Open Access
 Open Peer Review
 Published:
Treatmentseeking behaviour in low and middleincome countries estimated using a Bayesian model
BMC Medical Research Methodologyvolume 17, Article number: 67 (2017)
Abstract
Background
Seeking treatment in formal healthcare for uncomplicated infections is vital to combating disease in low and middleincome countries (LMICs). Healthcare treatmentseeking behaviour varies within and between communities and is modified by socioeconomic, demographic, and physical factors. As a result, it remains a challenge to quantify healthcare treatmentseeking behaviour using a metric that is comparable across communities. Here, we present an application for transforming individual categorical responses (actions related to fever) to a continuous probabilistic estimate of fever treatment for one country in SubSaharan Africa (SSA).
Methods
Using nationally representative household survey data from the 2013 Demographic and Health Survey (DHS) in Namibia, individuallevel responses (n = 1138) were linked to theoretical estimates of travel time to the nearest public or private health facility. Bayesian Item Response Theory (IRT) models were fitted via Markov Chain Monte Carlo (MCMC) simulation to estimate parameters related to fever treatment and estimate probability of treatment for children under five years. Different models were implemented to evaluate computational needs and the effect of including predictor variables such as rurality. The mean treatment rates were then estimated at regional level.
Results
Modelling results suggested probability of fever treatment was highest in regions with relatively high incidence of malaria historically. The minimum predicted threshold probability of seeking treatment was 0.3 (model 1: 0.340; 95% CI 0.155–0.597), suggesting that even in populations at large distances from facilities, there was still a 30% chance of an individual seeking treatment for fever. The agreement between correctly predicted probability of treatment at individual level based on a subset of data (n = 247) was high (AUC = 0.978), with a sensitivity of 96.7% and a specificity of 75.3%.
Conclusion
We have shown how individual responses in national surveys can be transformed to probabilistic measures comparable at population level. Our analysis of household survey data on fever suggested a 30% baseline threshold for fever treatment in Namibia. However, this threshold level is likely to vary by country or endemicity. Although our focus was on fever treatment, the methodology outlined can be extended to multiple health seeking behaviours captured in routine national survey data and to other infectious diseases.
Background
Delay in seeking treatment for ill health in low and middleincome countries (LMICs) affects disease progression, management and outcomes [1–3]. Most infectious diseases in LMICs are preventable by using costeffective interventions and treatable at peripheral health facilities [4]. However, weak health systems affect the delivery of most interventions [5] and socioeconomic and physical barriers that modify healthseeking behaviour compound this, leading to underutilisation of health facilities [6]. Encouraging appropriate treatmentseeking behaviour for uncomplicated infections is vital to further reduce disease burden in these countries or for successful elimination. For malaria, for example, the current World Health Organisation (WHO) recommendation is for malaria treatment to be sought in the formal healthcare sector within 24 hours of fever onset and other malariarelated symptoms [7]. This is because patients who seek treatment through the formal sector are likely to receive an appropriate diagnosis and effective management [8]. However, there are many factors influencing population treatmentseeking behaviour including, but not limited to; availability of healthcare providers, proximity or travel time to healthcare facilities, condition severity and perception, and the sociodemographic profile of the population at risk [9].
Studies on treatmentseeking behaviour can be grouped into two categories of approach. The first is a qualitative description of steps undertaken by the population in different settings [10–12] while the second is a quantitative association between determinants (factors) and choice of health service use [13–18]. Although these approaches are used widely in biomedical research, they usually do not examine the latent (i.e. theoretical) characteristics such as individuallevel traits to estimate variation at population level. In addition, comparability is not simply guaranteed with the same questionnaire because of differential item functioning problem i.e. the varying behavioural response to the same question depending on the respondent [19]. Such variation can then be translated to spatially explicit applications that can be combined with existing spatial data on populations [20] and disease incidence to inform and optimise targeting of communitybased interventions.
Modelbased geostatistical methods have already been used to predict and estimate disease incidence at fine spatial resolution [21, 22]. This has been aided by public health intelligence data that are increasingly becoming available across space and time from geolocated nationally representative household surveys. These include the Malaria Indicator Surveys (MIS) [23], Demographic and Health Surveys (DHS) [24], and Multiple Indicator Cluster Surveys (MICS) [25]. These nationally representative household surveys also collect information on selfreported health behaviour such as fever management [14]. However, how can responses concerning fever treatment from household surveys be compared across populations with varying access, demographics, cultures, and disease burdens? Item response theory (IRT) has been widely used to examine surveys items (questions) and person characteristics in psychology and education [26–28]. In education, for example, it has been used to estimate the personlevel traits (such as ability) or itemlevel difficulty in an examination [29–31]. IRT concepts can be extended to health as applied previously in delirium screening [32], longitudinal data analysis [33], and interpreting medical codes from patient records [34]. IRT approaches are essentially probit models with additional regression effects used to aid estimation of item characteristics [35]. Extending this to a Bayesian framework has the advantages of incorporating uncertainty in estimating latent traits and prior distributions can be imposed on the Bayesian probability model to capture many aspects of data not included in descriptive or quantitative frequentist approaches [36]. Although Imputation techniques can be used to handle missing data, this was beyond the current scope of this manuscript.
Here, the aim was to demonstrate the application of IRT to fever treatmentseeking modelling using data from a low malaria transmission setting, the Namibia 2013 DHS. We analyse fever treatmentseeking behaviour at a national level and derive response characteristic curves based on travel times to the nearest facilities. The rest of this paper is organised as follows. Section 2 provides an overview of household survey data in LMICs and the proposed modelling approach. We then present treatmentseeking behaviour model outputs in section 3, including evaluation of model performance. The paper concludes with a brief discussion in sections 4 and 5.
Methods
Data characteristics in low and middle income countries
Distance or proximity to healthcare provider is an important parameter in the choice of treatment by patients in many LMICs [37–39]. In these countries, the majority of people access facilities by walking. Therefore, it is preferable to use a facility close to the place of residence because it is less costly compared to travelling greater distances requiring motorised transport [40]. Other factors that influence utilisation patterns include: age, gender, healthcare costs, socioeconomic status, residence (urban or rural), familiarity with health personnel, fever severity, and quantity as well as quality of services at peripheral facilities [41, 42]. In some cases, however, the phenomenon of bypassing the nearest healthcare facility can be encountered, even for mild fever conditions [43, 44]. Empirical data are not always available to model such nuances and we therefore assume use of the nearest facility in this case study.
Estimation of travel times to the nearest formal healthcare treatment provider
Estimating travel times between population centres and formal healthcare providers has already been considered in previous research [14]. In brief, this requires a combination of mode of travel (walking or motorised) and an impedance surface that is constructed based on multiple data layers, including the various land use and land cover characteristics, elevation, and roads [45]. Travel time to nearest healthcare facility is a useful measure because it is relatively easy to estimate and to relate travel times in different settings compared to estimating the actual physical distance. The approach in Alegana et al. [14] shows how travel times for Namibia were derived.
Quantification of formal healthcare use based on national representative household surveys
To estimate the utilisation of healthcare facilities, this study used the reported use of formal healthcare for fever treatment from the DHS. These surveys are conducted in 90 countries worldwide, and 44 in SSA, providing information on reproductive health, fertility, population demographics and general health status, nutrition, household characteristics, socioeconomic status and infant and child mortality rates [46]. The surveys are based on a random twostage cluster sampling design in which clusters are usually first sampled within a region on a probabilityproportionaltosize basis and thereafter, within each cluster, households are sampled randomly [47, 48]. Cluster sizes usually vary, but are typically approximately 15 to 30 households. The household survey provides information on health and the sociodemographic profile of consenting participants including their treatmentseeking behaviour for conditions such as malariaassociated fever.
A notable feature of the fever treatment variable in the DHS is the decay in treatment with increasing travel time to nearest facility (Fig. 1). The geographical barrier to utilisation, manifested as a distance decay, is a wellknown phenomenon in studies of healthcare utilization [37, 38, 49] and occurs when usage of health facilities declines with increasing distance [50, 51]. This feature motivates the use of probit models to characterise treatmentseeking behaviour (section 2.4). Another feature of utilisation is that even for patients in close proximity to healthcare facilities, treatment for fever is not always 100% as some mild conditions selfresolve, are treated through informal care, or may be treated at a more distant facility [9]. Household survey data usually contain detailed information on other factors that could affect utilisation of healthcare facilities. These explanatory variables can be grouped largely into socioeconomic and demographic characteristics and have been used selectively in quantitative studies of healthcare utilisation [3, 10, 17, 52, 53].
Application of Bayesian probit models to healthcare utilisation research
Item response modelling was proposed in the 1960s [54–56] and is commonly applied to studies in education and psychology to estimate item characteristics [28]. The first applications of IRT used maximum likelihood estimation [57, 58]. Bayesian extensions were proposed for one and twoparameter models [59] and extended to the threeparameter logistic model [60]. Fitting via Gibbs sampling became popular using data augmentation (DAG) techniques in the 1990s particularly for application to the normalogive models [61–63]. Fu et al. [64] provided some extensions to the threeparameter model following Sahu’s DAG approach [63] and compared Gibbs sampling to BILOGMG software [65] using likelihood estimation. There have also been other innovations in parameter estimation [62], including extension to a multilevel approach [26–28] and comparison with maximum likelihood methods [66]. Here, a unidimensional threeparameter model with a hierarchical structure was used, its parameters estimated, and prior sensitivity checked by comparing model goodnessoffit statistics. The main objective was to estimate the probability of a positive response to choice of treatment for persons with fever associated with malaria at a household level.
In general, let Y _{ ij } represent a dichotomous response variable of an individual j (j = 1,......, N) on a set of questions (items) i (i = 1,......, n) on use of public healthcare for treatment. Y _{ ij } = 1 represents a positive response on one item (e.g. public healthcare use), while Y _{ ij } = 0 represents a negative response (e.g., nonpublic healthcare use). The probability of Y _{ ij } = 1 can then be written following [64] as:
where, θ _{ j } = θ _{ j1}......, θ _{ jk },..... θ _{ jm } with − ∞ < θ _{ jk } > + ∞ for k = 1,..... m dimension represents the person traits (i.e. the ability parameter). a _{ ik } represents item discrimination parameters between individuals separated by individuallevel traits, and is positive (a _{ ik } > 0). b _{ ik }(−∞ < b _{ ik } < ∞) represents item difficulty (or location) parameters which for multiple items represents relationship between items and the underlying individualtraits (see Appendix for full glosary of symbols). Lastly, c _{ i } (0 < c _{ i } < 1) represents the threshold (i.e., minimum) probability for the item in question (fever treatment). This specification of threshold probability is important to this application because the estimated probability is never equal to one when θ _{ j } is zero, due to several individual characteristics. Hence, probability of treatment is constrained to be greater than zero and less than one. In many applications in psychology and education, the ability parameter, for example, is modelled as a latent characteristic independent of survey observations [67, 68]. In this application, a predictor variable was introduced on the individual traits parameter in terms of traveltime to the nearest health facility. This parameterisation also enables the introduction of other variables such as residence (urban or rural), socioeconomic status, or educational levels. Thus, Eq. 1 can be simplified to:
Where
with β _{ Qj } representing coefficients of dependent variables X _{ Qj } exploring differences in ability.
The likelihood and posterior specification
In general, let f(θ, a, b, c) denote a collection of unknown parameters, the posterior can be expressed as the product of the likelihood and prior distributions for unknown parameters given as:
where f(θ, a, b, c) = f(θ)f(a)f(b)f(c) and the posterior density we wish to evaluate is
where D is a proportionality constant and
Goodnessoffit statistics, prior specification and Markov chain Monte Carlo implementation
The same notation was used for the item discrimination parameter, witha _{ i } > 0, where a halfnormal or truncated normal prior was used such that a _{ i } ~ N(μ _{ a }, σ _{ a } ^{2} )I(a _{ i } > 0)and I(⋅) is an indicator function. The rationale for this specification is to ensure that the parameter estimate is positive. The probability threshold parameter was constrained on c ∈ (0, 1] using a beta distribution such that π(c _{ k }; κ, τ)αc _{ k } ^{κ − 1} (1 − c _{ k })^{τ − 1} for suitable parameters values \( {\kappa}_{c_k} \)and\( {\tau}_{c_k} \). The recommended procedure for selecting suitable estimates of these parameters is such that the E(c) = κ/(κ + τ) and weakly informative priors may be used for parameters of beta distribution.
Two different specifications were used for the difficulty parameter. The first was a normal prior b _{ i } ~ N(0, 10) (model 1) and the second a truncated normal (model 2) restricting b _{ i } ~ N(μ _{ b }, σ _{ b } ^{2} )I(b _{ i } > 0) to be positive. Thus, the difference between model 1 and model 2 was only in the prior specification for the b parameter. Figure 2a represents the overall parameter structure for Model 1 and Model 2. The rationale for using different priors for b was to evaluate the effect of constraining item difficulty to positivity (b _{ i } > 0) compared to allowing for flexible Gaussian density.
Lastly, the individualtrait parameter θ _{ j } was modelled in a hierarchical approach following Fox and Glas [30] such that the joint distribution of θ _{ j } parameters follows a multivariate normal distribution. Thus, in general, α and β are the intercept terms and regression coefficients, respectively, modelled as independent effects in model 1 and model 2 (Fig. 2a). In extending the model to a multilevel representation, time to the nearest facility could then be used to explain individual traits. Normal priors (e.g.α ~ N(0, 1)) were used for α and β in Fig. 2a. Secondly, this was extended to a random intercept in model 3 (Fig. 2b) and lastly, as a random slope and intercept model including residence (urban or rural) as a centering variable (model 4, Fig. 2c). For model 4, the random slope and intercept were modelled jointly as:
with a Wishart (multivariate scaled χ ^{2}) distribution (Barnard et al. 2000) with density \( f\left(\sum \right)\propto {\left\sum \right}^{\left(\nu + d+1\right)/2 e\frac{1}{2} tr\left(\wedge {\sum}^{1}\right)} \); d dimension matrix; νdegrees of freedom; specified for covariance matrix ∑. Thus, the inverse is specified as ∑^{− 1} = Wishart(Ω, p) where Ω is a scale matrix, usually identity, and pis the degrees of freedom equal to the number of random components. Alternative approaches could employ a scaled inverseWishart distribution because of the large standard errors associated with large variances in the use of the inverseWishart prior [69].
Validation was considered via a subset of 40% of the data selected randomly (n = 247 of the 1138 children) with the remaining 60% (n = 891) used in model estimation. Model 1 was then applied to the validation set and the predicted probability of treatment transformed to a binary outcome. A receiver operating characteristic (ROC) curve was then used to derive the specificity and sensitivity of predictions when compared to observed responses from survey data. For estimation, different model specifications were also used to check the sensitivity of different prior specifications (i.e. models differ only on prior structure) and complexity. Model outputs were evaluated and compared via goodnessoffit statistics, for example, the Deviance Information Criterion (DIC). The DIC summarises model fit based on a combination of model deviance and complexity (effective number of parameters) [70, 71]. This is defined as:
where \( \overline{D}={E}_{\theta \Big y}\left[ D\right] \)is the mean deviance for D = − 2 × log{P(yθ)} with
and complexity (effective number of parameters) given by \( p D=\overline{D}\widehat{D} \). The two parameters were monitored in the MCMC implementation using five chains in JAGS version 4.2.0 and the rjags package in R version 3.3.1 [72]. A combination of GelmanRubin [73] with RafteryLewis diagnostic [74] approaches were used to check for convergence. For the former, we checked for a reduction factor of <1.05 while the latter provided estimates of burnin and thinning factors given an accuracy of 0.0005 at quantile (0.025) and coverage probability of 0.975.
Results
We used the Namibia 2013 DHS data to estimate the probability of fever treatment in the formal sector (reported fever treatment in public and private sectors) for children under five years. There were 4818 children under five years enumerated, of which 1138 (23.6%) reported at least one fever episode in the preceding fortnight. Of those that reported a fever episode, 726 (63.8%) sought treatment in the formal sector (public and private sector excluding traditional healers). Overall, the proportion of children with reported fever was fairly homogeneous across all the regions surveyed but varied by estimated travel times. Estimation of probability of treatment focussed on children reporting fever (n = 1138) rather than all children examined in the crosssectional survey.
In terms of computation, the GelmanRubin test was less than or equal to 1.05 for all the parameters monitored in the MCMC implementation. However, the RafteryLewis method showed that a minimum of 55,318 iterations were required to achieve an accuracy of 0.0005 at coverage probability of 0.999 with quantile at 0.05. More than 100,000 iterations with a burnin of 50,000 were implemented. Table 1 shows the DIC estimates and the effective number of parameters from the four models implemented. Comparison between model 1 (M1 DIC 3615.9) and model 2 (M2 DIC 3685.1) suggests that using truncated normal priors for the b parameter did not improve model fit. Increasing DIC (for model 3 and 4) was also directly proportional to the increase in model complexity by including random intercept and slope. This also increased computational demands for M3 and M4 requiring at least 250,000 iterations with longer burnin (slow convergence). The difference in DIC estimates also suggested that the models were sensitive to changes in model structure. Based on a binary classification of predicted probability at the individual level from model 1, the area under the curve (AUC) was 0.978 with a sensitivity of 96.7% and a specificity of 75.3% (155 true positive, 21 false positive, 64 true negative, and 7 false negative).
Table 2 shows posterior estimates of the parameters along with 95% equaltailed credible intervals. A plot of feverresponse curves based on the fitted parameters is shown in Fig. 3a along with a scatterplot of α and β parameters from Model 4 (Fig. 3b), posterior density of parameters (Fig. 3c) and ROC plot (Fig. 3d). Different mean combinations of parameters a, b, and c resulted in response characteristics based on travel time to nearest health facility (Fig. 3a). Parameter estimates could be compared and interpreted jointly in this manner because they apply to one item (on estimating fever treatment). Comparison between model 1 and model 2 suggested that constraining the b parameter did not have a major impact on mean estimates of the individuallevel traits, a or the threshold parameter c. Overall, model 4 had larger person discriminant parameter estimates (mean and median) compared to all the other model specifications. The correlation between mean estimates for α and βas estimated from the model was weak (mean 0.011, median 0.006 scatterplot Fig. 2b). The combination of correlation and DIC estimates suggested a fixed prior independent specification as a better choice. It also imposes less computational demand. The threshold probability was >0.3 for all model estimates, suggesting this as the lower limit probability of use of nearest facility for fever treatment in the four models implemented from the 2013 Namibia DHS.
Table 3 shows the estimated mean probabilities for malaria related fever treatment at a regional level in Namibia with associated confidence intervals and population estimates. Population estimates are useful in estimating fever treatment burden based on probability estimate at regional level. For malaria, the probability of fever treatment among febrile cases was highest in endemic areas in Zambezi and Kavango (mean probability in Zambezi 0.546 (95% Credible Interval (CI): 0.369–0.671)) compared to Kunene with less than one case per 1000 population with mean probability 0.433 (95% CI: 0.364–0.614). Overall mean probability of fever treatment was greater than 0.5 in areas with malaria incidence >1 per 1000 population.
Discussion
Characterising treatmentseeking behaviour in LMICs is valuable because it varies by geographic location, type of disease and severity, person characteristics including age and gender, as well as health system based factors such as availability, cost among other enabling factors [9, 75, 76]. Here, the focus was on the estimation of latent parameters of a survey question on fever and estimating the probability of seeking treatment based on a dichotomous response. We used data from a nationally representative household survey from the DHS in one country to estimate fever treatment latent characteristics using a Bayesian IRT approach. By using this method, we estimated the parameters of fever response curves that characterise geographical decay in the use of formal health care based on travel time to the nearest facility. The method is particularly appealing because of the joint estimation of IRT parameters related to fever treatment with uncertainties incorporated in prior distributions and the ability to extract the full posterior distribution compared to point estimates from maximum likelihood approaches [26, 61]. This is important because estimates from such probabilistic modelling can then be applied in estimating numbers of symptomatic infections (treatment burden) when such probabilistic estimates are transformed into gridded metrics that vary spatially [77, 78]. The modelling approach can also be extended to other items in household surveys to further understand human behaviour response to health conditions.
The lower limit probability estimated here, related to the threshold parameter (e.g. from Table 2 model 1: 0.340; 95% CI 0.155–0.597), for Namibia suggests that even at large distances from health facilities, there was still a 30% chance of individuals seeking fever treatment. We suggest that this is an important property in treatmentseeking behaviour for individuals living far from health facilities in Namibia, although this threshold may be different by country or endemicity and was not explored further in this analysis. In this study, estimates of probability of fever treatment at the regional level showed that the mean probability was highest in regions with relatively high incidence of malaria historically (Table 3). Another operational application of the probability response characteristics curves, derived from the latent parameters in Fig. 3a, could be in identifying areas where community health workers could be deployed [79, 80]. This, however, requires definition of a cutoff probability (yaxis on Fig. 3a), currently not established for malaria transmission settings, to delineate areas with limited access. Constraining the b parameter (item parameter) did not influence estimates of the individuallevel traits and the threshold parameters. This is primarily because only one item was used in this application resulting in similar parameter estimate for the location parameter.
In extending the model to a multilevel framework, travel times were used as predictors. Comparison between constant intercept and slope model parameters with a random parameter model showed that the former resulted in shorter MCMC runs and better model fit compared to the latter (i.e., the random slope and intercept), which experienced slow convergence as the number of effective parameters increased exponentially. We are not discouraging use of a more complex modelling approach while estimating IRT parameters, but this highlights the increasing computational demands and efficiency related to increased complexity.
MCMC techniques were used to estimate and jointly interpret IRT parameters. The threeparameter logistic model [60] was particularly useful compared to the twoparameter model [59], because, the third parameter c represents the threshold probability on the fever response curve, ensuring that probability is always greater than or equal to zero. Despite the known benefits of IRT in other fields [28], this approach has seldom been applied to modelling human behavioural aspects for treatmentseeking behaviour. The current study was confined to patients’ responses to a fever question in household survey data and how latent (rather than observed) properties can be quantified in relation to patient behaviour and travel time. Dichotomous responses are common in many health surveys in LMICs and methods used here can be extended to other health conditions. Although we did not have to deal with missing data (NAs), several data imputation techniques can be used for nonignorable NAs [81]. These may arise when there is lack of response, or, associated with refusal to participate or simply unobserved variable for survey items. When NAs are imputed into the data matrix, for example, these do not usually contribute to likelihood estimation [82] of the ability parameter and the higher the number of missing values the more likely that there will be an increase in uncertainty for the parameter estimate.
There exist some additional limitations aside from those related to computational speed and efficiency. While fever in the Namibia 2013 DHS was associated with malaria treatment, the survey data did not include a laboratory confirmation of malaria infection [83]. Moreover, the sampling methodology for children with fever in the DHS may be inferior because the survey is not powered for fever detection [47]. Most current surveys however incorporate rapid diagnostic tests (RDTs) and future identification of febrile cases could include laboratory results as a preprocessing step in identifying malariarelated fever cases. In addition, although prior specifications introduce a measure of uncertainty in a hierarchical way, assumptions in generating input data such as use of the nearest facility may not be sufficient in understanding treatmentseeking behaviour. It has been shown in separate population surveys that patients may bypass the nearest health centre due to various individual or supplybased factors such as quality [84]. While an obvious recommendation is to include such effects, increasing model complexity to capture such differences may have an impact on computational efficiency as seen in model 3 and model 4. More importantly, identifying measures of quality of care in public or private health sectors can be challenging [40].
Conclusion
In the context of fever treatment, we have demonstrated that there is potential to use nationally representative household data to provide a probabilistic measure of treatment using a Bayesian method. Our estimates of threshold probability apply to one low malaria transmission country and may be different in other countries with varying malaria endemicity. Future studies will aim to conduct such comparative analysis between and within countries via spatially varying parameters. The methodology can be extended to multiple human behavioural questions (items) related to health and demographics in the routine national survey data.
Abbreviations
 AUC:

Area under curve
 CHW:

Community health worker
 DAG:

Data augmentation
 DHS:

Demographic health surveys
 DIC:

Deviance information criterion
 iCCM:

Integrated communitycase management
 IRT:

Item response theory
 LMICs:

Low and middleincome countries
 MCMC:

Markov chain Monte Carlo
 MICS:

Multiple indicator cluster surveys
 MIS:

Malaria indicator surveys
 ROC:

Receiveroperating characteristics
 SSA:

SubSaharan Africa
 WHO:

World Health Organization
References
 1.
Rudan I, El Arifeen S, Bhutta ZA, Black RE, Brooks A, Chan KY, Chopra M, Duke T, Marsh D, Pio A, et al. Setting Research Priorities to Reduce Global Mortality from Childhood Pneumonia by 2015. PLoS Med. 2011;8(9):e1001099.
 2.
Herbert HK, Lee ACC, Chandran A, Rudan I, Baqui AH. Care Seeking for Neonatal Illness in Low and MiddleIncome Countries: A Systematic Review. PLoS Med. 2012;9(3):e1001183.
 3.
Colvin CJ, Smith HJ, Swartz A, Ahs JW, de Heer J, Opiyo N, Kim JC, Marraccini T, George A. Understanding careseeking for child illness in subSaharan Africa: a systematic review and conceptual framework based on qualitative research of household recognition and response to child diarrhoea, pneumonia and malaria. Soc Sci Med. 2013;86:66–78.
 4.
World Health Organizastion: Integrated communitybased interventions. In., vol. TDR/BL 11.10. Geneva: World Health Organization; 2009.
 5.
The malE. R. A. Consultative Group on Health Systems Operational R. A Research Agenda for Malaria Eradication: Health Systems and Operational Research. PLoS Med. 2011;8(1):e1000397.
 6.
Bhutta ZA, Sommerfeld J, Lassi ZS, Salam RA, Das JK. Global burden, distribution, and interventions for infectious diseases of poverty. Infect Dis Poverty. 2014;3:21–1.
 7.
World Health Organization. Guidelines for the treatment of malaria. 3rd ed. Geneva: WHO; 2015.
 8.
Landier J, Parker DM, Thu AM, Carrara VI, Lwin KM, Bonnington CA, Pukrittayakamee S, Delmas G, Nosten FH. The role of early detection and treatment in malaria elimination. Malar J. 2016;15(1):1–8.
 9.
Littrell M, Gatakaa H, Evance I, Poyer S, Njogu J, Solomon T, Munroe E, Chapman S, Goodman C, Hanson K, et al. Monitoring fever treatment behaviour and equitable access to effective medicines in the context of initiatives to improve ACT access: baseline results and implications for programming in six African countries. Malar J. 2011;10(1):327.
 10.
Sundararajan R, MwangaAmumpaire J, Adrama H, Tumuhairwe J, Mbabazi S, Mworozi K, Carroll R, Bangsberg D, Boum Ii Y, Ware NC. Sociocultural and Structural Factors Contributing to Delays in Treatment for Children with Severe Malaria: A Qualitative Study in Southwestern Uganda. AmJTrop Med Hyg. 2015;92(5):933–40.
 11.
O’Neill S, Gryseels C, Dierickx S, Mwesigwa J, Okebe J, d’Alessandro U, Grietens KP. Foul wind, spirits and witchcraft: illness conceptions and healthseeking behaviour for malaria in the Gambia. Malar J. 2015;14(1):167.
 12.
Hadley M. Does increase in utilisation rates alone indicate the success of a user fee removal policy? A qualitative case study from Zambia. Health Policy. 2011;103(2â€“3):244–54.
 13.
Battle KE, Bisanzio D, Gibson HS, Bhatt S, Cameron E, Weiss DJ, Mappin B, Dalrymple U, Howes RE, Hay SI, et al. Treatmentseeking rates in malaria endemic countries. Malar J. 2016;15(1):1–11.
 14.
Alegana V, Wright J, Petrina U, Noor A, Snow R, Atkinson P. Spatial modelling of healthcare utilisation for treatment of fever in Namibia. Int J Health Geogr. 2012;11(1):6.
 15.
Wasunna B, Okiro EA, Webster J, Todd J, Snow RW, Jones C. The Impact of a Community Awareness Strategy on Caregiver Treatment Seeking Behaviour and Use of ArtemetherLumefantrine for Febrile Children in Rural Kenya. PLoS ONE. 2015;10(7):e0130305.
 16.
Aung T, Lwin MM, Sudhinaraset M, Wei C. Rural and urban disparities in healthseeking for fever in Myanmar: findings from a probabilitybased household survey. Malar J. 2016;15(1):386.
 17.
Matovu F, Nanyiti A, Rutebemberwa E. Household health careseeking costs: experiences from a randomized, controlled trial of communitybased malaria and pneumonia treatment among underfives in eastern Uganda. Malar J. 2014;13:222.
 18.
Manongi R, Mtei F, Mtove G, Nadjm B, Muro F, Alegana V, Noor AM, Todd J, Reyburn H. Inpatient child mortality by travel time to hospital in a rural area of Tanzania. Trop Med Int Health. 2014;19(5):555–62.
 19.
Hays RD, Morales LS, Reise SP. Item response theory and health outcomes measurement in the 21st century. Med Care. 2000;38(9 Suppl):Ii28–42.
 20.
The WorldPop project [http://www.worldpop.org.uk/]. Accessed Nov 2016.
 21.
Dalrymple U, Mappin B, Gething P. Malaria mapping: understanding the global endemicity of falciparum and vivax malaria. BMC Med. 2015;13(1):140.
 22.
Alegana VA, Atkinson PM, Pezzulo C, Sorichetta A, Weiss D, Bird T, ErbachSchoenberg E, Tatem AJ. Fine resolution mapping of population agestructures for health and development applications. J Royal Soc Interface. 2015;12(105). doi:10.1098/rsif.2015.0073.
 23.
Roll Back Malaria Monitoring and Evaluation Reference Group. A guide to Malaria Indicator Surveys (MIS). Geneva: Roll Back Malaria Monitoring and Evaluation Reference Group; 2008.
 24.
Measure DHS. Demographic and Health Surveys. 2011. [http://www.measuredhs.com]. Accessed Sept 2016.
 25.
United Nations Children Fund (UNICEF). Monitoring the situation of women and children: Multiple Indicator Cluster Survey manual 2005. New York: UNICEF; 2006.
 26.
Matteucci M. An Investigation of Parameter Recovery in MCMC Estimation for the Additive IRT Model. Commun Stat  Theory Methods. 2014;43(4):751–70.
 27.
Sheng Y, Wikle CK. Bayesian Multidimensional IRT Models With a Hierarchical Structure. Educ Psychol Meas. 2008;68(3):413–30.
 28.
Sheng Y. Review of Bayesian Item Response Modeling: Theory and Applications, by JeanPaul Fox. Struct Equ Model Multidiscip J. 2015;22(3):484–8.
 29.
Béguin AA, Glas CAW. MCMC estimation and some modelfit analysis of multidimensional IRT models. Psychometrika. 2001;66(4):541–61.
 30.
Fox JP, Glas CAW. Bayesian estimation of a multilevel IRT model using gibbs sampling. Psychometrika. 2001;66(2):271–88.
 31.
Fox JP, Wyrick C. A Mixed Effects Randomized Item Response Model. J Educ Behav Stat. 2008;33(4):389–415.
 32.
Yang FM, Jones RN, Inouye SK, Tommet D, Crane PK, Rudolph JL, Ngo LH, Marcantonio ER. Selecting optimal screening items for delirium: an application of item response theory. BMC Med Res Methodol. 2013;13(1):8.
 33.
Gorter R, Fox JP, Twisk JWR. Why item response theory should be used for longitudinal questionnaire data analysis in medical research. BMC Med Res Methodol. 2015;15(1):1–12.
 34.
Dregan A, Grieve A, van Staa T, Gulliford MC. Potential application of itemresponse theory to interpretation of medical codes in electronic patient records. BMC Med Res Methodol. 2011;11(1):168.
 35.
Sheng Y, Wikle CK. Comparing Multiunidimensional and Unidimensional Item Response Theory Models. Educ Psychol Meas. 2007;67(6):899–919.
 36.
Banerjee S, Carling PB, Gelfand AE. Hierarchical modeling and analysis for spatial data. London: Chapman & Hall/CRC; 2004.
 37.
Stock R. Distance and the utilization of health facilities in rural Nigeria. Soc Sci Med. 1983;17(9):563–70.
 38.
Buor D. Analysing the primacy of distance in the utilization of health services in the AhafoAno South district, Ghana. Int J Health Plann Manage. 2003;18(4):293–311.
 39.
Noor AM, Amin AA, Gething PW, Atkinson PM, Hay SI, Snow RW. Modelling distances travelled to government health services in Kenya. Trop Med Int Health. 2006;11(2):188–96.
 40.
Basu S, Andrews J, Kishore S, Panjabi R, Stuckler D. Comparative Performance of Private and Public Healthcare Systems in Low and MiddleIncome Countries: A Systematic Review. PLoS Med. 2012;9(6). doi:10.1371/journal.pmed.1001244.
 41.
Tanser F, Hosegood V, Benzler J, Solarsh G. New approaches to spatially analyse primary health care usage patterns in rural South Africa. Trop Med Int Health. 2001;6(10):826–38.
 42.
Noor AM, Rage IA, Moonen B, Snow RW. Health service providers in Somalia: their readiness to provide malaria casemanagement. Malar J. 2009;8:100.
 43.
Leonard K, Mliga GR, Mariam DH. Bypassing health centers in Tanzania: Revealed preferences for observable and unobservable quality. New York: Columbia University Academic Commons; 2002.
 44.
Akin JS, Hutchinson P. Healthcare facility choice and the phenomenon of bypassing. Health Policy Plan. 1999;14(2):135–51.
 45.
Ray N, Ebener S. AccessMod 3.0: computing geographic coverage and accessibility to health care services using anisotropic movement of patients. Int J Health Geogr. 2008;7(1):63.
 46.
Corsi DJ, Neuman M, Finlay JE, Subramanian S. Demographic and health surveys: a profile. Int J Epidemiol. 2012;41(6):1602–13.
 47.
Aliaga A, Ren R. Optimal sample sizes for twostage cluster sampling in demographic and health surveys. In: DHS Working Papers No 30. Calverton: ORC Macro; 2006.
 48.
Le TN, Verma VK. An analysis of sampling designs and sampling errors of the demographic and health surveys. In: DHS Analytical Reports No 3. Calverton: Macro International; 1997.
 49.
McLaren ZM, Ardington C, Leibbrandt M. Distance decay and persistent health care disparities in South Africa. BMC Health Serv Res. 2014;14(1):1–9.
 50.
Bailey CT, Gatrell CA. Interactive Spatial Data Analysis. Essex, England: Longman Scientific & Technical; 1995.
 51.
Cromley EK, McLafferty SL. GIS and public health. New York: Guilford Press; 2002.
 52.
Febir LG, Asante KP, AfariAsiedu S, Abokyi LN, Kwarteng A, Ogutu B, Gyapong M, OwusuAgyei S. Seeking treatment for uncomplicated malaria: experiences from the Kintampo districts of Ghana. Malar J. 2016;15(1):1–11.
 53.
Chibwana AI, Mathanga DP, Chinkhumba J, Campbell Jr CH. Sociocultural predictors of healthseeking behaviour for febrile underfive children in MwanzaNeno district. Malawi Malar J. 2009;8:219.
 54.
Rasch G. On general laws and the meaning of the measurement in psychology. In: Proceedings of the 4th Berkley Symposium on Mathematical Statistics: 1961. London: University of California Press; 1961. p. 321–34.
 55.
McDonald RP. Numerical methods for polynomial models in nonlinear factor analysis. Psychometrika. 1967;32(1):77–112.
 56.
Lord FM, Novick MR, Birnbaum A. Statistical Theories of Mental Test Scores. MA: AddisonWesley; 1968.
 57.
Darrell Bock R. Estimating item parameters and latent ability when responses are scored in two or more nominal categories. Psychometrika. 1972;37(1):29–51.
 58.
Andersen EB. The Numerical Solution of a Set of Conditional Estimation Equations. J R Stat Soc Ser B Methodol. 1972;34(1):42–54.
 59.
Swaminathan H, Gifford JA. Bayesian estimation in the twoparameter logistic model. Psychometrika. 1985;50(3):349–64.
 60.
Swaminathan H, Gifford JA. Bayesian estimation in the threeparameter logistic model. Psychometrika. 1986;51(4):589–601.
 61.
Albert JH. Bayesian Estimation of Normal Ogive Item Response Curves Using Gibbs Sampling. J Educ Stat. 1992;17(3):251–69.
 62.
Braeken J, Tuerlinckx F. Investigating latent constructs with item response models: A MATLAB IRTm toolbox. Behav Res Methods. 2009;41(4):1127–37.
 63.
Sahu SK. Bayesian Estimation and Model Choice in Item Response Models. J Stat Comput Simul. 2002;72(3):217–32.
 64.
Fu ZH, Tao J, Shi NZ. Bayesian estimation in the multidimensional threeparameter logistic model. J Stat Comput Simul. 2009;79(6):819–35.
 65.
Rupp AA. Item Response Modeling With BILOGMG and MULTILOG for Windows. Int J Test. 2003;3(4):365–84.
 66.
Skrondal A, RabeHesketh S. Prediction in multilevel generalized linear models. J R Stat Soc Ser A (Stat Methodol). 2009;172(3):659–87.
 67.
Hattie J. Methodology Review: Assessing Unidimensionality of Tests and ltenls. Appl Psychol Meas. 1985;9(2):139–64.
 68.
de la Torre J, Douglas JA. Higherorder latent trait models for cognitive diagnosis. Psychometrika. 2004;69(3):333–53.
 69.
Barnard J, McCulloch R, Meng XL. Modelling covariance matrices in terms of standard deviations and correlations, with application to shrinkage. Stat Sin. 2000;10(4):1281–311.
 70.
Spiegelhalter DJ, Best NG, Carlin BP, Van Der Linde A. Bayesian measures of model complexity and fit. J R Stat Soc Ser B (Stat Methodol). 2002;64(4):583–639.
 71.
Plummer M. Penalized loss functions for Bayesian model comparison. Biostatistics. 2008;9(3):523–39.
 72.
Plummer M. rjags: Bayesian Graphical Models using MCMC. In. Plummer, M; 2016.
 73.
Gelman A, Rubin DB. Inference from Iterative Simulation Using Multiple Sequences. Stat Sci. 1992;7(4):457–72.
 74.
Raftery AE, Lewis SM. Implementing MCMC. In: Gilks WR, Spiegelhalter DJ, Richardson S, editors. Markov Chain Monte Carlo in Practice. London: Chapman and Hall; 1996. p. 115–30.
 75.
Kizito J, Kayendeke M, Nabirye C, Staedke SG, Chandler CI. Improving access to health care for malaria in Africa: a review of literature on what attracts patients. Malar J. 2012;11:55.
 76.
Lagarde M, Palmer N. The impact of user fees on access to health services in low and middleincome countries. Cochrane Database Syst Rev. 2011;(4):Cd009094.
 77.
Chen I, Clarke SE, Gosling R, Hamainza B, Killeen G, Magill A, O?Meara W, Price RN, Riley EM: "Asymptomatic" Malaria: A Chronic and Debilitating Infection That Should Be Treated. PLoS Med 2016;13(1):e1001942.
 78.
Sturrock HJW, Hsiang MS, Cohen JM, Smith DL, Greenhouse B, Bousema T, Gosling RD. Targeting Asymptomatic Malaria Infections: Active Surveillance in Control and Elimination. PLoS Med. 2013;10(6):e1001467.
 79.
Ferrer BE, Webster J, Bruce J, NarhBana SA, Narh CT, Allotey NK, Glover R, BartPlange C, SagoeMoses I, Malm K, et al. Integrated community case management and communitybased health planning and services: a cross sectional study on the effectiveness of the national implementation for the treatment of malaria, diarrhoea and pneumonia. Malar J. 2016;15(1):340.
 80.
Kirigia J, Asbu E. Technical and scale efficiency of public community hospitals in Eritrea: an exploratory study. Heal Econ Rev. 2013;3(1):6.
 81.
Little R, Rubin D. Statistical Analysis with Missing Data. 2nd ed. New York: John Wiley & Sons, Incorporated; 2002.
 82.
Little RJA, Rubin DB. On Jointly Estimating Parameters and Missing Data by Maximizing the CompleteData Likelihood. Am Stat. 1983;37(3):218–20.
 83.
Roll Back Malaria, Measure Evaluation, USAID, et al. Guidelines for core populationbased indicators. Calverton: MEASURE Evaluation; 2009.
 84.
DiLiberto DD, Staedke SG, Nankya F, MaitekiSebuguzi C, Taaka L, Nayiga S, Kamya MR, Haaland A, Chandler CI. Behind the scenes of the PRIME intervention: designing a complex intervention to improve malaria care at public health centres in Uganda. Glob Health Action. 2015;8:29067.
 85.
Ministry of Health and Social Services. Namibia Health Facility Census, 2009. Windhoek: Ministry of Health and Social Services and ICF Macro; 2010. p. 585.
 86.
Alegana VA, Atkinson PM, Lourenço C, Ruktanonchai NW, Bosco C, ErbachSchoenberg E, Didier B, Pindolia D, Le Menach A, Katokele S, et al. Advances in mapping malaria for elimination: fine resolution modelling of Plasmodium falciparum incidence. Scientific Reports. 2016;6:29628.
Acknowledgements
We would like to thank Professor Sujit Sahu (University of Southampton) and Dr Linus Bengtsson (Flowminder codirector) for comments on the earlier version of the manuscript.
Funding
Andrew J Tatem is supported by a Wellcome Trust Sustaining Health Grant [grant number 106866/Z/15/Z] and Bill and Melinda Gates Foundation [grant numbers OPP1106427, 1032350, OPP1134076].
Availability of data and materials
DHS data available in the public domain at http://dhsprogram.com/data/availabledatasets.cfm.
Authors’ contributions
VA, PMA, and AJT were responsible for study design, analysis, interpretation, and production of final manuscript. CP and JW contributed to data assembly and management, interpretation and production of final manuscript. All authors have read and approved the final version of the manuscript.
Competing interests
The authors declare that they have no competing interests.
Consent for publication
Not Applicable.
Ethics approval and consent to participate
University of Southampton (17263).
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Author information
Appendix
Appendix
Parameter notations
j Individual/person
i Item/survey question
k Dimension for items
q Dimension for dependent variables
a Discrimination parameter
b Difficulty parameter on items
c Probability threshold parameter
θ Individual trait/ability parameter
P(Y) Probability that event Y occurs
I(⋅) Indicator function for event in sample space
E(X) Expectation for random parameter X
μ Mean
DIC Deviance Information Criterion
\( \overline{D} \) Mean deviance
[{(⋅)}] Order of brackets
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Received
Accepted
Published
DOI
Keywords
 Bayesian hierarchical model
 Treatmentseeking behaviour
 Item response theory
 Markov Chain Monte Carlo