Ensemblebased methods for forecasting census in hospital units
 Devin C Koestler^{1}Email author,
 Hernando Ombao^{2} and
 Jesse Bender^{3}
DOI: 10.1186/147122881367
© Koestler et al.; licensee BioMed Central Ltd. 2013
Received: 6 September 2012
Accepted: 22 May 2013
Published: 30 May 2013
Abstract
Background
The ability to accurately forecast census counts in hospital departments has considerable implications for hospital resource allocation. In recent years several different methods have been proposed forecasting census counts, however many of these approaches do not use available patientspecific information.
Methods
In this paper we present an ensemblebased methodology for forecasting the census under a framework that simultaneously incorporates both (i) arrival trends over time and (ii) patientspecific baseline and timevarying information. The proposed model for predicting census has three components, namely: current census count, number of daily arrivals and number of daily departures. To model the number of daily arrivals, we use a seasonality adjusted Poisson Autoregressive (PAR) model where the parameter estimates are obtained via conditional maximum likelihood. The number of daily departures is predicted by modeling the probability of departure from the census using logistic regression models that are adjusted for the amount of time spent in the census and incorporate both patientspecific baseline and time varying patientspecific covariate information. We illustrate our approach using neonatal intensive care unit (NICU) data collected at Women & Infants Hospital, Providence RI, which consists of 1001 consecutive NICU admissions between April 1st 2008 and March 31st 2009.
Results
Our results demonstrate statistically significant improved prediction accuracy for 3, 5, and 7 day census forecasts and increased precision of our forecasting model compared to a forecasting approach that ignores patientspecific information.
Conclusions
Forecasting models that utilize patientspecific baseline and timevarying information make the most of data typically available and have the capacity to substantially improve census forecasts.
Background
In a period of heightened economic burden, efficient and effective allocation of hospital resources is an issue of principal importance. The ability to accurately forecast the number of patient arrivals, as well as predict census counts in hospital departments, have considerable implications for hospital resource allocation, both at the micro and macro level. For example, short term census forecasts have the potential to improve inpatient bed allocation, reduce diversions, better align hospital ancillary services, and reduce the incidence of over and understaffing [1]. More importantly, accurate census forecasts can inform scaling up of operations during high census periods, potentially leading to improved patient outcomes [2]. Since staffing levels in hospital units are driven by the census capacity as well as the acuity of inunit patients, forecasting methods that incorporate both patientlevel severity of illness (which may evolve considerably throughout their stay) and longterm census trends are necessary for informing accurate census predictions  this is a main highlight of this paper.
There have been several methodologies developed for forecasting arrival and census counts in various hospital departments [3–8]. Jones et al. (2008) evaluated the use of seasonal autoregressive integrated moving average (ARIMA), time series regression, exponential smoothing, and artificial neural network models to forecast daily patient volumes in emergency departments at three diverse hospital emergency departments. The time series methods considered in that analysis provided improved insample model goodness of fit assessed via mean absolute prediction error (MAPE) relative to a multiple linear regression approach, considered the benchmark model for forecasting emergency department patient volumes. Additionally, Sun et al. (2009) evaluated the use of autoregressive integrated moving average models, adjusted to incorporate various environmental variables, to forecast counts of daily patient attendances in the emergency department of an acute care regional general hospital. In addition to univariate time series approaches to forecasting emergency department patient volumes, multivariate time series models have also been utilized and have been shown to reliably forecast emergency department patient census.
The primary limitation of the above methods is that they do not incorporate patientlevel information to make predictions. Here, we propose an ensemblebased method for shortterm census forecasts under a framework that simultaneously incorporates (i) hospital unit arrival trends over time and (ii) patientspecific baseline and timevarying information. Such approaches represent the future of census forecasting as hospital departments around the country move toward more efficient methods for collecting and processing patientlevel information upon admission and through the duration of stay. The proposed method is applied to neonatal intensive care unit (NICU) data collected at Women & Infants Hospital, Providence RI, which consists of 1001 consecutive NICU admissions between April 1st 2008 and March 31st 2009. In order to illustrate the potential for improved census forecasts that results from incorporating baseline and timevarying patient information, our proposed approach is compared to a forecasting method that ignores patientspecific information.
Methods
To elucidate our methodology, we differentiate between arrival, departure, and census counts for the NICU. We define the arrival count on a particular day as the number of patients admitted to the NICU during a 24 hour period. Similarly, the departure count for a particular day is defined as the number of patients who depart the NICU as a result of a healthy discharge during a 24 hour period. By healthy discharge we refer to cases where a patient was discharged from the NICU as a result of adequate physiological health, as determined by clinical criteria. Lastly, we define the daily census count as the number of patients residing in the NICU at the end of the day (11:59pm).
Our approach to modeling the NICU census follows [9] where the NICU census C(t+k) at time (t+k) can be concisely expressed as a function of several different components. More specifically, C(t+k) where k≥1, is a function of:
 1.
C(t): NICU census at time (t).
 2.
{A(t+1),A(t+2),…A(t+k)}: number of arrivals on each successive day from time (t+1) up to time (t+k).
 3.
{D(t+1),D(t+2),…D(t+k)}: number of departures from the NICU census on each successive day from time (t+1) up to time (t+k).
Remark 1
One key assumption we make is that the number of arrivals A(t) is independent of the number of departures D(t) at time (t). In other words, the number of patients arriving in the NICU census at some time (t) does not provide information about the number of patients departing the NICU census at that same time. As will be described in further detail, the assumption of independence between arrivals and departures at some time (t) has important implications when computing the errors associated with our census predictions. In general, this is a sensible assumption when NICUs are operating below their maximum capacity, which was most often the case for the data presented here.
Remark 2
We note that the number of departures D(t+k) at time (t+k) is not independent of the number of arrivals A(t+k−i) at time (t+k−i), where i=1,2,…. That is, there is an upperbound on the number of departures at time (t+k) based on the cumulative number of arrivals from the preceding days.
Remark 3
In (2), the estimate for the census at time (t+k) is given as a point estimate, which is subject to uncertainty given the error in the estimation of the number of arrivals and departures. Stochastic or “ensemblebased” forecasting is used to account for this uncertainty through the use of multiple forecasts created with an individual forecast model. For example, we generate an ensemble of predictions of C(t+k), denoted $\left\{{\hat{C}}^{\left(r\right)}\right(t+k),r=1,2,\dots ,M\}$, where r represents a single realization and M is an upperbound on the number of realizations, and summarize over the ensemble to obtain more accurate census forecasts. We describe this procedure in further detail in Section “Ensemblebased forecasting and prediction intervals for census forecasts”.
Since obtaining census forecasts at some time (t+k) is contingent upon predictions for the number of arrivals as well as the number of departures from (t+1) up to and including time (t+k), we describe our proposed methodology for predicting the number of arrivals and departures in the paragraphs that follow.
Predicting the number of arrivals
We model daily arrival using the Poisson Autoregressive (PAR) model [10]. This choice is inspired by several studies that have demonstrated that daily arrival patterns in various hospital departments can be modeled as a Poisson process [11–13]. Moreover, this model incorporates the correlation between daytoday arrival counts. The model specification is as follows: let {A(t),t=1,2,…T}, denote a time series of arrival counts. We define ${\mathcal{F}}_{t}$ as any covariate information, including previous arrival counts, available to the observer up to time t. Under the PAR model, denote the conditional expected arrival to be $\mathbb{E}\left[A\right(t\left)\right{\mathcal{F}}_{t}]={\mu}_{t}$, t=1,2,…,T.
where $\hat{\mathit{\beta}}$ and $\hat{\varphi}$ denote the conditional maximum likelihood estimates for β and ϕ respectively, where we treat the first $\stackrel{\u0307}{p}$ arrivals, $\left\{A\right(1),A(2),\dots ,A(\stackrel{\u0307}{p}\left)\right\}$ as fixed.
Predicting the number of departures
To predict the number of departures from a group of patients residing in the NICU, it is ideal to incorporate both patientspecific baseline covariate information (i.e., information collected upon admission to the NICU) and any covariate information collected throughout their stay in the NICU. For the data considered here, only birth weight and gestational age were obtained for each child upon NICU admission. Although both are known to be useful for predicting length of stay, physiologic information collected throughout their time in the NICU may dictate when patients are released and thus, largely impact overall length of stay [15]. In addition, we also seek a framework for predicting the number of departures to reflect the fact that the probability of a patient leaving k days from some time point (t) should be conditional on how many days that patient has already spent in the NICU.
Some notation
Suppose S _{ i } represents the number of days the i ^{ t h } patient has spent in the NICU and X _{ i } represents a vector of baseline covariates collected on the i ^{ t h } patient. Moreover, define ${\mathbf{X}}_{i}^{\left(t\right)}$ as a vector of additional covariates obtained up to and including (t) days after admission to the NICU for the i ^{ t h } patient. Thus ${\mathbf{Z}}_{i}^{\left(t\right)}={[\mathbf{X},{\mathbf{X}}^{\left(t\right)}]}_{i}$, is a vector of baseline covariates plus any additional covariate information that is obtained for the i ^{ t h } patient up to and including (t) days after admission to the NICU. Further, let H _{ s }(t) represent the number of patients in the NICU at time (t) who have stayed (s) days in the NICU and suppose that ${Y}_{i}^{(k,s)}$ is an indicator of whether or not patient i leaves the NICU in k days, given that they have already spent (s) days in the NICU. More specifically:
Modeling the probability of departure
where β ^{(k,s)} is a vector of model parameters specific to the model above; the superscript k reflects the fact that we are modeling the probability of departure from the NICU prior to or on k days from some time point and the superscript, s denotes that this model is conditional on those who have spent s prior days in the NICU. In the above expression, g(.) is an appropriate link function (i.e. logit, probit, complementary loglog, etc. since Y ^{(k,s)} is binary). Thus, ${\hat{\pi}}^{\left(k\right)}({\mathbf{Z}}_{i}^{\left(s\right)},{S}_{i}=s)$ represents the estimated probability of departing the NICU prior to or on k days from time (t) for the i ^{ t h } patient who has spent s days in the NICU.
Predicting the number of departures
 1.
${\hat{Y}}_{i}^{(k,s)}$ = ${\hat{\pi}}^{\left(k\right)}({\mathbf{Z}}_{i}^{\left(s\right)},{S}_{i}=s)$ or,
 2.
${\hat{Y}}_{i}^{(k,s)}$ is a sample from a $\mathit{\text{Bernoulli}}\left({\hat{\pi}}^{\left(k\right)}\right({\mathbf{Z}}_{i}^{\left(s\right)},{S}_{i}=s\left)\right)$.
Due to sparseness of data for large R and also model feasibility, we propose setting R to a fixed value that remains constant across forecasts at different times. In summary, there are two main stages in our approach to predicting the number of departures among N patients residing in the census at time t. In the first stage, we predict the number of departures among patients who have spent S days in the census, where S={0,1,…,R} using model (7) and in the second stage we sum over all predictions obtained in the first stage to obtain an estimate of the expected number of departures from the census.
It is important to note that formula (8) provides an estimate of the expected number of departures kdays from time t only among the N patients residing in the census at time t. Given that the number of departures at time kdays from time t also depends on subjects that arrive between times t and t+k, reliable estimates of the number of departures must also consider these subjects. Since these subjects are not directly observed, we hereafter refer to them as pseudosubjects.
We estimate the probability of departure kdays from time t for each pseudosubject by taking a random sample  with replacement  of baseline covariates from our available data, ${\mathbf{x}}_{i}^{\star}$. Using ${\mathbf{x}}_{i}^{\star}$, we compute the probability of departure, ${\hat{\pi}}^{\left(k\right)}({\mathbf{x}}_{i}^{\ast},{s}_{i}=s)$, prior to or on day (t+k), for the i ^{ t h } pseudosubject. Putting this into a broader context, if we predict the number of arrivals on day t+1 to be ${\hat{\mu}}_{t+1}$, that is $\hat{A}(t+1)={\hat{\mu}}_{t+1}$, then we sample ${\hat{\mu}}_{t+1}$ observations, with replacement, from our available data and use the baseline covariates from those ${\hat{\mu}}_{t+1}$ observations to predict the number of departures prior to or on day t+k from that group. This process is repeated for day t+2 up to day t+k−1, with $\hat{A}(t+2),\dots ,\hat{A}(t+k1)$ forming the basis of how many psuedosubjects are considered at each time point.
As noticed, forecasting census counts becomes increasingly more complex with increasing levels of uncertainty as one considers longer forecasts. While the above examples present the census forecasts as point predictions, it is often the case that better prediction performance can be achieved by summarizing over an ensemble of such forecasts (i.e. mean or median over the ensemble) [16]. In the section that follows, we present an ensemblebased method that can be simultaneously used to (i) obtain more reliable census forecasts and (ii) obtain prediction intervals for our census forecasts.
Ensemblebased forecasting and prediction intervals for census forecasts
without loss of generality, assuming a logistic regression model for the probability of departure. It follows then that ${\hat{\mathit{\beta}}}^{(k,s)}\stackrel{.}{\sim}\mathit{\text{MVN}}({\mathit{\beta}}^{(k,s)},{\mathbf{V}}_{{\beta}^{(k,s)}})$, where ${\hat{\mathit{\beta}}}^{(k,s)}$ represents the maximum likelihood estimate of β ^{(k,s)} and ${\mathbf{V}}_{{\beta}^{(k,s)}}$ represents the inverse Fisher Information matrix for β ^{(k,s)}.
and r denotes a single realization. More precisely, we obtain ${\hat{C}}^{\left(r\right)}(t+k)$ using the following procedure:
 1.
Denote λ ^{(r)} as a sample from a Multivariate normal distribution with mean $\hat{\mathit{\lambda}}$ and variancecovariance ${\hat{\mathbf{V}}}_{\lambda}$ and β ^{(r)(k,s)} as a sample from a Multivariate normal distribution with mean ${\hat{\mathit{\beta}}}^{(k,s)}$ and variancecovariance ${\hat{\mathbf{V}}}_{{\beta}^{(k,s)}}$, for s=1,2,…,R.
 2.
Conditional on λ ^{(r)} and β ^{(r)(k,s)}, compute ${\hat{\mu}}_{t+j}^{\left(r\right)}=\mu \left({\mathit{\lambda}}^{\left(r\right)}\right)$ and ${\hat{\pi}}_{i}^{\left(r\right)\left(k\right)}={\pi}^{\left(k\right)}({\mathbf{z}}_{i}^{\left(s\right)},{s}_{i}=s)\left{\mathit{\beta}}^{\left(r\right)(k,s)}\right)$ for j=1,2,…,k, i=1,2,…H _{ s }(t), and s=1,2,…,R.
 3.
Based on ${\hat{\mu}}_{t+j}^{\left(r\right)}$ and ${\hat{\pi}}_{i}^{\left(r\right)\left(k\right)}$, obtain samples ${\hat{A}}^{\left(r\right)}(t+j)$ and ${\hat{Y}}_{i}^{\left(r\right)(k,s)}$ from a $\mathit{\text{Poisson}}\left({\hat{\mu}}_{t+j}^{\left(r\right)}\right)$ and $\mathit{\text{Bernoulli}}\left({\hat{\pi}}_{i}^{\left(r\right)\left(k\right)}\right)$ respectively, for j=1,2,…,k, i=1,2,…H _{ s }(t), and s=1,2,…,R.
 4.
Based on ${\hat{A}}^{\left(r\right)}(t+j)$, j=1,2,…,k−1 sample ${\hat{Y}}_{i}^{\star \left(r\right)(kj,0)}$ for each of the pseudosubjects.
 5.
Compute ${\hat{D}}^{\left(r\right)}(t+j)$, for j=1,2,…,k.
 6.
Conditional on ${\hat{A}}^{\left(r\right)}(t+j)$ and ${\hat{D}}^{\left(r\right)}(t+j)$, for j=1,2,…,k, Compute ${\hat{C}}^{\left(r\right)}(t+k)$.
End
where M is an upper bound prespecified by the use (i.e. M=1000). Our census forecasts are then obtained by summarizing over the ensemble of predictions, ${\hat{C}}^{\left(r\right)}(t+k),r=1,2,\dots M$, using for instance, the mean or median. Furthermore, this approach can also be used for the construction of 95% prediction intervals by computing the associated percentiles of the ensemble of predictions.
Up to now, we have considered the components of the census forecasting model, namely the arrivals and the departures, as separate entities. To provide further intuition, we provide an example of our proposed census forecasting model in Additional file 1.
Results
Our census forecasting model was applied to data collected by the third author, a neonatologist at the Department of Pediatrics, Women and Infants Hospital, 101 Dudley St., Providence, RI with the goal of obtaining accurate shortterm census forecasts. Women & Infants Hospital, Department of Pediatrics maintains several patient databases designed for quality monitoring. One of these, the RiskAdjusted Length of Stay Database has prospectively collected data on neonatal severity of illness in the first week of life. Maintained meticulously from April 2008 through June 2010, it includes 2660 consecutive admissions that stayed in the NICU greater than 24 hours. It includes fields for patient level information, such as: demographic data, hospital stay, and severity of illness indices.
The patient level data used for this analysis consisted of 1001 consecutive NICU admissions, born between April 1, 2008 through March 31, 2009, that had complete data at the time of analysis. With IRB approval, the medical records of all newborns born between April 1, 2008 through March 31, 2009 admitted to the NICU were obtained retrospectively for data extraction. All newborns admitted to the NICU at Women and Infants Hospital within 24 hours of birth were recruited and ranged in morbidity from minimal to sever. Newborns were excluded if they (i) died prior to NICU admission; (ii) were admitted for preterminal comfort care (defined as neither intubation nor cardiorespiratory resuscitation); (iii) had a major congenital anomaly. All procedures and study materials were approved by the Institutional Review Board at Women & Infants Hospital, Providence RI.
Summary statistics for the patient level data (N = 1001)
Variable  Variable type  Mean  Median  Standard deviation  Range 

LOS  Discrete  20.1  10.0  25.4  (1,227) 
BWEIGHT  Continuous  2418.0  2335.0  939.8  (360, 5493) 
GESTAGE  Continuous  34.6  34.0  4.0  (23, 42) 
SNAPDOL1  Discrete  8.2  0.0  13.1  (0, 103) 
SNAPDOL3  Discrete  5.0  0.0  10.4  (0, 86) 
MAIN1  Continuous  484.8  417.0  314.0  (0, 2139) 
MAIN7  Continuous  661.2  607.0  361.3  (0, 2486) 
Arrival data used in the development of the census forecasting models consisted of the number of daily admissions to the NICU at Women and Infants hospital from January 1, 2008 through May 20, 2009. Collectively, both the patient level data as well as the arrival data were used to construct a NICU census forecasting model for predicting the census counts at 1, 3, 5, and 7 days in advance. All analyses were carried out using R version 2.11.
Fitting the seasonality adjusted PAR model for predicting arrivals
We first estimated the autocorrelation function (ACF) to determine the presence of seasonality and weekly trends in our data. We observed a modest semiannual trend in our data with peaks in the number of arrivals in the spring and fall months. Since our data consists of approximately a year and a half of daily admissions counts, our empirical estimate of the frequency of the seasonality trend was ($\hat{\omega}=3$), which constitutes 1 cycle per 26 weeks. We used the the Bayesian Information Criterion (BIC) [14] to determine the optimal order, ($\stackrel{\u0307}{p}$) of our PAR models. More specifically, we adjusted for seasonality and fit several PAR(p) models varying the order (p) and selected the model that resulted in the lowest BIC. Based on this approach, we estimated the optimal order to be ($\stackrel{\u0307}{p}=7$). Fitting an order (7) seasonality adjusted PAR model, we computed within sample predictions and found the meansquared error (MSE) to be 3.48, which is marginally better than using predictions based on the estimated mean number of daily admissions (MSE = 3.55).
Predicting departures
We considered conditional logistic repression models to obtain an estimate for the expected number departures. In order to obtain prospective census forecasts at 1, 3, 5, and 7 days in advance, we fit a series of conditional logistic regression models which were collectively used to obtain an estimate of the number of departures. In particular, the logistic regression models that were used to inform the expected number of departures were stratified by the number of days a patient occupied NICU, which in turn had implications on what covariate information was available for those patients. For instance, at baseline, the only covariate information for a patient was their birth weight and gestational age. However SNAPDOL1 and MAIN1 scores were available for patients who occupied the NICU for at least one day. Furthermore, SNAPDOL3 and MAIN7 scores were available for patients who occupied the NICU for at least 3 and 7 days respectively. As mentioned, the logistic regression models that were used to inform the expected number of departures were stratified by the number of days a patient occupied NICU, such that S∈{1,2,…,≥10}, where S represents the number of days a patient had occupied the NICU at some time point. The upper bound for S was selected as such due to concerns regarding data spareness as well for computational and model feasibility. Given the high volume of models this framework requires, results from our individual model fits are omitted, but available upon request.
Predicting census
Discussion
In this paper we presented a novel methodology for forecasting the census in a Neonatal Intensive Care Unit (NICU). Our model explicitly incorporates both arrival trends over time in the NICU and patientlevel clinical information. The census forecast is computed from the current census count, predicted arrivals and predicted departures. We proposed an ensemblebased procedure for obtaining census predictions that used seasonality adjusted PAR model to model arrival trends in the NICU and stratified conditional logistic regression models incorporating baseline and timedependent covariates for modeling the probability of departure. Our model is efficient because it integrates arrival trends over time as well as patient level information.
We note that the ideal data set to train and test our forecasting model is one that consists of several years worth of patientlevel and arrival data. The former is crucial to the development of accurate and reliable models for predicting the probability departure, while the latter is integral to attainment of a model that can predict the number of census arrivals with a high degree of accuracy. The arrival data used in this analysis consisted of only approximately a year and a half of admission information, thus we are limited in our ability to ascertain longterm seasonality trends. Our finding of a halfyear periodicity in arrivals to the NICU (peaks in the spring and fall months) is consistent with numerous studies, which have reported seasonality trends in delivery rates [19–22]. Plausible explanations for this finding include temperature or photoperiod (affecting hormonal concentrations, sperm quality or sexual activity), seasonal variation in pregnancy loss, or cultural factors [23]. Despite the evidence for seasonal patterns in delivery rates, the nature of seasonality effects tend to vary across different racial and ethnic groups, maternal education levels, and marital status of the underlying population [22]. Since the PAR model described in Section “Predicting the number of arrivals” can be generalized to account for Kmany seasonality effects using the sum of both cosine and sine terms with different frequencies, differing arrivals patterns across different study populations can be easily accommodated under this general framework.
The arrival data used in this analysis and perhaps the nature of NICU arrival data in general, presented challenges in the formulation of an efficient and reliable model for predicting the number of NICU admissions. This feature is a likely candidate to explain the modest overestimation that was observed in the validation of our forecasting model, which became more pronounced for increasing lengths of forecasts. The primary reason for this is that census forecasts beyond 1day rely heavily on accurate predictions of the number of arrivals for intermediary days, thus bias in intermediary predictions can give rise to considerable under overestimation in downstream census forecasts. We note that the observed tendency of our proposed approach to overestimate the census, particularly at the later timepoints (i.e., 5 and 7day forecasts), could be used to recalibrate our census forecasting procedure in a manner similar to [24]; however, arrival, census, and patientlevel data over a longer span of time than what were used here would be needed to effectively implement such an approach.
Our justification for using a conditional logistic regression framework for predicting the number of departures was motivated by two principle issues. As a result of our general forecasting framework, our interest was primarily focused on the expected number of departures for a cohort of patients currently residing in the census. Thus, treating each patient within a cohort as independent, the expected number of departures for a given cohort can be efficiently estimated by summing the individual predictions for departure for each patient. The idea of predicting the probabilities of departure as opposed to length of stay predictions lends itself nicely to a logistic regression framework. An alternative approach involves using lengthofstay distributions within a queueing theory analysis. However, unlike the framework described here, such an approach would not facilitate the attainment of the subjectspecific probabilities of departure, which is of interest to clinicians. Secondly, it is also of clinical interest to have estimates of the probability that a patient leaves the NICU some k days into the future, which can be conveniently extracted using our proposed approach. While our approach was based on predicting the probability of “healthydischarge” from the NICU, “nonhealthy discharge” (Pediatric Intensive Care Unit (PICU or death in the NICU) represented a relatively small proportion of the study population considered here (3% and 2%, respectively). For this reason and because including such subjects would necessitate a framework that simultaneously models the probability of leaving the NICU in a variety of different ways (i.e., health discharge, PICU, death), adding considerable statistical and computational and complexity, we opted to exclude these subjects for the development and validation of our proposed methodology.
The results obtained from validating our forecasting model demonstrated that the ability to accurately forecast the NICU census was largely a function of the length of forecasts, a common feature of nearly all forecasting models. Furthermore, the estimated MAPE and correlations among the predicted and observed census between our forecasting model and an approach based solely on census trends over time, suggest that incorporating patientspecific information has the capacity to improve census predictions, especially for the longer forecasts (i.e 3, 5, and 7 day forecasts). Additionally, the narrower widths of the 95% prediction intervals for our forecasting model relative to the forecasting model based only on census trends over time, suggest that more precise predictions can be obtained using our forecasting model.
Conclusions
In summary, census forecasting models that utilize (i) arrival trends over time and (ii) patientspecific baseline and timevarying information make the most of data which is typically available in the NICU and as demonstrated, have the potential to be a useful tool for prospectively estimating the NICU census.
Abbreviations
 NICU:

Neonatal intensive care unit
 PAR:

Poisson autoregressive
 MAPE:

Mean absolute prediction error
 MSE:

Mean squared error
 LOS:

Length of stay.
Declarations
Authors’ Affiliations
References
 Littig SJ, Isken MW: Short term hospital occupancy prediction. Health Care Manag Sci. 2007, 10: 4766. 10.1007/s1072900690009.View ArticlePubMedGoogle Scholar
 Iwashyna TJ, Kramer AA, Kahn JM: Intensive care unit occupancy and patient outcomes. Crit Care Med. 2009, 37 (5): 15451557. 10.1097/CCM.0b013e31819fe8f8. [http://dx.doi.org/10.1097/CCM.0b013e31819fe8f8]View ArticlePubMedPubMed CentralGoogle Scholar
 Boyle J, Wallis M, Jessup M, Crilly J, Lind J, Miller P, Fitzgerald G: Regression forecasting of patient admission data. Conf Proc IEEE Eng Med Biol Soc. 2008, 2008: 38193822. [http://dx.doi.org/10.1109/IEMBS.2008.4650041]PubMedGoogle Scholar
 Flottemesch TJ, Gordon BD, Jones SS: Advanced statistics: developing a formal model of emergency department census and defining operational efficiency. Acad Emerg Med. 2007, 14 (9): 799809. [http://dx.doi.org/10.1111/j.15532712.2007.tb02356.x]View ArticlePubMedGoogle Scholar
 Jones SS, Evans RS, Allen TL, Thomas A, Haug PJ, Welch SJ, Snow GL: A multivariate time series approach to modeling and forecasting demand in the emergency department. J Biomed Inform. 2009, 42: 123139. 10.1016/j.jbi.2008.05.003. [http://dx.doi.org/10.1016/j.jbi.2008.05.003]View ArticlePubMedGoogle Scholar
 Jones SS, Thomas A, Evans RS, Welch SJ, Haug PJ, Snow GL: Forecasting daily patient volumes in the emergency department. Acad Emerg Med. 2008, 15 (2): 159170. 10.1111/j.15532712.2007.00032.x. [http://dx.doi.org/10.1111/j.15532712.2007.00032.x]View ArticlePubMedGoogle Scholar
 Schweigler LM, Desmond JS, McCarthy ML, Bukowski KJ, Ionides EL, Younger JG: Forecasting models of emergency department crowding. Acad Emerg Med. 2009, 16 (4): 301308. 10.1111/j.15532712.2009.00356.x. [http://dx.doi.org/10.1111/j.15532712.2009.00356.x]View ArticlePubMedGoogle Scholar
 Sun Y, Heng BH, Seow YT, Seow E: Forecasting daily attendances at an emergency department to aid resource planning. BMC Emerg Med. 2009, 9: 110.1186/1471227X91. [http://dx.doi.org/10.1186/1471227X91]View ArticlePubMedPubMed CentralGoogle Scholar
 Swain RW, Kilpatrick KE, III JJM: Implementation of a model for census prediction and control. J Am Stat Assoc. 1977, 12 (4): 380395.Google Scholar
 Kedem B, Fokianos K: Regression Models for Time Series Analysis. 2002, New York: John Wiley and Sons. Inc.View ArticleGoogle Scholar
 Gorunescu F, McClean SI, Millard PH: Using a queueing model to help plan bed allocation in a department of geriatric medicine. Health Care Manag Sci. 2002, 5 (4): 307312. 10.1023/A:1020342509099.View ArticlePubMedGoogle Scholar
 Groothuis S, Hasman A, van Pol PEJ, Lencer NHMK, Janssen JJJ, Jans JDMJ, Stappers JLM, Dassen WRM, Doevendans PAFM, van Merode GG: Predicting capacities required in cardiology units for heart failure patients via simulation. Comput Methods Programs Biomed. 2004, 74 (2): 129141. 10.1016/S01692607(03)000804. [http://dx.doi.org/10.1016/S01692607(03)000804]View ArticlePubMedGoogle Scholar
 Millard PH, Mackay M, Vasilakis C, Christodoulou G: Measuring and modelling surgical bed usage. Ann R Coll Surg Engl. 2000, 82 (2): 75—82PubMedPubMed CentralGoogle Scholar
 Schwartz G: Estimating the dimension of a model. Ann Stat. 1978, 6 (2): 461464. 10.1214/aos/1176344136.View ArticleGoogle Scholar
 Bender J, Koestler D, Ombao H, McCourt M, Alskinis B, Heber W, Etheridge D, Peipert J, Rubin L, Padbury J: Neonatal Intensive Care Unit: predictive models for length of stay. J Perinatol. 2012 In pressGoogle Scholar
 Taylor JW, Buizza R: Neural network load forecasting with weather ensemble predictions. IEEE Trans Power Syst. 2002, 17: 626632. 10.1109/TPWRS.2002.800906.View ArticleGoogle Scholar
 Richardson DK, Corcoran JD, Escobar GJ, Lee SK: SNAPII and SNAPPEII: Simplified newborn illness severity and mortality risk scores. J Pediatr. 2001, 138: 92100. 10.1067/mpd.2001.109608.View ArticlePubMedGoogle Scholar
 Verma A, Weir A, Drummond J, Mitchell BF: Performance profile of an outcome measure: morbidity assessment index for newborns. J Epidemiol Community Health. 2005, 59 (5): 420426. 10.1136/jech.2003.019109. [http://dx.doi.org/10.1136/jech.2003.019109]View ArticlePubMedPubMed CentralGoogle Scholar
 Enabudoso EJ, Okpighe AC, Gharoro EP, Okpere EE: Delivery rate in Benin City, Nigeria: are there seasonal variations?. Niger J Clin Pract. 2011, 14 (2): 129131. 10.4103/11193077.83999. [http://dx.doi.org/10.4103/11193077.83999]View ArticlePubMedGoogle Scholar
 Kokangul A, Ozkan A, Akcan S, Ozcan K, Narli M: Statistical analysis of patients’ characteristics in Neonatal Intensive Care Units. J Med Syst. 2009, 34 (4): 471478.View ArticlePubMedGoogle Scholar
 Darrow LA, Strickland MJ, Klein M, Waller LA, Flanders WD, Correa A, Marcus M, Tolbert PE: Seasonality of birth and implications for temporal studies of preterm birth. Epidemiology. 2009, 20 (5): 699706. 10.1097/EDE.0b013e3181a66e96. [http://dx.doi.org/10.1097/EDE.0b013e3181a66e96]View ArticlePubMedPubMed CentralGoogle Scholar
 Lam DA, Miron JA: Seasonality of births in human populations. Soc Biol. 1991, 38 (1–2): 5178.PubMedGoogle Scholar
 Bobak M, Gjonca A: The seasonality of live birth is strongly influenced by sociodemographic factors. Hum Reprod. 2001, 16 (7): 15121517. 10.1093/humrep/16.7.1512.View ArticlePubMedGoogle Scholar
 Gneiting T, Raftery AE, WestveldIII AH, Goldman T: Calibrated probabilistic forecasting using ensemble model output statistics and minimum CRPS estimation. Mon Weather Rev. 2004, 133: 10981118.View ArticleGoogle Scholar
 The prepublication history for this paper can be accessed here:http://www.biomedcentral.com/14712288/13/67/prepub
Prepublication history
Copyright
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.