Simple estimators of the intensity of seasonal occurrence

Brookhart, M Alan; Rothman, Kenneth J

doi:10.1186/1471-2288-8-67

Technical advance
Open access
Published: 22 October 2008

Simple estimators of the intensity of seasonal occurrence

M Alan Brookhart¹ &
Kenneth J Rothman²

BMC Medical Research Methodology volume 8, Article number: 67 (2008) Cite this article

4373 Accesses
34 Citations
Metrics details

Abstract

Background

Edwards's method is a widely used approach for fitting a sine curve to a time-series of monthly frequencies. From this fitted curve, estimates of the seasonal intensity of occurrence (i.e., peak-to-low ratio of the fitted curve) can be generated.

Methods

We discuss various approaches to the estimation of seasonal intensity assuming Edwards's periodic model, including maximum likelihood estimation (MLE), least squares, weighted least squares, and a new closed-form estimator based on a second-order moment statistic and non-transformed data. Through an extensive Monte Carlo simulation study, we compare the finite sample performance characteristics of the estimators discussed in this paper. Finally, all estimators and confidence interval procedures discussed are compared in a re-analysis of data on the seasonality of monocytic leukemia.

Results

We find that Edwards's estimator is substantially biased, particularly for small numbers of events and very large or small amounts of seasonality. For the common setting of rare events and moderate seasonality, the new estimator proposed in this paper yields less finite sample bias and better mean squared error than either the MLE or weighted least squares. For large studies and strong seasonality, MLE or weighted least squares appears to be the optimal analytic method among those considered.

Conclusion

Edwards's estimator of the seasonal relative risk can exhibit substantial finite sample bias. The alternative estimators considered in this paper should be preferred.

Peer Review reports

Background

In a classic paper, Edwards [1] describes a geometrically motivated, moment-based method to fit a sine curve to a time series of square-root transformed monthly frequencies. From this basic framework, he derived both a test of the null hypothesis of no seasonality and an estimator of the intensity of seasonal occurrence (i.e., the peak-to-low ratio of the fitted sine curve). Owing to its intuitive appeal and computational simplicity, Edwards's and related methods have been widely used in epidemiology in studies of seasonality, e.g., [2–7].

Although there has been considerable discussion of the hypothesis testing procedure described by Edwards and a variety of alternative tests have been proposed [8–12], there has been relatively little discussion of the properties of Edwards's estimator of the intensity of seasonal occurrence. St. Leger discusses some computational difficulties involved with maximum likelihood estimation of the parameters in Edwards's model [13]. Nam compared the performance of the MLE with a moment-based "locally reasonable" estimator, similar to Edwards's estimator, and concluded that the MLE was preferable when the seasonal trend was strong [14].

In this paper, we review various approaches to the estimation of the intensity of seasonal occurrence, including Edwards's methods, least squares, weighted least squares, and the MLE. We then propose a new closed-form moment estimator of the peak-to-low ratio based on non-transformed data and a second-order moment statistic. Through an extensive Monte-Carlo simulation study, we compare the finite sample performance of the estimators discussed in this paper across a variety of data generating distributions, including some that involve overdispersion and autocorrelation of the outcome and thus depart from the assumed model. All estimators and confidence interval procedures discussed in this paper are applied in a reanalysis of data on the seasonal incidence of monocytic leukemia.

Methods

Data and Probability Model

Edwards's approach is used to study the seasonality of rare events that arise from an underlying non-homogeneous Poisson process with a rate given by the periodic function

λ(t) = μ{1 + αcos(2πt + θ)},

where μ is the total number of expected events in the year, t is the time in years, θ is the phase angle, and α is the hemi-amplitude of the periodic process.

We consider the situation in which the year is divided into k equally-sized intervals and aggregate data are available on the frequency of events occurring in each interval across T years. We denote the observed frequencies with N _i, i = 1, ..., k.

Edwards's probability model for these data is a discrete approximation to the non-homogeneous Poisson process and models the observed counts as independent Poisson random variables with mean given by the periodic function

m_{i} = \frac{n}{k} {1 + α \cos [\frac{2 π}{k} (i - ϕ - 0.5)]},

(1)

where i is the interval (e.g., quarter, month, week), and φ + 0.5 is the time of peak incidence. The parameter n is the total expected number of events across all years, i.e., n = μT.

In this paper, we focus on the estimation of the peak-to-low ratio of the process, also termed the intensity of seasonal occurrence or seasonal relative risk, and is given by

R = \frac{1 + α}{1 - α} .

Edwards's Method

Edwards derives an estimator for α by first computing the distance from the origin to a re-scaled center of gravity of k point masses of weight $\sqrt{N_{i}}$ placed on the rim of a unit circle at angles θ _i= 2πi/k, i = 1, ..., k. Using a first-order Taylor series expansion he derives an expected value for this quantity that depends on the true α. Setting the distance from the origin to the center of gravity equal to its expected distance and solving for α, Edwards derives a moment-based estimator for a given by

{\hat{α}}_{E} = \frac{4 \sqrt{{(\sum_{i = 1}^{k} \sqrt{N_{i}} \sin (θ_{i}))}^{2} + {(\sum_{i = 1}^{k} \sqrt{N_{i}} \cos (θ_{i}))}^{2}}}{\sum_{i = 1}^{k} \sqrt{N_{i}}}

Using the fact that the variance of $\sqrt{N_{i}}$ is approximately $\frac{1}{4}$ , he shows that the approximate variance for $\hat{α}$ _Eis 2/N where N = ΣN _i. Edwards estimates R by replacing α with $\hat{α}$ _E, i.e.,

{\hat{R}}_{E} = \frac{1 + {\hat{α}}_{E}}{1 - {\hat{α}}_{E}} .

In the subsequent sections, we borrow this geometric framework todevelop alternative estimators of α and R.

Moment-based Estimation of a Using Non-transformed Data

We consider two new estimators of α. Instead of basing these on square-root transformed data, we use the data in their original scale. The first estimator of α that we consider depends on the distance from the origin to the center of gravity of k masses of weight N _ieach placed on the rim of a unit circle in direction θ _i= 2πi/k, i.e.,

D = \sqrt{{(\frac{1}{k} \sum_{i = 1}^{k} N_{i} \sin (θ_{i}))}^{2} + {(\frac{1}{k} \sum_{i = 1}^{k} N_{i} \cos (θ_{i}))}^{2}} .

Let D _y= $\frac{1}{k} \sum_{i = 1}^{k} N_{i}$ sin(θ _i) be the vertical component and D _x= $\frac{1}{k} \sum_{i = 1}^{k} N_{i}$ cos(θ _i) be the horizontal component of the distance from origin to the center of gravity of the k masses. Let N = ΣN _i. From the exact expressions for E[D _x|N] and E[D _y|N] (see Additional file 1), a first-order approximation for E[D|N] is given by:

E [D | N] \approx \sqrt{E {[D_{x} | N]}^{2} + E {[D_{y} | N]}^{2}} = \frac{N α}{2 k} .

Setting D equal to E[D|N] and solving for α yields the following moment-based estimator for α:

{\hat{α}}_{D} = \frac{2 k D}{N} .

(2)

This estimator is the same as Nam's locally reasonable estimator [14]. It can also be derived from least-squares estimation of the parameters of the periodic model:

N _i= β ₀ + β ₁ sin(θ _i) + β ₂ cos(θ _i) + ε _i,

from which R is estimated as:

{\hat{R}}_{L S} = \frac{{\hat{β}}_{0} + \sqrt{{\hat{β}}_{1}^{2} + {\hat{β}}_{2}^{2}}}{{\hat{β}}_{0} - \sqrt{{\hat{β}}_{1}^{2} + {\hat{β}}_{2}^{2}}} .

This relation also suggests a two-step weighted least-squares estimator of R. In the first step, least squares is used to estimate the parameters in (3) and then predicted values of each ${\hat{N}}_{i}$ are generated. In the second step, the parameters of (3) are estimated using weighted least squares with weights given by w _i= 1/ ${\hat{N}}_{i}$ . The optimality of these weights assumes that the variance of N _iis equal to the expected value of N _i. This procedure could be iterated until the estimates and weights converge.

The second estimator of α that we consider is based on the second-order moment statistic D ². This statistic is appealing because we can express the expected value of E[D ²|N] exactly, whereas E[D|N] is only available to a first-order approximation. Using the exact expressions for $E [D_{y}^{2} | N]$ and $E [D_{x}^{2} | N]$ (see Additional file 1), we see that

E [D^{2} | N] = E [D_{x}^{2} | N] + E [D_{y}^{2} | N] = \frac{N}{k^{2}} {1 - \frac{α^{2}}{4} + \frac{α^{2} N}{4}} .

Solving this expression for α yields the estimator

{\hat{α}}^{*} = 2 \sqrt{\frac{D^{2} k^{2} - N}{N (N - 1)}} .

When D ² is less than N/k ² (the expected value of E[D ²|N] at α = 0), this estimator results in invalid (imaginary) estimates of α. To remedy this, we propose the following modified estimator

{\hat{α}}_{D 2} = 2 \sqrt{\frac{D^{2} k^{2} - N f}{N (N - 1)}},

where

f = \frac{D^{2} k^{2} / N}{1 + D^{2} k^{2} / N} .

This modification insures that the quantity inside the square root is always greater than or equal to zero. For small values of D ², ${\hat{α}}_{D 2} \approx {\hat{α}}_{D}$ . As D ² increases, f converges to 1 and $\hat{α}$ _D2corresponds to the estimator using the exact expression for E[D ²|N].

Given an estimate of α, R can be estimated by substituting $\hat{α}$ into the formula that relates R to α:

\hat{R} = \frac{1 + \hat{α}}{1 - \hat{α}} .

Ratio estimators such as $\hat{R}$ are known to be biased upwards, particularly with sparse data. Later we discuss a bias-correction term for this estimate of R.

Confidence Intervals for R

Constructing confidence intervals for R is problematic because the null value lies on the boundary of the points of support for R. Frangakis and Varadhan recently proposed an approach for computing exact confidence limits for the seasonal relative risk derived from simulation and maximum likelihood estimation of parameters in a circular normal probability model.[19] Their approach can be adapted to estimate confidence intervals for any of the moment estimators proposed in this paper.

The approach involves finding the roots of the function h(R) = | $\hat{R}$ - R| - q(R; α), where q(R; α) is the 1 - α quantile of | $\hat{R}$ - R|. Note that q(R; α) depends on a particular estimator, although we do not make this explicit in the notation. The lower confidence limit is either zero or the value of the smaller root, whichever is larger. The upper confidence limit is the value of the larger root. Since q cannot be expressed in closed form, it is estimated via simulation. For a given value of R, data are simulated from the probability model and | $\hat{R}$ - R| is computed for each simulated data set. In the simulation, the parameter φ can be held fixed at its estimated value. The value of q(R; α) is then estimated by taking the empirical 1 - α quantile of the simulated values of q. The roots of h can be found by using an iterative algorithm.

For the estimators considered in this paper, it is possible that the function h will have only one root. This situation occurs when the number of events is small and/or the seasonality is strong enough so that no upper bound can be placed on the strength of seasonality (the fitted trough of the sine curve is close to zero). When only a single root is found, we set the upper confidence limit to infinity.

While this approach yields confidence intervals that are correct under the assumed probability model, it is computationally intensive and requires specialized software. We also consider a simple ad hoc approach for the estimation of approximate confidence limits for R. This approach is based on a normal approximation to the sampling distribution of log( $\hat{R}$ ). We enforce the boundary constraint by truncating the lower confidence limit at one. This procedure yields a lower limit given by:

{\hat{R}}_{L} = \max (\exp [\ln (\hat{R}) - Z_{1 - α / 2} \hat{SE} (\ln (\hat{R}))], 1) .

The upper limit is unbounded and given by

{\hat{R}}_{U} = \exp [\ln (\hat{R}) - Z_{1 - α / 2} \hat{SE} (\ln (\hat{R}))] .

A first-order Taylor series approximation for the standard error for the sampling distribution of log( $\hat{R}$ ) is given by

\hat{SE} (\ln (\hat{R})) \approx \frac{2 \sqrt{V \hat{A} R} (\hat{α})}{(1 + \hat{α}) (1 - \hat{α})} .

For all estimators, $V \hat{A} R (\hat{α}) \approx 2 / N$ .

Simulation Study

We compared the various estimators discussed in this paper in a comprehensive Monte Carlo simulation study. Initially, we set k = 12 (corresponding to monthly observations) with n = 150, n = 500, and n = 2500. For each setting of k and n, we simulated data for values of R ranging from 1.05 to 3.05 in increments of 0.25.

For each simulated data set, we evaluate the following five estimators of R:

1. $\hat{R}$ _E: an estimate of $\hat{R}$ using Edwards's estimator of α,

2. $\hat{R}$ _LS: an estimate of R using least squares,

3. $\hat{R}$ _D2: an estimate of R using $\hat{α}$ _D2,

4. $\hat{R}$ _WLS: an estimate of R using weighted least squares,

5. $\hat{R}$ _MLE: an estimate of R using the maximum likelihood estimate of α.

We consider various perturbations of these baseline parameters in sensitivity analyses. First, we set k = 52 (corresponding to weekly observations) with n = 1000, n = 5000 and n = 10000. We also simulated data under two different probability models that departed from the assumed model: 1) a negative binomial model with the mean given by Edwards's model (1), but in which the counts were overdispersed with variance given by VAR[N _i] = 1.5E[N _i]; and 2) a model that generated data with a marginal mean given by Edwards's model, but in which the counts were strongly autocorrelated and overdispersed. We created autocorrelation and overdispersion among the observations by simulating N ₁ using Edwards's model, and then generating each N _i, i = 2, ..., k by simulating Q _ifrom Edwards model and then letting N _i= Q _i+ 0.1{E[N _i-1] - N _i-1}.

Additionally, we use the simulation results to evaluate the adequacy of the ad hoc confidence interval procedure suggested in section 2.4. For each simulated data set, we compute a 95% confidence interval for $\hat{R}$ _D2, $\hat{R}$ _WLS, and $\hat{R}$ _MLEand record the relative frequency of estimated confidence intervals that contain the true parameter.

Computation

All simulations were performed in SAS V9.1 running on a Windows XP platform using software created by the authors. The maximum likelihood estimates were found using PROC NLMIXED in which the likelihood function (conditional on N) is maximized using a Newton-Raphson algorithm with a line search and boundary constraint (see Additional file 2 for example program). For the Monte Carlo simulation study, the true parameter value was used as the starting point for the maximization routine. The weighted least-squares estimates were obtained in a two-step procedure using PROC GENMOD.

Results

In table 1, we report the bias and MSE from the baseline simulation. For all values of n and R, the new estimator $\hat{R}$ _D2had the smallest bias of all those considered. For n = 150, $\hat{R}$ _D2also had the smallest MSE for all values of R. For n = 500 and n = 2500, $\hat{R}$ _D2had minimal or close to minimal MSE for smaller values of R (R < 1.85); however, for large values of R, $\hat{R}$ _WLSand $\hat{R}$ _MLEwere better from the MSE perspective. The MSE of the estimator $\hat{R}$ _LSwas similar, but sometimes slightly larger, than that of $\hat{R}$ _WLS. Edwards's estimator was the most biased and had the largest MSE for all values of n and R. All estimators evaluated were biased upwards for values of R close to unity, a consequence of the behavior of the estimators near the boundary.

Table 1 Estimated bias and MSE for each estimator from the baseline simuala-tion for n = 150, 500, and 2500 based on 1,000 simulated datasets

Full size table

In the sensitivity analyses in which we generated overdispersed and auto-correlated data, the same essential patterns prevailed. The bias of $\hat{R}$ _D2was minimal for all values of R in each scenario. Edwards's estimator was the most biased and had the largest MSE for all values of n and R. In these simulations that depart from the assumed model, the MSE of $\hat{R}$ _WLSwas better than $\hat{R}$ _MLEfor certain values of R and n. This result is likely due to the fact that the MLE is not based on the probability model used to generate the data. In table 2, we report the MSE of $\hat{R}$ _D2relative to $\hat{R}$ _WLSfor the overdispersed and auto correlated data-generating distributions, respectively. In these figures, relative MSEs below 1 indicate that $\hat{R}$ _D2is preferable from the MSE perspective. Both figures reveal that the relative MSE increases with R. For small values of R and n, $\hat{R}$ _D2is preferable. For larger values of R, $\hat{R}$ _WLSwas preferable. These results were the most pronounced in the setting of autocorrelated data. The MSE of $\hat{R}$ _D2was never more than 13% greater than $\hat{R}$ _WLS; however, it was nearly half as much for small values of R. For the simulations in which k = 52, the estimator $\hat{R}$ _D2continued to be the least biased, but there was little difference in MSE between $\hat{R}$ _D2, $\hat{R}$ _WLS, and $\hat{R}$ _MLEin terms of MSE across all values of R.

Table 2 Relative mean squared error of $\hat{R}$ _D2to $\hat{R}$ _WLS

Full size table

In table 3, we report the estimated coverage probabilities for the ad hoc confidence intervals computed for the estimators $\hat{R}$ _D2, $\hat{R}$ _LS, and $\hat{R}$ _WLS. The actual coverage probabilities are close to correct, usually within one to two percentage points of the nominal 95%. The coverage probabilities for these confidence intervals in the setting of autocorrelation and overdispersion was substantially lower, with actual coverage probabilities ranging from 87% to 96%.

Table 3 Percentage of estimated ad hoc 95% confidence intervals that cover the true parameter

Full size table

As a side note, the algorithm that we used to find the MLE experienced convergence problems close to R = 1. For R = 1.05, the MLE failed to converge in roughly 20% of the simulated data sets. This problem diminished as R increased. For R = 1.5 the MLE was located for 95% of the simulated data sets. This is likely to be a result of near non-identifiability of φ when the seasonality is weak. More computationally-intensive approaches, such as a grid search, might alleviate this problem; however, in the context of a simulation study, we required an approach that could converge rapidly. For all results discussed below, we excluded simulated data sets for which the MLE was not found. We found that the simulation results for the non-missing estimators were largely unaffected by the inclusion/exclusion of the simulations for which the MLE was not located.

Example: Seasonality of Monocytic Leukemia

We compared the estimators proposed in this paper with the MLE and the estimator of Edwards through a re-analysis of data on the seasonal incidence of monocytic leukemia in England and Wales from 1974–1998 (N = 2311, k = 12) with monthly counts given as (203, 203, 197, 206, 204, 216, 165, 161, 177, 179, 200, 200). We used data from the Office of National Statistics as reported by Eatough [7]. In Table 4, we report the point estimate and approximate 95% confidence limits corresponding to each of the five estimators considered in the simulation study. We also present the confidence limits computed using the method of Frangakis and Varadhan [19]. These confidence intervals could not be computed for the MLE because the convergence problems experienced by the maximization algorithm made the computation of q infeasible.

Table 4 Estimated peak-to-low ratio and 95% CI for the seasonal incidence of monocytic leukemia in England and Wales (1974–98) using four different estimators and two confidence interval procedures

Full size table

The different estimators do not lead to substantively different interpretations of the data. Nevertheless, consistent with the results of the simulation, the estimators $\hat{R}$ _D2are smaller than R _LSand Edwards estimator. Given the large number of events and the fact that the data exhibit only moderate seasonality, the simulation study suggests that Edwards estimator should be only moderately biased for these data. The confidence intervals computed by the ad hoc confidence interval procedure were nearly identical to those of Frangakis and Varadhan.

Discussion

In this paper we have proposed a new estimator of the peak-to-low ratio of a periodic process and compared it to several alternative estimators, including Edwards's estimator, the MLE, and weighted least squares. Studies employing Edwards's method often involve very rare events and moderate seasonality. For these studies, the estimator proposed in this paper appears to be optimal. It has less bias and a smaller MSE than any of the estimators considered, including the MLE and weighted least squares. Weighted least squares was preferable from a MSE perspective in the setting of frequent outcomes or strong seasonality. We speculate that the simple estimator proposed in this paper improves upon the estimator of Edwards and the other moment-based estimator because it is based on an exact rather than an approximate expression for the distance from the origin to the center of gravity. We further speculate that the bias and inefficiency in the MLE is due to the small event rates considered in this paper.

The ad hoc confidence interval procedure that we evaluated performed reasonably well for data generated from Edwards's probability model. If more precise confidence intervals are needed, the computationally-intensive approach proposed by Frangakis and Varadhan can be employed [19]. Users should be aware that both of the confidence intervals considered in this paper are model based. If the underlying model is wrong, for example, in the setting of strongly autocorrelated or overdispersed data, the true coverage probabilities may differ from the nominal 95%.

Because ratio estimators are known to be biased upwards, particularly with sparse data, we also considered a bias-corrected estimator based on the expected value of a second-order Taylor series expansion of (1 + $\hat{α}$ )/(1 - $\hat{α}$ ) around α given by

\begin{matrix} E [\frac{1 + \hat{α}}{1 - \hat{α}}] \approx \frac{1 + α}{1 - α} + \frac{2 VAR [\hat{α}]}{{(1 - α)}^{3}} \\ = \frac{1 + α}{1 - α} (1 + \frac{2 VAR [\hat{α}]}{(1 + α) {(1 - α)}^{2}}) . \end{matrix}

This approximation led to the following bias-corrected estimator of R:

\hat{R} = \frac{1 + \hat{α}}{1 - \hat{α}} {(1 + \frac{2 V \hat{A} R [\hat{α}]}{(1 + \hat{α}) {(1 - \hat{α})}^{2}})}^{- 1} .

(4)

We found that estimators based on this correction factor tended to be somewhat over-corrected, possibly because they are based on an approximation of the variance of $\hat{α}$ .

One important limitation of the estimators proposed in this paper is that they are based on the assumption of a single cyclical effect (harmonic) that can be well approximated by a sine curve. For more complex data, with multiple periodic components or a linear trend, alternative statistical methods should be used. For such data there exist more complex harmonic models [20, 12], spectral methods [21], and various periodic regression models. Also, we outline an approach to estimating seasonal intensity using a periodic generalized linear model that assumes a log link and a Poisson distributed outcome (see Additional file 3). This approach is based on a different model for the mean, i.e., that the log of the expected value of the counts is a sinusoidal function. However, it allows for the inclusion of covariates and extends naturally to variably-sized intervals through use of a Poisson offset.

Edwards's method has been widely used in epidemiology in studies of seasonality. In this paper we have shown that Edwards's estimator of the seasonal relative risk can be substantially biased. The estimator proposed in this paper represents a straightforward modification of Edwards's estimator. Like that of Edwards, it is a simple estimator that is available in closed form. For modest seasonality and small numbers of events, this estimator appears to have the best finite sample performance characteristics of those estimators considered.

For more frequent events or stronger seasonality, the weighted least-squares approach discussed in this paper is preferable and is easily implemented using standard statistical software.

References

Edwards JH: The recognition and estimation of cyclical trends. Ann Hum Genet. 1961, 25: 83-86. 10.1111/j.1469-1809.1961.tb01501.x.
Article CAS PubMed Google Scholar
Yamaguchi S, Dunga A, Broadhead RL, Brabin B: Epidemiology of measles in Blantyre, Malawi: analyses of passive surveillance data from 1996 to 1998. Epidemiology and Infection. 2002, 129 (2): 361-369. 10.1017/S0950268802007458.
Article CAS PubMed PubMed Central Google Scholar
Mamoulakis C, Antypas S: Cryptorchidism: seasonal variations in Greece do not support the theory of light. Andrologia. 2002, 34 (3): 194-203. 10.1046/j.1439-0272.2002.00492.x.
Article PubMed Google Scholar
Ajdacic-Gross V, Wang J, Bopp M, Eich D, Rossler W, Gutzwiller F: Are seasonalities in suicides dependant on suicide method?. A reappraisal. Social Science and Medicine. 2003, 57 (7): 1173-1181. 10.1016/S0277-9536(02)00493-8.
Article PubMed Google Scholar
Anderka M, Declercq E, Wendy S: A time to be born. Am J Public Health. 2000, 90 (1): 124-126. 10.2105/AJPH.90.1.124.
Article CAS PubMed PubMed Central Google Scholar
Seretakis D, Lagiou P, Lipworth L, Signorello LB, Rothman KJ, Trichopoulos D: Changing seasonality of mortality from coronary heart disease. JAMA. 1997, 278 (12): 1012-1014. 10.1001/jama.278.12.1012.
Article CAS PubMed Google Scholar
Eatough JP: Evidence of seasonality in the diagnosis of monocytic leukaemia. Brit J Cancer. 2002, 87 (5): 509-510. 10.1038/sj.bjc.6600497.
Article CAS PubMed PubMed Central Google Scholar
Hewitt D, Milner J, Cisma A, Pakula A: On Edwards's criterion of seasonality and a non-parametric alternative. Brit J Prev Soc Med. 1971, 25: 174-176.
CAS Google Scholar
Roger JH: A significance test for cyclic trends in incidence data. Biometrika. 1977, 64: 152-155. 10.1093/biomet/64.1.152.
Article Google Scholar
Rogerson P: A generalization of Hewitt's test for seasonality. Int J Epidemiol. 1996, 25: 644-648. 10.1093/ije/25.3.644.
Article CAS PubMed Google Scholar
Walter S, Elwood J: A test for seasonality of events with a variable population at risk. Br J Prev Soc Med. 1975, 29: 18-21.
CAS PubMed PubMed Central Google Scholar
Jones RH, Ford PM, Hamman RF: Seasonality comparisons among groups using incidence data. Biometrics. 1988, 44: 1131-1144. 10.2307/2531741.
Article CAS PubMed Google Scholar
St Leger AS: Comparison of two tests for seasonality in epidemiological data. Appl Statist. 1976, 25 (3): 280-286. 10.2307/2347236.
Article Google Scholar
Nam J: Efficient method for identification of cyclic trends in incidence data. Communications in Statistics-Theory and Methods. 1983, 12 (9): 1053-1068. 10.1080/03610928308828515.
Article Google Scholar
Rothman KJ: Episheet: Spreadsheets for the analysis of epidemiologic data. [http://www.drugepi.info/links/downloads/episheet.xls]
Ihaka R, Gentleman RR: A Language for Data Analysis and Graphics. Journal of Computational and Graphical Statistics. 1996, 5: 299-314. 10.2307/1390807.
Google Scholar
R Development Core Team: R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. 2003, ISBN 3-900051-00-3,, [http://www.R-project.org]
Google Scholar
Savitzky A, Golay MJE: Smoothing and differentiation of data by simplified least squares procedures. Anal Chem. 1964, 36: 1627-1639. 10.1021/ac60214a047.
Article CAS Google Scholar
Frangakis CE, Varadhan R: Confidence intervals for seasonal relative risk with null boundary values. Epidemiology. 2002, 13: 734-737. 10.1097/00001648-200211000-00022.
Article PubMed Google Scholar
Pocock SJ: Harmonic analysis applied to seasonal variations in sickness absence. App Stat. 1974, 103-120. 10.2307/2346992.
Google Scholar
Chatfield C: The analysis of time series: an introduction. 1996, St Edmundsbury Press Ltd, Suffolk, Fifth
Google Scholar

Pre-publication history

The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2288/8/67/prepub

Download references

Acknowledgements

The authors are grateful for the helpful comments of Tim Lash and Claus Dethlefsen. M. Alan Brookhart is supported by a career development grant from the National Institute on Aging (AG-027400).

Author information

Authors and Affiliations

Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital & Harvard Medical School Boston, MA, USA
M Alan Brookhart
RTI Health Solutions Research Triangle Park, NC, USA
Kenneth J Rothman

Authors

M Alan Brookhart
View author publications
You can also search for this author in PubMed Google Scholar
Kenneth J Rothman
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to M Alan Brookhart.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

KR conceived the project. Both authors contributed equally to evaluation and development of statistical methodology. MB carried out programming, simulation, and data analysis. MB drafted the manuscript. Both authors read and approved the final manuscript.

Electronic supplementary material

12874_2007_303_MOESM1_ESM.pdf

Additional file 1: Derivations. The file provides mathematical derivations of several expressions referenced in the paper. (PDF 26 KB)

12874_2007_303_MOESM2_ESM.pdf

Additional file 2: SAS Program. This file provides the SAS program used to located themaximum likelihood estimate of Edwards's model. (PDF 13 KB)

12874_2007_303_MOESM3_ESM.pdf

Additional file 3: Periodic generalized linear model approach to estimating seasonal intensity. The file outlines an approach to estimating the peak-to-low ratio using a periodic generalized linear model. (PDF 22 KB)

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Brookhart, M.A., Rothman, K.J. Simple estimators of the intensity of seasonal occurrence. BMC Med Res Methodol 8, 67 (2008). https://doi.org/10.1186/1471-2288-8-67

Download citation

Received: 21 December 2007
Accepted: 22 October 2008
Published: 22 October 2008
DOI: https://doi.org/10.1186/1471-2288-8-67

Simple estimators of the intensity of seasonal occurrence

Abstract

Background

Methods

Results

Conclusion

Background

Methods

Data and Probability Model

Edwards's Method

Moment-based Estimation of a Using Non-transformed Data

Confidence Intervals for R

Simulation Study

Computation

Results

Example: Seasonality of Monocytic Leukemia

Discussion

References

Pre-publication history

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors' contributions

Electronic supplementary material

12874_2007_303_MOESM1_ESM.pdf

12874_2007_303_MOESM2_ESM.pdf

12874_2007_303_MOESM3_ESM.pdf

Rights and permissions

About this article

Cite this article

Keywords

BMC Medical Research Methodology

Contact us

Simple estimators of the intensity of seasonal occurrence

Abstract

Background

Methods

Results

Conclusion

Background

Methods

Data and Probability Model

Edwards's Method

Moment-based Estimation of a Using Non-transformed Data

Confidence Intervals for R

Simulation Study

Computation

Results

Example: Seasonality of Monocytic Leukemia

Discussion

References

Pre-publication history

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors' contributions

Electronic supplementary material

12874_2007_303_MOESM1_ESM.pdf

12874_2007_303_MOESM2_ESM.pdf

12874_2007_303_MOESM3_ESM.pdf

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Medical Research Methodology

Contact us