 Research article
 Open Access
 Published:
Direct modelling of age standardized marginal relative survival through incorporation of timedependent weights
BMC Medical Research Methodology volume 21, Article number: 84 (2021)
Abstract
Background
When quantifying the probability of survival in cancer patients using cancer registration data, it is common to estimate marginal relative survival, which under assumptions can be interpreted as marginal net survival. Net survival is a hypothetical construct giving the probability of being alive if it was only possible to die of the cancer under study, enabling comparisons between populations with differential mortality rates due to causes other the cancer under study. Marginal relative survival can be estimated nonparametrically (Pohar Perme estimator) or in a modeling framework. In a modeling framework, even when just interested in marginal relative survival it is necessary to model covariates that affect the expected mortality rates (e.g. age, sex and calendar year). The marginal relative survival function is then obtained through regression standardization. Given that these covariates will generally have nonproportional effects, the model can become complex before other exposure variables are even considered.
Methods
We propose a flexible parametric model incorporating restricted cubic splines that directly estimates marginal relative survival and thus removes the need to model covariates that affect the expected mortality rates. In order to do this the likelihood needs to incorporate the marginal expected mortality rates at each event time taking account of informative censoring. In addition timedependent weights are incorporated into the likelihood. An approximation is proposed through splitting the time scale into intervals, which enables the marginal relative survival model to be fitted using standard software. Additional weights can be incorporated when standardizing to an external reference population.
Results
The methods are illustrated using national cancer registry data. In addition, a simulation study is performed to compare different estimators; a nonparametric approach, regressionstandardization and the new marginal relative model. The simulations study shows the new approach is unbiased and has good relative precision compared to the nonparametric estimator.
Conclusion
The approach enables estimation of standardized marginal relative survival without the need to model covariates that affect expected mortality rates and thus reduces the chance of model misspecification.
Background
When quantifying the probability of survival in cancer patients using cancer registration data it is common to estimate marginal relative survival, which under assumptions can be interpreted as marginal net survival [1, 2]. Net survival is a hypothetical construct giving the probability of being alive if it was only possible to die of the cancer under study. In these studies cause of death information is either unavailable or, more commonly, deemed unreliable [3, 4] and so expected mortality rates are incorporated so that mortality in excess of that expected can be estimated.
Relative survival is likely to vary by demographic factors such as age and sex, and disease severity, but is often reported as a marginal estimate giving the average survival in the study population. To ensure fair comparisons between population groups agestandardization is generally performed [5] where a common age distribution is forced on the groups being compared. The arguments for standardization could also apply to other covariates that vary between groups being compared.
Marginal relative survival can be estimated nonparametrically using the Pohar Perme estimator [6]. In order to agestandardize when using the Pohar Perme estimator it is usual to obtain separate estimates for different age groups, with the marginal estimate obtained through a weighted average of the age group specific effects. An alternative is to up or down weight individuals relative to a reference population [7, 8]. Standardization can also be performed using a modeling approach [9–11], which allows estimation of conditional effects and well as marginal estimates, through regression standardization [2, 12].
In a standard survival model, i.e. not in the relative survival framework,if one is just interested in an estimate of marginal survival, then it is not necessary to model any covariates. This means that for a reasonable parametric model with no covariates the estimated survival function should be in good agreement with the nonparametric KaplanMeier curve. However, in the relative survival framework there will be a difference between the estimated relative survival from a model with no covariates and a nonparametric (Pohar Perme) estimate. This is due to need to incorporate expected mortality rates that vary between individuals (by age, sex, calendar year etc) in the relative survival model. Thus, red when using a relative survival model to estimate, marginal relative survival, then covariates that affect expected mortality rates should be modelled so that the marginal effect can then be estimated through regression standardization [2, 12].
The need to model covariates that affect expected mortality rates, even when interest only lies in marginal relative survival, increases the likelihood of model misspecification, particularly as it is very common to have to model timedependent effects (nonproportional excess hazards). In this paper we propose a model that directly estimates marginal relative survival and thus removes the need to model variables that reflect the variation in expected mortality rates. In the “Methods” section we define the conditional and marginal models and describe how incorporation of individual level weights allows external age standardization and enable covariates to be incorporated. The “Results” section includes a simulation study evaluating statistical properties of different estimators and illustrates the methods using an example of individuals diagnosed with melanoma. The paper is concluded with a discussion.
Methods
In the relative survival framework it is assumed that the overall allcause mortality rate, h(tX_{i}), for an individual with covariate pattern X_{i}, is the sum of the expected mortality rate, h^{∗}(tX_{i}), and the excess mortality rate, λ(tX_{i}).
For simplicity it is assumed that covariates, X_{i}, are the same for the expected and excess mortality rates, but this can be relaxed. Expected mortality rates are stratified by age, sex, calendar year and potentially other demographic covariates. The relative survival for covariate pattern X_{i} is,
The marginal relative survival, R_{m}(tX), is the expectation over covariates, X, i.e. E_{X}[R(tX)]. This can be estimated in a modelling framework when incorporating these covariates, X, by averaging the individual estimates, \(\widehat {R}(t\pmb {X}_{i})\).
The marginal excess mortality rate function, λ_{m}(tX) can be obtained through the usual transformation from survival to hazard function, \(h(t) = \frac {d \ln [S(t)]}{dt}\).
and can be estimated in a parametric modeling framework when modeling covariates X by,
A conditional model with no covariates for the excess mortality rate
Consider the conditional model in Eq. (1) without including covariates X_{i} for the excess mortality rate.
This assumes that the excess mortality rate, λ(t), is the same for all individuals. This would mean that allcause mortality rate would vary between individuals only due to variation in the expected mortality rates and not variation in excess mortality rates. This is different from that defined in Eq. (2) where the individual excess mortality rates vary between individuals.
Likelihood
We adopt a fully parametric approach, so will model how excess mortality rates vary over time from diagnosis and by covariates. For an observed allcause survival/censoring time, t_{i} and event indicator for death due to any cause, d_{i}, the loglikelihood contribution of the i^{th} individual with covariate pattern, X_{i}, for a relative survival model is
where Λ(tX_{i},β,γ) is the cumulative excess mortality function with parameters, β, modelling covariate effects and, γ, modeling the effect of time from diagnosis [11, 13].
For a marginal model with no covariates the marginal excess mortality rate, λ_{m}(tX), as defined in Eq. (2), needs to be directly estimated; X thus denotes covariates that can impact on both expected and excess mortality rates. Rather than incorporate, h^{∗}(t_{i}X_{i}), the individual expected hazard for the i^{th} individual at time t_{i}, a suitable estimate of the marginal expected mortality rate needs to be incorporated. A naive way to do this would be including the mean of h^{∗}(t_{i}X_{i}) among those at risk at time t_{i}. However, net survival is defined in the hypothetical world where it is not possible to die from other causes, but it is estimated in the real world where individuals may die from both their cancer and from other causes. This means that with increasing time from diagnosis individuals with a higher risk of dying from other causes will be underrepresented. This should be taken into account in both the likelihood and when estimating the mean expected mortality rate. A similar idea to that proposed by Pohar Perme et al. [6] for the nonparametric estimator is used by upweighting by the inverse of the expected survival, S^{∗}(tX_{i}), where,
and defining the individual level, timedependent weights \(w_{i}^{*}(t)\) as
The mean expected mortality rate at time t_{i} incorporating weights, \(w_{i}^{*}(t_{i})\), is
where \(\mathcal {R}(t_{i})\) is the set of those at risk at time t_{i}.
The weighted marginal expected mortality rates, \(\bar {h}^{*}(t_{i})\) can then be incorporated into the weighted likelihood rather than h^{∗}(t_{i}X_{i}) together with weights, \(w_{i}^{*}(t_{i})\),
Note that Eq. (4) only needs to be calculated at event times and is not needed for individuals with censored times. The integral in Eq. (5) will generally not be analytically tractable. A numerical integration method, such as Gaussian quadrature, could be incorporated into the estimation process. However, we choose to split the timescale into a number of intervals and assume that the weight is constant within each interval. The likelihood then becomes,
where M_{i} is the number of intervals for the i^{th} subject. An advantage of this approach is that after splitting the timescale and calculating the weights, standard parametric relative survival models can be used. This requires the software to incorporate both weights and left truncation into the likelihood. There needs to be a choice of how finely to split the timescale. As the weights depend upon expected mortality rates, the weights will vary continuously, so a choice needs to be made at what point within the interval to calculate the weight. We use the midpoint of the interval [14]. More time intervals will result in greater precision, but increase computational time. The choice of the number of time intervals is investigated in the example in the “Results” section. The weights vary within individuals and leads to withinsubject correlation. Therefore, a cluster robust sandwich estimator of the variance is used [15]. This is similar to other methods that use timedependent weights, such as the Fine and Gray subhazard model [16] or the parametric equivalent [17].
External agestandardization
In order to compare estimates of marginal relative survival between different population groups it is necessary to agestandardize to the same age distribution. In the nonparametric setting the usual approach is to estimate marginal relative survival separately within age groups and then obtain a weighted average of the agespecific estimates, with weights equal to the proportion within each age group in the reference population. In a modelling framework regression standardization is performed with each individual up or downweighted using the ratio of the proportion in the age group to which the individual belongs and the proportion in the reference age group [2]. A similar idea can be used within the marginal model that enables externally agestandardized estimates to be obtained without the need to model, or stratify by, age.
Let \(p^{a}_{i}\) be the proportion in the age group to which the i^{th} individual belongs and \(p^{R}_{i}\) be the corresponding proportion in the reference population. Weights can be defined to upweight or downweight individual relative to the reference population.
These weights can then be combined with the inverse expected survival weights,
These weights are the same as those defined by Sasieni and Brentnall [7] for use in nonparametric relative survival estimators. The weights need to be used when calculating \(\bar {h}^{*}(t_{i})\) by substituting \(w_{i}^{*}(t)\) for w_{i}(t) in Eq. (4) and in the likelihood in Eq. (6). It is common just to standardize by age, but the approach is applicable when standardizing over multiple covariates.
Modeling covariates
When the aim is to make contrasts between different population groups then covariates can be added to the marginal model. For example, these covariate could be different regions/countries, socioeconomic groups, timeperiods or sexes. As the age distribution may vary between the groups being compared it is important to agestandardize, but now the weights defined in Eq. (7) should be calculated separately within subgroups. In addition, \(\bar {h}^{*}(t_{i})\) should be calculated separately within each subgroup.
Choice of parametric model
The likelihood defined in Eq. (6) could be used for a variety of parametric models. Here we use flexible parametric survival models on the logcumulative excess hazard scale [9] that incorporate restricted cubic splines to model the effect of time from diagnosis. An advantage of modeling on the logcumulative excess hazard scale is that the it provides an analytical form for the cumulative excess hazard that is required for the likelihood in Eq. (6). The model for the log cumulative excess hazard, Λ(tk_{0},γ), where k_{0} is a vector of knots and γ the associated splines parameters, is
where s(ln(t)k_{0},γ) is a restricted cubic spline function of log time. The number of parameters to model the baseline is determined by the number of knots for the restricted cubic spine function with the number of parameters (including the intercept) being equal to the number of knots. Simulation studies have shown that the models give negligible bias when estimating survival functions across a wide range of scenarios [18, 19]. For more details on these models see Royston and Lambert [13].
After fitting a model the estimated marginal relative survival and marginal excess mortality functions can be estimated,
In Appendix I, we describe how a semiparametric marginal model could be fitted through estimation of a separate parameter for each event type using Poisson regression and how this is a equivalent to the Pohar Perme nonparametric estimate when not modeling covariates in order to demonstrate how the methods are related. However, we do not advocate this approach due to the computational intensity of splitting the time scale at unique time points and estimating a separate parameter for each time interval.
Simulation study
A small simulation study was performed to quantify any potential bias in the different methods, the coverage probabilities and the variation in the estimates. We simulate two scenarios where there is substantial variation in relative survival by age, a situation which leads to higher bias for some methods [2]. In Scenario 1 the variation the excess hazard ratio decreases over time and in Scenario 2 we assume the effect of age is proportional over time. The simulation strategy is outlined below.

1
The sample size of each dataset was set at 1000.

2
All subjects were assumed to be male and diagnosed in one calendar year, 2009.

3
Age at diagnosis was sampled from a Normal distribution with mean 66 and standard deviation 13.

4
Times of death due to cancer was generated with the baseline survival (at the mean age) having a Weibull distribution with shape paramater, γ=0.5, and scale parameter, λ=0.2. The hazard function for the Weibull model is λγt^{γ−1}.

5
The effect of age at diagnosis (agediag) varied between the 2 scenarios. Scenario 1 agediag was assumed to be nonproportional over time with an excess hazard ratio of exp(0.1−0.01(agediag−66)t). The timedependent excess hazard ratios and relative survival for selected ages are shown in Fig. 1. Note that the excess hazard ratios get closer to the null as time increases. Scenario 2 agediag was assumed to be nonproportional over time with an excess hazard ratio of exp(0.03(agediag−66)). The excess hazard ratios and relative survival for selected ages are shown in Fig. 1.

6
Times to death from other causes were generated from exponential distributions with mortality rates obtained from expected population mortality rates in England in 2009 [20]. A different rate was used for each year of follow to account for the fact that attained age was increasing. If the simulated value was greater than 1 for any yearly interval then the individual was assumed to be alive at the start of the next interval. If the value was less than one, it was assumed that the individual died in the interval.

7
The observed time to death was taken as the minimum of the cancer specific time to death and the other cause time to death.

8
The following analyses were performed on each simulated data set. All flexible parametric models used 6 knots to model the effect of followup time.

a
The nonparametric Pohar Perme method to estimate marginal relative survival [6].

b
A conditional flexible parametric relative survival model with nocovariates, see Eq. (3).

c
A conditional flexible parametric relative survival model fitting a linear effect of age and assuming proportional hazards, see Eq. (1). The marginal relative survival was estimated using regression standardization.

d
A conditional flexible parametric relative survival model with the effect of age modelled using restricted cubic splines (4 knots) and allowing for nonproportional hazards with an interaction between the age splines and spines variables to model the effect of followup time, see Eq. (1). Marginal relative survival was then estimated using regression standardization.

e
A flexible parametric marginal model to directly estimate marginal relative survival. The timescale was split every 0.2 years to incorporate the weights.

a

9
1000 datasets were generated. In estimation of the coverage this gives a standard error of \(100\sqrt (0.95*0.05/1000) = 0.69\) percentage points.

10
For each method the bias, the mean square error (MSE), the coverage and relative percentage increase in precision compared to the nonparametric Pohar Perme estimator at 1, 5 and 10 years post diagnosis was reported.
Results
Simulation study
The results of the simulation study for scenarios 1 and 2 are shown in Tables 1 and 2 respectively. In both scenarios bias is negligible for the Pohar Perme method, the marginal model and regression standardization when fitting a model with timedependent effects for age with coverage close to 95%. Regression standardization when assuming proportional hazards gives a bias of 0.4, 0.9 and 2.0 percentage points at 1, 5 and 10 years respectively for Scenario 1, illustrating the potential issue of fitting an incorrect model. In Scenario 2 bias was lower as the variation in excess mortality reduced as a function of followup time [2].
Fitting a conditional survival model with no covariates gives a bias of 2.6, 4.9 and 4.6 percentage points at 1, 5 and 10 years respectively for Scenario 1, with bias of 0.0, 2.7 and 4.3 percentage points for Scenario 2.
The marginal relative survival model has a lower MSE at all times when compared to the Pohar Perme method for both scenarios. Regression standardization of the model allowing a nonproportional effect of age has similar MSE to the marginal model at 1 and 5 years, but a lower MSE at 10 years.
Regression standardization with nonproportional hazards and the marginal model had similar relative increases in precision when compared to the Pohar Perme estimator at 1 and 5 years. At 10 years, the marginal model had higher relative precision than the Pohar Perme method, but the increase was notably greater when using regression standardization.
Five (0.5%) of the models with timedependent effect of age in Scenario 1 and 183 (18.3%) in Scenario 2 failed to converge indicating a potential problem in using complex models to model the nonproportional effects of age when there are very few individuals still at risk towards the end of followup.
In summary, the simulation study shows the marginal model has negligible bias and greater precision than the nonparametric Pohar Perme estimator.
Application to melanoma example
The methods are illustrated using data on 4,744 patients diagnosed with melanoma (a type of skin cancer) in a Northern European country between 1985–1994. The data is from a national cancer registry and so attempts to capture all diagnosed cases of Melanoma in the country. These data are distributed with the Stata strs package [21]. The code to implement the methods are shown in Appendix II with datasets freely available.
Figure 2a shows the nonparametric Pohar Perme estimator together with the estimated modeled marginal relative survival. These are internally age standardized estimates as the expectation is over the observed age distribution. Here a model with 6 knots (i.e. five restricted cubic spline variables) were used to model the marginal log cumulative excess hazard. The timescale was split every 0.2 years when incorporating the weights. Agreement can be seen to be very good. A feature of the Pohar Perme estimator is that it is highly variable towards the end of followup. This is due to the impact of the weights, with the small number of older individuals who are still alive having high influence. Also shown on this figure is the conditional model with nocovariates which shows a clear difference when compared to the other methods.
Figure 2b shows the externally agestandardized marginal relative survival using Pohar Perme and the marginal relative survival model. The age groups were <45,45−54,55−64,65−74,75+ years of age with standardization weights as defined in the International Cancer Survival Standard (ICSS) [5]. In order to agestandardize using the PoharPerme estimator separate estimates for each age group were obtained and then a weighted average was calculated. The marginal relative survival model does not need to stratify or model the effect of age due to the incorporation of the weights defined in Eq. (7). The Figure shows that, in this example, external agestandardization makes little difference when compared to the internally agestandardized estimates in Fig. 2a as the reference agedistribution was similar to that observed in the study population.
In Fig. 2 the timescale was split every 0.2 years with the weights taken at the midpoint of each interval and 6 knots (5 spline variables) were used to model the baseline. Table 3 shows the estimated survival at 1, 5 and 10 years using a variety of time splits, at 0.05, 0.1, 0.2, 0.5, 1 and 2.5 years, and various number of knots (4, 6, 8 and 10). There is very little difference in any of the estimates at 1year with the largest difference, 0.002. With increasing time there is more variation, but for timesplits from 0.05 to 0.2 years yields a difference of 0.002 or less at 10 years. Larger timesplits lead to larger differences, which is not surprising as the weights are forced to be constant within each interval.
If the mean expected mortality defined in Eq. (4) is calculated separately for males and females then the effect of sex can be modelled. When doing so one will generally want to agestandardize as the age distributions of the groups being compared could differ. There is a choice here about what age distribution to standardize to. This could be the ICSS weights used above, which would allow comparisons with other studies using the same age standard. Alternatively, if measures more relevant to the study population are required then this could be the joint age distribution of male and females or the age distribution of one of the groups. Here the age distribution of males will be used to illustrate some useful interpretations.
It could be simplistically assumed that the marginal excess mortality rates are proportional. Fitting such a model yields a marginal excess mortality rate ratio for females compared to males of 0.70 (95% CI: 0.59 to 0.83). The assumption of proportional marginal hazards can be relaxed by incorporating an interaction between the effect of being female and the effect of time. This was done using a spline term with 4 knots, i.e. three additional parameters. The excess mortality rates for both the proportional hazards model and nonproportional model are shown in Fig. 3a with the corresponding marginal relative survival functions in Fig. 3b. The curves for each sex are very similar as the proportionality assumption appears reasonable here; (a decrease in loglikelihood of 0.71 with 3 degrees of freedom).
As we have standardized each of these estimates to the age distribution of males the curves give the factual estimates for males, i.e. given their own age distribution and the counterfactual effect for males if they had the same relative survival as females. This is because the agedistribution for males is being applied to the effect of females.
Figure 4 shows the difference in the two marginal relative survival functions with 95% confidence intervals. As the age distribution of males has been used as the age reference, this gives an estimate how much the marginal relative survival would increase if males had the same relative survival as females.
Discussion
We have demonstrated how marginal estimates of relative survival can be directly modeled through calculation of appropriately weighted mean expected mortality rates and introducing weights into the likelihood. We have also shown how additional weights can be included so that agestandardized marginal relative survival is directly estimated without the need to model or stratify by age. In addition we have shown that contrasts of marginal relative survival can be directly estimated from the marginal model. Although the relative survival framework has almost exclusively used within cancer, the methods described here could be utilized in other disease areas [22, 23].
When making comparisons between population groups there is generally a need to agestandardize as in most situations there will be a difference in the age distribution and there is usually a strong relationship between age and relative survival. Introducing individual level relative weights enables external agestandardization without the need to stratify or model age. Rutherford et al recently advocated the use of individual relative weights for the nonparametric Pohar Perme estimator for age standardization when data were sparse [8]. An important choice when age standardizing is in the age distribution used. Using ICSS weights allows comparisons with many other studies using the same weights, but using the agedistribution observed in the study population may be more useful for generalizing to a particular population. As separate effects do not need to be calculated by age group the use of narrower categories of age for the relative weights can be used. For example, in practice the oldest age group when standardizing for most cancer types is 75+. There can be substantial variation in relative survival within this age group and the age distribution within this age group may vary between population groups. Using the weighting approach proposed here one could, for example, apply the five year age groups presented by Corazziari et al. [5].
If just an overall summary of marginal relative survival is needed then the nonparametric Pohar Perme estimate could be used. An advantage of the marginal model is that it has negligible bias, but higher relative precision than then nonparametric approach. Our approach is particularly useful for comparisons between different population groups.
A limitation of the method is that the user has to choose the number of knots for the restricted cubic splines. Previous work has demonstrated that estimates are insensitive to the number of knots, as long as their are enough to capture the underlying shape of the hazard functions [18, 19, 24]. However, a sensitivity analysis, such as that displayed in Table 3 can be reassuring to both the analyst and others. The marginal model approximates the integral defined in the likelihood by splitting the timescale (Eq. 6). The sensitivity analysis shown here shows that as long as the width of the intervals is at 0.2 years or less there is very little impact on the estimates, but smaller intervals would lead to improved accuracy. Although more accurate estimation could be obtained through calculating the integral numerically for each individual in the estimation process, we prefer the split time approach as it enables standard software to be used together with a host of useful postestimation prediction commands.
In observational studies with timetoevent outcomes it is common to use inverse probability weighting (IPW) to adjust for confounders [25]. However, a conditional relative survival model without covariates gives a biased estimate of the marginal survival function (Fig. 2) and thus the IPW approach in not directly transferable to the relative survival framework. However, applying the IPW weights to the marginal model will enable the IPW approach to be used in the relative survival framework and this is being further investigated in ongoing work.
When there is interest in more detailed comparisons between population groups then a conditional model should be used. For example, to quantify how differences in relative survival vary by age. However, when there is only interest in the marginal relative survival and associated contrasts, the model proposed here simplifies the modelling process, has negligible bias and increased precision over the non parametric Pohar Perme estimate.
Conclusion
When only marginal effects are of interest the model described here is particularly useful as it less prone to model misspecification as covariates that are associated with expected mortality rates do not need to be incorporated into the model.
Software
Stata code to setup data and fit the models for the melanoma example is shown in Appendix II of the supplementary material.
Availability of data and materials
The melanoma data used in this example is available in the strs Stata package (see reference 16). Code to fit the models is available in the supplementary material.
Declarations
Abbreviations
 ICSS:

International cancer survival standard
 IPW:

Inverse probability weighting
 MSE:

Mean squared error
References
 1
Pavlic K, Pohar Perme M. Using pseudoobservations for estimation in relative survival. Biostatistics. 2018; 20(3):384–99. https://doi.org/10.1093/biostatistics/kxy008.
 2
Lambert PC, Dickman PW, Rutherford MJ. Comparison of approaches to estimating agestandardized net survival. BMC Med Res Methodol. 2015; 15:64. https://doi.org/10.1186/s1287401500573.
 3
Johnson CJ, Hahn CG, Fink AK, German RR. Variability in cancer death certificate accuracy by characteristics of death certifiers. Am J Forensic Med Pathol. 2012; 33(2):137–42. https://doi.org/10.1097/PAF.0b013e318219877e.
 4
Schaffar R, Rapiti E, Rachet B, Woods L. Accuracy of cause of death data routinely recorded in a populationbased cancer registry: impact on causespecific survival and validation using the Geneva Cancer Registry. BMC Cancer. 2013; 13:609. https://doi.org/10.1186/1471240713609.
 5
Corazziari I, Quinn M, Capocaccia R. Standard cancer patient population for age standardising survival ratios. Eur J Cancer. 2004; 40(15):2307–16. https://doi.org/10.1016/j.ejca.2004.07.002.
 6
Pohar Perme M, Stare J, Estève J. On estimation in relative survival. Biometrics. 2012; 68:113–20. https://doi.org/10.1111/j.15410420.2011.01640.x.
 7
Sasieni P, Brentnall AR. On standardized relative survival. Biometrics. 2016; 73:473–82. https://doi.org/10.1111/biom.12578.
 8
Rutherford MJ, Dickman PW, Coviello E, Lambert PC. Estimation of agestandardized net survival, even when agespecific data are sparse. Cancer Epidemiol. 2020; 67:101745. https://doi.org/10.1016/j.canep.2020.101745.
 9
Nelson CP, Lambert PC, Squire IB, Jones DR. Flexible parametric models for relative survival, with application in coronary heart disease. Stat Med. 2007; 26(30):5486–98. https://doi.org/10.1002/sim.3064.
 10
Dickman PW, Sloggett A, Hills M, Hakulinen T. Regression models for relative survival. Stat Med. 2004; 23(1):51–64. https://doi.org/10.1002/sim.1597.
 11
Estève J, Benhamou E, Croasdale M, Raymond L. Relative survival and the estimation of net survival: elements for further discussion. Stat Med. 1990; 9(5):529–38.
 12
Syriopoulou E, Rutherford MJ, Lambert PC. Marginal measures and causal effects using the relative survival framework. Int J Epidemiol. 2020; 49:619–28. https://doi.org/10.1093/ije/dyz268.
 13
Royston P, Lambert PC. Flexible Parametric Survival Analysis in Stata: Beyond the Cox Model: Stata Press; 2011.
 14
Seppä K, Hakulinen T, Läärä E, Pitkäniemi J. Comparing net survival estimators of cancer patients. Stat Med. 2016; 35(11):1866–79. https://doi.org/10.1002/sim.6833.
 15
Lin DY. On fitting Cox’s proportional hazards models to survey data. Biometrika. 2000; 87:37–47. https://doi.org/10.1093/biomet/87.1.37.
 16
Fine JP, Gray RJ. A proportional hazards model for the subdistribution of a competing risk. J Am Stat Assoc. 1999; 446:496–509.
 17
Lambert PC, Wilkes SR, Crowther MJ. Flexible parametric modelling of the causespecific cumulative incidence function. Stat Med. 2017; 36:1429–46. https://doi.org/10.1002/sim.7208.
 18
Rutherford MJ, Crowther MJ, Lambert PC. The use of restricted cubic splines to approximate complex hazard functions in the analysis of timetoevent data: a simulation study. J Stat Comput Simul. 2015; 85:777–93. https://doi.org/10.1080/00949655.2013.845890.
 19
Bower H, Crowther MJ, Rutherford MJ, Andersson TML, Clements M, Liu XR, Dickman PW, Lambert PC. Capturing simple and complex timedependent effects using flexible parametric survival models. Commun Stat  Simul Comput. 2019. https://doi.org/10.1080/03610918.2019.1634201.
 20
Lifetables. Cancer Survival Group UK. London School of Hygeine and Tropical Medicine. https://csg.lshtm.ac.uk/lifetables/. Accessed 21 Dec 2020.
 21
Dickman PW, Coviello E. Estimating and modelling relative survival. Stata J. 2015; 15(1):186–215.
 22
Nelson CP, Lambert PC, Squire IB, Jones DR. Relative survival: what can cardiovascular disease learn from cancer?Eur Heart J. 2008; 29(7):941–7. https://doi.org/10.1093/eurheartj/ehn079.
 23
Bhaskaran K, Hamouda O, Sannes M, Boufassa F, Johnson AM, Lambert PC, Porter K, CASCADE Collaboration. Changes in the risk of death after HIV seroconversion compared with mortality in the general population. JAMA. 2008; 300(1):51–59. https://doi.org/10.1001/jama.300.1.51.
 24
Syriopoulou E, Mozumder SI, Rutherford MJ, Lambert PC. Robustness of individual and marginal modelbased estimates: A sensitivity analysis of flexible parametric models. Cancer Epidemiol. 2018; 58:17–24. https://doi.org/10.1016/j.canep.2018.10.017.
 25
Austin PC. The use of propensity score methods with survival or timetoevent outcomes: reporting measures of effect similar to those used in randomized experiments. Stat Med. 2014; 33:1242–58. https://doi.org/10.1002/sim.5984.
Acknowledgments
Not applicable.
Funding
This work was partly funded by Cancerfonden and Vetenskapsrådet. The funders had no role in the design of the study or collection, analysis, and interpretation of data or writing of the manuscript. Open access funding provided by Karolinska Institute.
Author information
Affiliations
Contributions
PCL, ES, MR conceived the idea for the study. PCL developed and implemented the statistical analysis, which included multiple discussions with ES and MR. PCL drafted the manuscript with input from ES and MR and all authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Additional file 1
Poisson regression and the Pohar Perme estimator.
Additional file 2
Stata implementation of the models.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Lambert, P.C., Syriopoulou, E. & Rutherford, M.R. Direct modelling of age standardized marginal relative survival through incorporation of timedependent weights. BMC Med Res Methodol 21, 84 (2021). https://doi.org/10.1186/s12874021012661
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12874021012661
Keywords
 Net survival
 Regression standardization
 Relative survival