 Research article
 Open Access
 Published:
Missing not at random in end of life care studies: multiple imputation and sensitivity analysis on data from the ACTION study
BMC Medical Research Methodology volume 21, Article number: 13 (2021)
Abstract
Background
Missing data are common in endoflife care studies, but there is still relatively little exploration of which is the best method to deal with them, and, in particular, if the missing at random (MAR) assumption is valid or missing not at random (MNAR) mechanisms should be assumed. In this paper we investigated this issue through a sensitivity analysis within the ACTION study, a multicenter cluster randomized controlled trial testing advance care planning in patients with advanced lung or colorectal cancer.
Methods
Multiple imputation procedures under MAR and MNAR assumptions were implemented. Possible violation of the MAR assumption was addressed with reference to variables measuring quality of life and symptoms. The MNAR model assumed that patients with worse health were more likely to have missing questionnaires, making a distinction between single missing items, which were assumed to satisfy the MAR assumption, and missing values due to completely missing questionnaire for which a MNAR mechanism was hypothesized. We explored the sensitivity to possible departures from MAR on gender differences between key indicators and on simple correlations.
Results
Up to 39% of followup data were missing. Results under MAR reflected that missingness was related to poorer health status. Correlations between variables, although very small, changed according to the imputation method, as well as the differences in scores by gender, indicating a certain sensitivity of the results to the violation of the MAR assumption.
Conclusions
The findings confirmed the importance of undertaking this kind of analysis in endoflife care studies.
Background
Missing data are common in palliative and endoflife care studies, where 20–50% of participants withdraw early, mostly because of deterioration and/or death [1]. A systematic review and metaanalysis of randomized controlled palliative intervention trials found about one quarter of primary endpoint data missing [2]. In the literature, several methods have been proposed for dealing with missing data, but there has been relatively little exploration of which is best in the palliative care setting [3]. The appropriateness of the chosen method is strongly related to the nature of the mechanism generating missing data. Missing data can be categorized as: missing completely at random (MCAR), when the probability of an observation being missing does not depend on both observed and unobserved data; missing at random (MAR), when the probability of an observation being missing depends only on the observed; and missing not at random (MNAR), when the probability of an observation being missing also depends on unobserved data [4].
Completecase analysis, that consists in performing the analysis on the subset of subjects with complete information, was the most common approach to treat missing data in randomized clinical trials as recently as 2013 [5, 6]. However, unless the missingness mechanism is MCAR, this may lead to biased results. When the MCAR assumption is not valid, alternative strategies can be adopted to deal with missing data: inverse probability weighting, doubly robust inverse probability weighting, maximum likelihood estimation, multiple imputation (MI). Among them, MI is widely recognised as the most appropriate one in many fields [7]. MI creates several complete versions of the data by replacing each missing value with more than one plausible value. Each of the resulting complete datasets is then analyzed with standard statistical methods and the results pooled for final inference using the Rubin’s combination rule, to obtain a point estimate and a measure of precision which accounts for uncertainty due to missing information [4, 8]. There are several ways to implement MI that could be run under MAR and MNAR [9,10,11,12]. One of these is the Multivariate Imputation by Chained Equations (MICE) which relies on the MAR assumption [9, 10], but can be modified in order to account for MNAR mechanisms [13].
Missing data in endoflife care studies can arise due to the fact that one or more questionnaire items are missing, or due to the fact that all the questionnaire items are missing (missing form). While missing items may be due to simple omissions in questionnaire compilation, reasons for missing a whole questionnaire may relate to sudden change in the patient’s health status or to the patient’s sensitivity to specific issues which could be not adequately measured by the collected variables. In these cases, the missingness mechanism may be MNAR [14,15,16,17,18]. Although it is known that performing MI under the assumption of MAR when the actual missingness mechanism is MNAR may produce biased estimates [4], this issue is not widely appreciated in dealing with missing data in palliative care studies [3]. Even if it is not possible to distinguish between MNAR and MAR patterns using observed data, the robustness of the MAR assumption can be investigated through sensitivity analyses [4, 14]: if the results obtained under MAR and specific MNAR assumptions are similar, one can conclude that the presence of unobserved factors does not affect the conclusions.
The aim of this study was to compare the performance of different MI methods within the ACTION (Advance care planning – an innovative palliative care intervention to improve quality of life in oncology) study by treating separately missing items and missing questionnaires [15]. Using a preliminary dataset, we handled the missing data by applying a MICE procedure under the standard assumption of MAR and also under MNAR by using a pattern mixturemodel approach, [4, 16] which distinguished between missing values due to missing items and missing the whole questionnaires. We focused on target analyses to evaluate the sensitivity of the results to the use of the two procedures.
Methods
Data
The ACTION study is a phase III multicenter cluster randomised controlled trial, following the CONSORT guidelines, on the effects of advance care planning (ACP) in patients with advanced lung or colorectal cancer. The ACTION Respecting Choices ACP intervention involves trained healthcare professionals who assist patients and their relatives in reflecting on the patient’s goals, values and beliefs and in discussing their healthcare wishes. The intervention has the potential to improve current and future healthcare decision making, provide patients with a sense of control, and improve quality of life. In total, 22 hospitals in 6 countries were randomised to be intervention or control sites, with up to 1360 patients participating. At the intervention sites, patients are offered interviews with a trained ACP facilitator, whereas in the control sites, patients receive usual care. All participating patients are asked to complete questionnaires at baseline, and again after 2.5 and 4.5 months; the questionnaires assess quality of life, and the extent to which care as received is aligned with their preferences, their evaluation of decisionmaking processes and quality of endoflife care (see additional file 2) [15].
Ethical approval was obtained from the Research Ethics Committee (REC) of the coordinating centre (‘Medische Ethische Toetsings Commissie Erasmus MC’), as well as RECs in all participating countries. The trial was registered in the International Clinical Trials Registry Platform (ISRCTN63110516) per 10/3/2014.
Within the ACTION study it was decided to perform a preliminary analysis on a first subsample of the enrolled subjects, with the aim to explore methods for dealing missing data. Our analysis was based on the records of 487 patients, representing the 36% of the final expected sample, containing information collected through questionnaires at baseline and at 2.5 months, for a total of 121 variables. Baseline data included personal information (gender, age, marital status, living with a partner or alone, living in a private household or in an institution/care facility, children, years of education, religiosity, ethnic group), information on diagnosis (small cell or nonsmall cell lung cancer, colon cancer, rectal cancer), cancer stage (III or IV lung cancer; metachronous metastases or IV colorectal cancer), WHO performance status (a measure of how well a person with cancer is able to carry on ordinary daily activities), and current treatment (chemotherapy, radiation therapy, immunotherapy, targeted therapy). Using baseline and followup questionnaires, we calculated scores for quality of life and symptoms (primary endpoints) and for shared decisionmaking, satisfaction with care and coping with illness (secondary endpoints) (Table 1). All the scores, except that measuring patient involvement which was built ad hoc for the present study, are validated and largely used in the context of psychometrics [17,18,19,20,21,22]. They are continuous variables ranging between 0 and 100. Since an intermediate analysis on the treatment effect was not planned by protocol, this preliminary analysis was blinded in respect to treatment arm and country of the participants, as well as in respect to their survival. Due to the fact that few patients are expected to die during the first 2.5 months of follow up, for sake of simplicity, in our analysis we assumed that all patients were still alive at the completion of the second questionnaire, meaning that no form was missing because of patient’s death (see discussion).
Statistical methods
In our analysis we first applied the MICE procedure under MAR assumption. The procedure was applied to all the variables with missing data, including variables on personal information, and on diagnosis and treatments. Then we performed a sensitivity analysis assuming alternative MNAR mechanisms, within a pattern mixturemodel approach [4, 16]. MI under MNAR required the specification of additional assumptions on the missingness mechanism and the modification of the MICE algorithm.
According to the main purpose of the ACTION study, the sensitivity analysis focused on relevant outcomes of quality of life and symptoms scores. We estimated means and 90% confidence intervals of these outcomes on the overall sample and by gender and their correlations with other scores (i.e. shared decision making, satisfaction with care and coping with illness), under the different MI approaches. Differences among MI procedures were evaluated by comparing point estimates and their confidence intervals in a descriptive way. If, under all the MNAR models, the direction of the explored relationships was the same as that under MAR and the confidence intervals largely overlapped, we concluded that the results were robust to violation of the MAR assumption. Due to the fact that at this stage of the analysis we are not interested in performing statistical tests, but only in describing the results, we preferred to report the 90% confidence intervals instead of the 95% ones in order to discourage their use as a surrogate of pvalue = 0.05 [23].
Multiple imputation by chained equations under MAR assumption
Let Y = (Y_{1},…,Y_{J}) be the n⨯J matrix of the data, where Y_{j} = (y_{1j}, y_{2j},…y_{nj})^{T} is the vector of length n representing the values assumed by the jth variable in the n subjects. Let R = (R_{1},…, R_{J}) be a n⨯J matrix, with R_{j} = (r_{1j}, r_{2j}, …,r_{nj})^{T} vector of the missingness indicators for the jth variable (r_{ij} = 0 if y_{ij} is missing and r_{ij} = 1 otherwise). Let us denote the observed entries and the missing entries of Y as Y_{obs} and Y_{mis}, respectively. Analogously, let Y_{j obs} and Y_{j mis} be the observed and the missing entries of the vector Y_{j}. Under the MAR assumption of conditional independence between Y_{mis} and R given the observed data Y_{obs}, ƒ(Y_{mis}Y_{obs}, R) = ƒ (Y_{mis}Y_{obs}), the MICE algorithm required the specification of the univariate conditional distributions f(Y_{j}Y_{j}, θ_{j}) in the form of regression models for the variables with missing values, where Y_{j} = (Y_{1}, …, Y_{j1}, Y_{j + 1}, …, Y_{J}) and θ_{j} was a vector of unknown regression parameters [9]. In our analysis, in order to avoid collinearity and related computational problems, assuring at the same time good prediction, a preliminary selection of the predictors to be included in the regressions was performed (in practice some of the elements of θ_{j} have been set to zero). In particular, we regressed each variable Y_{j} with missing entries on each of the other variables, completed by a preliminary imputation which draw from their empirical marginal distribution. Then, we calculated the Akaike’s information criterion (AIC) for each regression and selected as predictors for Y_{j} the 15 variables which lead to the smallest AIC values [24]. Considering the number of observations and the large percentage of missing values in most of the follow up scores, 15 was considered an appropriate number of covariates [10]. The marginal regressions used in this preliminary procedure and in the following imputation algorithms varied according to the nature of the outcome variable (linear regression for continuous variables, logistic regression for factor variables with 2 levels, multinomial logistic regression for factor variables with more than 2 levels, and proportional odds models for ordered variables). Finally, for each incomplete variable, the set of the selected predictors was enriched by including the indicators of gender, age, type of cancer and WHO performance status. The selected predictors for each incomplete variable are reported in Figure A1 of the additional file 1.
Once defined the conditional distributions f(Y_{j}Y_{j}, θ_{j}) and assumed noninformative priors on θ_{j}, the MICE algorithm consisted in the following steps. First, we randomly drew an initial imputation \( {\hat{Y}}_1,\dots, {\hat{Y}}_J \) for the missing values in Y_{mis}, by sampling from the empirical marginal distributions of the variables with missing entries (step 0). Then, for the first variable Y_{j} with missing entries:

1
we sampled \( {\hat{\theta}}_j \) from the posterior distribution \( f\left({\theta}_j{Y}_{j\ \mathrm{obs}},{\hat{Y}}_{j}\right) \),

2
we drew \( {\hat{Y}}_{j\ \mathrm{mis}} \) from the posterior predictive distribution \( f\left({Y}_j{Y}_{j\ \mathrm{obs}},{\hat{Y}}_{j},{\hat{\theta}}_j\right) \).
We repeated steps 1 and 2 sequentially for each variable with missing entries in the dataset, and we repeated the entire procedure, excluding step 0, until algorithm convergence (100 iterations) [25]. At the end, we got a complete dataset. We created several complete versions of the data by repeating the procedure M times, choosing M according to the rule of thumb based on the average percentage rate of missingness [26].
The MICE procedure was implemented by using the mice library of the R software [27, 28].
Multiple imputation by chained equations under MNAR assumption
Let us suppose that the MAR assumption is not valid. This, in general, implies that the posterior predictive distribution of Y_{j mis} at step 2 of the MICE algorithm depends on R. In order to account for this dependency, according to the pattern mixturemodel approach [4, 16], the MICE algorithm can be modified defining distinct posterior predictive distributions depending on the missing data patterns in R. For example, let us assume that we are in the simple case in which the value of Y_{j mis} depends on the observed and on the fact that Y_{j} is missing, but not on the fact that other variables are missing. In this case, we would have two distinct posterior predictive distributions, one for R_{j} = 0 and one for R_{j} = 1:
\( f\left({Y}_j{Y}_{j\ \mathrm{obs}},{\hat{Y}}_{j},{{\hat{\theta}}_j}^{R_j=0}\right) \) and \( f\left({Y}_j{Y}_{j\ \mathrm{obs}},{\hat{Y}}_{j},{{\hat{\theta}}_j}^{R_j=1}\right) \),
and the step 2 of the MICE algorithm can be modified as follows. After having sampled \( {{\hat{\theta}}_j}^{R_j=1} \) from the posterior distribution \( f\left({\theta}_j{Y}_{j\ \mathrm{obs}},{\hat{Y}}_{j}\right) \), we generate \( {{\hat{\theta}}_j}^{R_j=0} \) from the conditional distribution \( f\left({\theta}_j^{R_j=0}{\hat{Y}}_{j},{\hat{\theta}}_j^{R_j=1}\right) \), a priori defined according to an assumed hypothesis on the MNAR mechanism. Finally, we sample \( {\hat{Y}}_{j\ \mathrm{mis}} \) from the posterior predictive distribution \( f\left({Y}_j{Y}_{j\ \mathrm{obs}},{\hat{Y}}_{j},{{\hat{\theta}}_j}^{R_j=0}\right) \). More in general, when the value of Y_{j mis} depends also on the fact that other variables are missing, there are in principle distinct predictive distributions for each pattern of missing data [29].
Partition of the missing values and modified MICE in the ACTION study
In our analysis, we explored the violation of the MAR assumption, making a distinction between missing items due to the fact that the patient did not reply to some items in a questionnaire and missing items due to the fact that the form was completely missing (this could happen for the second questionnaire, but not for the baseline one), for which a violation of the MAR assumption could be hypothesised. In particular, we introduced two matrices of missingness indicators, R^{I} which indicated the missing values of the first type and R^{F} which indicated the missing values of the second type. The two matrices described a partition of the missing values; for simplicity we called M^{I} the first element of the partition (collection of the missing data defined by R^{I}) and M^{F} (collection of missing data defined by R^{F}). Then, we assumed a possible MNAR mechanism for the missing values in M^{F}:
with ƒ (Y_{mis}Y_{obs}, R^{F}) possibly different from ƒ (Y_{mis}Y_{obs}).
Two distinct posterior predictive distributions were defined for each Y_{j} with missing values in M^{F}, one for R^{F}_{j} = 0 and one for R^{F}_{j} = 1:
\( f\left({Y}_j{Y}_{j\mathrm{obs}},{\hat{Y}}_{j},{{\hat{\theta}}_j}^{R_j^F=0}\right) \) and \( f\left({Y}_j{Y}_{j\mathrm{obs}},{\hat{Y}}_{j},{{\hat{\theta}}_j}^{R_j^F=1}\right). \)
The described violation of the MAR assumption required a modification of the MICE algorithm that relied on a model for \( f\left({\theta}_j^{R_j^F=0}\left{\hat{Y}}_{j},{\hat{\theta}}_j^{R_j^F=1}\right.\right) \). The modified MICE algorithm that we implemented for our analysis was the following. After having randomly drawn an initial imputation \( {\hat{Y}}_1,\dots, {\hat{Y}}_J \) for the missing values by sampling from the empirical marginal distributions of the variables with missing entries (step 0), we imputed the missing values for the first variable Y_{j} with missing entries, according to the following steps:

1
we sampled \( {\hat{\theta}}_j \) from the posterior distribution \( f\left({\theta}_j{Y}_{j\ \mathrm{obs}},{\hat{Y}}_{j}\right) \),

2
we drew \( {\hat{Y}}_{j\ \mathrm{mis}} \) from the posterior predictive distribution \( f\left({Y}_j{Y}_{j\ \mathrm{obs}},{\hat{Y}}_{j},{\hat{\theta}}_j\right) \),

3
if Y_{j} had missing entries in M^{F}, we generated \( {\hat{\theta}}_j^{R_j^F=0} \) from the conditional distribution \( f\left({\theta}_j^{R_j^F=0}{\hat{Y}}_{j},{\hat{\theta}}_j^{R_j^F=1}\right) \), a priori defined according to our hypothesis on the MNAR mechanism (see below), and we sampled \( {\hat{Y}}_{j\ \mathrm{mis}} \) from the posterior predictive distribution \( f\left({Y}_j{Y}_{j\ obs},{\hat{Y}}_{j},{\hat{\theta}}_j^{R_j^F=0}\right) \).
We repeated step 1step 3 for all variables with missing entries sequentially, until algorithm convergence (100 iterations). We repeated the whole procedure M times in order to create M complete versions of the data set. In running the algorithm, we ordered the variables according the number of missing values (from the variable with fewer missing entries to that one with more missing entries).
Assumptions on the MNAR mechanism
We assumed that the MAR assumption was violated with reference to the six primary endpoints expressing the patient’s health status, as measured by the second questionnaire: followup scores pain (PA), dyspnoea (DY), emotional functioning (EF), physical functioning (PF), fatigue (FA) and quality of life (QOL). Under the general assumption that reasons for missing the second questionnaire as a whole may relate to sudden changes in the patient’s health status or to sensitivity to specific issues not adequately measured by the other variables [4, 14, 30,31,32,33], patients with a poorer health status were those for whom we expected to have larger probability of missing form. Our definition of the MNAR mechanisms reflected these hypotheses.
Even if in principle the model for \( f\left({\theta}_j^{R_j^F=0}\left{\hat{Y}}_{j},{\hat{\theta}}_j^{R_j^F=1}\right.\right) \) could be highly complex, we specified two simple alternative models [13, 34, 35]. The first model assumed a constant shift k in the expected value of each of the six variables between observed and missing observations belonging to M^{F}. We considered four different shifts k_{1}, …, k_{4} defined as the equispaced values between 0 and ½ interquartile range of the variable. The second model allowed the shift k to vary among individuals according to their WHO performance status. Let Y_{WHO} be the variable expressing the WHO performance status, and sd(Y_{j obs}) the standard error for one of the six primary endpoints, calculated on the observed values; the shift k was defined as k = Y_{WHO}⨯sd(Y_{jobs})/4. The shift was always assumed to be in the direction of worsening the patient’s health conditions (subtracted for QOL, PF, EF and added for the other scores).
Analysis of data
We implemented the algorithm for the MNAR model by modifying the existing mice function of the mice R library. We used the passive imputation builtin method in order to incorporate at each step of the imputation algorithm the transformations required by the MNAR assumptions [27, 28, 34].
At the end of the MI procedure (MICE or modified MICE), each of the M complete datasets was analysed applying standard statistical methods and the results were combined according to the Rubin’s rules [4]. Indicating with Q the unknown parameter of interest, for example the average quality of life score, let \( {\hat{Q}}_m \) be the point estimate of Q and \( {\hat{U}}_m \) the estimate of its variance, arising from the analysis on the mth dataset (m = 1, …, M). The combined estimate of Q was equal to \( \overline{Q}=\sum \limits_{m=1}^M\frac{{\hat{Q}}_m}{M} \) and its estimated variance was T = U+(1 + 1/M) B, where \( U=\sum \limits_{m=1}^M\frac{{\hat{U}}_m}{M} \) and \( B=\frac{\sum_{m=1}^M{\left({\hat{Q}}_m\overline{Q}\right)}^2}{\left(M1\right)}. \)
Results
In Tables 2 and 3, we report descriptive statistics and percentage of missing values for sociodemographic, diagnosis and treatment variables measured at baseline, and for baseline and followup scores of quality of life and symptoms, shared decisionmaking, satisfaction with care and coping with illness. Between 1 and 15% of sociodemographic data, clinical data and baseline scores were missing. Missing values at followup were around 36–39%, mostly due to missing forms (170/487; 35%).
In the additional file 1, we reported the missing data pattern (figure A1), as well as the selected predictors for each univariate conditional regression used in the imputation procedures (figure A2).
Table 4 shows means and 90% confidence intervals of quality of life scores and symptoms at follow up, calculated from the observed data and after MI under MAR and MNAR assumptions setting M = 40, which approximately corresponded to the maximum percentage rate of missingness [26]. When compared with the values calculated on the observed data, the results under MAR suggested that missingness was related to poor health status and lower quality of life, with the means after MICE moving in the direction of a worse health, i.e. decreasing for QOL, PF, EF and increasing for PA, DY and FA. Assuming MNAR mechanisms the discrepancy between imputed values and observed values was markedly greater.
Table 5 shows the mean scores by gender, calculated on the observed, under the MAR model and under the MNAR model that assumed a degree of departure from MAR dependent on the WHO performance status. Under both imputation approaches, there was a clear difference between genders, with males reporting a better health state than females. The size of the difference between genders increased under the MNAR model.
All correlations of secondary endpoints (patient involvement, overall quality of care, active coping, denial) with primary endpoints related to symptoms (FA, PA, DY) were negative, whereas those with primary endpoints related to functioning (QOL, PF, EF) were positive, although very low. As an example, in Fig. 1 we show the correlation coefficients between QOL and the four selected secondary endpoints. Overall quality of care showed the higher correlations with primary endpoints. Although in general correlations after MICE were robust to violations of the MAR assumption, sometimes a certain discrepancy was observed between the different imputation methods (see additional file 1 Figures A3A7). In particular, under the MNAR scenarios all correlations involving denial were weaker than under MAR. A similar behaviour was observed also for patient involvement.
Discussion
Reviews of randomized controlled trials indicate wide variation in how missing data are dealt with. Relatively few studies use MI (8%), and, when carried out, it is mainly under MAR assumption; few studies assume MNAR mechanisms [5, 6, 36]. As usual in endoflife care studies, the proportion of missing values in the subset of the ACTION study data analysed in the present paper was high, particularly in relation to quality of life and symptoms outcomes. The percentage of missing forms at follow up was around 37%, thus higher than the 23% reported in a recent metaanalysis of randomized controlled trials on palliative interventions [2].
Although the availability of many variables should make the MAR assumption reasonable, quality of life is a multidimensional and complex concept and it is difficult to exclude the presence of relevant unmeasured factors, particularly in end of life care studies where reasons for missing values may relate to the patient’s health status or their sensitivity to specific issues [14, 30,31,32,33]. Because the violation of the MAR assumption can significantly affect final results, particularly when the proportion of missing values is high, we performed a sensitivity analysis to evaluate if the results obtained assuming MAR were robust to the presence of MNAR. In particular, we focused on means and correlations between primary and secondary outcomes or between primary outcomes and individual characteristics (gender).
Performing MI under both MAR and MNAR assumptions produced different estimates of the average score of the primary outcomes. The results under MAR suggested that missingness was related to poor health status and lower quality of life. This is consistent with the common sense idea that more critical patients could have greater problems in filling questionnaires. As expected, assuming MNAR mechanisms which explicitly assigned higher probabilities of missing form to those patients who experienced worse health status, the discrepancy between imputed values and observed values became markedly greater. Males showed a better health than females. After imputation, the difference became even more marked, particularly under the MNAR models. The marginal correlations between primary and secondary endpoints were consistently negative for symptoms and positive for functioning and quality of life. As an example, quality of life was positively associated with patient involvement, overall quality of care, active coping and denial. On the contrary, pain was negatively correlated with the same variables. In agreement with clinical expectations, overall quality of care had the strongest correlations with the primary outcomes. The correlations of the primary outcomes with denial appeared to be weaker under the MNAR scenarios than when assuming MAR. These findings suggest that the provisional indicators which we focused on in the present analysis were not always robust to violation of the MAR assumption.
The peculiar features of our approach are mainly two. First, we integrated the MNAR model within the MICE algorithm as implemented in the mice function through the passive imputation builtin method [34], so that imputation error was correctly propagated [37]. This is different from following a twostep procedure that first imputes each outcome under the MAR assumption, and then modifies the imputed values according to a specific model for MNAR [13, 31, 33, 38, 39]. Additionally, our algorithm accounts for the different nature of the missing values in the data set: missing entries satisfying the MAR assumption (15.3% of the total number of missing values) and missing entries, due to missing form, for which a MNAR mechanism could be hypothesised (84.7%).
In the literature, several algorithms have been proposed that modify MICE accounting for the violation of the MAR assumption [13, 29, 34, 35, 38]. Recently, Tompsett and colleagues [29] proposed an approach, called NotAtRandom Fully Conditional Specification (NARFCS), which generalizes the MICE algorithm to MNAR, by including in each univariate conditional regression model the missingness indicator of the variable to be imputed (the coefficient of which cannot be estimated from the data and is the object of the sensitivity analysis), as well as the missingness indicators of the other incomplete variables. In this way, it is possible to allow for the presence of correlations between each variable and the missingness indicators of the others. Unlike NARFCS, our approach assumed that these correlations were equal to zero, because it included a missing form indicator only in the univariate conditional models used for the imputation of quality of life and symptoms scores (i.e. the variable interested by MNAR). However, in a sensitivity analysis, we included the missing form indicator in each univariate conditional distribution, thus allowing for the possible correlation between having a missing form and all the variables in the data set. The results of this sensitivity analysis were very similar to those reported in the paper (see additional file 1 Tables A1A2).
Recently, imputation algorithms based on nonlinear models, such as classification and regression trees or random forests, have spread. These approaches allow to deal with complex interactions and nonlinearities in the prediction models. Additionally, they do not require preselection of predictors and can be used also when the number of covariates is larger than the number of observations [40, 41]. However, the performance of these methods is not well stated. It depends on the presence and relevance of possible interaction effects and on the correlation structure of the data, and it could be quite poor when data are highly skewed [42,43,44]. It would be interesting to investigate the robustness to departure from the MAR assumption for multiple imputations approaches based on recursive partitioning. However, at the best of our knowledge, there are no studies dealing with such algorithms under the MNAR assumption and extending the patternmixture model to this context deserves ad hoc investigation.
Limitations
We fixed the number of imputations to 40 following a rule proposed by White and colleagues [26]. Since in this preliminary analysis we focused on different quantities, all surrogate in respect to the primary endpoint of the ACTION study, we did not perform a detailed investigation in order to determine an optimal number of imputations, which in principle should be based on the Fraction of Missing Information evaluated for the parameter of interest [26] or on criteria aimed to assure results stability over repetitions of the MI procedure [45].
The present results could be sensitive to the prediction models specification. Models with a larger number of predictors or with predictors selected with different methods (e.g. LASSO) or models which includes interactions could result more or less robust to violation of the MAR assumption, leading to different conclusions. This point will be addressed when the mixedpattern approach proposed in this paper will be applied on the complete ACTION data set. As one of the main limitations of the present study, we would like to remark that we did not account for the cluster randomized design of the study, because information on treatment assignment and cluster variables (country and hospitals) was blinded to us [46, 47]. Similarly, also patient’s survival at follow up was not available, so that we had no information about possible truncation by death. At this stage of the study, truncation by death was likely a minor problem since the analyses have been carried out with a followup of 2.5 months from recruitment and most of the enrolled patients had a good WHO performance status at baseline (the median survival for both small and nonsmall cell lung cancer in both stage III and IV is estimated to be around 7–8 months [48, 49], and 2.6 and 1.7 years for patients with and without metastasectomy, respectively [50]). Truncation by death is however a very relevant point that should be address in the future in order to get reliable estimates of the treatment effect [51,52,53,54]. Moreover, the MI on the complete ACTION dataset will be performed separately by treatment arm [46].
Conclusions
In imputing missing data in endoflife care studies, sensitivity analyses for the departure from MAR should be performed. We proposed a modification of the MICE algorithm which accounts for the presence in the data set of two kind of missing data: missing entries satisfying the MAR assumption and missing entries, due to missing form, for which a MNAR mechanism could be hypothesised. We found that the results obtained after having imputed the missing values through MICE were not always robust to possible violations of the MAR assumption.
Availability of data and materials
The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.
Abbreviations
 ACP:

Advance care planning
 AIC:

Akaike’s information criterion
 DY:

Dyspnoea
 EF:

Emotional functioning
 FA:

Fatigue
 MAR:

The missing at random
 MCAR:

Missing completely at random
 MI:

Multiple imputation
 MICE:

Multivariate Imputation by Chained Equations
 MNAR:

Missing not at random
 NARFCS :

NotAtRandom Fully Conditional Specification
 PA:

Pain
 PF:

Physical functioning
 QOL:

Quality of life
 REC:

Research Ethics Committee
References
Palmer JL. Analysis of missing data in palliative care studies. J Pain Symptom Manag. 2004;28:612–8.
Hussain JA, White IR, Langan D, Johnson MJ, Currow DC, Torgerson DJ, et al. Missing data in randomized controlled trials testing palliative interventions pose a significant risk of bias and loss of power: a systematic review and metaanalyses. J Clin Epidemiol. 2015;74:57–65.
Hussain JA, Bland M, Langan D, Johnson MJ, Currow DC, White IR. Quality of missing data reporting and handling in palliative care trials demonstrates that further development of the CONSORT missing data reporting guidance is required: a systematic review. J Clin Epidemiol. 2017;88:81–91.
Rubin DB. Multiple imputation for nonresponse in surveys. New York: Wiley; 1987.
Sullivan TR, Yelland LN, Lee KJ, Ryan P, Salter AB. Treatment of missing data in followup studies of randomised controlled trials: a systematic review of the literature. Clin Trials. 2017;14:387–95. https://doi.org/10.1177/1740774517703319.
Bell ML, Fiero M, Horton NJ, Hsu CH. Handling missing data in RCTs; a review of the top medical journals. BMC Med Res Methodol. 2014;14:118.
Sterne JA, White IR, Carlin JB, Spratt M, Royston P, Kenward MG, Wood AM, Carpenter JR. Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls. BMJ. 2009;338:b2393. https://doi.org/10.1136/bmj.b2393.
Little RJA, Rubin DB. Statistical analysis with missing data. New Jersey: Wiley; 2002.
Raghunathan TE, Lepkowski JM, van Hoewyk J, Solenberger P. A multivariate technique for multiply imputing missing values using a sequence of regression models. Surv Methodol. 2001;27:85–95.
Van Buuren S. Flexible imputation of missing data. Boca Raton: Chapman & Hall; 2018.
Jones M, Mishra GD, Dobson A. Analytical results in longitudinal studies depended on target of inference and assumed mechanism of attrition. J Clin Epidemiol. 2015;68:1165–75.
Hu B, Li L, Greene T. Joint multiple imputation for longitudinal outcomes and clinical events that truncate longitudinal followup. Stat Med. 2016;35:2991–3006.
MorenoBetancur M, Chavance M. Sensitivity analysis of incomplete longitudinal data departing from the missing at random assumption: methodology and application in a clinical trial with dropouts. Stat Methods Med Res. 2016;25:1471–89.
Preston NJ, Fayers P, Walters SJ, Pilling M, Grande GE, Short V, et al. Recommendations for managing missing data, attrition and response shift in palliative and endoflife care research: part of the MORECare research method guidance on statistical issues. Palliat Med. 2013;27:899e907.
Post WJ, Buijs C, Stolk RP, de Vries EG, le Cessie S. The analysis of longitudinal quality of life measures with informative dropout: a pattern mixture approach. Qual Life Res. 2010;19:137–48.
Fielding S, Fayers PM, McDonald A, McPherson G, Campbell MK, RECORD Study Group. Simple imputation methods were inadequate for missing not at random (MNAR) quality of life data. Health Qual Life Outcomes. 2008;6:57. https://doi.org/10.1186/14777525657.
Pauler DK, McCoy S, Moinpour C. Pattern mixture models for longitudinal quality of life studies in advanced stage disease. Stat Med. 2003;22:795–809.
Diehr P, Johnson LL. Accounting for missing data in endoflife research. J Palliat Med. 2005;8(Suppl 1):S50–7.
Rietjens JA, Korfage IJ, Dunleavy L, Preston NJ, Jabbarian LJ, Christensen CA, et al. Advance care planning a multicentre cluster randomised clinical trial: the research protocol of the ACTION study. BMC Cancer. 2016;8(16):264.
Ratitch B, O'Kelly M, Tosiello R. Missing data in clinical trials: from clinical assumptions to statistical analysis using pattern mixture models. Pharm Stat. 2013;12:337–47.
Groenvold M, Petersen MA, Aaronson NK, Arraras JI, Blazeby JM, Bottomley A, Fayers PM, de Graeff A, Hammerlid E, Kaasa S, Sprangers MA, Bjorner JB, EORTC Quality of Life Group. The development of the EORTC QLQC15PAL: a shortened questionnaire for cancer patients in palliative care. Eur J Cancer. 2006;42:55–64.
Fayers PM, Aaronson NK, Bjordal K, Groenvold M, Curran D, Bottomley A, on behalf of the EORTC Quality of Life Group. EORTC QLQC30 scoring manual. 3rd ed. Brussels: EORTC; 2001.
Arora NK, Weaver KE, Clayman ML, OakleyGirvan I, Potosky AL. Physicians’ decisionmaking style and psychosocial outcomes among cancer survivors. Patient Educ Couns. 2009;77:404–12. https://doi.org/10.1016/j.pec.2009.10.004.
Brédart A, Bottomley A, Blazeby JM, Conroy T, Coens C, D'Haese S, et al. An international prospective study of the EORTC cancer inpatient satisfaction with care measure (EORTC INPATSAT32). Eur J Cancer. 2005;41:2120–31.
Carver CS, Scheier MF, Weintraub JK. Assessing coping strategies: a theoretically based approach. J Pers Soc Psychol. 1989;56:267–83.
Stanton AL, Kirk SB, Cameron CL, DanoffBurg S. Coping through emotional approach: scale construction and validation. J Pers Soc Psychol. 2000;78:1150–69.
Greenland S, Senn SJ, Rothman KJ, Carlin JB, Poole C, Goodman SN, Altman DG. Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations. Eur J Epidemiol. 2016;31:337–50.
Li F, Baccini M, Mealli F, Zell ER, Frangakis CE, Rubin DB. Multiple imputation by ordered monotone blocks with application to the anthrax vaccine research program. J Comput Graphical Stat. 2014;23:877–92.
Gelman A, Rubin DB. Inference from iterative simulation using multiple sequences. Stat Sci. 1992;7:457–511.
White IR, Royston P, Wood AM. Multiple imputation using chained equations: issues and guidance for practice. Stat Med. 2011;30:377–99.
Van Buuren S, GroothuisOudshoorn K. MICE: Multivariate imputation by chained equations. R J Stat Softw. 2011;45:1–67.
R Core Team. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2013.
Tompsett DM, Leacy F, MorenoBetancur M, Heron J, White IR. On the use of the notatrandom fully conditional specification (NARFCS) procedure in practice. Stat Med. 2018;37:2338–53.
Resseguier N, Giorgi R, Paoletti X. Sensitivity analysis when data are missing notatrandom. Epidemiology. 2011;22:282.
Leacy FP, Floyd S, Yates TA, White IR. Analyses of sensitivity to the missingatrandom assumption using multiple imputation with delta adjustment: application to a tuberculosis/HIV prevalence survey with incomplete HIVstatus data. Am J Epidemiol. 2017;185:304–15.
Keene ON, Roger JH, Hartley BF, Kenward MG. Missing data sensitivity analysis for recurrent event data using controlled imputation. Pharmaceut Stat. 2014;13:258–64. https://doi.org/10.1002/pst.1624.
Little RJA. Selection and patternmixture models. In: Fitzmaurice G, Davidian M, Verbeke G, Molenberghs G, editors. Longitudinal data analysis, chapter 18. Boca Raton: CRC Press; 2009. p. 409–31.
Van Buuren S, Boshuizen HC, Knook DL. Multiple imputation of missing blood pressure covariates in survival analysis. Stat Med. 1999;18:681–94.
Carpenter JR, Kenward MG. Multiple imputation and its application. 1st ed. Chichester: Wiley; 2013.
Burgette LF, Reiter JP. Multiple imputation for missing data via sequential regression trees. Am J Epidemiol. 2010;172:1070–6.
Tang F, Ishwaran H. Random forest missing data algorithms. Stat Anal Data Min. 2017;10:363–77.
Doove LL, Van Buuren S, Dusseldorp E. Recursive partitioning for missing data imputation in the presence of interaction effects. Comput Stat Data Anal. 2014;72:92–104.
Hong S, Lynn HS. Accuracy of randomforestbased imputation of missing data in the presence of nonnormality, nonlinearity, and interaction. BMC Med Res Methodol. 2020;20:199.
Shah AD, Bartlett JW, Carpenter J, Nicholas O, Hemingway H. Comparison of random forest and parametric imputation models for imputing missing data using MICE: a CALIBER study. Am J Epidemiol. 2014;179:764–74.
Lu K. Number of imputations needed to stabilize estimated treatment difference in longitudinal data analysis. Stat Methods Med Res. 2017;26:674–90.
Sullivan TR, White IR, Salter AB, Ryan P, Lee KJ. Should multiple imputation be the method of choice for handling missing data in randomized trials? Stat Methods Med Res. 2018:962280216683570.
Speidel M, Drechsler J, Jolani S. R package hmi: a convenient tool for hierarchical multiple imputation and beyond. IABdiscussion paper. 2018. 16/2018.
Wang S, Tang J, Sun T, Zheng X, Li J, Sun H, Zhou X, Zhou C, Zhang H, Cheng Z, Ma H, Sun H. Survival changes in patients with small cell lung cancer and disparities between different sexes. Socioecon Statuses Ages Sci Rep. 2017;7:1339. https://doi.org/10.1038/s41598017015710.
Simmons CP, Koinis F, Fallon MT, Fearon KC, Bowden J, Solheim TS, Gronberg BH, McMillan DC, Gioulbasanis I, Laird BJ. Prognosis in advanced lung cancera prospective study examining key clinicopathological factors. Lung Cancer. 2015;88:304–9. https://doi.org/10.1016/j.lungcan.2015.03.020.
Siebenhüner AR, Güller U, Warschkow R. Populationbased SEER analysis of survival in colorectal cancer patients with or without resection of lung and liver metastases. BMC Cancer. 2020;20(246). https://doi.org/10.1186/s1288502067101.
Yang F, Small DS. Using postoutcome measurement information in censoring‐by‐death problems. J Royal Stat Soc Series B. 2016;78:299–318.
Tchetgen Tchetgen EJ. Identification and estimation of survivor average causal effects. Stat Med. 2014;33:3601–780.
Wen L, Terrera GM, Seaman SR. Methods for handling longitudinal outcome processes truncated by dropout and death. Biostatistics. 2018;19:407–25.
Yang F, Ding P. Using survival information in truncation by death problems without the monotonicity assumption. Biometrics. 2018. https://doi.org/10.1111/biom.12883.
Acknowledgements
We thank all participating patients and relatives, facilitators, trainers, hospital staff and the Advisory Board for their valuable contribution to this project. We thank Bud Hammes and Linda Briggs for their advice throughout the project. Membership of the ACTION consortium: Agnes van der Heide, Ida J. Korfage, Judith A.C. Rietjens, Lea J. Jabbarian, Suzanne Polinder, Hans van Delden, Marijke Kars, Marieke Zwakman, Luc Deliens, Mariëtte N. Verkissen, Kim Eecloo, Kristof Faes, Kristian Pollock, Jane Seymour, Glenys Caswell, Andrew Wilcock, Louise Bramley, Sheila Payne, Nancy Preston, Lesley Dunleavy, Eleanor Sowerby, Guido Miccinesi, Francesco Bulli, Francesca Ingravallo, Giulia Carreras, Alessandro Toccafondi, Giuseppe Gorini, Urška Lunder, Branka Červ, Anja Simonič, Alenka Mimić, Hana Kodba Čeh, Polona Ozbič, Mogens Groenvold, Caroline Arnfeldt, Anna Thit Johnsen.
Funding
This publication is based on the ACTION project, conducted by a collaboration of research teams from Belgium, Denmark, Italy, the Netherlands, Slovenia, and the UK.
The project is funded by the 7th Framework Programme for Research and Technological Development (FP7) (Proposal No.
602541–2). The funder was not involved in any aspects of study design, data analysis, interpretation of results or drafting the paper.
Author information
Affiliations
Consortia
Contributions
GC conceived the study, built the scores, developed the statistical models, analyzed the data and wrote the first version of the manuscript. GM built the scores, interpreted the data and contributed in writing the manuscript. MB conceived the study, developed the statistical models and wrote the first version of the manuscript. AW, NP, DN, LD, MG, UL, AH contributed in writing the manuscript. The authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Ethical approval for the ACTION project was obtained from the Research Ethics Committee (REC) of the coordinating centre (‘Medische Ethische Toetsings Commissie Erasmus MC’), as well as RECs in all participating countries. Written informed consent to participate was obtained from all participants to the study.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Additional file 1 Appendix.
Appendix with supplementary material
Additional file 2.
ACTION study patient questionnaire. ACTION study patient questionnaire
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Carreras, G., Miccinesi, G., Wilcock, A. et al. Missing not at random in end of life care studies: multiple imputation and sensitivity analysis on data from the ACTION study. BMC Med Res Methodol 21, 13 (2021). https://doi.org/10.1186/s1287402001180y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s1287402001180y
Keywords
 Missing data
 MAR
 MNAR
 Advance care planning
 Oncology
 Quality of life