The ability of different imputation methods for missing values in mental measurement questionnaires

Xu, Xueying; Xia, Leizhen; Zhang, Qimeng; Wu, Shaoning; Wu, Mingcheng; Liu, Hongbo

doi:10.1186/s12874-020-00932-0

Research article
Open access
Published: 27 February 2020

The ability of different imputation methods for missing values in mental measurement questionnaires

Xueying Xu¹,
Leizhen Xia¹,
Qimeng Zhang¹,
Shaoning Wu¹,
Mingcheng Wu¹ &
…
Hongbo Liu¹

BMC Medical Research Methodology volume 20, Article number: 42 (2020) Cite this article

6535 Accesses
22 Citations
Metrics details

Abstract

Background

Incomplete data are of particular important influence in mental measurement questionnaires. Most experts, however, mostly focus on clinical trials and cohort studies and generally pay less attention to this deficiency. We aim is to compare the accuracy of four common methods for handling items missing from different psychology questionnaires according to the items non-response rates.

Method

All data were drawn from the previous studies including the self-acceptance scale (SAQ), the activities of daily living scale (ADL) and self-esteem scale (RSES). SAQ and ADL dataset, simulation group, were used to compare and assess the ability of four imputation methods which are direct deletion, mode imputation, Hot-deck (HD) imputation and multiple imputation (MI) by absolute deviation, the root mean square error and average relative error in missing proportions of 5, 10, 15 and 20%. RSES dataset, validation group, was used to test the application of imputation methods. All analyses were finished by SAS 9.4.

Results

The biases obtained by MI are the smallest under various missing proportions. HD imputation approach performed the lowest absolute deviation of standard deviation values. But they got the similar results and the performances of them are obviously better than direct deletion and mode imputation. In a real world situation, the respondents’ average score in complete data set was 28.22 ± 4.63, which are not much different from imputed datasets. The direction of the influence of the five factors on self-esteem was consistent, although there were some differences in the size and range of OR values in logistic regression model.

Conclusion

MI shows the best performance while it demands slightly more data analytic capacity and skills of programming. And HD could be considered to impute missing values in psychological investigation when MI cannot be performed due to limited circumstances.

Peer Review reports

Background

Mental health is mostly measured by one or more psychometric scales that commonly include a set of functional items. It is inevitable to lead to missing items in psychological measurement, in the process of completing scales, due to the existence of many factors [1]. Any missing item will result in the loss of the whole scale score because items are collected to calculate subscale scores and total scale scores. Therefore, incomplete data are of particular important influence in mental measurement questionnaires (psychologic instruments). While researchers generally pay less attention to this deficiency. Many studies fail to mention the existence of missing and the methods of handing it [2, 3]. Others merely alluded to the default method or typically discard of samples with missing value to obtain a complete dataset [4]. These methods, known as listwise deletion or complete case analysis, reduce even more sample size available for analysis [5]. It results in reducing the statistical power (by reduced sample size) and causing greater selection biases if observations with missing values are only excluded from the analysis [6]. In recent years, many statistical methods have been proposed to impute missing values. The most common method consists in imputing a missing value by the average response of the other items or mean imputation in total scores of scale. Such a method is clearly recommended in scoring manuals of widely used quality of life questionnaires such as SF-36 and QLQ-C 30 [7, 8] though it might be inadequate especially when the rate of missing data is high [9,10,11].

Incomplete items of psychometric scales are a commonly encountered scenario in cross-sectional observational studies. Most experts, however, mostly focus on clinical trials and cohort studies or longitudinal studies during more than 20 years of research on missing values. There are few studies on the missing values in psychological tests in cross-sectional studies. Although a few articles cover this topic, they are limited to one aspect. For example, Shrive et al. has compared imputation techniques only based on the Zung depression scale, only considered one simulation (one distribution of the missing data) for each scenario and used a dichotomous criterion (“diseased” or “not diseased” according to the score) for comparisons. It did not make comparisons between the scale scores [11]. These results were hardly to reasonably extending to other questionnaires constructed according to classical test theory.

Therefore, based on the data from three cross-sectional surveys, we explored the actual effects of different missing rates from different populations and different scales considering the real-world evidence from the practical application point. Our aim to compare the accuracy and precision of four common methods for handling items missing from different psychology questionnaires according to the types of missingness and the item non-response rates, in order to provide advice to scientific researchers on the choice of appropriate imputation methods in their future work.

Method

Data sources

Considering the wider population and applicability of the scale, three common scales were selected from different age groups as the simulated imputation datasets, which are the self-acceptance scale (SAQ) of college students, the activities of daily living scale (ADL) of elderly people and self-esteem scale (RSES) of middle school students [12,13,14]. In enrolled dataset, samples with missing values are typically discarded to obtain a complete dataset. The SAQ dataset included 742 individuals with complete age, gender, the characteristics of parents, etc. The ADL dataset included 1242 elders with age, gender, the characteristics of daily living, etc. There are 3513 middle school students in RSES dataset also with complete age, gender, the characteristics of parents, etc. SAQ and ADL dataset, simulation group, were used to compare and assess the ability of different imputation methods. RSES dataset, validation group, was used to test the application of imputation methods. All datasets were complete on all required variables.

Simulation of missing data

The mechanism of the missingness is important when imputing missing values. Missing item scores can be categorized into three types by Little and Rubin: when the missing data is independent to the actual or potential study variables, the losses are thought to be missing completely at random (MCAR) [15]. If the missingness due to issues related to the biological, psychological, social and/or cultural diversity of subjects, or depends on known or observed covariates, the non-issuance of the response is due to random causes (MAR). And the item nonresponse is classified as missing not at random (MNAR) if the probability of an item being missing depends on the true answer [16, 17]. In real world data, there is no way to verify that the data is MAR or MNAR though MCAR can be confirmed by Litter’s MCAR. Therefore, it is difficult to determine the missing data mechanism. Most scholars suppose that the missing of questionnaire data is at MAR, which use the relationships with other variables. In addition, most current imputation methods assume MAR in order to avoiding biased results. So the explanatory variables were assigned to missing under a MAR missing data mechanism in this study.

Imputation methods

Four imputation methods are considered in this study. Among them, (1) the direct deletion method is to delete all subjects with missing values and conduct statistical analysis based on a complete dataset. It is the most common and simplest approach which was used in statistical software. (2) Mode imputation is one of the most naive and easiest methods for imputing missing values for categorical variables. The mode of the non-missing values of each variable was used to impute the missing values. (3) Hot-deck (HD) imputation refers to selecting the corresponding variable value of the observation most “similar” to the missing observation as the filling value of the missing observation. Generally, it is divided into two methods: sequential hot platform filling method and random hot platform filling method [18]. The most “similar” observation in sequential hot platform filling is selected in some order in the filling class. Random hot platform filling is randomly selected from the filling class. This research selects random Hot-deck imputation. (4) Multiple imputation (MI), which aims to produce a range of values that “approximate” the missing response [19]. MI uses a set of external covariates to generate a range of plausible values for each missing value (based on correlations between the covariates and the item to be replaced). The algorithm works by iteratively imputing the missing values based on the fitted conditional models until a stopping criterion is satisfied.

Performance evaluation of imputation algorithms

Comparison of different imputation methods is performed as follows:

(1)
Absolute deviation. It is the absolute value of the difference of results between two data points of complete dataset and imputation dataset.
(2)
The root mean square error (RMSE) [20].
$$ RMSE=\sqrt{\frac{\sum \limits_{i=1}^n{\left({y}_{ij}-{y}_{i0}\right)}^2}{n},} $$

where n = the number of simulated imputation in each missingness proportion,

y_ij = statistics of ith imputation using imputation method j in each missingness proportion,

y_i0 = statistics of ith imputation using complete dataset in each missingness proportion.

Higher RMSE indicates larger differences between datasets imputed with the test methods. A narrower range of RMSE values indicates more stability in imputation method. Likewise, a wider range of RMSE values for each combination indicates less stability and therefore reliability in imputation method.

(3)
Average relative error.
$$ \mathrm{average}\ \mathrm{relative}\ \mathrm{error}=\frac{\sum \limits_{i=1}^n\left(\frac{y_{i0}-{y}_{ij}}{y_{i0}}\right)}{n}, $$

where n = the number of simulated imputation in each missingness proportion,

y_ij = statistics of ith imputation using imputation method j in each missingness proportion,

y_i0 = statistics of ith imputation using complete dataset in each missingness proportion.

The vertical axis plots the percentage relative error for continuous variables and percentage misclassification error for categorical variables, while the horizontal axis groups the results according to the proportion of missing values. Each boxplot represents the error measure over 50 random replications.

Statistical analysis

SAQ and ADL dataset were used as simulation groups. The missing rates in all datasets were set at 5, 10, 15, and 20% under a MAR missing data mechanism, respectively. And We repeated 50 times to simulate a MAR missing data and fill the missing values at each missing rate by four imputation methods before absolute deviation and RMSE of mean, standard deviation, correlation coefficient were calculated. If the results of all methods were similar, average relative error of these statistics will be continued to calculate or they will be computed except those imputation methods with less effective than others obviously in order to determine the preferred methods. RSES dataset, validation group, was analyzed the performance of the extensionality in a supposed real world situation by simulation different nonresponse rates one time. All analyses were finished by SAS 9.4.

Result

Results in simulated situations

The results of the comparisons of imputation methods for the analysis the absolute deviation of mean, standard deviation and correlation coefficient are shown in Table 1. The mean biases obtained by direct deletion are the biggest (0.583, 1.080, 1.453, and 1.586 in SAQ) and mode imputation is the most unstable under various missing proportions. MI has the best result, while the HD is not much different from it. HD imputation performed the lowest absolute deviation of standard deviation for each condition. And the performances of them are obviously better than direct deletion and mode imputation, especially in the high percentage of missing values of both scales.

Table 1 The absolute deviation for four imputation methods

Full size table

As shown in Fig. 1, the RMSE of mean values tended to increase with higher missingness rates under the same imputation approach. The RMSEs of mean and standard deviation in direct deletion and mode imputation of both two scales are higher obviously than other methods. In addition, except for the direct deletion technique, the differences between correlation coefficient calculated with imputed values and those “original values” were very small for other three methods in all the simulation scenarios. Next, direct deletion will be excluded in calculating the average relative error considering the less effective obviously in absolute deviation and RMSE of almost all statistics.

In ADL, the average relative error of mean and standard deviation by mode imputation are obviously greater than HD and MI methods. While in SAQ, the different of mean between three methods are not significant, but the average relative error of standard deviation by mode imputation is also bigger. The average relative error of mean in HD and average relative error of standard deviation in MI are slightly larger but not significant in both two scales. For correlation coefficient, the results of three imputation methods are similar (Fig. 2). In the end, we chose HD and MI to the next step because of their small deviation and stability.

Results in real world situation

This portion evaluated the effect of HD and MI on the self-esteem scale in a real world situation. The complete data set included 3513 students from seventh grade to twentieth grade in the study using RSES. The average age was 16.3 years old and 1810 (51.5%) interviewees were male. 54.3% of adolescents were first-borns or only child (n = 1906) and the majority of them are from countryside (n = 3361). The parents of most students worked outside of their hometown for more than 5 years (father 56.0%, mother 52.2%).

The average scores of RSES among middle school students in complete dataset, HD and MI under various missing proportions were showed in Fig. 3. The respondents’ average score in complete data set was 28.22 ± 4.63, which are not much different from imputed datasets even at the missingness proportion of 20%. The biggest deviation of score is 0.04 in HD imputation dataset of missingness rate of 10%, which is inconsistent with the previous result that the higher missing rates are the more errors get. It reveals that using HD and MI methods to interpolate missing values of scales has a good extension.

Based on the general analysis rules and methods of cross-sectional studies and mental health surveys, we further conduct descriptive study and hypothesis testing on the different characteristics of the RSES data. Compared with complete dataset, the two imputation methods also got the semblable result at different missing proportions (Table 2). The differences in all results are no more than 0.2 and the biggest deviation of score is 0.2 in HD imputation dataset of missingness rate of 20%. Overall, the average score of MI is more accurate than the result of HD imputation. However, we found that the variability of MI is significantly increased and the range of difference of standard deviation from the complete data is 0.06–3.44, which is far greater than the result of HD (0.01–0.29). Although the increase in variability did not affect the conclusion of comparison between different features in this study, this bias should be avoided as much as possible in actual work, which is due to the systematic error caused by MI method.

Table 2 The result of differences test of complete and impute data for RSES

Full size table

Logistic regression model was adopted to explore the relationship between variables in most investigations [21, 22]. In order to better describe and verify the possible bias of the analysis of the imputed data, we also used logistic regression to analyse the relationship between self-esteem and other variables. Self-esteem is divided into two categories (RSES score < 30 or RSES score ≥ 30) and gender, grade, academic record, residence and communication with relatives taking care of you as the independent variable. The logistic regression model was constructed in the complete dataset and the imputed datasets, respectively. The analysis results are shown in Table 3. The direction of the influence of the five factors on self-esteem was consistent, although there were some differences in the range of OR values between the data sets of original and imputation. The OR value of MI dataset is more similar to complete dataset than HD, but the largest difference of OR value is only 0.03 in the missing rate of 5% of the HD dataset.

Table 3 Influence on self-esteem in complete and impute data by multiple logistic regression models

Full size table

Discussion

In this study, an extensive simulation study was performed in three psychology scales to compare the performance of four well-known methods of missing value imputation for missing data at random. To answer the question of whether to impute and to reflect the wide applicability of imputation methods in psychological research, we offer important points of the percentage of missing values and different scales that are mostly used to different populations. As expected, the imputation error of all statistics increases as the proportion of missing values increases in all approaches though the variation tends to reduce slightly, which is due to averaging over much more missing observations. Because the questionnaire survey is different from other studies of which results are independent, missing items would affect subscale and total scores. Even if there are not many missing values for independent items, it will lead to a larger proportion of respondents missing since the missing values of the items are distributed among different participants. There are, for example, only 564 individuals with the complete total scale scores in 1001 samples, when the percentage of missing items set to 20% in the dataset. Therefore, we recommend effective impute missing values to reduce the bias of missing on the questionnaire result.

In addition, the choice of imputation method does seem to drastically affect our results no matter in which psychological questionnaire. Both direct deletion and mode-imputation methods are simple to implement and have low computational requirements, but the performances of them are not satisfactory compared with other methods regardless of the different missing proportions or scales. This finding is in line with the conclusion from the literatures [23]. If the missingness is not related with the outcome, a direct deletion analysis will provide unbiased estimates [24]. Due to the particularity of the questionnaire, it may result in biased calculations in describing the distribution of test results or inferring relevant factors. It may, however, be reasonable to use mode-imputation analyses when computing correlation coefficient because the risk of bias is low and the precision could be bracketed with HD and MI. Therefore, we used both HD and MI to verify the accuracy and extensionality of the imputation method comparing with complete dataset when assumed real-world conditions.

The process of routine survey analysis was simulated in validation group. In general, the results of the MI and HD datasets are analogous to those of the original data. MI appeared to produce minimally biased estimates in describing the average and correlation analysis, but it can easily cause systematic errors in describing tendency of dispersion. The results of HD imputation are more stable no matter average level, hypothesis testing, or correlation analysis, which is similar to the results of the two questionnaires in the simulation dataset [25].

MI is a relatively sophisticated imputation method. It is mainly performed by the Markov Chain Monte Carlo (MCMC) procedure and requires a preliminary evaluation of missing data. The use of MI method demands more data analytic capacity and skills of programming. However, for a psychologist or clinician, it is difficult to implement it. Additionally, the biggest problem is that multiple data sets are generated in MI method, after that the statistics obtained by multiple imputation are the integration of many group statistics. So the display of some results and the use of analytical methods have significant limitations. MI may not display all the information of regular statistical analyses. For example, only the regression coefficients of each variable can be integrated and statistical inference can be given when multiple linear regression is performed with multiple imputation. While, there is no evaluation of the stability of the regression equation. And some statistical analysis methods, such as survival analysis, are currently difficult to multiple impute directly using software. Therefore, in practical work, MI imputation has limitations. In our study, the results of HD imputation are also similar to those of the complete dataset, especially in the variability of the data, which is more consistent than MI. It is not easy to produce false negative results due to less increased variability after data imputation by HD. In addition, the basic principle of the HD method is simple and a dataset is convenient to do statistical analyses after imputation. In this case, we recommend HD to fill the missing data because of its ease of implementation and the results can be accepted compared with MI in the psychological questionnaire.

Conclusion

We would be wise to think about the missing-data problem in psychological scales prior to making decisions. MI shows the best performance while it demands slightly more data analytic capacity and skills of programming. And HD could be considered to impute missing values in psychological investigation when MI cannot be performed due to limited circumstances.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Abbreviations

ADL:: Activities of daily living scale
HD:: Hot-deck imputation
MAR:: Missing at random
MCAR:: Missing completely at random
MI:: Multiple imputation
MNAR:: Missing not at random
RMSE:: The root mean square error
RSES:: Self-esteem scale
SAQ:: Self-acceptance scale

References

Yanagida T, Kubinger KD, Rasch D. Planning a study for testing the Rasch model given missing values due to the use of test-booklets. J Appl Meas. 2015;16(4):432–42.
PubMed Google Scholar
Baba S, Katsumata Y, Okamoto Y, Kawaguchi Y, Hanaoka M, Kawasumi H, et al. Reliability of the SF-36 in Japanese patients with systemic lupus erythematosus and its associations with disease activity and damage: a two-consecutive year prospective study. Lupus. 2018;27(3):407–16. https://doi.org/10.1177/0961203317725586.
Article CAS PubMed Google Scholar
Janani K, Jain M, Vargese J, Srinivasan V, Harika K, Michael T, et al. Health-related quality of life in liver cirrhosis patients using SF-36 and CLDQ questionnaires. Clin Exp Hepatol. 2018;4(4):232–9. https://doi.org/10.5114/ceh.2018.80124.
Article CAS PubMed PubMed Central Google Scholar
Erez G, Selman L, Murtagh FE. Measuring health-related quality of life in patients with conservatively managed stage 5 chronic kidney disease: limitations of the medical outcomes study short form 36: SF-36. Qual Life Res. 2016;25(11):2799–809. https://doi.org/10.1007/s11136-016-1313-7.
Article PubMed PubMed Central Google Scholar
Bunevicius A. Reliability and validity of the SF-36 health survey questionnaire in patients with brain tumors: a cross-sectional study. Health Qual Life Outcomes. 2017;15(1):92. https://doi.org/10.1186/s12955-017-0665-1.
Article PubMed PubMed Central Google Scholar
Peyre H, Leplege A, Coste J. Missing data methods for dealing with missing items in quality of life questionnaires. A comparison by simulation of personal mean score, full information maximum likelihood, multiple imputation, and hot deck techniques applied to the SF-36 in the French 2003 decennial health survey. Qual Life Res. 2011;20(2):287–300. https://doi.org/10.1007/s11136-010-9740-3.
Article PubMed Google Scholar
Aaronson NK, Ahmedzai S, Bergman B, Bullinger M, Cull A, Duez NJ, et al. The European Organization for Research and Treatment of Cancer QLQ-C30: a quality-of-life instrument for use in international clinical trials in oncology. J Natl Cancer Inst. 1993;85(5):365–76.
Article CAS PubMed Google Scholar
Ware JE Jr, Sherbourne CD. The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection. Med Care. 1992;30(6):473–83.
Article PubMed Google Scholar
Fielding S, Fayers PM, McDonald A, McPherson G, Campbell MK. Simple imputation methods were inadequate for missing not at random (MNAR) quality of life data. Health Qual Life Outcomes. 2008;6:57. https://doi.org/10.1186/1477-7525-6-57.
Article PubMed PubMed Central Google Scholar
Molenberghs G, Thijs H, Jansen I, Beunckens C, Kenward MG, Mallinckrodt C, et al. Analyzing incomplete longitudinal clinical trial data. Biostatistics. 2004;5(3):445–64. https://doi.org/10.1093/biostatistics/5.3.445.
Article PubMed Google Scholar
Shrive FM, Stuart H, Quan H, Ghali WA. Dealing with missing data in a multi-question depression scale: a comparison of imputation methods. BMC Med Res Methodol. 2006;6:57. https://doi.org/10.1186/1471-2288-6-57.
Article PubMed PubMed Central Google Scholar
Springer KW, Hauser RM. An assessment of the construct validity of RyV’s Scales of psychological well-being: method, mode, and measurement effects. Soc Sci Res. 2006;35:1080–102.
Article Google Scholar
Lawton MP, Brody EM. Assessment of older people: self-maintaining and instrumental activities of daily living. Gerontologist. 1969;9(3):179–86.
Article CAS PubMed Google Scholar
Rosenberg M. Society and the adolescent self-image. Princeton: Princeton University Press; 1965.
Book Google Scholar
Rubin DB. Inference and missing data. Biometrika. 1976;63:581–92.
Article Google Scholar
Little R, Rubin DB. Statistical analysis with missing data. Hoboken: Wiley; 2014.
Google Scholar
Gelman A, Hill J. Data analysis using regression and multilevel/hierarchical models. Cambridge: Cambridge University Press; 2006.
Book Google Scholar
Sande I. Hot deck imputation procedures, incomplete data in samples surveys. New York: Academic Press; 1983.
Google Scholar
Rubin D. Multiple imputation for nonreponse in surveys. New York: Wiley; 1987.
Book Google Scholar
To KT, Fry RC, Reif DM. Characterizing the effects of missing data and evaluating imputation methods for chemical prioritization applications using ToxPi. BioData Min. 2018;11:10. https://doi.org/10.1186/s13040-018-0169-5.
Article PubMed PubMed Central Google Scholar
Cheng SH, Shih CC, Lee IH, Hou YW, Chen KC, Chen KT, et al. A study on the sleep quality of incoming university students. Psychiatry Res. 2012;197(3):270–4. https://doi.org/10.1016/j.psychres.2011.08.011.
Article PubMed Google Scholar
Stoltzfus JC. Logistic regression: a brief primer. Acad Emerg Med. 2011;18(10):1099–104. https://doi.org/10.1111/j.1553-2712.2011.01185.x.
Article PubMed Google Scholar
Read SH, Lewis SC, Halbesma N, Wild SH. Measuring the association between body mass index and all-cause mortality in the presence of missing data: analyses from the Scottish National Diabetes Register. Am J Epidemiol. 2017;185(8):641–9. https://doi.org/10.1093/aje/kww162.
Article PubMed Google Scholar
van Buuren S. Flexible imputation of missing data. Boca Raton: Taylor & Francis: CRC Press; 2012.
Book Google Scholar
Donders AR, van der Heijden GJ, Stijnen T, Moons KG. Review: a gentle introduction to imputation of missing values. J Clin Epidemiol. 2006;59(10):1087–91. https://doi.org/10.1016/j.jclinepi.2006.01.014.
Article PubMed Google Scholar

Download references

Acknowledgments

Not applicable.

Funding

This research was supported in part by Social Sciences Foundation of Liaoning Province, Grant No. L18ATJ001, in China, awarded to Hongbo Liu. The funders had no role in the study design, data collection and analysis, decision to publish or preparation of the manuscript.

Author information

Authors and Affiliations

Public Health of School, China Medical University, No.77 Puhe Road, 110122, Shenyang, People’s Republic of China
Xueying Xu, Leizhen Xia, Qimeng Zhang, Shaoning Wu, Mingcheng Wu & Hongbo Liu

Authors

Xueying Xu
View author publications
You can also search for this author in PubMed Google Scholar
Leizhen Xia
View author publications
You can also search for this author in PubMed Google Scholar
Qimeng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Shaoning Wu
View author publications
You can also search for this author in PubMed Google Scholar
Mingcheng Wu
View author publications
You can also search for this author in PubMed Google Scholar
Hongbo Liu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

XYX performed data analysis and drafted the manuscript. HBL conceived and supervised the study and provided reviews of the text. LZX runs software and simulates imputation data. QMZ and SNW involved in the study design, SNW and MCW contribute to data collection procedure and interpretation of data. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Hongbo Liu.

Ethics declarations

Ethics approval and consent to participate

All procedures in our study were approved by the Ethical Committee Group of China Medical University (CMU6206–1004). Written informed consent was obtained from each participant, each school and nursing homes.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing of interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Xu, X., Xia, L., Zhang, Q. et al. The ability of different imputation methods for missing values in mental measurement questionnaires. BMC Med Res Methodol 20, 42 (2020). https://doi.org/10.1186/s12874-020-00932-0

Download citation

Received: 16 June 2019
Accepted: 19 February 2020
Published: 27 February 2020
DOI: https://doi.org/10.1186/s12874-020-00932-0

The ability of different imputation methods for missing values in mental measurement questionnaires