The methodological quality of systematic reviews regarding the Core Outcome Set (COS) development

Cao, Hong; Chen, Yan; Yang, Zhihao; Lan, Junjie; Sum-wing Kwong, Joey; Zhang, Rui; Zhao, Huaye; Hu, Linfang; Wang, Jiaxue; Sun, Shuimei; Tan, Songsong; Cao, Jinyong; He, Rui; Zheng, Wenyi; Zhang, Jiaxing

doi:10.1186/s12874-024-02182-w

Research
Open access
Published: 11 March 2024

The methodological quality of systematic reviews regarding the Core Outcome Set (COS) development

Hong Cao^1,2^na1,
Yan Chen^1,2^na1,
Zhihao Yang³,
Junjie Lan²,
Joey Sum-wing Kwong⁴,
Rui Zhang²,
Huaye Zhao²,
Linfang Hu²,
Jiaxue Wang²,
Shuimei Sun²,
Songsong Tan⁵,
Jinyong Cao⁶,
Rui He⁷,
Wenyi Zheng⁷ &
…
Jiaxing Zhang²

BMC Medical Research Methodology volume 24, Article number: 65 (2024) Cite this article

578 Accesses
1 Altmetric
Metrics details

Abstract

Background

The Core Outcome Measures in Effectiveness Trials (COMET) working group proposed core outcome sets (COS) to address the heterogeneity in outcome measures in clinical studies. According to the recommendations of COMET, performing systematic reviews (SRs) usually was the first step for COS development. However, the SRs that serve as a basis for COS are not specifically appraised by organizations such as COMET regarding their quality. Here, we investigated the status of SRs related to development of COS and evaluated their methodological quality.

Methods

We conducted a search on PubMed to identify SRs related to COS development published from inception to May 2022. We qualitatively summarized the disease included in SR topics, and the studies included in the SRs. We evaluated the methodological quality of the SRs using AMSTAR 2.0 and compared the overall quality of SRs with and without protocols using the Mann-Whitney U test.

Results

We included 175 SRs from 23 different countries or regions, and they mainly focused on five diseases: musculoskeletal system or connective tissue disease (n = 19, 10.86%), injury, poisoning, or certain other consequences of external causes (n = 18, 10.29%), digestive system disease (n = 16, 9.14%), nervous system disease (n = 15, 8.57%), and genitourinary system disease (n = 15, 8.57%). Although 88.00% of SRs included randomized controlled trials (RCTs), only a few SRs (23.38%) employed appropriate tools to assess the risk of bias in RCTs. The assessment results on the basis of AMSTAR 2.0 indicated that most SRs (93.71%) were rated as ‘’critically low’’ to ‘’low’’ in terms of overall confidence. The overall confidence of SRs with protocols was significantly higher than that without protocols (P <.001). Compared to the SRs with protocols on Core Outcome Measures in Effectiveness Trials (COMET), SRs with protocols on PROSPERO were of better overall confidence (P = .017).

Conclusion

The overall quality of published SRs regarding COS development was poor. Our findings emphasize the need for researchers to carefully select the disease topic and strictly adhere to the requirements of optimal methodology when conducting a SR for the establishment of a COS.

Peer Review reports

Background

Heterogeneity in outcome measures in clinical studies is widely recognized as an obstacle to evidence synthesis [1,2,3]. In a previous study [4], a survey was performed on 82 randomized controlled trial (RCT) protocols investigating treatment modalities for COVID-19. The study concluded that the significant heterogeneity of primary outcomes and lack of critical outcomes across these COVID-19 studies may lead to a waste of research resources. To address this issue, Jin et al. [5] developed a Core Outcome Set (COS) for different subtypes of laboratory-confirmed COVID-19 cases on the basis of two rounds of Delphi survey and one consensus meeting.

A COS refers to a minimum set of indicators that must be reported in clinical studies in specific health fields. It includes industry-recognized clinical outcomes, outcome indicators, their measurement methods and measurement time points [6]. The establishment and execution of COS can improve the value of clinical research, help researchers to identify reporting bias [1, 7, 8], and reduce the waste of research resource [9], thus facilitating evidence curation and clinical decision-making [10]. The Core Outcome Measures in Effectiveness Trials (COMET) working group which aims to develop, apply, disseminate, and update COS was founded in 2010 by internationally recognized professionals in evidence-based medicine. The COMET organization has developed a free, open-access, searchable platform for knowledge sharing and scholarly exchange. It offers methodological guidance and reporting standards for COS studies. The methods advocated by the COMET handbook [11] for COS production include systematic reviews (SRs), eDelphi, consensus meetings, semi-structured interviews, focus groups, nominal group method, among others.

SR, the best evidence for medical decision-making, is recommended by COMET as the initial step for COS development due to its capabilities of comprehensive search of literature, rigorous evaluation of evidence, and efficient curation of outcomes. The quality of SR has a direct impact on the final COS. Rogozińska et al. [12] evaluated the methods of 93 SRs (90 full reports and 3 summaries) published in the COMET database using A Measurement Tool to Assess Systematic Reviews (AMSTAR) 2.0 (items 1–9), the results of which suggest that future studies should ensure that the methods used to generate the different outcomes and outcome domains are reported transparently. Nevertheless, this study did not provide a “reliability” rating of SRs in accordance with AMSTAR 2.0. Recently, there has been an increase in the number of SRs regarding COS published on PubMed. Therefore, we conducted this overview to investigate the research status of SRs that serve as a basis for COS development and assess the quality of these SRs using complete AMSTAR 2.0.

Methods

Eligibility criteria

We included any study if it was a SR that constructed an outcome pool for the COS development. We excluded SR protocols, SRs without full text or detailed information, and SRs published in languages other than English or Chinese.

Literature search

We conducted a search on PubMed from its earliest records to May 30, 2022 to identify published SRs related to the development of COS. We used Medical Subject Headings and free text terms associated with “core outcome set” or “COS” to search for relevant studies.

Study selection and data extraction

Two investigators (H. C and Y. C), who were trained in research methods and had experiences in SRs related to COS, independently screened the titles and abstracts of the SRs for inclusion and subsequently reviewed the full texts of the selected studies against pre-defined inclusion and exclusion criteria. They also independently extracted the following data from included SRs: (1) general information, such as the first author, year of publication, country, registry number, disease population, interventions, outcomes, and their definitions; and (2) methodological information, such as database searched, tools used for assessing the quality of evidence, types of included studies, the method used to develop the COS, and so on. Any disagreement in the process of study selection and data extraction was resolved by discussion.

Quality assessment

Two investigators (H. C and Y. C) appraised the methodological quality of each included SR independently using the AMSTAR 2.0 [13], and any discrepancies were resolved through discussion. AMSTAR 2.0 comprises 16 items, with Item 2, 4, 7, 9, 11, 13, and 15 being critical items that significantly impact the review’s validity and conclusion. The project’s assessment is either “Yes” or “no” for Item 1, 3, 5, 6, 10, 13, 14, and 16; “Yes”, “Partial Yes” or “No” for Item 2, 4, 7, 8, and 9; or “Yes”, “No”, or “No meta-analysis” for Item 11, 12, and 15. There are four levels of quality for a SR: “high”, “moderate”, “low”, and “critically low” in terms of “no or one non-critical weakness”, “more than one non-critical weakness”, “one critical flaw with or without non-critical weakness”, and “more than one critical flaw with or without non-critical weakness”, respectively.

Statistical analysis

We provided a qualitative summary of the disease categories covered in the SR topics, the study design types included in the SRs, pathways for COS development, and the methodological quality of the SRs. Categorical variables were presented as frequency and percentage, while continuous variables were described using mean (and standard deviation) or median (and interquartile range (IQR)). We used the methodological quality of the SRs as an ordinal variable, categorized as “high”, “moderate”, “low”, or “critically low”. We employed the Mann-Whitney U test to compare the methodological quality of SRs with and without protocols and to evaluate the impact of different registered protocols (PROSPERO [14] vs. COMET) on the methodological quality of SRs. R software (version 3.6.3) was applied for data analysis. P ≤ .05 was considered statistically significant, and all tests were two-sided.

To assess the agreement between the two reviewers in study selection and quality assessment, we calculated the kappa statistic (K). The value of K ranges from 0 to 1, where values of 0 to 0.20, 0.21 to 0.39, 0.40 to 0.59, 0.60 to 0.79, 0.80 to 0.90, and above 0.90 represent no agreement, minimal agreement, weak agreement, moderate agreement, strong agreement, and almost perfect agreement, respectively [15].

Results

Search results and description of included SRs

A total of 755 studies were initially retrieved from PubMed. Following the selection process (see Figs. 1), 175 SRs were included in the final assessment. The level of agreement between the two reviewers for study selection was acceptable (K = 0.89). As presented in Table S1, the included SRs were published between 2006 and 2022, with a marked increase in quantity after 2018. The authors of the SRs were from 23 countries or regions, with 42.86% of authors hailing from the United Kingdom.

Based on the International Classification of Diseases (ICD)-11 [16] (Fig. 2), the topics of the 175 included SRs were focused on 20 different types of diseases. The top five diseases with the highest number of literature were musculoskeletal system or connective tissue disease (n = 19, 10.86%), injury, poisoning, or certain other consequences of external causes (n = 18, 10.29%), digestive system disease (n = 16, 9.14%), nervous system disease (n = 15, 8.57%), and genitourinary system disease (n = 15, 8.57%).

Figure 3 shows that out of the 175 SRs included in the final assessment, 98 SRs (56.00%) included only one type of study. Among these, randomized controlled trials (RCTs) were the most common type (n = 77, 44.00%), followed by observational studies (n = 15, 8.57%), non-randomized controlled trials (n = 1, 0.57%), and qualitative studies (n = 5, 2.86%). Additionally, 162 SRs (92.57%) only included primary research, such as RCTs, quasi-randomized controlled trials, non-RCTs, cohort studies, case-control studies, cross-sectional studies, and qualitative research. Thirteen SRs (7.43%) included both primary research and SR. While 154 SRs (88.00%) selected RCTs as the included studies, only 52 out of 154 (33.12%) SRs assessed the methodological quality of the included RCTs. Among these, only 36 out of 154 (23.38%) SRs used appropriate tools to assess the risk of bias in RCTs.

A total of 42 SRs reported 7 different methodological pathways for COS development (Table S2 and Fig. 4). Among these, most studies (16 out of 42, 38.10%) selected a research approach consisting of a systematic review, semi-structured interview, Delphi survey, and consensus meeting for COS development. Delphi survey and consensus meeting were the most common methods used for COS development, in addition to systematic reviews.

Methodological quality of included SRs

The results of methodological quality assessment of included SRs were shown in accordance with AMSTAR 2.0 [13] (see Fig. 5). The level of agreement between the two reviewers for quality assessment was acceptable (K = 0.81 ~ 0.93 for all items and overall confidence). Since none of the SRs performed meta-analysis, Item 11, Item 12, and Item 15 were not applicable in the final assessment. All SRs provided the Population, Intervention, Comparator group, and Outcome (PICO) and performed study selection and data extraction in duplicate, hence they were rated as “yes” in Item 1, Item 5, and Item 6, respectively. Regarding Item 16, most SRs (166 out of 175, 94.86%) were rated “yes” since there were no competing interests in these studies.

Despite 93 SRs (53.14%) having written protocols, only 90 SRs were rated “yes” in Item 2 because 3 of them did not register or publish the protocols before the studies were initiated. Regarding Item 3, only 49 SRs (28.00%) explained the reason for including RCTs or other observational studies. 155 SRs searched at least 2 databases, and the search strategy was provided along with the justification for publication restrictions. However, out of the 155 SRs, only 2 SRs searched the reference lists or bibliographies of included studies, trial or study registries, and grey literature. As a result, only these 2 SRs were rated “yes” for Item 4 on AMSTAR 2.0, while the remaining 153 were rated “partially yes”.

In terms of Item 7, 133 SRs (76.00%) provided a list of excluded studies and the main reasons and were rated “partial yes”, while 23 SRs (13.14%) explained the exclusion reason for each study and were rated “yes”. Nearly all SRs (172 out of 175, 98.28%) described the PICO and research designs of included studies, but only 104 of them provided the details about the above information and were assessed as “yes” in Item 8. Forty-three SRs (24.57%) used appropriate tools (e.g., Cochrane Risk-of-Bias tool (RoB) 1.0 for randomized trials [17], JADAD scale [18], a criteria list for quality assessment of RCTs [19], Critical Appraisal Skills Program (CASP) checklist [20], The Evidence Project risk of bias tool [21], Oxford Centre for Evidence-Based Medicine Levels of Evidence [22], etc.) to assess the risk of bias in individual studies included in the SRs, hence they were rated “yes” in Item 9.

In Item 10, 150 SRs (85.71%) did not report on the sources of funding for individual studies included in the review and were evaluated as “no”. Due to the discussion about the impact of the risk of bias on the results, 23 SRs (13.14%) were rated “yes” in Item 13. In the presence of heterogeneity, the review authors of 134 SRs (76.57%) investigated sources of any heterogeneity in the results and discussed the impact of this on the results of the review, hence these SRs were rated “yes” in Item 14.

In summary, only 6 SRs (3.43%) were evaluated as “high” in terms of overall confidence according to the AMSTAR 2.0, while most SRs (164 out of 175, 93.71%) were “critically low” to “low”. Table 1 indicates that the overall confidence of SRs with protocols was significantly higher than those without protocols (P < .001). Among the SRs with protocols, 67 were registered on the PROSPERO (an international database of prospectively registered systematic reviews) and 21 on the COMET database. SRs with protocols on the PROSPERO had a higher overall confidence compared to those with protocols on the COMET (P = .017).

Table 1 Overall confidence of systematic reviews with or without protocols

Full size table

Discussion

Main findings and interpretations

In this study, we investigated 175 SRs from 23 countries or regions regarding the development of COS. We found that the research hotspots for COS in SRs were musculoskeletal system or connective tissue disease, injury, poisoning, or certain other consequences of external causes, digestive system disease, nervous system disease, and genitourinary system disease, accounting for nearly half of all the SRs. While RCTs were the most common study design included in most SRs, only a few studies (23.38%) used appropriate tools to assess their risk of bias. In terms of COS development, most studies used SRs, Delphi surveys, and consensus meetings as components of the research pathway. According to the data from COMET website, 75 out of 175 SRs have developed their COS, while the COS for 22 SRs are still under development. Consistent with AMSTAR 2.0, the overall confidence of most SRs (93.71%) was evaluated as ‘’critically low’’ to ‘’low’’. Nevertheless, registering a protocol on a specialized database such as PROSPERO before conducting an SR could help improve the overall quality of the SRs.

The Global Burden of Disease (GBD) 2019 Diseases and Injuries Collaborators [23] conducted an analysis of 369 diseases and injuries across 204 countries and territories. They reported that the top five diseases with the highest disability-adjusted life-years (DALYs) were neonatal disorders, ischemic heart disease, stroke, lower respiratory infections, and diarrheal diseases. However, these diseases were not the focus of the included SRs. Notably, no SR was found regarding the establishment of COS for ischemic heart disease and diarrheal diseases. Additionally, only a few SRs were related to the construction of COS for neonatal disorders (n = 3, 1.71%), stroke (n = 1, 0.57%), and lower respiratory infections (n = 2, 1.14%). Therefore, we suggest that researchers should pay more attention to these diseases with high DALYs and focus on establishing their COS by conducting SRs.

Although assessing the risk of bias in included studies is considered the most critical step in SRs, only one-fourth of the SRs that included RCTs employed appropriate tools to assess the risk of bias in RCTs. The most common tools used in the SRs were the JADAD scale and RoB 1.0 for randomized trials. The JADAD scale, a valid tool to initially assess the quality of an RCT, has the main advantage of requiring a short amount of time to complete. However, in view of its broad assessment without key information on potential confounding factors affecting the validity of findings (such as allocation concealment, industry sponsorship, and conflict of interest), the accuracy and precision of JADAD are inferior to other more exhaustive updated instruments. ROB is the most updated, reliable, valid, and comprehensive tool and considered as the gold standard for assessing potential biases in RCTs. An overview [24] comparing the advantages and disadvantages of the JADAD scale over RoB 2.0 suggested that RoB 2.0 should be primarily considered when assessing the quality of RCTs.

The assessment on the basis of AMSTAR 2.0 revealed that the overall quality of SRs in this field was poor, which could compromise the reliability of COS. Our survey identified several methodological issues, including: (1) more than 50% of SRs did not select appropriate tools to assess the risk of bias in the included studies, account for the risk of bias in the interpretation, report the funding sources of included studies, or explain the reason for the design of included studies; (2) 20–50% of SRs did not register their protocol before starting the studies or discuss the sources of any heterogeneity and their influence on the results; (3) less than 20% of SRs did not use a comprehensive literature search strategy, provide a list of excluded studies with their reasons for exclusion, describe the included studies in adequate detail, or report any potential sources of conflict of interest. Rogozińska et al. [12] previously found that none of the SRs published on the COMET database met all nine methodological expectations for SRs outlined in AMSTAR 2.0, which was similar to our findings. Therefore, we urge researchers in this field to strive to improve the quality of SRs regarding COS by addressing the above-mentioned issues.

Additionally, we discovered that SRs with registered protocols had a significantly higher overall quality than those without. Furthermore, registering protocols on PROSPERO, as opposed to COMET, can be advantageous in improving the quality of SRs. SRs are generally considered the best evidence for addressing health research questions due to their rigorous and methodical approach, which requires adherence to pre-specified methods and analyses. Creating and registering a protocol that outlines the rationale, hypothesis, and planned methods of the SRs in the initial stage of the study ensures the transparency and reproducibility, reducing the risk of selective reporting bias. Protocols guide decisions about including/excluding studies/data in SRs during the research process and reporting outcomes in the final manuscript. As a solution to improve incomplete and biased reporting of SRs, the Preferred Reporting Items for Systematic reviews and Meta-analyses (PRISMA) guideline proposed registering protocols in 2009 [25]. In 2011, PROSPERO, the first international registry for systematic reviews, was launched by the Center for Reviews and Dissemination (CRD) at York University. The establishment of PROSPERO aimed to provide a registry for SRs in health, promote their quality, and reduce redundancy and resource waste. Wanderley et al. [26] also agreed that SRs registered with PROSPERO have the highest level of consistency and credibility. Therefore, we recommend that authors of SRs on COS should register a detailed protocol on PROSPERO before beginning the review.

Recommendations for SRs regarding the COS development

In order to establish a convincing COS, it is essential to perform a SR with adequate methodological quality. To promote the overall quality of SRs, researchers should first register a protocol on specialized databases like PROSPERO before conducting the SR. Secondly, appropriate tools for assessing the risk of bias in the included studies should be selected, with RoB being the optimum choice for SRs that include RCTs. Improving the quality of SRs also requires amelioration of the methodological process in the following aspects: building a comprehensive literature search strategy, providing a detailed exclusion list with reasons, and accounting for the risk of bias in the results. Regarding the selection of the disease topic of SRs for COS development, researchers should focus on diseases with high DALYs that have not been covered in previous SRs regarding COS, such as ischemic heart disease and diarrheal diseases.

Strengths and limitations

This study has several strengths, including its comprehensive and systematic investigation of methodological details about SRs regarding the COS development. Furthermore, the study had a large sample size of 175 SRs published in PubMed from the earliest to 2022, making it relatively representative.

Nonetheless, there are several limitations that should be considered. First, we only searched for eligible SRs published in PubMed and included those published in Chinese and English, potentially missing SRs that were not indexed in PubMed at the time of the search or those published in other languages. Second, despite the assistance of trained assessors, some misinterpretation may still exist in some extracted items from the original article source, which could have affected the results. Finally, the assessment of methodological quality and identification of specific methodological gaps were limited by the insufficient reporting of methodological details in some of the included SRs.

Conclusions

In summary, despite the increasing number of SRs published on COS development, their overall quality was found to be poor. Methodological issues were present in every aspect of design and execution when examining using AMSTAR 2.0. Identifying and understanding these important methodological gaps is critical for future SRs. We have identified several critical methodological issues and provided recommendations for improving the quality of SRs on COS development. Our findings emphasize the need for researchers to carefully select the disease topic and strictly adhere to optimal methodology requirements when conducting SRs to establish COS.

Data availability

All data generated or analysed during this study are included in this published article and its supplementary information files.

Abbreviations

COMET:: Core Outcome Measures in Effectiveness Trials
COS:: Core Outcome Set
SRs:: Systematic Reviews
AMSTAR:: A Measure Tool to Assess Systematic
RCT:: Randomized Controlled Trial
COVID-19:: Corona Virus Disease 2019
ICD:: International Classification of Diseases
PICO:: Population, Intervention, Comparator group, and Outcome
RoB:: Cochrane Risk-of-Bias tool
CASP:: Critical Appraisal Skills Program
GBD:: Global Burden of Disease
DALYs:: Disability-Adjusted Life-Years

References

Kirkham JJ, Dwan KM, Altman DG, Gamble C, Dodd S, Smyth R, et al. The impact of outcome reporting bias in randomised controlled trials on a cohort of systematic reviews. BMJ. 2010;340:c365. https://doi.org/10.1136/bmj.c365.
Article PubMed Google Scholar
Williamson PR, Gamble C. Identification and impact of outcome selection bias in meta-analysis. Statist Med. 2005;24:1547–61. https://doi.org/10.1002/sim.2025.
Article MathSciNet CAS Google Scholar
Mayo-Wilson E, Fusco N, Li T, Hong H, Canner JK, Dickersin K, et al. Multiple outcomes and analyses in clinical trials create challenges for interpretation and research synthesis. J Clin Epidemiol. 2017;86:39–50. https://doi.org/10.1016/j.jclinepi.2017.05.007.
Article PubMed Google Scholar
Zhang J, Lu Y, Kwong JS, Li X, Zheng W, He R. Quality assessment of the Chinese clinical trial protocols regarding treatments for coronavirus disease 2019. Front Pharmacol. 2022;11:1330. https://doi.org/10.3389/fphar.2020.01330.
Article CAS Google Scholar
Jin X, Pang B, Zhang J, Liu Q, Yang Z, Feng J, et al. Core Outcome Set for clinical trials on Coronavirus Disease 2019 (COS-COVID). Eng (Beijing). 2020;6(10):1147–52. https://doi.org/10.1016/j.eng.2020.03.002.
Article CAS Google Scholar
Available. at: www.comet-initiative.org.
Hahn S, Williamson PR, Hutton JL, Garner P, Flynn EV. Assessing the potential for bias in meta-analysis due to selective reporting of subgroup analyses within studies. Stat Med. 2000;19(24):3325–36. https://doi.org/10.1002/1097-0258(20001230)19:24%3C3325::aid-sim827%3E3.0.co;2-d.
Article CAS PubMed Google Scholar
Blackwood B, Marshall J, Rose L. Progress on core outcome sets for critical care research. Curr Opin Crit Care. 2015;21(5):439–44. https://doi.org/10.1097/M-CC.0000000000000232.
Article PubMed Google Scholar
Chan AW, Song F, Vickers A, Jefferson T, Dickersin K, Gøtzsche PC, et al. Increasing value and reducing waste: addressing inaccessible research. Lancet. 2014;383(9913):257–66. https://doi.org/10.1-016/S0140-6736(13)62296-5.
Article PubMed PubMed Central Google Scholar
Clarke M, Williamson PR. Core outcome sets and systematic reviews. Syst Rev. 2016;5:11. https://doi.org/10.1186/s13643-016-0188-6.
Article PubMed PubMed Central Google Scholar
Williamson PR, Altman DG, Bagley H, Barnes KL, Blazeby JM, Brookes ST et al. The COMET handbook: version 1.0. Trials 18(Suppl 3):280. https://doi.org/10.1186/s13063-017-1978-4.
Rogozińska E, Gargon E, Olmedo-Requena R, Asour A, Cooper NAM, Vale CL, et al. Methods used to assess outcome consistency in clinical studies: a literature-based evaluation. PLoS ONE. 2020;15(7):e0235485. https://doi.org/10.1371/journal.pone.0235485.
Article CAS PubMed PubMed Central Google Scholar
Shea BJ, Reeves BC, Wells G, Thuku M, Hamel C, Moran J, et al. AMSTAR 2: a critical appraisal tool for systematic reviews that include randomised or non-randomised studies of healthcare interventions, or both. BMJ. 2017;358:j4008. https://doi.org/10.1136/bmj.j4008.
Article PubMed PubMed Central Google Scholar
Davies S. The importance of PROSPERO to the National Institute for Health Research. Syst Rev. 2012;1:5. https://doi.org/10.1186/2046-4053-1-5.
Article PubMed PubMed Central Google Scholar
Mary LM. Interrater reliability: the kappa statistic. Biochemia Media. 2012;22(3):276–82. [Medline: 23092060].
Google Scholar
World Health Organization. International Classification of Diseases 11th Revision (ICD-11). https://icd.who.int/en. Published 2022. [accessed March 1, 2023.].
Higgins JPT, Altman DG, Gøtzsche PC, Jüni P, Moher D, Oxman AD, et al. The Cochrane collaboration’s tool for assessing risk of bias in randomised trials. BMJ. 2011;343. https://doi.org/10.1136/bmj.d5928.
Jadad A, Moore A, Carroll D, Jenkinson C, Reynolds DJ, Gavaghan DJ, et al. Assessing the quality of reports of randomized clinical trials: is blinding necessary? Control Clin Trials. 1996;17:1–12. https://doi.org/10.1016/0197-2456(95)00134-4.
Article CAS PubMed Google Scholar
Verhagen AP, de Vet HC, de Bie RA, Kessels AG, Boers M, Bouter LM, et al. The Delphi list: a criteria list for quality assessment of randomized clinical trials for conducting systematic reviews developed by Delphi consensus. J Clin Epidemiol. 1998;51(12):1235–41. https://doi.org/10.1016/s0895-4356(98)00131-0.
Article CAS PubMed Google Scholar
Critical Appraisal Skills Programme (CASP). CASP qualitative research Checklist: 10 questions to help you make sense of qualitative research. Oxford: Public Health Resource Unit; UK: Milton Keynes Primary Care Trust,; 2002.
Google Scholar
Kennedy CE, Fonner VA, Armstrong KA, Denison JA, Yeh PT, O’Reilly KR, et al. The evidence project risk of bias tool: assessing study rigor for both randomized and non-randomized interve-ntion studies. Syst Rev. 2019;8:3–3. https://doi.org/10.1186/s13643-018-0925-0.
Article PubMed PubMed Central Google Scholar
Oxford Centre for Evidence-Based Medicine. Oxford Centre for Evidence-Based Medicine-levels of evidence. Group. http://www.cebm.net/oxford-centre-evidence-based-medicine-levels-evidence-march-2009. Published 2011. [accessed November 24, 2018.].
GBD 2019 Diseases and Injuries Collaborators. Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: a systematic analysis for the global burden of Disease Study 2019. Lancet. 2020;396(10258):1204–22. https://doi.org/10.1016/S0140-6736(20)30925-9.
Article Google Scholar
Luchini C, Veronese N, Nottegar A, Shin JI, Gentile G, Granziol U, et al. Assessing the quality of studies in meta-research: review/guidelines on the most important quality assessment tools. Pharm Stat. 2021;20:185–95. https://doi.org/10.1002/pst.2068.
Article PubMed Google Scholar
Moher D, Liberati A, Tetzlaff J, Altman DG, PRISMA Group. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. BMJ. 2009;339:b2535. https://doi.org/10.1136/bmj.b2535.
Article PubMed PubMed Central Google Scholar
Bernardo WM. PRISMA statement and PROSPERO. Int Braz J Urol. 2017;43(3):383–4. https://doi.org/10.1590/S1677-5538.IBJU.2017.03.02.
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by National Natural Science Foundation of China (No. 72064004), Doctoral Foundation of Guizhou Provincial People’s Hospital (GZSYBS (2019) No.09), the Hospital Pharmaceutical Research Project funded by the China Pharmaceutical Association (CPA-Z05-ZC-2023-002), and Foundation of Health Commission ofGuizhou Province (gzwkj2021-553). We would like to thank the Ouality Control Center of Guizhou.

Funding

This work was supported by National Natural Science Foundation of China (No.72064004), Doctoral Foundation of Guizhou Provincial People’s Hospital (GZSYBS (2019) No.09), the Hospital Pharmaceutical Research Project funded by the China Pharmaceutical Association (CPA-Z05-ZC-2023-002), and Foundation of Health Commission of Guizhou Province (gzwkj2022-224). But the funding did not involve in the study design, the collection, analysis and interpretation of data, writing of the report, or the decision to submit the article for publication.

Author information

Hong Cao and Yan Chen are co-first authors of this article and contribute equally to the article.

Authors and Affiliations

School of Pharmaceutical Sciences, Guizhou University, 2708 South of Huaxi Avenue Road, Guiyang, China
Hong Cao & Yan Chen
Department of Pharmacy, Guizhou Provincial People’s Hospital, No.83 Zhongshandong Road, Guiyang, Guizhou Province, China
Hong Cao, Yan Chen, Junjie Lan, Rui Zhang, Huaye Zhao, Linfang Hu, Jiaxue Wang, Shuimei Sun & Jiaxing Zhang
Department of Health Services Management, Guizhou Medical University, Shansi Building, Huaxi College Town, Guiyang, China
Zhihao Yang
Global Health Nursing, Graduate School of Nursing Science, St. Luke’s International University, 10-1 Akashi-choChuo-Ku 104-0044, Tokyo, Japan
Joey Sum-wing Kwong
Office of Health Insurance Administration, Guizhou Provincial People’s Hospital, No.83 Zhongshandong Road, Guiyang, Guizhou Province, China
Songsong Tan
Department of endoscopy, Guizhou Provincial People’s Hospital, No.83 Zhongshandong Road, Guiyang, Guizhou Province, China
Jinyong Cao
Experimental Cancer Medicine, Department of Laboratory Medicine, Karolinska Institute, Room 601, Novum PI 6, Hälsovägen 7, SE-14157 Huddinge, Stockholm, Sweden
Rui He & Wenyi Zheng

Authors

Hong Cao
View author publications
You can also search for this author in PubMed Google Scholar
Yan Chen
View author publications
You can also search for this author in PubMed Google Scholar
Zhihao Yang
View author publications
You can also search for this author in PubMed Google Scholar
Junjie Lan
View author publications
You can also search for this author in PubMed Google Scholar
Joey Sum-wing Kwong
View author publications
You can also search for this author in PubMed Google Scholar
Rui Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Huaye Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Linfang Hu
View author publications
You can also search for this author in PubMed Google Scholar
Jiaxue Wang
View author publications
You can also search for this author in PubMed Google Scholar
Shuimei Sun
View author publications
You can also search for this author in PubMed Google Scholar
Songsong Tan
View author publications
You can also search for this author in PubMed Google Scholar
Jinyong Cao
View author publications
You can also search for this author in PubMed Google Scholar
Rui He
View author publications
You can also search for this author in PubMed Google Scholar
Wenyi Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Jiaxing Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Hong Cao, Yan Chen, and Jiaxing Zhang collected the data. Jiaxing Zhang and Songsong Tan involved in statistical analysis. Hong Cao, Yan Chen, Zhihao Yang, and Jiaxing Zhang drafted the manuscript. Jiaxing Zhang, Junjie Lan, Rui He, Wenyi Zheng, and Joey Sum-wing Kwong revised the final manuscript. Rui Zhang, Huaye Zhao, Linfang Hu, Jiaxue Wang, Shuimei Sun, Jinyong Cao conceived and designed the study, performed critical revision of the manuscript for important intellectual content, and approved final version of the manuscript to be published.

Corresponding author

Correspondence to Jiaxing Zhang.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Cao, H., Chen, Y., Yang, Z. et al. The methodological quality of systematic reviews regarding the Core Outcome Set (COS) development. BMC Med Res Methodol 24, 65 (2024). https://doi.org/10.1186/s12874-024-02182-w

Download citation

Received: 22 June 2023
Accepted: 16 February 2024
Published: 11 March 2024
DOI: https://doi.org/10.1186/s12874-024-02182-w

The methodological quality of systematic reviews regarding the Core Outcome Set (COS) development

Abstract

Background

Methods

Results

Conclusion

Background

Methods

Eligibility criteria

Literature search

Study selection and data extraction

Quality assessment

Statistical analysis

Results

Search results and description of included SRs

Methodological quality of included SRs

Discussion

Main findings and interpretations

Recommendations for SRs regarding the COS development

Strengths and limitations

Conclusions

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Electronic supplementary material

Supplementary Material 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Medical Research Methodology

Contact us