Skip to main content

Table 1 Median Test-retest-reliability (Gwet’s AC1) per Amstar item 1–11 reviewer (R1-R7), for n = 16 SRs

From: Measuring test-retest reliability (TRR) of AMSTAR provides moderate to perfect agreement – a contribution to the discussion of the importance of TRR in relation to the psychometric properties of assessment tools

AMSTAR-Item 1 2 3 4 5 6 7 8 9 10 11 Median (Range)
R1 1 0.72 0.64 0.75 0.9 1 0.86 0.75 0.64 0.63 0.92 0.75 (0.63–1)
R2 1 0.9 0.92 0.63 0.89 0.75 1 0.68 0.84 0.64 1 0.89 (0.63–1)
R3 1 0.8 0.84 0.34 0.63 0.34 0.86 0.75 0.5 −0.02 0.93 0.75 (0.2–1)
R4 1 0.63 0.82 0.63 0.88 1 0.86 0.4 0.82 0.88 0.72 0.82 (0.4–1)
R5 1 0.8 0.8 0.6 0.53 0.86 0.58 0.69 0.82 0.5 −0.02 0.69 (−0.02–1)
R6 1 0.75 0.77 0.53 0.15 0.4 0.93 0.68 1 0.76 0.72 0.75 (0.4–1)
R7 1 0.88 0.92 0.88 0.51 0.77 1 0.68 0.53 0.6 0.75 0.77 (0.51–1)
Median (Range) 1 (1–1) 0.8 (0.63–1) 0.82 (0.64–1) 0.6 (0.53–1) 0.53 (0.15–1) 0.77 (0.34–1) 0.86 (0.58–1) 0.68 (0.4–1) 0.82 (0.5–1) 0.6 (0.2–1) 0.72 (−0.02–1)  
  1. Legend: light gray: moderate agreement, medium dark colored: substantial agreement, dark colored almost perfect agreement