Reliability and criterion validity of self-measured waist, hip, and neck circumferences

Barrios, Pamela; Martin-Biggers, Jennifer; Quick, Virginia; Byrd-Bredbenner, Carol

doi:10.1186/s12874-016-0150-2

Research article
Open access
Published: 04 May 2016

Reliability and criterion validity of self-measured waist, hip, and neck circumferences

Pamela Barrios¹,
Jennifer Martin-Biggers¹,
Virginia Quick¹ &
…
Carol Byrd-Bredbenner¹

BMC Medical Research Methodology volume 16, Article number: 49 (2016) Cite this article

6442 Accesses
23 Citations
2 Altmetric
Metrics details

Abstract

Background

Waist, hip, and neck circumference measurements are cost-effective, non-invasive, useful markers for body fat distribution and disease risk. For epidemiology and intervention studies, including body circumference measurements in self-report surveys could be informative. However, few studies have assessed the test-retest reliability and criterion validity of a self-report tool feasible for use in large scale studies.

Methods

At home, mothers of young children viewed a brief, online instructional video on how to measure their waist, hip, and neck circumferences. Afterwards, they created a homemade paper measuring tape from a downloaded file with scissors and tape, took all measurements in duplicate, and entered them into an online survey. A few weeks later, participants visited an anthropometrics lab where they measured themselves again, and trained technicians (n = 9) measured participants in duplicate using standard equipment and procedures. To assess differences between self- and technician-measured circumferences, duplicate measurements for participant home self-measurements, participant lab self-measurements, and technician measurements each were averaged and Wilcoxon signed-rank tests conducted. Agreement between all possible pairs of measurements were examined using Intraclass Correlations (ICCs) and Bland-Altman plots.

Results

Participants (n = 41; aged 38.05 ± 3.54SD years; 71 % white) were all mothers that had at least one child under the age of 12 yrs. Technical error of measurements for self- and technician- duplicate measurements varied little (0.08 to 0.76 inches) and had very high reliability (≥0.90). Intraclass Correlations (ICC) comparing self vs technician were high (0.97, 0.96, and 0.84 for waist, hip, and neck). Comparison of self-measurements at home vs lab revealed high test-retest reliability (ICC ≥ 0.87). Differences between participant self- and technician measurements were small (i.e., mean difference ranged from −0.13 to 0.06 inches) with nearly all (≥93 %) differences within Bland-Altman limits of agreement and <10 % exceeding the a priori clinically meaningful difference criterion.

Conclusions

This study has demonstrated a simple, inexpensive method for teaching novice mothers of young children to take their own body circumferences resulting in accurate, reliable data. Thus, collecting self-measured and self-reported circumference data in future studies may be a feasible approach in research protocols that has potential to expand our knowledge of body composition beyond that provided by self-reported body mass indexes.

Peer Review reports

Background

Anthropometric data collected by self-report surveys are usually limited to height and weight—measurements that are easy, quick, inexpensive, and tend to have a small degree of reporting error in adults [1–7]. These measures typically are used to calculate body mass index (BMI) for the purpose of classifying individuals as underweight, normal weight, overweight, or obese. However, BMI is an indirect measurement of body adiposity and may result in misclassification [8–10]. For instance, those who have a greater proportion of muscle tissue and bone mass, such as athletes and body builders, weigh more and, thus, likely have a BMI that incorrectly indicates weight status. Individuals who are inactive or who have age-related decreases in muscle and bone mass may have a BMI classified as normal weight despite having elevated body fat levels [8]. Additionally, men tend to have more lean muscle mass and less body fat than women even when both have the same BMI [10, 11]. Another limitation of BMI is that it does not reflect body fat distribution (central trunk vs. hips and thighs), which is associated with metabolic disturbances and cardiovascular risks [12–15].

Waist, hip, and neck circumferences are cost-effective, non-invasive, and informative supplementary measurements that could be included on self-report surveys to enhance the usefulness of BMI by serving as indicators of body fatness and fat distribution [15, 16]. Convincing evidence indicates that waist circumference and waist-hip circumference ratio are strongly associated with cardiovascular disease, type 2 diabetes mellitus, hypertension, sarcopenic obesity, colorectal and post-menopausal breast cancer, and, in older adults, declining quality of life and physical activity levels [17–21]. Some have proposed that waist circumference could replace waist-hip ratio and BMI as a single data point to reflect all-cause mortality risk [22]. Others have called for waist and hip circumferences to be routine metabolic and cardiovascular health clinical measures [13, 23] and used as indicators for weight loss interventions [7, 15].

Neck circumference is a relatively new, economical, and practical measure identified as a useful marker for upper body obesity [24, 25]. It correlates positively with metabolic syndrome risk, cardiovascular risk, and elevated blood pressure in children, and pregnancy-induced hypertension [25–31]. In addition, evidence indicates that it is a stronger indicator of elevated serum triglycerides and decreased serum HDL cholesterol (atherogenic dyslipidemia) than BMI and waist circumference in both sexes, making it a useful, non-invasive diagnostic tool [32].

Including circumferences in self-report surveys is worthwhile only if the values reported are accurate. Dutch, overweight workers who were sent a tape measure and written instructions for measuring their own waist circumference had self-reported values that were highly correlated with researcher-measured waist circumference [16]. Other studies using similar methodology also found technician-measured and self-measured circumferences were highly correlated for waist and hip, did not differ significantly, and had no consistent trend across studies in under- or over-reporting [7, 10, 33–40]. No findings could be located to establish reliability of self-measured neck circumferences.

Training materials have been developed to improve circumference measurement accuracy. For instance, English-speaking adults in Scotland and Belgium were given a measuring tape and asked to measure their own waist and hip circumferences using written instructions or training video instructions; those using the training video reported more accurate waist circumferences measurements [41]. Completing a 25-min computer-based training with a reading grade level of 11.7 in a laboratory setting prior to self-measurement resulted in waist circumferences that did not differ significantly between college students and trained staff [37].

Previously published research comparing precision of self-report vs trained-technician measurements indicate self-report measurements may be sufficiently accurate for epidemiological studies [33–35, 38, 42, 43]. The few research studies available suggest that training, especially video instructions, have the potential to improve self-reported waist measurement accuracy [37, 41]. These findings are promising, but their application remains limited for numerous reasons. For example, the instructions (written and video) provided to study participants are generally unavailable beyond the study participants. Additionally, the participant burden (e.g., training time needed and difficulty level of training materials) is beyond what many individuals are willing or able to invest [37] and the ecological value was sacrificed in many studies because training and self-measurements were conducted in a laboratory setting [37, 38, 44].

A key factor limiting application and replication of existing research is the tape measure used. That is, previous studies have relied on tape measures with special characteristics [10, 44] or one mailed to participants [7, 16, 39, 45]—this limitation makes it costly and logistically-difficult to conduct a large scale survey or promote self-measurement as a strategy for self-monitoring of health. In addition, little is known about the reliability of self-measurements over time in any population group [46]. Another limitation of published studies is the statistical procedures used to compare self- and technician-measurements. Many report only correlation coefficients (e.g., Pearson, kappa, ICC), which demonstrate strength of relationship between two raters, but do not reflect inter-rater agreement (e.g., Bland-Altman plots, also called Tukey Mean Difference plots [47]). Of those reporting Bland-Altman plots, no studies of technician- vs self-measurements could be located that applied the array of reporting standards for Bland-Altman analysis of agreement between measurements taken by technicians vs. self [20, 36]. Thus, to overcome limitations of previous research and ascertain the test-retest reliability and criterion validity of a self-report tool feasible for use in large scale studies, this study compared self-measurements of waist, hip, and neck circumferences taken by novice lay people (i.e., mothers of young children) at home after viewing a brief, simple online instructional video and creating a homemade paper measuring tape from a downloaded pdf file to measurements taken by trained technicians using research-grade equipment and standard procedures.

Methods

The Institutional Review Boards at the authors’ university approved study procedures. All participants gave informed consent.

Sample

Participants were recruited via announcements posted on community websites and distributed through workplace listservs. Recruitment materials invited individuals to learn to accurately measure their neck, waist, and hips and then have these measurements taken by a trained researcher. Participants received $25 for completing the study. To be eligible for this study, participants had to be women, between 18 and 45 years of age, have at least one child under 12 years of age, and not be pregnant within the past year.

Development of study tape measure and video

Tape measures that can be downloaded, printed on home printers, and assembled with scissors and tape are commonly used by online clothing companies to ensure ordered clothing will properly fit purchasers. Development of the tape measure for this study began by collecting and reviewing a wide array of online tape measures and assessing them for measurement accuracy, ease of assembly, and clarity of instructions. Existing tape measures were extensively adapted to create the tape measure used in this study; adaptations included developing by clarifying assembly instructions and improving labeling of cutting lines and pieces to be joined by tape (see Fig. 1).

Development of the video began by writing scripts using consumer-friendly terminology. The scripts included instructions for creating the tape measure and taking neck, waist, and hip circumferences. The scripts were reviewed for technical accuracy by a panel of experts in anthropometric measurements and instructional design (n = 4) and iteratively refined and shortened. The key points addressed in the video are shown in Table 1. Waist was measured at the level of the belly button (umbilicus) [48–50], hips measured at the level of maximum extension of the buttocks [17, 50], and neck at a point halfway between the collar bone and chin in the middle of the neck [25].

Table 1 Key Points Addressed in the Body Measurements Video

Full size table

Before participants were recruited, the tape measure and video were posted online. The tape measure and video underwent formative cognitive testing with women similar to the study participants, but not included in the study reported here, to verify clarity of information, accuracy of interpretation, and application of the information; it was iteratively refined based on formative testing findings. Subsequently, the tape measure and video were pilot-tested with 7 women recruited in the same way as the study sample and having characteristics similar to those in the study sample, but not in the sample, and again refined.

Study design

Participants completed an in-home assessment, including self-measures and an online questionnaire (part 1), followed by a clinical visit (part 2). In part 1, participants viewed the less than 9 min instructional video explaining how to measure their own waist, hip, and neck circumferences using the measuring tape they printed out and assembled. Participants were advised to watch the video carefully and as many times as required until they felt sufficiently confident to take their measurements accurately. They also were instructed to pause the video at each of these points to complete the task before proceeding: assemble the tape measure, measure waist, measure hips, and measure neck. The video provided verbal instructions along with photos of women demonstrating the measuring procedure. Participants were instructed to wear minimal and/or snug-fitting clothing, fast for 4 h and void their bladders before taking any measurements, take measurements at the end of a normal expiration, take all measurements in duplicate to the nearest ½-inch, record measurements immediately after taking them, and then enter the measurements into an online survey after all measurements were completed.

The survey also collected participant name, demographic data, height, and weight and evaluated video clarity and ease of constructing the tape measure. Participants were instructed to retain the tape measure.

In part 2, participants visited a campus anthropometrics lab. At the lab, technicians confirmed participants took their measurements at home using the tape they assembled and brought to the lab. The participant-assembled tape measure was labeled and later analyzed for accuracy of assembly. Participants were instructed to fast 4 h before the visit and to wear light, snug clothing. At the lab, participants were instructed to void their bladders, watch the video, and take their measurements in duplicate in the same way they did at home using a commercial measuring tape like those used in home sewing (the home-assembled paper tape measures were not used in the lab to preserve them for later analysis). Trained-technicians, blind to participants’ self-reported measurements taken at home, observed participants while they took and recorded their self-measurements.

Then, technicians measured participants’ circumferences in duplicate using a Gulick tape measure (Country Technology, Inc., Gays Mills, WI) and standard research methods based on the same anatomic landmarks as the participants were instructed to use. Technicians also measured in duplicate heights without shoes to the nearest ¼-inch using a calibrated wall-mounted stadiometer (QuickMedical, Issaquah, WA) and weights to the nearest ¼-pound with a calibrated digital scale (Tanita model TBF-300WA, Arlington Heights, Illinois). At the conclusion of the session, technicians briefly interviewed participants to explore their perceptions of the clarity and ease of following the instructions in the video and to identify suggestions for improvement.

Prior to data collection, research technicians (n = 9) were trained to complete study measurements accurately. Technicians reviewed standard anthropometric measurement protocol [51], discussed the protocol with the lead technician, viewed live demonstrations of measurements being taken, and then practiced taking measurements until they achieved a high degree of accuracy compared to the lead technician. The coefficient of inter-observer reliability was above 0.96 for all measurements.

Data analysis

Analyses were performed using the SPSS for Windows statistical software package version 21.0 (SPSS Inc., Chicago, IL, USA) and Excel (Microsoft, Seattle, WA, USA). Technical error of measurement was calculated for each set of duplicate measurements to assess intra-observer error and reliability [51–54]. To assess differences between self- and technician-measured circumferences, duplicate measurements for participant home self-measurements, participant lab self-measurements, and technician measurements each were averaged and Wilcoxon signed-rank tests conducted. Agreement between all possible pairs of measurements were examined using Intraclass Correlations (ICCs). Statistical significance was set at P < 0.05.

Home and lab self-measurements were compared to establish test-retest reliability (repeatability of measurements). In-depth comparisons of participant home self-measurements and technician measurements were conducted because home measurements are analogous to those that participants would self-report in surveys and technician measures can be considered the comparative “gold standard” or measure to establish criterion validity [33]. Analysis procedures for Bland-Altman plots incorporated the array of reporting standards for agreement analysis in laboratory research [20]. These plots graphically illustrate the agreement between participant home self-measurements and technician measurements [47, 55, 56]. The plots include the mean difference (also called bias), limits of agreement (LOA, which are 95 % confidence limits for the bias) calculated using the formula for small samples [57], 95 % tolerance limits for upper and lower LOA (also referred to as 95 % confidence limits for the population) [57], and confidence limits for the bias calculated using standard error of the bias [56].

A comparison of the magnitude of measurement errors between study participants (i.e., untrained lay people) and technicians (i.e., health professionals trained in anthropometrics) was conducted to determine whether self-measurements by untrained lay persons using a self-assembled tape measure were sufficiently accurate for research purposes. A mean difference of ≥ ±10 % was set a priori as the clinically meaningful difference between participants and technicians. This difference was set after scrutinizing previous research for guidance. For example, a review article examining the magnitude of measurement error for waist circumferences taken at various anatomical locations (none included umbilicus) reported that intra-observer and inter-observer measurement error ranged from 0.7 to 9.2 cm (0.28 to 3.62 inches at 2.54 cm per inch) and 1.4 to 15 cm (0.55 to 5.90 inches), respectively, with untrained health professionals tending to have greater measurement error than health professionals trained in anthropometrics [46]. Authors of the review paper concluded it was “difficult to draw conclusions on the magnitude of measurement error [46].” Previous research has noted strong inter-observer differences in waist and hip measurements [46, 51, 58], even when observers were health professionals trained in anthropometrics. Additionally, studies rarely report absolute measurement error (e.g., inches different between observers) [46]. Although no reports of error as a percent of body circumferences could be located and a clinically meaningful difference for inter-observer or intra-observer measurements of waist circumference [46], or other body circumferences, could not be gleaned from the literature, Verweij et al. [46] proposed that a 5 % change in waist circumference measurements taken by trained health professionals may be a clinically relevant short-term change for improvements in health conditions positively associated with waist circumference (e.g., cardiovascular disease). The > ±10 % level was identified as the clinically significant level for this study after considering the inter-observer differences in measurements among trained health professionals reported by others [46, 58], Verweij et al’s [46] “realistic” range of waist circumferences (23.6 inches [60 cm] to 53.15 inches [135 cm]), the current lack of guidance with regard to body circumferences, and examination of studies comparing tests for other measures (i.e., blood glucose, vitamin D, total cholesterol, and triglycerides) which deemed values exceeding approximately 7 to 15 % as clinically significant measurement errors [59–62].

Results

Participants (n = 41) were 38.05 ± 3.54SD years, 71 % white, and 78 % had a bachelor’s degree or higher. As shown in Table 2, the technical error of measurement for home self-, participant lab self-, and technician- duplicate measurements indicated very minor differences (i.e., 0.08 to 0.76 inches) and very high reliability (≥0.90). Table 2 also reports means, ranges, and ICC for measurements. All ICCs comparing participant home vs participant lab, participant home vs technician, and participant lab vs technician met the benchmark for near perfect agreement (i.e., the ICCs fell within the 0.81 to 1.0 range) [63–68]. A comparison of the duplicate technician and self-measurements indicated high measurement repeatability because little difference occurred between the paired measurements for any circumference (i.e., mean difference ranged from −0.13 to 0.06 inches).

Table 2 Participant Characteristics and Intra-Class Correlations (ICC) between Participant Self-Measurements and Technician Measurements (N = 41)

Full size table

A comparison of the participant home and participant lab self-measurements was conducted to establish test-retest reliability. The ICCs for these intra-observer measurements were very high (see Table 2). Despite the significant difference between home and lab waist and neck self-measurements, the mean difference was negligible (i.e., 0.95 and 0.38 inches), respectively. The mean difference between home and lab self-measurements equals about 3 % difference for waist and neck circumferences and less than 1 % for hip circumferences which indicate high test-retest reliability.

Figure 2 illustrates the differences between participant home self-measurements and technician waist circumference measurements. The mean difference (bias) indicates that participant waist circumferences were about one-half inch larger than technician measurements; however this measurement did not differ significantly between technician and participant home measurements and did not demonstrate systematic bias. As anticipated, ≥95 % of waist measurement differences fell within the limits of agreement (LOA). A comparison of differences indicated the vast majority (i.e., 93 %) of the participant home waist circumference measurements were ±10 % of technician measurements (i.e., the a priori standard) and thus were not clinically meaningful. The three differences outside the standard differed 12, 12, and 16 %. All three of these cases also had differences outside the standard for one other circumference (1 hip and 2 neck). The upper and lower tolerance limits show the potential agreement expected if similar measures are taken with different samples in the future [56].

The mean difference between home and technician hip measurements was about one-fifth of an inch. Participant home hip measurements did not differ significantly from technician measurements and there was no systematic bias. An examination of the hip measurement differences revealed that 93 % were within the LOA (Fig. 3). As with waist measurements, the vast majority (i.e., 95 %) of the participant home and technician hip circumference measurements were within the a priori standard. The two hip circumferences differences outside the standard differed by 11 and 16 %. Both cases also had differences outside the standard for one other circumference (1 waist and 1 neck).

The mean difference between home and technician neck measurements showed a slight positive systematic bias, with participant measurements being consistently larger than technician measurements by an average of about eight-tenths of an inch (Fig. 4). Home measurements were significantly greater than technician measurements. However, a comparison of the mean differences with the LOA indicate that 95 % of the differences were within the LOA. Most (i.e., 80 %) participant home and technician neck circumference measurements were within the a priori standard indicating these measurements were not clinically meaningful. The neck circumferences differences that were not within the standard (n = 7) differed by 13, 14, 14, 14, 17, 17, and 31 %; all of these values except two differed by 2-inches or less. Two of these cases had measurements outside the standard for one other circumference (both were for waist).

An examination of the tape measures participants made at home indicated that nearly all followed the online instructions and assembled the measuring tapes correctly. Only three participants did not correctly assemble the measuring tape. Their most common error was not taping pieces of the tape measure together at the correct locations; despite this error, measurements from two of these women were very similar to technician measurements whereas the third woman underestimated measures by more than 2 inches. Using a 5-point scale (1 = not easy at all and 5 = very easy), participants rated the ease of making the tape measure 4.6 ± 0.48SD. To further improve ease, participants suggested making the dotted cutting lines darker to help them cut the paper tape straight.

Participants had limited suggestions for refining the instructional video. A few felt more information on how to identify the widest part of their hips was needed beyond the pictures depicting this in the video. A few thought the >9-min video was too long and detailed.

Technician observations of participants when the participants were measuring themselves in the lab indicated that >80 % of mothers completed all self-measurement procedures without errors. Errors observed in some participants were not keeping the tape measure flat, placing the tape measure at incorrect locations on waist or hips, wearing inappropriate clothing or not removing clothing, and incorrectly reading measurements on the tape measure.

Discussion

The aim of this study was to evaluate the test-retest reliability and criterion validity of self-measurements taken by novice lay persons using a self-assembled tape measure after viewing a brief online instructional video. Results indicate that participants were able to accurately assemble the tape measure and demonstrate proficiency in measuring themselves when observed by lab technicians. The low technical error measurements and high reliability for duplicate measurements demonstrates excellent intra-observer accuracy and reliability. The high ICCs between participant home and lab waist, hip, and neck circumferences indicate that participant self-measurements are highly reliable over time, which is congruent with the limited research reporting reliability of self-measurements [10, 36]. The high reliability indicates that measurements individuals take over time can help them accurately track physical changes that may enable them, their health care providers, and researchers to better realize individuals’ increasing or reducing risk for health conditions associated with high waist and neck circumferences and high waist:hip ratios, such as type 2 diabetes mellitus, cardiovascular disease, and metabolic syndrome [17–19].

The high ICCs between participant home and technician (criterion) measurements for all circumferences indicate measurements made by lay people using paper self-assembled tape measures and a brief online training video are comparable to those of trained health professionals using research-grade equipment and, thus, demonstrate good criterion validity. This finding also suggests that it is feasible to cost-effectively gather accurate self-measurements using a flexible, inelastic paper tape measure self-assembled from a pdf downloaded from the internet for large scale consumer surveys and intervention studies where participants are geographically distant from researchers and, thus, cannot easily visit anthropometric labs for measurement by trained technicians.

The mean differences in waist, hip, and neck circumferences between participants and technicians were small (0.95 inches [2.41 cm], 0.28 inches [0.71 cm], and 0.38 inches [0.97 cm], respectively). A comparison of the mean waist circumference differences reported in other studies of self-measurements (mean difference range = −6.70 to 5.98 cm) indicate the findings from the study reported here (i.e., 0.52 inches or 1.32 cm) are well within this range [7, 10, 33–35, 37–44, 49, 69, 70]. Similarly, studies reporting LOA or SD of mean waist circumference differences thereby permitting LOA calculation, the lower limits ranged from −21.01 to −3.19 cm and the upper range spanned 1.46 to 15.42 cm, or an absolute difference of 4.65 to 33.32 cm. The upper and lower LOA and absolute difference for waist circumference in this study also are well within the values reported by others [7, 10, 34, 35, 37, 38, 41–44, 49, 69]. A similar comparison of mean differences in self-reported hip circumferences published by others [7, 33, 35, 38–41, 43, 44, 69, 70] (mean difference range = −5.90 to 1.19 cm; lower LOA range = −26.09 to −2.29 cm; upper LOA range 1.60 to 14.29 cm; absolute difference of LOA = 6.97 to 40.38 cm) to findings in this study indicate comparable results (mean difference = 1.07 cm, LOA = −10.36 to 9.35 cm; absolute difference 19.71 cm). Also, like other studies, there were no significant differences in mean waist and hip circumferences measured at home and in the lab by technicians [38, 41]. No comparable studies could be found for neck circumference, however the limited research available indicates high agreement for this measure among trained observers [71].

The vast majority of waist and hip circumference self vs. technician measurements were within the a priori standard for differences and, hence, not deemed clinically meaningful. Approximately one-sixth of neck circumference self-measurements differed more than 10 % from technician measurements; this finding, along with the positive bias in neck measurements, indicates a need for improvement. An even tighter agreement between participant and technician circumference measurements would further enhance the utility of self-measurements and may be feasible to achieve. For example, if the a priori standard had been set at < ±5 %, the majority of measurements in this study (i.e., 68, 88, and 78 % for waist, hip, and neck circumferences) would meet this standard.

The lower proportion of waist circumferences (68 %) in the < ±5 % agreement range vs. hip and neck circumferences (88 and 78 %) is of interest. This difference likely is because of the many factors affecting waist circumference throughout the day, including posture, time of day variations in height, fasting vs postprandial state [46, 51, 58, 69, 72], as well as the time gap between home measurements and lab measurements (mean 9.02 ± 6.55 days) and likely differences in phase of the menstrual cycle and associated commonly reported abdominal size changes.

It is important to consider that some differences between technician and participant measurements may be due to the dissimilarity in measurement precision each used. To follow best practices, technicians measured to the nearest ¼-inch. The ½-inch precision level was chosen for participants because previous research revealed that the majority laypersons elected to make self-measurements using ½-inch to 1 inch precision [69]. Additionally, consumers frequently have difficulty accurately interpreting markings denoting fractional quantities when performing measurements [73].

For the most accurate waist and hip measurements, experts recommend standing with feet together, arms at the side, wearing little clothing, being in a fasted state, taking measurements at the end of a normal expiration with the abdomen relaxed, and taking measurements twice and averaging measurements repeatedly until they are within 1 cm of each other [17]. Because the participants in this study were taking their own measurements, they could not keep their arms at their sides or feet together. However, the video did instruct them to wear minimal clothing, read the measuring tape after taking a deep breath in and letting it out, put tension on the tape measure by pulling it gently to be sure it sat flat on the skin but not to pull it tight, and take measurements twice. Additionally, the video repeated instructions for measuring each circumference twice and each time directed them to ensure that the tape measure ran straight across their back (waist), buttocks (hips), or neck and encouraged them to use a mirror to check accuracy of tape measurement placement. Although many protocols do not control for posture and fasting [17, 58], an improvement to the video that should be investigated in future research is to instruct individuals to take waist measurements when standing as erect as possible, after a 4 h or longer fast, and while relaxing their abdomen (not “sucking it in”) [70]. However, the similarity of the home self-measurements and technician measurements suggests participants’ abdomens were relaxed when doing self-measurements. Additionally, Yoon recommended enlisting the assistance of a partner when taking self-measurements because she observed this improved the accuracy of measurements [69].

This study has many important strengths. The tape measure and videos underwent formative cognitive testing by experts trained in qualitative data collection methods and subsequently refined to ensure participant comprehension. Technicians were rigorously trained and had excellent inter-rater reliability scores. In addition, this study is one of the first of its type to include intra-observer technical error measurement and reliability [51–54] as well as test-retest reliability data for self-measurements [10, 36]. A major contribution of this study is establishing the reliability and validity of use a self-assembled tape measure from a downloadable pdf file that is suitable for mass distribution via the Internet at virtually no cost—this innovation has the potential to advance research and promote self-monitoring of body size vis-à-vis personal health. Although creating the tape measure does place some participant burden (e.g., they need to have the appropriate resources, including a computer, Internet, printer, tape), participants in this study reported the tape measure was ease to assemble and did not report any problems. This study is among the few of its type to report confidence intervals for waist, hip, and neck circumferences differences and limits of agreement [34, 43, 44]. Importantly, this study provides the recommended reporting data for Bland-Altman analysis of agreement between measurements taken by technicians vs self. Clinically meaningful levels are rarely reported [20, 21, 56]; this study also is the first known to the authors to propose a clinically meaningful difference in agreement for body circumferences.

This study has numerous strengths, however, the results are limited by the size and homogeneity of the sample (i.e., young women who are mostly white and fairly well educated). Future research should expand the study to males and older adults of varying socioeconomic status and race/ethnicity. Additionally, studies should explore possible training effects (e.g., seeing the video a second time) to ascertain whether it was training effects or other factors (e.g., being observed by technicians) contributing to self-measurements in the lab that were somewhat closer to those of the technician than measurements made at home. Furthermore, an investigation of the effect of providing an interpretation of the measurements to consumers (e.g., health conditions associated with a large waist circumference) on promoting consumer discussions with health care providers would provide insight into the health promotion and motivational utility of self-measurements.

Conclusions

This study has demonstrated that a simple, inexpensive method for teaching individuals to take their own body circumferences provides reliable and suitably accurate data. Collecting self-measured and self-reported circumference data in research studies is a feasible addition to research protocols and has the potential to expand our knowledge of body composition beyond that provided by just BMI.

Ethics approval and consent to participate

The Rutgers University Institutional Review Board approved this study and written consent for participation was obtained from all study participants.

Declarations and availability statement

The dataset supporting the conclusions of this article are available from the corresponding author upon request.

References

Quick V, Byrd-Bredbenner C, Shoff S, White A, Lohse B, Horacek T, Kattlemann K, Phillips B, Hoerr S, Greene G. Concordance of self-report and measured height and weight of college students. J Nutr Educ Behav. 2015;47:94–8.
Article PubMed PubMed Central Google Scholar
Stunkard AJ, Albaum JM. The accuracy of self-reported weights. Am J Clin Nutr. 1981;34(8):1593–9.
CAS PubMed Google Scholar
Sherry B, Jefferds M, Grummer-Strawn L. Accuracy of adolescent self-report of height and weight in assessing overweight status: a literature review. Arch Pediatr Adolesc Med. 2007;161:1154–61.
Article PubMed Google Scholar
Nyholm M, Gullberg B, Merlo J, Lundqvist-Persson C, Rastam L, Lindblad U. The validity of obesity based on self-reported weight and height: Implications for population studies. Obesity (Silver Spring). 2007;15:197–208.
Article Google Scholar
Kreatsoulas C, Hassan A, Subramanian S, Fleegler E. Accuracy of self-reported height and weight to determine body mass index among youth. Child Adoesc Behav. 2014;2:126.
Google Scholar
Engstrom J, Paterson S, Doherty A, Trabulsi M, Speer K. Accuracy of self-reported height and weight in women: an integrative review of the literature. J Midwifery Womens Health. 2003;48:338–45.
Article PubMed Google Scholar
Rimm E, Stampfer M, Colditz G, Chute C, Litin L, Willett W. Validity of self-reported waist and hip circumferences in men and women. Epidemiol. 1990;1(6):466–73.
Article CAS Google Scholar
Rothman K. BMI-related errors in the measurement of obesity. Int J Obesity. 2008;32:S56–9.
Article Google Scholar
Romero-Corral A, Somers V, Sierra-Johnson J, Thomas R, Collazo-Clavell M, Korinek J, Allison T, Batsis J, Sert-Kuniyoshi F, Lopez-Jimenez F. Accuracy of body mass index in diagnosing obesity in the adult general population. Int J Obesity. 2008;32:959–66.
Article CAS Google Scholar
Prince S, Janssen I, Tramner J. Self-measured waist circumference in older patients with heart failure: a study of validity and reliability using a MyoTape. J Cardiopulm Rehabil Prev. 2008;28:43–7.
Article PubMed Google Scholar
Gallagher D, Visser M, Sepulveda D, Pierson R, Harris T, Heymsfield S. How useful is body mass index for comparison of body fatness across age, sex, and ethnic groups? Am J Epidemiol. 1995;143:228–39.
Article Google Scholar
Gastaldelli A. Abdominal fat: Does it predict the development of type 2 diabetes? Am J Clin Nutr. 2008;87:1118–9.
CAS PubMed Google Scholar
Despres J-P. Abdominal obesity: the most prevalent cause of the metabolic syndrome and related cardiometabolic risk. Euro Heart J Suppl. 2006;8:B4–12.
Article CAS Google Scholar
Despres J-P. Body fat distribution and risk of cardiovascular disease. Circulation. 2012;126:1301–13.
Article PubMed Google Scholar
Klein S, Allison DB, Heymsfield SB, Kelley DE, Leibel RL, Nonas C, Kahn R. Waist Circumference and Cardiometabolic Risk: A Consensus Statement from Shaping America's Health: Association for Weight Management and Obesity Prevention; NAASO, The Obesity Society; the American Society for Nutrition; and the American Diabetes Association. Obesity. 2007;15(5):1061–7.
Article PubMed Google Scholar
Dekkers J, van Wier M, Hendriksen I, Twisk J, van Mechelen W. Accuracy of self-reported body weight, height and waist circumference in a Dutch overweight working population. BMC Medical Res Methodol. 2008;8(1):69.
Article Google Scholar
World Health Organization. Waist circumference and waist-hip ratio: Report of a WHO expert consultation, 8–11 December 2008. Geneva: WHO; 2011.
Google Scholar
Stenholm S, Harris T, Rantanen T, Visser M, Kritchevsky S, Ferrucci L. Sarcopenic obesity: definition, cause and consequences. Curr Opin Clin Nutr Metabol Care. 2008;11:693–700.
Article Google Scholar
Batsis J, Zbehlik A, Barre L, Mackenzie T, Bartels S. The impact of waist circumference on function and physical activity in older adults: longitudinal observational data from the osteoarthritis initiative. Nutr J. 2014;13:81.
Article PubMed PubMed Central Google Scholar
Chhapala V, Kanwal S, Brar R. Reporting standards for Bland-Altman agreement analysis in laboratory research: A cross sectional survey of current practice. Ann Clin Biochem. 2015;52:382–6.
Article Google Scholar
Dewitte K, Fierens C, Stockl D, Thienpont L. Application of the Bland-Altman plot for interpretation of method- comparison studies: a critical investigation of its practice. Clin Chem. 2002;48:799–801.
CAS PubMed Google Scholar
Seidell J. Waist circumference and waist/hip ratio in relation to all-cause mortality, cancer and sleep apnea. Eur J Clin Nutr. 2010;64:35–41.
Article CAS PubMed Google Scholar
Cameron A, Maglian D, Shaw J, Zimmet P, Carstensen B, Alberti K, Tuomilehto J, Barr E, Pauvaday V, Kowlessur S, et al. The influence of hip circumference on the relationship between abdominal obesity and mortality. Int J Epidemiol. 2012;4:484–94.
Article Google Scholar
Aswathappa J, Garg S, Kutty K, Shankar V. Neck circumference as an anthropometric measure of obesity in diabetics. North Am J Med Sci. 2013;5:28–31.
Article Google Scholar
Hingorjo M, Qureshi M, Mehdi A. Neck circumference as a useful marker of obesity: A comparison with body mass index and waist circumference. J Pakistan Med Assoc. 2012;2012:36–40.
Google Scholar
Androutsos OGE, Moschonis G, Roma-Giannikou E, Chrousos GP, Manios Y, Kanaka-Gantenbein C. Neck circumference: A useful screening tool of cardiovascular risk in children. Pediatr Obes. 2012;7:187–95.
Article CAS PubMed Google Scholar
Nafiu O, Zepeda A, Curcio C, Prasad Y. Association of neck circumference and obesity status with elevated blood pressure in children. J Human Hypertens. 2014;28:263–8.
Article CAS Google Scholar
Cizza G, de Jonge L, Piaggi P, Mattingly M, Zhao X, Lucassen E, Rother K, Sumner A, Csako G. Neck circumference is a predictor of metabolic syndrome and obstructive sleep apnea in short-sleeping obese men and women. Metab Syndr Relat Disord. 2014;12:231–41.
Article CAS PubMed PubMed Central Google Scholar
Arnold T, Schweitzer A, Hoffman H, Onyewu C, Hurtado M, Hoffman E, Klein C. Neck and waist circumference biomarkers of cardiovascular risk in a cohort of predominantly African-American college students: A preliminary study. J Acad Nutr Diet. 2014;114:107–2014.
Article PubMed PubMed Central Google Scholar
Jamar G, Pisani L, Oyama L, Belote C, Masquio D, Furuya V, Carvalho-Ferreira J, Andrade-Silva S, Damaso A, Caranti D. Is the neck circumference an emergent predictor for inflammatory status in obese adults? Int J Clin Pract. 2014;67:217–24.
Article Google Scholar
Ursavas A, Karadag M, Nalci N, Ercan I, Gozu R. Self-reported snoring, maternal obesity and neck circumference as risk factors for pregnancy-induced hypertension and preeclampsia. Respiration. 2008;76:33–9.
Article PubMed Google Scholar
Vallianou N, Evangelopoulos A, Bountziouka V, Vogiatzakis E, Bonou M, Barbetseas J, Avgerinos P, Panagiotakos D. Neck circumference is correlated with triglycerides and inversely related with HDL cholesterol beyond BMI and waist circumference. Diabetes/Metab Res Rev. 2013;29:90–7.
Article CAS Google Scholar
Tehard B, Van Liere MJ, Com Nougue C, Clavel-Chapelon F. Anthropometric measurements and body silhouette of women: validity and perception. J Am Diet Assoc. 2002;102(12):1779–84.
Article CAS PubMed PubMed Central Google Scholar
Ayala A, Nijpels G, Lakerveld J. Validity of self-measured waist circumference in adults at risk of type 2 diabetes and cardiovascular disease. BMC Med. 2014;12:170.
Article Google Scholar
Spencer E, Roddam A, Key T. Accuracy of self-reported waist and hip measurements in 4492 EPIC-Oxford participants. Public Health Nutr. 2004;7:723–7.
Article PubMed Google Scholar
Kushi L, Kaye S, Folsom A, Soler J, Prineas R. Accuracy and reliability of self-measurement of body girths. Am J Epidemiol. 1988;128:740–8.
CAS PubMed Google Scholar
Elliott W. Criterion validity of a computer-based tutorial for teaching waist circumference self-measurement. J Bodyw Mov Ther. 2008;12:133–45.
Article PubMed Google Scholar
Roberts C, Wilder L, Jackson R, Moy T, Becker D. Accuracy of self-measurement of waist and hip circumference in men and women. J Am Diet Assoc. 1997;97:534–6.
Article CAS PubMed Google Scholar
Freudenheim J, Darrow S. Accuracy of self-measurement of body fat distribution by waist, hip, and thigh circumferences. Nutr Cancer. 1991;15:179–86.
Article CAS PubMed Google Scholar
Weaver T, Kushi L, McGovern P, Potter J, Rich S, King R, Whitbeck J, Greenstein J, Sellers T. Validation study of self-reported measures of fat distribution. Int J Obesity. 1996;20:544–650.
Google Scholar
McEneaney DF, Lennie SC. Video instructions improve accuracy of self-measures of waist circumference compared with written instructions. Public Health Nutr. 2011;14(7):1192–9.
Article PubMed Google Scholar
Xie Y, Ho S, Liu Z, Hui S-C. Comparisons of measured and self-reported anthropometric variables and blood pressure in a sample of Hong Kong Female Nurses. Plos One. 2014;9:e107233.
Article PubMed PubMed Central Google Scholar
Reidpath D, Chee-Ho Chea J, Lam F-C, Yasin S, Soyiri I, Allotey P. Validity of self-measured waist and hip circumferences: results from a community study in Malaysia. Nutr J. 2013;12:135.
Article PubMed PubMed Central Google Scholar
Han T, Lean M. Self-reported waist circumference compared with the ‘Waist Watcher’ tape-measure to identify individuals at increased health risk through intra-abdominal fat accumulation. Br J Nutr. 1998;80:81–8.
Article CAS PubMed Google Scholar
Cullum A, McCarthy A, Gunnell D, Davey Smith G, Serne J, Ben-Shlomo Y. Dietary restraint and the mis-reporting of anthropometric measures by middle-aged adults. Int J Obes Rel Metab Dis. 2004;28:426–33.
Article CAS Google Scholar
Verweij L, Terwee C, Proper K, Hulshof C, van Mechelen W. Measurement error of waist circumference: gaps in knowledge. Public Health Nutr. 2012;16:281–8.
Article PubMed Google Scholar
Kozak M, Wnuk A. Including the Tukey mean-difference (Bland-Altman) plot in a statistics course. Teach Stat. 2014;36:83–7.
Article Google Scholar
Ross R, Berentzen T, Bradshaw A, Janssen I, Kahn H, Katmarzyk P, Kuk J, Seidell J, Snider M, Sorensen T, et al. Does the relationship between waist circumference, morbidity and mortality depend on measurement protocol for waist circumference? Obes Rev. 2008;9:321–5.
Article Google Scholar
Bigaard J, Spanggaard I, Thomsen BL, Overvad K, Tjonneland A. Self-reported and technician-measured waist circumferences differ in middle-aged men and women. J Nutr. 2005;135(9):2263–70.
CAS PubMed Google Scholar
MESA (Multi-ethnic Study of Atherosclerosos) Website [https://www.mesa-nhlbi.org/aboutMESAOverviewProtocol.aspx]. Accessed 30 Jan 2016
Lohman TG, Roche AF, Martorell R. Anthropometric Standardization Reference Manual. Abriged ed. Champaing: Human Kinetics Books; 1991.
Google Scholar
Ulijaszek S, Kerr D. Anthropometric measurement error and the assessment of nutritional status. Br J Nutr. 1999;82:165–77.
Article CAS PubMed Google Scholar
Goto R, Mascie-Taylor C. Precision of measurement as a component of human variation. J Physiol Anthropol. 2007;26:253–6.
Article PubMed Google Scholar
Perini T, de Oliveira G, Ornellas J, de Oliveira F. Technical error of measurement in anthropometry, English version. Rev Bras Med Esporte. 2005;11:86–90.
Article Google Scholar
Bland JM, Altman DG. Measuring agreement in method comparison studies. Stat Method Med Res. 1999;8(2):135–60.
Article CAS Google Scholar
Woodman R. Bland–Altman beyond the basics: Creating confidence with badly behaved data. Clin Experiment Pharmacol Physiol. 2010;37:141–2.
Article CAS Google Scholar
Ludbrook J. Confidence in Altman-bland plots: A critical review of the method of differences. Clin Experiment Pharmacol Physiol. 2010;37:143–9.
Article CAS Google Scholar
Agarwal S, Misra A, Aggarwal P, Bardia A, Goel R, Vikram N, Wasir J, Hussain N, Ramachandran K, Pandey R. Waist circumference measurement by site, posture, respiratory phase, and meal time: implications for methodology. Obesity. 2009;17:1056–61.
Article PubMed Google Scholar
DuBose J, Inaba K, Branco B, Barmparas G, Lam L, Teixeira P, Belzberg H, Demetriades D. Discrepancies between capillary glucose measurements and traditional laboratory assessments in both shock and non-shock states after trauma. J Surg Res. 2012;178:820–6.
Article CAS PubMed Google Scholar
Kos S, van Meerkerk A. vand der Linden J, Stiphout T, Wulkan R: Validation of a new generation POCT glucose device with emphasis on aspects important for glycemic control in the hospital care. Clin Chem Lab Med. 2012;50:1573–80.
Article CAS PubMed Google Scholar
Abdel-Wareth L, Haq A, Turner A, Khan S, Salem A, Mustafa F, Hussein N, Pallinalakam F, Grundy L, Patras G, et al. Total Vitamin D Assay Comparison of the Roche Diagnostics “Vitamin D Total” Electrochemiluminescence Protein Binding Assay with the Chromsystems HPLC Method in a Population with both D2 and D3 forms of Vitamin D. Nutrients. 2013;5:971–80.
Article CAS PubMed PubMed Central Google Scholar
Coqueiro R, Santos M, Neto J, Queiroz B, Bruggar N, Barbosa A. Validity of a portable glucose, total cholesterol, and triglycerides multi-analyzer in adults. Biolog Res Nurs. 2014;16:288–94.
Article Google Scholar
Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33:159–74.
Article CAS PubMed Google Scholar
Nunnally J, Bernstein I. Psychometric testing. New York: McGraw-Hill; 1994.
Google Scholar
Streiner D, Norman G. Health measurement scales—A practical guide to their development and use. New York: Oxford University Press; 1995.
Google Scholar
Cohen J. Statistical power analysis for the behavioral sciences. Hillsdale: Erblaum; 1988.
Google Scholar
Pedhazur S. Measurement, design, and analysis. Hillsdale: Erlbaum; 1991.
Google Scholar
Cicchetti D. Guidelines, criteria, and rules of thumb for evaluating normed and standardized assessment instruments in psychology. Psychol Assess. 1994;6:284–90.
Article Google Scholar
Yoon J, Radwin R. The accuracy of consumer-made body measurements for women's mail-order clothing. Hum Factors. 1994;36:557–68.
Google Scholar
Hall T, Young T. A validation study of body fat distribution as determined by self-measurement of waist and hip circumference. Int J Obesity. 1989;13:801–7.
CAS Google Scholar
Laberge R, Vaccani J, Gow R, Gaboury I, Hoey L, Katz S. Inter- and intra-rater reliability of neck circumference measurements in children. Pediatr Pulmonol. 2009;44:64–9.
Article PubMed Google Scholar
Misra A, Wasir J, Vikram N. Waist circumference criteria for the diagnosis of abdominal obesity are not applicable uniformly to all populations and ethnic groups. Nutr. 2005;21:969–76.
Article Google Scholar
Yorkin M, Spaccarotella K, Martin-Biggers J, Lozada C, Hongu N, Quick V, Byrd-Bredbenner C. A Tool to Improve Parental Measurements of Preschool Child Height. Adv Public Health. 2015;2015:Article ID 965371.
Article Google Scholar

Download references

Acknowledgments

None.

Funding

This study was made possible by the United States Department of Agriculture, National Institute of Food and Agriculture, Grant Number 2011-68001-30170.

Author information

Authors and Affiliations

Department of Nutritional Sciences, Rutgers University, 26 Nichol Avenue, New Brunswick, NJ, 08901, USA
Pamela Barrios, Jennifer Martin-Biggers, Virginia Quick & Carol Byrd-Bredbenner

Authors

Pamela Barrios
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Martin-Biggers
View author publications
You can also search for this author in PubMed Google Scholar
Virginia Quick
View author publications
You can also search for this author in PubMed Google Scholar
Carol Byrd-Bredbenner
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Virginia Quick.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

PB and JMB carried out the research study, collected data and participated in the draft of the manuscript. VQ assisted with data analysis and drafting of the manuscript. CBB conceived of the study and participated in its design, analysis, and coordination along with drafting the manuscript. All authors have read and approved the final manuscript.

Authors’ information

PM and JMB are graduate students, VQ is research associate, CBB is professor and extension specialist.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Barrios, P., Martin-Biggers, J., Quick, V. et al. Reliability and criterion validity of self-measured waist, hip, and neck circumferences. BMC Med Res Methodol 16, 49 (2016). https://doi.org/10.1186/s12874-016-0150-2

Download citation

Received: 05 December 2015
Accepted: 22 April 2016
Published: 04 May 2016
DOI: https://doi.org/10.1186/s12874-016-0150-2

Reliability and criterion validity of self-measured waist, hip, and neck circumferences