Skip to main content

Detection of sedentary time and bouts using consumer-grade wrist-worn devices: a hidden semi-Markov model

Abstract

Background

Wrist-worn data from commercially available devices has potential to characterize sedentary time for research and for clinical and public health applications. We propose a model that utilizes heart rate in addition to step count data to estimate the proportion of time spent being sedentary and the usual length of sedentary bouts.

Methods

We developed and trained two Hidden semi-Markov models, STEPHEN (STEP and Heart ENcoder) and STEPCODE (STEP enCODEr; a steps-only based model) using consumer-grade Fitbit device data from participants under free living conditions, and validated model performance using two external datasets. We used the median absolute percentage error (MDAPE) to measure the accuracy of the proposed models against research-grade activPAL device data as the referent. Bland-Altman plots summarized the individual-level agreement with activPAL.

Results

In OPTIMISE cohort, STEPHEN’s estimates of the proportion of time spent sedentary had significantly (p < 0.001) better accuracy (MDAPE [IQR] = 0.15 [0.06–0.25] vs. 0.23 [0.13–0.53)]) and agreement (Bias Mean [SD]=-0.03[0.11] vs. 0.14 [0.11]) than the proprietary software, estimated the usual sedentary bout duration more accurately (MDAPE[IQR] = 0.11[0.06–0.26] vs. 0.42[0.32–0.48]), and had better agreement (Bias Mean [SD] = 3.91[5.67] minutes vs. -11.93[5.07] minutes). With the ALLO-Active dataset, STEPHEN and STEPCODE did not improve the estimation of proportion of time spent sedentary, but STEPHEN estimated usual sedentary bout duration more accurately than the proprietary software (MDAPE[IQR] = 0.19[0.03–0.25] vs. 0.36[0.15–0.48]) and had smaller bias (Bias Mean[SD] = 0.70[8.89] minutes vs. -11.35[9.17] minutes).

Conclusions

STEPHEN can characterize the proportion of time spent being sedentary and usual sedentary bout length. The methodology is available as an open access R package available from https://github.com/limfuxing/stephen/. The package includes trained models, but users have the flexibility to train their own models.

Peer Review reports

Background

The global sales and use of wearable devices in health care have been increasing and were expected to reach USD 4.5 billion by 2020 [1]. Among these are consumer-grade wrist-worn activity trackers. In addition to providing health-related information to consumers, these devices have also increasingly been used in research settings as both measurement and behavior change tools [2,3,4,5]. Wrist-worn devices can be more comfortable to wear and lead to greater adherence and longer wear within the research setting [6, 7], compared to research-grade thigh-worn or hip-worn devices. Their suitability to long-term wear and their widespread use indicates that they may have the capacity to generate massive amounts of data, and to provide a cost-effective method for capturing individual-specific activity patterns at fine resolutions.

Most wrist-worn wearables have been designed with a primary focus of capturing physical activity, especially stepping. However, there is now increasing interest among researchers around their potential for the monitoring of sedentary time (defined as any waking behavior characterized by an energy expenditure ≤ 1.5 metabolic equivalents (METs), while in a sitting, reclining or lying) and sedentary patterns, given evidence linking these exposures to increased risk of cardiovascular diseases, cancer, and mental ill-health [8,9,10]]. For example, accumulating sedentary time in long uninterrupted bouts has been shown to be associated adversely with glycemic control [11], while interrupting sedentary time lowers postprandial glucose [8]. The proportion of time spent being sedentary is also important because it has been shown to have implications for cardiometabolic health and disease risk, even after adjusting for moderate-to-vigorous-intensity physical activity amount [12].

Validation studies have shown that commercial activity trackers have a reasonable concordance with gold-standard measurements (hand counting) for capturing stepping at faster walking speeds [13], consistent with their primary use. Nevertheless, their measurement of activities of daily living is more problematic, with limited validity for capturing stepping at slower walking speeds [13] and sedentary time [14]. Being wrist-worn, they are also known to have lower sensitivity and specificity for measuring steps when compared to hip-worn and thigh-worn devices [15, 16]. Steps can falsely register during light or sedentary activities involving wrist movement such as folding laundry or playing video games [17]. These spurious detections during activities that do not involve stepping [18, 19] constitute a minority of steps counted and are possibly overlooked as they have limited consequence to some validity statistics for total step counts [20]. However, they are highly relevant to measuring sedentary time, and have a noticeable impact on measuring sedentary bout duration, an issue that has been reported previously [21,22,23]. To our knowledge, no current algorithms satisfactorily address these issues.

What would be highly desirable are algorithms for consumer-grade devices that can accurately classify physical activities into five widely used categories of differing intensity: sedentary, standing, light physical activity (LPA), moderate physical activities (MPA) and vigorous physical activity (VPA). Notably, in this context, the categories themselves cannot be directly observed. Instead, they would be inferred from parameters reflecting the intensity of the physical activity captured by the wearables, such as heart rate, and either raw acceleration (usually not available), or other device outputs like step counts and estimated metabolic equivalent of task (MET) values. The Hidden semi-Markov Model (HsMM) is a class of stochastic models that can be used to infer the unobserved states that generate the observed variables that reflect those states. Variants of the Hidden Markov Model (HMM) have found prominent applications in meteorology where they have been used to infer the underlying weather states given observed variables such as temperature, air pressure and wind speed [24]. In the context of physical activity, these models have been used to infer the underlying physical activity types (e.g., walking, sitting, and standing) using data from research-grade triaxial accelerometers [25, 26].

Here we describe the development of STEPHEN (STEP and Heart Encoder), a Hidden semi-Markov model to infer the underlying physical activity states using routine output from wrist-worn devices, specifically step counts and heart rate data. We propose to use heart rate data in conjunction with step counts for two reasons. We anticipate that the inclusion of heart rate data would improve detection of sedentary time. Firstly, because physical activities typically increase heart rate, an increase in step counts without significant increase in heart rate may represent misclassified sedentary time involving wrist movements. Secondly, heart rate data may improve the capacity of wrist-worn devices to differentiate sitting from standing events because it increases immediately when standing from sitting/lying position [27].

While STEPHEN infers multiple activity categories, herein we focus on its validity to distinguish sedentary from non-sedentary time and estimating both the proportion of time spent being sedentary and usual sedentary bout duration. These important measures are expected to be poorly captured by current commercial wearables, thus have the potential to be improved. Our focus on sedentary time is also partly driven by the existing data available to us from two studies (OPTIMISE and ALLO-Active), in which participants wore both a common commercial wrist-worn activity tracker (the Fitbit) concurrently with the thigh-worn activPAL, which is highly suitable as a ground truth for sedentary time, but not necessarily for separating between the other physical activity states.

Here, we report our work in developing and evaluating STEPHEN and STEPCODE (STEP encoder) a steps-only based model. We evaluated the accuracy of the models when used to estimate the two key sedentary time metrics (time; usual bout duration) in two external cohorts with different living conditions and using data from different Fitbit devices. The accuracy of the proposed models is compared to proprietary Fitbit software using activPAL as the reference assessment method.

Methods

Participants

This study includes participants from two distinct sources: 44 intervention group participants from the OPTIMISE trial [28] and 10 participants from the ALLO-Active trial [29], all of whom wore Fitbit and activPAL monitors (the referent device) concurrently. The OPTIMISE trial focused on reducing sedentary behavior in adults with type 2 diabetes who work in desk-based environments, through provision of a Fitbit, a height-adjustable sit-stand desktop workstation, and behavioral health coaching [28]. Contexts like this, whereby participants are performing activities such as typing that would traditionally be sedentary behaviors, in an upright position (as light activity), are known to pose particular challenges for classifying sedentary time via wrist-worn devices [30]. The ALLO-Active trial involved a very low-active context, where participants spent significant time hospitalised, as the participants were adults with hematologic malignancies who received allogeneic stem cell transplantation. Here the intervention-group participants received a multifaceted activity program designed to increase purposeful aerobic and resistance exercise and reduce sedentary time through replacement with light exercise [29]. The ALLO-Active intervention targeted less extensive behaviour changes by comparison with OPTIMISE.

Human ethics approval for OPTIMISE were obtained from Alfred Health Human Ethics Committee (Melbourne, Australia), The University of Queensland Institutional Human Research Ethics Committee (Brisbane, Australia), and the University of the Sunshine Coast Human Research Ethics Committee (Sunshine Coast, Australia). The OPTIMISE Your Health trial has been registered with the Australian New Zealand Clinical Trials Registry (ANZCTRN12618001159246; date of registration: 07/03/2018). Ethics approval for the ALLO-Active trial was provided by the Alfred Hospital Human Research Ethics Committee, and the trial was registered with the Australian New Zealand Clinical Trials Registry (registration number: ACTRN12619000741189). All participants provided written informed consent.

Data collection

Both studies used similar protocols for collecting activPAL (PAL technologies, Glasgow, Scotland) data: activPAL4; 24-hour wear protocol; concurrent sleep and wear diary; and devices were attached to the thigh, two-thirds of the way up, along the midline. Devices were set to record data at the standard 30 Hz and were initialised and downloaded with the proprietary software, default settings, default (VANE) algorithm, and exported as events files. OPTIMISE participants wore the activPAL at 0, 3 and 6 months, for 10 days. ALLO-Active trial participants wore the thigh-mounted activPAL at 0, ≈ 1, and ≈ 4 months, for 7 days. ActivPAL measurements from all timepoints were used for validation purposes in this study as long as Fitbit was concurrently worn. OPTIMISE intervention participants received a Fitbit Inspire HR device (Fitbit Inc., San Francisco, CA, USA) while all ALLO-Active trial participants (intervention and usual care) were provided with either a Fitbit Ionic or a Fitbit Versa 2 device (Fitbit, Inc., San Francisco, CA, USA). Both studies encouraged participants to wear the Fitbit continuously throughout the trial period.

Data processing

Data from both studies were processed identically, limiting the data to when both devices were worn, and the participant was awake (see Details below).

The activPAL events files (VANE) provide a precise record of the start time and duration of each event -each bout of sitting / lying (sedentary) or standing, and each stride (two steps) along with the estimated MET-duration of the event. The default method [31] assigns MET-values of 1.25 to sitting and 1.4 for standing, and estimates METs for stepping as a linear function of cadence (steps/min). The activPAL has strong validity relative to direct observation for classifying sedentary time and transitioning between sedentary and non-sedentary states [32]. This device provides the ground truth for this study. Using SAS 9.4 (SAS Institute, Inc., Cary, NC, USA), based on usual practices in the field [33], time in bed (for sleeping purposes) and non-wear were removed (estimated from a combination of self-report and device movement), and data limited to valid days (≥ 10 h wear while awake with evidence of movement: ≥500 steps/day and < 95% of the day in any one activity) [28]. Automated estimation [34] coupled with visual checking of the automated classification against acceleration using actigrams was used to infer missing values for time in bed. Data were also summarized as amounts of each activity per 1-minute (to permit 1:1 matching of these mixed activity minutes with the Fitbit’s single-activity 1-minute epoch data).

Fitbit data was downloaded using the Fitabase software (Fitabase Small Steps Labs, LLC, San Diego, CA). The Fitbit-generated step counts, heart rate and physical activity intensity were extracted at 1-minute resolution (the smallest resolution available for steps and intensity). Non-wear time (i.e., minutes where the associated heart rate was zero or absent) was excluded along with data between midnight and 5AM, where the person may be wearing the Fitbit and awake but their physical activity patterns may not be representative of the rest of the days. No minimum daily wearing rates criteria was applied to the Fitbit data and no further inspection of the heart rate quality signals was performed.

For validation purpose, the Fitbit data was matched to the activPAL minute-level data and only minutes during which both devices were worn were retained. The activPAL event files were also mapped to the periods during which the participant was wearing the Fitbit and only events where Fitbit was worn 100% of the time were retained. Timestamp matching was performed using in-house R codes that utilize lubridate R package [35]. Since activPAL timestamp does not take into account daylight saving shift while Fitbit’s does, we manually shifted back the activPAL timestamp by one hour during time periods where daylight saving was observed. Sedentary behavior metrics were calculated based on these matched data.

Measures of sedentary time

Because of the differences in the data resolution (1-minute epoch for Fitbit data and continuous time up to the nearest 0.1s for activPAL), the classification of sedentary time is slightly different for the two data types.

For methods that use Fitbit data (Fitbit algorithm, STEPHEN and STEPCODE), the classification took place at each minute. In the case of Fitbit’s proprietary algorithm, Fitabase, a cloud-based Fitbit data management system, was used to extract the predicted physical activity intensity output and its lowest predicted intensity state was designated as sedentary. For STEPHEN and STEPCODE, we obtained the predicted physical activity state for each minute (see Model Development below). For activPAL, the classification rule is simpler whereby all of the time intervals associated with ‘sitting’ events are classified as sedentary.

Proportion of time spent being sedentary

For the Fitbit algorithm, STEPHEN and STEPCODE, this was calculated for each participant as the total number of sedentary minutes divided by the total number of minutes when the participant was awake and wearing both devices. For activPAL, this was calculated for each participant as the total time for sedentary (‘sitting’) events divided by the total time for all events.

Usual bout duration of sedentary bouts

Sedentary bouts were identified as recorded to the nearest 0.1s duration from the activPAL events files, and as consecutive sedentary minutes for the Fitbit data.

Usual bout duration, defined as the bout duration above and below which half of all sedentary time is accrued [36], was calculated for each participant using the methods described in [36] using all of their sedentary bouts across the entire study duration.

Fig. 1
figure 1

Schematic illustration of the Step and Heart Rate Autoencoder (STEPHEN) Hidden semi-Markov Model. The observed data are step counts and heart rate and individuals can move between the four hidden states representing sedentary behavior, LPA & Standing, MPA and VPA. The time spent in a particular hidden state before moving to another state (sojourn time) is assumed to follow Gamma distribution whose parameters are estimated empirically from the data

Statistical methods and analyses

Model development

We used Fitbit data from 11 OPTIMISE participants to develop 2 models: [1] Hidden semi-Markov Model using step counts and heart rate data (STEPHEN), and [2] Hidden semi-Markov Model using step counts data only (STEPCODE). While the focus of this paper is on validating STEPHEN, STEPCODE was developed to demonstrate that the improved performance of STEPHEN over proprietary algorithm is due to the combination of using heart data in addition to the Hidden semi-Markov model rather than merely due to the Hidden semi-Markov model alone. In both STEPHEN and STEPCODE models, we assume there are K = 4 unobserved physical activity states that manifest as the observed step counts and heart rate data. Without losing generality, we can think of these states as representing sedentary behavior, standing and LPA, MPA and VPA (see Fig. 1). The decision to lump standing with LPA should not affect the validity of the current models for characterising sedentary behaviour because both standings and LPA will be classified as non-sedentary anyway.

Under each state, we assume that the step count and heart rate data are distributed as Negative Binomial (NB) random variables with state-specific parameters. An individual is allowed to move between states, e.g., from the sedentary state to light physical activity with the amount of time (minutes) spent consecutively under a particular state (sojourn) following a Gamma distribution. Compared to the Exponential sojourn distribution for the standard Hidden Markov model, the Gamma sojourn distribution capture persistent sojourn (bouts) under a particular state after the initial transition [37]. This property enables the models to capture prolonged standing and LPA with stable heart rate after the initial heart increase when transitioning from sedentary behavior. The models were estimated separately for each participant, resulting in 11 × 2 = 22 estimated models (11 STEPHEN models and 11 STEPCODE models). We used the mhsmm R package [38] to estimate the model parameters and coded a user-written function to calculate the conditionally independent, bivariate Negative Binomial probability. A more detailed mathematical description of the three models is provided in the Supplementary Methods.

For both STEPHEN and STEPCODE to allow for potential differences in baseline resting heart rate between the individual in the development and validation cohorts, we adjusted the mean heart rate parameters under each state by subtracting a bias term defined as the difference in average heart rate with zero step counts between the individual whose physical activity states we tried to predict and the individual whose data was used to develop the model. We then obtained the predicted physical activity state under each of eleven models (using the Viterbi algorithm). At this stage, the predicted state for each minute is an integer ranging from 1 to 4 and unlabeled. We then labelled these predicted states using the following rules: the predicted states associated with Hidden semi-Markov states with the lowest mean step count and heart rate parameters were labelled as ‘Sedentary’, the remaining predicted states associated with increasing step counts and heart rate parameters were correspondingly labelled as standing/LPA, MPA and VPA. These labelled predicted states were then ensembled at each timepoint using the ‘majority vote’ rule to produce the final, labelled predicted physical activity state.

Model validation

An extensive internal validation was performed together with two external validations. Instance-level agreement statistics were not calculated due to the different device resolutions (1 min epoch versus continuous to nearest 0.1s). In lieu of a typical confusion matrix, the amount of time the activPAL recorded as spent sitting, standing and stepping were summarized separately for the minutes that the Fitbit and test methods predicted to be sedentary or non-sedentary. The sedentary time measures (proportion of time spent sedentary and usual bout duration) were compared against the activPAL (the ground truth) via two indicators. The main indicator used was absolute percentage errors (APE), which was summarized per individual and reported as median (MDAPE) with interquartile range (IQR). Mean bias (SD) were visualized via Bland-Altman plots [39] and equivalence between each method and activPAL (Δequivalence = ± 0.05 for proportion and ± 5 min for usual sedentary bout length) was tested using one sample t-test for equivalence hypothesis testing in R package TOSTER [40] and proportional bias in the Bland-Altman plots were tested using simple linear regression.

Internal validation

The internal validation was conducted within the 11 OPTIMISE participants used to develop the model.

External validation 1

The first external validation was performed in 33 OPTIMISE participants who were not used for model development, to indicate how well the models are likely to perform in participants with similar demographic characteristics to the participants used to develop the models. The MDAPE for each method was compared via Wilcoxon signed ranks test to determine whether the proposed methods outperformed the proprietary Fitbit method. To investigate whether gender, body mass index (BMI), age and job classification were associated with accuracy, we conducted multiple linear regression with APE as dependent variables.

External validation 2

To assess the performance of the models when used to predict sedentary time in participants wearing different brands of Fitbit and with very different physical activity patterns, we tested the models in 10 participants (5 participants wearing Fitbit Ionic and 5 participants wearing Fitbit Versa 2) from the ALLO-Active Trial [29].

All statistical analyses were performed using R 4.2.2.

Results

Participant characteristics

The mean (SD) for age and BMI were 52.8 (5.2) years and 35.6 (5.4) kg/m2 in the OPTIMISE internal validation sample (n = 11) and 52.2 (7.3) years and 33.7 (5.0) kg/m2, respectively in the OPTIMISE external validation sample (n = 33). ALLO-Active participants were slightly older (57.4 (10.8) years) and leaner (27.2 (5.7) kg/m2). Two OPTIMISE internal validation participants (18.2%) and five OPTIMISE external validation participants (15.2%) were on medication that affects heart rates, e.g., beta blockers while none of the ALLO-Active participants were on these medications. Two OPTIMISE external validation participants (6.1%) also have experienced major cardiovascular events.

Among the OPTIMISE participants, the mean (SD) of Fitbit wear time per day was 15.3 (3.0) hours, resulting in the mean (SD) for total Fitbit wear time of 8806.0 (4979.3) hours over the study period, of which 334.3 (174.9) hours was when activPAL concurrently worn. Among the ALLO-Active participants, the mean (SD) of Fitbit wear time per day was 15.0(3.1) hours, resulting in the mean (SD) for total Fitbit wear time of 8508.0 (6504.1) hours over the study period, of which 166.2 (50.1) hours was when activPAL was concurrently worn. Data from the activPAL (referent) during the concurrent wear time indicate the OPTIMSE participants were more active and less sedentary than the ALLO-Active participants, with mean proportion of sedentary time being 0.56(0.12) vs. 0.67(0.11) and mean usual bout duration of 27.3(4.0) vs. 33.3(10.2) minutes.

Internal validation

STEPHEN, STEPCODE and Fitbit’s proprietary algorithm all showed some degree of agreement with the data from the activPAL (Table 1), with MDAPE (IQR) ranging from 0.14(0.05–0.18) to 0.20(0.10–0.28) for proportion of time spent being sedentary and 0.16(0.03–0.26) to 0.33(0.28–0.39) for usual sedentary bout length (duration). The mean (SD) bias for proportion of time spent being sedentary ranged from − 0.06 (0.11) to 0.13 (0.07) as a proportion, which is equivalent to -57.6 (105.6) to 124.8(67.2) minutes per 16 h awake. The Bland-Altman plots (Supp. Figure 1A-C) show that all methods overestimate the proportion of time being sedentary. In terms of the usual sedentary bout length (duration), STEPHEN had the least bias relative to activPAL(Mean Bias(SD) = 4.40(4.18),Supp. Figure 1E) when compared to Fitbit’s proprietary algorithm (Mean Bias (SD) = -8.68(3.24), Supp. Figure 1D) and STEPCODE (Mean Bias (SD) = -5.14(3.62), Supp. Figure 1F). The 95% limits of agreement were wide for all methods.

Table 1 Mean (SD), median (IQR) Absolute percentage errors and Mean (SD) Bias of Sedentary Behavior Metrics against activPAL

The compositions generated from activPAL’s sitting, standing and stepping distributed among the minutes predicted as sedentary and non-sedentary are presented in Table 2. Predicted sedentary time minutes on average consisted of approximately 46s (76.7%) sitting, 13s (21.7%) standing and 1s (1.7%) stepping for all methods. The composition of minutes predicted to be non-sedentary were more variable between the methods (Table 2). Predicted non-sedentary minutes still contained relatively high amounts of sitting, ranging from 11.3s (18.8%) for STEPHEN (which performed best) to 14.0s (23.3%) for STEPCODE. Put simply, most predicted sedentary time (76.7%) was spent sitting, with most of the misclassified activity being standing, while 76.7% (STEPCODE) to 81.2% (STEPHEN) of the time predicted to be not sitting being spent either standing or stepping.

Table 2 Mean (SE) of sitting, standing and stepping seconds for Sedentary and Non-sedentary minutes

STEPHEN’s non-sedentary minutes contained a higher proportion of either standing or stepping while STEPCODE’s remain very similar to the proprietary algorithm. This suggests that the use of heart rate data improves predictive performance. To better understand how the use of heart rate data results in a better classification of non-sedentary minutes, we examined data from minutes where reclassification from sedentary under Fitbit classification to non-sedentary under STEPHEN occur. More specifically, we focused on a ± 5 min time window around the minutes where the reclassifications occur. The results suggest that except for participant ID 1050, these are the minutes where there is a consistent sudden jump in step counts. It is evident that this sudden jump is consistent because the width of the interval is relatively narrow and either does not include zero or occurs just above zero, (Supp. Figure 2). These consistent changes in the step counts are not accompanied by consistent changes in heart rate (Supp. Figure 3). We interpret this as evidence that the sudden apparent increase in steps is very likely due to wrist movements that are not associated with physical activities, and STEPHEN managed to successfully reclassify Fitbit anomalous non-sedentary minutes as sedentary.

Fig. 2
figure 2

External Validation Using N = 33 OPTIMISE Participants. (A)-(C) Bland-Altman Plots of Proportion of Time Spent Being Sedentary according to: (A) Fitbit, (B) STEPHEN, (C) STEPCODE. (D)-(F) Bland-Altman Plots of Usual Sedentary Bout Length (Duration) according to: (D) Fitbit, (E) STEPHEN, (F) STEPCODE. In all plots the center line indicates the mean bias and the top and bottom lines indicate the limits of agreements (mean ± 1.96 SD). Statistically significant proportional bias was shown with 95% confidence intervals of the slope estimate

Fig. 3
figure 3

External Validation Using N = 10 ALLO-Active Participants. (A)-(C) Bland-Altman Plots of Proportion of Time Spent Being Sedentary according to: (A) Fitbit, (B) STEPHEN, (C) STEPCODE. (D)-(F) Bland-Altman Plots of Usual Sedentary Bout Length (Duration) according to: (D) Fitbit, (E) STEPHEN, (F) STECODE. In all plots the center line indicates the mean bias and the top and bottom lines indicate the limits of agreements (mean ± 1.96 SD). Statistically significant proportional bias was shown with 95% confidence intervals of the slope estimate

External validation 1

In the OPTIMISE external validation sample, the accuracy for proportion of time spent being sedentary and usual bout duration was the best for STEPHEN. The MDAPE (IQR) for proportion of time being sedentary was lowest for STEPHEN, at 0.15(0.06–0.25), followed by 0.20(0.12–0.49) for STEPCODE and 0.23(0.13–0.53) for Fitbit (Table 1). Both STEPHEN and STEPCODE differed significantly in MDAPE relative to the Fitbit (both p < 0.001). The same was seen for usual sedentary bout duration, where the MDAPE (IQR) was 0.11(0.06–0.26) for STEPHEN, 0.32(0.19–0.39) for STEPCODE and 0.42(0.32–0.48) for Fitbit and both methods differed significantly in MDAPE relative to the Fitbit (both p < 0.001).

STEPHEN also resulted in the smallest mean bias for both proportion of time spent being sedentary (Bias Mean(SD) = -0.03 (0.11), equivalent to 28.8 min/16 hours awake) and usual sedentary bout duration (Bias Mean(SD) = 3.91(5.76) minutes, Table 1). Among the three methods (Fitbit, STEPHEN and STEPCODE), only STEPHEN demonstrated equivalence with activPAL (p-values = 0.817 and 0.857 for proportion of time spent being sedentary and usual bout duration, Table 1). In contrast, both Fitbit and STEPCODE have the equivalence hypothesis rejected for both metrics (p-values < 0.001, Table 1).

The Bland-Altman plots revealed that the 95% limits of agreement were rather wide for all methods (Fig. 2) and there were statistically significant proportional biases for the estimate of proportion of sedentary time under Fitbit’s algorithm (p-value < 0.001, Fig. 2A) and STEPCODE (p-value < 0.001, Fig. 2C) but not for STEPHEN (p-value = 0.098, Fig. 2B). The Bland-Altman plots for usual sedentary bout did not exhibit proportional bias (all p-values > 0.10, Fig. 2D-F).

Among minutes predicted as sedentary time, the compositions of activPAL’s sitting, standing and stepping (Table 2) were approximately 45s (75.0%), 14s (23.3%) and 1s (1.7%) for Fitbit’s algorithm and STEPCODE, while STEPHEN’s sedentary minutes contains more sitting, at approximately 49s (81.3%) and less standing, at approximately 10s (16.7%). Among minutes predicted as non-sedentary, the compositions for Fitbit’s algorithms and STEPCODE were approximately 15s (25.0%) sitting, 25s (41.7%) standing and 20s (33.3%) stepping. STEPHEN’s non-sedentary minutes contains slightly less sitting at approximately 14s (23.3%) and more standing (25s, 41.7%).

The accuracy of STEPHEN, as well as STEPCODE and Fitbit’s proprietary algorithm appeared to be quite robust across males and females (Table 3), but showed dependence on age for proportion of time spent being sedentary and on BMI for usual sedentary bouts (Table 3). Although in both cases the effect sizes are rather small. On the other hand, the APE appears to be much smaller (accuracy is much better) for those in Managers/Administrators job category (Table 3), although these are not statistically significant.

Table 3 Regression coefficients and 95% confidence intervals for Investigating Association between Demographic Factors and accuracy as measured using Absolute percentage error (APE)

Similar to the observed proportional bias in Fitbit’s and STEPCODE’s estimate of proportion of time spent being sedentary, the APE was also significantly associated with the amount recorded by the activPAL (regression coefficient = -2.03, 95% CI = [-2.43 ; -1.63], p-value < 0.001 for Fitbit, -1.97, 95% CI = [-2.33 ; -1.61], p-value < 0.001 for STEPCODE and − 0.33 [-0.77;0.12] for STEPHEN).

External validation 2

The second external validation in the more sedentary, less active population (ALLO-Active) showed slightly different results to those seen in the OPTIMISE external validation (Table 1). As in the OPTIMISE external validation, the STEPHEN model had the highest accuracy for estimating usual sedentary bout duration with MDAPE(IQR) = 0.19(0.03–0.25), followed by STEPCODE with MDAPE(IQR) = 0.30(0.13–0.37) and Fitbit’s algorithm with MDAPE(IQR) = 0.36(0.15–0.48). However, the accuracy for proportion of time being sedentary by STEPHEN was slightly worse than Fitbit’s (MDAPE(IQR) = 0.13(0.08–0.22) vs. 0.09(0.05–0.19)). On the other hand, STEPCODE had similar accuracy to Fitbit for this metric with MDAPE(IQR) = 0.09(0.06–0.20). At the instance level, all methods have predicted sedentary minutes consisting of approximately 53s (88.3%) sitting, 6s (10.0%) standing and 1s (1.7%) stepping. For non-sedentary minutes, STEPHEN had approximately 15s (25.0%) sitting, 26s (43.4%) standing and 19s (31.7%) stepping, while Fitbit and STEPCODE are quite similar with approximately 16s (26.7%) sitting, 26s (43.4%) standing and 18s (30.0%) stepping.

Consistent with the MDAPE results, the mean biases (Table 1) showed slightly less overestimation of sedentary time proportion for both Fitbit and STEPCODE than for STEPHEN (Mean (SD) Bias = 0.10(0.06)), and a much lesser mean (SD) bias in usual bout duration for STEPHEN with 0.70(8.89) minutes compared with STEPCODE with − 7.10(9.55) minutes and Fitbit with − 11.35(9.17) minutes. All methods demonstrated equivalence with activPAL for both metrics (all p-values > 0.05, Table 1).

Interestingly, proportional biases were not as apparent in the ALLO-Active population as they had been in the OPTIMISE validation sample (all p-values > 0.10, Fig. 3).

Discussion

We have developed STEPHEN, a Hidden semi-Markov Model for classifying activity from the routine output (minute-level steps and heart rate) of commercially available wrist-worn wearable devices and tested its capacity to detect sedentary time. STEPHEN provided strong sedentary time classification both at a population-level and at an instance-level within an individual. Collectively, the classification improvements were seen in the internal validation, an external validation using OPTIMISE participants, and to a slightly lesser extent, in a very different external population (who by comparison with participants used to train the model were more sedentary, less active, and did not have access to sit-stand workstations). STEPHEN improved sedentary time classification through effective use of heart rate data for removing segments associated with wrist-only movements that result in high step counts but are unaccompanied by proportional change in heart rate. STEPHEN’s sedentary minutes contains a higher proportion of sitting time (as reflected by activPAL), while its non-sedentary minutes contains less sitting and higher proportion of standing or stepping. This finding demonstrates that overall, the use of heart rate data results in better detection of postural change. However, we acknowledge that STEPHEN’s classification is still far from perfect. In particular, there are still sizeable sitting events among minutes classified as non-sedentary. This misclassification is directly related to the fact that increases in heart rate without postural change could occur due to events such as stress, anger and underlying health conditions such as tachycardia. Indeed, this highlights the challenges of accurate detection of postural change ©using wrist-worn devices. Recently, Straczkiewcz et al. developed a novel method that can be used to improve detection of postural change using data from wrist-worn devices [30]. However, their approach relies on raw accelerometer data unavailable to consumers and researchers without the proprietary Application Programming Interface (API), hence potentially limiting its utility.

For over a decade it has been demonstrated that for research-grade devices such as the Actiheart that collect heartrate in a cumbersome way on the chest, energy expenditure can be captured more accurately via combined heart rate and accelerometry than by accelerometery alone or heart rate alone [41]. Our experience with STEPHEN further supports the importance of measuring heart rate data with wearable devices for its utility in distinguishing lower energy expenditure activities such as sedentary behaviors in addition to measuring physical activ–ty - even when using only simple measures taken at the wrist on commercial wearables. The benefit of using heart rate may have particular relevance in settings that involve using sit-stand workstations to perform activities with the same wrist movements sometimes in a seated position (sedentary), and sometimes in an upright position (non-sedentary). Sit-stand workstations are an important context for research applications, as they are a highly effective means of reducing sedentary time especially when part of multicomponent interventions [42].

We developed STEPHEN using data from a particular type of Fitbit device (Fitbit Inspire HR) but the model underlying STEPHEN only requires step counts and heart rate data. Thus, in theory STEPHEN can be trained using data from other wrist-worn devices that produce step counts and heart rate data as part of their output. Its performance, however, would depend on the reliability of the heart rate data. There are several factors that have been shown to have effects on heart rate accuracy, and/or step counts, and other data generated from acceleration, such as wrist circumference, wrist placement, tightness of device and dominant vs. non-dominant hand use [43]. A recent meta-analysis also showed that devices with optical heart rate sensors are generally more accurate for estimating energy expenditure [44]. In the future, we plan to add more features to our models, so that information on these factors, when available, can be used to improve the sedentary time estimates. Incorporating heart-rate variability will be viable in the not too distance future when collection of higher-resolution heart rate data becomes the norm and may also help with classification. Access to raw acceleration data – which are usually not readily available to consumers from commercial wearables without access to proprietary API and possibly collaboration with the relevant industry partner – would also open further possibilities for improved classification based upon acceleration (with or without heart rate). The highly detailed raw data, which typically are available from research-grade wearables (that often lack heart rate data), offers possibilities of capturing a wider range of inputs, including position [45], and the application of machine learning methods to improve classification of sedentary time as well as physical activity [46].

Conclusions

Overall, the findings demonstrate that STEPHEN provides a better, viable alternative to the proprietary algorithm for estimating sedentary times using wrist-worn devices, especially when prior data are available to train the model for specific living conditions. Importantly, the accuracy of STEPHEN was reasonably stable across individuals and characteristics and, in contrast to Fitbit’s proprietary algorithm, does not exhibit proportionality bias. The methods are also timely given that methods for estimating sedentary time using wrist-worn devices lag behind similar methods for hip or thigh-worn devices [47, 48]. We implemented our methods via an open access R package (https://github.com/limfuxing/STEPHEN/) and users can opt to use the built-in trained models or train their models from scratch to suit specific living conditions. Retraining STEPHEN for specific populations is good practice and recommended. Nonetheless, when this cannot be performed for practical reasons, the external validation results support STEPHEN as being preferable to the proprietary output from the Fitbit for measuring sedentary time accumulation and is likely also to be superior for estimating proportion of time spent being sedentary, particularly for intervention-trial contexts.

STEPHEN clearly improves upon the proprietary algorithm for sedentary time and provides initial indications it could be preferred over the proprietary algorithm when data from higher quality devices are unavailable. However, the findings here should not be taken as indicating this approach for consumer wearables is preferable to estimating sedentary time using devices that can produce high quality posture-driven sedentary measures such as activPAL. We have not compared the performance of STEPHEN to research grade wrist-worn devices such as ActiGraph. But it is worth noting that these research grade wrist-worn devices, due to wrist placement are also sensitive to upper extremity movement [45] and hence could also be affected by anomalous step counts due to activities that involve wrist-only movement.

The key strengths of the study were the inclusion of two markedly different study populations over a reasonably long period of observation. Limitations of using the study results (and the algorithm) are firstly that the user will need to apply a further method for separating out time in bed or sleep from sedentary time. STEPHEN (like Fitbit’s proprietary algorithm) does not separate out time in bed for sleep (or sleep) from sedentary time, however, the validation process was limited to data during waking time when both devices were worn. Secondly, while STEPHEN has the potential to estimate physical activities, we have only validated its performance for estimating sedentary behavior. Future extension of STEPHEN for estimating physical activities would need to consider modelling the LPA and standing states separately.

Data availability

No datasets were generated or analysed during the current study.

Abbreviations

APE:

Absolute percentage errors

API:

Application programming interface

BMI:

Body mass index

CI:

Confidence interval

HMM:

Hidden Markov Model

HsMM:

Hidden semi Markov Model

HR:

Heart rate

IQR:

Inter quartile range

LPA:

Light physical activity

MAD:

Mean absolute deviation

MDAPE:

Median absolute percentage error

MET:

Metabolic equivalent of task

MPA:

Moderate physical activity

NB:

Negative Binomial

SD:

Standard deviation

STEPCODE:

STEP enCODEr

STEPHEN:

STEP and Heart ENcoder

VPA:

Vigorous physical activity

References

  1. Dias D, Paulo Silva Cunha J. Wearable Health devices-vital sign monitoring, systems and technologies. Sens (Basel). 2018;18(8). https://doi.org/10.3390/s18082414.

  2. Balbim GM, Marques IG, Marquez DX, Patel D, Sharp LK, Kitsiou S, Nyenhuis SM. Using Fitbit as an mHealth intervention Tool to promote physical activity: potential challenges and solutions. JMIR Mhealth Uhealth. 2021;9(3):e25289. https://doi.org/10.2196/25289.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Mair JL, Hayes LD, Campbell AK, Sculthorpe N. J Meas Phys Behav. 2022;5(1):3–7. https://doi.org/10.1123/jmpb.2021-0012. Should We Use Activity Tracker Data From Smartphones and Wearables to Understand Population Physical Activity Patterns?.

  4. Boudreaux BD, Schenck JA, Chu Z, Schmidt MD. Effectiveness of Fitbit Activity prompts in reducing sitting time and increasing physical activity in University employees: a Randomized Controlled Trial. J Meas Phys Behav. 2022;5(3):178–84. https://doi.org/10.1123/jmpb.2021-0060.

    Article  Google Scholar 

  5. Giurgiu M, Kolb S, Nigg C, Burchartz A, Timm I, Becker M, et al. Assessment of 24-hour physical behaviour in children and adolescents via wearables: a systematic review of free-living validation studies. BMJ Open Sport Exerc Med. 2022;8(2):e001267. https://doi.org/10.1136/bmjsem-2021-001267.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Kamada M, Shiroma EJ, Harris TB, Lee IM. Comparison of physical activity assessed using hip- and wrist-worn accelerometers. Gait Posture. 2016;44:23–8. https://doi.org/10.1016/j.gaitpost.2015.11.005.

    Article  PubMed  Google Scholar 

  7. Kerr J, Marinac CR, Ellis K, Godbole S, Hipp A, Glanz K, et al. Comparison of Accelerometry methods for estimating physical activity. Med Sci Sports Exerc. 2017;49(3):617–24. https://doi.org/10.1249/mss.0000000000001124.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Dunstan DW, Dogra S, Carter SE, Owen N. Sit less and move more for cardiovascular health: emerging insights and opportunities. Nat Rev Cardiol. 2021;18(9):637–48. https://doi.org/10.1038/s41569-021-00547-y.

    Article  PubMed  Google Scholar 

  9. Mahmood S, English DR, MacInnis RJ, Karahalios A, Owen N, Milne RL, et al. Domain-specific physical activity and the risk of colorectal cancer: results from the Melbourne Collaborative Cohort Study. BMC Cancer. 2018;18(1):1063. https://doi.org/10.1186/s12885-018-4961-x.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Hallgren M, Dunstan DW, Owen N. Passive Versus mentally active sedentary behaviors and Depression. Exerc Sport Sci Rev. 2020;48(1):20–7. https://doi.org/10.1249/jes.0000000000000211.

    Article  PubMed  Google Scholar 

  11. Smith JAB, Savikj M, Sethi P, Platt S, Gabriel BM, Hawley JA, et al. Three weeks of interrupting sitting lowers fasting glucose and glycemic variability, but not glucose tolerance, in free-living women and men with obesity. Am J Physiol Endocrinol Metab. 2021;321(2):E203–16. https://doi.org/10.1152/ajpendo.00599.2020.

    Article  CAS  PubMed  Google Scholar 

  12. Brocklebank LA, Falconer CL, Page AS, Perry R, Cooper AR. Accelerometer-measured sedentary time and cardiometabolic biomarkers: a systematic review. Prev Med. 2015;76:92–102. https://doi.org/10.1016/j.ypmed.2015.04.013.

    Article  PubMed  Google Scholar 

  13. Fokkema T, Kooiman TJ, Krijnen WP, Van Der Schans CP, Groot MDE. Reliability and validity of ten consumer activity trackers depend on walking speed. Med Sci Sports Exerc. 2017;49(4):793–800. https://doi.org/10.1249/mss.0000000000001146.

    Article  PubMed  Google Scholar 

  14. Mikkelsen MK, Berg-Beckhoff G, Frederiksen P, Horgan G, O’Driscoll R, Palmeira AL, et al. Estimating physical activity and sedentary behaviour in a free-living environment: a comparative study between Fitbit Charge 2 and actigraph GT3X. PLoS ONE. 2020;15(6):e0234426. https://doi.org/10.1371/journal.pone.0234426.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Montoye AHK, Pivarnik JM, Mudd LM, Biswas S, Pfeiffer KA. Validation and comparison of Accelerometers worn on the hip, thigh, and wrists for measuring physical activity and sedentary behavior. AIMS Public Health. 2016;3(2):298–312. https://doi.org/10.3934/publichealth.2016.2.298.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Bellettiere J, Nakandala S, Tuz-Zahra F, Winkler EAH, Hibbing PR, Healy GN, et al. J Meas Phys Behav. 2022;5(4):215–23. https://doi.org/10.1123/jmpb.2021-0062. CHAP-Adult: A Reliable and Valid Algorithm to Classify Sitting and Measure Sitting Patterns Using Data From Hip-Worn Accelerometers in Adults Aged 35+.

  17. Redenius N, Kim Y, Byun W. Concurrent validity of the Fitbit for assessing sedentary behavior and moderate-to-vigorous physical activity. BMC Med Res Methodol. 2019;19(1):29. https://doi.org/10.1186/s12874-019-0668-1.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Nakagata T, Murakami H, Kawakami R, Tripette J, Nakae S, Yamada Y, et al. Step-count outcomes of 13 different activity trackers: results from laboratory and free-living experiments. Gait Posture. 2022;98:24–33. https://doi.org/10.1016/j.gaitpost.2022.08.004.

    Article  PubMed  Google Scholar 

  19. O’Connell S, ÓLaighin G, Quinlan LR. When a step is not a step! Specificity analysis of five physical activity monitors. PLoS ONE. 2017;12(1):e0169616. https://doi.org/10.1371/journal.pone.0169616.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Paul SS, Tiedemann A, Hassett LM, Ramsay E, Kirkham C, Chagpar S, Sherrington C. Validity of the fitbit activity tracker for measuring steps in community-dwelling older adults. BMJ Open Sport Exerc Med. 2015;1(1):e000013. https://doi.org/10.1136/bmjsem-2015-000013.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Fuller D, Colwell E, Low J, Orychock K, Tobin MA, Simango B, et al. Reliability and validity of commercially available Wearable devices for Measuring steps, Energy expenditure, and Heart Rate: systematic review. JMIR Mhealth Uhealth. 2020;8(9):e18694. https://doi.org/10.2196/18694.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Feehan LM, Geldman J, Sayre EC, Park C, Ezzat AM, Yoo JY, et al. Accuracy of Fitbit devices: systematic review and narrative syntheses of quantitative data. JMIR Mhealth Uhealth. 2018;6(8):e10527. https://doi.org/10.2196/10527.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Evenson KR, Goto MM, Furberg RD. Systematic review of the validity and reliability of consumer-wearable activity trackers. Int J Behav Nutr Phys Act. 2015;12:159. https://doi.org/10.1186/s12966-015-0314-1.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Stoner O, Economou T. An advanced hidden Markov model for hourly rainfall time series. Comput Stat Data Anal. 2020;152:107045. https://doi.org/10.1016/j.csda.2020.107045.

    Article  Google Scholar 

  25. Witowski V, Foraita R, Pitsiladis Y, Pigeot I, Wirsik N. Using hidden markov models to improve quantifying physical activity in accelerometer data - a simulation study. PLoS ONE. 2014;9(12):e114089. https://doi.org/10.1371/journal.pone.0114089.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Kerr J, Patterson RE, Ellis K, Godbole S, Johnson E, Lanckriet G, Staudenmayer J. Objective Assessment of Physical Activity: Classifiers for Public Health. Med Sci Sports Exerc. 2016;48(5):951–7. https://doi.org/10.1249/mss.0000000000000841.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Kavi L, Gammage MD, Grubb BP, Karabin BL. Postural tachycardia syndrome: multiple symptoms, but easily missed. Br J Gen Pract. 2012;62(599):286–7. https://doi.org/10.3399/bjgp12X648963.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Brakenridge CJ, Gardiner PA, Grigg RV, Winkler EAH, Fjeldsoe BS, Schaumberg MA, et al. Sitting less and moving more for improved metabolic and brain health in type 2 diabetes: ‘OPTIMISE your health’ trial protocol. BMC Public Health. 2022;22(1):929. https://doi.org/10.1186/s12889-022-13123-x.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Dillon HT, Saner NJ, Ilsley T, Kliman D, Spencer A, Avery S, et al. Preventing the adverse cardiovascular consequences of allogeneic stem cell transplantation with a multi-faceted exercise intervention: the ALLO-Active trial protocol. BMC Cancer. 2022;22(1):898. https://doi.org/10.1186/s12885-022-09793-w.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Straczkiewicz M, Glynn NW, Zipunnikov V, Harezlak J. Fast and robust algorithm for detecting body posture using wrist-worn accelerometers. J Meas Phys Behav. 2020;3(4):285–93. https://doi.org/10.1123/jmpb.2019-0067.

    Article  Google Scholar 

  31. Tudor-Locke C, Rowe DA. Using cadence to study free-living ambulatory behaviour. Sports Med. 2012;42(5):381–98. https://doi.org/10.2165/11599170-000000000-00000.

    Article  PubMed  Google Scholar 

  32. Lyden K, Keadle SK, Staudenmayer J, Freedson PS. The activPALTM accurately classifies activity intensity categories in healthy adults. Med Sci Sports Exerc. 2017;49(5):1022–8. https://doi.org/10.1249/mss.0000000000001177.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Edwardson CL, Winkler EAH, Bodicoat DH, Yates T, Davies MJ, Dunstan DW, Healy GN. Considerations when using the activPAL monitor in field-based research with adult populations. J Sport Health Sci. 2017;6(2):162–78. https://doi.org/10.1016/j.jshs.2016.02.002.

    Article  PubMed  Google Scholar 

  34. Winkler EA, Bodicoat DH, Healy GN, Bakrania K, Yates T, Owen N, et al. Identifying adults’ valid waking wear time by automated estimation in activPAL data collected with a 24 h wear protocol. Physiol Meas. 2016;37(10):1653–68. https://doi.org/10.1088/0967-3334/37/10/1653.

    Article  PubMed  Google Scholar 

  35. Grolemund G, Wickham H. Dates and Times made Easy with lubridate. J Stat Softw. 2011;40(3):1–25. https://doi.org/10.18637/jss.v040.i03.

    Article  Google Scholar 

  36. Chastin SFM, Winkler EAH, Eakin EG, Gardiner PA, Dunstan DW, Owen N, Healy GN. Sensitivity to change of objectively-derived measures of sedentary behavior. Meas Phys Educ Exerc Sci. 2015;19(3):138–47. https://doi.org/10.1080/1091367X.2015.1050592.

    Article  Google Scholar 

  37. Melnyk I, Banerjee A. A spectral algorithm for inference in Hidden Semi-markov models. J Mach Learn Res. 2017;18(1):1164–202.

    Google Scholar 

  38. O’Connell J, Højsgaard S. Hidden Semi Markov models for multiple Observation sequences: the Mhsmm Package for R. J Stat Softw. 2011;39(4):1–22. https://doi.org/10.18637/jss.v039.i04.

    Article  Google Scholar 

  39. Altman DG, Bland JM. Measurement in Medicine: the analysis of Method Comparison studies. J R Stat Soc Ser Stat Soc. 1983;32(3):307–17. https://doi.org/10.2307/2987937.

    Article  Google Scholar 

  40. Caldwell AR, Natick MA. Exploring Equivalence Testing with the Updated TOSTER R Package. PsyArXiv. 2022. https://doi.org/10.31234/osf.io/ty8de

  41. Villars C, Bergouignan A, Dugas J, Antoun E, Schoeller DA, Roth H, et al. Validity of combining heart rate and uniaxial acceleration to measure free-living physical activity energy expenditure in young men. J Appl Physiol (1985). 2012;113(11):1763–71. https://doi.org/10.1152/japplphysiol.01413.2011.

    Article  CAS  PubMed  Google Scholar 

  42. Pereira MA, Mullane SL, Toledo MJL, Larouche ML, Rydell SA, Vuong B, et al. Efficacy of the ‘Stand and move at work’ multicomponent workplace intervention to reduce sedentary time and improve cardiometabolic risk: a group randomized clinical trial. Int J Behav Nutr Phys Act. 2020;17(1):133. https://doi.org/10.1186/s12966-020-01033-3.

    Article  PubMed  PubMed Central  Google Scholar 

  43. Nelson BW, Low CA, Jacobson N, Areán P, Torous J, Allen NB. Guidelines for wrist-worn consumer wearable assessment of heart rate in biobehavioral research. NPJ Digit Med. 2020;3:90. https://doi.org/10.1038/s41746-020-0297-4.

    Article  PubMed  PubMed Central  Google Scholar 

  44. O’Driscoll R, Turicchi J, Beaulieu K, Scott S, Matu J, Deighton K, et al. How well do activity monitors estimate energy expenditure? A systematic review and meta-analysis of the validity of current technologies. Br J Sports Med. 2020;54(6):332–40. https://doi.org/10.1136/bjsports-2018-099643.

    Article  PubMed  Google Scholar 

  45. Rowlands AV, Yates T, Olds TS, Davies M, Khunti K, Edwardson CL. Wrist-worn accelerometer-brand independent posture classification. Med Sci Sports Exerc. 2016;48(4):748–54. https://doi.org/10.1249/mss.0000000000000813.

    Article  PubMed  Google Scholar 

  46. Gao Z, Liu W, McDonough DJ, Zeng N, Lee JE. The Dilemma of analyzing physical activity and sedentary behavior with wrist Accelerometer Data: challenges and opportunities. J Clin Med. 2021;10(24). https://doi.org/10.3390/jcm10245951.

  47. Nakandala S, Jankowska MM, Tuz-Zahra F, Bellettiere J, Carlson JA, LaCroix AZ, et al. Application of Convolutional Neural Network Algorithms for advancing Sedentary and Activity Bout classification. J Meas Phys Behav. 2021;4(2):102–10. https://doi.org/10.1123/jmpb.2020-0016.

    Article  PubMed  PubMed Central  Google Scholar 

  48. Greenwood-Hickman MA, Nakandala S, Jankowska MM, Rosenberg DE, Tuz-Zahra F, Bellettiere J, et al. The CNN Hip Accelerometer posture (CHAP) method for classifying sitting patterns from hip accelerometers: a validation study. Med Sci Sports Exerc. 2021;53(11):2445–54. https://doi.org/10.1249/mss.0000000000002705.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

GNH was supported by an Australian MRFF-NHMRC Emerging Leadership Investigator Grant (#1193815). All Baker Heart & Diabetes Institute authors acknowledge support from the Victorian Government’s Operational Infrastructure Support (OIS) scheme. All authors declare that they have no conflict of interest to disclose. The authors declare that results of the study are presented clearly, honestly, and without fabrication, falsification, or inappropriate data manipulation.

Funding

Not applicable.

Author information

Authors and Affiliations

Authors

Contributions

AS conceptualised the statistical approaches, performed statistical analyses, wrote and edited the manuscript, responsible for all aspects of the works presented here. CJB conceptualized the OPTIMISE trial and data collection and contributed to writing the manuscript, DHL contributed to writing the manuscript, contributed to statistical analyses, EH, RG and HD conceptualized the AlloActive trial, contributed to writing the manuscript. HDB contributed to conceptualizing the statistical approaches, JAS contributed to conceptualizing the statistical approaches, contributed to manuscript writing, GNH, NO and DWD conceptualized the OPTIMISE trial, contributed to manuscript writing. EEAHW contributed to conceptualizing the statistical approaches, performed data collection, performed statistical analyses, contributed to manuscript writing. All authors read and approved the final version of the manuscript.

Corresponding author

Correspondence to Agus Salim.

Ethics declarations

Ethics approval and consent to participate

Human ethics approval for OPTIMISE were obtained from Alfred Health Human Ethics Committee (Melbourne, Australia), The University of Queensland Institutional Human Research Ethics Committee (Brisbane, Australia), and the University of the Sunshine Coast Human Research Ethics Committee (Sunshine Coast, Australia). The OPTIMISE Your Health trial has been registered with the Australian New Zealand Clinical Trials Registry (ANZCTRN12618001159246; date of registration: 07/03/2018). Ethics approval for the ALLO-Active trial was provided by the Alfred Hospital Human Research Ethics Committee, and the trial was registered with the Australian New Zealand Clinical Trials Registry (registration number: ACTRN12619000741189). All participants provided written informed consent.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Salim, A., Brakenridge, C.J., Lekamlage, D.H. et al. Detection of sedentary time and bouts using consumer-grade wrist-worn devices: a hidden semi-Markov model. BMC Med Res Methodol 24, 222 (2024). https://doi.org/10.1186/s12874-024-02311-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12874-024-02311-5

Keywords