Abstract
Background: Wearable devices offer a promising solution for remotely monitoring heart rate (HR) during home-based cardiac rehabilitation. However, evidence regarding their accuracy across varying exercise intensities and patient profiles remains limited, particularly in populations with cardiovascular disease (CVD) such as those with heart failure (HF).
Objective: The objective of this study was to evaluate the accuracy of HR measurements obtained using the Fitbit Inspire 3 during cardiopulmonary exercise testing (CPX) in patients with CVD, including those with HF.
Methods: In this single-center, prospective pilot study, we enrolled 30 patients with CVD undergoing CPX. HR was simultaneously recorded using electrocardiography and the Fitbit Inspire 3 at 1-minute intervals across various CPX phases: rest, exercise below and above the anaerobic threshold (AT), and recovery. The correlation between the two methods was assessed using the Pearson correlation coefficient. Measurement error was quantified by mean absolute error and mean absolute percentage error (MAPE), with a MAPE of ≤10% defined as the threshold for acceptable agreement.
Results: All data points were 630 points per minute. The Fitbit Inspire 3 device demonstrated a strong overall correlation with electrocardiography-derived HR (r=0.90; IQR 0.88‐0.91) and an acceptable MAPE of 5.40% (SD 8.33%). The total error was 14.9% (94/630), with overestimation and underestimation of 37 (5.8%) points and 57 (9%) points, respectively. The rate of HR underestimation reached 19 (16%) points during exercise above the AT, compared to 1 (3%) point at rest. When stratified by HF stage (B vs C), underestimation was more pronounced in patients with HF (14/275, 5% points vs 40/355, 11.2% points).
Conclusions: The Fitbit Inspire 3 provides acceptable validity for HR monitoring during CPX in patients with CVD. However, clinicians should interpret HR data with caution during high-intensity exercise, especially in patients with HF.
doi:10.2196/77911
Keywords
Introduction
Background
Outpatient cardiac rehabilitation (CR) is a class I recommended therapy for patients with cardiovascular disease (CVD) and is regarded as an essential component of treatment []. However, in Japan, the prevalence of heart failure (HF) is rising among older adults [], while participation in outpatient CR remains low []. Among working-age patients with coronary artery disease (CAD), the challenge of balancing work and CR contributes to the low participation rates []. A recent meta-analysis [] demonstrated that home-based CR combined with digital support yielded comparable improvements in quality of life and reductions in hospital readmissions to conventional outpatient CR. These findings highlight the potential significance of remote support in home-based CR programs.
Maintaining appropriate exercise intensity is crucial to prevent symptom exacerbation when prescribing exercise therapy []. Heart rate (HR), closely linked to oxygen consumption [], is a key indicator for setting exercise intensity and is commonly used in outpatient CR []. HR monitoring during outpatient CR is typically performed using electrocardiography (ECG). However, in home-based settings, continuous ECG monitoring is impractical, leading to reliance on manual pulse checks, whose accuracy remains insufficiently validated.
Advances in wearable digital technology have led to the widespread use of devices using photoplethysmography (PPG), such as smartwatches, which are being investigated as potential alternatives to ECG []. A recent scoping review suggested that smartwatches may be useful tools to support and enhance outpatient disease management []. However, measurement accuracy can vary widely between devices [], and HR obtained via wearable devices tends to be underestimated at higher exercise intensities []. Consequently, whether such technology can be reliably applied to patients with CVD, particularly those with HF who may experience reduced peripheral perfusion, warrants careful evaluation.
A recent validation study [] reported that HR measurements using the Apple Watch 7 (Apple Inc) and Galaxy Watch 4 (Samsung Electronics Co, Ltd) during cardiopulmonary exercise testing (CPX) were highly accurate in patients with CAD, indicating their potential utility in this population. However, these devices are relatively expensive (approximately US $350), and their accessibility to individuals with low income, who may already be reluctant to participate in outpatient CR, remains limited []. Moreover, although the study population included stable patients with CAD, the accuracy of measurement in patients with HF was not evaluated.
Objective
We hypothesized that if a more affordable device, such as the Fitbit Inspire 3 (Fitbit LLC; approximately US $80), could yield similar HR measurement accuracy, it would become more widely accessible. Previous research on the Fitbit series in postoperative patients performing low-intensity activities [] showed promising accuracy, and a validation study involving younger patients with CVD demonstrated good performance during stepwise increases in exercise intensity measured by CPX []. However, no studies have specifically focused on patients with HF. Therefore, this study aimed to evaluate the accuracy of HR measurement from the Fitbit Inspire 3 compared to ECG-based measurements during CPX in patients with CVD, including those with HF.
Methods
Study Design and Setting
This single-center, prospective, observational pilot study was conducted at the Itami City Hospital.
Participants
Participants were recruited from patients who received a prescription for CPX at the Department of Cardiology, Itami City Hospital, between August 2024 and March 2025. The inclusion criteria were as follows: (1) aged ≥18 years at the time of consent, (2) documented agreement to undergo CPX, and (3) provision of written informed consent for study participation. The exclusion criteria were as follows: (1) aged <18 years, (2) persistent atrial fibrillation (AF), (3) known silicone allergy, or (4) refusal to participate.
Sample Size Calculation
As a pilot study, a target sample size of 30 participants was established based on previous studies with sample sizes ranging from 10 to 60 [], as well as the estimated number of CPX procedures expected during the study period.
Data Collection and Procedures
CPX was performed using a cycle ergometer equipped with a breath-by-breath gas analyzer (AE-300S; Minato Medical Science). The testing protocol included a 1-minute rest period and 4 minutes of warm-up at 0 W, followed by a symptom-limited ramp protocol with individualized increments of 10‐20 W/min. The recovery phase involved pedaling at 0 W for ≥5 minutes until clinical status stabilized. Ventilatory parameters, including minute ventilation (VE), oxygen uptake (VO₂), and carbon dioxide output (VCO₂), were recorded every 6 seconds.
The respiratory exchange ratio was calculated as VCO₂/VO₂ using breath-by-breath data. Peak VO₂ was defined as the highest VO₂ value recorded during exercise or the average of the final 18 seconds (3 data points), whichever was greater. Peak VO₂ was normalized by body weight (BW) and expressed in mL/kg/min (peak VO₂/BW), in accordance with standard clinical practice for CPX. The anaerobic threshold (AT) was determined by an experienced cardiologist based on multiple indices: the inflection point in VE/VO₂ without a concurrent rise in VE/VCO₂, increasing partial pressure of end-tidal oxygen without a change in end-tidal carbon dioxide, and other standard criteria.
During CPX, the Fitbit Inspire 3 device was attached to the arm contralateral to the side used for blood pressure measurement. The device was worn approximately 2 finger-widths above the wrist crease, with the optical sensor positioned flush against the skin to minimize motion artifacts. HR was recorded simultaneously using ECG and the wearable device, which logged data at 1-minute intervals []. Test start times were extracted from the electronic medical records. HR data from the wearable device were retrieved through the Fitbit Web Application Programming Interface using the httr and jsonlite packages in R (version 4.4.0; R Foundation for Statistical Computing).
Demographic and Clinical Characteristics
All variables were obtained from patients’ medical records. Baseline demographics data included age, sex, diagnosis, American College of Cardiology (ACC) or American Heart Association (AHA) HF stages [], relevant medical history, current medications, and smoking status. Physical examination findings included height, weight, BMI, and the New York Heart Association functional classification. Laboratory parameters included hemoglobin, creatinine, estimated glomerular filtration rate, and B-type natriuretic peptide. Echocardiographic data included left ventricular ejection fraction (LVEF), measured using the modified Simpson method. HF phenotypes were categorized as HF with reduced ejection fraction (LVEF≤40%), HF with mildly reduced ejection fraction (LVEF 41%‐49%), and HF with preserved ejection fraction (LVEF≥50%) [].
Statistical Analysis
Continuous variables are presented as mean (SD) or median (IQR), as appropriate. Categorical variables are expressed as counts and percentages.
A whole-test analysis was performed using the complete set of 1-minute HR data. In addition, a phase-specific analysis was conducted by assigning each 1-minute sample to 1 of 4 predefined exercise phases: rest, low-intensity (including warm-up; below AT), high-intensity (above AT), and recovery. A stratified whole-test analysis was also carried out based on the ACC or AHA HF stage classification.
Furthermore, stratified analyses by ACC or AHA stage were conducted by repeating both the whole-test and phase-specific analyses within each stage category. Pearson correlation coefficients were calculated to assess the relationship between ECG-based and Fitbit-derived HR. Correlation coefficients were interpreted as follows: 0‐0.30 as negligible, 0.30‐0.50 as low, 0.50‐0.70 as moderate, 0.70‐0.90 as high, and 0.90‐1.00 as very high. Bland-Altman plots were used to evaluate agreement, systematic bias, and limits of agreement. Mean absolute error (MAE) and mean absolute percentage error (MAPE) were calculated to quantify measurement error. On the basis of previous studies, a MAPE of ≤10% was considered acceptable [,]. The proportions of underestimation and overestimation were also calculated. All numerical values and statistical metrics were derived using CPX-measured HR as the reference standard. All analyses were conducted using R (version 4.4.0; R Foundation for Statistical Computing) and RStudio (version 2024.12.1+563; RStudio, Inc).
Ethical Considerations
The study protocol complied with the principles of the Declaration of Helsinki (1975), as revised in 2000, and was approved by the institutional review boards of the Itami City Hospital (2537) and the Shijonawate Gakuen University (24‐4). Written informed consent was obtained from all participants after detailed explanations of the study’s objectives, procedures, potential benefits, and risks were provided by the researchers. To protect participant privacy, study records were de-identified. Each participant was assigned a unique study ID, and the linkage file connecting IDs to personal identifiers was stored exclusively on the hospital’s electronic medical record network; no identifiable information was transferred outside this network. Only variables necessary for the study were collected, and the datasets were processed and managed to prevent the immediate identification of any individual. All analyses were conducted using de-identified datasets on the local computers. Participants did not receive any financial or material compensation for participating in the study.
Results
Demographic and Clinical Characteristics
The demographic and clinical characteristics are summarized in and . Of the 30 patients with CVD included in the study, 13 (43%) were classified as having stage B HF and 17 (57%) as having stage C HF. The median (IQR) age was 65 (54-73) years, and 17 patients (57%) were male. CAD was the most common underlying condition, observed in 16 patients (53%), followed by dilated cardiomyopathy in 6 (20%) patients. Regarding pharmacotherapy, β blockers were prescribed to 23 (77%) patients. According to the New York Heart Association classification, 8 (27%) patients were in class I; 20 (67%) in class II; and 2 (7%) in class III. The median (IQR) LVEF was 52% (45%‐61%), and 20% (6/30) of the patients were categorized as having HF with reduced ejection fraction. The median (IQR) B-type natriuretic peptide concentration was 56 (13‐93) pg/mL. During CPX, the total median exercise time was 8.33 (IQR 7.00‐10.45) minutes, the median peak respiratory exchange ratio was 1.16 (IQR 1.10‐1.23), and the median peak VO₂/BW was 17.2 (IQR 14.5‐21.2) mL/kg/min. When comparing the stage B (n=13, 43%) and stage C (n=17, 57%) groups, patients in the stage B group were older (median age 73 vs 55 y) and had a higher prevalence of CAD (11/13, 85% vs 5/17, 29%). In contrast, nonischemic etiologies, including dilated cardiomyopathy, were more common in the stage C group (6/17, 35% vs 0/13, 0%). β-blocker use was more frequent in the stage C group (17/17, 100% vs 6/13, 46%).
| Characteristics | Overall (n=30) | Stage B (n=13) | Stage C (n=17) | |
| Age (y), median (IQR) | 65 (54-73) | 73 (64-78) | 55 (48-65) | |
| Male, n (%) | 17 (57) | 8 (62) | 9 (53) | |
| BMI (kg/m2), median (IQR) | 23.2 (21.6-26.2) | 22.0 (20.8-24.8) | 23.7 (22.0-27.9) | |
| Etiology, n (%) | ||||
| 16 (53) | 11 (85) | 5 (29) | ||
| 6 (20) | 0 (0) | 6 (35) | ||
| 2 (7) | 0 (0) | 2 (12) | ||
| 2 (7) | 0 (0) | 2 (12) | ||
| 4 (13) | 2 (15) | 2 (12%) | ||
| Comorbidities, n (%) | ||||
| 19 (63) | 10 (77) | 9 (53) | ||
| 12 (40) | 6 (46) | 6 (35) | ||
| Smoking, n (%) | ||||
| 18 (60) | 8 (62) | 10 (59) | ||
| 8 (27) | 4 (31) | 4 (24) | ||
| 4 (13) | 1 (8) | 3 (18) | ||
| FH, n (%) | 1 (3) | 1 (8) | 0 (0) | |
| DM, n (%) | 9 (30) | 4 (31) | 5 (29) | |
| Respiratory disease, n (%) | 4 (13) | 2 (15) | 2 (12) | |
| LVEF (%), median (IQR) | 52 (45-61) | 54 (52-61) | 46 (39-58) | |
| LVEF classification, n (%) | ||||
| 16 (53) | 10 (77) | 6 (35) | ||
| 8 (27) | 3 (23) | 5 (29) | ||
| 6 (20) | 0 (0) | 6 (35) | ||
| NYHA classification, n (%) | ||||
| 8 (27) | 4 (31) | 4 (24) | ||
| 20 (67) | 9 (69) | 11 (65) | ||
| 2 (7) | 0 (0) | 2 (12) | ||
aCAD: coronary artery disease.
bDCM: dilated cardiomyopathy.
cHHD: hypertensive heart disease.
dFH: family history.
eDM: diabetes mellitus.
fLVEF: left ventricular ejection fraction.
gHFpEF: heart failure with preserved ejection fraction.
hHFmrEF: heart failure with mildly reduced ejection fraction.
iHFrEF: heart failure with reduced ejection fraction.
jNYHA: New York Heart Association.
| Characteristics | Overall (n=30) | Stage B (n=13) | Stage C (n=17) | |
| ACE-i, ARB, or ARNI, n (%) | 28 (93) | 13 (100) | 15 (88) | |
| β blocker, n (%) | 23 (77) | 6 (46) | 17 (100) | |
| SGLT2i, n (%) | 13 (43) | 2 (15) | 11 (65) | |
| MRA, n (%) | 14 (47) | 1 (8) | 13 (76) | |
| Diuretics, n (%) | 7 (23) | 0 (0) | 7 (41) | |
| Nitrates, n (%) | 11 (37) | 8 (62) | 3 (18) | |
| Calcium antagonists, n (%) | 3 (10) | 1 (8) | 2 (12) | |
| Antiplatelet agents, n (%) | 16 (53) | 12 (92) | 4 (24) | |
| Anticoagulants, n (%) | 8 (27) | 3 (23) | 5 (29) | |
| Statins, n (%) | 16 (53) | 11 (85) | 5 (29) | |
| Ezetimibe, n (%) | 1 (3) | 1 (8) | 0 (0) | |
| Amiodarone, n (%) | 1 (3) | 0 (0) | 1 (6) | |
| Hemoglobin (g/dL), median (IQR) | 13.75 (11.83-14.98) | 12.90 (11.80-14.80) | 13.80 (12.30-15.00) | |
| Creatinine (mg/dL), median (IQR) | 0.92 (0.80-1.04) | 0.88 (0.80-1.11) | 0.92 (0.84-1.01) | |
| eGFR (mL/min/1.73 m²), median (IQR) | 62 (50-70) | 61 (50-69) | 64 (51-70) | |
| BNP (pg/mL), median (IQR) | 56 (13-93) | 61 (25-114) | 27 (12-83) | |
| Missing BNP values, n (%) | 1 (3) | 1 (8) | 0 (0) | |
| Ramp protocol | ||||
| 26 (87) | 12 (92) | 14 (82) | ||
| 4 (13) | 1 (8) | 3 (18) | ||
| Total exercise time (min), median (IQR) | 8.33 (7.00-10.45) | 8.90 (7.00-10.60) | 8.20 (7.15-9.65) | |
| Peak RER, median (IQR) | 1.16 (1.10-1.23) | 1.16 (1.11-1.23) | 1.15 (1.04-1.22) | |
| Peak WR (W), median (IQR) | 93 (75-112) | 93 (81-110) | 98 (74-112) | |
| Peak VO2 (mL/min), median (IQR) | 1158 (883-1324) | 1097 (984-1,277) | 1194 (820-1348) | |
| Peak VO2/BW (mL/kg/min), median (IQR) | 17.2 (14.5-21.2) | 17.1 (15.6-22.1) | 17.3 (14.4-20.2) | |
| ATVO2 (ml/min), median (IQR) | 724 (610-823) | 682 (624-772) | 745 (584-840) | |
| ATVO2/BW (mL/kg/min), median (IQR) | 11.80 (9.86-12.82) | 11.69 (10.64-12.54) | 11.99 (9.52-12.83) | |
aCPX: cardiopulmonary exercise testing.
bACE-i: angiotensin-converting enzyme inhibitor.
cARB: angiotensin II receptor blocker.
dARNI: angiotensin receptor–neprilysin inhibitor.
eSGLT2i: sodium-glucose cotransporter 2 inhibitor.
fMRA: mineralocorticoid receptor antagonist.
geGFR: estimated glomerular filtration rate.
hBNP: B-type natriuretic peptide.
iRER: respiratory exchange ratio.
jWR: work rate.
kVO₂: oxygen uptake.
lVO₂/BW: oxygen uptake per body weight.
mATVO₂: oxygen consumption at anaerobic threshold.
Validity of HR Measurement Using the Fitbit Inspire 3
A comparison between HR measurements obtained via CPX and those estimated by the Fitbit Inspire 3 is presented. In total, 630 data points were analyzed and categorized into 4 exercise phases: rest, below and above the AT, and recovery (). These data were further stratified by HF stage (stages B and C; ).
| Condition | Correlation coefficient, r (95% CI) | MAE (bpm), mean (SD) | MAPE (%), mean (SD) | Error rate, n (%) | Overestimation, n (%) | Underestimation, n (%) |
| All (N=630) | 0.90 (0.88‐0.91) | 5.20 (8.75) | 5.40 (8.33) | 94 (15) | 37 (6) | 57 (9) |
| Rest (n=30) | 0.92 (0.84‐0.96) | 3.40 (4.15) | 4.67 (5.53) | 3 (10) | 2 (7) | 1 (3) |
| Below AT (n=270) | 0.85 (0.82‐0.88) | 4.41 (7.28) | 5.23 (8.29) | 39 (14) | 10 (4) | 29 (11) |
| Above AT (n=119) | 0.76 (0.67‐0.82) | 8.03 (14.08) | 6.14 (9.62) | 20 (17) | 1 (1) | 19 (16) |
| Recovery (n=211) | 0.92 (0.90‐0.94) | 4.85 (6.46) | 5.32 (7.96) | 27 (13) | 22 (10) | 5 (2) |
aHR: heart rate.
bCPX: cardiopulmonary exercise testing.
cMAPE: mean absolute percentage error.
dMAE: mean absolute error.
ebpm: beats per minute.
fNumber of data points.
gAT: anaerobic threshold.
| Condition | Correlation coefficient, r (95% CI) | MAE (bpm), mean (SD) | MAPE (%), mean (SD) | Error rate, n (%) | Overestimation, n (%) | Underestimation, n (%) |
| All (N=630) | 0.90 (0.88‐0.91) | 5.20 (8.75) | 5.40 (8.33) | 94 (15) | 37 (6) | 57 (9) |
| Stage B (n=275) | 0.92 (0.90‐0.94) | 4.51 (7.66) | 4.67 (7.94) | 31 (11) | 17 (6) | 14 (5) |
| Stage C (n=355) | 0.88 (0.85‐0.90) | 5.73 (9.49) | 5.98 (8.59) | 58 (16) | 18 (5) | 40 (11) |
aHR: heart rate.
bCPX: cardiopulmonary exercise testing.
cMAPE: mean absolute percentage error.
dMAE: mean absolute error.
ebpm: beats per minute.
fNumber of data points.
In the overall sample (N=630), the correlation coefficient between CPX-measured HR and Fitbit Inspire 3–derived HR was 0.90 (95% CI 0.88‐0.91; ). The MAE was 5.20 (SD 8.75) beats per minute (bpm), and the MAPE was 5.40% (SD 8.33%). The total error was 94 (N=630, 15%), with overestimation and underestimation of 37 (6%) and 57 (9%), respectively. The Bland-Altman plot () displays CPX-measured HR on the x-axis and the difference between CPX and Fitbit HR measurements on the y-axis. The time-series trend of HR error, with time on the x-axis, is shown in . The average difference in HR was −1.25 bpm, with upper and lower limits of agreement of 18.56 bpm and −21.05 bpm, respectively.



By condition, the rest phase (n=30) showed a correlation coefficient of 0.85 (0.82‐0.88), MAE of 4.41 (SD 7.28) bpm, and MAPE of 5.23% (SD 8.29%). During the below AT condition (n=270), the correlation coefficient was 0.92 (0.90‐0.94), MAE was 4.85 (SD 6.46) bpm, and MAPE was 5.32% (SD 7.96%).
In contrast, the above AT condition (n=119) demonstrated a lower correlation coefficient of 0.76 (0.67‐0.82), with higher MAE and MAPE values of 8.03 (SD 14.08) bpm and 6.14% (SD 9.62%), respectively. In the recovery phase (n=211), the correlation coefficient was 0.92 (0.90‐0.94), with an MAE of 4.85 (SD 6.46) bpm and a MAPE of 5.32% (SD 7.96%). Regarding estimation errors, the above AT phase exhibited a higher underestimation (19/119 points, 15.9%) than overestimation (1/119 points; 0.8%), whereas in the recovery phase, overestimation (22/211 points, 10.4%) exceeded underestimation (5/211 points, 2.3%).
By HF stage, stage B (n=275) demonstrated a correlation coefficient of 0.92 (0.90‐0.94) and an MAE of 4.51 (SD 7.66) bpm. In comparison, stage C (n=355) showed a slightly lower correlation coefficient of 0.88 (0.85‐0.90) and a higher MAE of 5.73 (SD 9.49) bpm. The over- and underestimations were 17 (N=275, 6.1%) and 14 (5%) for stage B and 18 (N=355, 5%) and 40 (11.2%) for stage C, respectively.
Discussion
Principal Findings
In this study, we analyzed the accuracy of HR measurements obtained from the Fitbit Inspire 3 relative to ECG-based HR during CPX in patients with CVD, including those with HF. The Fitbit Inspire 3 device yielded relatively accurate HR estimates at intensities below the AT. However, estimation errors increased above the AT: the device underestimated HR during high-intensity exercise and overestimated it during recovery. In addition, overall accuracy was lower in patients with HF.
Comparison With Previous Work
The overall correlation coefficient was 0.90 (), whereas previous studies involving healthy individuals using Fitbit devices reported correlation coefficients ranging from 0.84 to 0.93 [,]. MAPE was 4.67% (SD 5.53%) at rest and 6.14% (SD 9.62%) above the AT. A study using the Fitbit Charge 4 across various activities, such as stair climbing and squats, found MAPE values ranging from 6.36% to 11.98%, depending on the activity []. Another cycling-based study using the Fitbit Charge 3 reported a MAPE of 6.1% []. Despite using the comparatively less-expensive Fitbit Inspire 3 in this study, we observed similar outcomes, suggesting that, if the exercise modality is equivalent, similar accuracy may be achieved.
As the workload increased during CPX, MAPE rose from 4.67% (SD 5.53%) at rest to 5.23% (SD 8.29%) below the AT and 6.14% (SD 9.62%) above the AT (). Correspondingly, measurement error increased with HR (). A previous study that varied walking speeds on a treadmill reported MAPEs of 9.99% at 3.0 km/h and 10.06% at 6.4 km/h []. Another study that varied cycling workloads, similar to this study, found a MAPE of −7.0% at a light load (50 W) and −15% at workloads of 60%‐85% of HR reserve [], indicating larger measurement errors at higher exercise intensities. The decreased accuracy at elevated workloads may be attributable to several factors: the low test-retest reliability of wrist-based PPG sensors (intraclass correlation coefficients <0.5) [], increased forearm muscle contraction to stabilize the handlebars (which can reduce blood flow and introduce signal artifacts) [,], reduced contact between the PPG sensor and the skin [], measurement latency [], and motion artifacts themselves []. In our study, underestimation became more pronounced at higher exercise intensities, whereas overestimation predominated during the recovery phase (). The acute phase of HR recovery is influenced by parasympathetic reactivation, and the late phase is associated with sympathetic withdrawal and reduced catecholamine levels, generally leading to a 12 to 30 bpm decrease within 1 minute []. The fact that 23 (77%) participants were taking β blockers suggests that pharmacological effects, autonomic dysfunction, and the Fitbit device’s delayed response to sudden changes in HR [] may have collectively contributed to these discrepancies.
Notably, patients with stage C HF showed a higher rate of underestimation than those with stage B (11% vs 5%), likely owing to the larger measurement error at high workloads. No validation study has specifically focused on wearable devices in patients with HF. We speculate that because patients with HF have a limited ability to increase cardiac output beyond the AT [] and experience reduced peripheral perfusion due to heightened sympathetic tone, PPG-based HR may be underestimated. Further research is warranted to clarify the accuracy of PPG-based HR measurements above the AT in this population.
Limitations
This study has some limitations worth noting. First, it was a pilot study with a relatively small sample size. In addition, HR values were averaged over 1-minute intervals, leading to fewer data points. Consequently, rapid fluctuations in HR may not have been fully captured and could have contributed to measurement error. Second, we excluded patients with persistent AF; however, patients with both HF and AF may experience further reductions in HR measurement accuracy []. Third, we used cycling as the exercise modality, which restricted upper-limb movement. Home-based exercise therapies commonly involve walking, which requires arm movement. Therefore, we were unable to assess potential artifacts caused by arm motion. Further studies are needed to confirm the accuracy of these devices under more typical home-based exercise conditions. Fourth, the patients with HF in our cohort were relatively young and clinically stable. The results may differ in patients with more advanced HF, and caution should be exercised when generalizing these findings.
Conclusions
This study demonstrated that the accuracy of HR estimation by the Fitbit Inspire 3 varied depending on exercise intensity and patient characteristics. These findings suggest that when using the Fitbit Inspire 3 to support interventions such as home-based exercise therapy in patients with CVD, including those with HF, careful consideration should be given to the patient’s HF stage and exercise intensity, as well as to the device’s potential limitations in different usage scenarios.
Acknowledgments
The authors express their sincere gratitude to physical therapist Yuka Nakashima and the physical and occupational therapists in the Rehabilitation Department of Itami City Hospital for their invaluable support in facilitating this study. This work was supported by JSPS KAKENHI (grant JP24K20549).
Data Availability
The data generated and analyzed during this research are available from the corresponding author upon reasonable request.
Authors' Contributions
Conceptualization: KK
Data curation: KK (lead), YH (supporting)
Formal analysis: KK
Funding acquisition: KK
Investigation: YH, RF, TH
Methodology: KK
Project administration: KK (lead), YH (supporting)
Resources: YH, HM
Supervision: HS
Validation: KK
Visualization: KK
Writing–original draft: KK
Writing–review & editing: HS (lead), YH (supporting), RF (supporting)
Conflicts of Interest
None declared.
References
- Makita S, Yasu T, Akashi YJ, et al. JCS/JACR 2021 Guideline on rehabilitation in patients with cardiovascular disease. Circ J. Dec 23, 2022;87(1):155-235. [CrossRef] [Medline]
- Fujimoto W, Toh R, Takegami M, et al. Estimating incidence of acute heart failure syndromes in Japan - an analysis from the KUNIUMI Registry. Circ J. Sep 24, 2021;85(10):1860-1868. [CrossRef] [Medline]
- Kamiya K, Yamamoto T, Tsuchihashi-Makaya M, et al. Nationwide survey of multidisciplinary care and cardiac rehabilitation for patients with heart failure in Japan - an analysis of the AMED-CHF Study. Circ J. Jun 25, 2019;83(7):1546-1552. [CrossRef] [Medline]
- Sugiharto F, Nuraeni A, Trisyani Y, Melati Putri A, Aghnia Armansyah N. Barriers to participation in cardiac rehabilitation among patients with coronary heart disease after reperfusion therapy: a scoping review. Vasc Health Risk Manag. 2023;19:557-570. [CrossRef] [Medline]
- Molloy CD, Long L, Mordi IR, et al. Exercise-based cardiac rehabilitation for adults with heart failure - 2023 Cochrane systematic review and meta-analysis. Eur J Heart Fail. Dec 2023;25(12):2263-2273. [CrossRef] [Medline]
- Sartor F, Vernillo G, de Morree HM, et al. Estimation of maximal oxygen uptake via submaximal exercise testing in sports, clinical, and home settings. Sports Med. Sep 2013;43(9):865-873. [CrossRef] [Medline]
- Kim C, Kim SH, Suh MR. Accuracy and validity of commercial smart bands for heart rate measurements during cardiopulmonary exercise test. Ann Rehabil Med. Aug 2022;46(4):209-218. [CrossRef] [Medline]
- Friend SH, Ginsburg GS, Picard RW. Wearable digital health technology. N Engl J Med. Nov 30, 2023;389(22):2100-2101. [CrossRef] [Medline]
- Fabien S, Waechter S, Kayode GA, Müller B, Roth M, Koehler F. Smartwatches in the assessment of heart failure patients in epidemiology and pathophysiology studies: a scoping review. ESC Heart Fail. Jun 2025;12(3):1727-1738. [CrossRef] [Medline]
- Fuller D, Colwell E, Low J, et al. Reliability and validity of commercially available wearable devices for measuring steps, energy expenditure, and heart rate: systematic review. JMIR Mhealth Uhealth. Sep 8, 2020;8(9):e18694. [CrossRef] [Medline]
- Jo E, Lewis K, Directo D, Kim MJ, Dolezal BA. Validation of biofeedback wearables for photoplethysmographic heart rate tracking. J Sports Sci Med. Aug 5, 2016;15(3):540-547. [Medline]
- Kim C, Song JH, Kim SH. Validation of wearable digital devices for heart rate measurement during exercise test in patients with coronary artery disease. Ann Rehabil Med. Aug 2023;47(4):261-271. [CrossRef] [Medline]
- Helmer P, Hottenrott S, Rodemers P, et al. Accuracy and systematic biases of heart rate measurements by consumer-grade fitness trackers in postoperative patients: prospective clinical trial. J Med Internet Res. Dec 30, 2022;24(12):e42359. [CrossRef] [Medline]
- Pierick AR, Burke KJ, Prusi M, et al. Validity of wrist-worn activity tracker heart rate detection in Fontan patients during exercise. Med Sci Sports Exerc. Feb 1, 2025;57(2):280-284. [CrossRef] [Medline]
- Gorny AW, Liew SJ, Tan CS, Müller-Riemenschneider F. Fitbit Charge HR wireless heart rate monitor: validation study conducted under free-living conditions. JMIR Mhealth Uhealth. Oct 20, 2017;5(10):e157. [CrossRef] [Medline]
- Heidenreich PA, Bozkurt B, Aguilar D, et al. 2022 AHA/ACC/HFSA Guideline for the Management of Heart Failure: a report of the American College of Cardiology/American Heart Association Joint Committee on Clinical Practice Guidelines. Circulation. May 3, 2022;145(18):e895-e1032. [CrossRef] [Medline]
- Bozkurt B, Coats AJS, Tsutsui H, et al. Universal definition and classification of heart failure: a report of the Heart Failure Society of America, Heart Failure Association of the European Society of Cardiology, Japanese Heart Failure Society and Writing Committee of the Universal Definition of Heart Failure: endorsed by the Canadian Heart Failure Society, Heart Failure Association of India, Cardiac Society of Australia and New Zealand, and Chinese Heart Failure Association, Heart Failure Association. Eur J Heart Fail. Mar 2021;23(3):352-380. [CrossRef] [Medline]
- Boudreaux BD, Hebert EP, Hollander DB, et al. Validity of wearable activity monitors during cycling and resistance exercise. Med Sci Sports Exerc. Mar 2018;50(3):624-633. [CrossRef] [Medline]
- Nelson BW, Allen NB. Accuracy of consumer wearable heart rate measurement during an ecologically valid 24-hour period: intraindividual validation study. JMIR Mhealth Uhealth. Mar 11, 2019;7(3):e10828. [CrossRef] [Medline]
- Stahl SE, An HS, Dinkel DM, Noble JM, Lee JM. How accurate are the wrist-based heart rate monitors during walking and running activities? Are they accurate enough? BMJ Open Sport Exerc Med. 2016;2(1):e000106. [CrossRef] [Medline]
- Wang R, Blackburn G, Desai M, et al. Accuracy of wrist-worn heart rate monitors. JAMA Cardiol. Jan 1, 2017;2(1):104-106. [CrossRef] [Medline]
- Nissen M, Slim S, Jäger K, et al. Heart rate measurement accuracy of Fitbit Charge 4 and Samsung Galaxy Watch Active 2: device evaluation study. JMIR Form Res. Mar 1, 2022;6(3):e33635. [CrossRef] [Medline]
- Haveman ME, van Rossum MC, Vaseur RME, et al. Continuous monitoring of vital signs with wearable sensors during daily life activities: validation study. JMIR Form Res. Jan 7, 2022;6(1):e30863. [CrossRef] [Medline]
- Muggeridge DJ, Hickson K, Davies AV, et al. Measurement of heart rate using the Polar OH1 and Fitbit Charge 3 wearable devices in healthy adults during light, moderate, vigorous, and sprint-based exercise: validation study. JMIR Mhealth Uhealth. Mar 25, 2021;9(3):e25313. [CrossRef] [Medline]
- Mol A, Meskers CGM, Niehof SP, Maier AB, van Wezel RJA. Pulse transit time as a proxy for vasoconstriction in younger and older adults. Exp Gerontol. Jul 1, 2020;135(110938):110938. [CrossRef] [Medline]
- Johnson PC. Overview of the microcirculation. In: Tuma RF, Duran WN, Ley K, editors. Microcirculation. Academic Press; 2008:xi-xxiv. [CrossRef] ISBN: 9780123745309
- Johnson JM. Physical training and the control of skin blood flow. Med Sci Sports Exerc. Mar 1998;30(3):382-386. [CrossRef] [Medline]
- Allen J. Photoplethysmography and its application in clinical physiological measurement. Physiol Meas. Mar 2007;28(3):R1-R39. [CrossRef] [Medline]
- Støve MP, Holm RS, Kjaersgaard AS, Duncker K, Jensen MR, Larsen BT. Measurement latency significantly contributes to reduced heart rate measurement accuracy in wearable devices. J Med Eng Technol. Apr 2020;44(3):125-132. [CrossRef] [Medline]
- Bent B, Goldstein BA, Kibbe WA, Dunn JP. Investigating sources of inaccuracy in wearable optical heart rate sensors. NPJ Digit Med. 2020;3(1):18. [CrossRef] [Medline]
- Petek BJ, Al-Alusi MA, Moulson N, et al. Consumer wearable health and fitness technology in cardiovascular medicine: JACC state-of-the-art review. J Am Coll Cardiol. Jul 18, 2023;82(3):245-264. [CrossRef] [Medline]
- Nappi A, Cuocolo A, Imbriaco M, et al. Ambulatory monitoring of left ventricular function: walk and bicycle exercise in congestive heart failure. J Nucl Med. Jun 1997;38(6):948-953. [Medline]
- Al-Kaisey AM, Koshy AN, Ha FJ, et al. Accuracy of wrist-worn heart rate monitors for rate control assessment in atrial fibrillation. Int J Cardiol. Feb 1, 2020;300:161-164. [CrossRef] [Medline]
Abbreviations
| ACC: American College of Cardiology |
| AF: atrial fibrillation |
| AHA: American Heart Association |
| AT: anaerobic threshold |
| bpm: beats per minute |
| BW: body weight |
| CAD: coronary artery disease |
| CPX: cardiopulmonary exercise testing |
| CR: cardiac rehabilitation |
| CVD: cardiovascular disease |
| ECG: electrocardiography |
| HF: heart failure |
| HR: heart rate |
| LVEF: left ventricular ejection fraction |
| MAE: mean absolute error |
| MAPE: mean absolute percentage error |
| PPG: photoplethysmography |
| VCO2: carbon dioxide output |
| VE: minute ventilation |
| VO2: oxygen uptake |
Edited by Goran Krstačić; submitted 22.May.2025; peer-reviewed by Morten Støve, Vinicius Damasceno; final revised version received 04.Aug.2025; accepted 03.Sep.2025; published 06.Oct.2025.
Copyright© Kazufumi Kitagaki, Yuji Hongo, Rie Futai, Takeshi Hasegawa, Hiroshi Morikawa, Hisashi Shimoyama. Originally published in JMIR Cardio (https://cardio.jmir.org), 6.Oct.2025.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Cardio, is properly cited. The complete bibliographic information, a link to the original publication on https://cardio.jmir.org, as well as this copyright and license information must be included.

