Dynamic risk stratification of worsening heart failure using a deep learning-enabled implanted ambulatory single-lead electrocardiogram

Author Notes

Abstract

Aims

Implantable loop recorders (ILRs) provide continuous single-lead ambulatory electrocardiogram (aECG) monitoring. Whether these aECGs could be used to identify worsening heart failure (HF) is unknown.

Methods and results

We linked ILR aECG from Medtronic device database to the left ventricular ejection fraction (LVEF) measurements in Optum^® de-identified electronic health record dataset. We trained an artificial intelligence (AI) algorithm [aECG-convolutional neural network (CNN)] on a dataset of 35 741 aECGs from 2247 patients to identify LVEF ≤ 40% and assessed its performance using the area under the receiver operating characteristic curve. Ambulatory electrocardiogram-CNN was then used to identify patients with increasing risk of HF hospitalization in a real-world cohort of 909 patients with prior HF diagnosis. This dataset provided 12 467 follow-up monthly evaluations, with 201 HF hospitalizations. For every month, time-series features from these predictions were used to categorize patients into high- and low-risk groups and predict HF hospitalization in the next month. The risk of HF hospitalization in the next 30 days was significantly higher in the cohort that aECG-CNN identified as high risk [hazard ratio (HR) 1.89; 95% confidence interval (CI) 1.28–2.79; P = 0.001] compared with low risk, even after adjusting patient demographics (HR 1.88; 95% CI 1.27–2.79 P = 0.002).

Conclusion

An AI algorithm trained to detect LVEF ≤40% using ILR aECGs can also readily identify patients at increased risk of HF hospitalizations by monitoring changes in the probability of HF over 30 days.

Graphical Abstract

Open in new tab Download slide

Ejection fraction, Heart failure, Neural network, Deep learning, Electrocardiogram

See the editorial comment for this article ‘Implantable cardiac monitors: the digital future of risk prediction?’, by A. Bauer and C. Dlaska, https://doi.org/10.1093/ehjdh/ztae036.

Introduction

Heart failure (HF) is a chronic disease with high prevalence, morbidity, and mortality. The impact of HF is expected to increase substantially with the aging of the population.¹ The overall economic cost of HF in 2012 was estimated at $108 billion per annum based on a study of patient population across the globe.² With an aging, rapidly expanding and industrializing global population this value will continue to rise. Considering the increasing economic burden of HF, early identification of patients at increased risk of clinical deterioration and hospitalization even prior to the development of severe symptoms may provide an opportunity for early intervention that can prevent worsening of symptoms, slow down disease progression, and improve patient outcomes.^3–10

Electrocardiogram (ECG) is a commonly adopted non-invasive method for screening and diagnosing cardiovascular disease, and the use of standard 12-lead ECG for systolic HF diagnosis has been ongoing since 1989, advancing from identification of simple abnormalities on ECG to more advanced artificial intelligence (AI) algorithms.^11–16 These algorithms can predict the presence but also development of HF with more accuracy than traditional ECG-derived metrics. However, no system exists which provides continuous 12-lead ECG monitoring in outpatients. Conversely, over a million implantable loop recorders (ILRs) have been implanted and provide continuous single-lead ambulatory electrocardiograms (aECGs).

Although ILR helps to diagnose arrhythmias, whether this single-lead ECG can be used to predict systolic HF events is unknown.

In this study, we show that AI can analyse aECGs from ILRs to identify patients with reduced left ventricular ejection fraction (LVEF) (≤40%). Furthermore, we show by continuously monitoring daily aECGs for changes in adverse features, we can stratify patients into low- and high-risk cohorts.

Methods

Cohort and study design

We conducted a retrospective study of patients in the Optum^® de-identified electronic health record (EHR) dataset between 2007 and 2021 from multiple hospital systems in the USA. Through a methodology compliant with health insurance portability and accountability act (HIPAA)’s de-identification standard, a third party determined which of those patients with LVEF measurements from the Optum^® EHR were concomitantly enrolled in the Medtronic DiscoveryLink data warehouse. This is a manufacturer’s de-identified device data warehouse containing continuous ECG data from fully ILRs. For patients whose data appeared in both datasets, a combined dataset was created that met HIPAA’s de-identification standard. This process is discussed in detail in the Supplementary Appendix. The combined dataset was used to develop the aECG-convolutional neural network (CNN) algorithm. This retrospective analysis using de-identified data falls into the category of non-human research, and no institutional review board approval was indicated.

Development of an ambulatory electrocardiogram-convolutional neural network model to detect left ventricular ejection fraction ≤ 40%

To create the aECG-CNN model development dataset, LVEF measurements from Optum^® EHR were matched to single-lead aECGs measured within 7 days (before/after) of the LVEF measurement date. We excluded patients diagnosed with hypertrophic cardiomyopathy (ICD10: I42.1, I42.2; ICD9: 42511, 42518). To ensure robust and reliable LVEF information and minimize errors introduced through automated data extraction from Optum^® EHR (Figure 1), we only included patients with prior HF diagnosis to create the low ejection fraction (EF) data (LVEF ≤ 40%). Patients with no HF diagnosis and an EF >40% were used to create the normal EF data (LVEF > 40%). More detailed information on data cleanup is provided in the supplementary material (see Supplementary material online, Figure S1). Electrocardiograms were randomized at the patient level to the training, validation, and testing datasets using a 60%:20%:20% split.

Figure 1

Dataset creation for the ambulatory electrocardiogram-convolutional neural network model development. Schematic indicates the strategy to obtain robust and reliable dataset for model development. To avoid cross-contamination, no patient data are repeated among training, validation, and testing datasets. *Patient count (n = 5829) selected based on several inclusion, exclusion criteria to ensure the quality of LVEF data as they are obtained from Optum^® EHR and the dataset was captured in Optum^® EHR via natural language processing of procedure/diagnostic notes and prone to natural language processing errors.

Open in new tab Download slide

A custom deep learning convolutional neural network model (aECG-CNN) was trained on an independent training database of LVEF-aECG pairs to detect LVEF ≤ 40% (full architecture listed in the Supplementary Appendix). The model has 24 layers with 1.7 M total learnable parameters. The algorithm predicts the probability (0–1) of LVEF ≤40% from 10 s of aECG data; one indicates the highest probability of LVEF ≤40%. Cross-entropy loss was calculated and minimized using the adaptive moment estimation optimizer.^17,18 During training, we performed data augmentation through over-sampling of low EF ECGs, Gaussian noise, and cyclical shifting of the signal. There was no test-time augmentation.

Longitudinal analysis of ambulatory electrocardiogram and assessment of increased risk of future heart failure hospitalization events

There is evidence that fluctuations in LVEF can predict HF hospitalizations.^19,20 We hypothesized that the aECG-CNN algorithm has the potential to track the changes in probabilities of LVEF ≤40% using longitudinal aECG data and the degree of change/relative change can be used to assess the dynamic risk of future HF hospitalization events (Figure 2).

$Ambulatory electrocardiogram-convolutional neural network model architecture to detect low ejection fraction. Input to the model is scalograms (image) of single-lead ambulatory electrocardiogram signal.$

Figure 2

Ambulatory electrocardiogram-convolutional neural network model architecture to detect low ejection fraction. Input to the model is scalograms (image) of single-lead ambulatory electrocardiogram signal.

Open in new tab Download slide

The proposed algorithm uses the fact that ILRs record a daily ECG and that changes in LVEF may occur over both shorter and longer periods. Therefore, we calculate three running averages of the low EF score (using the ECG) over short (7-day) and medium (15-day) intervals and compare these to a 30-day baseline for each patient. These periods were chosen empirically to reduce the risk of overfitting to the dataset; it was hypothesized that shorter periods than 7 days would comprise too few measurements and periods longer than 15 days risked ‘missing’ the change to act. The specific rules used to assign patients to the high- and low-risk cohort are shown in Table 1.

Table 1

Open in new tab

Dynamic risk group stratification based on aECG-CNN probability changes within a month

Risk assessment	Feature set
High risk	Defined as large fluctuations and sustained fluctuations in predicted LVEF.
	Large fluctuations = (Pmax7-Pmax30 ≥ 0.1) AND (Pmax15-Pmean15 ≥ 0.05)
	Sustained fluctuations = (Pmax7-Pmax15) ≥ 0.08) for ≥6 Days OR (Pmax7-Pmax30) for ≥5 Days
Low risk	Above conditions not met.
Utilized features	Description
Pmax7	The peak probability of LVEF ≤ 40% within the 7-day running average.
Pmax15	The peak probability of LVEF ≤ 40% within the 15-day running average.
Pmean15	The mean probability of LVEF ≤ 40% within the 15-day running average.
Pmax30	The peak probability of LVEF ≤ 40% within the 30-day running average.

Risk assessment	Feature set
High risk	Defined as large fluctuations and sustained fluctuations in predicted LVEF.
	Large fluctuations = (Pmax7-Pmax30 ≥ 0.1) AND (Pmax15-Pmean15 ≥ 0.05)
	Sustained fluctuations = (Pmax7-Pmax15) ≥ 0.08) for ≥6 Days OR (Pmax7-Pmax30) for ≥5 Days
Low risk	Above conditions not met.
Utilized features	Description
Pmax7	The peak probability of LVEF ≤ 40% within the 7-day running average.
Pmax15	The peak probability of LVEF ≤ 40% within the 15-day running average.
Pmean15	The mean probability of LVEF ≤ 40% within the 15-day running average.
Pmax30	The peak probability of LVEF ≤ 40% within the 30-day running average.

Table 1

Open in new tab

Dynamic risk group stratification based on aECG-CNN probability changes within a month

Risk assessment	Feature set
High risk	Defined as large fluctuations and sustained fluctuations in predicted LVEF.
	Large fluctuations = (Pmax7-Pmax30 ≥ 0.1) AND (Pmax15-Pmean15 ≥ 0.05)
	Sustained fluctuations = (Pmax7-Pmax15) ≥ 0.08) for ≥6 Days OR (Pmax7-Pmax30) for ≥5 Days
Low risk	Above conditions not met.
Utilized features	Description
Pmax7	The peak probability of LVEF ≤ 40% within the 7-day running average.
Pmax15	The peak probability of LVEF ≤ 40% within the 15-day running average.
Pmean15	The mean probability of LVEF ≤ 40% within the 15-day running average.
Pmax30	The peak probability of LVEF ≤ 40% within the 30-day running average.

Risk assessment	Feature set
High risk	Defined as large fluctuations and sustained fluctuations in predicted LVEF.
	Large fluctuations = (Pmax7-Pmax30 ≥ 0.1) AND (Pmax15-Pmean15 ≥ 0.05)
	Sustained fluctuations = (Pmax7-Pmax15) ≥ 0.08) for ≥6 Days OR (Pmax7-Pmax30) for ≥5 Days
Low risk	Above conditions not met.
Utilized features	Description
Pmax7	The peak probability of LVEF ≤ 40% within the 7-day running average.
Pmax15	The peak probability of LVEF ≤ 40% within the 15-day running average.
Pmean15	The mean probability of LVEF ≤ 40% within the 15-day running average.
Pmax30	The peak probability of LVEF ≤ 40% within the 30-day running average.

To test this algorithm, we used a real-world cohort of ILR patients with HF hospitalization prior to implant and follow-up ECGs for at least 6 months post-implant (Figure 3). Heart failure hospitalization was used as the endpoint and was defined as an inpatient, emergency department, or observation unit stay in a hospital with a primary discharge diagnosis of HF. Primary diagnosis of HF was ascertained based on ICD9/ICD10 codes: 428.X, 402.01, 402.11, 402.91, 404.01, 404.03, 404.11, 404.13, 404.91, 404.93, I50.X, I11.0, I13.0, and I13.2 as has been described previously.²¹

Figure 3

Schematic indicates the dataset selection for longitudinal analysis of ambulatory electrocardiogram-convolutional neural network algorithm to assess the increased risk of future heart failure events.

Open in new tab Download slide

The ability of aECG-CNN to predict HF hospitalization was assessed by simulating monthly follow-ups which consisted of looking at the aECG-CNN risk states across 30 days and evaluating the occurrence of clinical events in the following 30 days (Figure 4) as has been described previously.⁵

Figure 4

Every 30 days the previous 30 days are evaluated for risk group based on ambulatory electrocardiogram-convolutional neural network, and then, the subsequent 30 days are evaluated for heart failure event. Start indicates implant date.

Open in new tab Download slide

Statistical analysis

The performance of the algorithm to identify LVEF ≤40% was assessed using the area under the receiver operating characteristic (AUROC) curve. To assess the algorithm’s ability to predict HF hospitalization, a marginal Cox proportional hazards model was used to calculate the hazard ratios (HRs) and 95% confidence intervals (CIs) for patients identified by the algorithm as high vs. low risk. The marginal model adjusts for multiple observations within individual subjects. Kaplan–Meier analysis was performed. Patients were right censored at the last day of EHR data availability if <30 days. Cox regression models were used for adjusting several clinical variables, including age, gender, race, and comorbidities such as diabetes, hypertension, and history of myocardial infarction (MI). Baseline variables that may be manipulated because of worsening HF (e.g. HF medications) were not adjusted for. All statistical analyses were performed in Minitab (v20.1.3) and SAS (v9.4).

Results

Ambulatory electrocardiogram-convolutional neural network algorithm to predict left ventricular ejection fraction ≤ 40%

The aECG-CNN model was trained on a dataset of 35 741 LVEF-ECG pairs from 2249 patients to predict the probability of LVEF ≤40%. The model was validated on an independent dataset of 6721 LVEF-ECG pairs from 750 patients and tested on another independent dataset of 6611 LVEF-ECG pairs from 750 patients. The baseline characteristics of total dataset used for the development, validation, and testing of the aECG-CNN model are provided in Table 2. The model yielded an accuracy, a sensitivity, a specificity, and an AUROC of 75%, 70%, 76%, and 0.8, respectively, in the independent test dataset. Full event counts as a 2 × 2 table, precision, and F1 scores are provided in the Supplementary Appendix, Supplementary material online, Tables S1 and S2. The receiver operating characteristics curve in Figure 5 provides the performance of model on test dataset.

Figure 5

The diagram shows the receiver operating characteristic of ambulatory electrocardiogram-convolutional neural network classification model on test dataset.

Open in new tab Download slide

Table 2

Open in new tab

Baseline patient characteristics of aECG-CNN algorithm development/validation/testing set

Clinical history	All (n = 3749)	Training set (n = 2249)	Validation set (n = 750)	Test set (n = 750)	P-value
Mean age (years)	68 (14)	68 (14)	68 (14)	68 (14)	0.69
Male gender	2009 (53.6)	1202 (53.4)	401 (53.4)	406 (54.1)	0.94
Mean LVEF	57.9 (8.8)	57.9 (8.8)	58.1 (8.9)	57.97 (8.9)	0.17
Race					0.43
African American	487 (13.0)	294 (13.1)	110 (14.7)	83 (11.1)
Caucasian	3006 (80.2)	1794 (79.7)	595 (79.3)	617 (82.2)
Asian	42 (1.1)	25 (1.1)	9 (1.2)	8 (1.1)
Others	214 (5.7)	136 (6.1)	36 (4.8)	42 (5.6)
Ethnicity
Hispanic	169 (4.7)	105 (4.7)	31 (4.1)	33 (4.4)	0.92
Comorbidities
Hypertension	2678 (71.4)	1602 (71.2)	535 (71.3)	541 (72.1)	0.89
History of MI	684 (18.2)	405 (18.0)	136 (18.1)	143 (19.1)	0.81
Diabetes mellitus	1133 (30.2)	659 (29.3)	242 (32.3)	232 (30.9)	0.27
Cardiomyopathy	566 (15.1)	336 (14.9)	122 (16.2)	108 (14.4)	0.57
Atrial fibrillation	1154 (30.8)	702 (31.2)	230 (30.7)	222 (29.6)	0.71
Chronic obstructive pulmonary disease	49 (1.3)	33 (1.5)	7 (0.93)	9 (1.2)	0.52
Heart failure	296 (7.9)	177 (7.8)	63 (8.4)	56 (7.5)	0.79

Clinical history	All (n = 3749)	Training set (n = 2249)	Validation set (n = 750)	Test set (n = 750)	P-value
Mean age (years)	68 (14)	68 (14)	68 (14)	68 (14)	0.69
Male gender	2009 (53.6)	1202 (53.4)	401 (53.4)	406 (54.1)	0.94
Mean LVEF	57.9 (8.8)	57.9 (8.8)	58.1 (8.9)	57.97 (8.9)	0.17
Race					0.43
African American	487 (13.0)	294 (13.1)	110 (14.7)	83 (11.1)
Caucasian	3006 (80.2)	1794 (79.7)	595 (79.3)	617 (82.2)
Asian	42 (1.1)	25 (1.1)	9 (1.2)	8 (1.1)
Others	214 (5.7)	136 (6.1)	36 (4.8)	42 (5.6)
Ethnicity
Hispanic	169 (4.7)	105 (4.7)	31 (4.1)	33 (4.4)	0.92
Comorbidities
Hypertension	2678 (71.4)	1602 (71.2)	535 (71.3)	541 (72.1)	0.89
History of MI	684 (18.2)	405 (18.0)	136 (18.1)	143 (19.1)	0.81
Diabetes mellitus	1133 (30.2)	659 (29.3)	242 (32.3)	232 (30.9)	0.27
Cardiomyopathy	566 (15.1)	336 (14.9)	122 (16.2)	108 (14.4)	0.57
Atrial fibrillation	1154 (30.8)	702 (31.2)	230 (30.7)	222 (29.6)	0.71
Chronic obstructive pulmonary disease	49 (1.3)	33 (1.5)	7 (0.93)	9 (1.2)	0.52
Heart failure	296 (7.9)	177 (7.8)	63 (8.4)	56 (7.5)	0.79

Discrete data are presented as counts (percentage) and continuous measurements as mean (standard deviation).

Table 2

Open in new tab

Baseline patient characteristics of aECG-CNN algorithm development/validation/testing set

Clinical history	All (n = 3749)	Training set (n = 2249)	Validation set (n = 750)	Test set (n = 750)	P-value
Mean age (years)	68 (14)	68 (14)	68 (14)	68 (14)	0.69
Male gender	2009 (53.6)	1202 (53.4)	401 (53.4)	406 (54.1)	0.94
Mean LVEF	57.9 (8.8)	57.9 (8.8)	58.1 (8.9)	57.97 (8.9)	0.17
Race					0.43
African American	487 (13.0)	294 (13.1)	110 (14.7)	83 (11.1)
Caucasian	3006 (80.2)	1794 (79.7)	595 (79.3)	617 (82.2)
Asian	42 (1.1)	25 (1.1)	9 (1.2)	8 (1.1)
Others	214 (5.7)	136 (6.1)	36 (4.8)	42 (5.6)
Ethnicity
Hispanic	169 (4.7)	105 (4.7)	31 (4.1)	33 (4.4)	0.92
Comorbidities
Hypertension	2678 (71.4)	1602 (71.2)	535 (71.3)	541 (72.1)	0.89
History of MI	684 (18.2)	405 (18.0)	136 (18.1)	143 (19.1)	0.81
Diabetes mellitus	1133 (30.2)	659 (29.3)	242 (32.3)	232 (30.9)	0.27
Cardiomyopathy	566 (15.1)	336 (14.9)	122 (16.2)	108 (14.4)	0.57
Atrial fibrillation	1154 (30.8)	702 (31.2)	230 (30.7)	222 (29.6)	0.71
Chronic obstructive pulmonary disease	49 (1.3)	33 (1.5)	7 (0.93)	9 (1.2)	0.52
Heart failure	296 (7.9)	177 (7.8)	63 (8.4)	56 (7.5)	0.79

Clinical history	All (n = 3749)	Training set (n = 2249)	Validation set (n = 750)	Test set (n = 750)	P-value
Mean age (years)	68 (14)	68 (14)	68 (14)	68 (14)	0.69
Male gender	2009 (53.6)	1202 (53.4)	401 (53.4)	406 (54.1)	0.94
Mean LVEF	57.9 (8.8)	57.9 (8.8)	58.1 (8.9)	57.97 (8.9)	0.17
Race					0.43
African American	487 (13.0)	294 (13.1)	110 (14.7)	83 (11.1)
Caucasian	3006 (80.2)	1794 (79.7)	595 (79.3)	617 (82.2)
Asian	42 (1.1)	25 (1.1)	9 (1.2)	8 (1.1)
Others	214 (5.7)	136 (6.1)	36 (4.8)	42 (5.6)
Ethnicity
Hispanic	169 (4.7)	105 (4.7)	31 (4.1)	33 (4.4)	0.92
Comorbidities
Hypertension	2678 (71.4)	1602 (71.2)	535 (71.3)	541 (72.1)	0.89
History of MI	684 (18.2)	405 (18.0)	136 (18.1)	143 (19.1)	0.81
Diabetes mellitus	1133 (30.2)	659 (29.3)	242 (32.3)	232 (30.9)	0.27
Cardiomyopathy	566 (15.1)	336 (14.9)	122 (16.2)	108 (14.4)	0.57
Atrial fibrillation	1154 (30.8)	702 (31.2)	230 (30.7)	222 (29.6)	0.71
Chronic obstructive pulmonary disease	49 (1.3)	33 (1.5)	7 (0.93)	9 (1.2)	0.52
Heart failure	296 (7.9)	177 (7.8)	63 (8.4)	56 (7.5)	0.79

Discrete data are presented as counts (percentage) and continuous measurements as mean (standard deviation).

Ambulatory electrocardiogram-convolutional neural network algorithm to assess dynamic risk of heart failure hospitalization

A total of 909 patients with 12 467 follow-up monthly evaluations were analysed to assess the risk of future HF hospitalization based on the relative change in probability of aECG-CNN model inference output. The mean follow-up was 13.7 months from implant date. A total of 201 monthly evaluations (1.6%) had HF hospitalization in the next 30 days. The baseline characteristics of the patient cohort are provided in Table 3, including the reason for ILR implantation. The average age of patients was 68 ± 13 years, with 51% males. The prevalence of comorbidities diagnosed prior to ILR implant was as follows: diabetes (39%), hypertension (95%), dilated cardiomyopathy (25%), congestive HF (99%), valvular disease (3%), coronary artery disease (75%), history of MI (56%), renal dysfunction (52%), ventricular arrhythmia (30%), and atrial fibrillation (57%). Based on the change in probability of LVEF ≤40% during monthly evaluations, 1830 evaluations (14.7%) were categorized as high risk and 10 637 evaluations (85.3%) as low risk. Figure 6 shows the Kaplan–Meier curve of cumulative incidence of monthly evaluations with subsequent HF hospitalization events in the next 30 days among two risk groups identified by aECG-CNN model. Patients aECG-CNN identified as high risk were almost twice as likely to have HF hospitalization (HR 1.89; 95% CI 1.28–2.79; P = 0.001) than those it identified as low risk (Table 4). This performance was unchanged after adjusting for patient demographics (HR 1.88, 95% CI 1.27–2.79; P = 0.002).

Figure 6

The diagram shows the Kaplan–Meier analysis of cumulative incidence of monthly evaluations with subsequent heart failure hospitalization among high-risk and low-risk category. 494 patients contributed solely to the high-risk group, 901 solely to the low-risk group, and 486 to both groups.

Open in new tab Download slide

Table 3

Open in new tab

Baseline demographics of patients in the study

Clinical history	Total (n = 909)
Mean age	68 (13)
Male gender	464 (51.1)
Race
African American	163 (17.9)
Asian	7 (0.8)
Caucasian	695 (76.5)
Other	44 (4.8)
Ethnicity, Hispanic	44 (4.8)
Baseline LVEF	53.5 (13.5)
NT-proBNP	1512 (3983)
Comorbidities
Diabetes mellitus	354 (38.9)
Hypertension	861 (94.7)
Renal dysfunction	475 (52.3)
History of MI	512 (56.3)
Atrial fibrillation	522 (57.4)
Ventricular arrhythmias	277 (30.5)
Coronary artery disease	684 (75.2)
Congestive heart failure	899 (98.9)
Dilated cardiomyopathy	225 (24.7)
Hypertrophic cardiomyopathy	16 (1.7)
Valvular heart disease	28 (3.1)
Medications history
Angiotensin-converting enzyme inhibitors/Angiotensin receptor blockers	478 (52.6)
Anticoagulants	494 (54.3)
Diuretics	811 (89.2)
Entresto	21 (2.3)
Spironolactone	219 (24.1)
Beta-blockers	710 (78.1)
Vasodilator–nitrate	468 (51.5)
Antiarrhythmic Drugs
Class I/III/IV	289 (31.8)
Class I	48 (5.2)
Class III/IV	272 (29.9)
Reason for ICM monitoring
AF ablation monitoring	33 (3.6)
AF management	126 (13.8)
Cryptogenic stroke	162 (17.8)
Palpitations	42 (4.6)
Suspected AF	62 (6.8)
Syncope	341 (37.5)
Ventricular tachycardia	21 (2.3)
Other/unknown	122 (13.4)

Clinical history	Total (n = 909)
Mean age	68 (13)
Male gender	464 (51.1)
Race
African American	163 (17.9)
Asian	7 (0.8)
Caucasian	695 (76.5)
Other	44 (4.8)
Ethnicity, Hispanic	44 (4.8)
Baseline LVEF	53.5 (13.5)
NT-proBNP	1512 (3983)
Comorbidities
Diabetes mellitus	354 (38.9)
Hypertension	861 (94.7)
Renal dysfunction	475 (52.3)
History of MI	512 (56.3)
Atrial fibrillation	522 (57.4)
Ventricular arrhythmias	277 (30.5)
Coronary artery disease	684 (75.2)
Congestive heart failure	899 (98.9)
Dilated cardiomyopathy	225 (24.7)
Hypertrophic cardiomyopathy	16 (1.7)
Valvular heart disease	28 (3.1)
Medications history
Angiotensin-converting enzyme inhibitors/Angiotensin receptor blockers	478 (52.6)
Anticoagulants	494 (54.3)
Diuretics	811 (89.2)
Entresto	21 (2.3)
Spironolactone	219 (24.1)
Beta-blockers	710 (78.1)
Vasodilator–nitrate	468 (51.5)
Antiarrhythmic Drugs
Class I/III/IV	289 (31.8)
Class I	48 (5.2)
Class III/IV	272 (29.9)
Reason for ICM monitoring
AF ablation monitoring	33 (3.6)
AF management	126 (13.8)
Cryptogenic stroke	162 (17.8)
Palpitations	42 (4.6)
Suspected AF	62 (6.8)
Syncope	341 (37.5)
Ventricular tachycardia	21 (2.3)
Other/unknown	122 (13.4)

Discrete data are presented as counts (percentage) and continuous measurements as mean (standard deviation).

Table 3

Open in new tab

Baseline demographics of patients in the study

Clinical history	Total (n = 909)
Mean age	68 (13)
Male gender	464 (51.1)
Race
African American	163 (17.9)
Asian	7 (0.8)
Caucasian	695 (76.5)
Other	44 (4.8)
Ethnicity, Hispanic	44 (4.8)
Baseline LVEF	53.5 (13.5)
NT-proBNP	1512 (3983)
Comorbidities
Diabetes mellitus	354 (38.9)
Hypertension	861 (94.7)
Renal dysfunction	475 (52.3)
History of MI	512 (56.3)
Atrial fibrillation	522 (57.4)
Ventricular arrhythmias	277 (30.5)
Coronary artery disease	684 (75.2)
Congestive heart failure	899 (98.9)
Dilated cardiomyopathy	225 (24.7)
Hypertrophic cardiomyopathy	16 (1.7)
Valvular heart disease	28 (3.1)
Medications history
Angiotensin-converting enzyme inhibitors/Angiotensin receptor blockers	478 (52.6)
Anticoagulants	494 (54.3)
Diuretics	811 (89.2)
Entresto	21 (2.3)
Spironolactone	219 (24.1)
Beta-blockers	710 (78.1)
Vasodilator–nitrate	468 (51.5)
Antiarrhythmic Drugs
Class I/III/IV	289 (31.8)
Class I	48 (5.2)
Class III/IV	272 (29.9)
Reason for ICM monitoring
AF ablation monitoring	33 (3.6)
AF management	126 (13.8)
Cryptogenic stroke	162 (17.8)
Palpitations	42 (4.6)
Suspected AF	62 (6.8)
Syncope	341 (37.5)
Ventricular tachycardia	21 (2.3)
Other/unknown	122 (13.4)

Clinical history	Total (n = 909)
Mean age	68 (13)
Male gender	464 (51.1)
Race
African American	163 (17.9)
Asian	7 (0.8)
Caucasian	695 (76.5)
Other	44 (4.8)
Ethnicity, Hispanic	44 (4.8)
Baseline LVEF	53.5 (13.5)
NT-proBNP	1512 (3983)
Comorbidities
Diabetes mellitus	354 (38.9)
Hypertension	861 (94.7)
Renal dysfunction	475 (52.3)
History of MI	512 (56.3)
Atrial fibrillation	522 (57.4)
Ventricular arrhythmias	277 (30.5)
Coronary artery disease	684 (75.2)
Congestive heart failure	899 (98.9)
Dilated cardiomyopathy	225 (24.7)
Hypertrophic cardiomyopathy	16 (1.7)
Valvular heart disease	28 (3.1)
Medications history
Angiotensin-converting enzyme inhibitors/Angiotensin receptor blockers	478 (52.6)
Anticoagulants	494 (54.3)
Diuretics	811 (89.2)
Entresto	21 (2.3)
Spironolactone	219 (24.1)
Beta-blockers	710 (78.1)
Vasodilator–nitrate	468 (51.5)
Antiarrhythmic Drugs
Class I/III/IV	289 (31.8)
Class I	48 (5.2)
Class III/IV	272 (29.9)
Reason for ICM monitoring
AF ablation monitoring	33 (3.6)
AF management	126 (13.8)
Cryptogenic stroke	162 (17.8)
Palpitations	42 (4.6)
Suspected AF	62 (6.8)
Syncope	341 (37.5)
Ventricular tachycardia	21 (2.3)
Other/unknown	122 (13.4)

Discrete data are presented as counts (percentage) and continuous measurements as mean (standard deviation).

Table 4

Open in new tab

HF event rate comparison between different risk groups based on aECG-CNN in ILR patients with history of HF events any time prior to implant

Diagnostic parameter	Number of evaluations (%)	Number of HF events (% of evaluations)	Hazard ratio (95% CI)	P-value
Risk group based on aECG-CNN				0.001
Low	10 637 (85.3)	152 (1.4)	Reference
High	1830 (14.7)	49 (2.7)	1.89 (1.28–2.79)

Diagnostic parameter	Number of evaluations (%)	Number of HF events (% of evaluations)	Hazard ratio (95% CI)	P-value
Risk group based on aECG-CNN				0.001
Low	10 637 (85.3)	152 (1.4)	Reference
High	1830 (14.7)	49 (2.7)	1.89 (1.28–2.79)

Table 4

Open in new tab

HF event rate comparison between different risk groups based on aECG-CNN in ILR patients with history of HF events any time prior to implant

Diagnostic parameter	Number of evaluations (%)	Number of HF events (% of evaluations)	Hazard ratio (95% CI)	P-value
Risk group based on aECG-CNN				0.001
Low	10 637 (85.3)	152 (1.4)	Reference
High	1830 (14.7)	49 (2.7)	1.89 (1.28–2.79)

Diagnostic parameter	Number of evaluations (%)	Number of HF events (% of evaluations)	Hazard ratio (95% CI)	P-value
Risk group based on aECG-CNN				0.001
Low	10 637 (85.3)	152 (1.4)	Reference
High	1830 (14.7)	49 (2.7)	1.89 (1.28–2.79)

Discussion

In this study, we show that an AI algorithm trained to identify patients with impaired LVEF from ILR aECGs can be used to dynamically predict the risk of HF hospitalization over the next 30 days. Early identification of these patients would allow timely cardiovascular care including appropriate diagnostic testing, medication optimization, or even device-based therapies if deemed necessary which may improve patient outcomes and ease acute healthcare provision resource utilization.^3,6,8

This model has the potential to reside in a remote monitoring platform and continuously access the changes in aECG to monitor the probability of an instance of LVEF ≤ 40% and subsequent increase of worsening HF hospitalization risk. Previous studies have shown the utility of standard 12-lead ECG markers to assess future HF events, and indeed, some of these report higher area under the curves (AUCs) than this study.^{11–15,22–25} However, these previous studies typically rely on 12-lead ECGs. This requirement limits the clinical utility of this approach. No technology exists that readily permits the regular automated gathering of continuous 12-lead aECGs that could serve as a telemonitoring solution. Ambulatory electrocardiogram-CNN, however, provides a daily single-lead aECGs from ILRs to provide the required continuous dynamic daily risk score for HF hospitalization, at no extra burden for patients with these devices. Such a system may be analogous to other implantable monitoring techniques that exist for ambulatory monitoring of increased risk of worsening HF.^3–10 However, by using the ECG to directly predict cardiac function, this approach may provide more orthogonal inputs to existing biosensors of pressure, impedance, and physical activity.

Besides providing ability for diagnosis and monitoring of cardiac arrhythmia,^26–29 the ILR devices also monitor diagnostic parameter and store aggregated daily measurements longitudinally over a long period of time. These diagnostic parameters include nighttime heart rate and daytime heart rate, atrial fibrillation burden, ventricular rate during atrial fibrillation (AF), heart rate variability, and activity duration. Additionally, other sensors such as subcutaneous impedance are being investigated for measurement of fluid and respiration rate. Recently, an approach of combining multiple diagnostic parameters in an ILR device using a Bayesian belief network machine learning model for identifying patients at risk of worsening HF was reported for ILR devices in patients with HF with reduced and preserved EF.³⁰ Ambulatory electrocardiogram-CNN could readily be added into these ambulatory HF event risk prediction platforms potentially delivering further improvement in the performance of the currently available multi-parameter algorithms. Whether such a risk scoring methodology for ambulatory management of patients with HF will improve patient outcomes requires prospective evaluation. A randomized control study is currently being conducted to investigate whether ILR-based ambulatory management can improve outcomes in Class II and III patients with HF with reduced LVEF or preserved LVEF (ALLEVIATE-HF study: NCT04452149).

Finally, the aECG-CNN-based monitoring can also identify patients with potential asymptomatic or mildly symptomatic reduction in LVEF in an ambulatory setting—indeed, this is what the model was trained to identify. If the model were to suspect new left ventricular impairment in a previously healthy patient, this could trigger confirmatory diagnostic testing (e.g. echocardiography) which could lead to the earlier identification of patients who could benefit from prognostic medication and device therapy again with potential benefits for patients and healthcare services alike.

Study limitations

We have relied on EHRs for identifying LVEF measurements, clinical endpoints, and the presence of comorbidities. Such approaches are susceptible to data entry errors and faults in natural language processing. We aimed to minimize these by requiring concordant ICD codes.

Single-lead aECG is derived from a short dipole, in a non-standard orientation, which varies from patient to patient. This almost certainly limits the ability of aECG-CNN to identify pathology and predict events when compared with standard 12-lead ECGs. However, because ILRs provide continuous daily ECG monitoring, our system can rely on intra-patient changes in HF probabilities, which is not currently available with 12-lead ECGs.

Although this study included over 2000 patients, a prospective study could be useful in demonstrating the system’s clinical utility. First, this would demonstrate the generalizability of the algorithm to an external prospective dataset, rather than an anonymized electronic health record. Second, this would allow the additive predictive power of the system to be ascertained, beyond those of existing telemonitoring biosignals.

Conclusions

This study found that an aECG-CNN algorithm trained to detect reduced LVEF using single-lead aECGs acquired by an ILR can identify patients at increased risk of future HF events. Further investigation is needed to evaluate whether HF risk prediction based on this ECG-based deep learning model can be used to predict and prevent HF events in a prospective study. If this is found to be the case, the benefits for patients and healthcare systems alike would likely be large.

Supplementary material

Supplementary material is available at European Heart Journal – Digital Health.

Acknowledgements

J.H. was funded by the British Heart Foundation (FS/ICRF/22/26039). D.K. was supported by the NIHR Imperial BRC. N.V., S.L., S.S., and J.K. were funded by Medtronic, Minneapolis, MN, USA.

Funding

This study was funded by the British Heart Foundation (FS/ICRF/22/26039), Medtronic, and NIHR Imperial BRC. The funders had no role in the study design, data collection, data analysis, data interpretation, or writing of the report.

Data availability

All patients provided consent to use their de-identiﬁed device data for research purposes when they sign up for Medtronic CareLink Network. As per the contractual data access, the de-identified data cannot be shared.

References

Heidenreich

Fonarow

Opsha

Sandhu

Sweitzer

Warraich

, et al.

Economic issues in heart failure in the United States

J Card Fail

2022

;

453

–

466

Cook

Cole

Asaria

Jabbour

Francis

The annual global economic burden of heart failure

Int J Cardiol

2014

;

171

368

–

376

Abraham

Adamson

Bourge

Aaron

Costanzo

Stevenson

, et al.

Wireless pulmonary artery haemodynamic monitoring in chronic heart failure: a randomised controlled trial [published correction appears in Lancet. 2012 Feb 4; 379(9814):412]

Lancet

2011

;

377

658

–

666

Brachmann

Böhm

Rybak

Klein

Butter

Klemm

, et al.

Fluid status monitoring with a wireless network to reduce cardiovascular-related hospitalizations and mortality in heart failure: rationale and design of the OptiLink HF Study (Optimization of Heart Failure Management using OptiVol Fluid Status Monitoring and CareLink)

Eur J Heart Fail

2011

;

796

–

804

Cowie

Sarkar

Koehler

Whellan

Crossley

Tang

WHW

, et al.

Development and validation of an integrated diagnostic algorithm derived from parameters monitored in implantable devices for identifying patients at risk for heart failure hospitalization in an ambulatory setting

Eur Heart J

2013

;

2472

–

2480

Hindricks

Taborsky

Glikson

Heinrich

Schumacher

Katz

, et al.

Implant-based multiparameter telemonitoring of patients with heart failure (IN-TIME): a randomised controlled trial

Lancet

2014

;

384

583

–

590

Boehmer

Hariharan

Devecchi

Smith

Molon

Capucci

, et al.

A multisensor algorithm predicts heart failure events in patients with implanted devices: results from the MultiSENSE study

JACC Heart Fail

2017

;

216

–

225

Koehler

Deckwart

Prescher

Wegscheider

Winkler

, et al.

Telemedical Interventional Management in Heart Failure II (TIM-HF2), a randomised, controlled trial investigating the impact of telemedicine on unplanned cardiovascular hospitalisations and mortality in heart failure patients: study design and description of the intervention

Eur J Heart Fail

2018

;

1485

–

1493

Ahmed

Taylor

Green

Moore

Goode

Black

, et al.

Triage-HF plus: a novel device-based remote monitoring pathway to identify worsening heart failure

ESC Heart Fail

2020

;

107

–

116

Google Scholar

PubMed

OpenURL Placeholder Text

WorldCat

Zile

Koehler

Sarkar

Butler

Prediction of worsening heart failure events and all-cause mortality using an individualized risk stratification strategy

ESC Heart Fail

2020

;

4277

–

4289

Adedinsewo

Carter

Attia

Johnson

Kashou

Dugan

, et al.

Artificial intelligence-enabled ECG algorithm to identify patients with left ventricular systolic dysfunction presenting to the emergency department with dyspnea

Circ Arrhythm Electrophysiol

2020

;

e008437

Attia

Kapa

Yao

Lopez-Jimenez

Mohan

Pellikka

, et al.

Prospective validation of a deep learning electrocardiogram algorithm for the detection of left ventricular systolic dysfunction

J Cardiovasc Electrophysiol

2019

;

668

–

674

Bachtiger

Petri

Scott

Park

Kelshiker

Sahemey

, et al.

Point-of-care screening for heart failure with reduced ejection fraction using artificial intelligence during ECG-enabled stethoscope examination in London, UK: a prospective, observational, multicentre study

Lancet Digit Health

2022

;

e117

–

e125

Acharya

Fujita

Hagiwara

Tan

Adam

, et al.

Deep convolutional neural network for the automated diagnosis of congestive heart failure using ECG signals

Appl Intell

2019

;

–

Google Scholar

Crossref

WorldCat

Attia

Kapa

Lopez-Jimenez

McKie

Ladewig

Satam

, et al.

Screening for cardiac contractile dysfunction using an artificial intelligence-enabled electrocardiogram

Nat Med

2019

;

–

Lih

Jahmunah

San

Ciaccio

Yamakawa

Tanabe

, et al.

Comprehensive electrocardiographic diagnosis based on deep learning

Artif Intell Med

2020

;

103

101789

Wookey

The real-world-weight cross-entropy loss function: modeling the costs of mislabeling

IEEE

2019

;

4806

–

4813

Google Scholar

OpenURL Placeholder Text

WorldCat

Zhang

Improved Adam optimizer for deep neural networks. 2018 IEEE/ACM 26th International Symposium on Quality of Service (IWQoS), Banff, AB, Canada 2018, pp. 1–2

. doi:

10.1109/IWQoS.2018.8624183

Kakihana

Ito

Nakahara

Yamaguchi

Yasuda

Sepsis-induced myocardial dysfunction: pathophysiology and management

J Intensive Care

2016

;

Parikh

Bhatt

Tan

Allen

Feng

, et al.

Developing clinical risk prediction models for worsening heart failure events and death by left ventricular ejection fraction

J Am Heart Assoc

2023

;

e029736

Zile

Kahwash

Sarkar

Koehler

Butler

Temporal characteristics of device-based individual and integrated risk metrics in patients with chronic heart failure

JACC Heart Fail

2023

;

143

–

156

O’Neal

Mazur

Bertoni

Bluemke

Al-Mallah

Lima

JAC

, et al.

Electrocardiographic predictors of heart failure with reduced versus preserved ejection fraction: the multi-ethnic study of atherosclerosis

J Am Heart Assoc

2017

;

e006023

Rautaharju

Prineas

Wood

Zhang

Crow

Heiss

Electrocardiographic predictors of new-onset heart failure in men and in women free of coronary heart disease (from the Atherosclerosis in Communities [ARIC] Study)

Am J Cardiol

2007

;

100

1437

–

1441

Ahmad

Mujtaba

Floyd

Chen

Soliman

Electrocardiographic markers of atrial cardiomyopathy and risk of heart failure in the multi-ethnic study of atherosclerosis (MESA) cohort

Front Cardiovasc Med

2023

;

1143338

Reinier

Narayanan

Uy-Evanado

Teodorescu

Chugh

Mack

, et al.

Electrocardiographic markers and the left ventricular ejection fraction have cumulative effects on risk of sudden cardiac death

JACC Clin Electrophysiol

2015

;

542

–

550

Krahn

Klein

Yee

Takle-Newhouse

Norris

Use of an extended monitoring strategy in patients with problematic syncope. Reveal investigators

Circulation

1999

;

406

–

410

Sanders

Pürerfellner

Pokushalov

Sarkar

Di Bacco

Maus

, et al.

Performance of a new atrial fibrillation detection algorithm in a miniaturized insertable cardiac monitor: results from the Reveal LINQ Usability Study

Heart Rhythm

2016

;

1425

–

1430

Verma

Champagne

Sapp

Essebag

Novak

Skanes

, et al.

Discerning the incidence of symptomatic and asymptomatic episodes of atrial fibrillation before and after catheter ablation (DISCERN AF): a prospective, multicenter study

JAMA Intern Med

2013

;

173

149

–

156

Sanna

Diener

Passman

Di Lazzaro

Bernstein

Morillo

, et al.

Cryptogenic stroke and underlying atrial fibrillation

N Engl J Med

2014

;

370

2478

–

2486

Zile

Kahwash

Sarkar

Koehler

Zielinski

Mehra

, et al.

A novel heart failure diagnostic risk score using a minimally invasive subcutaneous insertable cardiac monitor

JACC Heart Fail

2024

;

182

–

196

Author notes

Conflict of interest: D.K. is a consultant for Medtronic Inc., and N.V., S.L., S.S., and J.K. are employees and share holders of Medtronic Inc.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Download all slides

Month:	Total Views:
May 2024	104
June 2024	202
July 2024	158
August 2024	141
September 2024	110
October 2024	89
November 2024	53
December 2024	42
January 2025	54
February 2025	72
March 2025	71
April 2025	50
May 2025	11

Article Contents

Dynamic risk stratification of worsening heart failure using a deep learning-enabled implanted ambulatory single-lead electrocardiogram

Abstract

Introduction

Methods

Cohort and study design

Development of an ambulatory electrocardiogram-convolutional neural network model to detect left ventricular ejection fraction ≤ 40%

Longitudinal analysis of ambulatory electrocardiogram and assessment of increased risk of future heart failure hospitalization events

Statistical analysis

Results

Ambulatory electrocardiogram-convolutional neural network algorithm to predict left ventricular ejection fraction ≤ 40%

Ambulatory electrocardiogram-convolutional neural network algorithm to assess dynamic risk of heart failure hospitalization

Discussion

Study limitations

Conclusions

Supplementary material

Acknowledgements

Funding

Data availability

References

Author notes

Supplementary data

Citations

Views

Altmetric

Email alerts

See also

Commentary

More on this topic

Related articles in PubMed

Citing articles via

Most Read

Latest

Article Contents

Dynamic risk stratification of worsening heart failure using a deep learning-enabled implanted ambulatory single-lead electrocardiogram

Abstract

Introduction

Methods

Cohort and study design

Development of an ambulatory electrocardiogram-convolutional neural network model to detect left ventricular ejection fraction ≤ 40%

Longitudinal analysis of ambulatory electrocardiogram and assessment of increased risk of future heart failure hospitalization events

Statistical analysis

Results

Ambulatory electrocardiogram-convolutional neural network algorithm to predict left ventricular ejection fraction ≤ 40%

Ambulatory electrocardiogram-convolutional neural network algorithm to assess dynamic risk of heart failure hospitalization

Discussion

Study limitations

Conclusions

Supplementary material

Acknowledgements

Funding

Data availability

References

Author notes

Supplementary data

Citations

Views

Altmetric

Email alerts

See also

Commentary

More on this topic

Related articles in PubMed

Citing articles via

Most Read

Latest

This Feature Is Available To Subscribers Only