Abstract

Aims

Implantable loop recorders (ILRs) provide continuous single-lead ambulatory electrocardiogram (aECG) monitoring. Whether these aECGs could be used to identify worsening heart failure (HF) is unknown.

Methods and results

We linked ILR aECG from Medtronic device database to the left ventricular ejection fraction (LVEF) measurements in Optum® de-identified electronic health record dataset. We trained an artificial intelligence (AI) algorithm [aECG-convolutional neural network (CNN)] on a dataset of 35 741 aECGs from 2247 patients to identify LVEF ≤ 40% and assessed its performance using the area under the receiver operating characteristic curve. Ambulatory electrocardiogram-CNN was then used to identify patients with increasing risk of HF hospitalization in a real-world cohort of 909 patients with prior HF diagnosis. This dataset provided 12 467 follow-up monthly evaluations, with 201 HF hospitalizations. For every month, time-series features from these predictions were used to categorize patients into high- and low-risk groups and predict HF hospitalization in the next month. The risk of HF hospitalization in the next 30 days was significantly higher in the cohort that aECG-CNN identified as high risk [hazard ratio (HR) 1.89; 95% confidence interval (CI) 1.28–2.79; P = 0.001] compared with low risk, even after adjusting patient demographics (HR 1.88; 95% CI 1.27–2.79 P = 0.002).

Conclusion

An AI algorithm trained to detect LVEF ≤40% using ILR aECGs can also readily identify patients at increased risk of HF hospitalizations by monitoring changes in the probability of HF over 30 days.

See the editorial comment for this article ‘Implantable cardiac monitors: the digital future of risk prediction?’, by A. Bauer and C. Dlaska, https://doi.org/10.1093/ehjdh/ztae036.

Introduction

Heart failure (HF) is a chronic disease with high prevalence, morbidity, and mortality. The impact of HF is expected to increase substantially with the aging of the population.1 The overall economic cost of HF in 2012 was estimated at $108 billion per annum based on a study of patient population across the globe.2 With an aging, rapidly expanding and industrializing global population this value will continue to rise. Considering the increasing economic burden of HF, early identification of patients at increased risk of clinical deterioration and hospitalization even prior to the development of severe symptoms may provide an opportunity for early intervention that can prevent worsening of symptoms, slow down disease progression, and improve patient outcomes.3–10

Electrocardiogram (ECG) is a commonly adopted non-invasive method for screening and diagnosing cardiovascular disease, and the use of standard 12-lead ECG for systolic HF diagnosis has been ongoing since 1989, advancing from identification of simple abnormalities on ECG to more advanced artificial intelligence (AI) algorithms.11–16 These algorithms can predict the presence but also development of HF with more accuracy than traditional ECG-derived metrics. However, no system exists which provides continuous 12-lead ECG monitoring in outpatients. Conversely, over a million implantable loop recorders (ILRs) have been implanted and provide continuous single-lead ambulatory electrocardiograms (aECGs).

Although ILR helps to diagnose arrhythmias, whether this single-lead ECG can be used to predict systolic HF events is unknown.

In this study, we show that AI can analyse aECGs from ILRs to identify patients with reduced left ventricular ejection fraction (LVEF) (≤40%). Furthermore, we show by continuously monitoring daily aECGs for changes in adverse features, we can stratify patients into low- and high-risk cohorts.

Methods

Cohort and study design

We conducted a retrospective study of patients in the Optum® de-identified electronic health record (EHR) dataset between 2007 and 2021 from multiple hospital systems in the USA. Through a methodology compliant with health insurance portability and accountability act (HIPAA)’s de-identification standard, a third party determined which of those patients with LVEF measurements from the Optum® EHR were concomitantly enrolled in the Medtronic DiscoveryLink data warehouse. This is a manufacturer’s de-identified device data warehouse containing continuous ECG data from fully ILRs. For patients whose data appeared in both datasets, a combined dataset was created that met HIPAA’s de-identification standard. This process is discussed in detail in the Supplementary Appendix. The combined dataset was used to develop the aECG-convolutional neural network (CNN) algorithm. This retrospective analysis using de-identified data falls into the category of non-human research, and no institutional review board approval was indicated.

Development of an ambulatory electrocardiogram-convolutional neural network model to detect left ventricular ejection fraction ≤ 40%

To create the aECG-CNN model development dataset, LVEF measurements from Optum® EHR were matched to single-lead aECGs measured within 7 days (before/after) of the LVEF measurement date. We excluded patients diagnosed with hypertrophic cardiomyopathy (ICD10: I42.1, I42.2; ICD9: 42511, 42518). To ensure robust and reliable LVEF information and minimize errors introduced through automated data extraction from Optum® EHR (Figure 1), we only included patients with prior HF diagnosis to create the low ejection fraction (EF) data (LVEF ≤ 40%). Patients with no HF diagnosis and an EF >40% were used to create the normal EF data (LVEF > 40%). More detailed information on data cleanup is provided in the supplementary material (see Supplementary material online, Figure S1). Electrocardiograms were randomized at the patient level to the training, validation, and testing datasets using a 60%:20%:20% split.

Dataset creation for the ambulatory electrocardiogram-convolutional neural network model development. Schematic indicates the strategy to obtain robust and reliable dataset for model development. To avoid cross-contamination, no patient data are repeated among training, validation, and testing datasets. *Patient count (n = 5829) selected based on several inclusion, exclusion criteria to ensure the quality of LVEF data as they are obtained from Optum® EHR and the dataset was captured in Optum® EHR via natural language processing of procedure/diagnostic notes and prone to natural language processing errors.
Figure 1

Dataset creation for the ambulatory electrocardiogram-convolutional neural network model development. Schematic indicates the strategy to obtain robust and reliable dataset for model development. To avoid cross-contamination, no patient data are repeated among training, validation, and testing datasets. *Patient count (n = 5829) selected based on several inclusion, exclusion criteria to ensure the quality of LVEF data as they are obtained from Optum® EHR and the dataset was captured in Optum® EHR via natural language processing of procedure/diagnostic notes and prone to natural language processing errors.

A custom deep learning convolutional neural network model (aECG-CNN) was trained on an independent training database of LVEF-aECG pairs to detect LVEF ≤ 40% (full architecture listed in the Supplementary Appendix). The model has 24 layers with 1.7 M total learnable parameters. The algorithm predicts the probability (0–1) of LVEF ≤40% from 10 s of aECG data; one indicates the highest probability of LVEF ≤40%. Cross-entropy loss was calculated and minimized using the adaptive moment estimation optimizer.17,18 During training, we performed data augmentation through over-sampling of low EF ECGs, Gaussian noise, and cyclical shifting of the signal. There was no test-time augmentation.

Longitudinal analysis of ambulatory electrocardiogram and assessment of increased risk of future heart failure hospitalization events

There is evidence that fluctuations in LVEF can predict HF hospitalizations.19,20 We hypothesized that the aECG-CNN algorithm has the potential to track the changes in probabilities of LVEF ≤40% using longitudinal aECG data and the degree of change/relative change can be used to assess the dynamic risk of future HF hospitalization events (Figure 2).

Ambulatory electrocardiogram-convolutional neural network model architecture to detect low ejection fraction. Input to the model is scalograms (image) of single-lead ambulatory electrocardiogram signal.
Figure 2

Ambulatory electrocardiogram-convolutional neural network model architecture to detect low ejection fraction. Input to the model is scalograms (image) of single-lead ambulatory electrocardiogram signal.

The proposed algorithm uses the fact that ILRs record a daily ECG and that changes in LVEF may occur over both shorter and longer periods. Therefore, we calculate three running averages of the low EF score (using the ECG) over short (7-day) and medium (15-day) intervals and compare these to a 30-day baseline for each patient. These periods were chosen empirically to reduce the risk of overfitting to the dataset; it was hypothesized that shorter periods than 7 days would comprise too few measurements and periods longer than 15 days risked ‘missing’ the change to act. The specific rules used to assign patients to the high- and low-risk cohort are shown in Table 1.

Table 1

Dynamic risk group stratification based on aECG-CNN probability changes within a month

Risk assessmentFeature set
High riskDefined as large fluctuations  and  sustained fluctuations in predicted LVEF.
Large fluctuations = (Pmax7-Pmax30 ≥ 0.1) AND (Pmax15-Pmean15 ≥ 0.05)
Sustained fluctuations = (Pmax7-Pmax15) ≥ 0.08) for ≥6 Days OR (Pmax7-Pmax30) for ≥5 Days
Low riskAbove conditions not met.
Utilized featuresDescription
Pmax7The peak probability of LVEF ≤ 40% within the 7-day running average.
Pmax15The peak probability of LVEF ≤ 40% within the 15-day running average.
Pmean15The mean probability of LVEF ≤ 40% within the 15-day running average.
Pmax30The peak probability of LVEF ≤ 40% within the 30-day running average.
Risk assessmentFeature set
High riskDefined as large fluctuations  and  sustained fluctuations in predicted LVEF.
Large fluctuations = (Pmax7-Pmax30 ≥ 0.1) AND (Pmax15-Pmean15 ≥ 0.05)
Sustained fluctuations = (Pmax7-Pmax15) ≥ 0.08) for ≥6 Days OR (Pmax7-Pmax30) for ≥5 Days
Low riskAbove conditions not met.
Utilized featuresDescription
Pmax7The peak probability of LVEF ≤ 40% within the 7-day running average.
Pmax15The peak probability of LVEF ≤ 40% within the 15-day running average.
Pmean15The mean probability of LVEF ≤ 40% within the 15-day running average.
Pmax30The peak probability of LVEF ≤ 40% within the 30-day running average.
Table 1

Dynamic risk group stratification based on aECG-CNN probability changes within a month

Risk assessmentFeature set
High riskDefined as large fluctuations  and  sustained fluctuations in predicted LVEF.
Large fluctuations = (Pmax7-Pmax30 ≥ 0.1) AND (Pmax15-Pmean15 ≥ 0.05)
Sustained fluctuations = (Pmax7-Pmax15) ≥ 0.08) for ≥6 Days OR (Pmax7-Pmax30) for ≥5 Days
Low riskAbove conditions not met.
Utilized featuresDescription
Pmax7The peak probability of LVEF ≤ 40% within the 7-day running average.
Pmax15The peak probability of LVEF ≤ 40% within the 15-day running average.
Pmean15The mean probability of LVEF ≤ 40% within the 15-day running average.
Pmax30The peak probability of LVEF ≤ 40% within the 30-day running average.
Risk assessmentFeature set
High riskDefined as large fluctuations  and  sustained fluctuations in predicted LVEF.
Large fluctuations = (Pmax7-Pmax30 ≥ 0.1) AND (Pmax15-Pmean15 ≥ 0.05)
Sustained fluctuations = (Pmax7-Pmax15) ≥ 0.08) for ≥6 Days OR (Pmax7-Pmax30) for ≥5 Days
Low riskAbove conditions not met.
Utilized featuresDescription
Pmax7The peak probability of LVEF ≤ 40% within the 7-day running average.
Pmax15The peak probability of LVEF ≤ 40% within the 15-day running average.
Pmean15The mean probability of LVEF ≤ 40% within the 15-day running average.
Pmax30The peak probability of LVEF ≤ 40% within the 30-day running average.

To test this algorithm, we used a real-world cohort of ILR patients with HF hospitalization prior to implant and follow-up ECGs for at least 6 months post-implant (Figure 3). Heart failure hospitalization was used as the endpoint and was defined as an inpatient, emergency department, or observation unit stay in a hospital with a primary discharge diagnosis of HF. Primary diagnosis of HF was ascertained based on ICD9/ICD10 codes: 428.X, 402.01, 402.11, 402.91, 404.01, 404.03, 404.11, 404.13, 404.91, 404.93, I50.X, I11.0, I13.0, and I13.2 as has been described previously.21

Schematic indicates the dataset selection for longitudinal analysis of ambulatory electrocardiogram-convolutional neural network algorithm to assess the increased risk of future heart failure events.
Figure 3

Schematic indicates the dataset selection for longitudinal analysis of ambulatory electrocardiogram-convolutional neural network algorithm to assess the increased risk of future heart failure events.

The ability of aECG-CNN to predict HF hospitalization was assessed by simulating monthly follow-ups which consisted of looking at the aECG-CNN risk states across 30 days and evaluating the occurrence of clinical events in the following 30 days (Figure 4) as has been described previously.5

Every 30 days the previous 30 days are evaluated for risk group based on ambulatory electrocardiogram-convolutional neural network, and then, the subsequent 30 days are evaluated for heart failure event. Start indicates implant date.
Figure 4

Every 30 days the previous 30 days are evaluated for risk group based on ambulatory electrocardiogram-convolutional neural network, and then, the subsequent 30 days are evaluated for heart failure event. Start indicates implant date.

Statistical analysis

The performance of the algorithm to identify LVEF ≤40% was assessed using the area under the receiver operating characteristic (AUROC) curve. To assess the algorithm’s ability to predict HF hospitalization, a marginal Cox proportional hazards model was used to calculate the hazard ratios (HRs) and 95% confidence intervals (CIs) for patients identified by the algorithm as high vs. low risk. The marginal model adjusts for multiple observations within individual subjects. Kaplan–Meier analysis was performed. Patients were right censored at the last day of EHR data availability if <30 days. Cox regression models were used for adjusting several clinical variables, including age, gender, race, and comorbidities such as diabetes, hypertension, and history of myocardial infarction (MI). Baseline variables that may be manipulated because of worsening HF (e.g. HF medications) were not adjusted for. All statistical analyses were performed in Minitab (v20.1.3) and SAS (v9.4).

Results

Ambulatory electrocardiogram-convolutional neural network algorithm to predict left ventricular ejection fraction ≤ 40%

The aECG-CNN model was trained on a dataset of 35 741 LVEF-ECG pairs from 2249 patients to predict the probability of LVEF ≤40%. The model was validated on an independent dataset of 6721 LVEF-ECG pairs from 750 patients and tested on another independent dataset of 6611 LVEF-ECG pairs from 750 patients. The baseline characteristics of total dataset used for the development, validation, and testing of the aECG-CNN model are provided in Table 2. The model yielded an accuracy, a sensitivity, a specificity, and an AUROC of 75%, 70%, 76%, and 0.8, respectively, in the independent test dataset. Full event counts as a 2 × 2 table, precision, and F1 scores are provided in the Supplementary Appendix, Supplementary material online, Tables S1 and S2. The receiver operating characteristics curve in Figure 5 provides the performance of model on test dataset.

The diagram shows the receiver operating characteristic of ambulatory electrocardiogram-convolutional neural network classification model on test dataset.
Figure 5

The diagram shows the receiver operating characteristic of ambulatory electrocardiogram-convolutional neural network classification model on test dataset.

Table 2

Baseline patient characteristics of aECG-CNN algorithm development/validation/testing set

Clinical historyAll (n = 3749)Training set
(n = 2249)
Validation set
(n = 750)
Test set
(n = 750)
P-value
Mean age (years)68 (14)68 (14)68 (14)68 (14)0.69
Male gender2009 (53.6)1202 (53.4)401 (53.4)406 (54.1)0.94
Mean LVEF57.9 (8.8)57.9 (8.8)58.1 (8.9)57.97 (8.9)0.17
Race0.43
African American487 (13.0)294 (13.1)110 (14.7)83 (11.1)
Caucasian3006 (80.2)1794 (79.7)595 (79.3)617 (82.2)
Asian42 (1.1)25 (1.1)9 (1.2)8 (1.1)
Others214 (5.7)136 (6.1)36 (4.8)42 (5.6)
Ethnicity
Hispanic169 (4.7)105 (4.7)31 (4.1)33 (4.4)0.92
Comorbidities
Hypertension2678 (71.4)1602 (71.2)535 (71.3)541 (72.1)0.89
History of MI684 (18.2)405 (18.0)136 (18.1)143 (19.1)0.81
Diabetes mellitus1133 (30.2)659 (29.3)242 (32.3)232 (30.9)0.27
Cardiomyopathy566 (15.1)336 (14.9)122 (16.2)108 (14.4)0.57
Atrial fibrillation1154 (30.8)702 (31.2)230 (30.7)222 (29.6)0.71
Chronic obstructive pulmonary disease49 (1.3)33 (1.5)7 (0.93)9 (1.2)0.52
Heart failure296 (7.9)177 (7.8)63 (8.4)56 (7.5)0.79
Clinical historyAll (n = 3749)Training set
(n = 2249)
Validation set
(n = 750)
Test set
(n = 750)
P-value
Mean age (years)68 (14)68 (14)68 (14)68 (14)0.69
Male gender2009 (53.6)1202 (53.4)401 (53.4)406 (54.1)0.94
Mean LVEF57.9 (8.8)57.9 (8.8)58.1 (8.9)57.97 (8.9)0.17
Race0.43
African American487 (13.0)294 (13.1)110 (14.7)83 (11.1)
Caucasian3006 (80.2)1794 (79.7)595 (79.3)617 (82.2)
Asian42 (1.1)25 (1.1)9 (1.2)8 (1.1)
Others214 (5.7)136 (6.1)36 (4.8)42 (5.6)
Ethnicity
Hispanic169 (4.7)105 (4.7)31 (4.1)33 (4.4)0.92
Comorbidities
Hypertension2678 (71.4)1602 (71.2)535 (71.3)541 (72.1)0.89
History of MI684 (18.2)405 (18.0)136 (18.1)143 (19.1)0.81
Diabetes mellitus1133 (30.2)659 (29.3)242 (32.3)232 (30.9)0.27
Cardiomyopathy566 (15.1)336 (14.9)122 (16.2)108 (14.4)0.57
Atrial fibrillation1154 (30.8)702 (31.2)230 (30.7)222 (29.6)0.71
Chronic obstructive pulmonary disease49 (1.3)33 (1.5)7 (0.93)9 (1.2)0.52
Heart failure296 (7.9)177 (7.8)63 (8.4)56 (7.5)0.79

Discrete data are presented as counts (percentage) and continuous measurements as mean (standard deviation).

Table 2

Baseline patient characteristics of aECG-CNN algorithm development/validation/testing set

Clinical historyAll (n = 3749)Training set
(n = 2249)
Validation set
(n = 750)
Test set
(n = 750)
P-value
Mean age (years)68 (14)68 (14)68 (14)68 (14)0.69
Male gender2009 (53.6)1202 (53.4)401 (53.4)406 (54.1)0.94
Mean LVEF57.9 (8.8)57.9 (8.8)58.1 (8.9)57.97 (8.9)0.17
Race0.43
African American487 (13.0)294 (13.1)110 (14.7)83 (11.1)
Caucasian3006 (80.2)1794 (79.7)595 (79.3)617 (82.2)
Asian42 (1.1)25 (1.1)9 (1.2)8 (1.1)
Others214 (5.7)136 (6.1)36 (4.8)42 (5.6)
Ethnicity
Hispanic169 (4.7)105 (4.7)31 (4.1)33 (4.4)0.92
Comorbidities
Hypertension2678 (71.4)1602 (71.2)535 (71.3)541 (72.1)0.89
History of MI684 (18.2)405 (18.0)136 (18.1)143 (19.1)0.81
Diabetes mellitus1133 (30.2)659 (29.3)242 (32.3)232 (30.9)0.27
Cardiomyopathy566 (15.1)336 (14.9)122 (16.2)108 (14.4)0.57
Atrial fibrillation1154 (30.8)702 (31.2)230 (30.7)222 (29.6)0.71
Chronic obstructive pulmonary disease49 (1.3)33 (1.5)7 (0.93)9 (1.2)0.52
Heart failure296 (7.9)177 (7.8)63 (8.4)56 (7.5)0.79
Clinical historyAll (n = 3749)Training set
(n = 2249)
Validation set
(n = 750)
Test set
(n = 750)
P-value
Mean age (years)68 (14)68 (14)68 (14)68 (14)0.69
Male gender2009 (53.6)1202 (53.4)401 (53.4)406 (54.1)0.94
Mean LVEF57.9 (8.8)57.9 (8.8)58.1 (8.9)57.97 (8.9)0.17
Race0.43
African American487 (13.0)294 (13.1)110 (14.7)83 (11.1)
Caucasian3006 (80.2)1794 (79.7)595 (79.3)617 (82.2)
Asian42 (1.1)25 (1.1)9 (1.2)8 (1.1)
Others214 (5.7)136 (6.1)36 (4.8)42 (5.6)
Ethnicity
Hispanic169 (4.7)105 (4.7)31 (4.1)33 (4.4)0.92
Comorbidities
Hypertension2678 (71.4)1602 (71.2)535 (71.3)541 (72.1)0.89
History of MI684 (18.2)405 (18.0)136 (18.1)143 (19.1)0.81
Diabetes mellitus1133 (30.2)659 (29.3)242 (32.3)232 (30.9)0.27
Cardiomyopathy566 (15.1)336 (14.9)122 (16.2)108 (14.4)0.57
Atrial fibrillation1154 (30.8)702 (31.2)230 (30.7)222 (29.6)0.71
Chronic obstructive pulmonary disease49 (1.3)33 (1.5)7 (0.93)9 (1.2)0.52
Heart failure296 (7.9)177 (7.8)63 (8.4)56 (7.5)0.79

Discrete data are presented as counts (percentage) and continuous measurements as mean (standard deviation).

Ambulatory electrocardiogram-convolutional neural network algorithm to assess dynamic risk of heart failure hospitalization

A total of 909 patients with 12 467 follow-up monthly evaluations were analysed to assess the risk of future HF hospitalization based on the relative change in probability of aECG-CNN model inference output. The mean follow-up was 13.7 months from implant date. A total of 201 monthly evaluations (1.6%) had HF hospitalization in the next 30 days. The baseline characteristics of the patient cohort are provided in Table 3, including the reason for ILR implantation. The average age of patients was 68 ± 13 years, with 51% males. The prevalence of comorbidities diagnosed prior to ILR implant was as follows: diabetes (39%), hypertension (95%), dilated cardiomyopathy (25%), congestive HF (99%), valvular disease (3%), coronary artery disease (75%), history of MI (56%), renal dysfunction (52%), ventricular arrhythmia (30%), and atrial fibrillation (57%). Based on the change in probability of LVEF ≤40% during monthly evaluations, 1830 evaluations (14.7%) were categorized as high risk and 10 637 evaluations (85.3%) as low risk. Figure 6 shows the Kaplan–Meier curve of cumulative incidence of monthly evaluations with subsequent HF hospitalization events in the next 30 days among two risk groups identified by aECG-CNN model. Patients aECG-CNN identified as high risk were almost twice as likely to have HF hospitalization (HR 1.89; 95% CI 1.28–2.79; P = 0.001) than those it identified as low risk (Table 4). This performance was unchanged after adjusting for patient demographics (HR 1.88, 95% CI 1.27–2.79; P = 0.002).

The diagram shows the Kaplan–Meier analysis of cumulative incidence of monthly evaluations with subsequent heart failure hospitalization among high-risk and low-risk category. 494 patients contributed solely to the high-risk group, 901 solely to the low-risk group, and 486 to both groups.
Figure 6

The diagram shows the Kaplan–Meier analysis of cumulative incidence of monthly evaluations with subsequent heart failure hospitalization among high-risk and low-risk category. 494 patients contributed solely to the high-risk group, 901 solely to the low-risk group, and 486 to both groups.

Table 3

Baseline demographics of patients in the study

Clinical historyTotal (n = 909)
Mean age68 (13)
Male gender464 (51.1)
Race
 African American163 (17.9)
 Asian7 (0.8)
 Caucasian695 (76.5)
 Other44 (4.8)
Ethnicity, Hispanic44 (4.8)
Baseline LVEF53.5 (13.5)
NT-proBNP1512 (3983)
Comorbidities
 Diabetes mellitus354 (38.9)
 Hypertension861 (94.7)
 Renal dysfunction475 (52.3)
 History of MI512 (56.3)
 Atrial fibrillation522 (57.4)
 Ventricular arrhythmias277 (30.5)
 Coronary artery disease684 (75.2)
 Congestive heart failure899 (98.9)
 Dilated cardiomyopathy225 (24.7)
 Hypertrophic cardiomyopathy16 (1.7)
 Valvular heart disease28 (3.1)
Medications history
Angiotensin-converting enzyme inhibitors/Angiotensin receptor blockers478 (52.6)
 Anticoagulants494 (54.3)
 Diuretics811 (89.2)
 Entresto21 (2.3)
 Spironolactone219 (24.1)
 Beta-blockers710 (78.1)
 Vasodilator–nitrate468 (51.5)
Antiarrhythmic Drugs
 Class I/III/IV289 (31.8)
 Class I48 (5.2)
 Class III/IV272 (29.9)
Reason for ICM monitoring
 AF ablation monitoring33 (3.6)
 AF management126 (13.8)
 Cryptogenic stroke162 (17.8)
 Palpitations42 (4.6)
 Suspected AF62 (6.8)
 Syncope341 (37.5)
 Ventricular tachycardia21 (2.3)
 Other/unknown122 (13.4)
Clinical historyTotal (n = 909)
Mean age68 (13)
Male gender464 (51.1)
Race
 African American163 (17.9)
 Asian7 (0.8)
 Caucasian695 (76.5)
 Other44 (4.8)
Ethnicity, Hispanic44 (4.8)
Baseline LVEF53.5 (13.5)
NT-proBNP1512 (3983)
Comorbidities
 Diabetes mellitus354 (38.9)
 Hypertension861 (94.7)
 Renal dysfunction475 (52.3)
 History of MI512 (56.3)
 Atrial fibrillation522 (57.4)
 Ventricular arrhythmias277 (30.5)
 Coronary artery disease684 (75.2)
 Congestive heart failure899 (98.9)
 Dilated cardiomyopathy225 (24.7)
 Hypertrophic cardiomyopathy16 (1.7)
 Valvular heart disease28 (3.1)
Medications history
Angiotensin-converting enzyme inhibitors/Angiotensin receptor blockers478 (52.6)
 Anticoagulants494 (54.3)
 Diuretics811 (89.2)
 Entresto21 (2.3)
 Spironolactone219 (24.1)
 Beta-blockers710 (78.1)
 Vasodilator–nitrate468 (51.5)
Antiarrhythmic Drugs
 Class I/III/IV289 (31.8)
 Class I48 (5.2)
 Class III/IV272 (29.9)
Reason for ICM monitoring
 AF ablation monitoring33 (3.6)
 AF management126 (13.8)
 Cryptogenic stroke162 (17.8)
 Palpitations42 (4.6)
 Suspected AF62 (6.8)
 Syncope341 (37.5)
 Ventricular tachycardia21 (2.3)
 Other/unknown122 (13.4)

Discrete data are presented as counts (percentage) and continuous measurements as mean (standard deviation).

Table 3

Baseline demographics of patients in the study

Clinical historyTotal (n = 909)
Mean age68 (13)
Male gender464 (51.1)
Race
 African American163 (17.9)
 Asian7 (0.8)
 Caucasian695 (76.5)
 Other44 (4.8)
Ethnicity, Hispanic44 (4.8)
Baseline LVEF53.5 (13.5)
NT-proBNP1512 (3983)
Comorbidities
 Diabetes mellitus354 (38.9)
 Hypertension861 (94.7)
 Renal dysfunction475 (52.3)
 History of MI512 (56.3)
 Atrial fibrillation522 (57.4)
 Ventricular arrhythmias277 (30.5)
 Coronary artery disease684 (75.2)
 Congestive heart failure899 (98.9)
 Dilated cardiomyopathy225 (24.7)
 Hypertrophic cardiomyopathy16 (1.7)
 Valvular heart disease28 (3.1)
Medications history
Angiotensin-converting enzyme inhibitors/Angiotensin receptor blockers478 (52.6)
 Anticoagulants494 (54.3)
 Diuretics811 (89.2)
 Entresto21 (2.3)
 Spironolactone219 (24.1)
 Beta-blockers710 (78.1)
 Vasodilator–nitrate468 (51.5)
Antiarrhythmic Drugs
 Class I/III/IV289 (31.8)
 Class I48 (5.2)
 Class III/IV272 (29.9)
Reason for ICM monitoring
 AF ablation monitoring33 (3.6)
 AF management126 (13.8)
 Cryptogenic stroke162 (17.8)
 Palpitations42 (4.6)
 Suspected AF62 (6.8)
 Syncope341 (37.5)
 Ventricular tachycardia21 (2.3)
 Other/unknown122 (13.4)
Clinical historyTotal (n = 909)
Mean age68 (13)
Male gender464 (51.1)
Race
 African American163 (17.9)
 Asian7 (0.8)
 Caucasian695 (76.5)
 Other44 (4.8)
Ethnicity, Hispanic44 (4.8)
Baseline LVEF53.5 (13.5)
NT-proBNP1512 (3983)
Comorbidities
 Diabetes mellitus354 (38.9)
 Hypertension861 (94.7)
 Renal dysfunction475 (52.3)
 History of MI512 (56.3)
 Atrial fibrillation522 (57.4)
 Ventricular arrhythmias277 (30.5)
 Coronary artery disease684 (75.2)
 Congestive heart failure899 (98.9)
 Dilated cardiomyopathy225 (24.7)
 Hypertrophic cardiomyopathy16 (1.7)
 Valvular heart disease28 (3.1)
Medications history
Angiotensin-converting enzyme inhibitors/Angiotensin receptor blockers478 (52.6)
 Anticoagulants494 (54.3)
 Diuretics811 (89.2)
 Entresto21 (2.3)
 Spironolactone219 (24.1)
 Beta-blockers710 (78.1)
 Vasodilator–nitrate468 (51.5)
Antiarrhythmic Drugs
 Class I/III/IV289 (31.8)
 Class I48 (5.2)
 Class III/IV272 (29.9)
Reason for ICM monitoring
 AF ablation monitoring33 (3.6)
 AF management126 (13.8)
 Cryptogenic stroke162 (17.8)
 Palpitations42 (4.6)
 Suspected AF62 (6.8)
 Syncope341 (37.5)
 Ventricular tachycardia21 (2.3)
 Other/unknown122 (13.4)

Discrete data are presented as counts (percentage) and continuous measurements as mean (standard deviation).

Table 4

HF event rate comparison between different risk groups based on aECG-CNN in ILR patients with history of HF events any time prior to implant

Diagnostic parameterNumber of evaluations (%)Number of HF events (% of evaluations)Hazard ratio (95% CI)P-value
Risk group based on aECG-CNN0.001
Low10 637 (85.3)152 (1.4)Reference
High1830 (14.7)49 (2.7)1.89 (1.28–2.79)
Diagnostic parameterNumber of evaluations (%)Number of HF events (% of evaluations)Hazard ratio (95% CI)P-value
Risk group based on aECG-CNN0.001
Low10 637 (85.3)152 (1.4)Reference
High1830 (14.7)49 (2.7)1.89 (1.28–2.79)
Table 4

HF event rate comparison between different risk groups based on aECG-CNN in ILR patients with history of HF events any time prior to implant

Diagnostic parameterNumber of evaluations (%)Number of HF events (% of evaluations)Hazard ratio (95% CI)P-value
Risk group based on aECG-CNN0.001
Low10 637 (85.3)152 (1.4)Reference
High1830 (14.7)49 (2.7)1.89 (1.28–2.79)
Diagnostic parameterNumber of evaluations (%)Number of HF events (% of evaluations)Hazard ratio (95% CI)P-value
Risk group based on aECG-CNN0.001
Low10 637 (85.3)152 (1.4)Reference
High1830 (14.7)49 (2.7)1.89 (1.28–2.79)

Discussion

In this study, we show that an AI algorithm trained to identify patients with impaired LVEF from ILR aECGs can be used to dynamically predict the risk of HF hospitalization over the next 30 days. Early identification of these patients would allow timely cardiovascular care including appropriate diagnostic testing, medication optimization, or even device-based therapies if deemed necessary which may improve patient outcomes and ease acute healthcare provision resource utilization.3,6,8

This model has the potential to reside in a remote monitoring platform and continuously access the changes in aECG to monitor the probability of an instance of LVEF ≤ 40% and subsequent increase of worsening HF hospitalization risk. Previous studies have shown the utility of standard 12-lead ECG markers to assess future HF events, and indeed, some of these report higher area under the curves (AUCs) than this study.11–15,22–25 However, these previous studies typically rely on 12-lead ECGs. This requirement limits the clinical utility of this approach. No technology exists that readily permits the regular automated gathering of continuous 12-lead aECGs that could serve as a telemonitoring solution. Ambulatory electrocardiogram-CNN, however, provides a daily single-lead aECGs from ILRs to provide the required continuous dynamic daily risk score for HF hospitalization, at no extra burden for patients with these devices. Such a system may be analogous to other implantable monitoring techniques that exist for ambulatory monitoring of increased risk of worsening HF.3–10 However, by using the ECG to directly predict cardiac function, this approach may provide more orthogonal inputs to existing biosensors of pressure, impedance, and physical activity.

Besides providing ability for diagnosis and monitoring of cardiac arrhythmia,26–29 the ILR devices also monitor diagnostic parameter and store aggregated daily measurements longitudinally over a long period of time. These diagnostic parameters include nighttime heart rate and daytime heart rate, atrial fibrillation burden, ventricular rate during atrial fibrillation (AF), heart rate variability, and activity duration. Additionally, other sensors such as subcutaneous impedance are being investigated for measurement of fluid and respiration rate. Recently, an approach of combining multiple diagnostic parameters in an ILR device using a Bayesian belief network machine learning model for identifying patients at risk of worsening HF was reported for ILR devices in patients with HF with reduced and preserved EF.30 Ambulatory electrocardiogram-CNN could readily be added into these ambulatory HF event risk prediction platforms potentially delivering further improvement in the performance of the currently available multi-parameter algorithms. Whether such a risk scoring methodology for ambulatory management of patients with HF will improve patient outcomes requires prospective evaluation. A randomized control study is currently being conducted to investigate whether ILR-based ambulatory management can improve outcomes in Class II and III patients with HF with reduced LVEF or preserved LVEF (ALLEVIATE-HF study: NCT04452149).

Finally, the aECG-CNN-based monitoring can also identify patients with potential asymptomatic or mildly symptomatic reduction in LVEF in an ambulatory setting—indeed, this is what the model was trained to identify. If the model were to suspect new left ventricular impairment in a previously healthy patient, this could trigger confirmatory diagnostic testing (e.g. echocardiography) which could lead to the earlier identification of patients who could benefit from prognostic medication and device therapy again with potential benefits for patients and healthcare services alike.

Study limitations

We have relied on EHRs for identifying LVEF measurements, clinical endpoints, and the presence of comorbidities. Such approaches are susceptible to data entry errors and faults in natural language processing. We aimed to minimize these by requiring concordant ICD codes.

Single-lead aECG is derived from a short dipole, in a non-standard orientation, which varies from patient to patient. This almost certainly limits the ability of aECG-CNN to identify pathology and predict events when compared with standard 12-lead ECGs. However, because ILRs provide continuous daily ECG monitoring, our system can rely on intra-patient changes in HF probabilities, which is not currently available with 12-lead ECGs.

Although this study included over 2000 patients, a prospective study could be useful in demonstrating the system’s clinical utility. First, this would demonstrate the generalizability of the algorithm to an external prospective dataset, rather than an anonymized electronic health record. Second, this would allow the additive predictive power of the system to be ascertained, beyond those of existing telemonitoring biosignals.

Conclusions

This study found that an aECG-CNN algorithm trained to detect reduced LVEF using single-lead aECGs acquired by an ILR can identify patients at increased risk of future HF events. Further investigation is needed to evaluate whether HF risk prediction based on this ECG-based deep learning model can be used to predict and prevent HF events in a prospective study. If this is found to be the case, the benefits for patients and healthcare systems alike would likely be large.

Supplementary material

Supplementary material is available at European Heart Journal – Digital Health.

Acknowledgements

J.H. was funded by the British Heart Foundation (FS/ICRF/22/26039). D.K. was supported by the NIHR Imperial BRC. N.V., S.L., S.S., and J.K. were funded by Medtronic, Minneapolis, MN, USA.

Funding

This study was funded by the British Heart Foundation (FS/ICRF/22/26039), Medtronic, and NIHR Imperial BRC. The funders had no role in the study design, data collection, data analysis, data interpretation, or writing of the report.

Data availability

All patients provided consent to use their de-identified device data for research purposes when they sign up for Medtronic CareLink Network. As per the contractual data access, the de-identified data cannot be shared.

References

1

Heidenreich
 
PA
,
Fonarow
 
GC
,
Opsha
 
Y
,
Sandhu
 
AT
,
Sweitzer
 
NK
,
Warraich
 
HJ
, et al.  
Economic issues in heart failure in the United States
.
J Card Fail
 
2022
;
28
:
453
466
.

2

Cook
 
C
,
Cole
 
G
,
Asaria
 
P
,
Jabbour
 
R
,
Francis
 
DP
.
The annual global economic burden of heart failure
.
Int J Cardiol
 
2014
;
171
:
368
376
.

3

Abraham
 
WT
,
Adamson
 
PB
,
Bourge
 
RC
,
Aaron
 
MF
,
Costanzo
 
MR
,
Stevenson
 
LW
, et al.  
Wireless pulmonary artery haemodynamic monitoring in chronic heart failure: a randomised controlled trial [published correction appears in Lancet. 2012 Feb 4; 379(9814):412]
.
Lancet
 
2011
;
377
:
658
666
.

4

Brachmann
 
J
,
Böhm
 
M
,
Rybak
 
K
,
Klein
 
G
,
Butter
 
C
,
Klemm
 
H
, et al.  
Fluid status monitoring with a wireless network to reduce cardiovascular-related hospitalizations and mortality in heart failure: rationale and design of the OptiLink HF Study (Optimization of Heart Failure Management using OptiVol Fluid Status Monitoring and CareLink)
.
Eur J Heart Fail
 
2011
;
13
:
796
804
.

5

Cowie
 
MR
,
Sarkar
 
S
,
Koehler
 
J
,
Whellan
 
DJ
,
Crossley
 
GH
,
Tang
 
WHW
, et al.  
Development and validation of an integrated diagnostic algorithm derived from parameters monitored in implantable devices for identifying patients at risk for heart failure hospitalization in an ambulatory setting
.
Eur Heart J
 
2013
;
34
:
2472
2480
.

6

Hindricks
 
G
,
Taborsky
 
M
,
Glikson
 
M
,
Heinrich
 
U
,
Schumacher
 
B
,
Katz
 
A
, et al.  
Implant-based multiparameter telemonitoring of patients with heart failure (IN-TIME): a randomised controlled trial
.
Lancet
 
2014
;
384
:
583
590
.

7

Boehmer
 
JP
,
Hariharan
 
R
,
Devecchi
 
FG
,
Smith
 
AL
,
Molon
 
G
,
Capucci
 
A
, et al.  
A multisensor algorithm predicts heart failure events in patients with implanted devices: results from the MultiSENSE study
.
JACC Heart Fail
 
2017
;
5
:
216
225
.

8

Koehler
 
F
,
Koehler
 
K
,
Deckwart
 
O
,
Prescher
 
S
,
Wegscheider
 
K
,
Winkler
 
S
, et al.  
Telemedical Interventional Management in Heart Failure II (TIM-HF2), a randomised, controlled trial investigating the impact of telemedicine on unplanned cardiovascular hospitalisations and mortality in heart failure patients: study design and description of the intervention
.
Eur J Heart Fail
 
2018
;
20
:
1485
1493
.

9

Ahmed
 
FZ
,
Taylor
 
JK
,
Green
 
C
,
Moore
 
L
,
Goode
 
A
,
Black
 
P
, et al.  
Triage-HF plus: a novel device-based remote monitoring pathway to identify worsening heart failure
.
ESC Heart Fail
 
2020
;
7
:
107
116
.

10

Zile
 
MR
,
Koehler
 
J
,
Sarkar
 
S
,
Butler
 
J
.
Prediction of worsening heart failure events and all-cause mortality using an individualized risk stratification strategy
.
ESC Heart Fail
 
2020
;
7
:
4277
4289
.

11

Adedinsewo
 
D
,
Carter
 
RE
,
Attia
 
Z
,
Johnson
 
P
,
Kashou
 
AH
,
Dugan
 
JL
, et al.  
Artificial intelligence-enabled ECG algorithm to identify patients with left ventricular systolic dysfunction presenting to the emergency department with dyspnea
.
Circ Arrhythm Electrophysiol
 
2020
;
13
:
e008437
.

12

Attia
 
ZI
,
Kapa
 
S
,
Yao
 
X
,
Lopez-Jimenez
 
F
,
Mohan
 
TL
,
Pellikka
 
PA
, et al.  
Prospective validation of a deep learning electrocardiogram algorithm for the detection of left ventricular systolic dysfunction
.
J Cardiovasc Electrophysiol
 
2019
;
30
:
668
674
.

13

Bachtiger
 
P
,
Petri
 
CF
,
Scott
 
FE
,
Park
 
SR
,
Kelshiker
 
MA
,
Sahemey
 
HK
, et al.  
Point-of-care screening for heart failure with reduced ejection fraction using artificial intelligence during ECG-enabled stethoscope examination in London, UK: a prospective, observational, multicentre study
.
Lancet Digit Health
 
2022
;
4
:
e117
e125
.

14

Acharya
 
UR
,
Fujita
 
H
,
Oh
 
SL
,
Hagiwara
 
Y
,
Tan
 
JH
,
Adam
 
M
, et al.  
Deep convolutional neural network for the automated diagnosis of congestive heart failure using ECG signals
.
Appl Intell
 
2019
;
49
:
16
27
.

15

Attia
 
ZI
,
Kapa
 
S
,
Lopez-Jimenez
 
F
,
McKie
 
PM
,
Ladewig
 
DJ
,
Satam
 
G
, et al.  
Screening for cardiac contractile dysfunction using an artificial intelligence-enabled electrocardiogram
.
Nat Med
 
2019
;
25
:
70
74
.

16

Lih
 
OS
,
Jahmunah
 
V
,
San
 
TR
,
Ciaccio
 
EJ
,
Yamakawa
 
T
,
Tanabe
 
M
, et al.  
Comprehensive electrocardiographic diagnosis based on deep learning
.
Artif Intell Med
 
2020
;
103
:
101789
.

17

Ho
 
Y
,
Wookey
 
S
.
The real-world-weight cross-entropy loss function: modeling the costs of mislabeling
.
IEEE
 
2019
;
8
:
4806
4813
.

18

Zhang
 
Z.
 
Improved Adam optimizer for deep neural networks. 2018 IEEE/ACM 26th International Symposium on Quality of Service (IWQoS), Banff, AB, Canada 2018, pp. 1–2
. doi:

19

Kakihana
 
Y
,
Ito
 
T
,
Nakahara
 
M
,
Yamaguchi
 
K
,
Yasuda
 
T
.
Sepsis-induced myocardial dysfunction: pathophysiology and management
.
J Intensive Care
 
2016
;
4
:
22
.

20

Parikh
 
RV
,
Go
 
AS
,
Bhatt
 
AS
,
Tan
 
TC
,
Allen
 
AR
,
Feng
 
KY
, et al.  
Developing clinical risk prediction models for worsening heart failure events and death by left ventricular ejection fraction
.
J Am Heart Assoc
 
2023
;
12
:
e029736
.

21

Zile
 
MR
,
Kahwash
 
R
,
Sarkar
 
S
,
Koehler
 
J
,
Butler
 
J
.
Temporal characteristics of device-based individual and integrated risk metrics in patients with chronic heart failure
.
JACC Heart Fail
 
2023
;
11
:
143
156
.

22

O’Neal
 
WT
,
Mazur
 
M
,
Bertoni
 
AG
,
Bluemke
 
DA
,
Al-Mallah
 
MH
,
Lima
 
JAC
, et al.  
Electrocardiographic predictors of heart failure with reduced versus preserved ejection fraction: the multi-ethnic study of atherosclerosis
.
J Am Heart Assoc
 
2017
;
6
:
e006023
.

23

Rautaharju
 
PM
,
Prineas
 
RJ
,
Wood
 
J
,
Zhang
 
ZM
,
Crow
 
R
,
Heiss
 
G
.
Electrocardiographic predictors of new-onset heart failure in men and in women free of coronary heart disease (from the Atherosclerosis in Communities [ARIC] Study)
.
Am J Cardiol
 
2007
;
100
:
1437
1441
.

24

Ahmad
 
MI
,
Mujtaba
 
M
,
Floyd
 
JS
,
Chen
 
LY
,
Soliman
 
EZ
.
Electrocardiographic markers of atrial cardiomyopathy and risk of heart failure in the multi-ethnic study of atherosclerosis (MESA) cohort
.
Front Cardiovasc Med
 
2023
;
10
:
1143338
.

25

Reinier
 
K
,
Narayanan
 
K
,
Uy-Evanado
 
A
,
Teodorescu
 
C
,
Chugh
 
H
,
Mack
 
WJ
, et al.  
Electrocardiographic markers and the left ventricular ejection fraction have cumulative effects on risk of sudden cardiac death
.
JACC Clin Electrophysiol
 
2015
;
1
:
542
550
.

26

Krahn
 
AD
,
Klein
 
GJ
,
Yee
 
R
,
Takle-Newhouse
 
T
,
Norris
 
C
.
Use of an extended monitoring strategy in patients with problematic syncope. Reveal investigators
.
Circulation
 
1999
;
99
:
406
410
.

27

Sanders
 
P
,
Pürerfellner
 
H
,
Pokushalov
 
E
,
Sarkar
 
S
,
Di Bacco
 
M
,
Maus
 
B
, et al.  
Performance of a new atrial fibrillation detection algorithm in a miniaturized insertable cardiac monitor: results from the Reveal LINQ Usability Study
.
Heart Rhythm
 
2016
;
13
:
1425
1430
.

28

Verma
 
A
,
Champagne
 
J
,
Sapp
 
J
,
Essebag
 
V
,
Novak
 
P
,
Skanes
 
A
, et al.  
Discerning the incidence of symptomatic and asymptomatic episodes of atrial fibrillation before and after catheter ablation (DISCERN AF): a prospective, multicenter study
.
JAMA Intern Med
 
2013
;
173
:
149
156
.

29

Sanna
 
T
,
Diener
 
HC
,
Passman
 
RS
,
Di Lazzaro
 
V
,
Bernstein
 
RA
,
Morillo
 
CA
, et al.  
Cryptogenic stroke and underlying atrial fibrillation
.
N Engl J Med
 
2014
;
370
:
2478
2486
.

30

Zile
 
MR
,
Kahwash
 
R
,
Sarkar
 
S
,
Koehler
 
J
,
Zielinski
 
T
,
Mehra
 
MR
, et al.  
A novel heart failure diagnostic risk score using a minimally invasive subcutaneous insertable cardiac monitor
.
JACC Heart Fail
 
2024
;
12
:
182
196
.

Author notes

Conflict of interest: D.K. is a consultant for Medtronic Inc., and N.V., S.L., S.S., and J.K. are employees and share holders of Medtronic Inc.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Supplementary data