Toward alert triage: scalable qualitative coding framework for analyzing alert notes from the Telehealth Intervention Program for Seniors (TIPS)

Keyword match results of non-annotated alert notes (N = 24 579)

Code	No. of keywords (N = 108 total)	Alert notes retrieved from keyword search (N = 22 717)
Code	No. of keywords (N = 108 total)	No. of alert notes	Percent (%)
Review history	13	12 556	51.08
Technical error	6	467	1.90
Within normal limit	6	8208	33.39
False alarm	6	1051	4.28
Reading success	11	12 919	52.56
No further action needed	4	1962	7.98
Change improved	8	659	2.68
Change worsen	5	4658	18.95
Change unknown	10	11 418	46.45
Continue monitoring	7	202	0.82
Instruction	7	1507	6.13
Follow-up	7	4429	18.02
Nurse call patient	9	5988	24.36
Discuss symptoms	9	3590	14.61
Leave message	7	2559	10.41
Wrong number	2	135	0.55
Patient return call	4	111	0.45

Code	No. of keywords (N = 108 total)	Alert notes retrieved from keyword search (N = 22 717)
Code	No. of keywords (N = 108 total)	No. of alert notes	Percent (%)
Review history	13	12 556	51.08
Technical error	6	467	1.90
Within normal limit	6	8208	33.39
False alarm	6	1051	4.28
Reading success	11	12 919	52.56
No further action needed	4	1962	7.98
Change improved	8	659	2.68
Change worsen	5	4658	18.95
Change unknown	10	11 418	46.45
Continue monitoring	7	202	0.82
Instruction	7	1507	6.13
Follow-up	7	4429	18.02
Nurse call patient	9	5988	24.36
Discuss symptoms	9	3590	14.61
Leave message	7	2559	10.41
Wrong number	2	135	0.55
Patient return call	4	111	0.45

Table 1.

Keyword match results of non-annotated alert notes (N = 24 579)

Code	No. of keywords (N = 108 total)	Alert notes retrieved from keyword search (N = 22 717)
Code	No. of keywords (N = 108 total)	No. of alert notes	Percent (%)
Review history	13	12 556	51.08
Technical error	6	467	1.90
Within normal limit	6	8208	33.39
False alarm	6	1051	4.28
Reading success	11	12 919	52.56
No further action needed	4	1962	7.98
Change improved	8	659	2.68
Change worsen	5	4658	18.95
Change unknown	10	11 418	46.45
Continue monitoring	7	202	0.82
Instruction	7	1507	6.13
Follow-up	7	4429	18.02
Nurse call patient	9	5988	24.36
Discuss symptoms	9	3590	14.61
Leave message	7	2559	10.41
Wrong number	2	135	0.55
Patient return call	4	111	0.45

Code	No. of keywords (N = 108 total)	Alert notes retrieved from keyword search (N = 22 717)
Code	No. of keywords (N = 108 total)	No. of alert notes	Percent (%)
Review history	13	12 556	51.08
Technical error	6	467	1.90
Within normal limit	6	8208	33.39
False alarm	6	1051	4.28
Reading success	11	12 919	52.56
No further action needed	4	1962	7.98
Change improved	8	659	2.68
Change worsen	5	4658	18.95
Change unknown	10	11 418	46.45
Continue monitoring	7	202	0.82
Instruction	7	1507	6.13
Follow-up	7	4429	18.02
Nurse call patient	9	5988	24.36
Discuss symptoms	9	3590	14.61
Leave message	7	2559	10.41
Wrong number	2	135	0.55
Patient return call	4	111	0.45

Evaluation of the regular expression-based keywords

There was a total of 680 alert notes in the precision and recall analysis. Four out of 17 codes had a precision score of 100% (No Further Action Needed, Change Improved, Wrong Number, Patient Return Call). Nine out of 17 codes had precision scores between 80% and 95% (Review History, Technical Error, Within Normal Limit, False Alarm, Reading Success, Change Unknown, Continue Monitoring, Nurse Call Patient, Leave Message). Four out of 17 had precision scores between 60% and 80% (Change Worsen, Instruction, Follow Up, Discuss Symptoms). The code Follow Up had the lowest precision score of 60%. Nine of 17 codes had a recall score of 100% (Technical Error, False Alarm, Change Improved, Change Worsen, Instruction, Nurse Call Patient, Discuss Symptoms, Wrong Number, Patient Return Call). Five of 17 codes had recall scores between 80% and 95% (Within Normal Limit, No Further Action Needed, Continue Monitoring, Follow Up, Leave Message). The last 3 codes had a recall score of less than 80% (Review History, Reading Success, Change Unknown). The code Reading Success had the lowest recall score of 62%. The final precision score was 86% and the final recall score was 90%. Details of results are shown in Table 2.

Table 2.

Confusion table for testing keyword search results (N = 20 classified as True, N = 20 classified as False according to keyword search results)

	True	False	Precision	Recall
Review history
Positive	19 (95%)	1 (5%)	0.95	0.73
Negative	7 (35%)	13 (65%)	0.95	0.73
Technical error
Positive	17 (85%)	3 (15%)	0.85	1
Negative	0 (0%)	20 (100%)	0.85	1
Within normal limit
Positive	19 (95%)	1 (5%)	0.95	0.91
Negative	2 (10%)	18 (90%)	0.95	0.91
False alarm
Positive	17 (85%)	3 (15%)	0.85	1
Negative	0 (0%)	20 (100%)	0.85	1
Reading success
Positive	18 (90%)	2 (10%)	0.9	0.62
Negative	11 (55%)	9 (45%)	0.9	0.62
No further action needed
Positive	20 (100%)	0 (0%)	1	0.87
Negative	3 (15%)	17 (85%)	1	0.87
Change improved
Positive	20 (100%)	0 (0%)	1	1
Negative	0 (0%)	20 (100%)	1	1
Change worsen
Positive	15 (75%)	5 (25%)	0.75	1
Negative	0 (0%)	20 (100%)	0.75	1
Change unknown
Positive	18 (90%)	2 (10%)	0.9	0.64
Negative	10 (50%)	10 (50%)	0.9	0.64
Continue monitoring
Positive	19 (95%)	1 (5%)	0.95	0.91
Negative	2 (10%)	18 (90%)	0.95	0.91
Instruction
Positive	14 (70%)	6 (30%)	0.7	1
Negative	0 (0%)	20 (100%)	0.7	1
Follow-up
Positive	12 (60%)	8 (40%)	0.6	0.8
Negative	3 (15%)	17 (85%)	0.6	0.8
Nurse call patient
Positive	17 (85%)	3 (15%)	0.85	1
Negative	0 (0%)	20 (100%)	0.85	1
Discuss symptoms
Positive	13 (65%)	7 (35%)	0.65	1
Negative	0 (0%)	20 (100%)	0.65	1
Leave message
Positive	16 (80%)	4 (20%)	0.8	0.89
Negative	2 (10%)	18 (90%)	0.8	0.89
Wrong number
Positive	20 (100%)	0 (0%)	1	1
Negative	0 (0%)	20 (100%)	1	1
Patient return call
Positive	20 (100%)	0 (0%)	1	1
Negative	0 (0%)	20 (100%)	1	1

	True	False	Precision	Recall
Review history
Positive	19 (95%)	1 (5%)	0.95	0.73
Negative	7 (35%)	13 (65%)	0.95	0.73
Technical error
Positive	17 (85%)	3 (15%)	0.85	1
Negative	0 (0%)	20 (100%)	0.85	1
Within normal limit
Positive	19 (95%)	1 (5%)	0.95	0.91
Negative	2 (10%)	18 (90%)	0.95	0.91
False alarm
Positive	17 (85%)	3 (15%)	0.85	1
Negative	0 (0%)	20 (100%)	0.85	1
Reading success
Positive	18 (90%)	2 (10%)	0.9	0.62
Negative	11 (55%)	9 (45%)	0.9	0.62
No further action needed
Positive	20 (100%)	0 (0%)	1	0.87
Negative	3 (15%)	17 (85%)	1	0.87
Change improved
Positive	20 (100%)	0 (0%)	1	1
Negative	0 (0%)	20 (100%)	1	1
Change worsen
Positive	15 (75%)	5 (25%)	0.75	1
Negative	0 (0%)	20 (100%)	0.75	1
Change unknown
Positive	18 (90%)	2 (10%)	0.9	0.64
Negative	10 (50%)	10 (50%)	0.9	0.64
Continue monitoring
Positive	19 (95%)	1 (5%)	0.95	0.91
Negative	2 (10%)	18 (90%)	0.95	0.91
Instruction
Positive	14 (70%)	6 (30%)	0.7	1
Negative	0 (0%)	20 (100%)	0.7	1
Follow-up
Positive	12 (60%)	8 (40%)	0.6	0.8
Negative	3 (15%)	17 (85%)	0.6	0.8
Nurse call patient
Positive	17 (85%)	3 (15%)	0.85	1
Negative	0 (0%)	20 (100%)	0.85	1
Discuss symptoms
Positive	13 (65%)	7 (35%)	0.65	1
Negative	0 (0%)	20 (100%)	0.65	1
Leave message
Positive	16 (80%)	4 (20%)	0.8	0.89
Negative	2 (10%)	18 (90%)	0.8	0.89
Wrong number
Positive	20 (100%)	0 (0%)	1	1
Negative	0 (0%)	20 (100%)	1	1
Patient return call
Positive	20 (100%)	0 (0%)	1	1
Negative	0 (0%)	20 (100%)	1	1

Table 2.

Confusion table for testing keyword search results (N = 20 classified as True, N = 20 classified as False according to keyword search results)

	True	False	Precision	Recall
Review history
Positive	19 (95%)	1 (5%)	0.95	0.73
Negative	7 (35%)	13 (65%)	0.95	0.73
Technical error
Positive	17 (85%)	3 (15%)	0.85	1
Negative	0 (0%)	20 (100%)	0.85	1
Within normal limit
Positive	19 (95%)	1 (5%)	0.95	0.91
Negative	2 (10%)	18 (90%)	0.95	0.91
False alarm
Positive	17 (85%)	3 (15%)	0.85	1
Negative	0 (0%)	20 (100%)	0.85	1
Reading success
Positive	18 (90%)	2 (10%)	0.9	0.62
Negative	11 (55%)	9 (45%)	0.9	0.62
No further action needed
Positive	20 (100%)	0 (0%)	1	0.87
Negative	3 (15%)	17 (85%)	1	0.87
Change improved
Positive	20 (100%)	0 (0%)	1	1
Negative	0 (0%)	20 (100%)	1	1
Change worsen
Positive	15 (75%)	5 (25%)	0.75	1
Negative	0 (0%)	20 (100%)	0.75	1
Change unknown
Positive	18 (90%)	2 (10%)	0.9	0.64
Negative	10 (50%)	10 (50%)	0.9	0.64
Continue monitoring
Positive	19 (95%)	1 (5%)	0.95	0.91
Negative	2 (10%)	18 (90%)	0.95	0.91
Instruction
Positive	14 (70%)	6 (30%)	0.7	1
Negative	0 (0%)	20 (100%)	0.7	1
Follow-up
Positive	12 (60%)	8 (40%)	0.6	0.8
Negative	3 (15%)	17 (85%)	0.6	0.8
Nurse call patient
Positive	17 (85%)	3 (15%)	0.85	1
Negative	0 (0%)	20 (100%)	0.85	1
Discuss symptoms
Positive	13 (65%)	7 (35%)	0.65	1
Negative	0 (0%)	20 (100%)	0.65	1
Leave message
Positive	16 (80%)	4 (20%)	0.8	0.89
Negative	2 (10%)	18 (90%)	0.8	0.89
Wrong number
Positive	20 (100%)	0 (0%)	1	1
Negative	0 (0%)	20 (100%)	1	1
Patient return call
Positive	20 (100%)	0 (0%)	1	1
Negative	0 (0%)	20 (100%)	1	1

	True	False	Precision	Recall
Review history
Positive	19 (95%)	1 (5%)	0.95	0.73
Negative	7 (35%)	13 (65%)	0.95	0.73
Technical error
Positive	17 (85%)	3 (15%)	0.85	1
Negative	0 (0%)	20 (100%)	0.85	1
Within normal limit
Positive	19 (95%)	1 (5%)	0.95	0.91
Negative	2 (10%)	18 (90%)	0.95	0.91
False alarm
Positive	17 (85%)	3 (15%)	0.85	1
Negative	0 (0%)	20 (100%)	0.85	1
Reading success
Positive	18 (90%)	2 (10%)	0.9	0.62
Negative	11 (55%)	9 (45%)	0.9	0.62
No further action needed
Positive	20 (100%)	0 (0%)	1	0.87
Negative	3 (15%)	17 (85%)	1	0.87
Change improved
Positive	20 (100%)	0 (0%)	1	1
Negative	0 (0%)	20 (100%)	1	1
Change worsen
Positive	15 (75%)	5 (25%)	0.75	1
Negative	0 (0%)	20 (100%)	0.75	1
Change unknown
Positive	18 (90%)	2 (10%)	0.9	0.64
Negative	10 (50%)	10 (50%)	0.9	0.64
Continue monitoring
Positive	19 (95%)	1 (5%)	0.95	0.91
Negative	2 (10%)	18 (90%)	0.95	0.91
Instruction
Positive	14 (70%)	6 (30%)	0.7	1
Negative	0 (0%)	20 (100%)	0.7	1
Follow-up
Positive	12 (60%)	8 (40%)	0.6	0.8
Negative	3 (15%)	17 (85%)	0.6	0.8
Nurse call patient
Positive	17 (85%)	3 (15%)	0.85	1
Negative	0 (0%)	20 (100%)	0.85	1
Discuss symptoms
Positive	13 (65%)	7 (35%)	0.65	1
Negative	0 (0%)	20 (100%)	0.65	1
Leave message
Positive	16 (80%)	4 (20%)	0.8	0.89
Negative	2 (10%)	18 (90%)	0.8	0.89
Wrong number
Positive	20 (100%)	0 (0%)	1	1
Negative	0 (0%)	20 (100%)	1	1
Patient return call
Positive	20 (100%)	0 (0%)	1	1
Negative	0 (0%)	20 (100%)	1	1

DISCUSSION

Our findings demonstrate how community-based telehealth alert notes, which include a plethora of information, can be used to build an automated system that helps to triage alerts to prevent alert fatigue among healthcare workers and improve the quality of care. More specifically, our findings are unique in showing the feasibility of successful automations in community-based organizations with a large low-income population where we tested this approach. While the findings may be limited to a single telehealth project, community-driven telehealth projects are ubiquitous with many congregate housing facilities seeking technology innovations to serve their populations. Driving understanding and guidance to community-based organizations who are trying to leverage technology to help their older adult populations are innovating as they care for patients.

For instance, existing studies used various text mining techniques that performed phrase or word matches from various kinds of narrative documents within electronic health records to identify adverse events, medical errors, or screening for the risk of falls.^14–16 Results from these studies varied. One study identified a broad range of medical errors by searching 5 keywords “mistake,” “error,” “incorrect,” and “iatrogenic” in discharge summaries, sign-out notes, and outpatient visit notes and obtained a positive predictive range of only 3.4%–24.4%.¹⁷ Meanwhile, another study performed text searching on discharge summaries to identify a broad range of adverse events. The system turned 59% of discharge summaries with a predictive value of 52%.¹⁸ The most recent study identified 7 adverse events in narrative documents using a keyword search approach. The study achieved positive predictive values as low as 5.37% and up to 83.83%, depending on the kind of adverse event.¹⁹

Our findings show that our approach of using a simple keyword search based on the codebook developed from manual annotation performs well in terms of precision (M = 86%, SD = 13%) and recall (M = 90%, SD = 13%). Out of 17 codes, 4 codes showed a precision of 100% and 3 codes had both precision and recall scores of 100%. These are promising results in attempting to build a triage system of telehealth alerts, even with community-based organizations that have a large proportion of low-income populations. The codes directly related to triaging that are about requiring nurses’ intervention or not requiring nurses’ reaction—Technical Error, False Alarm, No Further Action Needed, Change Worsen, Instruction, Nurse Call Patient, Discuss Symptoms—have all resulted in either high precision or recall.

Codes requiring high recall over precision would mean true positives cannot be missed. However, the only low recall scores of our findings occur in non-triage related codes, such as Review History, Reading Success, or Change Unknown. Codes requiring high precision over recall would mean we cannot tolerate false negatives. Relatively lower precision scores happened in codes about follow-up and discussing symptoms. Nurses can still follow up or discuss symptoms without any harms or increased risks of the patient. As a next step, building a user-friendly interface for nurses to simply revise notes based on predicted notes and nurses helping the predictive model by giving feedback will continue to improve and add more uniquely relevant keywords. This change will not only help the nurses to more efficiently handle adding alert notes and following up on alert notes but also eventually improve the triaging process for reduced false alerts. The resulting dataset will work again as a training dataset for a triage system, which can continue to be improved based on nurses’ feedback and the reinforcement learning mechanisms.²⁰

Although our results are specific to the TIPS program, our approach of developing the workflow and testing keyword-based identification can be applied to identifying false alerts in other systems using alerts as well as implications for addressing alert fatigue. Making healthcare workers to become desensitized to safety alerts from overload of alerts results in failed or inappropriate responses to such warnings. Alert fatigue can cause a burden for healthcare workers, leading to bypassing true alerts that need clinicians’ attention and resulting in severe patient safety consequences.²¹^,²² Although alert fatigue has been extensively discussed in the context of electronic health records, little is known about its relevance in community-based telehealth.

Since we worked with a sample of the alert notes for the coding framework development and keyword generation, we excluded other possible codes or keywords that can detect the alerts related to other activities of interest. Future studies with various domains of alert notes will help add evidence to this triage approach.

To the best of our knowledge, this study is the first approach to using narrative telehealth data to extract information and build possibilities for future prediction models for alerts in the context of community-driven intervention programs for low-income older adults. Our results provide a benchmark reference for the analysis of clinical notes of telehealth that is scalable to develop an automated alert triaging system that detects false alerts and streamlines the workflow from alert review to respond to the patients. Further research can explore the generalizability of the findings to other contexts and investigate the integration of the system with electronic health records and other clinical decision support systems to streamline the alert review process. Community-driven telehealth is a growing area of commercialization, but with little support for not-for-profits and CBOs who make these investments with little evidence, this work makes a critical contribution to this space in which automated approaches lack evidence. Our findings pave the way for evidence-based toolkits for CBOs to make informed investments that can ensure patient safety, high-quality care, and save them money.

CONCLUSION

Our study introduces the potential of an automated approach to triage alerts within the telehealth system with mobile health monitoring data using a simple keyword search technique. We showed a feasibility of a simple, automating method for a community-driven telehealth program primarily serving low-income older adult population. Building on our work as a training dataset, an advanced machine learning technique can be examined to further evaluate the feasibility of automatically detecting false alerts and improve the quality and efficacy of telehealth.

FUNDING

This work was in part supported by National Science Foundation (2144880 and 2237097) and National Institutes of Health (K01AG068592 and R21HS028104). Helene and Grant Wilson Center for Social Entrepreneurship at Pace University funded a portion of this study.

AUTHOR CONTRIBUTIONS

PN: contributed to the conception or design of the work, analysis, and interpretation of data for the work; drafted the work and reviewed it critically for important intellectual content; gave final approval of the version to be published; and agreed to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. MKS: contributed to the conception and design of the work, analysis, and interpretation of data for the work; drafted the work and reviewed it critically for important intellectual content; gave final approval of the version to be published; agreed to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. ZZ: contributed to the conception and design of the work, the acquisition, analysis, and interpretation of data for the work; drafted the work and reviewed it critically for important intellectual content; gave a final approval of the version to be published; agreed to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. HWC: contributed to the analysis and interpretation of data for the work; drafted the work and reviewed it critically for important intellectual content; gave a final approval of the version to be published; agreed to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. JH-Y: contributed to the conception and design of the work, analysis, and interpretation of data for the work; drafted the work and reviewed it critically for important intellectual content; gave a final approval of the version to be published; agreed to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved.

SUPPLEMENTARY MATERIAL

Supplementary material is available at JAMIA Open online.

ACKNOWLEDGMENTS

The authors would like to thank Tiffany Chin for her clinical perspective in preparation for this study.

CONFLICT OF INTEREST STATEMENT

None declared.

DATA AVAILABILITY

The data underlying this article cannot be shared publicly due to the privacy of individuals that participated in the program. The data will be shared upon reasonable request to the corresponding author.

REFERENCES

Jaglal

Haroun

Salbach

, et al.

Increasing access to chronic disease self-management programs in rural and remote communities using telehealth

Telemed J E Health

2013

;

(

467

–

Koonin

Hoots

Tsang

, et al.

Trends in the use of telehealth during the emergence of the COVID-19 pandemic—United States, January–March 2020

MMWR Morb Mortal Wkly Rep

2020

;

(

1595

–

Snoswell

Taylor

Comans

, et al.

Determining if telehealth can reduce health system costs: scoping review

J Med Internet Res

2020

;

(

e17298

Hamilton

Johnson

Quinn

, et al.

Telehealth intervention programs for seniors: an observational study of a community-embedded health monitoring initiative

Telemed J E Health

2020

;

(

438

–

Pekmezaris

Mitzner

Pecinka

, et al.

The impact of remote patient monitoring (telehealth) upon Medicare beneficiaries with heart failure

Telemed J E Health

2012

;

(

101

–

Tham

Nandra

Whang

, et al.

Postoperative telehealth visits reduce emergency department visits and 30-day readmissions in elective thoracic surgery patients

J Healthc Qual

2021

;

(

204

–

Zhang

Henley

Schiaffino

, et al.

Older adults’ perceptions of community-based telehealth wellness programs: a qualitative study

Inform Health Soc Care

2022

;

(

361

–

Unruh

Jung

Kaushal

, et al.

Hospitalization event notifications and reductions in readmissions of Medicare fee-for-service beneficiaries in the Bronx, New York

J Am Med Inform Assoc

2017

;

(

–

156

Radhakrishna

Bowles

Zettek-Sumner

Contributors to frequent telehealth alerts including false alerts for patients with heart failure: a mixed methods exploration

Appl Clin Inform

2013

;

(

465

–

PubMed

Rayo

Moffatt-Bruce

SD.

Alarm system management: evidence-based guidance encouraging direct measurement of informativeness to improve alarm response

BMJ Qual Saf

2015

;

(

282

–

Schiaffino

Zhang

Sachs

, et al.

Predictors of retention for community-based telehealth programs: a study of the Telehealth Intervention Program for Seniors (TIPS)

AMIA Annual Symposium Proceedings

2022

;

2021

1089

–

PubMed

Schiaffino

Al-Amin

Schumacher

JR.

Predictors of language service availability in U.S. hospitals

Int J Health Policy Manag

2014

;

(

259

–

Aldiabat

Le Navenec

C-L.

Data saturation: the mysterious step in grounded theory methodology

TQR

2018

;

23, 1

(

2018

245

–

Holmes

Freilich

Taylor

, et al.

Electronic alerts for triage protocol compliance among emergency department triage nurses: a randomized controlled trial

Nurs Res

2015

;

(

226

–

Melton

Hripcsak

Automated detection of adverse events using natural language processing of discharge summaries

J Am Med Inform Assoc

2005

;

(

448

–

Zhu

Walker

Warren

, et al.

Identifying falls risk screenings not documented with administrative codes using natural language processing

AMIA Annu Symp Proc

2017

;

2017

1923

–

PubMed