This commentary refers to 'Feasibility of using deep learning to detect coronary artery disease based on facial photo' by S. Lin et al., 2020;41:4400–4411.

Lin et al.1 did a weighty contribution to improve our ability to predict the presence of coronary artery disease (CAD) in those undergoing coronary angiography (CAG) or coronary computed tomography angiography (CCTA). They developed a deep learning algorithm based on facial features and demonstrated that its prediction power is greater than the Diamond–Forrester model and the CAD consortium clinical score based on C-statistic.1 We believe that it is a worthy attempt to aid us in enrolling our patients for CAG or CCTA. A major diagnostic challenge is finding a valid and reliable means of distinguishing patients with normal or near-normal coronary arteries from those with obstructive CAD based on clinical characteristics and non-invasive evaluations. If a reliable tool was available, cardiologists could reduce the number of unnecessary CAG procedures, which are associated with a small but definite risk to the patient, healthcare costs, and waste of medical resources.2 In this commentary, we are going to challenge the use of C-statistic and suggest some modern indices for comparing prediction models.

Conventionally, the clinicians employ C-statistic for making comparisons between clinical prediction models for the sake of simplicity and understandability. C-Statistic integrates improvements of the sensitivity and specificity of a model by an equal weight; however, depending on the clinical scenario, they are not the same.3  ,  4 For instance, in the setting of implantable cardioverter-defibrillator (ICD) implantation for the primary prevention of sudden cardiac death (SCD) in patients with hypertrophic cardiomyopathy, a false-positive prediction results in ICD complications compared to SCD as the result of a false-negative prediction.4 Accordingly, in the setting of the diagnosis of CAD, false-positive and false-negative predictions are not the same in terms of clinical outcomes. In this context, a false-positive prediction lead to unnecessary radiation exposure, excessive costs, and complications of CAG; nonetheless, a false-negative prediction may cause acute coronary syndrome and its adverse outcomes. Therefore, clinicians need a statistical index that considers the different weights of false-positive and false-negative predictions. Taking this into account, biostatisticians suggest to choose novel statistical indices like net-benefit (NB) index or weighted net reclassification improvement rather than C-statistic for the comparison of clinical prediction models; because they correspond to clinical outcomes more plausibly than C-statistic. The NB index takes different weights of false-positive and false-negative predictions into account. Thus, it provides more desirable comparisons between the prediction models in terms of clinical outcomes.3–5  

The harm-to-benefit ratio equals to the odds of the clinically acceptable cut-off for decision-making. Lin et al.1 provided C-statistic for the Diamond–Forrester model and the CAD consortium clinical score, while they did not mention any cut-off, i.e. 5%, 30%, 40%, or 70%. Taking a cut-off into account and calculation of the NB index confers a more clinical weight to the comparison.4 At last, we should emphasize that we do not question the valid attempt of Lin et al.  1; nevertheless, we strongly recommend to use a clinically acceptable cut-off for making decisions and employing the NB index in future studies.4

Conflict of interest: none declared.

References

1

Lin
 
S
,
Li
 
Z
,
Fu
 
B
,
Chen
 
S
,
Li
 
X
,
Wang
 
Y
,
Wang
 
X
,
Lv
 
B
,
Xu
 
B
,
Song
 
X
,
Zhang
 
Y-J
,
Cheng
 
X
,
Huang
 
W
,
Pu
 
J
,
Zhang
 
Q
,
Xia
 
Y
,
Du
 
B
,
Ji
 
X
,
Zheng
 
Z.
 
Feasibility of using deep learning to detect coronary artery disease based on facial photo
.
Eur Heart J
 
2020
;
41
:4400–4411.

2

Masoudkabir
 
F
,
Vasheghani-Farahani
 
A
,
Hakki
 
E
,
Poorhosseini
 
H
,
Sadeghian
 
S
,
Abbasi
 
SH
,
Bahmanyar
 
S
,
Kassaian
 
SE.
 
Novel scoring system for prediction of cardiac syndrome X in women with typical angina and a positive exercise tolerance test
.
Tex Heart Inst J
 
2018
;
45
:
5
10
.

3

Steyerberg
 
EW
,
Vickers
 
AJ
,
Cook
 
NR
,
Gerds
 
T
,
Gonen
 
M
,
Obuchowski
 
N
,
Pencina
 
MJ
,
Kattan
 
MW.
 
Assessing the performance of prediction models: a framework for traditional and novel measures
.
Epidemiology
 
2010
;
21
:
128
138
.

4

Aminorroaya
 
A
,
Vasheghani-Farahani
 
A
,
Masoudkabir
 
F
,
Roayaei
 
P.
 
Recommendations of the current guidelines for implantable cardioverter-defibrillator implantation in patients with hypertrophic cardiomyopathy: debate still exists
.
J Interv Card Electrophysiol
 
2020
;doi: 10.1007/s10840-020-00837-4.

5

Steyerberg
 
EW
,
Calster
 
BV
,
Pencina
 
MJ.
 
Performance measures for prediction models and markers: evaluation of predictions and classifications
.
Rev Esp Cardiol (Engl Ed)
 
2011
;
64
:
788
794
.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://dbpia.nl.go.kr/journals/pages/open_access/funder_policies/chorus/standard_publication_model)