Abstract

OBJECTIVES

The EuroSCORE II is widely used to predict 30-day mortality in patients undergoing open and transcatheter cardiac surgery. The aim of this study is to evaluate the discriminatory ability of the EuroSCORE II in predicting 30-day mortality in a large cohort of patients undergoing surgical mitral valve repair in a high-volume centre.

METHODS

A retrospective review of our institutional database was carried on to find all patients who underwent mitral valve repair in our department from January 2012 to December 2019. Discrimination of the EuroSCORE II was assessed using receiver operating characteristic curves. The maximum Youden’s Index was employed to define the optimal cut-point. Calibration was assessed by generating calibration plot that visually compares the predicted mortality with the observed mortality. Calibration was also tested with the Hosmer–Lemeshow goodness-of-fit test. Finally, the accuracy of the models was tested calculating the Brier score.

RESULTS

A total of 2645 patients were identified, and the median EuroSCORE II was 1.3% (0.6–2.0%). In patients with degenerative mitral regurgitation (MR), the EuroSCORE II showed low discrimination (area under the curve 0.68), low accuracy (Brier score 0.27) and low calibration with overestimation of the 30-day mortality. In patients with secondary MR, the EuroSCORE II showed a good overall performance estimating the 30-day mortality with good discrimination (area under the curve 0.88), good accuracy (Brier score 0.003) and good calibration.

CONCLUSIONS

In patients with degenerative MR operated on in a high-volume centre with a high level of expertise in mitral valve repair, the EuroSCORE II significantly overestimates the 30-day mortality.

INTRODUCTION

Since its publication in 2012, the EuroSCORE II has been widely used to predict 30-day mortality in patients undergoing open cardiac surgery and transcatheter procedures. This score was developed by analysing a cohort of 23 000 patients who underwent open cardiac surgery in >150 hospitals in 43 countries over a 12-week period (May–July 2010) [1]. Several studies confirmed the good performance of the EuroSCORE II in predicting 30-day mortality in patients who had undergone major cardiac surgery [2–7], isolated coronary bypass graft [8], aortic valve replacement [9–11] and miscellaneous valve operations [12]. However, although 2 studies validated the EuorSCORE II in patients undergoing mitral valve surgery [13, 14], to date, no study has evaluated the predictive value of the EuroSCORE II in estimating 30-day mortality in patients undergoing exclusively surgical mitral valve repair. The aim of this study is to evaluate the performance of the EuroSCORE II in predicting 30-day mortality in patients who underwent mitral valve repair for degenerative and secondary mitral regurgitation (MR) in our institution since its introduction in January 2012.

MATERIALS AND METHODS

Patients

A retrospective review of our institutional database was carried on to find all patients who underwent mitral valve repair in our department from January 2012 to December 2019. Patients were then divided in 2 groups: degenerative and secondary MR. Indeed, to minimize the confounding effect of other less common aetiologies of MR, we excluded from the study population the relatively small number of patients with rheumatic disease and post-endocarditis MR. Charts were then analysed to find preoperative characteristics, EuroSCORE II value, intraoperative variables and postoperative results. Survival after hospital discharge was assessed by outpatients visit, usually done 2 months after the index procedure. Short-term outcomes are reported according to the AATS/STS/EACTS guidelines [15]. The Ethical Committee approved the study and waived the individual informed consent for this retrospective analysis.

Statistical analysis

Patient data were summarized as mean ± standard deviation, or median and interquartile range for asymmetrically distributed continuous variables. Categorical variables were expressed as percentages or prevalence, as appropriate. The Kolmogorov–Smirnov test was used to check for the normality of data. Univariate analysis was done with Pearson’s chi-square test or Fisher’s exact test for categorical variables. For continuous variables, Student’s t-test or Wilcoxon signed-rank tests were used as appropriate.

Discrimination of a test indicates the extent to which the model distinguishes between patients who will die or survive in the perioperative period. Discrimination was assessed with receiver operating characteristic (ROC) curves. ROC area under the curve (AUC) values vary between 0.5 and 1, where 0.5 denotes a bad diagnostic test and 1 denotes an excellent diagnostic test [16]. Another index used to evaluate the discrimination was the Somers’ Dxy rank correlation between predicted probabilities and observed responses. When Dxy = 0, the model is making random prediction, and when Dxy = 1, the prediction is perfect [17]. The maximum Youden’s Index (J = sensitivity + specificity − 1) was employed to define the optimal cut-point, which is the point on the curve at which the predictive value of the score is maximized.

Calibration refers to the agreement between observed outcomes and predictions. For example, 5 in-hospital deaths should be observed in a 100 patients’ group with a predicted mortality of 5%. Calibration was assessed by generating calibration plot that visually compares the predicted mortality with the observed mortality [18]. A curve below or above the diagonal reflects, respectively, overestimation and underestimation. The closest the line to the predicted mortality, the better the calibration of the score. Moreover, calibration was tested with the Hosmer–Lemeshow goodness-of-fit test, which compares observed to predicted values by decile of predicted probability. A significant P-value of this test indicates a bad performance of the score.

Finally, the accuracy of the models was tested calculating the Brier score (quadratic difference between predicted probability and observed outcome for each patient); when the prediction of the model is perfect, the Brier score is 0.

All the analysis was carried out separately for patients with degenerative MR and secondary MR. All the analyses were performed using Stata software (StataCorp LLC, College Station, TX, USA; Version 15).

RESULTS

A total of 2645 patients were identified. Preoperative characteristics of the 2 groups are listed in Table 1. As expected, patients with secondary MR were significantly older and with higher median EuroSCORE II (2.4% vs 1%, P <0.001). The surgical techniques and other intraoperative variables are summarized in Table 2. The 30-day mortality was significantly higher in patients with secondary MR (2.6% vs 0.39%, P <0.001). Similarly, low cardiac output syndrome was significantly more common in patients with secondary MR (8.6% vs 1.3%, P < 0.001). Postoperative complications are listed in Table 3. Death was due to low cardiac output syndrome in 4 patients (2 in the secondary MR group and 2 in the degenerative MR); septic shock in 4 patients (all in the degenerative MR group) and other causes in the remaining 5 cases. All hospital survivors had their first outpatient visit about 2 months after surgery.

Table 1:

Preoperative characteristics of the patients in the 2 groups

Degenerative (N = 2530)Secondary (N = 115)P-value
Age, median (IQR)57 (47–66)68 (63–72)<0.001
Female sex, n (%)837 (33)45 (39)0.178
NYHA class I, n (%)594 (23)0 (0)0.999
NYHA class II, n (%)1475 (59)25 (21)<0.001
NYHA class III, n (%)390 (15)55 (48)<0.001
NYHA class IV, n (%)71 (3)35 (31)<0.001
REDO, n (%)69 (2.7)2 (1.7)0.521
Creatinine clearance 85–50 ml/min, n (%)91 (3.6)7 (6)0.166
Creatinine clearance <50 ml/min, n (%)35 (1.4)5 (4.3)0.010
Dialysis regardless of the creatinine clearance, n (%)19 (0.75)2 (1.7)0.242
Insulin dependent diabetes mellitus, n (%)110 (4.3)37 (32)<0.001
Extracardiac arteriopathy, n (%)27 (1)7 (6)<0.001
COPD, n (%)40 (1.6)4 (3.5)0.119
Ejection fraction, median (IQR)61 (58–66)35 (30–42)<0.001
Moderate pulmonary hypertension (SPAP 30–55 mmHg), n (%)832 (32.8)66 (57)<0.001
Severe pulmonary hypertension (SPAP >55 mmHg), n (%)220 (8)28 (24)<0.001
Isolated mitral valve repair, n (%)1779 (71)52 (45)<0.001
Mitral valve repair + 1 procedure, n (%)571 (22)60 (53)<0.001
Mitral valve repair + 2 or more procedure, n (%)180 (7)3 (2)0.062
EuroSCORE II, median (IQR)1 (0.6–1.5)2.4 (1.5–4.1)<0.001
Degenerative (N = 2530)Secondary (N = 115)P-value
Age, median (IQR)57 (47–66)68 (63–72)<0.001
Female sex, n (%)837 (33)45 (39)0.178
NYHA class I, n (%)594 (23)0 (0)0.999
NYHA class II, n (%)1475 (59)25 (21)<0.001
NYHA class III, n (%)390 (15)55 (48)<0.001
NYHA class IV, n (%)71 (3)35 (31)<0.001
REDO, n (%)69 (2.7)2 (1.7)0.521
Creatinine clearance 85–50 ml/min, n (%)91 (3.6)7 (6)0.166
Creatinine clearance <50 ml/min, n (%)35 (1.4)5 (4.3)0.010
Dialysis regardless of the creatinine clearance, n (%)19 (0.75)2 (1.7)0.242
Insulin dependent diabetes mellitus, n (%)110 (4.3)37 (32)<0.001
Extracardiac arteriopathy, n (%)27 (1)7 (6)<0.001
COPD, n (%)40 (1.6)4 (3.5)0.119
Ejection fraction, median (IQR)61 (58–66)35 (30–42)<0.001
Moderate pulmonary hypertension (SPAP 30–55 mmHg), n (%)832 (32.8)66 (57)<0.001
Severe pulmonary hypertension (SPAP >55 mmHg), n (%)220 (8)28 (24)<0.001
Isolated mitral valve repair, n (%)1779 (71)52 (45)<0.001
Mitral valve repair + 1 procedure, n (%)571 (22)60 (53)<0.001
Mitral valve repair + 2 or more procedure, n (%)180 (7)3 (2)0.062
EuroSCORE II, median (IQR)1 (0.6–1.5)2.4 (1.5–4.1)<0.001

IQR: interquartile range.

Table 1:

Preoperative characteristics of the patients in the 2 groups

Degenerative (N = 2530)Secondary (N = 115)P-value
Age, median (IQR)57 (47–66)68 (63–72)<0.001
Female sex, n (%)837 (33)45 (39)0.178
NYHA class I, n (%)594 (23)0 (0)0.999
NYHA class II, n (%)1475 (59)25 (21)<0.001
NYHA class III, n (%)390 (15)55 (48)<0.001
NYHA class IV, n (%)71 (3)35 (31)<0.001
REDO, n (%)69 (2.7)2 (1.7)0.521
Creatinine clearance 85–50 ml/min, n (%)91 (3.6)7 (6)0.166
Creatinine clearance <50 ml/min, n (%)35 (1.4)5 (4.3)0.010
Dialysis regardless of the creatinine clearance, n (%)19 (0.75)2 (1.7)0.242
Insulin dependent diabetes mellitus, n (%)110 (4.3)37 (32)<0.001
Extracardiac arteriopathy, n (%)27 (1)7 (6)<0.001
COPD, n (%)40 (1.6)4 (3.5)0.119
Ejection fraction, median (IQR)61 (58–66)35 (30–42)<0.001
Moderate pulmonary hypertension (SPAP 30–55 mmHg), n (%)832 (32.8)66 (57)<0.001
Severe pulmonary hypertension (SPAP >55 mmHg), n (%)220 (8)28 (24)<0.001
Isolated mitral valve repair, n (%)1779 (71)52 (45)<0.001
Mitral valve repair + 1 procedure, n (%)571 (22)60 (53)<0.001
Mitral valve repair + 2 or more procedure, n (%)180 (7)3 (2)0.062
EuroSCORE II, median (IQR)1 (0.6–1.5)2.4 (1.5–4.1)<0.001
Degenerative (N = 2530)Secondary (N = 115)P-value
Age, median (IQR)57 (47–66)68 (63–72)<0.001
Female sex, n (%)837 (33)45 (39)0.178
NYHA class I, n (%)594 (23)0 (0)0.999
NYHA class II, n (%)1475 (59)25 (21)<0.001
NYHA class III, n (%)390 (15)55 (48)<0.001
NYHA class IV, n (%)71 (3)35 (31)<0.001
REDO, n (%)69 (2.7)2 (1.7)0.521
Creatinine clearance 85–50 ml/min, n (%)91 (3.6)7 (6)0.166
Creatinine clearance <50 ml/min, n (%)35 (1.4)5 (4.3)0.010
Dialysis regardless of the creatinine clearance, n (%)19 (0.75)2 (1.7)0.242
Insulin dependent diabetes mellitus, n (%)110 (4.3)37 (32)<0.001
Extracardiac arteriopathy, n (%)27 (1)7 (6)<0.001
COPD, n (%)40 (1.6)4 (3.5)0.119
Ejection fraction, median (IQR)61 (58–66)35 (30–42)<0.001
Moderate pulmonary hypertension (SPAP 30–55 mmHg), n (%)832 (32.8)66 (57)<0.001
Severe pulmonary hypertension (SPAP >55 mmHg), n (%)220 (8)28 (24)<0.001
Isolated mitral valve repair, n (%)1779 (71)52 (45)<0.001
Mitral valve repair + 1 procedure, n (%)571 (22)60 (53)<0.001
Mitral valve repair + 2 or more procedure, n (%)180 (7)3 (2)0.062
EuroSCORE II, median (IQR)1 (0.6–1.5)2.4 (1.5–4.1)<0.001

IQR: interquartile range.

Table 2:

Operative variables

Degenerative (N = 2530)Secondary (N = 115)P-value
CPB time, median (IQR)88 (70–112)94 (70–130)0.654
Cross clamp time, median (IQR)63 (49–82)69 (50–90)0.521
Type of repair, n (%)
 Isolated ring88 (4)70 (61)<0.001
 Ring + resection of the posterior leaflet1038 (41)00.999
 Ring + edge-to-edge901 (35)45 (39)0.441
 Ring + chordae300 (12)00.999
 Ring + resection+ chordae100 (4)00.999
 Othersa103 (4)00.999
Ring characteristics
 Complete ring, n (%)1220 (48)115 (100)0.999
 Band, n (%)1310 (52)00.999
 Ring diameter, median (IQR)36 (34–38)28 (26–30)<0.001
Degenerative (N = 2530)Secondary (N = 115)P-value
CPB time, median (IQR)88 (70–112)94 (70–130)0.654
Cross clamp time, median (IQR)63 (49–82)69 (50–90)0.521
Type of repair, n (%)
 Isolated ring88 (4)70 (61)<0.001
 Ring + resection of the posterior leaflet1038 (41)00.999
 Ring + edge-to-edge901 (35)45 (39)0.441
 Ring + chordae300 (12)00.999
 Ring + resection+ chordae100 (4)00.999
 Othersa103 (4)00.999
Ring characteristics
 Complete ring, n (%)1220 (48)115 (100)0.999
 Band, n (%)1310 (52)00.999
 Ring diameter, median (IQR)36 (34–38)28 (26–30)<0.001
a

Including combination not listed in the table, other technique such as cleft closure or chordal transposition and patients who did not receive annuloplasty.

IQR: interquartile range.

Table 2:

Operative variables

Degenerative (N = 2530)Secondary (N = 115)P-value
CPB time, median (IQR)88 (70–112)94 (70–130)0.654
Cross clamp time, median (IQR)63 (49–82)69 (50–90)0.521
Type of repair, n (%)
 Isolated ring88 (4)70 (61)<0.001
 Ring + resection of the posterior leaflet1038 (41)00.999
 Ring + edge-to-edge901 (35)45 (39)0.441
 Ring + chordae300 (12)00.999
 Ring + resection+ chordae100 (4)00.999
 Othersa103 (4)00.999
Ring characteristics
 Complete ring, n (%)1220 (48)115 (100)0.999
 Band, n (%)1310 (52)00.999
 Ring diameter, median (IQR)36 (34–38)28 (26–30)<0.001
Degenerative (N = 2530)Secondary (N = 115)P-value
CPB time, median (IQR)88 (70–112)94 (70–130)0.654
Cross clamp time, median (IQR)63 (49–82)69 (50–90)0.521
Type of repair, n (%)
 Isolated ring88 (4)70 (61)<0.001
 Ring + resection of the posterior leaflet1038 (41)00.999
 Ring + edge-to-edge901 (35)45 (39)0.441
 Ring + chordae300 (12)00.999
 Ring + resection+ chordae100 (4)00.999
 Othersa103 (4)00.999
Ring characteristics
 Complete ring, n (%)1220 (48)115 (100)0.999
 Band, n (%)1310 (52)00.999
 Ring diameter, median (IQR)36 (34–38)28 (26–30)<0.001
a

Including combination not listed in the table, other technique such as cleft closure or chordal transposition and patients who did not receive annuloplasty.

IQR: interquartile range.

Table 3:

Postoperative complications

Degenerative (N = 2530)Secondary (N = 115)P-value
Low cardiac output syndrome, n (%)34 (1.3)10 (8.6)<0.001
Prolonged ventilatory support (>48 h, n (%)48 (1.9)12 (10.4)<0.001
Re-exploration for bleeding, n (%)38 (1.5)1 (0.8)0.321
Acute kidney injury requiring dialysis, n (%)44 (1.7)4 (3.5)0.171
Stroke, n (%)23 (0.9)1 (0.8)0.653
Transitory ischaemic attack, n (%)31 (1.2)1 (0.8)0.765
30-Day mortality, n (%)10 (0.39)3 (2.6)<0.001
Degenerative (N = 2530)Secondary (N = 115)P-value
Low cardiac output syndrome, n (%)34 (1.3)10 (8.6)<0.001
Prolonged ventilatory support (>48 h, n (%)48 (1.9)12 (10.4)<0.001
Re-exploration for bleeding, n (%)38 (1.5)1 (0.8)0.321
Acute kidney injury requiring dialysis, n (%)44 (1.7)4 (3.5)0.171
Stroke, n (%)23 (0.9)1 (0.8)0.653
Transitory ischaemic attack, n (%)31 (1.2)1 (0.8)0.765
30-Day mortality, n (%)10 (0.39)3 (2.6)<0.001
Table 3:

Postoperative complications

Degenerative (N = 2530)Secondary (N = 115)P-value
Low cardiac output syndrome, n (%)34 (1.3)10 (8.6)<0.001
Prolonged ventilatory support (>48 h, n (%)48 (1.9)12 (10.4)<0.001
Re-exploration for bleeding, n (%)38 (1.5)1 (0.8)0.321
Acute kidney injury requiring dialysis, n (%)44 (1.7)4 (3.5)0.171
Stroke, n (%)23 (0.9)1 (0.8)0.653
Transitory ischaemic attack, n (%)31 (1.2)1 (0.8)0.765
30-Day mortality, n (%)10 (0.39)3 (2.6)<0.001
Degenerative (N = 2530)Secondary (N = 115)P-value
Low cardiac output syndrome, n (%)34 (1.3)10 (8.6)<0.001
Prolonged ventilatory support (>48 h, n (%)48 (1.9)12 (10.4)<0.001
Re-exploration for bleeding, n (%)38 (1.5)1 (0.8)0.321
Acute kidney injury requiring dialysis, n (%)44 (1.7)4 (3.5)0.171
Stroke, n (%)23 (0.9)1 (0.8)0.653
Transitory ischaemic attack, n (%)31 (1.2)1 (0.8)0.765
30-Day mortality, n (%)10 (0.39)3 (2.6)<0.001

For patients with degenerative MR, the area under the ROC curve was 0.68 (Fig. 1) with an optimal cut-point of 1.61. The Somers’ Dxy index was 0.032. These 2 values indicate a relatively low discrimination. The calibration plot, shown in Fig. 2, indicates an overprediction of the 30-day mortality also in patients with low surgical risk. The P-value for the Hosmer–Lemeshow test was <0.001 indicating low calibration. Finally, the Brier score was 0.27 indicating a medium–low overall performance of the score.

Receiver operating characteristic curves for patients with degenerative mitral regurgitation. The value of the area under the curve was 0.68 indicating a relatively low discrimination of the EuroSCORE II in this cohort of patients at low surgical risk (median EuroSCORE II 1%). AUC: area under the curve.
Figure 1:

Receiver operating characteristic curves for patients with degenerative mitral regurgitation. The value of the area under the curve was 0.68 indicating a relatively low discrimination of the EuroSCORE II in this cohort of patients at low surgical risk (median EuroSCORE II 1%). AUC: area under the curve.

Spline for the EuroSCORE II in patients with degenerative mitral regurgitation. This figure highlights in a non-linear manner the overprediction of the 30-day mortality in patients at low surgical risk.
Figure 2:

Spline for the EuroSCORE II in patients with degenerative mitral regurgitation. This figure highlights in a non-linear manner the overprediction of the 30-day mortality in patients at low surgical risk.

For patients with secondary MR, the area under the ROC curve was 0.87 with an optimal cut-point of 4.53 (Fig. 3). The Somers’ Dxy index was 0.773. These 2 values indicate good discrimination. The calibration plot, shown in Fig. 4, indicates a very good calibration for low-risk patients and a tendency to overestimate the 30-day mortality in medium- to high-risk patients. The P-value for the Hosmer–Lemeshow test was 0.996 indicating high calibration of the score. Finally, the Brier score was 0.003 indicating a very good overall performance of the score.

Receiver operating characteristic curves for patients with secondary mitral regurgitation. The value of the area under the curve was 0.86 indicating a good discrimination of the EuroSCORE II in this cohort of patients at increased surgical risk (median EuroSCORE II 2.4%). AUC: area under the curve.
Figure 3:

Receiver operating characteristic curves for patients with secondary mitral regurgitation. The value of the area under the curve was 0.86 indicating a good discrimination of the EuroSCORE II in this cohort of patients at increased surgical risk (median EuroSCORE II 2.4%). AUC: area under the curve.

Spline for the EuroSCORE II in patients with secondary mitral regurgitation. This figure highlights in a non-linear manner the good calibration of the score in patients at low surgical risk, while a tendency to overestimate the mortality is evident with EuroSCORE II >5%.
Figure 4:

Spline for the EuroSCORE II in patients with secondary mitral regurgitation. This figure highlights in a non-linear manner the good calibration of the score in patients at low surgical risk, while a tendency to overestimate the mortality is evident with EuroSCORE II >5%.

The details of the performance of the EuroSCORE II in the 2 groups of patients are listed in Table 4.

Table 4:

Performance of the EuroSCORE II in the 2 groups

Degenerative MRSecondary MR
Overall performance
 Brier score0.270.003
Discrimination
 Area under the curve0.680.86
 Somers’ Dxy0.0320.773
 Youden’s Index1.614.53
Calibration
 Slope0.120.42
 Intercept0.020
 Hosmer–Lemeshow P-value<0.0010.996
Degenerative MRSecondary MR
Overall performance
 Brier score0.270.003
Discrimination
 Area under the curve0.680.86
 Somers’ Dxy0.0320.773
 Youden’s Index1.614.53
Calibration
 Slope0.120.42
 Intercept0.020
 Hosmer–Lemeshow P-value<0.0010.996

Best performance for the different tests: Brier score = 0; the closest to 1, the worse the global accuracy. Performance for the AUC: 1 excellent diagnostic test and 0.5 bad diagnostic test. Best performance for Somers’ Dxy = 1; the closest to 1, the better the discrimination. Youden’s Index defines the optimal cut-point, which is the point at which the score performances are maximized. Slope: the closet to 1, the better the calibration. Intercept: the closest to 0, the better the calibration. Hosmer–Lemeshow goodness of fit: a non-significant P-value indicates a good test.

AUC: area under the curve; MR: mitral regurgitation.

Table 4:

Performance of the EuroSCORE II in the 2 groups

Degenerative MRSecondary MR
Overall performance
 Brier score0.270.003
Discrimination
 Area under the curve0.680.86
 Somers’ Dxy0.0320.773
 Youden’s Index1.614.53
Calibration
 Slope0.120.42
 Intercept0.020
 Hosmer–Lemeshow P-value<0.0010.996
Degenerative MRSecondary MR
Overall performance
 Brier score0.270.003
Discrimination
 Area under the curve0.680.86
 Somers’ Dxy0.0320.773
 Youden’s Index1.614.53
Calibration
 Slope0.120.42
 Intercept0.020
 Hosmer–Lemeshow P-value<0.0010.996

Best performance for the different tests: Brier score = 0; the closest to 1, the worse the global accuracy. Performance for the AUC: 1 excellent diagnostic test and 0.5 bad diagnostic test. Best performance for Somers’ Dxy = 1; the closest to 1, the better the discrimination. Youden’s Index defines the optimal cut-point, which is the point at which the score performances are maximized. Slope: the closet to 1, the better the calibration. Intercept: the closest to 0, the better the calibration. Hosmer–Lemeshow goodness of fit: a non-significant P-value indicates a good test.

AUC: area under the curve; MR: mitral regurgitation.

DISCUSSION

Prediction models play an important role in current cardiac surgical practice, and the EuroSCORE II is 1 of the most used tool to assess the risk of perioperative mortality.

Although the study population used for its original development included >1900 patients who underwent mitral valve repair [1], the EuroSCORE II has never been validated in a large contemporary cohort of mitral repair patients, treated in a high-volume referral centre. In addition, its performance has never been tested separately for patients with degenerative and secondary MR, which do have very different clinical characteristics, pathology and outcomes.

The main findings of our study are the following:

  • In patients with degenerative MR, the EuroSCORE II shows low discrimination (AUC 0.68), low accuracy (Brier score 0.27) and low calibration due to overprediction of the 30-day mortality.

  • In patients with secondary MR, the EuroSCORE II shows a good overall performance estimating the 30-day mortality with good discrimination (AUC 0.88), good accuracy (Brier score 0.003) and good calibration.

  • In patients with degenerative MR and low surgical risk, operated in a high-volume centre with high level of expertise for mitral valve repair, the EuroSCORE II significantly overestimates the 30-day mortality.

The EuroSCORE II was originally developed to improve calibration of the older model, the EuroSCORE. Studies on performance of the EuroSCORE II have demonstrated a good discriminatory ability of EuroSCORE II, better than its original version, with an area under the ROC curve greater than 0.80 in many series [6, 7, 19–21]. However, its performance varies depending on the type of operation and decreases in high-risk patients [6]. So far, our finding of a significant overprediction of the mortality in patients undergoing mitral valve repair for degenerative MR has never been reported in the literature. This overestimation of the 30-day mortality is probably linked to the very low mortality observed in our series (0.39%; 10/2530), which reflects the high level of expertise of our institution in mitral repair surgery. Indeed, the strict correlation between volume load and outcomes after mitral repair has been clearly demonstrated [22, 23]. Our results are somehow in contrast with those reported by Osnabrugge et al. [7]. In their sub-group analysis including about 1000 patients who underwent mitral valve repair in several centres across the USA (not divided by MR aetiology), the EuroSCORE II showed a better performance compared to our study, with an AUC of 0.82 and a better calibration. This discrepancy in the score performance can probably be explained by the significant difference in the 30-day mortality between their series (1.32%) and ours (0.39%), despite similar median EuroSCORE II values at baseline. This typically occurs when a comparison is made between the results of a single centre with high expertise in a specific field and those coming from a large number of centres with different levels of skill in that topic.

While the performance of the EuroSCORE II was not good in our cohort of degenerative MR, on the other hand, in patients with secondary MR, the EuroSCORE II showed good discrimination, good calibration and good accuracy. It is well known that degenerative and secondary MR are 2 completely different diseases, with different patients’ profile, natural history and outcomes after interventions [24]. Not surprisingly, in our study population, there was a significant difference between secondary and degenerative MR patients in terms of risk profile (median EuroSCORE II 2.4% vs 1%) and 30-day mortality (2.6% vs 0.39%). This led to a better discrimination, calibration and accuracy of the EuroSCORE II in patients with secondary MR as compared to those with degenerative MR.

Limitations

The main limitation of our study is the missing long-term follow-up. However, our aim was to validate the EuroSCORE II, which simply predicts the 30-day mortality and not the long-term survival. Another relative limitation is the small sample size of the patients with secondary MR compared to the cohort of patients with degenerative MR. Finally, the findings of this study should not be applied to patients undergoing mitral repair for rheumatic or endocarditis aetiologies. Indeed, both of them were excluded on purpose due to a limited sample size, which, therefore, would have increased the risk of bias.

CONCLUSION

In patients with degenerative MR undergoing mitral repair, the EuroSCORE II demonstrated an overall poor performance with low discrimination, accuracy and calibration due to an overestimation of the 30-day mortality. On the other hand, in patients with secondary MR, the EuroSCORE II showed better overall performance. In high-volume centres with a high level of expertise for mitral valve repair, the EuroSCORE II significantly overpredicts the 30-day mortality in patients with degenerative MR.

ACKNOWLEDGEMENTS

The Alfieri Heart Foundation provided support for data collection and analysis of this research. We thank Davide Schiavi, clinical trial coordinator of the Alfieri Heart Foundation for the assistance with the Ethical Committee.

Conflict of interest: none declared.

Author contributions

Davide Carino: Conceptualization; Data curation; Formal analysis; Investigation; Methodology; Project administration; Resources; Validation; Visualization; Writing—original draft; Writing—review and editing. Paolo Denti: Conceptualization; Methodology; Supervision; Validation. Guido Ascione: Data curation; Formal analysis; Investigation; Software. Benedetto Del Forno: Formal analysis; Investigation; Methodology; Validation. Elisabetta Lapenna: Conceptualization; Investigation; Methodology; Supervision; Validation. Stefania Ruggeri: Formal analysis; Methodology; Software; Visualization. Eustachio Agricola: Investigation; Methodology; Resources; Validation. Nicola Buzzatti: Investigation; Methodology; Validation. Alessandro Verzini: Resources; Supervision; Validation. Roberta Meneghin: Data curation. Anna Mara Scandroglio: Investigation; Methodology; Resources; Validation. Fabrizio Monaco: Investigation; Methodology; Resources; Validation. Alessandro Castiglioni: Funding acquisition; Resources; Supervision; Validation. Ottavio Alfieri: Funding acquisition; Project administration; Supervision; Validation; Visualization. Michele De Bonis: Conceptualization; Funding acquisition; Investigation; Methodology; Project administration; Resources; Supervision; Validation; Writing—review and editing.

Reviewer information

European Journal of Cardio-Thoracic Surgery thanks the anonymous reviewer(s) for their contribution to the peer review process of this article.

Presented at the 34th Annual Meeting of the European Association for Cardio-Thoracic Surgery, Barcelona, Spain, 8-10 October 2020.

REFERENCES

1

Nashef
SAM
,
Roques
F
,
Sharples
LD
,
Nilsson
J
,
Smith
C
,
Goldstone
AR
et al.
Euroscore II
.
Eur J Cardiothorac Surg
2012
;
41
:
734
45
.

2

Dedda
UD
,
Pelissero
G
,
Agnelli
B
,
Vincentiis
CD
,
Castelvecchio
S
,
Ranucci
M.
Accuracy, calibration and clinical performance of the new EuroSCORE II risk stratification system
.
Eur J Cardiothorac Surg
2013
;
43
:
27
32
.

3

Chalmers
J
,
Pullan
M
,
Fabri
B
,
McShane
J
,
Shaw
M
,
Mediratta
N
et al.
Validation of EuroSCORE II in a modern cohort of patients undergoing cardiac surgery
.
Eur J Cardiothorac Surg
2013
;
43
:
688
94
.

4

Grant
SW
,
Hickey
GL
,
Dimarakis
I
,
Trivedi
U
,
Bryan
A
,
Treasure
T
et al.
How does EuroSCORE II perform in UK cardiac surgery; an analysis of 23 740 patients from the Society for Cardiothoracic Surgery in Great Britain and Ireland National Database
.
Heart
2012
;
98
:
1568
72
.

5

Barili
F
,
Pacini
D
,
Capo
A
,
Rasovic
O
,
Grossi
C
,
Alamanni
F
et al.
Does EuroSCORE II perform better than its original versions? A multicentre validation study
.
Eur Heart J
2013
;
34
:
22
9
.

6

Guida
P
,
Mastro
F
,
Scrascia
G
,
Whitlock
R
,
Paparella
D.
Performance of the European System for Cardiac Operative Risk Evaluation II: a meta-analysis of 22 studies involving 145,592 cardiac surgery procedures
.
J Thorac Cardiovasc Surg
2014
;
148
:
3049
57.e1
.

7

Osnabrugge
RL
,
Speir
AM
,
Head
SJ
,
Fonner
CE
,
Fonner
E
,
Kappetein
AP
et al.
Performance of EuroSCORE II in a large US database: implications for transcatheter aortic valve implantation
.
Eur J Cardiothorac Surg
2014
;
46
:
400
8
.

8

Biancari
F
,
Vasques
F
,
Mikkola
R
,
Martin
M
,
Lahtinen
J
,
Heikkinen
J.
Validation of EuroSCORE II in patients undergoing coronary artery bypass surgery
.
Ann Thorac Surg
2012
;
93
:
1930
5
.

9

Wang
TKM
,
Choi
DHM
,
Stewart
R
,
Gamble
G
,
Haydock
D
,
Ruygrok
P.
Comparison of four contemporary risk models at predicting mortality after aortic valve replacement
.
J Thorac Cardiovasc Surg
2015
;
149
:
443
8
.

10

Duchnowski
P
,
Hryniewiecki
T
,
Kuśmierczyk
M
,
Szymanski
P.
Performance of the EuroSCORE II and the society of thoracic surgeons score in patients undergoing aortic valve replacement for aortic stenosis
.
J Thorac Dis
2019
;
11
:
2076
81
.

11

Barili
F
,
Pacini
D
,
Capo
A
,
Ardemagni
E
,
Pellicciari
G
,
Zanobini
M
et al.
Reliability of new scores in predicting perioperative mortality after isolated aortic valve surgery: a comparison with the society of thoracic surgeons score and logistic EuroSCORE
.
Ann Thorac Surg
2013
;
95
:
1539
44
.

12

Wang
L
,
Han
QQ
,
Qiao
F
,
Wang
C
,
Zhang
XW
,
Han
L
et al.
Performance of EuroSCORE II in patients who have undergone heart valve surgery: a multicentre study in a Chinese population
.
Eur J Cardiothorac Surg
2014
;
45
:
359
64
.

13

Barili
F
,
Pacini
D
,
Grossi
C
,
Bartolomeo
RD
,
Alamanni
F
,
Parolari
A.
Reliability of new scores in predicting perioperative mortality after mitral valve surgery
.
J Thorac Cardiovasc Surg
2014
;
147
:
1008
12
.

14

Moscarelli
M
,
Bianchi
G
,
Margaryan
R
,
Cerillo
A
,
Farneti
P
,
Murzi
M
et al.
Accuracy of EuroSCORE II in patients undergoing minimally invasive mitral valve surgery
.
Interact Cardiovasc Thorac Surg
2015
;
21
:
748
53
.

15

Akins
CW
,
Miller
DC
,
Turina
MI
,
Kouchoukos
NT
,
Blackstone
EH
,
Grunkemeier
GL
et al.
Guidelines for reporting mortality and morbidity after cardiac valve interventions
.
Eur J Cardiothorac Surg
2008
;
33
:
523
8
.

16

Cook
NR.
Use and misuse of the receiver operating characteristic curve in risk prediction
.
Circulation
2007
;
115
:
928
35
.

17

Helmreich
JE
, Regression modeling strategies with applications to linear models, logistic and ordinal regression and survival analysis (2nd edition).
J Stat Softw
2016
;
70
:
3
5
.

18

Steyerberg
EW
,
Vickers
AJ
,
Cook
NR
,
Gerds
T
,
Gonen
M
,
Obuchowski
N
et al.
Assessing the performance of prediction models: a framework for traditional and novel measures
.
Epidemiology
2010
;
21
:
128
38
.

19

Paparella
D
,
Guida
P
,
Eusanio
GD
,
Caparrotti
S
,
Gregorini
R
,
Cassese
M
et al.
Risk stratification for in-hospital mortality after cardiac surgery: external validation of EuroSCORE II in a prospective regional registry
.
Eur J Cardiothorac Surg
2014
;
46
:
840
8
.

20

Carnero-Alcázar
M
,
Guisasola
JAS
,
Lacruz
FJR
,
Castellanos
LCM
,
Carnicer
JC
,
Medinilla
EV
et al.
Validation of EuroSCORE II on a single-centre 3800 patient cohort
.
Interact Cardiovasc Thorac Surg
2013
;
16
:
293
300
.

21

Kirmani
BH
,
Mazhar
K
,
Fabri
BM
,
Pullan
DM.
Comparison of the EuroSCORE II and Society of Thoracic Surgeons 2008 risk tools
.
Eur J Cardiothorac Surg
2013
;
44
:
999
1005
.

22

El-Eshmawi
A
,
Castillo
JG
,
Tang
GHL
,
Adams
DH.
Developing a mitral valve center of excellence
.
Curr Opin Cardiol
2018
;
33
:
155
61
.

23

Bonow
RO
,
Adams
DH.
The time has come to define centers of excellence in mitral valve repair
.
J Am Coll Cardiol
2016
;
67
:
499
501
.

24

Buzzatti
N
,
Denti
P
,
Scarfò
IS
,
Giambuzzi
I
,
Schiavi
D
,
Ruggeri
S
et al.
Mid-term outcomes (up to 5 years) of percutaneous edge-to-edge mitral repair in the real-world according to regurgitation mechanism: a single-center experience
.
Catheter Cardiovasc Interv
2019
;
94
:
427
35
.

ABBREVIATIONS

     
  • AUC

    Area under the curve

  •  
  • MR

    Mitral regurgitation

  •  
  • ROC

    Receiver operating characteristic

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://dbpia.nl.go.kr/journals/pages/open_access/funder_policies/chorus/standard_publication_model)