Table 6.

The results of multiple-feature selection based on RFE. The values on the left/right of each table cell are the results of three-feature/four-feature selection. The performance metrics of our models were calculated with four types of sampling, including ordinary training set (noted as ‘NonSampling’), SMOTE training set (’SMOTE’), under-sampling training set (’UnderSample’), and oversampling training set (’OverSample’). Selected features depend on the used classifiers as well as the sampling types. Detailed analyses are made in Section 4.2.2.

MethodSamplingFeature IDRecall (per cent)Precision (per cent)F1 Score (per cent)FPR (per cent)
DTNonSampling3,5,10/2,3,5,1095/9597/9896/960.033/0.031
SMOTE2,5,10/2,5,10,1198/9681/8589/900.307/0.222
UnderSample2,5,10/2,5,10,1198/9851/5167/671.260/1.260
OverSample1,10,14/1,4,10,1497/9675/8485/890.420/0.289
AdaBoostNonSampling3,5,9/3,5,9,1695/9599/9997/970.015/0.011
SMOTE3,5,9/3,5,9,1099/9978/8487/910.360/0.258
UnderSample3,5,6/3,5,6,1099/9947/6064/751.491/0.876
OverSample3,5,9/3,5,9,1699/9973/7484/850.493/0.460
XGBoostNonSampling3,5,10/3,5,81097/9797/9797/970.033/0.033
SMOTE3,5,10/2,3,5,1096/9947/8863/931.449/0.169
UnderSample3,5,10/2,3,5,1099/9960/6175/750.878/0.849
OverSample3,5,10/3,5,7,1098/9895/9496/960.073/0.078
GBoostNonSampling3,8,10/3,8,10,1196/9698/9797/970.031/0.040
SMOTE3,5,10/3,5,10,1199/9989/9394/950.162/0.102
UnderSample5,8,10/1,5,8,1098/9853/5369/691.158/1.158
OverSample3,5,10/3,5,10,1498/9795/9696/970.071 /0.031
RFNonSampling1,9,10,159498960.031
SMOTE1,5,8,109876850.420
UnderSample1,5,8,109859730.913
OverSample1,5,8,109680870.318
MethodSamplingFeature IDRecall (per cent)Precision (per cent)F1 Score (per cent)FPR (per cent)
DTNonSampling3,5,10/2,3,5,1095/9597/9896/960.033/0.031
SMOTE2,5,10/2,5,10,1198/9681/8589/900.307/0.222
UnderSample2,5,10/2,5,10,1198/9851/5167/671.260/1.260
OverSample1,10,14/1,4,10,1497/9675/8485/890.420/0.289
AdaBoostNonSampling3,5,9/3,5,9,1695/9599/9997/970.015/0.011
SMOTE3,5,9/3,5,9,1099/9978/8487/910.360/0.258
UnderSample3,5,6/3,5,6,1099/9947/6064/751.491/0.876
OverSample3,5,9/3,5,9,1699/9973/7484/850.493/0.460
XGBoostNonSampling3,5,10/3,5,81097/9797/9797/970.033/0.033
SMOTE3,5,10/2,3,5,1096/9947/8863/931.449/0.169
UnderSample3,5,10/2,3,5,1099/9960/6175/750.878/0.849
OverSample3,5,10/3,5,7,1098/9895/9496/960.073/0.078
GBoostNonSampling3,8,10/3,8,10,1196/9698/9797/970.031/0.040
SMOTE3,5,10/3,5,10,1199/9989/9394/950.162/0.102
UnderSample5,8,10/1,5,8,1098/9853/5369/691.158/1.158
OverSample3,5,10/3,5,10,1498/9795/9696/970.071 /0.031
RFNonSampling1,9,10,159498960.031
SMOTE1,5,8,109876850.420
UnderSample1,5,8,109859730.913
OverSample1,5,8,109680870.318
Table 6.

The results of multiple-feature selection based on RFE. The values on the left/right of each table cell are the results of three-feature/four-feature selection. The performance metrics of our models were calculated with four types of sampling, including ordinary training set (noted as ‘NonSampling’), SMOTE training set (’SMOTE’), under-sampling training set (’UnderSample’), and oversampling training set (’OverSample’). Selected features depend on the used classifiers as well as the sampling types. Detailed analyses are made in Section 4.2.2.

MethodSamplingFeature IDRecall (per cent)Precision (per cent)F1 Score (per cent)FPR (per cent)
DTNonSampling3,5,10/2,3,5,1095/9597/9896/960.033/0.031
SMOTE2,5,10/2,5,10,1198/9681/8589/900.307/0.222
UnderSample2,5,10/2,5,10,1198/9851/5167/671.260/1.260
OverSample1,10,14/1,4,10,1497/9675/8485/890.420/0.289
AdaBoostNonSampling3,5,9/3,5,9,1695/9599/9997/970.015/0.011
SMOTE3,5,9/3,5,9,1099/9978/8487/910.360/0.258
UnderSample3,5,6/3,5,6,1099/9947/6064/751.491/0.876
OverSample3,5,9/3,5,9,1699/9973/7484/850.493/0.460
XGBoostNonSampling3,5,10/3,5,81097/9797/9797/970.033/0.033
SMOTE3,5,10/2,3,5,1096/9947/8863/931.449/0.169
UnderSample3,5,10/2,3,5,1099/9960/6175/750.878/0.849
OverSample3,5,10/3,5,7,1098/9895/9496/960.073/0.078
GBoostNonSampling3,8,10/3,8,10,1196/9698/9797/970.031/0.040
SMOTE3,5,10/3,5,10,1199/9989/9394/950.162/0.102
UnderSample5,8,10/1,5,8,1098/9853/5369/691.158/1.158
OverSample3,5,10/3,5,10,1498/9795/9696/970.071 /0.031
RFNonSampling1,9,10,159498960.031
SMOTE1,5,8,109876850.420
UnderSample1,5,8,109859730.913
OverSample1,5,8,109680870.318
MethodSamplingFeature IDRecall (per cent)Precision (per cent)F1 Score (per cent)FPR (per cent)
DTNonSampling3,5,10/2,3,5,1095/9597/9896/960.033/0.031
SMOTE2,5,10/2,5,10,1198/9681/8589/900.307/0.222
UnderSample2,5,10/2,5,10,1198/9851/5167/671.260/1.260
OverSample1,10,14/1,4,10,1497/9675/8485/890.420/0.289
AdaBoostNonSampling3,5,9/3,5,9,1695/9599/9997/970.015/0.011
SMOTE3,5,9/3,5,9,1099/9978/8487/910.360/0.258
UnderSample3,5,6/3,5,6,1099/9947/6064/751.491/0.876
OverSample3,5,9/3,5,9,1699/9973/7484/850.493/0.460
XGBoostNonSampling3,5,10/3,5,81097/9797/9797/970.033/0.033
SMOTE3,5,10/2,3,5,1096/9947/8863/931.449/0.169
UnderSample3,5,10/2,3,5,1099/9960/6175/750.878/0.849
OverSample3,5,10/3,5,7,1098/9895/9496/960.073/0.078
GBoostNonSampling3,8,10/3,8,10,1196/9698/9797/970.031/0.040
SMOTE3,5,10/3,5,10,1199/9989/9394/950.162/0.102
UnderSample5,8,10/1,5,8,1098/9853/5369/691.158/1.158
OverSample3,5,10/3,5,10,1498/9795/9696/970.071 /0.031
RFNonSampling1,9,10,159498960.031
SMOTE1,5,8,109876850.420
UnderSample1,5,8,109859730.913
OverSample1,5,8,109680870.318
Close
This Feature Is Available To Subscribers Only

Sign In or Create an Account

Close

This PDF is available to Subscribers Only

View Article Abstract & Purchase Options

For full access to this pdf, sign in to an existing account, or purchase an annual subscription.

Close