Table 5.

The results of univariate-feature selection (on the left of each table cell) and double-feature selection (on the right of each table cell). Based on the GS, the performance metrics of our models were calculated with four types of sampling, including ordinary training set (noted as ‘NonSampling’), SMOTE training set (’SMOTE’), undersampling training set (’UnderSample’), and oversampling training set (’OverSample’). Selected features depend on the used classifiers as well as the sampling types. More analyses is presented in Section 4.1.2.

MethodSampling70Feature IDRecall (per cent)Precision (per cent)F1 Score (per cent)FPR (per cent)
DTNonSampling10/3,1080/9194/9386/920.062/0.093
SMOTE5/1,1098/9832/6847/802.718/0.062
UnderSample2/1,993/9631/4645/702.796/1.040
OverSample2/2,1092/9541/8056/872.784/0.307
AdaBoostNonSampling10/5,1078/9097/9386/910.003/0.096
SMOTE10/1,1093 /9934/6749/802.433/0.651
UnderSample2/1,1093/9931/5847/732.738/0.962
OverSample2/2,1093/9734/6450/772.405/0.733
XGBoostNonSampling10/2,1078/9194/9386/920.064/0.056
SMOTE10/1,1093/9834/6849/802.433/0.622
UnderSample5/1,1098/9930/5946/743.007/0.925
OverSample10/6,1091/9545/8260/881.469/0.278
GBoostNonSampling10/2,1079/9113/9322/926.958/0.096
SMOTE5/1,1098/9832/6948/812.762/0.593
UnderSample5/1,1095/9927/5842/733.429/0.964
OverSample10/2,1090/9425/8948/913.762/0.156
MethodSampling70Feature IDRecall (per cent)Precision (per cent)F1 Score (per cent)FPR (per cent)
DTNonSampling10/3,1080/9194/9386/920.062/0.093
SMOTE5/1,1098/9832/6847/802.718/0.062
UnderSample2/1,993/9631/4645/702.796/1.040
OverSample2/2,1092/9541/8056/872.784/0.307
AdaBoostNonSampling10/5,1078/9097/9386/910.003/0.096
SMOTE10/1,1093 /9934/6749/802.433/0.651
UnderSample2/1,1093/9931/5847/732.738/0.962
OverSample2/2,1093/9734/6450/772.405/0.733
XGBoostNonSampling10/2,1078/9194/9386/920.064/0.056
SMOTE10/1,1093/9834/6849/802.433/0.622
UnderSample5/1,1098/9930/5946/743.007/0.925
OverSample10/6,1091/9545/8260/881.469/0.278
GBoostNonSampling10/2,1079/9113/9322/926.958/0.096
SMOTE5/1,1098/9832/6948/812.762/0.593
UnderSample5/1,1095/9927/5842/733.429/0.964
OverSample10/2,1090/9425/8948/913.762/0.156
Table 5.

The results of univariate-feature selection (on the left of each table cell) and double-feature selection (on the right of each table cell). Based on the GS, the performance metrics of our models were calculated with four types of sampling, including ordinary training set (noted as ‘NonSampling’), SMOTE training set (’SMOTE’), undersampling training set (’UnderSample’), and oversampling training set (’OverSample’). Selected features depend on the used classifiers as well as the sampling types. More analyses is presented in Section 4.1.2.

MethodSampling70Feature IDRecall (per cent)Precision (per cent)F1 Score (per cent)FPR (per cent)
DTNonSampling10/3,1080/9194/9386/920.062/0.093
SMOTE5/1,1098/9832/6847/802.718/0.062
UnderSample2/1,993/9631/4645/702.796/1.040
OverSample2/2,1092/9541/8056/872.784/0.307
AdaBoostNonSampling10/5,1078/9097/9386/910.003/0.096
SMOTE10/1,1093 /9934/6749/802.433/0.651
UnderSample2/1,1093/9931/5847/732.738/0.962
OverSample2/2,1093/9734/6450/772.405/0.733
XGBoostNonSampling10/2,1078/9194/9386/920.064/0.056
SMOTE10/1,1093/9834/6849/802.433/0.622
UnderSample5/1,1098/9930/5946/743.007/0.925
OverSample10/6,1091/9545/8260/881.469/0.278
GBoostNonSampling10/2,1079/9113/9322/926.958/0.096
SMOTE5/1,1098/9832/6948/812.762/0.593
UnderSample5/1,1095/9927/5842/733.429/0.964
OverSample10/2,1090/9425/8948/913.762/0.156
MethodSampling70Feature IDRecall (per cent)Precision (per cent)F1 Score (per cent)FPR (per cent)
DTNonSampling10/3,1080/9194/9386/920.062/0.093
SMOTE5/1,1098/9832/6847/802.718/0.062
UnderSample2/1,993/9631/4645/702.796/1.040
OverSample2/2,1092/9541/8056/872.784/0.307
AdaBoostNonSampling10/5,1078/9097/9386/910.003/0.096
SMOTE10/1,1093 /9934/6749/802.433/0.651
UnderSample2/1,1093/9931/5847/732.738/0.962
OverSample2/2,1093/9734/6450/772.405/0.733
XGBoostNonSampling10/2,1078/9194/9386/920.064/0.056
SMOTE10/1,1093/9834/6849/802.433/0.622
UnderSample5/1,1098/9930/5946/743.007/0.925
OverSample10/6,1091/9545/8260/881.469/0.278
GBoostNonSampling10/2,1079/9113/9322/926.958/0.096
SMOTE5/1,1098/9832/6948/812.762/0.593
UnderSample5/1,1095/9927/5842/733.429/0.964
OverSample10/2,1090/9425/8948/913.762/0.156
Close
This Feature Is Available To Subscribers Only

Sign In or Create an Account

Close

This PDF is available to Subscribers Only

View Article Abstract & Purchase Options

For full access to this pdf, sign in to an existing account, or purchase an annual subscription.

Close