The performance of the diagnostic system on the external test datasets and its comparison to radiologists. (A) The ROC curve for the CNN model using liver images to detect liver masses on the external test dataset (Foshan). (B) The ROC curve for the LMC-Net model for classifying benign versus malignant masses on the external test dataset (Foshan). The results include the mean diagnostic accuracies of junior, mid-level, senior radiologists and the consensus decision reached by radiologists and the AI model. (C) The ROC curve for the LM-Net for classifying benign versus malignant masses using the images from the Yichang cohort.