MSDAFL: molecular substructure-based dual attention feature learning framework for predicting drug–drug interactions

Comparison of MSDAFL with other DDI prediction methods on training, validation, and testing sets in a ratio of 6:2:2.^a

Dataset	Metric	MR-GNN	GCN-BMP	EPGCN-DS	DeepDrug	MIRACLE	SSI-DDI	CSGNN	DeepDDS	DSN-DDI	HTCL-DDI	MSDAFL
ZhangDDI	AUROC	0.9618 ± 0.0025	0.8442 ± 0.0121	0.9083 ± 0.0066	0.9535 ± 0.0020	0.9644 ± 0.0035	0.9314 ± 0.0029	0.9171 ± 0.0009	0.9320 ± 0.0023	0.9113 ± 0.0015	0.9858 ± 0.0021	0.9874 ± 0.0011
	AP	0.9263 ± 0.0030	0.8020 ± 0.0157	0.8896 ± 0.0088	0.9233 ± 0.0023	0.9309 ± 0.0053	0.9209 ± 0.0039	0.8902 ± 0.0073	0.9208 ± 0.0031	0.8642 ± 0.0030	0.9706 ± 0.0038	0.9707 ± 0.0018
	F1	0.8293 ± 0.0081	0.7186 ± 0.0271	0.8007 ± 0.0086	0.8289 ± 0.0027	0.8516 ± 0.0027	0.8196 ± 0.0124	0.8360 ± 0.0073	0.8279 ± 0.0042	0.8768 ± 0.0040	0.9219 ± 0.0056	0.9005 ± 0.0014
	ACC	0.9190 ± 0.0050	0.7578 ± 0.0107	0.8240 ± 0.0104	0.8567 ± 0.0033	0.9316 ± 0.0016	0.8535 ± 0.0050	0.8414 ± 0.0045	0.8563 ± 0.0028	0.8665 ± 0.0046	0.9659 ± 0.0024	0.9533 ± 0.0026
ChCh-Mainer	AUROC	0.9311 ± 0.0036	0.7865 ± 0.0056	0.9423 ± 0.0071	0.9838 ± 0.0010	0.9620 ± 0.0079	0.9809 ± 0.0014	0.9768 ± 0.0010	0.9710 ± 0.0018	0.9669 ± 0.0020	0.9906 ± 0.0015	0.9934 ± 0.0031
	AP	0.9595 ± 0.0019	0.8631 ± 0.0054	0.9680 ± 0.0040	0.9916 ± 0.0005	0.9950 ± 0.0011	0.9897 ± 0.0006	0.9756 ± 0.0016	0.9851 ± 0.0008	0.9634 ± 0.0027	0.9987 ± 0.0002	0.9991 ± 0.0007
	F1	0.8813 ± 0.0072	0.8087 ± 0.0092	0.8941 ± 0.0066	0.9467 ± 0.0026	0.9455 ± 0.0066	0.9398 ± 0.0034	0.9247 ± 0.0022	0.9221 ± 0.0063	0.8812 ± 0.0064	0.9748 ± 0.0019	0.9932 ± 0.0044
	ACC	0.8503 ± 0.0062	0.7307 ± 0.0080	0.8664 ± 0.0098	0.9318 ± 0.0035	0.9077 ± 0.0011	0.9219 ± 0.0048	0.9254 ± 0.0017	0.9038 ± 0.0064	0.8889 ± 0.0042	0.9561 ± 0.0032	0.9830 ± 0.0012
DeepDDI	AUROC	0.9335 ± 0.0017	0.7719 ± 0.0063	0.8593 ± 0.0024	0.9174 ± 0.0014	0.9276 ± 0.0038	0.9179 ± 0.0048	0.9401 ± 0.0025	0.9438 ± 0.0063	0.9322 ± 0.0010	0.9449 ± 0.0020	0.9974 ± 0.0011
	AP	0.9456 ± 0.0009	0.8170 ± 0.0060	0.8872 ± 0.0012	0.9299 ± 0.0018	0.9677 ± 0.0018	0.9347 ± 0.0044	0.9417 ± 0.0030	0.9568 ± 0.0056	0.9287 ± 0.0015	0.9741 ± 0.0010	0.9987 ± 0.0012
	F1	0.9007 ± 0.0049	0.8010 ± 0.0026	0.8486 ± 0.0038	0.8939 ± 0.0009	0.9354 ± 0.0070	0.8823 ± 0.0049	0.8601 ± 0.0063	0.9127 ± 0.0054	0.8560 ± 0.0015	0.9478 ± 0.0027	0.9911 ± 0.0043
	ACC	0.8754 ± 0.0043	0.7294 ± 0.0049	0.8022 ± 0.0039	0.8628 ± 0.0012	0.9033 ± 0.0098	0.8538 ± 0.0059	0.8633 ± 0.0036	0.8887 ± 0.0068	0.8541 ± 0.0009	0.9208 ± 0.0037	0.9866 ± 0.0024

Dataset	Metric	MR-GNN	GCN-BMP	EPGCN-DS	DeepDrug	MIRACLE	SSI-DDI	CSGNN	DeepDDS	DSN-DDI	HTCL-DDI	MSDAFL
ZhangDDI	AUROC	0.9618 ± 0.0025	0.8442 ± 0.0121	0.9083 ± 0.0066	0.9535 ± 0.0020	0.9644 ± 0.0035	0.9314 ± 0.0029	0.9171 ± 0.0009	0.9320 ± 0.0023	0.9113 ± 0.0015	0.9858 ± 0.0021	0.9874 ± 0.0011
	AP	0.9263 ± 0.0030	0.8020 ± 0.0157	0.8896 ± 0.0088	0.9233 ± 0.0023	0.9309 ± 0.0053	0.9209 ± 0.0039	0.8902 ± 0.0073	0.9208 ± 0.0031	0.8642 ± 0.0030	0.9706 ± 0.0038	0.9707 ± 0.0018
	F1	0.8293 ± 0.0081	0.7186 ± 0.0271	0.8007 ± 0.0086	0.8289 ± 0.0027	0.8516 ± 0.0027	0.8196 ± 0.0124	0.8360 ± 0.0073	0.8279 ± 0.0042	0.8768 ± 0.0040	0.9219 ± 0.0056	0.9005 ± 0.0014
	ACC	0.9190 ± 0.0050	0.7578 ± 0.0107	0.8240 ± 0.0104	0.8567 ± 0.0033	0.9316 ± 0.0016	0.8535 ± 0.0050	0.8414 ± 0.0045	0.8563 ± 0.0028	0.8665 ± 0.0046	0.9659 ± 0.0024	0.9533 ± 0.0026
ChCh-Mainer	AUROC	0.9311 ± 0.0036	0.7865 ± 0.0056	0.9423 ± 0.0071	0.9838 ± 0.0010	0.9620 ± 0.0079	0.9809 ± 0.0014	0.9768 ± 0.0010	0.9710 ± 0.0018	0.9669 ± 0.0020	0.9906 ± 0.0015	0.9934 ± 0.0031
	AP	0.9595 ± 0.0019	0.8631 ± 0.0054	0.9680 ± 0.0040	0.9916 ± 0.0005	0.9950 ± 0.0011	0.9897 ± 0.0006	0.9756 ± 0.0016	0.9851 ± 0.0008	0.9634 ± 0.0027	0.9987 ± 0.0002	0.9991 ± 0.0007
	F1	0.8813 ± 0.0072	0.8087 ± 0.0092	0.8941 ± 0.0066	0.9467 ± 0.0026	0.9455 ± 0.0066	0.9398 ± 0.0034	0.9247 ± 0.0022	0.9221 ± 0.0063	0.8812 ± 0.0064	0.9748 ± 0.0019	0.9932 ± 0.0044
	ACC	0.8503 ± 0.0062	0.7307 ± 0.0080	0.8664 ± 0.0098	0.9318 ± 0.0035	0.9077 ± 0.0011	0.9219 ± 0.0048	0.9254 ± 0.0017	0.9038 ± 0.0064	0.8889 ± 0.0042	0.9561 ± 0.0032	0.9830 ± 0.0012
DeepDDI	AUROC	0.9335 ± 0.0017	0.7719 ± 0.0063	0.8593 ± 0.0024	0.9174 ± 0.0014	0.9276 ± 0.0038	0.9179 ± 0.0048	0.9401 ± 0.0025	0.9438 ± 0.0063	0.9322 ± 0.0010	0.9449 ± 0.0020	0.9974 ± 0.0011
	AP	0.9456 ± 0.0009	0.8170 ± 0.0060	0.8872 ± 0.0012	0.9299 ± 0.0018	0.9677 ± 0.0018	0.9347 ± 0.0044	0.9417 ± 0.0030	0.9568 ± 0.0056	0.9287 ± 0.0015	0.9741 ± 0.0010	0.9987 ± 0.0012
	F1	0.9007 ± 0.0049	0.8010 ± 0.0026	0.8486 ± 0.0038	0.8939 ± 0.0009	0.9354 ± 0.0070	0.8823 ± 0.0049	0.8601 ± 0.0063	0.9127 ± 0.0054	0.8560 ± 0.0015	0.9478 ± 0.0027	0.9911 ± 0.0043
	ACC	0.8754 ± 0.0043	0.7294 ± 0.0049	0.8022 ± 0.0039	0.8628 ± 0.0012	0.9033 ± 0.0098	0.8538 ± 0.0059	0.8633 ± 0.0036	0.8887 ± 0.0068	0.8541 ± 0.0009	0.9208 ± 0.0037	0.9866 ± 0.0024

The superior results are emphasized in bold, while the second-best results are underlined.

Table 1.

Comparison of MSDAFL with other DDI prediction methods on training, validation, and testing sets in a ratio of 6:2:2.^a

Dataset	Metric	MR-GNN	GCN-BMP	EPGCN-DS	DeepDrug	MIRACLE	SSI-DDI	CSGNN	DeepDDS	DSN-DDI	HTCL-DDI	MSDAFL
ZhangDDI	AUROC	0.9618 ± 0.0025	0.8442 ± 0.0121	0.9083 ± 0.0066	0.9535 ± 0.0020	0.9644 ± 0.0035	0.9314 ± 0.0029	0.9171 ± 0.0009	0.9320 ± 0.0023	0.9113 ± 0.0015	0.9858 ± 0.0021	0.9874 ± 0.0011
	AP	0.9263 ± 0.0030	0.8020 ± 0.0157	0.8896 ± 0.0088	0.9233 ± 0.0023	0.9309 ± 0.0053	0.9209 ± 0.0039	0.8902 ± 0.0073	0.9208 ± 0.0031	0.8642 ± 0.0030	0.9706 ± 0.0038	0.9707 ± 0.0018
	F1	0.8293 ± 0.0081	0.7186 ± 0.0271	0.8007 ± 0.0086	0.8289 ± 0.0027	0.8516 ± 0.0027	0.8196 ± 0.0124	0.8360 ± 0.0073	0.8279 ± 0.0042	0.8768 ± 0.0040	0.9219 ± 0.0056	0.9005 ± 0.0014
	ACC	0.9190 ± 0.0050	0.7578 ± 0.0107	0.8240 ± 0.0104	0.8567 ± 0.0033	0.9316 ± 0.0016	0.8535 ± 0.0050	0.8414 ± 0.0045	0.8563 ± 0.0028	0.8665 ± 0.0046	0.9659 ± 0.0024	0.9533 ± 0.0026
ChCh-Mainer	AUROC	0.9311 ± 0.0036	0.7865 ± 0.0056	0.9423 ± 0.0071	0.9838 ± 0.0010	0.9620 ± 0.0079	0.9809 ± 0.0014	0.9768 ± 0.0010	0.9710 ± 0.0018	0.9669 ± 0.0020	0.9906 ± 0.0015	0.9934 ± 0.0031
	AP	0.9595 ± 0.0019	0.8631 ± 0.0054	0.9680 ± 0.0040	0.9916 ± 0.0005	0.9950 ± 0.0011	0.9897 ± 0.0006	0.9756 ± 0.0016	0.9851 ± 0.0008	0.9634 ± 0.0027	0.9987 ± 0.0002	0.9991 ± 0.0007
	F1	0.8813 ± 0.0072	0.8087 ± 0.0092	0.8941 ± 0.0066	0.9467 ± 0.0026	0.9455 ± 0.0066	0.9398 ± 0.0034	0.9247 ± 0.0022	0.9221 ± 0.0063	0.8812 ± 0.0064	0.9748 ± 0.0019	0.9932 ± 0.0044
	ACC	0.8503 ± 0.0062	0.7307 ± 0.0080	0.8664 ± 0.0098	0.9318 ± 0.0035	0.9077 ± 0.0011	0.9219 ± 0.0048	0.9254 ± 0.0017	0.9038 ± 0.0064	0.8889 ± 0.0042	0.9561 ± 0.0032	0.9830 ± 0.0012
DeepDDI	AUROC	0.9335 ± 0.0017	0.7719 ± 0.0063	0.8593 ± 0.0024	0.9174 ± 0.0014	0.9276 ± 0.0038	0.9179 ± 0.0048	0.9401 ± 0.0025	0.9438 ± 0.0063	0.9322 ± 0.0010	0.9449 ± 0.0020	0.9974 ± 0.0011
	AP	0.9456 ± 0.0009	0.8170 ± 0.0060	0.8872 ± 0.0012	0.9299 ± 0.0018	0.9677 ± 0.0018	0.9347 ± 0.0044	0.9417 ± 0.0030	0.9568 ± 0.0056	0.9287 ± 0.0015	0.9741 ± 0.0010	0.9987 ± 0.0012
	F1	0.9007 ± 0.0049	0.8010 ± 0.0026	0.8486 ± 0.0038	0.8939 ± 0.0009	0.9354 ± 0.0070	0.8823 ± 0.0049	0.8601 ± 0.0063	0.9127 ± 0.0054	0.8560 ± 0.0015	0.9478 ± 0.0027	0.9911 ± 0.0043
	ACC	0.8754 ± 0.0043	0.7294 ± 0.0049	0.8022 ± 0.0039	0.8628 ± 0.0012	0.9033 ± 0.0098	0.8538 ± 0.0059	0.8633 ± 0.0036	0.8887 ± 0.0068	0.8541 ± 0.0009	0.9208 ± 0.0037	0.9866 ± 0.0024

Dataset	Metric	MR-GNN	GCN-BMP	EPGCN-DS	DeepDrug	MIRACLE	SSI-DDI	CSGNN	DeepDDS	DSN-DDI	HTCL-DDI	MSDAFL
ZhangDDI	AUROC	0.9618 ± 0.0025	0.8442 ± 0.0121	0.9083 ± 0.0066	0.9535 ± 0.0020	0.9644 ± 0.0035	0.9314 ± 0.0029	0.9171 ± 0.0009	0.9320 ± 0.0023	0.9113 ± 0.0015	0.9858 ± 0.0021	0.9874 ± 0.0011
	AP	0.9263 ± 0.0030	0.8020 ± 0.0157	0.8896 ± 0.0088	0.9233 ± 0.0023	0.9309 ± 0.0053	0.9209 ± 0.0039	0.8902 ± 0.0073	0.9208 ± 0.0031	0.8642 ± 0.0030	0.9706 ± 0.0038	0.9707 ± 0.0018
	F1	0.8293 ± 0.0081	0.7186 ± 0.0271	0.8007 ± 0.0086	0.8289 ± 0.0027	0.8516 ± 0.0027	0.8196 ± 0.0124	0.8360 ± 0.0073	0.8279 ± 0.0042	0.8768 ± 0.0040	0.9219 ± 0.0056	0.9005 ± 0.0014
	ACC	0.9190 ± 0.0050	0.7578 ± 0.0107	0.8240 ± 0.0104	0.8567 ± 0.0033	0.9316 ± 0.0016	0.8535 ± 0.0050	0.8414 ± 0.0045	0.8563 ± 0.0028	0.8665 ± 0.0046	0.9659 ± 0.0024	0.9533 ± 0.0026
ChCh-Mainer	AUROC	0.9311 ± 0.0036	0.7865 ± 0.0056	0.9423 ± 0.0071	0.9838 ± 0.0010	0.9620 ± 0.0079	0.9809 ± 0.0014	0.9768 ± 0.0010	0.9710 ± 0.0018	0.9669 ± 0.0020	0.9906 ± 0.0015	0.9934 ± 0.0031
	AP	0.9595 ± 0.0019	0.8631 ± 0.0054	0.9680 ± 0.0040	0.9916 ± 0.0005	0.9950 ± 0.0011	0.9897 ± 0.0006	0.9756 ± 0.0016	0.9851 ± 0.0008	0.9634 ± 0.0027	0.9987 ± 0.0002	0.9991 ± 0.0007
	F1	0.8813 ± 0.0072	0.8087 ± 0.0092	0.8941 ± 0.0066	0.9467 ± 0.0026	0.9455 ± 0.0066	0.9398 ± 0.0034	0.9247 ± 0.0022	0.9221 ± 0.0063	0.8812 ± 0.0064	0.9748 ± 0.0019	0.9932 ± 0.0044
	ACC	0.8503 ± 0.0062	0.7307 ± 0.0080	0.8664 ± 0.0098	0.9318 ± 0.0035	0.9077 ± 0.0011	0.9219 ± 0.0048	0.9254 ± 0.0017	0.9038 ± 0.0064	0.8889 ± 0.0042	0.9561 ± 0.0032	0.9830 ± 0.0012
DeepDDI	AUROC	0.9335 ± 0.0017	0.7719 ± 0.0063	0.8593 ± 0.0024	0.9174 ± 0.0014	0.9276 ± 0.0038	0.9179 ± 0.0048	0.9401 ± 0.0025	0.9438 ± 0.0063	0.9322 ± 0.0010	0.9449 ± 0.0020	0.9974 ± 0.0011
	AP	0.9456 ± 0.0009	0.8170 ± 0.0060	0.8872 ± 0.0012	0.9299 ± 0.0018	0.9677 ± 0.0018	0.9347 ± 0.0044	0.9417 ± 0.0030	0.9568 ± 0.0056	0.9287 ± 0.0015	0.9741 ± 0.0010	0.9987 ± 0.0012
	F1	0.9007 ± 0.0049	0.8010 ± 0.0026	0.8486 ± 0.0038	0.8939 ± 0.0009	0.9354 ± 0.0070	0.8823 ± 0.0049	0.8601 ± 0.0063	0.9127 ± 0.0054	0.8560 ± 0.0015	0.9478 ± 0.0027	0.9911 ± 0.0043
	ACC	0.8754 ± 0.0043	0.7294 ± 0.0049	0.8022 ± 0.0039	0.8628 ± 0.0012	0.9033 ± 0.0098	0.8538 ± 0.0059	0.8633 ± 0.0036	0.8887 ± 0.0068	0.8541 ± 0.0009	0.9208 ± 0.0037	0.9866 ± 0.0024

The superior results are emphasized in bold, while the second-best results are underlined.

In addition, we also compare MSDAFL and other baseline methods across these datasets on training, validation and testing sets in a ratio of 8:1:1. The results are shown in the Table 2. The MSADFL model demonstrates improved performance on the ZhangDDI dataset. For the ChCh-Miner and DeepDDI datasets, the model’s performance remains stable across all four metrics. This stability highlights the strong capabilities of our model’s self-attention mechanism and interaction attention. In contrast, the HTCL-DDI model shows a decline in performance across all datasets, particularly on the DeepDDI dataset. This decline underscores the limitations of the approach that leverages existing drug interaction networks, where drugs are nodes and their interactions are edges, which can negatively impact drug interaction prediction. Our model outperforms others across all three datasets, indicating that it surpasses other models in predictive performance. Furthermore, to assess the generalization performance of the MSADFL model, we further conduct cross-datum study experiments on three datasets: ZhangDDI, DeepDDI, and ChCh-Miner. The experimental results of MSADFL and HTCL-DDI are presented in Supplementary Tables S2 and S3, respectively. Compared to previous experiments across three datasets with varying scales, the prediction performances of MSDAFL and HTCL-DDI are declined. Notably, when DeepDDI is used as the training set and ChCh-Miner as the test set, the AUC reaches 0.8227, highlighting the model’s robust generalization ability.

Table 2.

Comparison of MSDAFL with other DDI prediction methods on training, validation and testing sets in a ratio of 8:1:1.^a

Dataset	Metric	MR-GNN	GCN-BMP	EPGCN-DS	DeepDrug	MIRACLE	SSI-DDI	CSGNN	DeepDDS	DSN-DDI	HTCL-DDI	MSDAFL
ZhangDDI	AUROC	0.9434 ± 0.0015	0.8512 ± 0.0159	0.9043 ± 0.0018	0.9477 ± 0.0009	0.8914 ± 0.0021	0.9279 ± 0.0025	0.9871 ± 0.0012	0.9212 ± 0.0034	0.7113 ± 0.0035	0.9882 ± 0.0031	0.9912 ± 0.0012
	AP	0.9133 ± 0.0052	0.8220 ± 0.0201	0.8996 ± 0.0065	0.9351 ± 0.0042	0.9312 ± 0.0043	0.8913 ± 0.0074	0.9712 ± 0.0087	0.9031 ± 0.0012	0.6756 ± 0.0045	0.9514 ± 0.0048	0.9907 ± 0.0033
	F1	0.8463 ± 0.0074	0.7086 ± 0.0154	0.7958 ± 0.0046	0.8473 ± 0.0012	0.9282 ± 0.0017	0.8412 ± 0.0034	0.8731 ± 0.0034	0.8412 ± 0.0075	0.6712 ± 0.0031	0.9219 ± 0.0056	0.9405 ± 0.0014
	ACC	0.8884 ± 0.0099	0.7675 ± 0.0098	0.8212 ± 0.0064	0.8753 ± 0.0055	0.9391 ± 0.0014	0.8132 ± 0.0062	0.8992 ± 0.0014	0.8812 ± 0.0034	0.6513 ± 0.0034	0.9568 ± 0.0031	0.9553 ± 0.0012
ChCh-Mainer	AUROC	0.9451 ± 0.0009	0.7762 ± 0.0081	0.9015 ± 0.0064	0.9902 ± 0.0020	0.9540 ± 0.0012	0.9809 ± 0.0014	0.9912 ± 0.0035	0.9717 ± 0.0073	0.9218 ± 0.0032	0.9836 ± 0.0021	0.9964 ± 0.0021
	AP	0.9605 ± 0.0069	0.8351 ± 0.0071	0.9590 ± 0.0013	0.9874 ± 0.0015	0.9810 ± 0.0023	0.9897 ± 0.0006	0.9831 ± 0.0034	0.9881 ± 0.0012	0.9112 ± 0.0032	0.9931 ± 0.0012	0.9988 ± 0.0009
	F1	0.9023 ± 0.0122	0.7187 ± 0.0054	0.8741 ± 0.0046	0.9323 ± 0.0051	0.9712 ± 0.0062	0.9398 ± 0.0034	0.9271 ± 0.0022	0.9331 ± 0.0051	0.8432 ± 0.0014	0.9701 ± 0.0021	0.9911 ± 0.0021
	ACC	0.8653 ± 0.0058	0.7543 ± 0.0019	0.8061 ± 0.0074	0.9216 ± 0.0071	0.9534 ± 0.0032	0.9219 ± 0.0048	0.8912 ± 0.0064	0.9151 ± 0.0043	0.8465 ± 0.0012	0.9513 ± 0.0013	0.9850 ± 0.0022
DeepDDI	AUROC	0.9402 ± 0.0041	0.7412 ± 0.0085	0.8393 ± 0.0054	0.9062 ± 0.0043	0.8965 ± 0.0023	0.9429 ± 0.0025	0.9531 ± 0.0035	0.9056 ± 0.0019	0.7412 ± 0.0031	0.9152 ± 0.0022	0.9954 ± 0.0014
	AP	0.9514 ± 0.0065	0.8023 ± 0.0054	0.8566 ± 0.0066	0.9444 ± 0.0045	0.9471 ± 0.0044	0.9213 ± 0.0074	0.9411 ± 0.0030	0.9217 ± 0.0085	0.7213 ± 0.0041	0.8921 ± 0.0014	0.9937 ± 0.0022
	F1	0.9052 ± 0.0053	0.7745 ± 0.0056	0.8214 ± 0.0061	0.8639 ± 0.0036	0.9352 ± 0.0065	0.8712 ± 0.0034	0.8355 ± 0.0019	0.8951 ± 0.0064	0.6060 ± 0.0015	0.8828 ± 0.0008	0.9821 ± 0.0023
	ACC	0.8873 ± 0.0074	0.6444 ± 0.0056	0.7023 ± 0.0059	0.8021 ± 0.0058	0.9042 ± 0.0086	0.8632 ± 0.0062	0.8413 ± 0.0023	0.8552 ± 0.0012	0.6634 ± 0.0017	0.8694 ± 0.0043	0.9812 ± 0.0024

Dataset	Metric	MR-GNN	GCN-BMP	EPGCN-DS	DeepDrug	MIRACLE	SSI-DDI	CSGNN	DeepDDS	DSN-DDI	HTCL-DDI	MSDAFL
ZhangDDI	AUROC	0.9434 ± 0.0015	0.8512 ± 0.0159	0.9043 ± 0.0018	0.9477 ± 0.0009	0.8914 ± 0.0021	0.9279 ± 0.0025	0.9871 ± 0.0012	0.9212 ± 0.0034	0.7113 ± 0.0035	0.9882 ± 0.0031	0.9912 ± 0.0012
	AP	0.9133 ± 0.0052	0.8220 ± 0.0201	0.8996 ± 0.0065	0.9351 ± 0.0042	0.9312 ± 0.0043	0.8913 ± 0.0074	0.9712 ± 0.0087	0.9031 ± 0.0012	0.6756 ± 0.0045	0.9514 ± 0.0048	0.9907 ± 0.0033
	F1	0.8463 ± 0.0074	0.7086 ± 0.0154	0.7958 ± 0.0046	0.8473 ± 0.0012	0.9282 ± 0.0017	0.8412 ± 0.0034	0.8731 ± 0.0034	0.8412 ± 0.0075	0.6712 ± 0.0031	0.9219 ± 0.0056	0.9405 ± 0.0014
	ACC	0.8884 ± 0.0099	0.7675 ± 0.0098	0.8212 ± 0.0064	0.8753 ± 0.0055	0.9391 ± 0.0014	0.8132 ± 0.0062	0.8992 ± 0.0014	0.8812 ± 0.0034	0.6513 ± 0.0034	0.9568 ± 0.0031	0.9553 ± 0.0012
ChCh-Mainer	AUROC	0.9451 ± 0.0009	0.7762 ± 0.0081	0.9015 ± 0.0064	0.9902 ± 0.0020	0.9540 ± 0.0012	0.9809 ± 0.0014	0.9912 ± 0.0035	0.9717 ± 0.0073	0.9218 ± 0.0032	0.9836 ± 0.0021	0.9964 ± 0.0021
	AP	0.9605 ± 0.0069	0.8351 ± 0.0071	0.9590 ± 0.0013	0.9874 ± 0.0015	0.9810 ± 0.0023	0.9897 ± 0.0006	0.9831 ± 0.0034	0.9881 ± 0.0012	0.9112 ± 0.0032	0.9931 ± 0.0012	0.9988 ± 0.0009
	F1	0.9023 ± 0.0122	0.7187 ± 0.0054	0.8741 ± 0.0046	0.9323 ± 0.0051	0.9712 ± 0.0062	0.9398 ± 0.0034	0.9271 ± 0.0022	0.9331 ± 0.0051	0.8432 ± 0.0014	0.9701 ± 0.0021	0.9911 ± 0.0021
	ACC	0.8653 ± 0.0058	0.7543 ± 0.0019	0.8061 ± 0.0074	0.9216 ± 0.0071	0.9534 ± 0.0032	0.9219 ± 0.0048	0.8912 ± 0.0064	0.9151 ± 0.0043	0.8465 ± 0.0012	0.9513 ± 0.0013	0.9850 ± 0.0022
DeepDDI	AUROC	0.9402 ± 0.0041	0.7412 ± 0.0085	0.8393 ± 0.0054	0.9062 ± 0.0043	0.8965 ± 0.0023	0.9429 ± 0.0025	0.9531 ± 0.0035	0.9056 ± 0.0019	0.7412 ± 0.0031	0.9152 ± 0.0022	0.9954 ± 0.0014
	AP	0.9514 ± 0.0065	0.8023 ± 0.0054	0.8566 ± 0.0066	0.9444 ± 0.0045	0.9471 ± 0.0044	0.9213 ± 0.0074	0.9411 ± 0.0030	0.9217 ± 0.0085	0.7213 ± 0.0041	0.8921 ± 0.0014	0.9937 ± 0.0022
	F1	0.9052 ± 0.0053	0.7745 ± 0.0056	0.8214 ± 0.0061	0.8639 ± 0.0036	0.9352 ± 0.0065	0.8712 ± 0.0034	0.8355 ± 0.0019	0.8951 ± 0.0064	0.6060 ± 0.0015	0.8828 ± 0.0008	0.9821 ± 0.0023
	ACC	0.8873 ± 0.0074	0.6444 ± 0.0056	0.7023 ± 0.0059	0.8021 ± 0.0058	0.9042 ± 0.0086	0.8632 ± 0.0062	0.8413 ± 0.0023	0.8552 ± 0.0012	0.6634 ± 0.0017	0.8694 ± 0.0043	0.9812 ± 0.0024

The superior results are emphasized in bold, while the second-best results are underlined.

Table 2.

10.1101/2020.11.09.375626

Comparison of MSDAFL with other DDI prediction methods on training, validation and testing sets in a ratio of 8:1:1.^a

Dataset	Metric	MR-GNN	GCN-BMP	EPGCN-DS	DeepDrug	MIRACLE	SSI-DDI	CSGNN	DeepDDS	DSN-DDI	HTCL-DDI	MSDAFL
ZhangDDI	AUROC	0.9434 ± 0.0015	0.8512 ± 0.0159	0.9043 ± 0.0018	0.9477 ± 0.0009	0.8914 ± 0.0021	0.9279 ± 0.0025	0.9871 ± 0.0012	0.9212 ± 0.0034	0.7113 ± 0.0035	0.9882 ± 0.0031	0.9912 ± 0.0012
	AP	0.9133 ± 0.0052	0.8220 ± 0.0201	0.8996 ± 0.0065	0.9351 ± 0.0042	0.9312 ± 0.0043	0.8913 ± 0.0074	0.9712 ± 0.0087	0.9031 ± 0.0012	0.6756 ± 0.0045	0.9514 ± 0.0048	0.9907 ± 0.0033
	F1	0.8463 ± 0.0074	0.7086 ± 0.0154	0.7958 ± 0.0046	0.8473 ± 0.0012	0.9282 ± 0.0017	0.8412 ± 0.0034	0.8731 ± 0.0034	0.8412 ± 0.0075	0.6712 ± 0.0031	0.9219 ± 0.0056	0.9405 ± 0.0014
	ACC	0.8884 ± 0.0099	0.7675 ± 0.0098	0.8212 ± 0.0064	0.8753 ± 0.0055	0.9391 ± 0.0014	0.8132 ± 0.0062	0.8992 ± 0.0014	0.8812 ± 0.0034	0.6513 ± 0.0034	0.9568 ± 0.0031	0.9553 ± 0.0012
ChCh-Mainer	AUROC	0.9451 ± 0.0009	0.7762 ± 0.0081	0.9015 ± 0.0064	0.9902 ± 0.0020	0.9540 ± 0.0012	0.9809 ± 0.0014	0.9912 ± 0.0035	0.9717 ± 0.0073	0.9218 ± 0.0032	0.9836 ± 0.0021	0.9964 ± 0.0021
	AP	0.9605 ± 0.0069	0.8351 ± 0.0071	0.9590 ± 0.0013	0.9874 ± 0.0015	0.9810 ± 0.0023	0.9897 ± 0.0006	0.9831 ± 0.0034	0.9881 ± 0.0012	0.9112 ± 0.0032	0.9931 ± 0.0012	0.9988 ± 0.0009
	F1	0.9023 ± 0.0122	0.7187 ± 0.0054	0.8741 ± 0.0046	0.9323 ± 0.0051	0.9712 ± 0.0062	0.9398 ± 0.0034	0.9271 ± 0.0022	0.9331 ± 0.0051	0.8432 ± 0.0014	0.9701 ± 0.0021	0.9911 ± 0.0021
	ACC	0.8653 ± 0.0058	0.7543 ± 0.0019	0.8061 ± 0.0074	0.9216 ± 0.0071	0.9534 ± 0.0032	0.9219 ± 0.0048	0.8912 ± 0.0064	0.9151 ± 0.0043	0.8465 ± 0.0012	0.9513 ± 0.0013	0.9850 ± 0.0022
DeepDDI	AUROC	0.9402 ± 0.0041	0.7412 ± 0.0085	0.8393 ± 0.0054	0.9062 ± 0.0043	0.8965 ± 0.0023	0.9429 ± 0.0025	0.9531 ± 0.0035	0.9056 ± 0.0019	0.7412 ± 0.0031	0.9152 ± 0.0022	0.9954 ± 0.0014
	AP	0.9514 ± 0.0065	0.8023 ± 0.0054	0.8566 ± 0.0066	0.9444 ± 0.0045	0.9471 ± 0.0044	0.9213 ± 0.0074	0.9411 ± 0.0030	0.9217 ± 0.0085	0.7213 ± 0.0041	0.8921 ± 0.0014	0.9937 ± 0.0022
	F1	0.9052 ± 0.0053	0.7745 ± 0.0056	0.8214 ± 0.0061	0.8639 ± 0.0036	0.9352 ± 0.0065	0.8712 ± 0.0034	0.8355 ± 0.0019	0.8951 ± 0.0064	0.6060 ± 0.0015	0.8828 ± 0.0008	0.9821 ± 0.0023
	ACC	0.8873 ± 0.0074	0.6444 ± 0.0056	0.7023 ± 0.0059	0.8021 ± 0.0058	0.9042 ± 0.0086	0.8632 ± 0.0062	0.8413 ± 0.0023	0.8552 ± 0.0012	0.6634 ± 0.0017	0.8694 ± 0.0043	0.9812 ± 0.0024

Dataset	Metric	MR-GNN	GCN-BMP	EPGCN-DS	DeepDrug	MIRACLE	SSI-DDI	CSGNN	DeepDDS	DSN-DDI	HTCL-DDI	MSDAFL
ZhangDDI	AUROC	0.9434 ± 0.0015	0.8512 ± 0.0159	0.9043 ± 0.0018	0.9477 ± 0.0009	0.8914 ± 0.0021	0.9279 ± 0.0025	0.9871 ± 0.0012	0.9212 ± 0.0034	0.7113 ± 0.0035	0.9882 ± 0.0031	0.9912 ± 0.0012
	AP	0.9133 ± 0.0052	0.8220 ± 0.0201	0.8996 ± 0.0065	0.9351 ± 0.0042	0.9312 ± 0.0043	0.8913 ± 0.0074	0.9712 ± 0.0087	0.9031 ± 0.0012	0.6756 ± 0.0045	0.9514 ± 0.0048	0.9907 ± 0.0033
	F1	0.8463 ± 0.0074	0.7086 ± 0.0154	0.7958 ± 0.0046	0.8473 ± 0.0012	0.9282 ± 0.0017	0.8412 ± 0.0034	0.8731 ± 0.0034	0.8412 ± 0.0075	0.6712 ± 0.0031	0.9219 ± 0.0056	0.9405 ± 0.0014
	ACC	0.8884 ± 0.0099	0.7675 ± 0.0098	0.8212 ± 0.0064	0.8753 ± 0.0055	0.9391 ± 0.0014	0.8132 ± 0.0062	0.8992 ± 0.0014	0.8812 ± 0.0034	0.6513 ± 0.0034	0.9568 ± 0.0031	0.9553 ± 0.0012
ChCh-Mainer	AUROC	0.9451 ± 0.0009	0.7762 ± 0.0081	0.9015 ± 0.0064	0.9902 ± 0.0020	0.9540 ± 0.0012	0.9809 ± 0.0014	0.9912 ± 0.0035	0.9717 ± 0.0073	0.9218 ± 0.0032	0.9836 ± 0.0021	0.9964 ± 0.0021
	AP	0.9605 ± 0.0069	0.8351 ± 0.0071	0.9590 ± 0.0013	0.9874 ± 0.0015	0.9810 ± 0.0023	0.9897 ± 0.0006	0.9831 ± 0.0034	0.9881 ± 0.0012	0.9112 ± 0.0032	0.9931 ± 0.0012	0.9988 ± 0.0009
	F1	0.9023 ± 0.0122	0.7187 ± 0.0054	0.8741 ± 0.0046	0.9323 ± 0.0051	0.9712 ± 0.0062	0.9398 ± 0.0034	0.9271 ± 0.0022	0.9331 ± 0.0051	0.8432 ± 0.0014	0.9701 ± 0.0021	0.9911 ± 0.0021
	ACC	0.8653 ± 0.0058	0.7543 ± 0.0019	0.8061 ± 0.0074	0.9216 ± 0.0071	0.9534 ± 0.0032	0.9219 ± 0.0048	0.8912 ± 0.0064	0.9151 ± 0.0043	0.8465 ± 0.0012	0.9513 ± 0.0013	0.9850 ± 0.0022
DeepDDI	AUROC	0.9402 ± 0.0041	0.7412 ± 0.0085	0.8393 ± 0.0054	0.9062 ± 0.0043	0.8965 ± 0.0023	0.9429 ± 0.0025	0.9531 ± 0.0035	0.9056 ± 0.0019	0.7412 ± 0.0031	0.9152 ± 0.0022	0.9954 ± 0.0014
	AP	0.9514 ± 0.0065	0.8023 ± 0.0054	0.8566 ± 0.0066	0.9444 ± 0.0045	0.9471 ± 0.0044	0.9213 ± 0.0074	0.9411 ± 0.0030	0.9217 ± 0.0085	0.7213 ± 0.0041	0.8921 ± 0.0014	0.9937 ± 0.0022
	F1	0.9052 ± 0.0053	0.7745 ± 0.0056	0.8214 ± 0.0061	0.8639 ± 0.0036	0.9352 ± 0.0065	0.8712 ± 0.0034	0.8355 ± 0.0019	0.8951 ± 0.0064	0.6060 ± 0.0015	0.8828 ± 0.0008	0.9821 ± 0.0023
	ACC	0.8873 ± 0.0074	0.6444 ± 0.0056	0.7023 ± 0.0059	0.8021 ± 0.0058	0.9042 ± 0.0086	0.8632 ± 0.0062	0.8413 ± 0.0023	0.8552 ± 0.0012	0.6634 ± 0.0017	0.8694 ± 0.0043	0.9812 ± 0.0024

The superior results are emphasized in bold, while the second-best results are underlined.

3.4 Ablation experiment

The outstanding performance of MSDAFL stems from three carefully designed strategies: the cross-attention mechanism strategy between drug pairs, normalization of the interaction matrix, and self-attention mechanism strategy with cosine similarity. To ascertain the efficacy of each drug feature type, we conducted ablation experiments on the ZhangDDI dataset across these three configurations on training, validation and testing sets in a ratio of 6:2:2.The results are shown in the Supplementary Fig. S1, demonstrating the effectiveness of the module we proposed.

3.5 Parameter sensitivity

To investigate the influence of crucial parameters on prediction performance, we systematically vary these parameters and assessed their impact on the MSDAFL model’s efficacy using the ZhangDDI dataset on training, validation and testing sets in a ratio of 6:2:2. We analyze the batch size for model training, the parameter $λ$ ⁠, and the number of GIN layers. By holding other parameters constant, we explore how varying key parameter settings impacted the performance of MSDAFL. As illustrated in the Supplementary Fig. S2, we investigated how these parameters affect model performance. Specifically, we found that the model performs optimally when the batch size is set to 512, $λ$ to 0.75, and the number of GIN layers to 5.

3.6 Case study

To assess the practical utility of MSDAFL in real-world scenarios, we performed an analysis of clinical studies evaluating the prediction outcomes for four drug pairs using MSDAFL on the ZhangDDI testing set, as depicted in Supplementary Fig. S3. For the analysis of these four drugs, we can confirm the powerful performance of the MSDAFL model in predicting DDIs.

4 Discussion and conclusion

In this work, we introduce a molecular substructure-based dual attention feature learning framework for predicting DDIs. This framework integrates multiple attention mechanisms, including a self-attention encoder that extracts substructures from individual drugs and computes a cosine similarity matrix between the feature matrices of drug pairs. In the interactive attention encoding segment, we employ an interactive attention mechanism to investigate the strength of interactions between substructures of drug pairs, culminating in the regularization of the interactive feature matrix. Extensive experiments are conducted across three public datasets to evaluate the efficacy of our MSDAFL model and assess the contributions of its various modules. The findings decisively establish MSDAFL as a robust and promising tool for predicting DDIs, significantly contributing to medication safety and drug side effect research. Our study can be further advanced in three key domains: (i) by integrating heterogeneous biomedical information to augment representation learning, (ii) by expanding MSDAFL to more complex and practical application scenarios, and (iii) by supplementing with wet-lab experiments to further validate certain DDI prediction outcomes.

Supplementary data

Supplementary data are available at Bioinformatics online.

Conflict of interest

None declared.

Funding

This work was supported in part by the National Natural Science Foundation of China (62473149, 61962050, and 62072473), Natural Science Foundation of Hunan Province of China (2022JJ30428) and Excellent youth funding of Hunan Provincial Education Department (22B0372).

Data availability

Our code and data are available at: https://github.com/27167199/MSDAFL.

References

Cao

Fan

Zeng

Deepdrug: a general graph-based deep learning framework for drug relation prediction

bioRxiv

2020

, preprint: not peer reviewed.

10.48550/arXiv.1609.02907

Crossref

Chen

Liu

GCN-BMP: investigating graph representation learning for DDI prediction task

Methods

2020

;

179

–

Gilmer

Schoenholz

Riley

et al. Neural message passing for quantum chemistry. In: International Conference on Machine Learning. Sydney, NSW, Australia: PMLR.

2017

1263

–

1272

Glorot

Bengio

Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Chia Laguna Resort, Sardinia, Italy, JMLR Workshop and Conference Proceedings,

2010

249

–

256

Kingma

Adam: a method for stochastic optimization. In: Proceedings of the 3rd International Conference on Learning Representations (ICLR), San Diego, CA, USA, arXiv, arXiv:1412.6980,

2014

, preprint: not peer reviewed.

Kipf

Welling

Semi-supervised classification with graph convolutional networks. arXiv, arXiv:1609.02907,

2016

, preprint: not peer reviewed.

Miao

Wang

et al. ;

Hiplot Consortium

Hiplot: a comprehensive and easy-to-use web service for boosting publication-ready biomedical data visualization

Brief Bioinform

2022

;

bbac261

Huang

et al.

Large-scale exploration and analysis of drug combinations

Bioinformatics

2015

;

2007

–

Zhu

Shao

et al.

DSN-DDI: an accurate and generalized framework for drug–drug interaction prediction by dual-view representation learning

Brief Bioinform

2023

;

bbac597

Lin

Quan

Wang

Z-J

et al.

2020

KGNN: knowledge graph neural network for drug-drug interaction prediction

IJCAI

;

380

2739

–

2745

10.48550/arXiv.1804.10850

Xiao

Zhou

et al. Drug similarity integration through attentive multi-view graph auto-encoders. arXiv, arXiv:1804.10850,

2018

, preprint: not peer reviewed.

Mei

Zhang

A machine learning framework for predicting drug–drug interactions

Sci Rep

2021

;

17619

Nyamabo

Shi

J-Y.

SSI–DDI: substructure–substructure interactions for drug–drug interaction prediction

Brief Bioinform

2021

;

bbab133

Shao

Zhang

Traditional chinese medicine network pharmacology: theory, methodology and application

Chin J Nat Med

2013

;

110

–

PubMed

Sun

Wang

Elemento

et al.

Structure-based drug-drug interaction detection via expressive graph convolutional networks and deep sets (student abstract)

AAAI

2020

;

13927

–

. volume

Crossref

Sun

Sanderson

Zheng

Drug combination therapy increases successful drug repositioning

Drug Discov Today

2016

;

1189

–

Takeda

Hao

Cheng

et al.

Predicting drug–drug interactions through drug structural similarities and interaction networks incorporating pharmacokinetics and pharmacodynamics knowledge

J Cheminform

2017

;

Velickovic

Cucurull

Casanova

et al.

Graph attention networks

Stat

2017

;

1050

–

48550

10.48550/arXiv.1810.00826

Vilar

Uriarte

Santana

et al.

Detection of drug-drug interactions by modeling interaction profile fingerprints

PLoS One

2013

;

e58321

Vilar

Uriarte

Santana

et al.

Similarity-based modeling in large-scale prediction of drug-drug interactions

Nat Protoc

2014

;

2147

–

Wang

Liu

Shen

et al.

DeepDDS: deep graph neural network with attention mechanism to predict synergistic drug combinations

Brief Bioinform

2022

;

bbab390

Wang

Min

Chen

et al. Multi-view graph contrastive representation learning for drug-drug interaction prediction. In: Proceedings of the Web Conference 2021, Ljubljana, Slovenia.

2021

2921

–

2933

Leskovec

et al. How powerful are graph neural networks? arXiv, arXiv:1810.00826,

2018

, preprint: not peer reviewed.

Wang

Chen

et al. MR-GNN: Multi-resolution and dual graph neural network for predicting structured entity interactions. arXiv, arXiv:1905.09558,

2019

, preprint: not peer reviewed.

10.24963/ijcai.2019/551

Yan

Duan

Zhang

et al.

Predicting drug-drug interactions based on integrated similarity and semi-supervised learning

IEEE/ACM Trans Comput Biol Bioinform

2020

;

168

–

Crossref

Zhang

Wang

et al.

HTCL-DDI: a hierarchical triple-view contrastive learning framework for drug–drug interaction prediction

Brief Bioinform

2023

;

bbad324

Zhang

Chen

Liu

et al.

Predicting potential drug-drug interactions by integrating chemical, biological, phenotypic and network data

BMC Bioinformatics

2017

;

Zhao

Liu

Huang

et al.

CSGNN: contrastive self-supervised graph neural network for molecular interaction prediction

. In:

Proceedings of the 30th International Joint Conference on Artificial Intelligence

, IJCAI, Montreal, Canada.

2021

3756

–