Attention-based cross domain graph neural network for prediction of drug–drug interactions

Multi-typed DDI prediction (1)

Methods	ACC	AUC	AUPR	F1	Precision	Recall	KAPPA
ACDGNN	96.71	98.81	98.35	94.11	95.64	93.74	92.23
SSI-DDI	93.42	97.79	97.41	93.42	94.35	91.78	86.85
MHCADDI	79.54	87.28	84.79	79.39	76.71	81.29	59.09
SumGNN	87.81	94.17	93.67	87.67	88.24	86.61	75.36
KGNN	85.16	90.86	89.57	77.62	83.58	77.34	72.12
DDIMDL	83.07	87.53	85.68	79.95	84.69	80.34	56.12
DeepDDI	78.06	84.72	82.07	77.71	81.26	78.41	56.12
LaGAT	91.85	96.64	95.36	91.87	89.68	89.38	81.45
GoGNN	86.78	92.38	91.16	86.58	85.42	80.69	73.56
SFLLN	82.79	86.48	83.69	79.86	83.47	79.66	55.27

Methods	ACC	AUC	AUPR	F1	Precision	Recall	KAPPA
ACDGNN	96.71	98.81	98.35	94.11	95.64	93.74	92.23
SSI-DDI	93.42	97.79	97.41	93.42	94.35	91.78	86.85
MHCADDI	79.54	87.28	84.79	79.39	76.71	81.29	59.09
SumGNN	87.81	94.17	93.67	87.67	88.24	86.61	75.36
KGNN	85.16	90.86	89.57	77.62	83.58	77.34	72.12
DDIMDL	83.07	87.53	85.68	79.95	84.69	80.34	56.12
DeepDDI	78.06	84.72	82.07	77.71	81.26	78.41	56.12
LaGAT	91.85	96.64	95.36	91.87	89.68	89.38	81.45
GoGNN	86.78	92.38	91.16	86.58	85.42	80.69	73.56
SFLLN	82.79	86.48	83.69	79.86	83.47	79.66	55.27

Table 1

Multi-typed DDI prediction (1)

Methods	ACC	AUC	AUPR	F1	Precision	Recall	KAPPA
ACDGNN	96.71	98.81	98.35	94.11	95.64	93.74	92.23
SSI-DDI	93.42	97.79	97.41	93.42	94.35	91.78	86.85
MHCADDI	79.54	87.28	84.79	79.39	76.71	81.29	59.09
SumGNN	87.81	94.17	93.67	87.67	88.24	86.61	75.36
KGNN	85.16	90.86	89.57	77.62	83.58	77.34	72.12
DDIMDL	83.07	87.53	85.68	79.95	84.69	80.34	56.12
DeepDDI	78.06	84.72	82.07	77.71	81.26	78.41	56.12
LaGAT	91.85	96.64	95.36	91.87	89.68	89.38	81.45
GoGNN	86.78	92.38	91.16	86.58	85.42	80.69	73.56
SFLLN	82.79	86.48	83.69	79.86	83.47	79.66	55.27

Methods	ACC	AUC	AUPR	F1	Precision	Recall	KAPPA
ACDGNN	96.71	98.81	98.35	94.11	95.64	93.74	92.23
SSI-DDI	93.42	97.79	97.41	93.42	94.35	91.78	86.85
MHCADDI	79.54	87.28	84.79	79.39	76.71	81.29	59.09
SumGNN	87.81	94.17	93.67	87.67	88.24	86.61	75.36
KGNN	85.16	90.86	89.57	77.62	83.58	77.34	72.12
DDIMDL	83.07	87.53	85.68	79.95	84.69	80.34	56.12
DeepDDI	78.06	84.72	82.07	77.71	81.26	78.41	56.12
LaGAT	91.85	96.64	95.36	91.87	89.68	89.38	81.45
GoGNN	86.78	92.38	91.16	86.58	85.42	80.69	73.56
SFLLN	82.79	86.48	83.69	79.86	83.47	79.66	55.27

Till now, we have presented the results of experiments in transductive scenario, i.e., the drugs in test set were also included in the training set (partition policy 1). Next, in order to evaluate our method’s performance in inductive setting, which means new drugs that not included in the training set (also termed as cold start problem), we split the dataset on basis of the drugs instead of DDIs. It is more practical than transductive scenario. In order to evaluate the ability of ACDGNN for predicting the DDIs in inductive setting, here, we define the isolated drug represents the drug who has no any links in DDI network but has known links with other entities, such as gene, disease and so on. We divide the dataset according to the following two strategies: (1) Splitting all drugs as the training/validation/test set and ensure that in each validation/test triplet, one drug is from the training set and the other drug is from the validation/test set (the partition policy is recorded as 2). (2) Similarly, divide the data into training/validation/test set and ensure that the drugs in each validation/test triplet are both not appeared in the training set (the partition policy is marked as 3). The comparison results are shown in Tables 2 and 3, respectively. It can be seen that the prediction results of models under 2 and 3 scenarios are inferior to those of under 1. Accoring to results in Tables 2 and 3, it could be concluded that without prior knowledge about the isolated drugs, the performances of all models for 2 and 3 decrease, especially in 3. The experimental results also demonstrate that ACDGNN outperforms all other state-of-the-art methods in inductive DDI prediction, which illustrates the effectiveness of our model again.

Table 2

Multi-typed DDI prediction(2)

Methods	ACC	AUC	AUPR	F1	KAPPA
ACDGNN	81.44	91.88	93.28	80.86	62.89
SSI-DDI	73.81	81.57	81.95	73.50	47.61
MHCADDI	71.80	78.89	77.25	71.73	43.61
DeepDDI	66.48	72.49	71.79	66.44	32.96
DDIMDL	67.16	72.87	72.36	67.58	34.82
SumGNN	67.70	81.51	81.81	65.75	35.40
LaGAT	71.89	80.98	81.86	69.56	40.82
GoGNN	61.27	67.04	65.19	62.35	29.28
SFLLN	63.49	69.83	68.74	65.85	31.38

Methods	ACC	AUC	AUPR	F1	KAPPA
ACDGNN	81.44	91.88	93.28	80.86	62.89
SSI-DDI	73.81	81.57	81.95	73.50	47.61
MHCADDI	71.80	78.89	77.25	71.73	43.61
DeepDDI	66.48	72.49	71.79	66.44	32.96
DDIMDL	67.16	72.87	72.36	67.58	34.82
SumGNN	67.70	81.51	81.81	65.75	35.40
LaGAT	71.89	80.98	81.86	69.56	40.82
GoGNN	61.27	67.04	65.19	62.35	29.28
SFLLN	63.49	69.83	68.74	65.85	31.38

Table 2

Multi-typed DDI prediction(2)

Methods	ACC	AUC	AUPR	F1	KAPPA
ACDGNN	81.44	91.88	93.28	80.86	62.89
SSI-DDI	73.81	81.57	81.95	73.50	47.61
MHCADDI	71.80	78.89	77.25	71.73	43.61
DeepDDI	66.48	72.49	71.79	66.44	32.96
DDIMDL	67.16	72.87	72.36	67.58	34.82
SumGNN	67.70	81.51	81.81	65.75	35.40
LaGAT	71.89	80.98	81.86	69.56	40.82
GoGNN	61.27	67.04	65.19	62.35	29.28
SFLLN	63.49	69.83	68.74	65.85	31.38

Methods	ACC	AUC	AUPR	F1	KAPPA
ACDGNN	81.44	91.88	93.28	80.86	62.89
SSI-DDI	73.81	81.57	81.95	73.50	47.61
MHCADDI	71.80	78.89	77.25	71.73	43.61
DeepDDI	66.48	72.49	71.79	66.44	32.96
DDIMDL	67.16	72.87	72.36	67.58	34.82
SumGNN	67.70	81.51	81.81	65.75	35.40
LaGAT	71.89	80.98	81.86	69.56	40.82
GoGNN	61.27	67.04	65.19	62.35	29.28
SFLLN	63.49	69.83	68.74	65.85	31.38

Table 3

Multi-typed DDI prediction (3)

Methods	ACC	AUC	AUPR	F1	KAPPA
ACDGNN	67.29	70.94	69.65	67.00	34.57
SSI-DDI	65.30	69.08	68.26	63.85	30.61
MHCADDI	66.16	68.14	67.11	64.12	32.32
DeepDDI	59.26	63.20	63.21	58.50	18.54
DDIMDL	61.24	64.49	64.16	60.33	23.69
SumGNN	58.00	64.90	63.65	55.50	15.99
LaGAT	63.22	66.93	66.38	60.75	25.47
GoGNN	55.46	60.56	61.65	53.64	14.76
SFLLN	56.35	61.37	62.48	53.87	15.21

Methods	ACC	AUC	AUPR	F1	KAPPA
ACDGNN	67.29	70.94	69.65	67.00	34.57
SSI-DDI	65.30	69.08	68.26	63.85	30.61
MHCADDI	66.16	68.14	67.11	64.12	32.32
DeepDDI	59.26	63.20	63.21	58.50	18.54
DDIMDL	61.24	64.49	64.16	60.33	23.69
SumGNN	58.00	64.90	63.65	55.50	15.99
LaGAT	63.22	66.93	66.38	60.75	25.47
GoGNN	55.46	60.56	61.65	53.64	14.76
SFLLN	56.35	61.37	62.48	53.87	15.21

Table 3

Open in new tab Download slide

Multi-typed DDI prediction (3)

Methods	ACC	AUC	AUPR	F1	KAPPA
ACDGNN	67.29	70.94	69.65	67.00	34.57
SSI-DDI	65.30	69.08	68.26	63.85	30.61
MHCADDI	66.16	68.14	67.11	64.12	32.32
DeepDDI	59.26	63.20	63.21	58.50	18.54
DDIMDL	61.24	64.49	64.16	60.33	23.69
SumGNN	58.00	64.90	63.65	55.50	15.99
LaGAT	63.22	66.93	66.38	60.75	25.47
GoGNN	55.46	60.56	61.65	53.64	14.76
SFLLN	56.35	61.37	62.48	53.87	15.21

Methods	ACC	AUC	AUPR	F1	KAPPA
ACDGNN	67.29	70.94	69.65	67.00	34.57
SSI-DDI	65.30	69.08	68.26	63.85	30.61
MHCADDI	66.16	68.14	67.11	64.12	32.32
DeepDDI	59.26	63.20	63.21	58.50	18.54
DDIMDL	61.24	64.49	64.16	60.33	23.69
SumGNN	58.00	64.90	63.65	55.50	15.99
LaGAT	63.22	66.93	66.38	60.75	25.47
GoGNN	55.46	60.56	61.65	53.64	14.76
SFLLN	56.35	61.37	62.48	53.87	15.21

Parameter analysis

In this section, we will analyze the impact of the key parameters in ACDGNN, including the entities’ embedding dimension |$f$|⁠, the number of information propagation layers |$l$| in the heterogeneous neighbor-domain information aggregation module and the number of heads |$K$| in the multi-head attention mechanism.

Firstly, we analyze the impact of |$f$| on the prediction performance of ACDGNN under the three data partition polices. In our experiment, we empirically set the hyper-parameters |$l$| and |$K$| both to 2, and take |$f$| as the independent variable while the various performance metrics as the dependent variables for parameter analysis. The results are shown in Figure 3 1(a), 2(a) and 3(a). We can find that under the three data partition strategies, the model achieves the best performance when |$f$| is 64, 64 and 16, respectively. After reaching the optimal dimension, the performance of the model tends to decline with the increase of |$f$|⁠. The possible reason is that introduceing too many parameters may lead to overfitting of the model, which reduces its generalization ability.

Figure 3

Parameter analysis of ACDGNN. Subplots on row (A) presents the impact of embedding dimension on model performance under three data split policies. Subplots on row (B) and (C) illustrates effect of information propagation layers and number of attention heads on model performance, respectively.

Then we analyze the impact of |$l$| on the prediction performance under the three data partition polices. In this part, we select the optimal |$f$| under each data partition strategy as 64, 64, 16 respectively. The results are shown in Figure 3 1(b), 2(b) and 3(b). It can be seen that the optimal |$l$| is 2, 1 and 2 respectively under the three data partition strategies, which indicates that in heterogeneous networks, directly connected neighbors and the skip-connection neighbors are help to the prediction of DDI [34], while considering higher-order |$(>2)$| neighbor’s information may introduce additional noise, thus reducing the prediction performance of the model.

Finally, we analyze the effect of |$K$| under three partition polices. Here, the optimal |$f$| and |$l$| under policy 1 are set to 64 and 2 respectively, while under policy 2, they are set to 64 and 1, and under the policy 3, be set as 16 and 2. The experimental results are shown in Figure 3 1(c), 2(c) and 3(c). It can be seen that under the three data partition strategies, the optimal |$K$| is 1, 2 and 2 respectively. For the policies 2 and 3, due to the drugs in test set that unseen in the training phase, compared with partition policy 1, the representation learning process cannot be carried out very well. Therefore, the introduction of too many attention heads |$(>2)$| may also lead to overfitting of the model. This phenomenon is similar to hyper-parameter |$f$| and |$l$|⁠.

Ablation study

To study whether the components of ACDGNN have an effect on the final performance, we conduct the following ablation studies. First, we verify the effectiveness of the transformation module. We remove it and directly take the embedding of the entity itself as the input of the heterogeneous neighbor-domain information aggregation module at each layer, which is represented by ACDGNN w/o CDT (cross domain transformation). Secondly, we check the effectiveness of the feature-structure information aggregation module of Eq. 6. We also remove it and the embedding representation used by this model is composed of the feature information and structure information of drugs. Due to constraint loss (Eq. 12) depending on this module, so it will not be added in the final loss, that is, the final training loss of this model is |$L_{base}$|⁠, which is represented by ACDGNN w/o FSIA (feature structure information aggregation). Besides, to evalute the contributions of drug-related biomedical entities to model performance, we removed gene nodes and target nodes from network |$\mathcal{G}$| and the corresponding models are presented ACDGNN w/o Gene and ACDGNN w/o Target.

The comparison results are shown in Table 4. It can be found that under the partition strategies 1 and 2, considering the transformation module and the feature-structure information aggregation module at the same time can effectively improve the prediction performance, which is about 2% higher than the second on average. However, under partition strategy 3, considering the transformation module does not seem to significantly improve the generalization performance, while slightly decrease under some metrics (such as ACC, F1 and KAPPA). The possible reason is that the transformation module introduces more parameters when aggregating the neighborhood information, resulting in overfitting. Moreover, we can find that the removal of gene nodes and target nodes lead to significant performance drop, as the model could not extract comprehensive drug interaction information with absence of certain entities and thus produces sub-optimal nodes’ representations.

Table 4

Ablation study results

	Methods	ACC	AUC	AUPR	F1	KAPPA
1	ACDGNN	96.71	98.81	98.35	94.41	92.23
	ACDGNN w/o FSIA	93.79	94.14	90.99	91.37	82.58
	ACDGNN w/o CDT	88.74	92.37	95.41	88.61	79.49
	ACDGNN w/o Gene	92.58	93.73	93.81	89.57	81.63
	ACDGNN w/o Target	92.36	92.96	93.15	88.86	80.86
2	ACDGNN	81.44	91.88	93.28	80.86	62.89
	ACDGNN w/o FSIA	78.02	85.32	93.46	77.89	56.18
	ACDGNN w/o CDT	74.82	84.21	92.13	74.78	49.64
	ACDGNN w/o Gene	77.68	84.29	92.35	75.76	54.79
	ACDGNN w/o Target	77.24	83.97	91.86	75.13	54.28
3	ACDGNN	67.29	70.94	69.65	67.00	34.57
	ACDGNN w/o FSIA	65.92	64.59	59.17	65.77	31.84
	ACDGNN w/o CDT	69.00	68.60	60.45	68.18	38.00
	ACDGNN w/o Gene	64.93	63.75	58.64	64.61	30.49
	ACDGNN w/o Target	64.25	63.18	57.96	63.81	29.67

	Methods	ACC	AUC	AUPR	F1	KAPPA
1	ACDGNN	96.71	98.81	98.35	94.41	92.23
	ACDGNN w/o FSIA	93.79	94.14	90.99	91.37	82.58
	ACDGNN w/o CDT	88.74	92.37	95.41	88.61	79.49
	ACDGNN w/o Gene	92.58	93.73	93.81	89.57	81.63
	ACDGNN w/o Target	92.36	92.96	93.15	88.86	80.86
2	ACDGNN	81.44	91.88	93.28	80.86	62.89
	ACDGNN w/o FSIA	78.02	85.32	93.46	77.89	56.18
	ACDGNN w/o CDT	74.82	84.21	92.13	74.78	49.64
	ACDGNN w/o Gene	77.68	84.29	92.35	75.76	54.79
	ACDGNN w/o Target	77.24	83.97	91.86	75.13	54.28
3	ACDGNN	67.29	70.94	69.65	67.00	34.57
	ACDGNN w/o FSIA	65.92	64.59	59.17	65.77	31.84
	ACDGNN w/o CDT	69.00	68.60	60.45	68.18	38.00
	ACDGNN w/o Gene	64.93	63.75	58.64	64.61	30.49
	ACDGNN w/o Target	64.25	63.18	57.96	63.81	29.67

Table 4

Ablation study results

	Methods	ACC	AUC	AUPR	F1	KAPPA
1	ACDGNN	96.71	98.81	98.35	94.41	92.23
	ACDGNN w/o FSIA	93.79	94.14	90.99	91.37	82.58
	ACDGNN w/o CDT	88.74	92.37	95.41	88.61	79.49
	ACDGNN w/o Gene	92.58	93.73	93.81	89.57	81.63
	ACDGNN w/o Target	92.36	92.96	93.15	88.86	80.86
2	ACDGNN	81.44	91.88	93.28	80.86	62.89
	ACDGNN w/o FSIA	78.02	85.32	93.46	77.89	56.18
	ACDGNN w/o CDT	74.82	84.21	92.13	74.78	49.64
	ACDGNN w/o Gene	77.68	84.29	92.35	75.76	54.79
	ACDGNN w/o Target	77.24	83.97	91.86	75.13	54.28
3	ACDGNN	67.29	70.94	69.65	67.00	34.57
	ACDGNN w/o FSIA	65.92	64.59	59.17	65.77	31.84
	ACDGNN w/o CDT	69.00	68.60	60.45	68.18	38.00
	ACDGNN w/o Gene	64.93	63.75	58.64	64.61	30.49
	ACDGNN w/o Target	64.25	63.18	57.96	63.81	29.67

	Methods	ACC	AUC	AUPR	F1	KAPPA
1	ACDGNN	96.71	98.81	98.35	94.41	92.23
	ACDGNN w/o FSIA	93.79	94.14	90.99	91.37	82.58
	ACDGNN w/o CDT	88.74	92.37	95.41	88.61	79.49
	ACDGNN w/o Gene	92.58	93.73	93.81	89.57	81.63
	ACDGNN w/o Target	92.36	92.96	93.15	88.86	80.86
2	ACDGNN	81.44	91.88	93.28	80.86	62.89
	ACDGNN w/o FSIA	78.02	85.32	93.46	77.89	56.18
	ACDGNN w/o CDT	74.82	84.21	92.13	74.78	49.64
	ACDGNN w/o Gene	77.68	84.29	92.35	75.76	54.79
	ACDGNN w/o Target	77.24	83.97	91.86	75.13	54.28
3	ACDGNN	67.29	70.94	69.65	67.00	34.57
	ACDGNN w/o FSIA	65.92	64.59	59.17	65.77	31.84
	ACDGNN w/o CDT	69.00	68.60	60.45	68.18	38.00
	ACDGNN w/o Gene	64.93	63.75	58.64	64.61	30.49
	ACDGNN w/o Target	64.25	63.18	57.96	63.81	29.67

To summarize, the introduction of cross domain transformation and feature-structure information aggregation module can improve the DDI prediction performance. On the one hand, it can capture the information of neighbors in different domains through appropriate domain transformation; on the other hand, by weighted aggregation of feature information and structure information, ACDGNN can distinguish the importance of them. In addition, the constraint loss forces the embedding learned by ACDGNN to be consistent with the drug interaction behavior, therefore, a more representative embedding representation can be learned, leading to improvement of the final prediction performance. Besides, comprehensive use of information in drug-related entities is of great benefit to the prediction of DDI.

Case study

We conduct case studies to investigate the usefulness of ACDGNN in practice. Here, we use all the known DDI triples in our dataset to train the prediction model, and then make predictions for the remaining drug pairs. We construct a ranked list of (drug |$i$|⁠, drug |$j$|⁠, DDI type |$r$|⁠) triples, in which the triples are ranked by predicted probability scores. A higher prediction score between two drugs suggests that they have a higher probability of an interaction occurrence. We investigate the 20 highest ranked predictions in the list. For these 20 drug pairs, we apply DrugBank (https://go.drugbank.com/interax/multi_search) and Drug Interactions Checker tool provided by Drugs.com (https://www.drugs.com/) to find the evidence support for them and collect the descriptions about their interactions.

Fifteen DDI events can be confirmed among these 20 events (only top five are shown in Table 5 due to the pages’ limitation), the complete results are listed in the Supplementary Material. As shown in Table 5, the interaction between Diazepam and Chromium is predicted to cause the event #72, and means Diazepam may decrease the excretion rate of Chromium which could result in a higher serum level. Studies have shown that chromium functions as an active component of glucose tolerance factor (GTF). This factor facilitates binding of insulin to the cell and promotes the uptake of glucose [35]. Meanwhile, diazepam alone was found to inhibit insulin secretion [36], which supports the predictions of our model. The interaction between Buprenorphine and Imidafenacin is predicted to cause the event #49, means the risk or severity of adverse effects can be increased when Imidafenacin is combined with Butylscopolamine. It has been reported that Butylscopolamine binds to muscarinic M3 receptors in the gastrointestinal tract [37]. Similarly, Imidafenacin binds to and antagonizes muscarinic M1 and M3 receptors with high affinity [38]. The results indicate that our proposed ACDGNN model is effective in predicting novel DDIs. Other five DDIs deserve to be confirmed by further experiments. In addition, we also found that a certain drug may be closely related to a certain DDI event. For example, 4 of the top 20 predictions related to event #47 (the metabolism decrease) are related to Barnidipine. More attention should be paid on ‘Barnidipine’.

Table 5

The top 20 predicted DDIs

Drug A	Drug B	Evidence source	Description
Diazepam	Selenium	Drugbank tool	Diazepam may decrease the excretion rate of Selenium which could result in a higher serum level.
Diazepam	Chromium	Drugbank tool	Diazepam may decrease the excretion rate of Chromium which could result in a higher serum level.
Imidafenacin	Butylscopolamine	Drugbank tool	The risk or severity of adverse effects can be increased when Imidafenacin is combined with Butylscopolamine.
Buprenorphine	Palonosetron	Drugbank tool	Palonosetron may increase the central nervous system depressant (CNS depressant) activities of Buprenorphine.
Methscopolamine	Toloxatone	N.A.	N.A.

Drug A	Drug B	Evidence source	Description
Diazepam	Selenium	Drugbank tool	Diazepam may decrease the excretion rate of Selenium which could result in a higher serum level.
Diazepam	Chromium	Drugbank tool	Diazepam may decrease the excretion rate of Chromium which could result in a higher serum level.
Imidafenacin	Butylscopolamine	Drugbank tool	The risk or severity of adverse effects can be increased when Imidafenacin is combined with Butylscopolamine.
Buprenorphine	Palonosetron	Drugbank tool	Palonosetron may increase the central nervous system depressant (CNS depressant) activities of Buprenorphine.
Methscopolamine	Toloxatone	N.A.	N.A.

N.A.: The evidence of the given DDI is not available till now.

Table 5

The top 20 predicted DDIs

Drug A	Drug B	Evidence source	Description
Diazepam	Selenium	Drugbank tool	Diazepam may decrease the excretion rate of Selenium which could result in a higher serum level.
Diazepam	Chromium	Drugbank tool	Diazepam may decrease the excretion rate of Chromium which could result in a higher serum level.
Imidafenacin	Butylscopolamine	Drugbank tool	The risk or severity of adverse effects can be increased when Imidafenacin is combined with Butylscopolamine.
Buprenorphine	Palonosetron	Drugbank tool	Palonosetron may increase the central nervous system depressant (CNS depressant) activities of Buprenorphine.
Methscopolamine	Toloxatone	N.A.	N.A.

Drug A	Drug B	Evidence source	Description
Diazepam	Selenium	Drugbank tool	Diazepam may decrease the excretion rate of Selenium which could result in a higher serum level.
Diazepam	Chromium	Drugbank tool	Diazepam may decrease the excretion rate of Chromium which could result in a higher serum level.
Imidafenacin	Butylscopolamine	Drugbank tool	The risk or severity of adverse effects can be increased when Imidafenacin is combined with Butylscopolamine.
Buprenorphine	Palonosetron	Drugbank tool	Palonosetron may increase the central nervous system depressant (CNS depressant) activities of Buprenorphine.
Methscopolamine	Toloxatone	N.A.	N.A.

N.A.: The evidence of the given DDI is not available till now.

CONCLUSION

In this paper, we propose a new method ACDGNN: attention-based cross domain graph neural network. ACDGNN acts on heterogeneous networks and learns the embedding representation of drug entities by aggregating neighborhood information for multi-typed DDI prediction. ACDGNN is consisted by five modules: the input module takes a heterogeneous network as input, which contains many types of nodes and edges; the transformation module is used to map the information from neighbors to a homogeneous low-dimensional embedding space; the heterogeneous neighbor-domain information aggregation module exploits the multi-head attention mechanism to aggregate the neighborhood information; the feature-structure information aggregation module combines the entity’s attributes and the network structure information in the way of weighted aggregation to obtain the final embedding representation of the entity; the final decomposition based predictor uses the embedding of drug pairs and interaction types to make prediction. The proposed approach is compared with several state-of-the-art baselines using real-life datasets. The experimental results show that the proposed model achieves competitive prediction performance. In addition, we also performed ablation analysis and case study to verify the effectiveness of the method.

Key Points

An Attention-based cross domain graph neural network model for DDI prediction is proposed in this paper.
ACDGNN considers other types of drug-related entities and propagate information through cross domain operation for learning informative representation of drugs.
ACDGNN can eliminate the heterogeneity between different types of entities and effectively predict DDIs in transductive and inductive scenarios.

FUNDING

This work was supported by National Nature Science Foundation of China (Grant No. 61872297), Shaanxi Provincial Key Research & Development Program, China (Grand No. 2023-YBSF-114), CAAI-Huawei MindSpore Open Fund (Grant No. CAAIXSJLJJ-2022-035A) and the Fundamental Research Funds for the Central Universities (Grand No. SY20210003). Thanks for the Center for High Performance Computation, Northwestern Polytechnical University to provide computation resource.

Author Biographies

Hui Yu received his master’s and PhD degrees from Northwestern Polytechnical University, Xi’an, China, where he works currently as an associate professor. He has published >50 papers in peer reviewed journals and conferences. His research interests include bioinformatics, machine learning and data mining.

KangKang Li is currently pursuing his Master’s degree in the School of Computer Science at Northwestern Polytechnical University, Xi’an, China. He received his bachelor’s degree in software engineering from Chongqing University, Chongqing, China. He is interested in graph representation learning and applications.

WenMin Dong received his master’s degrees from Northwestern Polytechnical University, Xi’an, China. He received his bachelor’s degree in Computer Science and Technology from Anhui jianzhu university, Hefei, China. He is interested in machine learning and data mining.

Shuanghong Song has received her PhD degree from Northwestern Polytechnical University, Xi’an, China. She works currently as an associate professor in Shaanxi Normal University. She has published about 30 papers in peer reviewed journals and conferences. Her research interests includes Pharmacology of Traditional Chinese medicine’ Cphytochemistry and osteoporosis.

Chen Gao has received his master’s degree from Northwestern Polytechnical University in 2014, Xi’an, China. Then he works currently as an Senior engineer in Xi’an high-tech Research Institute. He has published >30 papers in peer reviewed journals and conference. His research interests include system simulation, artificial intelligence and data mining.

Jian-Yu Shi received his master’s and PhD degrees from Northwestern Polytechnical University, Xi’an, China, where he is currently working as a professor. He was selected as the Postdoctoral Fellow in the first round of the Hong Kong Scholars Program in 2011 and worked in the University of Hong Kong during 2012–2014. He has published 40+ peer-reviewed papers and has >10 years research experience in AI in drug discovery. His research interests include matrix factorization, graph neural network, drug.drug interaction, drug combination and precision medicine.

REFERENCES

1.

Takeda

T

,

Ming

H

,

Cheng

T

, et al.

Predicting drug–drug interactions through drug structural similarities and interaction networks incorporating pharmacokinetics and pharmacodynamics knowledge

.

J Chem

2017

;

9

(

1

):

16

.

2.

Huang

D

,

Jiang

Z

,

Zou

L

, et al.

Drug-drug interaction extraction from biomedical literature using support vector machine and long short term memory networks

.

Inform Sci

2017

;

415

:

100

–

9

.

3.

Qiu

Y

,

Zhang

Y

,

Deng

Y

, et al.

A comprehensive review of computational methods for drug-drug interaction detection

.

IEEE/ACM Trans Comput Biol Bioinform

2022

;

19

(

4

):

1968

–

85

.

4.

Zhao

C

,

Liu

S

,

Huang

F,

et al.

CSGNN: Contrastive self-supervised graph neural network for molecular interaction prediction

. In

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI, Virtual Event / Montreal, Canada, 19–27 August

.

2021

, p.

3756

–

63

.

5.

Ryu

JY

,

Kim

HU

,

Sang

YL

.

Deep learning improves prediction of drug–drug and drug–food interactions

.

Proc Natl Acad Sci U S A

2018

;

115

(

18

):

E4304

–

11

.

6.

Fokoue

A

,

Sadoghi

M

,

Hassanzadeh

O,

et al.

Predicting drug-drug interactions through large-scale similarity-based link prediction

. In

The Semantic Web. Latest Advances and New Domains - 13th International Conference, ESWC 2016, Heraklion, Crete, Greece, May 29 – June 2, 2016, Proceedings, volume 9678 of Lecture Notes in Computer Science

.

Springer

,

2016

, p.

774

–

89

.

7.

Rohani

N

,

Eslahchi

C

.

Drug-drug interaction predicting by neural network using integrated similarity

.

Sci Rep

2019

;

9

(

1

):

1

–

11

.

8.

Ying

S

,

Kaiqi

Y

,

Min

Y

, et al.

KMR: knowledge-oriented medicine representation learning for drug-drug interaction and similarity computation

.

J Chem

2020

;

11

(

1

):

22

1–22:16

.

9.

Yu

H

,

Mao

KT

,

Shi

JY

, et al.

Predicting and understanding comprehensive drug-drug interactions via semi-nonnegative matrix factorization

.

BMC Syst Biol

2018

;

12

(

Suppl 1

):

14

.

PubMed

10.

Shi

JY

,

Mao

KT

,

Yu

H

, et al.

Detecting drug communities and predicting comprehensive drug-drug interactions via balance regularized semi-nonnegative matrix factorization

.

J Chem

2019

;

11

(

1

):

1

–

16

.

11.

Ding

C

,

Li

T

,

Jordan

MI

.

Convex and semi-nonnegative matrix factorizations

.

IEEE Trans Pattern Anal Mach Intell

2010

;

32

(

1

):

45

–

55

.

12.

Wang

H

,

Lian

D

,

Zhang

Y,

et al.

Gognn: Graph of graphs neural network for predicting structured entity interactions

. In:

C

Bessiere

, editor,

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI

.

2020

, p.

1317

–

23

.

13.

Zhang

W

,

Jing

K

,

Huang

F

, et al.

Sflln: a sparse feature learning ensemble method with linear neighborhood regularization for predicting drug–drug interactions

.

Inform Sci

2019

;

497

:

189

–

201

.

14.

Chen

Y

,

Ma

T

,

Yang

X

, et al.

MUFFIN: multi-scale feature fusion for drug–drug interaction prediction

.

Bioinformatics

2021

;

37

(

17

):

2651

–

8

.

15.

He

C

,

Liu

Y

,

Li

H

, et al.

Multi-type feature fusion based on graph neural network for drug-drug interaction prediction

.

BMC Bioinformatics

2022

;

23

(

1

):

224

.

16.

Zitnik

M

,

Agrawal

M

,

Leskovec

J

.

Modeling polypharmacy side effects with graph convolutional networks

.

Bioinformatics

2018

;

34

(

13

):

457

–

66

.

17.

Yu

Y

,

Huang

K

,

Zhang

C

, et al.

Sumgnn: multi-typed drug interaction prediction via efficient knowledge graph summarization

.

Bioinformatics

2021

;

37

(

18

):

2988

–

95

.

18.

Fu

H

,

Huang

F

,

Liu

X

, et al.

MVGCN: data integration through multi-view graph convolutional network for predicting links in biomedical bipartite networks

.

Bioinformatics

2021

;

38

(

2

):

426

–

34

.

19.

Ren

ZH

,

You

ZH

,

Yu

CQ

, et al.

A biomedical knowledge graph-based method for drug–drug interactions prediction through combining local and global features with deep neural networks

.

Brief Bioinform

2022

;

23

(

5

):

Bbac363

.

20.

Su

R

,

Yang

H

,

Wei

L

, et al.

A multi-label learning model for predicting drug-induced pathology in multi-organ based on toxicogenomics data

.

PLoS Comput Biol

2022

;

18

(

9

):

1

–

28

.

21.

Zhou

SF

.

Drugs behave as substrates, inhibitors and inducers of human cytochrome p450 3a4

.

Curr Drug Metab

2008

;

9

(

4

).

22.

Hong

H

,

Guo

H

,

Lin

Y,

et al.

An attention-based graph neural network for heterogeneous structural learning

. In:

The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI

.

2020

, p.

4132

–

9

.

23.

Busbridge

D

,

Sherburn

D

,

Cavallo

P

, et al.

Relational graph attention networks

.

CoRR

2019

;

abs/1904.05811

.

24.

Velickovic

P

,

Cucurull

G

,

Casanova

A

, et al.

Graph attention networks

.

ICLR

2018

;

1050

:

4

.

25.

Zhou

J

,

Cui

G

,

Hu

S

, et al.

Graph neural networks: a review of methods and applications

.

AI Open

2020

;

1

:

57

–

81

.

26.

Vaswani

A

,

Shazeer

N

,

Parmar

N,

et al.

Attention is all you need

. In

Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems, December 4–9, 2017, Long Beach, CA, USA

.

2017,

p.

5998

–

6008

.

27.

Ma

T

,

Xiao

C

,

Zhou

J,

et al.

Drug similarity integration through attentive multi-view graph auto-encoders

. In:

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI, July 13–19, 2018, Stockholm, Sweden

.

2018,

p.

3477

–

83

.

28.

Scott

HD

,

Antoine

L

,

Christine

H

, et al.

Systematic integration of biomedical knowledge prioritizes drugs for repurposing

.

Elife

2017

;

6

:e26726.

29.

Nyamabo

AK

,

Yu

H

,

Shi

JY

.

SSI-DDI: substructure-substructure interactions for drug-drug interaction prediction

.

Brief Bioinform

2021

;

22

(

6

):

Bbab133

.

30.

Deac

A

,

Huang

Y

,

Velickovic

P

, et al.

Drug-drug adverse effect prediction with graph co-attention

.

CoRR

2019

;

abs/1905.00534

.

31.

Lin

X

,

Quan

Z

,

Wang

ZJ,

et al.

Kgnn: Knowledge graph neural network for drug-drug interaction prediction

. In:

C

Bessiere

, editor,

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI

.

International Joint Conferences on Artificial Intelligence

,

2020,

p.

2739

–

45

.

32.

Deng

Y

,

Xu

X

,

Qiu

Y

, et al.

A multimodal deep learning framework for predicting drug–drug interaction events

.

Bioinformatics

2020

;

36

(

15

):

4316

–

22

.

33.

Hong

Y

,

Luo

P

,

Jin

S

, et al.

LaGAT: link-aware graph attention network for drug–drug interaction prediction

.

Bioinformatics

2022

;

38

(

24

):

5406

–

12

.

34.

Huang

K

,

Xiao

C

,

Glass

LM

, et al.

Skipgnn: predicting molecular interactions with skip-graph networks

.

Sci Rep

2020

;

10

(

1

):

1

–

16

.

PubMed

35.

Williams

SR

.

Basic nutrition and diet therapy

(17 ed.) St Louis, Toronto, Santaclara:

Times Mirror/Mosby, College

,

1988

, pp. 78.

Google Preview