MNMDCDA: prediction of circRNA–disease associations by learning mixed neighborhood information from multiple distances

Li, Yang; Hu, Xue-Gang; Wang, Lei; Li, Pei-Pei; You, Zhu-Hong

doi:10.1093/bib/bbac479

Abstract

Emerging evidence suggests that circular RNA (circRNA) is an important regulator of a variety of pathological processes and serves as a promising biomarker for many complex human diseases. Nevertheless, there are relatively few known circRNA–disease associations, and uncovering new circRNA–disease associations by wet-lab methods is time consuming and costly. Considering the limitations of existing computational methods, we propose a novel approach named MNMDCDA, which combines high-order graph convolutional networks (high-order GCNs) and deep neural networks to infer associations between circRNAs and diseases. Firstly, we computed different biological attribute information of circRNA and disease separately and used them to construct multiple multi-source similarity networks. Then, we used the high-order GCN algorithm to learn feature embedding representations with high-order mixed neighborhood information of circRNA and disease from the constructed multi-source similarity networks, respectively. Finally, the deep neural network classifier was implemented to predict associations of circRNAs with diseases. The MNMDCDA model obtained AUC scores of 95.16%, 94.53%, 89.80% and 91.83% on four benchmark datasets, i.e., CircR2Disease, CircAtlas v2.0, Circ2Disease and CircRNADisease, respectively, using the 5-fold cross-validation approach. Furthermore, 25 of the top 30 circRNA–disease pairs with the best scores of MNMDCDA in the case study were validated by recent literature. Numerous experimental results indicate that MNMDCDA can be used as an effective computational tool to predict circRNA–disease associations and can provide the most promising candidates for biological experiments.

circRNA, circRNA–disease association, multi-source similarity networks, high-order mixed neighborhood information, deep neural network

Issue Section:

Problem solving protocol

Introduction

Circular RNA (circRNA), a novel member of the noncoding cancer genome, is a single-stranded endogenous non-coding RNA (ncRNA) molecule with a continuous loop structure [1, 2], which is generated by a back splicing event between the downstream 5′ end splice site and the upstream 3′ end splice site [3, 4]. CircRNA was first discovered by Sanger and colleagues in the 1970s, and they observed the presence of circRNA in Sendai viruses and plant infection viruses by using electron microscopy [5, 6]. Subsequently, in the 1990s, endogenous circRNAs were first identified in human cells and large and abundant circRNAs were also found in the Sry gene of mouse [7, 8]. Despite the early discovery of circRNAs, they have long been ignored as a by-product of ‘shear noise’ or abnormal shear and have not attracted much attention from scholars. It was not until 2013 when two research papers on circRNAs were published in Nature that the mystery of circRNAs was unveiled and people began to really understand them [3, 9]. These studies demonstrate that circRNA is not a ‘splicing by-product’ of messenger RNA, but a class of RNA molecules that play important roles in cells, and it has significant biological functions. Therefore, the development of computational biology methods and deep sequencing technologies has made circRNA a frontier of research in the RNA field, which is crucial to reveal new functions and potential roles of circRNAs [10, 11].

Recently, computational approaches have emerged as an effective strategy to infer circRNA–disease associations to overcome the inherent shortcomings of wet-lab approaches [12]. Moreover, prioritizing the most promising circRNAs for candidate diseases by computational methods will also help to discover their molecular behavior in the identification of viral circRNAs as well as in carcinogenesis [13]. For instance, Xiao et al. [14] proposed a computational model called iCDA-CMG (identifying circRNA-disease associations-collective matrix completion with graph learning) based on collective matrix completion and graph learning algorithm to identify circRNA–disease associations. Zheng et al. [15] developed an approach with chaos game representation and support vector machine (SVM) to infer unobserved associations between circRNAs and diseases. Yang et al. [16] employed accelerated attribute network embedding and stacked auto-encoder algorithms to obtain feature representations of circRNA and disease and then used XGBoost classifier to obtain prediction results. Lu et al. [17] designed a CDASOR model, which adopted a convolutional neural network coupled with a bidirectional long short-term memory network to discover the underlying circRNAs with target diseases.

Although the above models have their own advantages and have achieved encouraging results. However, it is worth noting that they still have some problems: (1) Existing models are based on incompletely correlated biological information, with redundancy and noise between data, and they do not sufficiently fuse circRNA similarity and disease similarity. (2) The current circRNA–disease association data validated by wet-lab experiments are limited. The constructed circRNA–disease association networks are relatively sparse, with many false positive and false negative associations among their descriptors, and thus, the prediction models cannot be fully trained. (3) Most of the existing models were proposed based on one dataset only, and the generalization performance of the prediction models was not verified on other circRNA–disease association datasets.

To overcome these problems, we present a novel computational model called MNMDCDA that fuses mixed neighborhood information of circRNAs and diseases from multi-source similarity networks, which combines a high-order graph convolutional network (high-order GCN) and a deep neural network (DNN) to predict circRNA–disease associations. This model can use high-order GCN to overcome the problem that the original GCN cannot obtain high-order neighborhood information of each node from its neighbors at different distances in the network. It can learn the high-order mixed neighborhood embedding representations of circRNAs and diseases in a specific way.

Specifically, in the first step, we calculated the similarities for circRNA pairs and disease pairs and constructed 12 multi-source similarity networks by integrating various biological data information from four gold standard datasets. In the second step, based on each multi-source similarity network, we extracted the low-dimensional feature representations of circRNAs and diseases using the high-order GCN algorithm to learn the higher-order mixed neighborhood information of each node. In the third step, we introduced a DNN as a binary classifier to accurately identify the potential associations between circRNAs and diseases. The framework of MNMDCDA is shown in Figure 1.

Figure 1

The framework of MNMDCDA model for predicting the potential circRNA–disease associations.

Open in new tab Download slide

In brief, the main contributions of the MNMDCDA model are as follows:

(i) We comprehensively utilize more attribute information of circRNAs and diseases and construct 12 multi-source similarity networks. It fuses these attributes from different perspectives to better describe the biological information of circRNAs and diseases.
(ii) We obtained higher-order neighborhood embedding representations of these attribute information from the network using the higher-order GCN algorithm and extracted advanced features. Thus, the hidden information contained in these networks is mined as much as possible to fully train the MNMDCDA model.
(iii) The MNMDCDA model was tested on three other benchmark datasets with the same experiments to further verify the generalization performance of the model. Furthermore, 25 of the top 30 disease-related circRNAs predicted by the MNMDCDA model in case studies have been validated by the latest published literature.

Materials

Gold standard dataset

To construct a high-reliability data source to evaluate the effectiveness of the MNMDCDA model, we employed currently available experimentally validated CircR2Disease dataset [18] as the gold standard dataset to assess the performance of the proposed model, which can be represented as follows:

D = D^{+} \cup D^{-}

(1)

where D⁺ and D⁻ are the positive and negative sample sets, respectively.

\cup

represents the union set between the elements. Take the CircAtlas v2.0 dataset for an example, there are totally 846 experimentally validated circRNA–disease association pairs belonging to 776 circRNAs and 117 diseases in the original bipartite graph, which were used as positive samples. However, there are

776 \times 117 = 90 792

connections in the corresponding bipartite graph, of which there are

776 \times 117 - 846 = 89 946

un-experimentally validated circRNA–disease associations, and all these possible unknown associated circRNA–disease pairs can be used to construct negative samples. To avoid unbalanced datasets that could lead to biased experimental results, we randomly selected the same number of data from other unknown circRNA–disease associations as negative samples to construct a balanced dataset. Although this approach may include the unconfirmed correlated circRNA–disease pairs as negative samples, the selected negative samples only account for

846 \div (776 \times 117) \approx 0.93 %

of all circRNA–disease associations, such a small bias is negligible from the machine learning and probability perspectives. Thus, we obtained 1692 samples from the CircAtlas v2.0 dataset, half of which were negative and positive. In this experiment, we first obtained 725 circRNA–disease associations consisting of 676 circRNAs and 100 diseases from the CircR2Disease dataset. Second, we constructed the adjacency matrix R2DM of the gold standard dataset. When circRNA c(i) is not associated with disease d(j), the element R2DM(i,j) of R2DM is assigned to 0; otherwise, it is assigned to 1. Ultimately, we can construct the circRNA–disease association network CD_N by the obtained adjacency matrix R2DM.

Disease feature construction

Gaussian interaction profile kernel-based disease similarity

Since similar circRNAs and diseases usually exhibit similar interaction patterns, the hypothesis for this similarity is that similar diseases are more likely to be associated with functionally similar circRNAs [19]. Therefore, we used Gaussian interaction profile kernel (GIPK) similarity to construct the similarity model of diseases based on the known circRNA–disease association adjacency matrix. In this experiment, we defined the binary vector V(d(i)) to represent the interaction profiles of disease d(i). When the disease d(i) is not associated with a particular circRNA, the corresponding position of the circRNA in the binary vector V(d(i)) is set to 0; otherwise, it is set to 1. Thus, the GIPK similarity

D_{GIP} (d (i), d (j))

of disease d(i) and disease d(j) can be measured based on the following equations:

D_{GIP} (d (i), d (j)) = \exp (- θ_{d} {‖ V (d (i)) - V (d (j)) ‖}^{2})

(2)

θ_{d} = \frac{1}{m} \sum_{i = 1}^{m} {‖ V (d (i)) ‖}^{2}

(3)

where

θ_{d}

is the regularization parameter controlling the kernel bandwidth, and m is the number of diseases in the circRNA–disease association adjacency matrix R2DM.

Medical subject heading-based disease semantic similarity

In this study, we used the Medical Subject Headings (MeSH) database [20, 21] to construct semantic similarity of diseases. MeSH is an authoritative, extensible biomedical subject heading that provides a rigorous classification of all diseases, which helps to calculate the semantic similarity of diseases. MeSH is available at https://www.nlm.nih.gov/. According to previous studies, we can use the semantic information of the MeSH database to construct a directed acyclic graph (DAG) to reflect the relationship among various diseases well. In the directed DAG, the nodes represent diseases and the directed edges indicate the relationships between diseases.

For a disease d, its DAG can be denoted as

{DAG}_{d} = (d, N_{d}, E_{d}),

where N_d indicates the set of diseases associated with d, including the node set of the disease d itself and its ancestor nodes, and E_d is the set of links in the subgraph, indicating the relationship between these diseases. Assuming that there is a disease s in the DAG_d, the semantic contribution value D_d(s) of disease s to disease d in this DAG can be calculated by the following equation:

{\begin{cases} D_{d} (s) = 1 & i f s = d \\ D_{d} (s) = max {μ \cdot D_{d} (s^{'}) | s^{'} \in c h i l d r e n o f s} & i f s \neq d \end{cases}

(4)

where

μ = 0.5

denotes the semantic contribution decay parameter of edges linking disease s and its child disease

s^{'}

in E_d [5]. Eventually, we can obtain the semantic value DV(d) of disease d as follows, by accumulating the semantic contribution values of all child nodes related to disease d in the disease set N_d.

D V (d) = \sum_{s \in N_{d}} D_{d} (s)

(5)

Given two diseases d(i) and d(j), we can calculate their semantic similarity DS(d(i), d(j)) by combining disease terms and MeSH-based hierarchical structure information using the following formula.

D S (d (i), d (j)) = \frac{\sum_{s \in N_{d (i)} \cap N_{d (j)}} (D_{d (i)} (s) + D_{d (j)} (s))}{D V (d (i)) + D V (d (j))}

(6)

Disease Ontology-based disease semantic similarity

The Disease Ontology (DO) [22] can be organized as a DAG so that the semantic similarities among diseases can be computed based on their corresponding DO terms. The DO term for each disease is retrieved from http://disease-ontology.org/. Then, we measure the semantic similarities between two diseases following Wang’s method described, and the detailed calculation steps are described in the literature [23]. To distinguish, we use DS_DO(d(i),d(j)) to represent the DO-based semantic similarity between two diseases d(i) and d(j).

Cosine similarity of disease

The cosine similarity is usually employed to express the distinction or similarity among finite sample sets [22]. Thus, we use cosine similarity to measure the similarity between diseases. Specifically, for diseases, we construct the cosine similarity model

D_{Cos} (d (i), d (j))

to represent the similarity between disease d(i) and disease d(j), based on their associated circRNA information, which is calculated as follows:

D_{Cos} (d (i), d (j)) = \frac{V (d (i)) \cdot V (d (j))}{‖ V (d (i)) ‖ \times ‖ V (d (j)) ‖}

(7)

Here, V(d(i)) and V(d(j)) denote the i-th and j-th columns of the adjacency matrix R2DM, respectively. ||V(d(i))|| and ||V(d(j))|| indicate the Euclidean norm of the vectors V(d(i)) and V(d(j)), respectively.

CircRNA feature construction

GIPK-based circRNA similarity

Similar to the disease GIPK similarity, we can give the binary vector V(c(i)) to represent the interaction profile of circRNA c(i) from the association between circRNA c(i) and diseases in the adjacency matrix R2DM. The GIPK similarity

C_{GIP} (c (i), c (j))

of circRNA c(i) and circRNA c(j) can be measured based on the following equations:

C_{GIP} (c (i), c (j)) = \exp (- θ_{c} {‖ V (c (i)) - V (c (j)) ‖}^{2})

(8)

θ_{c} = \frac{1}{n} \sum_{i = 1}^{n} {‖ V (c (i)) ‖}^{2}

(9)

where

θ_{c}

is the regularization parameter controlling the kernel bandwidth, and n is the number of circRNAs in the matrix R2DM.

CircRNA functional similarity

According to the hypothesis that circRNAs sharing semantically similar disease groups are more likely to be functionally similar as well [24], we can measure the functional similarity between two circRNAs. In particular, given two different diseases d(i) and d(j), and meanwhile, given two disease groups D(i) and D(j), which denote the disease groups associated with circRNA c(i) and circRNA c(j), respectively. Assuming that

C_{FS} (c (i), c (j))

is the functional similarity matrix of two circRNAs, the functional similarity between circRNA c(i) and circRNA c(j) can be calculated by the following equation:

C_{FS} (c (i), c (j)) = \frac{\sum_{1 \leq i \leq | D (i) |} S (d (i), D (j)) + \sum_{1 \leq j \leq | D (j) |} S (d (j), D (i))}{| D (i) | + | D (j) |}

(10)

S (d (i), D (j)) = {max}_{1 \leq k \leq | D (j) |} (D S (d (i), d (k)))

(11)

Cosine similarity of circRNA

Likewise, for circRNAs, we construct the cosine similarity model

C_{Cos} (c (i), c (j))

to represent the similarity between circRNA c(i) and circRNA c(j), based on their associated disease information, which is calculated as follows:

C_{Cos} (c (i), c (j)) = \frac{V (c (i)) \cdot V (c (j))}{‖ V (c (i)) ‖ \times ‖ V (c (j)) ‖}

(12)

Here, V(c(i)) and V(c(j)) denote the i-th row and j-th row of the adjacency matrix R2DM, respectively.

Multi-similarity matrix fusion

To fully utilize the information from different sources, we adopted a multi-similarity matrix fusion method to fuse circRNA similarity information and disease similarity information to realize feature complementation. The advantage of the fused information is that it not only reduces the potential shortcomings caused by single features but also absorbs the characteristics of different data sources.

For circRNA, we can construct the fused circRNA similarity information C_Fus by the following strategy. Specifically, if there is the functional similarity between circRNA c(i) and circRNA c(j), then we use the circRNA functional similarity matrix to construct the fusion similarity descriptor C_Fus(c(i),c(j)); otherwise, we use circRNA GIPK similarity to represent the similarity between circRNA c(i) and c(j). This construction strategy of fusion similarity for circRNAs can be expressed as follows:

\begin{aligned} C_{Fus} (c (i), c (j)) \\ = {\begin{cases} C_{FS} (c (i), c (j)) & if c (i) and c (j) has functional similarity \\ C_{GIP} (c (i), c (j)) & otherwise \end{cases} \end{aligned}

(13)

Similarly, for diseases, we can construct the fused disease similarity information D_Fus by the following strategy. If there is the semantic similarity between disease d(i) and disease d(j), the fused disease similarity descriptor D_Fus(d(i),d(j)) is constructed by employing the disease semantic similarity matrix; otherwise, the disease GIPK similarity is constructed to represent the similarity between diseases d(i) and d(j). This construction strategy of fused disease similarity can be described by the following equation:

\begin{aligned} D_{Fus} (d (i), d (j)) \\ = {\begin{cases} D S (d (i), d (j)) & if d (i) and d (j) has semantic similarity \\ D_{GIP} (d (i), d (j)) & otherwise \end{cases} \end{aligned}

(14)

Finally, the circRNA fusional similarity network corresponding to the matrix C_Fus(c(i),c(j)) is C_N, while the disease fusional similarity network corresponding to the matrix D_Fus(d(i),d(j)) is D_N.

Feature embedding of high-order GCNs

After obtaining the fusion feature descriptors of circRNA and disease, we use the high-order GCN [25] to extract the low-dimensional feature embedding representations of circRNA and disease from the multi-source similarity network, respectively. To clearly understand the high-order GCN, we first introduce the original GCN proposed by Kipf and Welling [26], which can be elegantly summarized by the following expression:

H^{(l + 1)} = σ (\hat{A} H^{(l)} W^{(l)})

(15)

where

\hat{A} = {\tilde{D}}^{- \frac{1}{2}} (A + I) {\tilde{D}}^{- \frac{1}{2}}

is the symmetrically normalized graph adjacency matrix of A with self-connections. Here, I is the identity matrix with the same size as A, and

\tilde{D}

is a diagonal matrix, the degree matrix of (A + I). H^(l) and H^(l + 1) are the input and output activation matrices, which represent the row-wise embedding of the graph vertices in the l th and l + 1 th layers, respectively. W^(l) is a trainable weight matrix of the l th layer, and

σ

is a nonlinear activation function. Thus, a GCN model with L layers can be expressed as follows:

H^{(l)} = {\begin{cases} X & if l = 0 \\ σ (\hat{A} H^{(l - 1)} W^{(l - 1)}) & if l \in [1, \dots, L] \end{cases}

(16)

However, the original GCN is susceptible to the over-smoothing problem because it focuses only on the first-order neighborhood information of each node, which limits its ability to capture remote dependencies among nodes from distant but informative nodes. Currently, it has been shown that better node feature representations can be learned by fusing mixed neighborhood information, which usually helps to improve prediction abilities for downstream tasks including link prediction and node classification [27]. High-order GCN mainly considers the neighborhood information of circRNAs or diseases at different distances, thus capturing the high-order features of biological networks and learning the linear mixture of features in multi-distance neighborhoods. The high-order GCN-based algorithm can be defined as follows:

H^{(l + 1)} = ∥_{j \in P} σ ({({\tilde{D}}^{- \frac{1}{2}} (A + I) {\tilde{D}}^{- \frac{1}{2}})}^{j}) H^{(l)} W_{j}^{(l)}

(17)

where the hyperparameter P is a set of integers, the adjacency power of

{\tilde{D}}^{- \frac{1}{2}} (A + I) {\tilde{D}}^{- \frac{1}{2}}

⁠,

P = {0, 1, \dots, p},

where p is the maximum order of the neighborhood considered by each high-order GCN layer for information propagation. Here, σ is the Rectified Linear Unit (ReLU) activation function.

‖

denotes the column-wise concatenation of neighborhood information of different orders from circRNAs or diseases embedding representations.

During the high-order GCN training, we use binary cross-entropy loss to optimize the model parameters:

L (c_{i}, d_{j}) = - A_{i j} \ln p_{i j} - (1 - A_{i j}) \ln (1 - p_{i j})

(18)

where (c_i,d_j) denotes the training pair of circRNA c_i and disease d_j, A_ij denotes the ground truth association label between these nodes of circRNA and disease, and p_ij denotes the predicted association probability between circRNA c_i and disease d_j. Thus, the final loss function considered for all associations between circRNA and disease is as follows:

L = \sum_{(c_{i}, d_{j}) \in T r^{+} \cup T r^{-}} L (c_{i}, d_{j})

(19)

where Tr⁺ and Tr⁻ represent the positive and negative sample data in the training process, respectively, and

\cup

denotes the union set between the elements in the mathematical formula.

Deep neural network

After obtaining representative features of circRNA–disease pairs using high-order GCN, we utilized the DNN supervised learning model to identify potential associations between circRNAs and diseases. We employed three fully connected layers in the neural network in this study. In the hidden layer of the DNN, each neuron in layer i + 1 is connected to all neurons in layer i. Each hidden layer can be computed by the following equation:

x_{i + 1} = σ (\sum_{i = 1}^{n} (w_{i} x_{i} + b_{i}))

(20)

In the input and hidden layers, we employed the ReLU [28] function (f(x) = max(0,x)) as the activation function of the model. In the output layer, we employed Sigmoid [29] function (f(x) = 1/1 + e^-x) as the activation function to activate the DNN to obtain the probability score of circRNA–disease pairs, which was used to estimate the probability of association between circRNA and disease. The higher the score, the higher the association between circRNA and disease.

We used the binary cross-entropy as the loss function to judge whether the model is good or bad for the prediction results. In addition, to accelerate the training process and avoid overfitting, the Adam algorithm [30] is used to optimize the binary cross-entropy loss, and the Dropout technique [31] is also used in the input and hidden layers to further avoid overfitting of the proposed model.

Experimental results

Evaluation indicators

In the experiment, we introduced five evaluation metrics, namely, accuracy (Acc.), sensitivity (Sen.), precision (Pre.), F1-score (F1) and Matthews correlation coefficient (MCC), as evaluation criteria to measure the prediction performance of the proposed MNMDCDA model [32], which are defined as follows:

Acc . = \frac{TP + TN}{TP + TN + FP + FN}

(21)

Sen . = \frac{TP}{FN + TP}

(22)

Pre . = \frac{TP}{TP + FP}

(23)

F 1 - score = \frac{2 \times Sen . \times Pre .}{Sen . + Pre .}

(24)

MCC = \frac{TP \times TN - FP \times FN}{\sqrt{(TP + FP) \times (TP + FN) \times (TN + FP) \times (TN + FN)}}

(25)

where TP and FP are true positive and false positive, indicating the number of correctly predicted positive samples and the number of incorrectly predicted positive samples, respectively. TN and FN are true negative and false negative, denoting the number of correctly predicted negative samples and the number of incorrectly predicted negative samples, respectively. Additionally, we plotted the receiver operating characteristic (ROC) curve [33] and calculated the area under the ROC curve (AUC) [34] to clearly visualize the prediction performance of MNMDCDA.

Evaluate model performance

In the training of high-order GCN, weight decay = 0.001, learning rate = 0.001, activation function = ReLU, number of neighbors = 20 and maximum order P of high-order GCN = 4. In the prediction using DNN, we used three layers of DNN. In the first and second layers we use 256 neurons, activation function = ReLU, dropout rate = 0.5. In the third layer we use 1 neuron, activation function = sigmoid. Meanwhile, Adam algorithm is utilized to optimize the binary cross-entropy loss function. Since the maximum order p of the high-order GCN determines the farthest distance that the nodes can obtain mixed information from their neighbors in the network learning, which greatly affects the performance of the prediction model. Therefore, to achieve the best prediction performance, we need to optimize the maximum order p of the high-order GCN to choose the appropriate order. The prediction results of the proposed model at different orders are given in Table 1. To visualize the prediction results, a line graph of the prediction performance of the proposed model at different orders is given in Figure 2. From these results, we can find that the proposed model obtains the highest AUC score of 95.16% at p = 4. Finally, we select the maximum order p = 4 for the high-order GCN to conduct the experiment in this study.

Table 1

Open in new tab

The prediction results of the model at different orders

p	0	1	2	3	4	5	6	7	8
AUC (%)	91.90	92.69	93.19	94.18	95.16	94.40	94.36	93.79	93.81

p	0	1	2	3	4	5	6	7	8
AUC (%)	91.90	92.69	93.19	94.18	95.16	94.40	94.36	93.79	93.81

Table 1

Open in new tab

The prediction results of the model at different orders

p	0	1	2	3	4	5	6	7	8
AUC (%)	91.90	92.69	93.19	94.18	95.16	94.40	94.36	93.79	93.81

p	0	1	2	3	4	5	6	7	8
AUC (%)	91.90	92.69	93.19	94.18	95.16	94.40	94.36	93.79	93.81

Figure 2

Line graph of the prediction performance of the model at different orders.

Open in new tab Download slide

In the experiment, we utilized the 5-fold cross-validation approach to evaluate the prediction performance of the proposed MNMDCDA model on CircR2Disease dataset. The detailed experimental results of the 5-fold cross-validation are summarized in Table 2. From Table 2, we can see that the MNMDCDA model obtained an average accuracy of 88.69%. The average experimental results of MNMDCDA on Sen., Pre., F1, MCC and AUC were 94.07%, 85.00%, 89.28%, 77.87% and 95.16%, respectively, with their corresponding standard deviations of 1.93%, 3.00%, 2.11%, 4.57% and 1.84%, respectively. In addition, we also plotted the ROC curves generated by the MNMDCDA method using 5-fold cross-validation on the CircR2Disease dataset, as shown in Figure 3.

Table 2

Open in new tab

Experimental results of the MNMDCDA model on CircR2Disease dataset

Model	Testing set	Acc. (%)	Sen. (%)	Pre. (%)	F1 (%)	MCC (%)	AUC (%)
Our model	1	91.03	95.86	87.42	91.45	82.45	97.79
	2	88.62	94.48	84.57	89.25	77.78	94.89
	3	88.97	95.86	84.24	89.68	78.68	95.65
	4	84.83	91.72	80.61	85.81	70.33	92.67
	5	90.00	92.41	88.16	90.24	80.09	94.79
	Average	88.69	94.07	85.00	89.28	77.87	95.16
	Standard deviation	2.36	1.93	3.00	2.11	4.57	1.84
Cosine similarity model	1	81.38	86.21	78.62	82.24	63.05	91.52
	2	85.52	88.28	83.66	85.91	71.14	94.11
	3	84.48	86.21	83.33	84.75	69.01	93.28
	4	82.76	82.07	83.22	82.64	65.52	89.65
	5	86.55	86.21	86.81	86.51	73.11	92.22
	Average	84.14	85.79	83.13	84.41	68.37	92.16
	Standard deviation	2.08	2.27	2.92	1.91	4.09	1.72
DO-based disease semantic similarity model	1	87.59	95.86	82.25	88.54	76.22	93.72
	2	87.93	95.86	82.74	88.82	76.83	94.09
	3	86.55	97.93	79.78	87.93	75.07	95.01
	4	83.79	87.59	81.41	84.39	67.78	91.60
	5	83.79	88.28	81.01	84.49	67.86	91.45
	Average	85.93	93.10	81.44	86.83	72.75	93.17
	Standard deviation	2.02	4.80	1.15	2.21	4.55	1.58
DA model	Average	67.66	73.10	65.93	69.32	35.53	70.61
DA model	Standard deviation	1.85	2.34	1.71	1.80	3.71	3.30
LR model	Average	69.38	74.34	67.65	70.82	38.97	71.43
LR model	Standard deviation	1.49	2.09	1.54	1.44	2.99	3.44
NB model	Average	66.07	54.34	70.73	61.39	33.01	73.37
NB model	Standard deviation	5.01	7.85	5.12	6.87	9.88	4.85
KNN model	Average	82.21	92.69	76.66	83.91	65.88	92.08
KNN model	Standard deviation	2.78	2.16	2.73	2.39	5.43	1.63
SVM model	Average	84.97	86.76	83.86	85.25	70.04	94.37
SVM model	Standard deviation	1.73	1.65	3.05	1.44	3.37	1.27
DT model	Average	85.24	86.90	84.26	85.46	70.71	91.01
DT model	Standard deviation	0.66	4.20	2.52	0.95	1.41	1.25
Adboost model	Average	86.41	94.62	81.27	87.41	73.93	92.26
Adboost model	Standard deviation	2.13	4.35	1.11	2.25	4.77	1.76
RF model	Average	87.24	91.45	84.37	87.74	74.80	94.30
RF model	Standard deviation	1.40	2.87	1.28	1.46	2.95	0.84

Model	Testing set	Acc. (%)	Sen. (%)	Pre. (%)	F1 (%)	MCC (%)	AUC (%)
Our model	1	91.03	95.86	87.42	91.45	82.45	97.79
	2	88.62	94.48	84.57	89.25	77.78	94.89
	3	88.97	95.86	84.24	89.68	78.68	95.65
	4	84.83	91.72	80.61	85.81	70.33	92.67
	5	90.00	92.41	88.16	90.24	80.09	94.79
	Average	88.69	94.07	85.00	89.28	77.87	95.16
	Standard deviation	2.36	1.93	3.00	2.11	4.57	1.84
Cosine similarity model	1	81.38	86.21	78.62	82.24	63.05	91.52
	2	85.52	88.28	83.66	85.91	71.14	94.11
	3	84.48	86.21	83.33	84.75	69.01	93.28
	4	82.76	82.07	83.22	82.64	65.52	89.65
	5	86.55	86.21	86.81	86.51	73.11	92.22
	Average	84.14	85.79	83.13	84.41	68.37	92.16
	Standard deviation	2.08	2.27	2.92	1.91	4.09	1.72
DO-based disease semantic similarity model	1	87.59	95.86	82.25	88.54	76.22	93.72
	2	87.93	95.86	82.74	88.82	76.83	94.09
	3	86.55	97.93	79.78	87.93	75.07	95.01
	4	83.79	87.59	81.41	84.39	67.78	91.60
	5	83.79	88.28	81.01	84.49	67.86	91.45
	Average	85.93	93.10	81.44	86.83	72.75	93.17
	Standard deviation	2.02	4.80	1.15	2.21	4.55	1.58
DA model	Average	67.66	73.10	65.93	69.32	35.53	70.61
DA model	Standard deviation	1.85	2.34	1.71	1.80	3.71	3.30
LR model	Average	69.38	74.34	67.65	70.82	38.97	71.43
LR model	Standard deviation	1.49	2.09	1.54	1.44	2.99	3.44
NB model	Average	66.07	54.34	70.73	61.39	33.01	73.37
NB model	Standard deviation	5.01	7.85	5.12	6.87	9.88	4.85
KNN model	Average	82.21	92.69	76.66	83.91	65.88	92.08
KNN model	Standard deviation	2.78	2.16	2.73	2.39	5.43	1.63
SVM model	Average	84.97	86.76	83.86	85.25	70.04	94.37
SVM model	Standard deviation	1.73	1.65	3.05	1.44	3.37	1.27
DT model	Average	85.24	86.90	84.26	85.46	70.71	91.01
DT model	Standard deviation	0.66	4.20	2.52	0.95	1.41	1.25
Adboost model	Average	86.41	94.62	81.27	87.41	73.93	92.26
Adboost model	Standard deviation	2.13	4.35	1.11	2.25	4.77	1.76
RF model	Average	87.24	91.45	84.37	87.74	74.80	94.30
RF model	Standard deviation	1.40	2.87	1.28	1.46	2.95	0.84

Table 2

Open in new tab

Experimental results of the MNMDCDA model on CircR2Disease dataset

Model	Testing set	Acc. (%)	Sen. (%)	Pre. (%)	F1 (%)	MCC (%)	AUC (%)
Our model	1	91.03	95.86	87.42	91.45	82.45	97.79
	2	88.62	94.48	84.57	89.25	77.78	94.89
	3	88.97	95.86	84.24	89.68	78.68	95.65
	4	84.83	91.72	80.61	85.81	70.33	92.67
	5	90.00	92.41	88.16	90.24	80.09	94.79
	Average	88.69	94.07	85.00	89.28	77.87	95.16
	Standard deviation	2.36	1.93	3.00	2.11	4.57	1.84
Cosine similarity model	1	81.38	86.21	78.62	82.24	63.05	91.52
	2	85.52	88.28	83.66	85.91	71.14	94.11
	3	84.48	86.21	83.33	84.75	69.01	93.28
	4	82.76	82.07	83.22	82.64	65.52	89.65
	5	86.55	86.21	86.81	86.51	73.11	92.22
	Average	84.14	85.79	83.13	84.41	68.37	92.16
	Standard deviation	2.08	2.27	2.92	1.91	4.09	1.72
DO-based disease semantic similarity model	1	87.59	95.86	82.25	88.54	76.22	93.72
	2	87.93	95.86	82.74	88.82	76.83	94.09
	3	86.55	97.93	79.78	87.93	75.07	95.01
	4	83.79	87.59	81.41	84.39	67.78	91.60
	5	83.79	88.28	81.01	84.49	67.86	91.45
	Average	85.93	93.10	81.44	86.83	72.75	93.17
	Standard deviation	2.02	4.80	1.15	2.21	4.55	1.58
DA model	Average	67.66	73.10	65.93	69.32	35.53	70.61
DA model	Standard deviation	1.85	2.34	1.71	1.80	3.71	3.30
LR model	Average	69.38	74.34	67.65	70.82	38.97	71.43
LR model	Standard deviation	1.49	2.09	1.54	1.44	2.99	3.44
NB model	Average	66.07	54.34	70.73	61.39	33.01	73.37
NB model	Standard deviation	5.01	7.85	5.12	6.87	9.88	4.85
KNN model	Average	82.21	92.69	76.66	83.91	65.88	92.08
KNN model	Standard deviation	2.78	2.16	2.73	2.39	5.43	1.63
SVM model	Average	84.97	86.76	83.86	85.25	70.04	94.37
SVM model	Standard deviation	1.73	1.65	3.05	1.44	3.37	1.27
DT model	Average	85.24	86.90	84.26	85.46	70.71	91.01
DT model	Standard deviation	0.66	4.20	2.52	0.95	1.41	1.25
Adboost model	Average	86.41	94.62	81.27	87.41	73.93	92.26
Adboost model	Standard deviation	2.13	4.35	1.11	2.25	4.77	1.76
RF model	Average	87.24	91.45	84.37	87.74	74.80	94.30
RF model	Standard deviation	1.40	2.87	1.28	1.46	2.95	0.84

Model	Testing set	Acc. (%)	Sen. (%)	Pre. (%)	F1 (%)	MCC (%)	AUC (%)
Our model	1	91.03	95.86	87.42	91.45	82.45	97.79
	2	88.62	94.48	84.57	89.25	77.78	94.89
	3	88.97	95.86	84.24	89.68	78.68	95.65
	4	84.83	91.72	80.61	85.81	70.33	92.67
	5	90.00	92.41	88.16	90.24	80.09	94.79
	Average	88.69	94.07	85.00	89.28	77.87	95.16
	Standard deviation	2.36	1.93	3.00	2.11	4.57	1.84
Cosine similarity model	1	81.38	86.21	78.62	82.24	63.05	91.52
	2	85.52	88.28	83.66	85.91	71.14	94.11
	3	84.48	86.21	83.33	84.75	69.01	93.28
	4	82.76	82.07	83.22	82.64	65.52	89.65
	5	86.55	86.21	86.81	86.51	73.11	92.22
	Average	84.14	85.79	83.13	84.41	68.37	92.16
	Standard deviation	2.08	2.27	2.92	1.91	4.09	1.72
DO-based disease semantic similarity model	1	87.59	95.86	82.25	88.54	76.22	93.72
	2	87.93	95.86	82.74	88.82	76.83	94.09
	3	86.55	97.93	79.78	87.93	75.07	95.01
	4	83.79	87.59	81.41	84.39	67.78	91.60
	5	83.79	88.28	81.01	84.49	67.86	91.45
	Average	85.93	93.10	81.44	86.83	72.75	93.17
	Standard deviation	2.02	4.80	1.15	2.21	4.55	1.58
DA model	Average	67.66	73.10	65.93	69.32	35.53	70.61
DA model	Standard deviation	1.85	2.34	1.71	1.80	3.71	3.30
LR model	Average	69.38	74.34	67.65	70.82	38.97	71.43
LR model	Standard deviation	1.49	2.09	1.54	1.44	2.99	3.44
NB model	Average	66.07	54.34	70.73	61.39	33.01	73.37
NB model	Standard deviation	5.01	7.85	5.12	6.87	9.88	4.85
KNN model	Average	82.21	92.69	76.66	83.91	65.88	92.08
KNN model	Standard deviation	2.78	2.16	2.73	2.39	5.43	1.63
SVM model	Average	84.97	86.76	83.86	85.25	70.04	94.37
SVM model	Standard deviation	1.73	1.65	3.05	1.44	3.37	1.27
DT model	Average	85.24	86.90	84.26	85.46	70.71	91.01
DT model	Standard deviation	0.66	4.20	2.52	0.95	1.41	1.25
Adboost model	Average	86.41	94.62	81.27	87.41	73.93	92.26
Adboost model	Standard deviation	2.13	4.35	1.11	2.25	4.77	1.76
RF model	Average	87.24	91.45	84.37	87.74	74.80	94.30
RF model	Standard deviation	1.40	2.87	1.28	1.46	2.95	0.84

Figure 3

ROC curves of 5-fold cross-validation achieved by MNMDCDA on CircR2Disease dataset.

Open in new tab Download slide

Comparison with cosine similarity model

In the MNMDCDA model, we used GIPK similarity to denote the correlation between circRNA and disease. Therefore, to verify whether GIPK similarity is beneficial to the prediction performance of the proposed model, we compared it with cosine similarity. To be fair, we only used cosine similarity instead of GIPK similarity, and the other parts of the model remain unchanged. The results are presented in Table 2. As shown in Table 2, the average values of Acc., Sen., Pre., F1, MCC and AUC obtained based on the cosine similarity model were 4.55%, 8.28%, 1.87%, 4.87%, 9.50% and 3.00% less than the MNMDCDA model, respectively. Figure 4 shows the ROC curves generated by the cosine similarity model on the CircR2Disease dataset. Figure 5 visualizes the experimental results of the cosine similarity model and the proposed model on the CircR2Disease dataset. From these results, it can be seen that the prediction performance of the MNMDCDA model is superior to that of the cosine similarity-based model on the same dataset.

Figure 4

ROC curves of 5-fold cross-validation achieved by cosine similarity model on CircR2Disease dataset.

Open in new tab Download slide

Figure 5

Comparison of the proposed different combinatorial models on the CircR2Disease dataset.

Open in new tab Download slide

Comparison with DO-based disease semantic similarity model

In the experiment, we used MeSH-based disease semantic similarity to represent the correlation between two diseases. Therefore, to verify whether the MeSH-based disease semantic similarity is beneficial to the prediction performance of the proposed model, we compared it with the DO-based disease semantic similarity. Similarly, we perform the same 5-fold cross-validation experiment on the CircR2Disease dataset, and the results are shown in Table 2. As shown in Table 2, the average values of Acc., Sen., Pre., F1, MCC and AUC obtained from the DO-based disease semantic similarity model were 2.76%, 0.97%, 3.56%, 2.45%, 5.12% and 1.99% less than the MNMDCDA model, respectively. Figure 5 visualizes the experimental results of the DO-based disease semantic similarity model and the proposed model on the CircR2Disease dataset. Figure 6 shows the ROC curves generated by the DO-based disease semantic similarity model on the CircR2Disease dataset.

Figure 6

ROC curves of 5-fold cross-validation achieved by DO-based disease semantic similarity model on CircR2Disease dataset.

Open in new tab Download slide

Comparison of various classifier models

To evaluate the impact of the DNN classifier on the overall performance of the MNMDCDA model, we compared eight different computational models, including discriminant analysis (DA), logistic regression (LR), naive Bayes (NB), K-nearest neighbor (KNN), SVM, Decision tree (DT), Adboost and Random Forest (RF). Table 2 shows the average results of the 5-fold cross-validation obtained by these models on the CircR2Disease dataset. As can be seen from Table 2, the highest average accuracy of the eight models is 87.24%, which is significantly lower than the proposed MNMDCDA model with an average accuracy of 88.69%. Figure 7 visualizes the experimental results of different classifier models on the CircR2Disease dataset. The results of this experiment further suggest that the use of DNN classifier in the MNMDCDA model can not only accurately determine whether circRNAs are associated with diseases but also contributes to the improvement of model prediction performance.

Figure 7

Comparison of various classifier models on the CircR2Disease dataset.

Open in new tab Download slide

Performance on independent dataset

Although the MNMDCDA model achieved good prediction performance on the CircR2Disease dataset, we also need to test its predictive ability on other independent datasets. In this paper, CircAtlas v2.0 [35], Circ2Disease [36] and CircRNADisease [37] are treated as independent datasets to examine the generalization performance of the model. The results are summarized in Table 3.

Table 3

Open in new tab

Results of 5-fold cross-validation achieved by the proposed model on three other independent datasets

Independent datasets	Testing set	Acc. (%)	Sen. (%)	Pre. (%)	F1 (%)	MCC (%)	AUC (%)
CircAtlas v2.0	1	84.37	95.88	77.99	86.02	70.61	92.26
	2	87.61	94.67	82.90	88.40	76.00	93.74
	3	90.83	92.90	89.20	91.01	81.73	96.92
	4	86.98	98.22	80.19	88.30	75.91	96.41
	5	87.57	95.86	82.23	88.52	76.20	93.34
	Average	87.47 ± 2.30	95.51 ± 1.95	82.50 ± 4.21	88.45 ± 1.77	76.09 ± 3.93	94.53 ± 2.03
Circ2Disease	1	83.33	88.89	80.00	84.21	67.08	90.12
	2	77.78	85.19	74.19	79.31	56.18	88.99
	3	86.11	83.33	88.24	85.71	72.33	93.52
	4	85.19	94.44	79.69	86.44	71.61	93.86
	5	74.07	74.07	74.07	74.07	48.15	82.51
	Average	81.30 ± 5.17	85.19 ± 7.52	79.24 ± 5.78	81.95 ± 5.21	63.07 ± 10.55	89.80 ± 4.59
CircRNADisease	1	80.71	87.14	77.22	81.88	61.94	92.20
	2	87.14	97.14	80.95	88.31	75.82	90.41
	3	86.43	97.14	80.00	87.74	74.59	92.47
	4	80.71	84.29	78.67	81.38	61.59	92.53
	5	85.71	98.57	78.41	87.34	73.91	91.53
	Average	84.14 ± 3.17	92.86 ± 6.62	79.05 ± 1.45	85.33 ± 3.40	69.57 ± 7.16	91.83 ± 0.89

Independent datasets	Testing set	Acc. (%)	Sen. (%)	Pre. (%)	F1 (%)	MCC (%)	AUC (%)
CircAtlas v2.0	1	84.37	95.88	77.99	86.02	70.61	92.26
	2	87.61	94.67	82.90	88.40	76.00	93.74
	3	90.83	92.90	89.20	91.01	81.73	96.92
	4	86.98	98.22	80.19	88.30	75.91	96.41
	5	87.57	95.86	82.23	88.52	76.20	93.34
	Average	87.47 ± 2.30	95.51 ± 1.95	82.50 ± 4.21	88.45 ± 1.77	76.09 ± 3.93	94.53 ± 2.03
Circ2Disease	1	83.33	88.89	80.00	84.21	67.08	90.12
	2	77.78	85.19	74.19	79.31	56.18	88.99
	3	86.11	83.33	88.24	85.71	72.33	93.52
	4	85.19	94.44	79.69	86.44	71.61	93.86
	5	74.07	74.07	74.07	74.07	48.15	82.51
	Average	81.30 ± 5.17	85.19 ± 7.52	79.24 ± 5.78	81.95 ± 5.21	63.07 ± 10.55	89.80 ± 4.59
CircRNADisease	1	80.71	87.14	77.22	81.88	61.94	92.20
	2	87.14	97.14	80.95	88.31	75.82	90.41
	3	86.43	97.14	80.00	87.74	74.59	92.47
	4	80.71	84.29	78.67	81.38	61.59	92.53
	5	85.71	98.57	78.41	87.34	73.91	91.53
	Average	84.14 ± 3.17	92.86 ± 6.62	79.05 ± 1.45	85.33 ± 3.40	69.57 ± 7.16	91.83 ± 0.89

Table 3

Open in new tab

Results of 5-fold cross-validation achieved by the proposed model on three other independent datasets

Independent datasets	Testing set	Acc. (%)	Sen. (%)	Pre. (%)	F1 (%)	MCC (%)	AUC (%)
CircAtlas v2.0	1	84.37	95.88	77.99	86.02	70.61	92.26
	2	87.61	94.67	82.90	88.40	76.00	93.74
	3	90.83	92.90	89.20	91.01	81.73	96.92
	4	86.98	98.22	80.19	88.30	75.91	96.41
	5	87.57	95.86	82.23	88.52	76.20	93.34
	Average	87.47 ± 2.30	95.51 ± 1.95	82.50 ± 4.21	88.45 ± 1.77	76.09 ± 3.93	94.53 ± 2.03
Circ2Disease	1	83.33	88.89	80.00	84.21	67.08	90.12
	2	77.78	85.19	74.19	79.31	56.18	88.99
	3	86.11	83.33	88.24	85.71	72.33	93.52
	4	85.19	94.44	79.69	86.44	71.61	93.86
	5	74.07	74.07	74.07	74.07	48.15	82.51
	Average	81.30 ± 5.17	85.19 ± 7.52	79.24 ± 5.78	81.95 ± 5.21	63.07 ± 10.55	89.80 ± 4.59
CircRNADisease	1	80.71	87.14	77.22	81.88	61.94	92.20
	2	87.14	97.14	80.95	88.31	75.82	90.41
	3	86.43	97.14	80.00	87.74	74.59	92.47
	4	80.71	84.29	78.67	81.38	61.59	92.53
	5	85.71	98.57	78.41	87.34	73.91	91.53
	Average	84.14 ± 3.17	92.86 ± 6.62	79.05 ± 1.45	85.33 ± 3.40	69.57 ± 7.16	91.83 ± 0.89

Independent datasets	Testing set	Acc. (%)	Sen. (%)	Pre. (%)	F1 (%)	MCC (%)	AUC (%)
CircAtlas v2.0	1	84.37	95.88	77.99	86.02	70.61	92.26
	2	87.61	94.67	82.90	88.40	76.00	93.74
	3	90.83	92.90	89.20	91.01	81.73	96.92
	4	86.98	98.22	80.19	88.30	75.91	96.41
	5	87.57	95.86	82.23	88.52	76.20	93.34
	Average	87.47 ± 2.30	95.51 ± 1.95	82.50 ± 4.21	88.45 ± 1.77	76.09 ± 3.93	94.53 ± 2.03
Circ2Disease	1	83.33	88.89	80.00	84.21	67.08	90.12
	2	77.78	85.19	74.19	79.31	56.18	88.99
	3	86.11	83.33	88.24	85.71	72.33	93.52
	4	85.19	94.44	79.69	86.44	71.61	93.86
	5	74.07	74.07	74.07	74.07	48.15	82.51
	Average	81.30 ± 5.17	85.19 ± 7.52	79.24 ± 5.78	81.95 ± 5.21	63.07 ± 10.55	89.80 ± 4.59
CircRNADisease	1	80.71	87.14	77.22	81.88	61.94	92.20
	2	87.14	97.14	80.95	88.31	75.82	90.41
	3	86.43	97.14	80.00	87.74	74.59	92.47
	4	80.71	84.29	78.67	81.38	61.59	92.53
	5	85.71	98.57	78.41	87.34	73.91	91.53
	Average	84.14 ± 3.17	92.86 ± 6.62	79.05 ± 1.45	85.33 ± 3.40	69.57 ± 7.16	91.83 ± 0.89

From Table 3, the average AUC values of the proposed model on three independent datasets were 94.53%, 89.80% and 91.83%, respectively. Therefore, this model can be used to explore organisms for which circRNA–disease association data are not yet available and to provide appropriate experience for further discovering new candidate diseases associated with circRNAs. Figure 8 gives the histogram of the experimental results of the proposed model on the independent dataset.

Figure 8

Comparison of experimental results on the benchmark dataset.

Open in new tab Download slide

Comparison with other existing methods

To further evaluate the prediction performance of the MNMDCDA model, we compare it with these six popular methods using the same dataset, including MGRCDA [34], NMFCDA [38], SGANRDA [39], iCircDA-MF [40], GCNCDA [41] and PWCDA [42]. To be fair, we use the AUC value that can fully reflect the stability of the model as a comparison index between different methods. Table 4 summarizes the AUC values obtained by these models on CircR2Disease. Figure 9 shows the line graph of the AUC scores obtained on the CircR2Disease dataset by different computational methods. These comparative results demonstrate that the MNMDCDA model using the high-order GCN framework combined with multi-source similarity networks has the best performance and is a promising approach.

Table 4

Open in new tab

The 5-fold cross-validation AUC values achieved by the various models

Methods	MNMDCDA	MGRCDA	NMFCDA	SGANRDA	iCircDA-MF	GCNCDA	PWCDA
AUC	0.9516	0.9298	0.9278	0.9215	0.9178	0.9090	0.8900

Methods	MNMDCDA	MGRCDA	NMFCDA	SGANRDA	iCircDA-MF	GCNCDA	PWCDA
AUC	0.9516	0.9298	0.9278	0.9215	0.9178	0.9090	0.8900

Table 4

Open in new tab

The 5-fold cross-validation AUC values achieved by the various models

Methods	MNMDCDA	MGRCDA	NMFCDA	SGANRDA	iCircDA-MF	GCNCDA	PWCDA
AUC	0.9516	0.9298	0.9278	0.9215	0.9178	0.9090	0.8900

Methods	MNMDCDA	MGRCDA	NMFCDA	SGANRDA	iCircDA-MF	GCNCDA	PWCDA
AUC	0.9516	0.9298	0.9278	0.9215	0.9178	0.9090	0.8900

Figure 9

Comparison of the AUC values of existing computational methods on the CircR2Disease dataset.

Open in new tab Download slide

Case studies

To further investigate the effectiveness of MNMDCDA in screening unknown disease candidate circRNAs, we conducted the case studies experiment on the CircR2Disease dataset. After model prediction, the experimental results are shown in Table 5, from which we can see that 25 of the top 30 circRNA–disease pairs have been confirmed in the recently published literature. In general, MNMDCDA has an excellent ability to predict potential disease-associated circRNAs, and these top candidates will likely be selected for further biological studies to reduce the range of wet-lab experimental searches.

Table 5

Open in new tab

Top 30 circRNA–disease associations predicted by MNMDCDA

Rank	circRNA	Disease	Evidence (PMID/ORCID)	Year
1	hsa_circ_001569	Breast cancer	31104012	2019
2	circFAT1	Breast cancer	34288822	2021
3	hsa_circ_0000190	Breast cancer	10.1093/annonc/mdy428.010	2018
4	hsa_circ_001763	Breast cancer	30509108	2019
5	ciRS-7	Breast cancer	33390857	2021
6	hsa_circ_0083964	Osteoarthritis	Unconfirmed	N/A
7	hsa_circ_001988	Gastric cancer	32592202	2021
8	hsa_circ_0001724	Gastric cancer	10.1016/j.genrep.2021.101226	2021
9	circFAT1	Gastric cancer	30419346	2019
10	circRHOBTB3	Gastric cancer	31928527	2020
11	hsa_circ_0023404	Rheumatoid arthritis	Unconfirmed	N/A
12	hsa_circ_0000520	Breast cancer	10.21203/rs.3.rs-1023577/v1	2021
13	circBRAF	Glioma	33650075	2021
14	hsa_circ_005239	Breast cancer	29037220	2017
15	Circ_HIPK3	Glioblastoma	34198978	2021
16	Circ_SMARCA5	Glioblastoma	30736462	2019
17	hsa_circ_0001566	Glioma	Unconfirmed	N/A
18	circHIPK3	Pancreatic cancer	32104074	2020
19	circRTN4	Pancreatic cancer	34983537	2022
20	circRHOBTB3	Pancreatic cancer	34416910	2021
21	hsa_circ_0089974	Gastric cancer	Unconfirmed	N/A
22	circHIPK3	Lung cancer	31232177	2020
23	hsa_circ_0001649	Pancreatic cancer	31138014	2019
24	hsa_circ_0005015	Diabetic retinopathy	29288268	2017
25	hsa_circRNA_100750	Diabetes retinopathy	28817829	2017
26	hsa_circ_0005927	Colorectal cancer	33312376	2020
27	hsa_circ_0081108	Diabetic retinopathy	32497630	2020
28	hsa_circ_0045510	Osteosarcoma	Unconfirmed	N/A
29	circHIAT1	Hepatocellular carcinoma	31108351	2019
30	circFAT1	Hepatocellular carcinoma	33179443	2020

Rank	circRNA	Disease	Evidence (PMID/ORCID)	Year
1	hsa_circ_001569	Breast cancer	31104012	2019
2	circFAT1	Breast cancer	34288822	2021
3	hsa_circ_0000190	Breast cancer	10.1093/annonc/mdy428.010	2018
4	hsa_circ_001763	Breast cancer	30509108	2019
5	ciRS-7	Breast cancer	33390857	2021
6	hsa_circ_0083964	Osteoarthritis	Unconfirmed	N/A
7	hsa_circ_001988	Gastric cancer	32592202	2021
8	hsa_circ_0001724	Gastric cancer	10.1016/j.genrep.2021.101226	2021
9	circFAT1	Gastric cancer	30419346	2019
10	circRHOBTB3	Gastric cancer	31928527	2020
11	hsa_circ_0023404	Rheumatoid arthritis	Unconfirmed	N/A
12	hsa_circ_0000520	Breast cancer	10.21203/rs.3.rs-1023577/v1	2021
13	circBRAF	Glioma	33650075	2021
14	hsa_circ_005239	Breast cancer	29037220	2017
15	Circ_HIPK3	Glioblastoma	34198978	2021
16	Circ_SMARCA5	Glioblastoma	30736462	2019
17	hsa_circ_0001566	Glioma	Unconfirmed	N/A
18	circHIPK3	Pancreatic cancer	32104074	2020
19	circRTN4	Pancreatic cancer	34983537	2022
20	circRHOBTB3	Pancreatic cancer	34416910	2021
21	hsa_circ_0089974	Gastric cancer	Unconfirmed	N/A
22	circHIPK3	Lung cancer	31232177	2020
23	hsa_circ_0001649	Pancreatic cancer	31138014	2019
24	hsa_circ_0005015	Diabetic retinopathy	29288268	2017
25	hsa_circRNA_100750	Diabetes retinopathy	28817829	2017
26	hsa_circ_0005927	Colorectal cancer	33312376	2020
27	hsa_circ_0081108	Diabetic retinopathy	32497630	2020
28	hsa_circ_0045510	Osteosarcoma	Unconfirmed	N/A
29	circHIAT1	Hepatocellular carcinoma	31108351	2019
30	circFAT1	Hepatocellular carcinoma	33179443	2020

Table 5

Open in new tab

Top 30 circRNA–disease associations predicted by MNMDCDA

Rank	circRNA	Disease	Evidence (PMID/ORCID)	Year
1	hsa_circ_001569	Breast cancer	31104012	2019
2	circFAT1	Breast cancer	34288822	2021
3	hsa_circ_0000190	Breast cancer	10.1093/annonc/mdy428.010	2018
4	hsa_circ_001763	Breast cancer	30509108	2019
5	ciRS-7	Breast cancer	33390857	2021
6	hsa_circ_0083964	Osteoarthritis	Unconfirmed	N/A
7	hsa_circ_001988	Gastric cancer	32592202	2021
8	hsa_circ_0001724	Gastric cancer	10.1016/j.genrep.2021.101226	2021
9	circFAT1	Gastric cancer	30419346	2019
10	circRHOBTB3	Gastric cancer	31928527	2020
11	hsa_circ_0023404	Rheumatoid arthritis	Unconfirmed	N/A
12	hsa_circ_0000520	Breast cancer	10.21203/rs.3.rs-1023577/v1	2021
13	circBRAF	Glioma	33650075	2021
14	hsa_circ_005239	Breast cancer	29037220	2017
15	Circ_HIPK3	Glioblastoma	34198978	2021
16	Circ_SMARCA5	Glioblastoma	30736462	2019
17	hsa_circ_0001566	Glioma	Unconfirmed	N/A
18	circHIPK3	Pancreatic cancer	32104074	2020
19	circRTN4	Pancreatic cancer	34983537	2022
20	circRHOBTB3	Pancreatic cancer	34416910	2021
21	hsa_circ_0089974	Gastric cancer	Unconfirmed	N/A
22	circHIPK3	Lung cancer	31232177	2020
23	hsa_circ_0001649	Pancreatic cancer	31138014	2019
24	hsa_circ_0005015	Diabetic retinopathy	29288268	2017
25	hsa_circRNA_100750	Diabetes retinopathy	28817829	2017
26	hsa_circ_0005927	Colorectal cancer	33312376	2020
27	hsa_circ_0081108	Diabetic retinopathy	32497630	2020
28	hsa_circ_0045510	Osteosarcoma	Unconfirmed	N/A
29	circHIAT1	Hepatocellular carcinoma	31108351	2019
30	circFAT1	Hepatocellular carcinoma	33179443	2020

Rank	circRNA	Disease	Evidence (PMID/ORCID)	Year
1	hsa_circ_001569	Breast cancer	31104012	2019
2	circFAT1	Breast cancer	34288822	2021
3	hsa_circ_0000190	Breast cancer	10.1093/annonc/mdy428.010	2018
4	hsa_circ_001763	Breast cancer	30509108	2019
5	ciRS-7	Breast cancer	33390857	2021
6	hsa_circ_0083964	Osteoarthritis	Unconfirmed	N/A
7	hsa_circ_001988	Gastric cancer	32592202	2021
8	hsa_circ_0001724	Gastric cancer	10.1016/j.genrep.2021.101226	2021
9	circFAT1	Gastric cancer	30419346	2019
10	circRHOBTB3	Gastric cancer	31928527	2020
11	hsa_circ_0023404	Rheumatoid arthritis	Unconfirmed	N/A
12	hsa_circ_0000520	Breast cancer	10.21203/rs.3.rs-1023577/v1	2021
13	circBRAF	Glioma	33650075	2021
14	hsa_circ_005239	Breast cancer	29037220	2017
15	Circ_HIPK3	Glioblastoma	34198978	2021
16	Circ_SMARCA5	Glioblastoma	30736462	2019
17	hsa_circ_0001566	Glioma	Unconfirmed	N/A
18	circHIPK3	Pancreatic cancer	32104074	2020
19	circRTN4	Pancreatic cancer	34983537	2022
20	circRHOBTB3	Pancreatic cancer	34416910	2021
21	hsa_circ_0089974	Gastric cancer	Unconfirmed	N/A
22	circHIPK3	Lung cancer	31232177	2020
23	hsa_circ_0001649	Pancreatic cancer	31138014	2019
24	hsa_circ_0005015	Diabetic retinopathy	29288268	2017
25	hsa_circRNA_100750	Diabetes retinopathy	28817829	2017
26	hsa_circ_0005927	Colorectal cancer	33312376	2020
27	hsa_circ_0081108	Diabetic retinopathy	32497630	2020
28	hsa_circ_0045510	Osteosarcoma	Unconfirmed	N/A
29	circHIAT1	Hepatocellular carcinoma	31108351	2019
30	circFAT1	Hepatocellular carcinoma	33179443	2020

Conclusion

Identifying the association between circRNAs and diseases can not only provide insight into the pathogenesis of complex diseases but also provide effective ideas and solutions for early prevention, diagnosis and treatment of diseases. In this paper, we propose a novel computational model MNMDCDA combining high-order GCN and DNN, aiming to investigate the potential relationship between circRNAs and diseases. To evaluate the model performance, we performed several ablation experiments on four datasets, including comparison of cosine similarity model, DO-based disease semantic similarity model, different classifier models, comparison of model generalization performance with other existing models. Numerous experimental results suggest that MNMDCDA outperforms other existing computational models and can effectively discriminate new disease-associated circRNAs.

There are three main reasons for the excellent performance of MNMDCDA: (1) MNMDCDA integrates multiple biological attribute information between circRNAs and diseases to form fusion descriptors and to construct multiple multi-source similarity networks. (2) Using the GCN algorithm of deep learning to fully learn the high-order mixed neighborhood embedding representation of circRNAs and diseases. (3) MNMDCDA can effectively predict the potential disease-related circRNAs from the fused features, and it has good generalization performance on three independent datasets.

Key Points

Integrating the multiple biological attribute information of circRNAs and diseases can comprehensively describe the complex association between circRNAs and diseases from multiple perspectives.
The high-order GCN algorithm of deep learning is used to learn the embedding representations with high-order mixed neighborhood information of circRNAs and diseases from multiple multi-source similarity networks, respectively.
Experimental results on three other benchmark datasets ensure the generalization performance of the MNMDCDA model and provide corresponding theoretical guidance for further wet-lab approaches.
Extensive experimental results demonstrate the superior performance of the MNMDCDA model in predicting potential circRNA–disease associations.

Data Availability

The data sets and source code can be freely downloaded from: https://github.com/ly2021010123/MNMDCDA/.

Acknowledgements

The authors would like to thank all anonymous reviewers for their constructive advice.

Funding

National Natural Science Foundation of China (61976077, 62076085, 62172355, 62120106008), in part by the Major special projects of the Ministry of Science and Technology (2021ZD0200403), in part by the Qingtan scholar talent project of Zaozhuang University.

Author Biographies

Yang Li is a PhD student in the Key Laboratory of Knowledge Engineering with Big Data in Anhui Province, School of Computer Science and Information Engineering at Hefei University of Technology, Hefei, China. His current research interests include machine learning, data mining and its applications in bioinformatics.

Xue-Gang Hu is a professor of Hefei University of Technology. His research interests include data mining and knowledge engineering.

Lei Wang is a professor of Guangxi Academy of Sciences. His research interests include data mining, machine learning, deep learning, computational biology and bioinformatics.

Pei-Pei Li is an associate professor of Hefei University of Technology. Her research interests include data mining and intelligent computing.

Zhu-Hong You is a professor of Northwestern Polytechnical University. His research interests include neural networks, intelligent information processing, sparse representation and its applications in bioinformatics.

References

1.

Kristensen

L

,

Hansen

T

,

Venø

M

, et al.

Circular RNAs in cancer: opportunities and challenges in the field

.

Oncogene

2018

;

37

(

5

):

555

–

65

.

2.

Wang

L

,

Wong

L

,

You

Z-H

, et al.

NSECDA: natural semantic enhancement for circRNA-disease association prediction

.

IEEE J Biomed Health Inform

2022

;

26

:

5075

–

84

.

3.

Memczak

S

,

Jens

M

,

Elefsinioti

A

, et al.

Circular RNAs are a large class of animal RNAs with regulatory potency

.

Nature

2013

;

495

(

7441

):

333

–

8

.

4.

Zhang

X-O

,

Wang

H-B

,

Zhang

Y

, et al.

Complementary sequence-mediated exon circularization

.

Cell

2014

;

159

(

1

):

134

–

47

.

5.

Sanger

HL

,

Klotz

G

,

Riesner

D

, et al.

Viroids are single-stranded covalently closed circular RNA molecules existing as highly base-paired rod-like structures

.

Proc Natl Acad Sci

1976

;

73

(

11

):

3852

–

6

.

6.

Kolakofsky

D

.

Isolation and characterization of Sendai virus DI-RNAs

.

Cell

1976

;

8

(

4

):

547

–

55

.

7.

Nigro

JM

,

Cho

KR

,

Fearon

ER

, et al.

Scrambled exons

.

Cell

1991

;

64

(

3

):

607

–

13

.

8.

Capel

B

,

Swain

A

,

Nicolis

S

, et al.

Circular transcripts of the testis-determining gene Sry in adult mouse testis

.

Cell

1993

;

73

(

5

):

1019

–

30

.

9.

Hansen

TB

,

Jensen

TI

,

Clausen

BH

, et al.

Natural RNA circles function as efficient microRNA sponges

.

Nature

2013

;

495

(

7441

):

384

–

8

.

10.

Zeng

X

,

Lin

W

,

Guo

M

, et al.

A comprehensive overview and evaluation of circular RNA detection tools

.

PLoS Comput Biol

2017

;

13

(

6

):e1005420.

Google Scholar

OpenURL Placeholder Text

WorldCat

11.

Wang

L

,

Wong

L

,

Li

Z

, et al.

A machine learning framework based on multi-source feature fusion for circRNA-disease association prediction

.

Brief Bioinform

2022

;

23

(5):bbac388.

Google Scholar

OpenURL Placeholder Text

WorldCat

12.

Niu

M

,

Zou

Q

,

Wang

C

.

GMNN2CD: identification of circRNA–disease associations based on variational inference and graph Markov neural networks

.

Bioinformatics

2022

;

38

(

8

):

2246

–

53

.

Google Scholar

Crossref

WorldCat

13.

Niu

M

,

Ju

Y

,

Lin

C

, et al.

Characterizing viral circRNAs and their application in identifying circRNAs in viruses

.

Brief Bioinform

2022

;

23

(

1

):bbab404.

Google Scholar

OpenURL Placeholder Text

WorldCat

14.

Xiao

Q

,

Zhong

J

,

Tang

X

, et al.

iCDA-CMG: identifying circRNA-disease associations by federating multi-similarity fusion and collective matrix completion

.

Mol Gen Genomics

2021

;

296

(

1

):

223

–

33

.

Google Scholar

Crossref

WorldCat

15.

Zheng

K

,

You

Z-H

,

Li

J-Q

, et al.

iCDA-CGR: identification of circRNA-disease associations based on Chaos game representation

.

PLoS Comput Biol

2020

;

16

(

5

):e1007872.

Google Scholar

OpenURL Placeholder Text

WorldCat

16.

Yang

J

,

Lei

X

.

Predicting circRNA-disease associations based on autoencoder and graph embedding

.

Inf Sci

2021

;

571

:

323

–

36

.

Google Scholar

Crossref

WorldCat

17.

Lu

C

,

Zeng

M

,

Wu

F-X

, et al.

Improving circRNA–disease association prediction by sequence and ontology representations with convolutional and recurrent neural networks

.

Bioinformatics

2021

;

36

(

24

):

5656

–

64

.

Google Scholar

Crossref

WorldCat

18.

Fan

C

,

Lei

X

,

Fang

Z

, et al.

CircR2Disease: a manually curated database for experimentally supported circular RNAs associated with various diseases

.

Database

2018

;

2018

:

1

–

6

.

Google Scholar

Crossref

WorldCat

19.

Wang

D

,

Wang

J

,

Lu

M

, et al.

Inferring the human microRNA functional similarity and functional network based on microRNA-associated diseases

.

Bioinformatics

2010

;

26

(

13

):

1644

–

50

.

20.

Xiang

Z

,

Qin

T

,

Qin

ZS

, et al.

A genome-wide MeSH-based literature mining system predicts implicit gene-to-gene relationships and networks

.

BMC Syst Biol

2013

;

7

(

3

):

1

–

15

.

Google Scholar

PubMed

OpenURL Placeholder Text

WorldCat

21.

Wang

L

,

You

Z-H

,

Chen

X

, et al.

LMTRDA: using logistic model tree to predict MiRNA-disease associations by fusing multi-source information of sequences and similarities

.

PLoS Comput Biol

2019

;

15

(

3

):e1006865.

Google Scholar

OpenURL Placeholder Text

WorldCat

22.

Jeon

M

,

Park

D

,

Lee

J

, et al.

ReSimNet: drug response similarity prediction using Siamese neural networks

.

Bioinformatics

2019

;

35

(

24

):

5249

–

56

.

23.

Yu

G

,

Wang

L-G

,

Yan

G-R

, et al.

DOSE: an R/Bioconductor package for disease ontology semantic and enrichment analysis

.

Bioinformatics

2015

;

31

(

4

):

608

–

9

.

24.

Chen

X

,

Yan

CC

,

Zhang

X

, et al.

WBSMDA: within and between score for MiRNA-disease association prediction

.

Sci Rep

2016

;

6

(

1

):

1

–

9

.

Google Scholar

PubMed

OpenURL Placeholder Text

WorldCat

25.

Abu-El-Haija

S

,

Perozzi

B

,

Kapoor

A

, et al. Mixhop: higher-order graph convolutional architectures via sparsified neighborhood mixing. In:

International Conference on Machine Learning

. Long Beach, CA,

PMLR

,

2019

,

21

–

9

.

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

26.

Kipf

TN

,

Welling

M

.

Semi-supervised classification with graph convolutional networks

.

arXiv Preprint

2016

;arXiv:1609.02907.

Google Scholar

OpenURL Placeholder Text

WorldCat

27.

Wang

J

,

Liang

J

,

Cui

J

, et al.

Semi-supervised learning with mixed-order graph convolutional networks

.

Inf Sci

2021

;

573

:

171

–

81

.

Google Scholar

Crossref

WorldCat

28.

Hara

K

,

Saito

D

,

Shouno

H

. Analysis of function of rectified linear unit used in deep learning. In:

2015 International Joint Conference on Neural Networks (IJCNN)

.

Killarney, New York

:

IEEE

,

2015

,

1

–

8

.

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

29.

Wanto

A

,

Windarto

AP

,

Hartama

D

, et al.

Use of binary sigmoid function and linear identity in artificial neural networks for forecasting population density

.

Int J Inf Syst Technol

2017

;

1

(

1

):

43

–

54

.

Google Scholar

OpenURL Placeholder Text

WorldCat

30.

Kingma

DP

,

Ba

J

.

Adam: a method for stochastic optimization

.

arXiv Preprint

2014

;arXiv:1412.6980.

Google Scholar

OpenURL Placeholder Text

WorldCat

31.

Srivastava

N

,

Hinton

G

,

Krizhevsky

A

, et al.

Dropout: a simple way to prevent neural networks from overfitting

.

J Mach Learn Res

2014

;

15

(

1

):

1929

–

58

.

Google Scholar

OpenURL Placeholder Text

WorldCat

32.

Su

X

,

You

Z-H

,

Huang

D-s

, et al.

Biomedical knowledge graph embedding with capsule network for multi-label drug-drug interaction prediction

.

IEEE Trans Knowl Data Eng

2022

;

1-1

.

Google Scholar

OpenURL Placeholder Text

WorldCat

33.

Bradley

AP

.

The use of the area under the ROC curve in the evaluation of machine learning algorithms

.

Pattern Recogn

1997

;

30

(

7

):

1145

–

59

.

Google Scholar

Crossref

WorldCat

34.

Wang

L

,

You

Z-H

,

Huang

D-S

, et al.

MGRCDA: Metagraph Recommendation Method for Predicting CircRNA-Disease Association

.

IEEE Trans Cybern

2021

;1–9. https://doi.org/10.1109/TCYB.2021.3090756.

Google Scholar

OpenURL Placeholder Text

WorldCat

35.

Wu

W

,

Ji

P

,

Zhao

F

.

CircAtlas: an integrated resource of one million highly accurate circular RNAs from 1070 vertebrate transcriptomes

.

Genome Biol

2020

;

21

(

1

):

1

–

14

.

Google Scholar

OpenURL Placeholder Text

WorldCat

36.

Yao

D

,

Zhang

L

,

Zheng

M

, et al.

Circ2Disease: a manually curated database of experimentally validated circRNAs in human disease

.

Sci Rep

2018

;

8

(

1

):

1

–

6

.

Google Scholar

PubMed

OpenURL Placeholder Text

WorldCat

37.

Zhao

Z

,

Wang

K

,

Wu

F

, et al.

circRNA disease: a manually curated database of experimentally supported circRNA-disease associations

.

Cell Death Dis

2018

;

9

(

5

):

1

–

2

.

38.

Wang

L

,

You

Z-H

,

Zhou

X

, et al.

NMFCDA: Combining randomization-based neural network with non-negative matrix factorization for predicting CircRNA-disease association

.

Appl Soft Comput

2021

;

110

:

107629

.

Google Scholar

Crossref

WorldCat

39.

Wang

L

,

Yan

X

,

You

Z-H

, et al.

SGANRDA: semi-supervised generative adversarial networks for predicting circRNA–disease associations

.

Brief Bioinform

2021

;

22

(

5

):bbab028.

Google Scholar

OpenURL Placeholder Text

WorldCat

40.

Wei

H

,

Liu

B

.

iCircDA-MF: identification of circRNA-disease associations based on matrix factorization

.

Brief Bioinform

2020

;

21

(

4

):

1356

–

67

.

41.

Wang

L

,

You

Z-H

,

Li

Y-M

, et al.

GCNCDA: a new method for predicting circRNA-disease associations based on graph convolutional network algorithm

.

PLoS Comput Biol

2020

;

16

(

5

):e1007568.

Google Scholar

OpenURL Placeholder Text

WorldCat

42.

Lei

X

,

Fang

Z

,

Chen

L

, et al.

PWCDA: path weighted method for predicting circRNA-disease associations

.

Int J Mol Sci

2018

;

19

(

11

):

3410

.

Google Scholar

Crossref

WorldCat

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://dbpia.nl.go.kr/journals/pages/open_access/funder_policies/chorus/standard_publication_model)

Download all slides

Month:	Total Views:
November 2022	115
December 2022	45
January 2023	41
February 2023	27
March 2023	38
April 2023	27
May 2023	10
June 2023	21
July 2023	13
August 2023	10
September 2023	9
October 2023	19
November 2023	20
December 2023	13
January 2024	63
February 2024	32
March 2024	41
April 2024	25
May 2024	27
June 2024	36
July 2024	51
August 2024	21
September 2024	34
October 2024	46
November 2024	61
December 2024	34
January 2025	34
February 2025	41
March 2025	46
April 2025	51

Article Contents

MNMDCDA: prediction of circRNA–disease associations by learning mixed neighborhood information from multiple distances

Abstract

Introduction

Materials

Gold standard dataset

Disease feature construction

Gaussian interaction profile kernel-based disease similarity

Medical subject heading-based disease semantic similarity

Disease Ontology-based disease semantic similarity

Cosine similarity of disease

CircRNA feature construction

GIPK-based circRNA similarity

CircRNA functional similarity

Cosine similarity of circRNA

Multi-similarity matrix fusion

Feature embedding of high-order GCNs

Deep neural network

Experimental results

Evaluation indicators

Evaluate model performance

Comparison with cosine similarity model

Comparison with DO-based disease semantic similarity model

Comparison of various classifier models

Performance on independent dataset

Comparison with other existing methods

Case studies

Conclusion

Data Availability

Acknowledgements

Funding

Author Biographies

References

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only