-
PDF
- Split View
-
Views
-
Cite
Cite
Lihong Peng, Liangliang Huang, Qiongli Su, Geng Tian, Min Chen, Guosheng Han, LDA-VGHB: identifying potential lncRNA–disease associations with singular value decomposition, variational graph auto-encoder and heterogeneous Newton boosting machine, Briefings in Bioinformatics, Volume 25, Issue 1, January 2024, bbad466, https://doi.org/10.1093/bib/bbad466
- Share Icon Share
Abstract
Long noncoding RNAs (lncRNAs) participate in various biological processes and have close linkages with diseases. In vivo and in vitro experiments have validated many associations between lncRNAs and diseases. However, biological experiments are time-consuming and expensive. Here, we introduce LDA-VGHB, an lncRNA–disease association (LDA) identification framework, by incorporating feature extraction based on singular value decomposition and variational graph autoencoder and LDA classification based on heterogeneous Newton boosting machine. LDA-VGHB was compared with four classical LDA prediction methods (i.e. SDLDA, LDNFSGB, IPCARF and LDASR) and four popular boosting models (XGBoost, AdaBoost, CatBoost and LightGBM) under 5-fold cross-validations on lncRNAs, diseases, lncRNA–disease pairs and independent lncRNAs and independent diseases, respectively. It greatly outperformed the other methods with its prominent performance under four different cross-validations on the lncRNADisease and MNDR databases. We further investigated potential lncRNAs for lung cancer, breast cancer, colorectal cancer and kidney neoplasms and inferred the top 20 lncRNAs associated with them among all their unobserved lncRNAs. The results showed that most of the predicted top 20 lncRNAs have been verified by biomedical experiments provided by the Lnc2Cancer 3.0, lncRNADisease v2.0 and RNADisease databases as well as publications. We found that HAR1A, KCNQ1DN, ZFAT-AS1 and HAR1B could associate with lung cancer, breast cancer, colorectal cancer and kidney neoplasms, respectively. The results need further biological experimental validation. We foresee that LDA-VGHB was capable of identifying possible lncRNAs for complex diseases. LDA-VGHB is publicly available at https://github.com/plhhnu/LDA-VGHB.
INTRODUCTION
Long noncoding RNAs (lncRNAs) with more than 200 nucleotides are a key class of genes involved in various biological functions [1]. lncRNAs participate in multiple biological processes including gene transcription and expression, chromatin remodeling, transcriptional and post-transcriptional regulation [2]. Diseases (i.e. immune responses and cancers) may produce when lncRNAs fail to regulate the biological processes. That is, lncRNAs have close linkages with tumorigenesis, progression and drug resistance [3, 4]. Thus, they are a class of potential diagnostic and prognostic biomarkers of complex disease [5–10]. For example, lncRNA MALAT1 can sponge miR-106b-5p to induce the progression of colorectal cancers [11]. The oncogenic affect of lncRNA H19 can be inhibited through the under-regulation of renal carcinoma cells [12]. The lack of FARSA-AS1 can hinder tumor growth and metastasis [13]. The overexpressions of MNX1-AS1 and MALAT1 demonstrate high sensitivity and specificity in multiple tumor tissues [14–16]. MEG3 rs3087918 has been used to reduce the risk of breast cancer [17]. CRNDE promoted the proliferation and metastasis of hepatocellular carcinoma [18]. WWC2-AS1 was highly expressed in radiation-induced intestinal fibrosis [19]. In summary, there are complex associations between lncRNAs and diseases.
With the rapid advance of RNA sequencing technologies, many platforms provide massive RNA-relevant data resources, which significantly improved various association prediction for human cancers [20–24]. However, experimental techniques are high-cost, time-consuming and laborious [25–28]. Notably, LncRNADisease2.0 [29], Lnc2Cancer [30], NRED [31] and MNDR v2.0 [32] provide numerous lncRNA–disease association (LDA) information. Based on these databases, substantial computational methods have been developed. These methods include network-based methods and machine-learning-based methods [25, 33].
Network-based LDA prediction methods first construct a heterogeneous network, and then infer potential LDAs through random walk, label propagation or matrix decomposition. Chen et al. conducted a series of works for LDA prediction based on various biological information [34–37], for example, lncRNA expression profile-based method [34], lncRNA similarity and disease similarity-based method [35], KATZ [36] and micro RNA information-based method [37]. Xie et al. [38–41] presented several LDA prediction methods, HAUBRW [38], LDA-LNSUBRW [39], RWSF-BLP [40] and SSMF-BLNP [41]. HAUBRW [38] incorporated heat spread, probability diffusion and unbalanced bi-random walk. LDA-LNSUBRW [39] combined linear neighborhood similarity and unbalanced bi-random walk. RWSF-BLP [40] used random walk-based multi-similarity fusion with bidirectional label propagation. SSMF-BLNP [41] integrated selective similarity matrix fusion and bidirectional linear neighborhood label propagation. In addition, several network-based methods have been developed to identify potential LDAs. These methods include multi-layer network model (MHRWR) [42], Laplace normalized random walk with restart (LRWRHLDA) [43], weighted graph regularized collaborative matrix factorization (WGRCMF) [44], collaborative matrix factorization with the maximized correntropy (LDCMFC) [45], dual sparse collaborative matrix factorization (WGRCMF) [44] and graph regularized nonnegative matrix factorization (LDGRNMF) [46]. Based on existing studies, Chen et al. [25, 26] summarized LDA identification algorithms and lncRNA function prediction models. Heterogeneous network-based methods can fuse diverse multi-relational data and encode various inter- and intra-relations between lncRNAs and diseases, and thus have obtained an increasing attention in LDA prediction [47–49]. However, network-based methods rely heavily on heterogeneous LDA network and fail to find potential associations for an orphan lncRNA or disease.
Machine learning techniques especially deep learning have obtained wide applications in bioinformatics due to their better classification performance [50–55]. For LDA prediction, the type of methods first extract the features of lncRNAs and diseases, and design machine learning models to find possible LDAs [56–58]. These models include random forest regression [59], bidirectional generative adversarial network (BiGAN) [60], graph convolutional matrix completion (GCRFLDA) [2], graph autoencoder and random forest (GAERF) [61], graph attention network (GANLDA) [62], graph convolution network with conditional random field [63], combination of deep learning and positive-unlabeled learning [64] and heterogeneous graph attention network with meta-paths [65]. Machine learning-based methods efficiently improve LDA prediction; however, they are susceptible to noisy and irrelevant data. In addition, they need to extract the optimal features from biological information and topological structures of lncRNAs and diseases.
To improve the LDA prediction accuracy and identify potential associations for an orphan lncRNA or disease, in this manuscript, we developed LDA-VGHB, a novel method for identifying possible LDA by incorporating LDA feature extraction based on singular value decomposition (SVD) and variational graph auto-encoder (VGAE) and LDA classification based on heterogeneous Newton boosting machine. The LDA-VGHB performance has been validated under 5-fold cross-validations (CVs) on lncRNAs, diseases, lncRNA–disease pairs and independent lncRNAs and independent diseases. LDA-VGHB was able to accurately predict potential linkages between lncRNAs and diseases on the lncRNADisease and MNDR databases.
MATERIAL AND METHODS
Data preparation
Two human LDA datasets were collected [64]. The two datasets are from the lncRNADisease database [66] and the MNDR database [32], respectively. After excluding diseases with irregular names or without MESH information or lncRNAs without sequence information in each dataset, we obtained the two preprocessed LDA datasets. The detailed information about the datasets is shown in Table 1.
Dataset . | lncRNAs . | Diseases . | LDAs . |
---|---|---|---|
lncRNADisease | 82 | 157 | 605 |
MNDR | 89 | 190 | 1529 |
Dataset . | lncRNAs . | Diseases . | LDAs . |
---|---|---|---|
lncRNADisease | 82 | 157 | 605 |
MNDR | 89 | 190 | 1529 |
Dataset . | lncRNAs . | Diseases . | LDAs . |
---|---|---|---|
lncRNADisease | 82 | 157 | 605 |
MNDR | 89 | 190 | 1529 |
Dataset . | lncRNAs . | Diseases . | LDAs . |
---|---|---|---|
lncRNADisease | 82 | 157 | 605 |
MNDR | 89 | 190 | 1529 |
Consequently, an LDA network with |$n$| lncRNAs and |$m$| diseases is represented as |$\boldsymbol{Y} \in{\Re ^{n \times m}}$|, where each element |$y_{i j}$| is defined by Eq. (1):
Based on the two datasets, we proposed a novel computational framework LDA-VGHB for predicting possible LDAs. As shown in Figure 1, first, lncRNA features and disease features are extracted by integrating lncRNA and disease similarity computation, linear feature extraction based on SVD and nonlinear feature extraction based on VGAE. Subsequently, unknown lncRNA–disease pairs are classified through a heterogeneous Newton boosting machine.

The pipeline for LDA prediction with SVD, VGAE and heterogeneous Newton boosting machine (LDA-VGHB). (i) Feature extraction. Features of lncRNAs and diseases are extracted by incorporating similarity computation, linear feature extraction based on SVD and nonlinear feature extraction based on VGAE. (ii) LDA classification. A heterogeneous Newton boosting machine is designed to classify unobserved LDAs.
Similarity computation
To measure disease similarity, we first calculate their semantic similarity matrix |$\boldsymbol{S}_d^{sem}$| based on their MeSH descriptors using the IDSSIM model [67]. Since several diseases are lack of directed acyclic graph in the MeSH database, we are unable to compute their semantic similarity. Thus, we utilize Gaussian association profile (GAP) kernel similarity [68] as a complement to disease semantic similarity and further measure their similarity. For two diseases |$d_i$| and |$d_j$|, let |${\boldsymbol{Y}}_{.i}$| and |${\boldsymbol{Y}}_{.j}$| be their GAPs, their GAP kernel similarity is defined by Eq. (2):
where |${\boldsymbol{Y}}_{.i}$| and |${\boldsymbol{Y}}_{.j}$| denote the |$i$|-th and |$j$|-th columns of |$\boldsymbol{Y}$|, respectively.
Semantic similarity and GAP kernel similarity measure disease similarity from biological significance and topological structures, respectively. Subsequently, disease similarity matrix |$\boldsymbol{S}_d$| is constructed by combining the two types of similarities by (3):
where |$\alpha $| is a weight parameter.
Similarly, lncRNA functional similarity |$\boldsymbol{S}_l^{fun}$| is computed based on disease semantic similarities according to the IDSSIM model [67]. lncRNA GAP kernel similarity matrix |$\boldsymbol{G}_l$| is computed by Eq. (4):
where |${\boldsymbol{Y}}_{i.}$| and |${\boldsymbol{Y}}_{j.}$| denote the |$i$|-th and |$j$|-th rows of |$\boldsymbol{Y}$|, respectively.
Consequently, lncRNA similarity matrix |$\boldsymbol{S}_l$| is constructed by Eq. (5):
Feature Extraction
Linear feature extraction
The SVD technique is a generalization of the eigen decomposition [69] and has been widely applied to feature extraction. By eigen decomposition, SVD decomposes a rectangular matrix into two orthogonal matrices and one diagonal matrix. In this study, we use SVD to extract linear features for diseases and lncRNAs. First, the LDA matrix |$\boldsymbol{Y} \in{\Re ^{n \times m}}$| is factorized into three matrices by Eq. (6):
where |$\boldsymbol{U}\in{R}^{{n\times n}}$| and |$\boldsymbol{V}\in{R}^{{m\times m}}$| are two real matrices, |$\boldsymbol{V}^{T}$| denotes the transpose of |$\boldsymbol{V}$| and |$\Sigma{\in }\boldsymbol{Y}^{{n\times m}}$| is a diagonal matrix where the |$i$|-th element |$\sigma _{i}$| denotes the |$i$|-th singular value of |$\boldsymbol{Y}$| and |$\sigma _{1}\geq \sigma _{2}\geq \cdots \geq \sigma _{n}\geq 0$|.
Next, the |$k$| largest singular values are used to construct an approximation representation by Eq. (7);
Consequently, |$\boldsymbol{U}_{i}$| and |$\boldsymbol{V}_{j}^{T}$| are applied to characterize linear features of |$l_i$| and |$d_j$|, respectively.
Nonlinear feature extraction
Variational graph autoencoder [70] efficiently combines graph convolutional network (GCN) and autoencoder. It fully utilizes latent variables of variable autoencoder and interpretable latent representation ability of GCN. Thus, it is widely applied to graph-structured data by incorporating graph structure and data distribution [2]. In this section, we use VGAE to extract nonlinear features for diseases and lncRNAs.
GCN [71] implements convolutional operations based on graph structures with non-Euclidean data. It can better extract node features by incorporating neighboring nodes’ characteristics and graph structures. It mainly comprises two categories based on different localized convolutional filter ways: spatial-based methods and spectral-based methods. In comparison with spatial-based methods, spectral-based methods obtain better performance via the spectrum of graph Laplacian [71, 72]. Thus, we use spectral-based methods [73] to extract features for lncRNAs and diseases from their similarity networks.
Let similarity matrix |${\boldsymbol{S}}_l$| denote the adjacency matrix of |$n$| lncRNAs. The initial scalar features of each lncRNA are represented through one corresponding row of the LDA matrix |$\boldsymbol{Y}$|. Consequently, we obtain the initial scalar feature matrix |$\boldsymbol{{X}}_{l}^{(0)}$| of |$n$| lncRNAs. Taken the lncRNA similarity matrix |${\boldsymbol{S}}_l$| and initial scalar feature matrix |$\boldsymbol{{X}}_{l}^{(0)}$| as inputs, at the |${t}$|-th layer, GCN transforms the graph signal |$\boldsymbol{X}_{l}^{(t)}$| into a new signal |$\boldsymbol{X}_{l}^{(t+1)}$| for all lncRNAs by Eq. (8):
Here, |$ReLU(\cdot )=max(0,\cdot )$| is a nonlinear activation function, |$\tilde{\boldsymbol{S}}_l=\boldsymbol{S}_l+\boldsymbol{I}_N$| is an adjacency matrix corresponding to |$\boldsymbol{S}_l$| with all diagonal element value of 1, i.e. an undirected graph corresponding to |$\boldsymbol{S}_l$| with added self-loop and |$\boldsymbol{I}_N$| is an identity matrix. |$\left [\tilde{\boldsymbol{A}}_{l}\right ]_{i i}=\sum _{j}{[{\tilde{\boldsymbol{S}}_l}]_{ij}}$|, and |$\beta _{l}^{(t)}$| denotes the parameters in the |$t$|-th layer of GCN. |$\boldsymbol{{X}}_{l}^{(t)}\in \Re ^{n\times d}$| denotes the matrix of activations at the |$t$|-th layer with |$\boldsymbol{{X}}_{l}^{(0)}=\boldsymbol{Y}$|.
In encoder, VGAE takes |$\boldsymbol{S}_l$| and |$\boldsymbol{X}_l$| as input and outputs a latent variable by two-layer GCN. The first layer is used to generate a low-dimensional feature matrix |$\tilde{\boldsymbol{X}}_l$| by Eq. (9):
where |$\boldsymbol{Q}=\tilde{\boldsymbol{A}}_{l}^{-\frac{1}{2}} \tilde{\boldsymbol{S}}_l \tilde{\boldsymbol{A}}_{l}^{-\frac{1}{2}}$|, and |$\boldsymbol{W}_{0}$| denotes the parameters in the first GCN layer.
The second layer is used to generate the data distribution by Eq. (10):
where |$\mu $| and |$\sigma $| denote the mean and variance of the node vector representation, and |$\boldsymbol{W}_{\mu }$| and |$\boldsymbol{W}_{\sigma }$| denote the corresponding parameters.
Consequently, suppose that |$\varepsilon $| follows the standard normal distribution |$N$|(0, 1), the latent variable |$\boldsymbol{Z}_l$| is obtained by Eq. (11):
In decoder, VGAE reconstructs adjacency matrix |$\boldsymbol{\hat{S}}_{l}$| by the sigmoid function based on latent variable |$\boldsymbol{Z}_l$| by Eq. (12):
During the learning, we define the following loss function by Eq. (13):
where the first term denotes the binary cross-entropy between |${\boldsymbol{S}}_{l}$| and |$\hat{\boldsymbol{S}}_{l}$|, the second term denotes the Kullback–Leibler divergence between posterior probability distribution |$q(\boldsymbol{Z}_l|{\boldsymbol{X}}_l, {\boldsymbol{S}}_l)$| and standard Gaussian distribution |$p(\boldsymbol{Z}_l)$| and |$p(\boldsymbol{S}_l|\boldsymbol{Z}_l)]$| denotes the probability between two nodes computed by the embedded vectors in the graph. Finally, the obtained lncRNA latent variable matrix |$\boldsymbol{Z}_l$| is used to represent their nonlinear features.
Similarly, we characterize the initial scalar features of each disease as one corresponding column of the LDA matrix |$Y$| and obtain the initial scalar feature matrix |$\boldsymbol{X}_d^{(0)}$| of |$m$| diseases. Taken the disease similarity |${\boldsymbol{S}}_d$| and initial scalar feature matrix |$\boldsymbol{{X}}_{d}^{(0)}$| as inputs, the disease nonlinear feature matrix |$\boldsymbol{Z}_d$| is computed through VGAE.
Feature Integration
The linear and nonlinear features of each lncRNA are concatenated as a |$a$|-dimensional vector, and the linear and nonlinear features of each disease are concatenated as a |$b$|-dimensional vector. Finally, an lncRNA–disease pair is represented as a |$k(k=a+b)$|-dimensional vector.
LDA prediction
For a given LDA dataset |$D = (\hat{\boldsymbol{X}}, {\hat{\boldsymbol{Y}}})$| with |$p\,\, (p=n \times m)$| samples (i.e. lncRNA–disease pairs), let |$\hat{\boldsymbol{x}}_i\in \hat{\boldsymbol{X}}$| denote the |$i$|-th training sample with |$k$|-dimensional features, and |$\hat{\boldsymbol{y}}_i\in \hat{\boldsymbol{Y}}$| denote its label. |$\hat{\boldsymbol{y}}_i=1$| if the |$i$|-th lncRNA–disease pair is associated, otherwise |$\hat{\boldsymbol{y}}_i=0$|. Inspired by heterogeneous Newton boosting machine [20, 74], we developed a heterogeneous Newton boosting machine-based LDA prediction model. Subsequently, we build an objective function by Eq. (14):
where |${\hat{\boldsymbol{y}}}_i$| and |$f(\hat{\boldsymbol{x}}_i)$| indicate the true label and the predicted label of |$\boldsymbol{x}_i$|, respectively. And loss function |$l({\hat{\boldsymbol{y}}}_i,f(\hat{ \boldsymbol{x}}_i))$| is twice differentiable related to |$f(\hat{ \boldsymbol{x}}_i)$|, |$l^{^{\prime}}({\hat{\boldsymbol{y}}}_i,f(\hat{ \boldsymbol{x}}_i))$| and |$l^{^{\prime\prime}}({\hat{\boldsymbol{y}}}_i,f(\hat{ \boldsymbol{x}}_i))$| represent its first and second derivatives, respectively.
At each boosting iteration, let |$\mathcal{H}^{(c)}$| represent the |$c$|-th subclass from |$C$| distinct subclasses defined by Eq. (15):
where |$\overline{\mathcal{H}}^{(c)}$| indicates a finite class with respect to |$b(\hat{ \boldsymbol{x}}_i)$|: |$\mathbb{R}^{d} \to \mathbb{R}$| satisfying |${\sum \nolimits _{i = 1}^n {b({\hat{ \boldsymbol{x}}_i})} ^2} = 1$|.
For the domain |$\mathcal{F}$| defined by Eq. (16):
one subclass is randomly selected to construct multiple binary decision trees. Let |$d_{min}$| and |$d_{max}$| denote the minimum and maximum depths among these decision trees, we randomly and uniformly set the maximum depth of each tree to a value between |$d_{min}$| and |$d_{max}$|. Consequently, we obtain |$C=R_d+1$| (|${R_d} = d_{max} - d_{min} + 1$|) unique choices for the subclass. And the probability mass function |$\Phi $| is represented by Eq. (17):
At the |$k$|-th iteration, assume that |$O_{k}$| (|$O_k=1,2,...,C$|) denote one index with respect to the sampled subclass, the base assumption is built by Eq. (18):
where |$g_i=l^{^{\prime}}({\hat{\boldsymbol{y}}}_i,f_{k-1}(\hat{ \boldsymbol{x}}_i))$| and |$h_i=l^{^{\prime\prime}}({\hat{\boldsymbol{y}}}_i,f_{k-1}(\hat{ \boldsymbol{x}}_i)) $|.
Lastly, for the |$i$|-th lncRNA–disease pair |$\hat{ \boldsymbol{x}}_i$|, its interaction probability |$f_k(\hat{ \boldsymbol{x}}_i)$| is computed by iterating updating model (19) with a learning rate |$\beta> 0$|:
RESULTS
Evaluation metrics and experimental setup
Precision, recall, accuracy, F1-score, area under the ROC curve (AUC) and area under the precision-recall curve (AUPR) [52] were used to evaluate the performance of LDA-VGHB with the other four classical LDA prediction models (i.e. SDLDA, LDNFSGB, IPCARF and LDASR) and the other four boosting algorithms (i.e. XGBoost, AdaBoost, CatBoost and LightGBM). Four different 5-fold CVs [75] were repeatedly conducted for 20 times:
5-fold CV on lncRNAs (|$CV_l$|): random rows in an LDA matrix |$\boldsymbol{Y}$| were masked for testing, i.e. 80% of lncRNAs were randomly selected as train set and the remaining was used as test set in each round.
5-fold CV on diseases (|$CV_d$|): random columns in an LDA matrix |$\boldsymbol{Y}$| were masked for testing, i.e. 80% of diseases were randomly selected for train set and the remaining was used as test set in each round.
5-fold CV on lncRNA–disease pairs (|$CV_{ld}$|): random lncRNA–disease pairs in an LDA matrix |$\boldsymbol{Y}$| were masked for testing, i.e. 80% of lncRNA–disease pairs were randomly selected as train set and the remaining was used as test set in each round.
5-fold CV on independent lncRNAs and independent diseases (|$CV_{ind}$|): First, 20% of lncRNAs and 20% of diseases were randomly selected to construct a ‘node test set’. Next, the remaining lncRNAs and diseases were taken as a ‘node train set’. Third, all edges linking a node in the ‘node train set’ with a node in the ‘node test set’ were removed. Finally, one learner was trained only on the ‘node train set’ to find potential LDAs within the ‘node test set’.
The above four CVs refer to association identification for (1) new lncRNAs without any associated disease, (2) new diseases without any associated lncRNA, (3) new lncRNA–disease pairs and (4) new independent lncRNAs and independent diseases, respectively. The average result on the 20 times is used as the final performance.
Baseline methods
LDA prediction models: SDLDA [76] extracts linear and nonlinear features for lncRNAs and diseases by combining SVD and deep learning and then uses a full connection layer with the sigmoid function to classify unknown lncRNA–disease pairs. LDNFSGB [77] first extracts the global and local features for lncRNAs and diseases, and uses autoencoder to reduce the feature dimensions, and finally implements LDA prediction through the gradient boosting algorithm. IPCARF [78] presents an incremental principal component analysis method to select LDA features and uses a random forest to predict potential LDAs. LDASR [79] employs autoencoder to obtain the optimal lncRNA and disease features and uses rotating forest to predict new LDAs. SDLDA [76], LDNFSGB [77], IPCARF [78] and LDASR [79] are state-of-the-art LDA prediction methods.
Boosting algorithms: XGBoost [20, 80] is an Extreme Gradient Boosting model. AdaBoost [81] manifests good generalization ability and low computational complexity. CatBoost [82] is known as categorical boosting algorithm. LightGBM [23, 83] integrates one-side sampling as well as exclusive feature bundling over gradient boosting decision trees. XGBoost [80], AdaBoost [81], CatBoost [82] and LightGBM [83] are powerful boosting models and achieve good predictions in diverse practical tasks.
Performance comparison
To evaluate the LDA-VGHB performance, we compared it with the other four classical LDA prediction methods (i.e. SDLDA [76], LDNFSGB [77], IPCARF [78] and LDASR [79]) and four popular boosting models (i.e. XGBoost [80], AdaBoost [81], CatBoost [82] and LightGBM [83]). We randomly selected negative LDAs with the same number as one of known positive LDAs from unlabeled lncRNA–disease pairs. Tables 2–5 show the performance of LDA-VGHB, SDLDA, LDNFSGB, LDASR and IPCAF on the lncRNADisease and MNDR databases under four different 5-fold CVs. Figure 2 depicts their receiver operating characteristic (ROC) and precision-recall (PR) curves under the four 5-fold CVs. In addition, Tables S1–S4 in Supplementary Materials give the results under the four different 10-fold CVs.
. | Dataset . | SDLDA . | LDNFSGB . | IPCARF . | LDASR . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8514|$\pm $|0.0509 | 0.7004|$\pm $|0.0639 | 0.4878|$\pm $|0.1309 | 0.6726|$\pm $|0.1200 | 0.8741|$\pm $|0.0484 |
MNDR | 0.9399|$\pm $|0.0154 | 0.8552|$\pm $|0.0393 | 0.6615|$\pm $|0.0966 | 0.8405|$\pm $|0.0300 | 0.9250|$\pm $|0.0201 | |
Recall | lncRNADisease | 0.6521|$\pm $|0.0732 | 0.6092|$\pm $|0.0790 | 0.5721|$\pm $|0.1580 | 0.5129|$\pm $|0.0946 | 0.7180|$\pm $|0.0713 |
MNDR | 0.8239|$\pm $|0.0437 | 0.8021|$\pm $|0.0498 | 0.6434|$\pm $|0.1545 | 0.7358|$\pm $|0.0562 | 0.8602|$\pm $|0.0395 | |
Accuracy | lncRNADisease | 0.7799|$\pm $|0.0341 | 0.6769|$\pm $|0.0423 | 0.4906|$\pm $|0.0951 | 0.6417|$\pm $|0.0597 | 0.8123|$\pm $|0.0384 |
MNDR | 0.8857|$\pm $|0.0283 | 0.8323|$\pm $|0.0230 | 0.6526|$\pm $|0.0775 | 0.7972|$\pm $|0.0268 | 0.8947|$\pm $|0.0258 | |
F1-score | lncRNADisease | 0.7365|$\pm $|0.0563 | 0.6462|$\pm $|0.0451 | 0.5125|$\pm $|0.1100 | 0.5668|$\pm $|0.0536 | 0.7852|$\pm $|0.0412 |
MNDR | 0.8775|$\pm $|0.0278 | 0.8260|$\pm $|0.0230 | 0.6401|$\pm $|0.1017 | 0.7827|$\pm $|0.0260 | 0.8908|$\pm $|0.0227 | |
AUC | lncRNADisease | 0.8023|$\pm $|0.0477 | 0.7346|$\pm $|0.0465 | 0.5096|$\pm $|0.1432 | 0.7057|$\pm $|0.0420 | 0.8814|$\pm $|0.0425 |
MNDR | 0.9366|$\pm $|0.0195 | 0.8839|$\pm $|0.0270 | 0.7104|$\pm $|0.0997 | 0.8641|$\pm $|0.0256 | 0.9541|$\pm $|0.0200 | |
AUPR | lncRNADisease | 0.8461|$\pm $|0.0553 | 0.7239|$\pm $|0.0626 | 0.5336|$\pm $|0.1423 | 0.6775|$\pm $|0.0971 | 0.8949|$\pm $|0.0322 |
MNDR | 0.9533|$\pm $|0.0129 | 0.8832|$\pm $|0.0307 | 0.7128|$\pm $|0.1012 | 0.8671|$\pm $|0.0252 | 0.9617|$\pm $|0.0131 |
. | Dataset . | SDLDA . | LDNFSGB . | IPCARF . | LDASR . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8514|$\pm $|0.0509 | 0.7004|$\pm $|0.0639 | 0.4878|$\pm $|0.1309 | 0.6726|$\pm $|0.1200 | 0.8741|$\pm $|0.0484 |
MNDR | 0.9399|$\pm $|0.0154 | 0.8552|$\pm $|0.0393 | 0.6615|$\pm $|0.0966 | 0.8405|$\pm $|0.0300 | 0.9250|$\pm $|0.0201 | |
Recall | lncRNADisease | 0.6521|$\pm $|0.0732 | 0.6092|$\pm $|0.0790 | 0.5721|$\pm $|0.1580 | 0.5129|$\pm $|0.0946 | 0.7180|$\pm $|0.0713 |
MNDR | 0.8239|$\pm $|0.0437 | 0.8021|$\pm $|0.0498 | 0.6434|$\pm $|0.1545 | 0.7358|$\pm $|0.0562 | 0.8602|$\pm $|0.0395 | |
Accuracy | lncRNADisease | 0.7799|$\pm $|0.0341 | 0.6769|$\pm $|0.0423 | 0.4906|$\pm $|0.0951 | 0.6417|$\pm $|0.0597 | 0.8123|$\pm $|0.0384 |
MNDR | 0.8857|$\pm $|0.0283 | 0.8323|$\pm $|0.0230 | 0.6526|$\pm $|0.0775 | 0.7972|$\pm $|0.0268 | 0.8947|$\pm $|0.0258 | |
F1-score | lncRNADisease | 0.7365|$\pm $|0.0563 | 0.6462|$\pm $|0.0451 | 0.5125|$\pm $|0.1100 | 0.5668|$\pm $|0.0536 | 0.7852|$\pm $|0.0412 |
MNDR | 0.8775|$\pm $|0.0278 | 0.8260|$\pm $|0.0230 | 0.6401|$\pm $|0.1017 | 0.7827|$\pm $|0.0260 | 0.8908|$\pm $|0.0227 | |
AUC | lncRNADisease | 0.8023|$\pm $|0.0477 | 0.7346|$\pm $|0.0465 | 0.5096|$\pm $|0.1432 | 0.7057|$\pm $|0.0420 | 0.8814|$\pm $|0.0425 |
MNDR | 0.9366|$\pm $|0.0195 | 0.8839|$\pm $|0.0270 | 0.7104|$\pm $|0.0997 | 0.8641|$\pm $|0.0256 | 0.9541|$\pm $|0.0200 | |
AUPR | lncRNADisease | 0.8461|$\pm $|0.0553 | 0.7239|$\pm $|0.0626 | 0.5336|$\pm $|0.1423 | 0.6775|$\pm $|0.0971 | 0.8949|$\pm $|0.0322 |
MNDR | 0.9533|$\pm $|0.0129 | 0.8832|$\pm $|0.0307 | 0.7128|$\pm $|0.1012 | 0.8671|$\pm $|0.0252 | 0.9617|$\pm $|0.0131 |
. | Dataset . | SDLDA . | LDNFSGB . | IPCARF . | LDASR . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8514|$\pm $|0.0509 | 0.7004|$\pm $|0.0639 | 0.4878|$\pm $|0.1309 | 0.6726|$\pm $|0.1200 | 0.8741|$\pm $|0.0484 |
MNDR | 0.9399|$\pm $|0.0154 | 0.8552|$\pm $|0.0393 | 0.6615|$\pm $|0.0966 | 0.8405|$\pm $|0.0300 | 0.9250|$\pm $|0.0201 | |
Recall | lncRNADisease | 0.6521|$\pm $|0.0732 | 0.6092|$\pm $|0.0790 | 0.5721|$\pm $|0.1580 | 0.5129|$\pm $|0.0946 | 0.7180|$\pm $|0.0713 |
MNDR | 0.8239|$\pm $|0.0437 | 0.8021|$\pm $|0.0498 | 0.6434|$\pm $|0.1545 | 0.7358|$\pm $|0.0562 | 0.8602|$\pm $|0.0395 | |
Accuracy | lncRNADisease | 0.7799|$\pm $|0.0341 | 0.6769|$\pm $|0.0423 | 0.4906|$\pm $|0.0951 | 0.6417|$\pm $|0.0597 | 0.8123|$\pm $|0.0384 |
MNDR | 0.8857|$\pm $|0.0283 | 0.8323|$\pm $|0.0230 | 0.6526|$\pm $|0.0775 | 0.7972|$\pm $|0.0268 | 0.8947|$\pm $|0.0258 | |
F1-score | lncRNADisease | 0.7365|$\pm $|0.0563 | 0.6462|$\pm $|0.0451 | 0.5125|$\pm $|0.1100 | 0.5668|$\pm $|0.0536 | 0.7852|$\pm $|0.0412 |
MNDR | 0.8775|$\pm $|0.0278 | 0.8260|$\pm $|0.0230 | 0.6401|$\pm $|0.1017 | 0.7827|$\pm $|0.0260 | 0.8908|$\pm $|0.0227 | |
AUC | lncRNADisease | 0.8023|$\pm $|0.0477 | 0.7346|$\pm $|0.0465 | 0.5096|$\pm $|0.1432 | 0.7057|$\pm $|0.0420 | 0.8814|$\pm $|0.0425 |
MNDR | 0.9366|$\pm $|0.0195 | 0.8839|$\pm $|0.0270 | 0.7104|$\pm $|0.0997 | 0.8641|$\pm $|0.0256 | 0.9541|$\pm $|0.0200 | |
AUPR | lncRNADisease | 0.8461|$\pm $|0.0553 | 0.7239|$\pm $|0.0626 | 0.5336|$\pm $|0.1423 | 0.6775|$\pm $|0.0971 | 0.8949|$\pm $|0.0322 |
MNDR | 0.9533|$\pm $|0.0129 | 0.8832|$\pm $|0.0307 | 0.7128|$\pm $|0.1012 | 0.8671|$\pm $|0.0252 | 0.9617|$\pm $|0.0131 |
. | Dataset . | SDLDA . | LDNFSGB . | IPCARF . | LDASR . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8514|$\pm $|0.0509 | 0.7004|$\pm $|0.0639 | 0.4878|$\pm $|0.1309 | 0.6726|$\pm $|0.1200 | 0.8741|$\pm $|0.0484 |
MNDR | 0.9399|$\pm $|0.0154 | 0.8552|$\pm $|0.0393 | 0.6615|$\pm $|0.0966 | 0.8405|$\pm $|0.0300 | 0.9250|$\pm $|0.0201 | |
Recall | lncRNADisease | 0.6521|$\pm $|0.0732 | 0.6092|$\pm $|0.0790 | 0.5721|$\pm $|0.1580 | 0.5129|$\pm $|0.0946 | 0.7180|$\pm $|0.0713 |
MNDR | 0.8239|$\pm $|0.0437 | 0.8021|$\pm $|0.0498 | 0.6434|$\pm $|0.1545 | 0.7358|$\pm $|0.0562 | 0.8602|$\pm $|0.0395 | |
Accuracy | lncRNADisease | 0.7799|$\pm $|0.0341 | 0.6769|$\pm $|0.0423 | 0.4906|$\pm $|0.0951 | 0.6417|$\pm $|0.0597 | 0.8123|$\pm $|0.0384 |
MNDR | 0.8857|$\pm $|0.0283 | 0.8323|$\pm $|0.0230 | 0.6526|$\pm $|0.0775 | 0.7972|$\pm $|0.0268 | 0.8947|$\pm $|0.0258 | |
F1-score | lncRNADisease | 0.7365|$\pm $|0.0563 | 0.6462|$\pm $|0.0451 | 0.5125|$\pm $|0.1100 | 0.5668|$\pm $|0.0536 | 0.7852|$\pm $|0.0412 |
MNDR | 0.8775|$\pm $|0.0278 | 0.8260|$\pm $|0.0230 | 0.6401|$\pm $|0.1017 | 0.7827|$\pm $|0.0260 | 0.8908|$\pm $|0.0227 | |
AUC | lncRNADisease | 0.8023|$\pm $|0.0477 | 0.7346|$\pm $|0.0465 | 0.5096|$\pm $|0.1432 | 0.7057|$\pm $|0.0420 | 0.8814|$\pm $|0.0425 |
MNDR | 0.9366|$\pm $|0.0195 | 0.8839|$\pm $|0.0270 | 0.7104|$\pm $|0.0997 | 0.8641|$\pm $|0.0256 | 0.9541|$\pm $|0.0200 | |
AUPR | lncRNADisease | 0.8461|$\pm $|0.0553 | 0.7239|$\pm $|0.0626 | 0.5336|$\pm $|0.1423 | 0.6775|$\pm $|0.0971 | 0.8949|$\pm $|0.0322 |
MNDR | 0.9533|$\pm $|0.0129 | 0.8832|$\pm $|0.0307 | 0.7128|$\pm $|0.1012 | 0.8671|$\pm $|0.0252 | 0.9617|$\pm $|0.0131 |
The performance comparison of five LDA prediction methods under 5-fold |$CV_d$|
. | Dataset . | SDLDA . | LDNFSGB . | IPCARF . | LDASR . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8854|$\pm $|0.0377 | 0.7548|$\pm $|0.0639 | 0.5583|$\pm $|0.0910 | 0.7462|$\pm $|0.0613 | 0.8917|$\pm $|0.0316 |
MNDR | 0.9232|$\pm $|0.0331 | 0.8005|$\pm $|0.0625 | 0.5557|$\pm $|0.1473 | 0.7625|$\pm $|0.0749 | 0.9300|$\pm $|0.0251 | |
Recall | lncRNADisease | 0.7182|$\pm $|0.0694 | 0.7309|$\pm $|0.0646 | 0.7538|$\pm $|0.1067 | 0.6431|$\pm $|0.0757 | 0.8415|$\pm $|0.0449 |
MNDR | 0.8579|$\pm $|0.0655 | 0.6936|$\pm $|0.0794 | 0.5279|$\pm $|0.1969 | 0.5758|$\pm $|0.0894 | 0.9190|$\pm $|0.0397 | |
Accuracy | lncRNADisease | 0.8187|$\pm $|0.0282 | 0.7552|$\pm $|0.0291 | 0.5766|$\pm $|0.0740 | 0.7165|$\pm $|0.0339 | 0.8737|$\pm $|0.0177 |
MNDR | 0.9043|$\pm $|0.0174 | 0.7670|$\pm $|0.0432 | 0.5593|$\pm $|0.1159 | 0.7010|$\pm $|0.0463 | 0.9305|$\pm $|0.0153 | |
F1-score | lncRNADisease | 0.7917|$\pm $|0.0519 | 0.7407|$\pm $|0.0526 | 0.6339|$\pm $|0.0715 | 0.6873|$\pm $|0.0512 | 0.8651|$\pm $|0.0304 |
MNDR | 0.8886|$\pm $|0.0475 | 0.7402|$\pm $|0.0577 | 0.5190|$\pm $|0.1434 | 0.6485|$\pm $|0.0555 | 0.9242|$\pm $|0.0298 | |
AUC | lncRNADisease | 0.8788|$\pm $|0.0274 | 0.8329|$\pm $|0.0273 | 0.6402|$\pm $|0.1004 | 0.7951|$\pm $|0.0317 | 0.9406|$\pm $|0.0154 |
MNDR | 0.9559|$\pm $|0.0160 | 0.8603|$\pm $|0.0363 | 0.5992|$\pm $|0.1601 | 0.8045|$\pm $|0.0362 | 0.9741|$\pm $|0.0106 | |
AUPR | lncRNADisease | 0.8934|$\pm $|0.0387 | 0.8163|$\pm $|0.0537 | 0.6355|$\pm $|0.1217 | 0.7914|$\pm $|0.0542 | 0.9429|$\pm $|0.0233 |
MNDR | 0.9561|$\pm $|0.0354 | 0.8292|$\pm $|0.0680 | 0.6040|$\pm $|0.1476 | 0.7630|$\pm $|0.0717 | 0.9728|$\pm $|0.0204 |
. | Dataset . | SDLDA . | LDNFSGB . | IPCARF . | LDASR . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8854|$\pm $|0.0377 | 0.7548|$\pm $|0.0639 | 0.5583|$\pm $|0.0910 | 0.7462|$\pm $|0.0613 | 0.8917|$\pm $|0.0316 |
MNDR | 0.9232|$\pm $|0.0331 | 0.8005|$\pm $|0.0625 | 0.5557|$\pm $|0.1473 | 0.7625|$\pm $|0.0749 | 0.9300|$\pm $|0.0251 | |
Recall | lncRNADisease | 0.7182|$\pm $|0.0694 | 0.7309|$\pm $|0.0646 | 0.7538|$\pm $|0.1067 | 0.6431|$\pm $|0.0757 | 0.8415|$\pm $|0.0449 |
MNDR | 0.8579|$\pm $|0.0655 | 0.6936|$\pm $|0.0794 | 0.5279|$\pm $|0.1969 | 0.5758|$\pm $|0.0894 | 0.9190|$\pm $|0.0397 | |
Accuracy | lncRNADisease | 0.8187|$\pm $|0.0282 | 0.7552|$\pm $|0.0291 | 0.5766|$\pm $|0.0740 | 0.7165|$\pm $|0.0339 | 0.8737|$\pm $|0.0177 |
MNDR | 0.9043|$\pm $|0.0174 | 0.7670|$\pm $|0.0432 | 0.5593|$\pm $|0.1159 | 0.7010|$\pm $|0.0463 | 0.9305|$\pm $|0.0153 | |
F1-score | lncRNADisease | 0.7917|$\pm $|0.0519 | 0.7407|$\pm $|0.0526 | 0.6339|$\pm $|0.0715 | 0.6873|$\pm $|0.0512 | 0.8651|$\pm $|0.0304 |
MNDR | 0.8886|$\pm $|0.0475 | 0.7402|$\pm $|0.0577 | 0.5190|$\pm $|0.1434 | 0.6485|$\pm $|0.0555 | 0.9242|$\pm $|0.0298 | |
AUC | lncRNADisease | 0.8788|$\pm $|0.0274 | 0.8329|$\pm $|0.0273 | 0.6402|$\pm $|0.1004 | 0.7951|$\pm $|0.0317 | 0.9406|$\pm $|0.0154 |
MNDR | 0.9559|$\pm $|0.0160 | 0.8603|$\pm $|0.0363 | 0.5992|$\pm $|0.1601 | 0.8045|$\pm $|0.0362 | 0.9741|$\pm $|0.0106 | |
AUPR | lncRNADisease | 0.8934|$\pm $|0.0387 | 0.8163|$\pm $|0.0537 | 0.6355|$\pm $|0.1217 | 0.7914|$\pm $|0.0542 | 0.9429|$\pm $|0.0233 |
MNDR | 0.9561|$\pm $|0.0354 | 0.8292|$\pm $|0.0680 | 0.6040|$\pm $|0.1476 | 0.7630|$\pm $|0.0717 | 0.9728|$\pm $|0.0204 |
The performance comparison of five LDA prediction methods under 5-fold |$CV_d$|
. | Dataset . | SDLDA . | LDNFSGB . | IPCARF . | LDASR . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8854|$\pm $|0.0377 | 0.7548|$\pm $|0.0639 | 0.5583|$\pm $|0.0910 | 0.7462|$\pm $|0.0613 | 0.8917|$\pm $|0.0316 |
MNDR | 0.9232|$\pm $|0.0331 | 0.8005|$\pm $|0.0625 | 0.5557|$\pm $|0.1473 | 0.7625|$\pm $|0.0749 | 0.9300|$\pm $|0.0251 | |
Recall | lncRNADisease | 0.7182|$\pm $|0.0694 | 0.7309|$\pm $|0.0646 | 0.7538|$\pm $|0.1067 | 0.6431|$\pm $|0.0757 | 0.8415|$\pm $|0.0449 |
MNDR | 0.8579|$\pm $|0.0655 | 0.6936|$\pm $|0.0794 | 0.5279|$\pm $|0.1969 | 0.5758|$\pm $|0.0894 | 0.9190|$\pm $|0.0397 | |
Accuracy | lncRNADisease | 0.8187|$\pm $|0.0282 | 0.7552|$\pm $|0.0291 | 0.5766|$\pm $|0.0740 | 0.7165|$\pm $|0.0339 | 0.8737|$\pm $|0.0177 |
MNDR | 0.9043|$\pm $|0.0174 | 0.7670|$\pm $|0.0432 | 0.5593|$\pm $|0.1159 | 0.7010|$\pm $|0.0463 | 0.9305|$\pm $|0.0153 | |
F1-score | lncRNADisease | 0.7917|$\pm $|0.0519 | 0.7407|$\pm $|0.0526 | 0.6339|$\pm $|0.0715 | 0.6873|$\pm $|0.0512 | 0.8651|$\pm $|0.0304 |
MNDR | 0.8886|$\pm $|0.0475 | 0.7402|$\pm $|0.0577 | 0.5190|$\pm $|0.1434 | 0.6485|$\pm $|0.0555 | 0.9242|$\pm $|0.0298 | |
AUC | lncRNADisease | 0.8788|$\pm $|0.0274 | 0.8329|$\pm $|0.0273 | 0.6402|$\pm $|0.1004 | 0.7951|$\pm $|0.0317 | 0.9406|$\pm $|0.0154 |
MNDR | 0.9559|$\pm $|0.0160 | 0.8603|$\pm $|0.0363 | 0.5992|$\pm $|0.1601 | 0.8045|$\pm $|0.0362 | 0.9741|$\pm $|0.0106 | |
AUPR | lncRNADisease | 0.8934|$\pm $|0.0387 | 0.8163|$\pm $|0.0537 | 0.6355|$\pm $|0.1217 | 0.7914|$\pm $|0.0542 | 0.9429|$\pm $|0.0233 |
MNDR | 0.9561|$\pm $|0.0354 | 0.8292|$\pm $|0.0680 | 0.6040|$\pm $|0.1476 | 0.7630|$\pm $|0.0717 | 0.9728|$\pm $|0.0204 |
. | Dataset . | SDLDA . | LDNFSGB . | IPCARF . | LDASR . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8854|$\pm $|0.0377 | 0.7548|$\pm $|0.0639 | 0.5583|$\pm $|0.0910 | 0.7462|$\pm $|0.0613 | 0.8917|$\pm $|0.0316 |
MNDR | 0.9232|$\pm $|0.0331 | 0.8005|$\pm $|0.0625 | 0.5557|$\pm $|0.1473 | 0.7625|$\pm $|0.0749 | 0.9300|$\pm $|0.0251 | |
Recall | lncRNADisease | 0.7182|$\pm $|0.0694 | 0.7309|$\pm $|0.0646 | 0.7538|$\pm $|0.1067 | 0.6431|$\pm $|0.0757 | 0.8415|$\pm $|0.0449 |
MNDR | 0.8579|$\pm $|0.0655 | 0.6936|$\pm $|0.0794 | 0.5279|$\pm $|0.1969 | 0.5758|$\pm $|0.0894 | 0.9190|$\pm $|0.0397 | |
Accuracy | lncRNADisease | 0.8187|$\pm $|0.0282 | 0.7552|$\pm $|0.0291 | 0.5766|$\pm $|0.0740 | 0.7165|$\pm $|0.0339 | 0.8737|$\pm $|0.0177 |
MNDR | 0.9043|$\pm $|0.0174 | 0.7670|$\pm $|0.0432 | 0.5593|$\pm $|0.1159 | 0.7010|$\pm $|0.0463 | 0.9305|$\pm $|0.0153 | |
F1-score | lncRNADisease | 0.7917|$\pm $|0.0519 | 0.7407|$\pm $|0.0526 | 0.6339|$\pm $|0.0715 | 0.6873|$\pm $|0.0512 | 0.8651|$\pm $|0.0304 |
MNDR | 0.8886|$\pm $|0.0475 | 0.7402|$\pm $|0.0577 | 0.5190|$\pm $|0.1434 | 0.6485|$\pm $|0.0555 | 0.9242|$\pm $|0.0298 | |
AUC | lncRNADisease | 0.8788|$\pm $|0.0274 | 0.8329|$\pm $|0.0273 | 0.6402|$\pm $|0.1004 | 0.7951|$\pm $|0.0317 | 0.9406|$\pm $|0.0154 |
MNDR | 0.9559|$\pm $|0.0160 | 0.8603|$\pm $|0.0363 | 0.5992|$\pm $|0.1601 | 0.8045|$\pm $|0.0362 | 0.9741|$\pm $|0.0106 | |
AUPR | lncRNADisease | 0.8934|$\pm $|0.0387 | 0.8163|$\pm $|0.0537 | 0.6355|$\pm $|0.1217 | 0.7914|$\pm $|0.0542 | 0.9429|$\pm $|0.0233 |
MNDR | 0.9561|$\pm $|0.0354 | 0.8292|$\pm $|0.0680 | 0.6040|$\pm $|0.1476 | 0.7630|$\pm $|0.0717 | 0.9728|$\pm $|0.0204 |
The performance comparison of five LDA prediction methods under 5-fold |$CV_{ld}$|
. | Dataset . | SDLDA . | LDNFSGB . | IPCARF . | LDASR . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8782|$\pm $|0.0306 | 0.7782|$\pm $|0.0270 | 0.7069|$\pm $|0.0478 | 0.7695|$\pm $|0.0393 | 0.8597|$\pm $|0.0269 |
MNDR | 0.9178|$\pm $|0.0154 | 0.8548|$\pm $|0.0156 | 0.7693|$\pm $|0.0850 | 0.8553|$\pm $|0.0189 | 0.9270|$\pm $|0.0143 | |
Recall | lncRNADisease | 0.7256|$\pm $|0.0376 | 0.8169|$\pm $|0.0408 | 0.6155|$\pm $|0.0652 | 0.6836|$\pm $|0.0342 | 0.8388|$\pm $|0.0332 |
MNDR | 0.8824|$\pm $|0.0198 | 0.8818|$\pm $|0.0204 | 0.5034|$\pm $|0.1469 | 0.8204|$\pm $|0.0238 | 0.9088|$\pm $|0.0169 | |
Accuracy | lncRNADisease | 0.8120|$\pm $|0.0216 | 0.7916|$\pm $|0.0256 | 0.6793|$\pm $|0.0403 | 0.7385|$\pm $|0.0283 | 0.8504|$\pm $|0.0189 |
MNDR | 0.9015|$\pm $|0.0114 | 0.8658|$\pm $|0.0127 | 0.6793|$\pm $|0.0753 | 0.8405|$\pm $|0.0129 | 0.9185|$\pm $|0.0110 | |
F1-score | lncRNADisease | 0.7939|$\pm $|0.0260 | 0.7965|$\pm $|0.0262 | 0.6563|$\pm $|0.0492 | 0.7233|$\pm $|0.0289 | 0.8485|$\pm $|0.0198 |
MNDR | 0.8996|$\pm $|0.0119 | 0.8679|$\pm $|0.0129 | 0.5995|$\pm $|0.1312 | 0.8371|$\pm $|0.0137 | 0.9177|$\pm $|0.0112 | |
AUC | lncRNADisease | 0.8774|$\pm $|0.0200 | 0.8578|$\pm $|0.0234 | 0.7384|$\pm $|0.0466 | 0.8133|$\pm $|0.0218 | 0.9271|$\pm $|0.0144 |
MNDR | 0.9560|$\pm $|0.0081 | 0.9346|$\pm $|0.0074 | 0.7680|$\pm $|0.0882 | 0.9143|$\pm $|0.0112 | 0.9722|$\pm $|0.0056 | |
AUPR | lncRNADisease | 0.8952|$\pm $|0.0177 | 0.8489|$\pm $|0.0289 | 0.7409|$\pm $|0.0515 | 0.8131|$\pm $|0.0277 | 0.9364|$\pm $|0.0157 |
MNDR | 0.9639|$\pm $|0.0063 | 0.9273|$\pm $|0.0098 | 0.7689|$\pm $|0.0924 | 0.9100|$\pm $|0.0136 | 0.9761|$\pm $|0.0051 |
. | Dataset . | SDLDA . | LDNFSGB . | IPCARF . | LDASR . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8782|$\pm $|0.0306 | 0.7782|$\pm $|0.0270 | 0.7069|$\pm $|0.0478 | 0.7695|$\pm $|0.0393 | 0.8597|$\pm $|0.0269 |
MNDR | 0.9178|$\pm $|0.0154 | 0.8548|$\pm $|0.0156 | 0.7693|$\pm $|0.0850 | 0.8553|$\pm $|0.0189 | 0.9270|$\pm $|0.0143 | |
Recall | lncRNADisease | 0.7256|$\pm $|0.0376 | 0.8169|$\pm $|0.0408 | 0.6155|$\pm $|0.0652 | 0.6836|$\pm $|0.0342 | 0.8388|$\pm $|0.0332 |
MNDR | 0.8824|$\pm $|0.0198 | 0.8818|$\pm $|0.0204 | 0.5034|$\pm $|0.1469 | 0.8204|$\pm $|0.0238 | 0.9088|$\pm $|0.0169 | |
Accuracy | lncRNADisease | 0.8120|$\pm $|0.0216 | 0.7916|$\pm $|0.0256 | 0.6793|$\pm $|0.0403 | 0.7385|$\pm $|0.0283 | 0.8504|$\pm $|0.0189 |
MNDR | 0.9015|$\pm $|0.0114 | 0.8658|$\pm $|0.0127 | 0.6793|$\pm $|0.0753 | 0.8405|$\pm $|0.0129 | 0.9185|$\pm $|0.0110 | |
F1-score | lncRNADisease | 0.7939|$\pm $|0.0260 | 0.7965|$\pm $|0.0262 | 0.6563|$\pm $|0.0492 | 0.7233|$\pm $|0.0289 | 0.8485|$\pm $|0.0198 |
MNDR | 0.8996|$\pm $|0.0119 | 0.8679|$\pm $|0.0129 | 0.5995|$\pm $|0.1312 | 0.8371|$\pm $|0.0137 | 0.9177|$\pm $|0.0112 | |
AUC | lncRNADisease | 0.8774|$\pm $|0.0200 | 0.8578|$\pm $|0.0234 | 0.7384|$\pm $|0.0466 | 0.8133|$\pm $|0.0218 | 0.9271|$\pm $|0.0144 |
MNDR | 0.9560|$\pm $|0.0081 | 0.9346|$\pm $|0.0074 | 0.7680|$\pm $|0.0882 | 0.9143|$\pm $|0.0112 | 0.9722|$\pm $|0.0056 | |
AUPR | lncRNADisease | 0.8952|$\pm $|0.0177 | 0.8489|$\pm $|0.0289 | 0.7409|$\pm $|0.0515 | 0.8131|$\pm $|0.0277 | 0.9364|$\pm $|0.0157 |
MNDR | 0.9639|$\pm $|0.0063 | 0.9273|$\pm $|0.0098 | 0.7689|$\pm $|0.0924 | 0.9100|$\pm $|0.0136 | 0.9761|$\pm $|0.0051 |
The performance comparison of five LDA prediction methods under 5-fold |$CV_{ld}$|
. | Dataset . | SDLDA . | LDNFSGB . | IPCARF . | LDASR . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8782|$\pm $|0.0306 | 0.7782|$\pm $|0.0270 | 0.7069|$\pm $|0.0478 | 0.7695|$\pm $|0.0393 | 0.8597|$\pm $|0.0269 |
MNDR | 0.9178|$\pm $|0.0154 | 0.8548|$\pm $|0.0156 | 0.7693|$\pm $|0.0850 | 0.8553|$\pm $|0.0189 | 0.9270|$\pm $|0.0143 | |
Recall | lncRNADisease | 0.7256|$\pm $|0.0376 | 0.8169|$\pm $|0.0408 | 0.6155|$\pm $|0.0652 | 0.6836|$\pm $|0.0342 | 0.8388|$\pm $|0.0332 |
MNDR | 0.8824|$\pm $|0.0198 | 0.8818|$\pm $|0.0204 | 0.5034|$\pm $|0.1469 | 0.8204|$\pm $|0.0238 | 0.9088|$\pm $|0.0169 | |
Accuracy | lncRNADisease | 0.8120|$\pm $|0.0216 | 0.7916|$\pm $|0.0256 | 0.6793|$\pm $|0.0403 | 0.7385|$\pm $|0.0283 | 0.8504|$\pm $|0.0189 |
MNDR | 0.9015|$\pm $|0.0114 | 0.8658|$\pm $|0.0127 | 0.6793|$\pm $|0.0753 | 0.8405|$\pm $|0.0129 | 0.9185|$\pm $|0.0110 | |
F1-score | lncRNADisease | 0.7939|$\pm $|0.0260 | 0.7965|$\pm $|0.0262 | 0.6563|$\pm $|0.0492 | 0.7233|$\pm $|0.0289 | 0.8485|$\pm $|0.0198 |
MNDR | 0.8996|$\pm $|0.0119 | 0.8679|$\pm $|0.0129 | 0.5995|$\pm $|0.1312 | 0.8371|$\pm $|0.0137 | 0.9177|$\pm $|0.0112 | |
AUC | lncRNADisease | 0.8774|$\pm $|0.0200 | 0.8578|$\pm $|0.0234 | 0.7384|$\pm $|0.0466 | 0.8133|$\pm $|0.0218 | 0.9271|$\pm $|0.0144 |
MNDR | 0.9560|$\pm $|0.0081 | 0.9346|$\pm $|0.0074 | 0.7680|$\pm $|0.0882 | 0.9143|$\pm $|0.0112 | 0.9722|$\pm $|0.0056 | |
AUPR | lncRNADisease | 0.8952|$\pm $|0.0177 | 0.8489|$\pm $|0.0289 | 0.7409|$\pm $|0.0515 | 0.8131|$\pm $|0.0277 | 0.9364|$\pm $|0.0157 |
MNDR | 0.9639|$\pm $|0.0063 | 0.9273|$\pm $|0.0098 | 0.7689|$\pm $|0.0924 | 0.9100|$\pm $|0.0136 | 0.9761|$\pm $|0.0051 |
. | Dataset . | SDLDA . | LDNFSGB . | IPCARF . | LDASR . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8782|$\pm $|0.0306 | 0.7782|$\pm $|0.0270 | 0.7069|$\pm $|0.0478 | 0.7695|$\pm $|0.0393 | 0.8597|$\pm $|0.0269 |
MNDR | 0.9178|$\pm $|0.0154 | 0.8548|$\pm $|0.0156 | 0.7693|$\pm $|0.0850 | 0.8553|$\pm $|0.0189 | 0.9270|$\pm $|0.0143 | |
Recall | lncRNADisease | 0.7256|$\pm $|0.0376 | 0.8169|$\pm $|0.0408 | 0.6155|$\pm $|0.0652 | 0.6836|$\pm $|0.0342 | 0.8388|$\pm $|0.0332 |
MNDR | 0.8824|$\pm $|0.0198 | 0.8818|$\pm $|0.0204 | 0.5034|$\pm $|0.1469 | 0.8204|$\pm $|0.0238 | 0.9088|$\pm $|0.0169 | |
Accuracy | lncRNADisease | 0.8120|$\pm $|0.0216 | 0.7916|$\pm $|0.0256 | 0.6793|$\pm $|0.0403 | 0.7385|$\pm $|0.0283 | 0.8504|$\pm $|0.0189 |
MNDR | 0.9015|$\pm $|0.0114 | 0.8658|$\pm $|0.0127 | 0.6793|$\pm $|0.0753 | 0.8405|$\pm $|0.0129 | 0.9185|$\pm $|0.0110 | |
F1-score | lncRNADisease | 0.7939|$\pm $|0.0260 | 0.7965|$\pm $|0.0262 | 0.6563|$\pm $|0.0492 | 0.7233|$\pm $|0.0289 | 0.8485|$\pm $|0.0198 |
MNDR | 0.8996|$\pm $|0.0119 | 0.8679|$\pm $|0.0129 | 0.5995|$\pm $|0.1312 | 0.8371|$\pm $|0.0137 | 0.9177|$\pm $|0.0112 | |
AUC | lncRNADisease | 0.8774|$\pm $|0.0200 | 0.8578|$\pm $|0.0234 | 0.7384|$\pm $|0.0466 | 0.8133|$\pm $|0.0218 | 0.9271|$\pm $|0.0144 |
MNDR | 0.9560|$\pm $|0.0081 | 0.9346|$\pm $|0.0074 | 0.7680|$\pm $|0.0882 | 0.9143|$\pm $|0.0112 | 0.9722|$\pm $|0.0056 | |
AUPR | lncRNADisease | 0.8952|$\pm $|0.0177 | 0.8489|$\pm $|0.0289 | 0.7409|$\pm $|0.0515 | 0.8131|$\pm $|0.0277 | 0.9364|$\pm $|0.0157 |
MNDR | 0.9639|$\pm $|0.0063 | 0.9273|$\pm $|0.0098 | 0.7689|$\pm $|0.0924 | 0.9100|$\pm $|0.0136 | 0.9761|$\pm $|0.0051 |
The performance comparison of five LDA prediction methods under 5-fold |$CV_{ind}$|
. | Dataset . | SDLDA . | LDNFSGB . | IPCARF . | LDASR . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8185|$\pm $|0.0923 | 0.6743|$\pm $|0.117 | 0.4995|$\pm $|0.0998 | 0.6747|$\pm $|0.1091 | 0.8958|$\pm $|0.0744 |
MNDR | 0.9314|$\pm $|0.0441 | 0.7690|$\pm $|0.1019 | 0.5101|$\pm $|0.1187 | 0.7529|$\pm $|0.0753 | 0.9216|$\pm $|0.0453 | |
Recall | lncRNADisease | 0.6348|$\pm $|0.1593 | 0.4921|$\pm $|0.1329 | 0.6623|$\pm $|0.1743 | 0.4112|$\pm $|0.1314 | 0.7214|$\pm $|0.1518 |
MNDR | 0.8073|$\pm $|0.1106 | 0.5685|$\pm $|0.1274 | 0.5610|$\pm $|0.1941 | 0.5030|$\pm $|0.1222 | 0.8346|$\pm $|0.0834 | |
Accuracy | lncRNADisease | 0.7422|$\pm $|0.0746 | 0.6242|$\pm $|0.0812 | 0.5029|$\pm $|0.1254 | 0.6077|$\pm $|0.0748 | 0.8150|$\pm $|0.0744 |
MNDR | 0.8731|$\pm $|0.0553 | 0.7007|$\pm $|0.0764 | 0.5197|$\pm $|0.1214 | 0.6683|$\pm $|0.0615 | 0.8799|$\pm $|0.0374 | |
F1-score | lncRNADisease | 0.7001|$\pm $|0.1217 | 0.5599|$\pm $|0.1171 | 0.5664|$\pm $|0.1221 | 0.5034|$\pm $|0.1177 | 0.7869|$\pm $|0.1092 |
MNDR | 0.8600|$\pm $|0.0744 | 0.6482|$\pm $|0.1192 | 0.5286|$\pm $|0.1452 | 0.5958|$\pm $|0.0946 | 0.8723|$\pm $|0.0463 | |
AUC | lncRNADisease | 0.7749|$\pm $|0.1115 | 0.6836|$\pm $|0.0899 | 0.5159|$\pm $|0.1679 | 0.6642|$\pm $|0.0862 | 0.8924|$\pm $|0.0666 |
MNDR | 0.9247|$\pm $|0.0419 | 0.7851|$\pm $|0.0756 | 0.5289|$\pm $|0.1616 | 0.7638|$\pm $|0.0745 | 0.9576|$\pm $|0.0218 | |
AUPR | lncRNADisease | 0.8285|$\pm $|0.0834 | 0.6928|$\pm $|0.0901 | 0.5490|$\pm $|0.1283 | 0.6603|$\pm $|0.0955 | 0.9056|$\pm $|0.0579 |
MNDR | 0.9431|$\pm $|0.0333 | 0.7593|$\pm $|0.0898 | 0.5365|$\pm $|0.1225 | 0.7430|$\pm $|0.0711 | 0.9621|$\pm $|0.0194 |
. | Dataset . | SDLDA . | LDNFSGB . | IPCARF . | LDASR . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8185|$\pm $|0.0923 | 0.6743|$\pm $|0.117 | 0.4995|$\pm $|0.0998 | 0.6747|$\pm $|0.1091 | 0.8958|$\pm $|0.0744 |
MNDR | 0.9314|$\pm $|0.0441 | 0.7690|$\pm $|0.1019 | 0.5101|$\pm $|0.1187 | 0.7529|$\pm $|0.0753 | 0.9216|$\pm $|0.0453 | |
Recall | lncRNADisease | 0.6348|$\pm $|0.1593 | 0.4921|$\pm $|0.1329 | 0.6623|$\pm $|0.1743 | 0.4112|$\pm $|0.1314 | 0.7214|$\pm $|0.1518 |
MNDR | 0.8073|$\pm $|0.1106 | 0.5685|$\pm $|0.1274 | 0.5610|$\pm $|0.1941 | 0.5030|$\pm $|0.1222 | 0.8346|$\pm $|0.0834 | |
Accuracy | lncRNADisease | 0.7422|$\pm $|0.0746 | 0.6242|$\pm $|0.0812 | 0.5029|$\pm $|0.1254 | 0.6077|$\pm $|0.0748 | 0.8150|$\pm $|0.0744 |
MNDR | 0.8731|$\pm $|0.0553 | 0.7007|$\pm $|0.0764 | 0.5197|$\pm $|0.1214 | 0.6683|$\pm $|0.0615 | 0.8799|$\pm $|0.0374 | |
F1-score | lncRNADisease | 0.7001|$\pm $|0.1217 | 0.5599|$\pm $|0.1171 | 0.5664|$\pm $|0.1221 | 0.5034|$\pm $|0.1177 | 0.7869|$\pm $|0.1092 |
MNDR | 0.8600|$\pm $|0.0744 | 0.6482|$\pm $|0.1192 | 0.5286|$\pm $|0.1452 | 0.5958|$\pm $|0.0946 | 0.8723|$\pm $|0.0463 | |
AUC | lncRNADisease | 0.7749|$\pm $|0.1115 | 0.6836|$\pm $|0.0899 | 0.5159|$\pm $|0.1679 | 0.6642|$\pm $|0.0862 | 0.8924|$\pm $|0.0666 |
MNDR | 0.9247|$\pm $|0.0419 | 0.7851|$\pm $|0.0756 | 0.5289|$\pm $|0.1616 | 0.7638|$\pm $|0.0745 | 0.9576|$\pm $|0.0218 | |
AUPR | lncRNADisease | 0.8285|$\pm $|0.0834 | 0.6928|$\pm $|0.0901 | 0.5490|$\pm $|0.1283 | 0.6603|$\pm $|0.0955 | 0.9056|$\pm $|0.0579 |
MNDR | 0.9431|$\pm $|0.0333 | 0.7593|$\pm $|0.0898 | 0.5365|$\pm $|0.1225 | 0.7430|$\pm $|0.0711 | 0.9621|$\pm $|0.0194 |
The performance comparison of five LDA prediction methods under 5-fold |$CV_{ind}$|
. | Dataset . | SDLDA . | LDNFSGB . | IPCARF . | LDASR . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8185|$\pm $|0.0923 | 0.6743|$\pm $|0.117 | 0.4995|$\pm $|0.0998 | 0.6747|$\pm $|0.1091 | 0.8958|$\pm $|0.0744 |
MNDR | 0.9314|$\pm $|0.0441 | 0.7690|$\pm $|0.1019 | 0.5101|$\pm $|0.1187 | 0.7529|$\pm $|0.0753 | 0.9216|$\pm $|0.0453 | |
Recall | lncRNADisease | 0.6348|$\pm $|0.1593 | 0.4921|$\pm $|0.1329 | 0.6623|$\pm $|0.1743 | 0.4112|$\pm $|0.1314 | 0.7214|$\pm $|0.1518 |
MNDR | 0.8073|$\pm $|0.1106 | 0.5685|$\pm $|0.1274 | 0.5610|$\pm $|0.1941 | 0.5030|$\pm $|0.1222 | 0.8346|$\pm $|0.0834 | |
Accuracy | lncRNADisease | 0.7422|$\pm $|0.0746 | 0.6242|$\pm $|0.0812 | 0.5029|$\pm $|0.1254 | 0.6077|$\pm $|0.0748 | 0.8150|$\pm $|0.0744 |
MNDR | 0.8731|$\pm $|0.0553 | 0.7007|$\pm $|0.0764 | 0.5197|$\pm $|0.1214 | 0.6683|$\pm $|0.0615 | 0.8799|$\pm $|0.0374 | |
F1-score | lncRNADisease | 0.7001|$\pm $|0.1217 | 0.5599|$\pm $|0.1171 | 0.5664|$\pm $|0.1221 | 0.5034|$\pm $|0.1177 | 0.7869|$\pm $|0.1092 |
MNDR | 0.8600|$\pm $|0.0744 | 0.6482|$\pm $|0.1192 | 0.5286|$\pm $|0.1452 | 0.5958|$\pm $|0.0946 | 0.8723|$\pm $|0.0463 | |
AUC | lncRNADisease | 0.7749|$\pm $|0.1115 | 0.6836|$\pm $|0.0899 | 0.5159|$\pm $|0.1679 | 0.6642|$\pm $|0.0862 | 0.8924|$\pm $|0.0666 |
MNDR | 0.9247|$\pm $|0.0419 | 0.7851|$\pm $|0.0756 | 0.5289|$\pm $|0.1616 | 0.7638|$\pm $|0.0745 | 0.9576|$\pm $|0.0218 | |
AUPR | lncRNADisease | 0.8285|$\pm $|0.0834 | 0.6928|$\pm $|0.0901 | 0.5490|$\pm $|0.1283 | 0.6603|$\pm $|0.0955 | 0.9056|$\pm $|0.0579 |
MNDR | 0.9431|$\pm $|0.0333 | 0.7593|$\pm $|0.0898 | 0.5365|$\pm $|0.1225 | 0.7430|$\pm $|0.0711 | 0.9621|$\pm $|0.0194 |
. | Dataset . | SDLDA . | LDNFSGB . | IPCARF . | LDASR . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8185|$\pm $|0.0923 | 0.6743|$\pm $|0.117 | 0.4995|$\pm $|0.0998 | 0.6747|$\pm $|0.1091 | 0.8958|$\pm $|0.0744 |
MNDR | 0.9314|$\pm $|0.0441 | 0.7690|$\pm $|0.1019 | 0.5101|$\pm $|0.1187 | 0.7529|$\pm $|0.0753 | 0.9216|$\pm $|0.0453 | |
Recall | lncRNADisease | 0.6348|$\pm $|0.1593 | 0.4921|$\pm $|0.1329 | 0.6623|$\pm $|0.1743 | 0.4112|$\pm $|0.1314 | 0.7214|$\pm $|0.1518 |
MNDR | 0.8073|$\pm $|0.1106 | 0.5685|$\pm $|0.1274 | 0.5610|$\pm $|0.1941 | 0.5030|$\pm $|0.1222 | 0.8346|$\pm $|0.0834 | |
Accuracy | lncRNADisease | 0.7422|$\pm $|0.0746 | 0.6242|$\pm $|0.0812 | 0.5029|$\pm $|0.1254 | 0.6077|$\pm $|0.0748 | 0.8150|$\pm $|0.0744 |
MNDR | 0.8731|$\pm $|0.0553 | 0.7007|$\pm $|0.0764 | 0.5197|$\pm $|0.1214 | 0.6683|$\pm $|0.0615 | 0.8799|$\pm $|0.0374 | |
F1-score | lncRNADisease | 0.7001|$\pm $|0.1217 | 0.5599|$\pm $|0.1171 | 0.5664|$\pm $|0.1221 | 0.5034|$\pm $|0.1177 | 0.7869|$\pm $|0.1092 |
MNDR | 0.8600|$\pm $|0.0744 | 0.6482|$\pm $|0.1192 | 0.5286|$\pm $|0.1452 | 0.5958|$\pm $|0.0946 | 0.8723|$\pm $|0.0463 | |
AUC | lncRNADisease | 0.7749|$\pm $|0.1115 | 0.6836|$\pm $|0.0899 | 0.5159|$\pm $|0.1679 | 0.6642|$\pm $|0.0862 | 0.8924|$\pm $|0.0666 |
MNDR | 0.9247|$\pm $|0.0419 | 0.7851|$\pm $|0.0756 | 0.5289|$\pm $|0.1616 | 0.7638|$\pm $|0.0745 | 0.9576|$\pm $|0.0218 | |
AUPR | lncRNADisease | 0.8285|$\pm $|0.0834 | 0.6928|$\pm $|0.0901 | 0.5490|$\pm $|0.1283 | 0.6603|$\pm $|0.0955 | 0.9056|$\pm $|0.0579 |
MNDR | 0.9431|$\pm $|0.0333 | 0.7593|$\pm $|0.0898 | 0.5365|$\pm $|0.1225 | 0.7430|$\pm $|0.0711 | 0.9621|$\pm $|0.0194 |

The ROC and PR curves of LDA-VGHB and the other four LDA prediction methods. A-B and C-D, E-F and G-H, I-J and K-L and M-N and O-P denote the ROC and PR curves of five methods on the lncRNADisease and MNDR databases under 5-fold |$CV_{l}$|, |$CV_{d}$|, |$CV_{ld}$|, |$CV_{ind}$|, respectively.
To assess the performance of LDA-VGHB and the other four LDA prediction methods in predicting potential diseases for a new lncRNA, we considered 5-fold CV on lncRNAs. Under 5-fold CV on lncRNAs (|$CV_l$|), all five LDA prediction methods randomly selected 80% of lncRNAs as train set and used the remaining as test set. As shown in Table 2 and Figure 2, LDA-VGHB obtained the best performance, followed by SDLDA, LDNFSGB, LDASR and IPCAF on the lncRNADisease and MNDR databases. Particularly, LDA-VGHB computed the best AUCs of 0.8814 and 0.9541, outperforming 8.97% and 1.83% than SDLDA on the two datasets, respectively. It also obtained the best AUPRs of 0.8949 and 0.9617, 5.45% and 0.87% better than SDLDA, respectively. In general, LDA-VGHB efficiently found potential associated diseases for a new lncRNA.
To evaluate the performance of LDA-VGHB and the other four LDA prediction methods in predicting potential lncRNAs for a new disease, we considered 5-fold CV on diseases. Under 5-fold CV on diseases (|$CV_d$|), all five LDA prediction methods randomly selected 80% of diseases as train set and used the remaining as test set. As shown in Table 3 and Figure 2, LDA-VGHB significantly outperformed SDLDA, LDNFSGB, LDASR and IPCAF on the two LDA datasets. For example, LDA-VGHB computed the highest AUCs of 0.9406 and 0.9741, was better 6.57% and 1.87% than SDLDA on the two datasets, respectively. It also computed the best AUPRs of 0.9429 and 0.9728, outperforming 5.25% and 1.72% compared with SDLDA, respectively. We found that LDA-VGHB could accurately predict possible lncRNAs for a new disease.
To assess the performance of LDA-VGHB with the other four LDA prediction methods in predicting potential LDAs for lncRNA–disease pairs, we considered 5-fold CV on lncRNA–disease pairs. Under 5-fold CV on lncRNA–disease pairs (|$CV_{ld}$|), all five methods randomly selected 80% of lncRNA–disease pairs as train set and used the remaining as test set. As shown in Table 4 and Figure 2, LDA-VGHB obviously improved LDA identification compared with SDLDA, LDNFSGB, LDASR and IPCAF under majority of conditions. It calculated the AUC values of 0.9271 and 0.9722, 5.36% and 1.67% better than the second-best method on the two datasets, respectively. It also calculated the AUPR values of 0.9364 and 0.9761, 4.40% and 1.25% better than the second-best method, respectively. Consequently, LDA-VGHB more accurately predicted possible LDAs based on known LDAs.
Lastly, to evaluate the performance of LDA-VGHB with the other four LDA prediction methods in predicting potential LDAs for independent lncRNAs and independent diseases, we considered 5-fold CV on independent lncRNAs and independent diseases. Under 5-fold CV on independent lncRNAs and independent diseases |$CV_{ind}$|, first, all five LDA prediction methods randomly selected 20% of lncRNAs and 20% of diseases to construct a ‘node test set’. Next, the five LDA prediction methods took the remaining lncRNAs and diseases as a ‘node train set’, and removed all edges linking a node in the ‘node train set’ with a node in the ‘node test set’. Finally, the five methods were trained only on the ‘node train set’ and were assessed the performance within the ‘node test set’. As shown in Table 5 and Figure 2, LDA-VGHB computed the best recall, accuracy, F1-score, AUC and AUPR on the two LDA datasets. It computed the highest AUCs of 0.8924 and 0.9576, outperforming 13.17% and 3.44% than SDLDA, respectively. It also computed the best AUPRs of 0.9056 and 0.9621, better 8.51% and 1.97% than SDLDA, respectively. The results manifest that LDA-VGHB computed the optimal LDA prediction performance under independent datasets.
Furthermore, boosting is one of the most popular ensemble learning tools and significantly improves classification performance [84, 85]. To evaluate the LDA classification performance of various boosting models, we compared LDA-VGHB with the other four popular boosting algorithms, i.e. XGBoost [86], AdaBoost [81], CatBoost [82] and LightGBM [87] under four different CVs. The four boosting algorithms used the same similarity computation and feature extraction procedures as LDA-VGHB. Their difference is to use different boosting models for classifying unknown lncRNA–disease pairs. The experiments were repeatedly conducted for 20 times. Tables 6–9 show their LDA prediction performance under 5-fold CVs on lncRNAs, diseases, lncRNA–disease pairs, independent lncRNAs and independent diseases, respectively. The results demonstrate that LDA-VGHB computed the best LDA identification accuracy on the two LDA databases under the four CVs under majority of conditions, thereby elucidating the powerful LDA classification performance of heterogeneous Newton boosting machine.
The LDA prediction performance comparison of five boosting models under |$CV_{l}$|
. | Dataset . | XGBoost . | AdaBoost . | CatBoost . | LightGBM . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8410|$\pm $|0.0576 | 0.7497|$\pm $|0.0604 | 0.8592|$\pm $|0.0534 | 0.8245|$\pm $|0.0553 | 0.8741|$\pm $|0.0484 |
MNDR | 0.9248 |$\pm $|0.0243 | 0.8835|$\pm $|0.0312 | 0.9161|$\pm $|0.0288 | 0.9255|$\pm $|0.0205 | 0.9250|$\pm $|0.0201 | |
Recall | lncRNADisease | 0.7300|$\pm $|0.0778 | 0.7898|$\pm $|0.1048 | 0.6876|$\pm $|0.0835 | 0.7013|$\pm $|0.0814 | 0.7180|$\pm $|0.7180 |
MNDR | 0.8451|$\pm $|0.0417 | 0.8244|$\pm $|0.0725 | 0.8440|$\pm $|0.0548 | 0.8479|$\pm $|0.0377 | 0.8602|$\pm $|0.0395 | |
Accuracy | lncRNADisease | 0.8034|$\pm $|0.0413 | 0.7747|$\pm $|0.0357 | 0.7969|$\pm $|0.0420 | 0.7839|$\pm $|0.0409 | 0.8123|$\pm $|0.0384 |
MNDR | 0.8876|$\pm $|0.0305 | 0.8567|$\pm $|0.0378 | 0.8832|$\pm $|0.0320 | 0.8899|$\pm $|0.0267 | 0.8947|$\pm $|0.0258 | |
F1-score | lncRNADisease | 0.7788|$\pm $|0.0540 | 0.7664|$\pm $|0.0757 | 0.7609|$\pm $|0.0582 | 0.7540|$\pm $|0.0506 | 0.7852|$\pm $|0.0412 |
MNDR | 0.8826|$\pm $|0.0282 | 0.8505|$\pm $|0.0371 | 0.8774|$\pm $|0.0324 | 0.8845|$\pm $|0.0237 | 0.8908|$\pm $|0.0227 | |
AUC | lncRNADisease | 0.8785|$\pm $|0.0337 | 0.8373|$\pm $|0.0426 | 0.8831|$\pm $|0.0275 | 0.8466|$\pm $|0.0397 | 0.8814|$\pm $|0.0425 |
MNDR | 0.9527|$\pm $|0.0207 | 0.9095|$\pm $|0.0369 | 0.9601|$\pm $|0.0123 | 0.9542|$\pm $|0.0191 | 0.9541|$\pm $|0.0200 | |
AUPR | lncRNADisease | 0.8720|$\pm $|0.0441 | 0.8595|$\pm $|0.0667 | 0.8890|$\pm $|0.0516 | 0.8322|$\pm $|0.0520 | 0.8949|$\pm $|0.0322 |
MNDR | 0.9604|$\pm $|0.0144 | 0.9310|$\pm $|0.0231 | 0.9657|$\pm $|0.0114 | 0.9562|$\pm $|0.0322 | 0.9617|$\pm $|0.0131 |
. | Dataset . | XGBoost . | AdaBoost . | CatBoost . | LightGBM . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8410|$\pm $|0.0576 | 0.7497|$\pm $|0.0604 | 0.8592|$\pm $|0.0534 | 0.8245|$\pm $|0.0553 | 0.8741|$\pm $|0.0484 |
MNDR | 0.9248 |$\pm $|0.0243 | 0.8835|$\pm $|0.0312 | 0.9161|$\pm $|0.0288 | 0.9255|$\pm $|0.0205 | 0.9250|$\pm $|0.0201 | |
Recall | lncRNADisease | 0.7300|$\pm $|0.0778 | 0.7898|$\pm $|0.1048 | 0.6876|$\pm $|0.0835 | 0.7013|$\pm $|0.0814 | 0.7180|$\pm $|0.7180 |
MNDR | 0.8451|$\pm $|0.0417 | 0.8244|$\pm $|0.0725 | 0.8440|$\pm $|0.0548 | 0.8479|$\pm $|0.0377 | 0.8602|$\pm $|0.0395 | |
Accuracy | lncRNADisease | 0.8034|$\pm $|0.0413 | 0.7747|$\pm $|0.0357 | 0.7969|$\pm $|0.0420 | 0.7839|$\pm $|0.0409 | 0.8123|$\pm $|0.0384 |
MNDR | 0.8876|$\pm $|0.0305 | 0.8567|$\pm $|0.0378 | 0.8832|$\pm $|0.0320 | 0.8899|$\pm $|0.0267 | 0.8947|$\pm $|0.0258 | |
F1-score | lncRNADisease | 0.7788|$\pm $|0.0540 | 0.7664|$\pm $|0.0757 | 0.7609|$\pm $|0.0582 | 0.7540|$\pm $|0.0506 | 0.7852|$\pm $|0.0412 |
MNDR | 0.8826|$\pm $|0.0282 | 0.8505|$\pm $|0.0371 | 0.8774|$\pm $|0.0324 | 0.8845|$\pm $|0.0237 | 0.8908|$\pm $|0.0227 | |
AUC | lncRNADisease | 0.8785|$\pm $|0.0337 | 0.8373|$\pm $|0.0426 | 0.8831|$\pm $|0.0275 | 0.8466|$\pm $|0.0397 | 0.8814|$\pm $|0.0425 |
MNDR | 0.9527|$\pm $|0.0207 | 0.9095|$\pm $|0.0369 | 0.9601|$\pm $|0.0123 | 0.9542|$\pm $|0.0191 | 0.9541|$\pm $|0.0200 | |
AUPR | lncRNADisease | 0.8720|$\pm $|0.0441 | 0.8595|$\pm $|0.0667 | 0.8890|$\pm $|0.0516 | 0.8322|$\pm $|0.0520 | 0.8949|$\pm $|0.0322 |
MNDR | 0.9604|$\pm $|0.0144 | 0.9310|$\pm $|0.0231 | 0.9657|$\pm $|0.0114 | 0.9562|$\pm $|0.0322 | 0.9617|$\pm $|0.0131 |
The LDA prediction performance comparison of five boosting models under |$CV_{l}$|
. | Dataset . | XGBoost . | AdaBoost . | CatBoost . | LightGBM . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8410|$\pm $|0.0576 | 0.7497|$\pm $|0.0604 | 0.8592|$\pm $|0.0534 | 0.8245|$\pm $|0.0553 | 0.8741|$\pm $|0.0484 |
MNDR | 0.9248 |$\pm $|0.0243 | 0.8835|$\pm $|0.0312 | 0.9161|$\pm $|0.0288 | 0.9255|$\pm $|0.0205 | 0.9250|$\pm $|0.0201 | |
Recall | lncRNADisease | 0.7300|$\pm $|0.0778 | 0.7898|$\pm $|0.1048 | 0.6876|$\pm $|0.0835 | 0.7013|$\pm $|0.0814 | 0.7180|$\pm $|0.7180 |
MNDR | 0.8451|$\pm $|0.0417 | 0.8244|$\pm $|0.0725 | 0.8440|$\pm $|0.0548 | 0.8479|$\pm $|0.0377 | 0.8602|$\pm $|0.0395 | |
Accuracy | lncRNADisease | 0.8034|$\pm $|0.0413 | 0.7747|$\pm $|0.0357 | 0.7969|$\pm $|0.0420 | 0.7839|$\pm $|0.0409 | 0.8123|$\pm $|0.0384 |
MNDR | 0.8876|$\pm $|0.0305 | 0.8567|$\pm $|0.0378 | 0.8832|$\pm $|0.0320 | 0.8899|$\pm $|0.0267 | 0.8947|$\pm $|0.0258 | |
F1-score | lncRNADisease | 0.7788|$\pm $|0.0540 | 0.7664|$\pm $|0.0757 | 0.7609|$\pm $|0.0582 | 0.7540|$\pm $|0.0506 | 0.7852|$\pm $|0.0412 |
MNDR | 0.8826|$\pm $|0.0282 | 0.8505|$\pm $|0.0371 | 0.8774|$\pm $|0.0324 | 0.8845|$\pm $|0.0237 | 0.8908|$\pm $|0.0227 | |
AUC | lncRNADisease | 0.8785|$\pm $|0.0337 | 0.8373|$\pm $|0.0426 | 0.8831|$\pm $|0.0275 | 0.8466|$\pm $|0.0397 | 0.8814|$\pm $|0.0425 |
MNDR | 0.9527|$\pm $|0.0207 | 0.9095|$\pm $|0.0369 | 0.9601|$\pm $|0.0123 | 0.9542|$\pm $|0.0191 | 0.9541|$\pm $|0.0200 | |
AUPR | lncRNADisease | 0.8720|$\pm $|0.0441 | 0.8595|$\pm $|0.0667 | 0.8890|$\pm $|0.0516 | 0.8322|$\pm $|0.0520 | 0.8949|$\pm $|0.0322 |
MNDR | 0.9604|$\pm $|0.0144 | 0.9310|$\pm $|0.0231 | 0.9657|$\pm $|0.0114 | 0.9562|$\pm $|0.0322 | 0.9617|$\pm $|0.0131 |
. | Dataset . | XGBoost . | AdaBoost . | CatBoost . | LightGBM . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8410|$\pm $|0.0576 | 0.7497|$\pm $|0.0604 | 0.8592|$\pm $|0.0534 | 0.8245|$\pm $|0.0553 | 0.8741|$\pm $|0.0484 |
MNDR | 0.9248 |$\pm $|0.0243 | 0.8835|$\pm $|0.0312 | 0.9161|$\pm $|0.0288 | 0.9255|$\pm $|0.0205 | 0.9250|$\pm $|0.0201 | |
Recall | lncRNADisease | 0.7300|$\pm $|0.0778 | 0.7898|$\pm $|0.1048 | 0.6876|$\pm $|0.0835 | 0.7013|$\pm $|0.0814 | 0.7180|$\pm $|0.7180 |
MNDR | 0.8451|$\pm $|0.0417 | 0.8244|$\pm $|0.0725 | 0.8440|$\pm $|0.0548 | 0.8479|$\pm $|0.0377 | 0.8602|$\pm $|0.0395 | |
Accuracy | lncRNADisease | 0.8034|$\pm $|0.0413 | 0.7747|$\pm $|0.0357 | 0.7969|$\pm $|0.0420 | 0.7839|$\pm $|0.0409 | 0.8123|$\pm $|0.0384 |
MNDR | 0.8876|$\pm $|0.0305 | 0.8567|$\pm $|0.0378 | 0.8832|$\pm $|0.0320 | 0.8899|$\pm $|0.0267 | 0.8947|$\pm $|0.0258 | |
F1-score | lncRNADisease | 0.7788|$\pm $|0.0540 | 0.7664|$\pm $|0.0757 | 0.7609|$\pm $|0.0582 | 0.7540|$\pm $|0.0506 | 0.7852|$\pm $|0.0412 |
MNDR | 0.8826|$\pm $|0.0282 | 0.8505|$\pm $|0.0371 | 0.8774|$\pm $|0.0324 | 0.8845|$\pm $|0.0237 | 0.8908|$\pm $|0.0227 | |
AUC | lncRNADisease | 0.8785|$\pm $|0.0337 | 0.8373|$\pm $|0.0426 | 0.8831|$\pm $|0.0275 | 0.8466|$\pm $|0.0397 | 0.8814|$\pm $|0.0425 |
MNDR | 0.9527|$\pm $|0.0207 | 0.9095|$\pm $|0.0369 | 0.9601|$\pm $|0.0123 | 0.9542|$\pm $|0.0191 | 0.9541|$\pm $|0.0200 | |
AUPR | lncRNADisease | 0.8720|$\pm $|0.0441 | 0.8595|$\pm $|0.0667 | 0.8890|$\pm $|0.0516 | 0.8322|$\pm $|0.0520 | 0.8949|$\pm $|0.0322 |
MNDR | 0.9604|$\pm $|0.0144 | 0.9310|$\pm $|0.0231 | 0.9657|$\pm $|0.0114 | 0.9562|$\pm $|0.0322 | 0.9617|$\pm $|0.0131 |
The LDA prediction performance comparison of five boosting models under |$CV_{d}$|
. | Dataset . | XGBoost . | AdaBoost . | CatBoost . | LightGBM . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8687|$\pm $|0.0383 | 0.7471|$\pm $|0.0485 | 0.8813|$\pm $|0.0366 | 0.8786|$\pm $|0.0455 | 0.8917|$\pm $|0.0316 |
MNDR | 0.9220|$\pm $|0.0318 | 0.8734|$\pm $|0.0523 | 0.9153|$\pm $|0.0280 | 0.9157|$\pm $|0.0385 | 0.9300|$\pm $|0.0251 | |
Recall | lncRNADisease | 0.8027|$\pm $|0.0515 | 0.8292|$\pm $|0.0625 | 0.7700|$\pm $|0.0762 | 0.8071|$\pm $|0.0478 | 0.8415|$\pm $|0.0449 |
MNDR | 0.8930|$\pm $|0.0400 | 0.8001|$\pm $|0.0855 | 0.9052|$\pm $|0.0361 | 0.8890|$\pm $|0.0476 | 0.9190|$\pm $|0.0397 | |
Accuracy | lncRNADisease | 0.8446|$\pm $|0.0232 | 0.7791|$\pm $|0.0324 | 0.8393|$\pm $|0.0251 | 0.8518|$\pm $|0.0231 | 0.8737|$\pm $|0.0177 |
MNDR | 0.9154|$\pm $|0.0162 | 0.8552|$\pm $|0.0233 | 0.9155|$\pm $|0.0157 | 0.9124|$\pm $|0.0181 | 0.9305|$\pm $|0.0153 | |
F1-score | lncRNADisease | 0.8334|$\pm $|0.0370 | 0.7848|$\pm $|0.0473 | 0.8198|$\pm $|0.0510 | 0.8403|$\pm $|0.0370 | 0.8651|$\pm $|0.0304 |
MNDR | 0.9070|$\pm $|0.0329 | 0.8332|$\pm $|0.0654 | 0.9099|$\pm $|0.0280 | 0.9019|$\pm $|0.0406 | 0.9242|$\pm $|0.0298 | |
AUC | lncRNADisease | 0.9075|$\pm $|0.0214 | 0.8440|$\pm $|0.0379 | 0.9148|$\pm $|0.0221 | 0.9118|$\pm $|0.0234 | 0.9406|$\pm $|0.0154 |
MNDR | 0.9663|$\pm $|0.0108 | 0.8915|$\pm $|0.0329 | 0.9671|$\pm $|0.0108 | 0.9651|$\pm $|0.0132 | 0.9741|$\pm $|0.0106 | |
AUPR | lncRNADisease | 0.9063|$\pm $|0.0316 | 0.8704|$\pm $|0.0471 | 0.9243|$\pm $|0.0319 | 0.9125|$\pm $|0.0383 | 0.9429|$\pm $|0.0233 |
MNDR | 0.9639|$\pm $|0.0254 | 0.9051|$\pm $|0.0562 | 0.9684|$\pm $|0.0186 | 0.9615|$\pm $|0.0347 | 0.9728|$\pm $|0.0204 |
. | Dataset . | XGBoost . | AdaBoost . | CatBoost . | LightGBM . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8687|$\pm $|0.0383 | 0.7471|$\pm $|0.0485 | 0.8813|$\pm $|0.0366 | 0.8786|$\pm $|0.0455 | 0.8917|$\pm $|0.0316 |
MNDR | 0.9220|$\pm $|0.0318 | 0.8734|$\pm $|0.0523 | 0.9153|$\pm $|0.0280 | 0.9157|$\pm $|0.0385 | 0.9300|$\pm $|0.0251 | |
Recall | lncRNADisease | 0.8027|$\pm $|0.0515 | 0.8292|$\pm $|0.0625 | 0.7700|$\pm $|0.0762 | 0.8071|$\pm $|0.0478 | 0.8415|$\pm $|0.0449 |
MNDR | 0.8930|$\pm $|0.0400 | 0.8001|$\pm $|0.0855 | 0.9052|$\pm $|0.0361 | 0.8890|$\pm $|0.0476 | 0.9190|$\pm $|0.0397 | |
Accuracy | lncRNADisease | 0.8446|$\pm $|0.0232 | 0.7791|$\pm $|0.0324 | 0.8393|$\pm $|0.0251 | 0.8518|$\pm $|0.0231 | 0.8737|$\pm $|0.0177 |
MNDR | 0.9154|$\pm $|0.0162 | 0.8552|$\pm $|0.0233 | 0.9155|$\pm $|0.0157 | 0.9124|$\pm $|0.0181 | 0.9305|$\pm $|0.0153 | |
F1-score | lncRNADisease | 0.8334|$\pm $|0.0370 | 0.7848|$\pm $|0.0473 | 0.8198|$\pm $|0.0510 | 0.8403|$\pm $|0.0370 | 0.8651|$\pm $|0.0304 |
MNDR | 0.9070|$\pm $|0.0329 | 0.8332|$\pm $|0.0654 | 0.9099|$\pm $|0.0280 | 0.9019|$\pm $|0.0406 | 0.9242|$\pm $|0.0298 | |
AUC | lncRNADisease | 0.9075|$\pm $|0.0214 | 0.8440|$\pm $|0.0379 | 0.9148|$\pm $|0.0221 | 0.9118|$\pm $|0.0234 | 0.9406|$\pm $|0.0154 |
MNDR | 0.9663|$\pm $|0.0108 | 0.8915|$\pm $|0.0329 | 0.9671|$\pm $|0.0108 | 0.9651|$\pm $|0.0132 | 0.9741|$\pm $|0.0106 | |
AUPR | lncRNADisease | 0.9063|$\pm $|0.0316 | 0.8704|$\pm $|0.0471 | 0.9243|$\pm $|0.0319 | 0.9125|$\pm $|0.0383 | 0.9429|$\pm $|0.0233 |
MNDR | 0.9639|$\pm $|0.0254 | 0.9051|$\pm $|0.0562 | 0.9684|$\pm $|0.0186 | 0.9615|$\pm $|0.0347 | 0.9728|$\pm $|0.0204 |
The LDA prediction performance comparison of five boosting models under |$CV_{d}$|
. | Dataset . | XGBoost . | AdaBoost . | CatBoost . | LightGBM . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8687|$\pm $|0.0383 | 0.7471|$\pm $|0.0485 | 0.8813|$\pm $|0.0366 | 0.8786|$\pm $|0.0455 | 0.8917|$\pm $|0.0316 |
MNDR | 0.9220|$\pm $|0.0318 | 0.8734|$\pm $|0.0523 | 0.9153|$\pm $|0.0280 | 0.9157|$\pm $|0.0385 | 0.9300|$\pm $|0.0251 | |
Recall | lncRNADisease | 0.8027|$\pm $|0.0515 | 0.8292|$\pm $|0.0625 | 0.7700|$\pm $|0.0762 | 0.8071|$\pm $|0.0478 | 0.8415|$\pm $|0.0449 |
MNDR | 0.8930|$\pm $|0.0400 | 0.8001|$\pm $|0.0855 | 0.9052|$\pm $|0.0361 | 0.8890|$\pm $|0.0476 | 0.9190|$\pm $|0.0397 | |
Accuracy | lncRNADisease | 0.8446|$\pm $|0.0232 | 0.7791|$\pm $|0.0324 | 0.8393|$\pm $|0.0251 | 0.8518|$\pm $|0.0231 | 0.8737|$\pm $|0.0177 |
MNDR | 0.9154|$\pm $|0.0162 | 0.8552|$\pm $|0.0233 | 0.9155|$\pm $|0.0157 | 0.9124|$\pm $|0.0181 | 0.9305|$\pm $|0.0153 | |
F1-score | lncRNADisease | 0.8334|$\pm $|0.0370 | 0.7848|$\pm $|0.0473 | 0.8198|$\pm $|0.0510 | 0.8403|$\pm $|0.0370 | 0.8651|$\pm $|0.0304 |
MNDR | 0.9070|$\pm $|0.0329 | 0.8332|$\pm $|0.0654 | 0.9099|$\pm $|0.0280 | 0.9019|$\pm $|0.0406 | 0.9242|$\pm $|0.0298 | |
AUC | lncRNADisease | 0.9075|$\pm $|0.0214 | 0.8440|$\pm $|0.0379 | 0.9148|$\pm $|0.0221 | 0.9118|$\pm $|0.0234 | 0.9406|$\pm $|0.0154 |
MNDR | 0.9663|$\pm $|0.0108 | 0.8915|$\pm $|0.0329 | 0.9671|$\pm $|0.0108 | 0.9651|$\pm $|0.0132 | 0.9741|$\pm $|0.0106 | |
AUPR | lncRNADisease | 0.9063|$\pm $|0.0316 | 0.8704|$\pm $|0.0471 | 0.9243|$\pm $|0.0319 | 0.9125|$\pm $|0.0383 | 0.9429|$\pm $|0.0233 |
MNDR | 0.9639|$\pm $|0.0254 | 0.9051|$\pm $|0.0562 | 0.9684|$\pm $|0.0186 | 0.9615|$\pm $|0.0347 | 0.9728|$\pm $|0.0204 |
. | Dataset . | XGBoost . | AdaBoost . | CatBoost . | LightGBM . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8687|$\pm $|0.0383 | 0.7471|$\pm $|0.0485 | 0.8813|$\pm $|0.0366 | 0.8786|$\pm $|0.0455 | 0.8917|$\pm $|0.0316 |
MNDR | 0.9220|$\pm $|0.0318 | 0.8734|$\pm $|0.0523 | 0.9153|$\pm $|0.0280 | 0.9157|$\pm $|0.0385 | 0.9300|$\pm $|0.0251 | |
Recall | lncRNADisease | 0.8027|$\pm $|0.0515 | 0.8292|$\pm $|0.0625 | 0.7700|$\pm $|0.0762 | 0.8071|$\pm $|0.0478 | 0.8415|$\pm $|0.0449 |
MNDR | 0.8930|$\pm $|0.0400 | 0.8001|$\pm $|0.0855 | 0.9052|$\pm $|0.0361 | 0.8890|$\pm $|0.0476 | 0.9190|$\pm $|0.0397 | |
Accuracy | lncRNADisease | 0.8446|$\pm $|0.0232 | 0.7791|$\pm $|0.0324 | 0.8393|$\pm $|0.0251 | 0.8518|$\pm $|0.0231 | 0.8737|$\pm $|0.0177 |
MNDR | 0.9154|$\pm $|0.0162 | 0.8552|$\pm $|0.0233 | 0.9155|$\pm $|0.0157 | 0.9124|$\pm $|0.0181 | 0.9305|$\pm $|0.0153 | |
F1-score | lncRNADisease | 0.8334|$\pm $|0.0370 | 0.7848|$\pm $|0.0473 | 0.8198|$\pm $|0.0510 | 0.8403|$\pm $|0.0370 | 0.8651|$\pm $|0.0304 |
MNDR | 0.9070|$\pm $|0.0329 | 0.8332|$\pm $|0.0654 | 0.9099|$\pm $|0.0280 | 0.9019|$\pm $|0.0406 | 0.9242|$\pm $|0.0298 | |
AUC | lncRNADisease | 0.9075|$\pm $|0.0214 | 0.8440|$\pm $|0.0379 | 0.9148|$\pm $|0.0221 | 0.9118|$\pm $|0.0234 | 0.9406|$\pm $|0.0154 |
MNDR | 0.9663|$\pm $|0.0108 | 0.8915|$\pm $|0.0329 | 0.9671|$\pm $|0.0108 | 0.9651|$\pm $|0.0132 | 0.9741|$\pm $|0.0106 | |
AUPR | lncRNADisease | 0.9063|$\pm $|0.0316 | 0.8704|$\pm $|0.0471 | 0.9243|$\pm $|0.0319 | 0.9125|$\pm $|0.0383 | 0.9429|$\pm $|0.0233 |
MNDR | 0.9639|$\pm $|0.0254 | 0.9051|$\pm $|0.0562 | 0.9684|$\pm $|0.0186 | 0.9615|$\pm $|0.0347 | 0.9728|$\pm $|0.0204 |
The LDA prediction performance comparison of five boosting models under |$CV_{ld}$|
. | Dataset . | XGBoost . | AdaBoost . | CatBoost . | LightGBM . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8852|$\pm $|0.0241 | 0.7751|$\pm $|0.0261 | 0.8758|$\pm $|0.0323 | 0.8655|$\pm $|0.0288 | 0.8597|$\pm $|0.0269 |
MNDR | 0.9192 |$\pm $|0.0127 | 0.8955|$\pm $|0.0178 | 0.9079|$\pm $|0.0145 | 0.9258|$\pm $|0.0152 | 0.9270|$\pm $|0.0143 | |
Recall | lncRNADisease | 0.8407|$\pm $|0.0340 | 0.8297|$\pm $|0.0286 | 0.8070|$\pm $|0.0374 | 0.8303|$\pm $|0.8303 | 0.8388|$\pm $|0.0332 |
MNDR | 0.9060|$\pm $|0.0167 | 0.8113|$\pm $|0.0242 | 0.9137|$\pm $|0.0165 | 0.8993|$\pm $|0.0200 | 0.9088|$\pm $|0.0169 | |
Accuracy | lncRNADisease | 0.8655|$\pm $|0.0199 | 0.7940|$\pm $|0.0222 | 0.8458|$\pm $|0.0258 | 0.8501|$\pm $|0.0208 | 0.8123|$\pm $|0.0384 |
MNDR | 0.9131|$\pm $|0.0102 | 0.8581|$\pm $|0.0119 | 0.9104|$\pm $|0.0115 | 0.9134|$\pm $|0.0111 | 0.9185|$\pm $|0.0110 | |
F1-score | lncRNADisease | 0.8619|$\pm $|0.0215 | 0.8011|$\pm $|0.0211 | 0.8394|$\pm $|0.0274 | 0.8470|$\pm $|0.0214 | 0.8485|$\pm $|0.0198 |
MNDR | 0.9124|$\pm $|0.0105 | 0.8510|$\pm $|0.0135 | 0.9106|$\pm $|0.0115 | 0.9121|$\pm $|0.0116 | 0.9177|$\pm $|0.0112 | |
AUC | lncRNADisease | 0.9182|$\pm $|0.0175 | 0.8542|$\pm $|0.0178 | 0.9195|$\pm $|0.0175 | 0.9154|$\pm $|0.0147 | 0.9271|$\pm $|0.0144 |
MNDR | 0.9661|$\pm $|0.0070 | 0.9038|$\pm $|0.0130 | 0.9665|$\pm $|0.0068 | 0.9716|$\pm $|0.0058 | 0.9722|$\pm $|0.0056 | |
AUPR | lncRNADisease | 0.9186|$\pm $|0.0192 | 0.8824|$\pm $|0.0155 | 0.9301|$\pm $|0.0152 | 0.9146|$\pm $|0.0225 | 0.9364|$\pm $|0.0157 |
MNDR | 0.9690|$\pm $|0.0067 | 0.9255|$\pm $|0.0094 | 0.9701|$\pm $|0.0061 | 0.9742|$\pm $|0.0138 | 0.9761|$\pm $|0.0051 |
. | Dataset . | XGBoost . | AdaBoost . | CatBoost . | LightGBM . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8852|$\pm $|0.0241 | 0.7751|$\pm $|0.0261 | 0.8758|$\pm $|0.0323 | 0.8655|$\pm $|0.0288 | 0.8597|$\pm $|0.0269 |
MNDR | 0.9192 |$\pm $|0.0127 | 0.8955|$\pm $|0.0178 | 0.9079|$\pm $|0.0145 | 0.9258|$\pm $|0.0152 | 0.9270|$\pm $|0.0143 | |
Recall | lncRNADisease | 0.8407|$\pm $|0.0340 | 0.8297|$\pm $|0.0286 | 0.8070|$\pm $|0.0374 | 0.8303|$\pm $|0.8303 | 0.8388|$\pm $|0.0332 |
MNDR | 0.9060|$\pm $|0.0167 | 0.8113|$\pm $|0.0242 | 0.9137|$\pm $|0.0165 | 0.8993|$\pm $|0.0200 | 0.9088|$\pm $|0.0169 | |
Accuracy | lncRNADisease | 0.8655|$\pm $|0.0199 | 0.7940|$\pm $|0.0222 | 0.8458|$\pm $|0.0258 | 0.8501|$\pm $|0.0208 | 0.8123|$\pm $|0.0384 |
MNDR | 0.9131|$\pm $|0.0102 | 0.8581|$\pm $|0.0119 | 0.9104|$\pm $|0.0115 | 0.9134|$\pm $|0.0111 | 0.9185|$\pm $|0.0110 | |
F1-score | lncRNADisease | 0.8619|$\pm $|0.0215 | 0.8011|$\pm $|0.0211 | 0.8394|$\pm $|0.0274 | 0.8470|$\pm $|0.0214 | 0.8485|$\pm $|0.0198 |
MNDR | 0.9124|$\pm $|0.0105 | 0.8510|$\pm $|0.0135 | 0.9106|$\pm $|0.0115 | 0.9121|$\pm $|0.0116 | 0.9177|$\pm $|0.0112 | |
AUC | lncRNADisease | 0.9182|$\pm $|0.0175 | 0.8542|$\pm $|0.0178 | 0.9195|$\pm $|0.0175 | 0.9154|$\pm $|0.0147 | 0.9271|$\pm $|0.0144 |
MNDR | 0.9661|$\pm $|0.0070 | 0.9038|$\pm $|0.0130 | 0.9665|$\pm $|0.0068 | 0.9716|$\pm $|0.0058 | 0.9722|$\pm $|0.0056 | |
AUPR | lncRNADisease | 0.9186|$\pm $|0.0192 | 0.8824|$\pm $|0.0155 | 0.9301|$\pm $|0.0152 | 0.9146|$\pm $|0.0225 | 0.9364|$\pm $|0.0157 |
MNDR | 0.9690|$\pm $|0.0067 | 0.9255|$\pm $|0.0094 | 0.9701|$\pm $|0.0061 | 0.9742|$\pm $|0.0138 | 0.9761|$\pm $|0.0051 |
The LDA prediction performance comparison of five boosting models under |$CV_{ld}$|
. | Dataset . | XGBoost . | AdaBoost . | CatBoost . | LightGBM . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8852|$\pm $|0.0241 | 0.7751|$\pm $|0.0261 | 0.8758|$\pm $|0.0323 | 0.8655|$\pm $|0.0288 | 0.8597|$\pm $|0.0269 |
MNDR | 0.9192 |$\pm $|0.0127 | 0.8955|$\pm $|0.0178 | 0.9079|$\pm $|0.0145 | 0.9258|$\pm $|0.0152 | 0.9270|$\pm $|0.0143 | |
Recall | lncRNADisease | 0.8407|$\pm $|0.0340 | 0.8297|$\pm $|0.0286 | 0.8070|$\pm $|0.0374 | 0.8303|$\pm $|0.8303 | 0.8388|$\pm $|0.0332 |
MNDR | 0.9060|$\pm $|0.0167 | 0.8113|$\pm $|0.0242 | 0.9137|$\pm $|0.0165 | 0.8993|$\pm $|0.0200 | 0.9088|$\pm $|0.0169 | |
Accuracy | lncRNADisease | 0.8655|$\pm $|0.0199 | 0.7940|$\pm $|0.0222 | 0.8458|$\pm $|0.0258 | 0.8501|$\pm $|0.0208 | 0.8123|$\pm $|0.0384 |
MNDR | 0.9131|$\pm $|0.0102 | 0.8581|$\pm $|0.0119 | 0.9104|$\pm $|0.0115 | 0.9134|$\pm $|0.0111 | 0.9185|$\pm $|0.0110 | |
F1-score | lncRNADisease | 0.8619|$\pm $|0.0215 | 0.8011|$\pm $|0.0211 | 0.8394|$\pm $|0.0274 | 0.8470|$\pm $|0.0214 | 0.8485|$\pm $|0.0198 |
MNDR | 0.9124|$\pm $|0.0105 | 0.8510|$\pm $|0.0135 | 0.9106|$\pm $|0.0115 | 0.9121|$\pm $|0.0116 | 0.9177|$\pm $|0.0112 | |
AUC | lncRNADisease | 0.9182|$\pm $|0.0175 | 0.8542|$\pm $|0.0178 | 0.9195|$\pm $|0.0175 | 0.9154|$\pm $|0.0147 | 0.9271|$\pm $|0.0144 |
MNDR | 0.9661|$\pm $|0.0070 | 0.9038|$\pm $|0.0130 | 0.9665|$\pm $|0.0068 | 0.9716|$\pm $|0.0058 | 0.9722|$\pm $|0.0056 | |
AUPR | lncRNADisease | 0.9186|$\pm $|0.0192 | 0.8824|$\pm $|0.0155 | 0.9301|$\pm $|0.0152 | 0.9146|$\pm $|0.0225 | 0.9364|$\pm $|0.0157 |
MNDR | 0.9690|$\pm $|0.0067 | 0.9255|$\pm $|0.0094 | 0.9701|$\pm $|0.0061 | 0.9742|$\pm $|0.0138 | 0.9761|$\pm $|0.0051 |
. | Dataset . | XGBoost . | AdaBoost . | CatBoost . | LightGBM . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8852|$\pm $|0.0241 | 0.7751|$\pm $|0.0261 | 0.8758|$\pm $|0.0323 | 0.8655|$\pm $|0.0288 | 0.8597|$\pm $|0.0269 |
MNDR | 0.9192 |$\pm $|0.0127 | 0.8955|$\pm $|0.0178 | 0.9079|$\pm $|0.0145 | 0.9258|$\pm $|0.0152 | 0.9270|$\pm $|0.0143 | |
Recall | lncRNADisease | 0.8407|$\pm $|0.0340 | 0.8297|$\pm $|0.0286 | 0.8070|$\pm $|0.0374 | 0.8303|$\pm $|0.8303 | 0.8388|$\pm $|0.0332 |
MNDR | 0.9060|$\pm $|0.0167 | 0.8113|$\pm $|0.0242 | 0.9137|$\pm $|0.0165 | 0.8993|$\pm $|0.0200 | 0.9088|$\pm $|0.0169 | |
Accuracy | lncRNADisease | 0.8655|$\pm $|0.0199 | 0.7940|$\pm $|0.0222 | 0.8458|$\pm $|0.0258 | 0.8501|$\pm $|0.0208 | 0.8123|$\pm $|0.0384 |
MNDR | 0.9131|$\pm $|0.0102 | 0.8581|$\pm $|0.0119 | 0.9104|$\pm $|0.0115 | 0.9134|$\pm $|0.0111 | 0.9185|$\pm $|0.0110 | |
F1-score | lncRNADisease | 0.8619|$\pm $|0.0215 | 0.8011|$\pm $|0.0211 | 0.8394|$\pm $|0.0274 | 0.8470|$\pm $|0.0214 | 0.8485|$\pm $|0.0198 |
MNDR | 0.9124|$\pm $|0.0105 | 0.8510|$\pm $|0.0135 | 0.9106|$\pm $|0.0115 | 0.9121|$\pm $|0.0116 | 0.9177|$\pm $|0.0112 | |
AUC | lncRNADisease | 0.9182|$\pm $|0.0175 | 0.8542|$\pm $|0.0178 | 0.9195|$\pm $|0.0175 | 0.9154|$\pm $|0.0147 | 0.9271|$\pm $|0.0144 |
MNDR | 0.9661|$\pm $|0.0070 | 0.9038|$\pm $|0.0130 | 0.9665|$\pm $|0.0068 | 0.9716|$\pm $|0.0058 | 0.9722|$\pm $|0.0056 | |
AUPR | lncRNADisease | 0.9186|$\pm $|0.0192 | 0.8824|$\pm $|0.0155 | 0.9301|$\pm $|0.0152 | 0.9146|$\pm $|0.0225 | 0.9364|$\pm $|0.0157 |
MNDR | 0.9690|$\pm $|0.0067 | 0.9255|$\pm $|0.0094 | 0.9701|$\pm $|0.0061 | 0.9742|$\pm $|0.0138 | 0.9761|$\pm $|0.0051 |
The LDA prediction performance comparison of five boosting models under |$CV_{ind}$|
. | Dataset . | XGBoost . | AdaBoost . | CatBoost . | LightGBM . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8531|$\pm $|0.0849 | 0.7961|$\pm $|0.0945 | 0.8797|$\pm $|0.0862 | 0.8555|$\pm $|0.0812 | 0.8958|$\pm $|0.0744 |
MNDR | 0.9187|$\pm $|0.0445 | 0.9081|$\pm $|0.0560 | 0.9108|$\pm $|0.0504 | 0.9151|$\pm $|0.0495 | 0.9216|$\pm $|0.0453 | |
Recall | lncRNADisease | 0.6894|$\pm $|0.1520 | 0.7625|$\pm $|0.1592 | 0.6482|$\pm $|0.1872 | 0.7040|$\pm $|0.1518 | 0.7214|$\pm $|0.1518 |
MNDR | 0.8234|$\pm $|0.0920 | 0.7812|$\pm $|0.1339 | 0.8259|$\pm $|0.0849 | 0.8141|$\pm $|0.1288 | 0.8346|$\pm $|0.0834 | |
Accuracy | lncRNADisease | 0.7812|$\pm $|0.0773 | 0.7749|$\pm $|0.0707 | 0.7762|$\pm $|0.0880 | 0.7872|$\pm $|0.0662 | 0.8150|$\pm $|0.0744 |
MNDR | 0.8745|$\pm $|0.0498 | 0.8483|$\pm $|0.0605 | 0.8707|$\pm $|0.0439 | 0.8678|$\pm $|0.0639 | 0.8799|$\pm $|0.0374 | |
F1-score | lncRNADisease | 0.7503|$\pm $|0.1118 | 0.7645|$\pm $|0.0974 | 0.7260|$\pm $|0.1566 | 0.7596|$\pm $|0.0972 | 0.7869|$\pm $|0.1092 |
MNDR | 0.8654|$\pm $|0.0594 | 0.8313|$\pm $|0.0834 | 0.8627|$\pm $|0.0546 | 0.8540|$\pm $|0.0945 | 0.8723|$\pm $|0.0463 | |
AUC | lncRNADisease | 0.8597|$\pm $|0.0646 | 0.8257|$\pm $|0.0844 | 0.8754|$\pm $|0.0687 | 0.8736|$\pm $|0.0630 | 0.8924|$\pm $|0.0666 |
MNDR | 0.9454|$\pm $|0.0327 | 0.9025|$\pm $|0.0492 | 0.9436|$\pm $|0.0335 | 0.9448|$\pm $|0.04125 | 0.9576|$\pm $|0.0218 | |
AUPR | lncRNADisease | 0.8669|$\pm $|0.0690 | 0.8611|$\pm $|0.0677 | 0.8893|$\pm $|0.0665 | 0.8804|$\pm $|0.0578 | 0.9056|$\pm $|0.0579 |
MNDR | 0.9501|$\pm $|0.0307 | 0.9248|$\pm $|0.0365 | 0.9488|$\pm $|0.0292 | 0.9490|$\pm $|0.0370 | 0.9621|$\pm $|0.0194 |
. | Dataset . | XGBoost . | AdaBoost . | CatBoost . | LightGBM . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8531|$\pm $|0.0849 | 0.7961|$\pm $|0.0945 | 0.8797|$\pm $|0.0862 | 0.8555|$\pm $|0.0812 | 0.8958|$\pm $|0.0744 |
MNDR | 0.9187|$\pm $|0.0445 | 0.9081|$\pm $|0.0560 | 0.9108|$\pm $|0.0504 | 0.9151|$\pm $|0.0495 | 0.9216|$\pm $|0.0453 | |
Recall | lncRNADisease | 0.6894|$\pm $|0.1520 | 0.7625|$\pm $|0.1592 | 0.6482|$\pm $|0.1872 | 0.7040|$\pm $|0.1518 | 0.7214|$\pm $|0.1518 |
MNDR | 0.8234|$\pm $|0.0920 | 0.7812|$\pm $|0.1339 | 0.8259|$\pm $|0.0849 | 0.8141|$\pm $|0.1288 | 0.8346|$\pm $|0.0834 | |
Accuracy | lncRNADisease | 0.7812|$\pm $|0.0773 | 0.7749|$\pm $|0.0707 | 0.7762|$\pm $|0.0880 | 0.7872|$\pm $|0.0662 | 0.8150|$\pm $|0.0744 |
MNDR | 0.8745|$\pm $|0.0498 | 0.8483|$\pm $|0.0605 | 0.8707|$\pm $|0.0439 | 0.8678|$\pm $|0.0639 | 0.8799|$\pm $|0.0374 | |
F1-score | lncRNADisease | 0.7503|$\pm $|0.1118 | 0.7645|$\pm $|0.0974 | 0.7260|$\pm $|0.1566 | 0.7596|$\pm $|0.0972 | 0.7869|$\pm $|0.1092 |
MNDR | 0.8654|$\pm $|0.0594 | 0.8313|$\pm $|0.0834 | 0.8627|$\pm $|0.0546 | 0.8540|$\pm $|0.0945 | 0.8723|$\pm $|0.0463 | |
AUC | lncRNADisease | 0.8597|$\pm $|0.0646 | 0.8257|$\pm $|0.0844 | 0.8754|$\pm $|0.0687 | 0.8736|$\pm $|0.0630 | 0.8924|$\pm $|0.0666 |
MNDR | 0.9454|$\pm $|0.0327 | 0.9025|$\pm $|0.0492 | 0.9436|$\pm $|0.0335 | 0.9448|$\pm $|0.04125 | 0.9576|$\pm $|0.0218 | |
AUPR | lncRNADisease | 0.8669|$\pm $|0.0690 | 0.8611|$\pm $|0.0677 | 0.8893|$\pm $|0.0665 | 0.8804|$\pm $|0.0578 | 0.9056|$\pm $|0.0579 |
MNDR | 0.9501|$\pm $|0.0307 | 0.9248|$\pm $|0.0365 | 0.9488|$\pm $|0.0292 | 0.9490|$\pm $|0.0370 | 0.9621|$\pm $|0.0194 |
The LDA prediction performance comparison of five boosting models under |$CV_{ind}$|
. | Dataset . | XGBoost . | AdaBoost . | CatBoost . | LightGBM . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8531|$\pm $|0.0849 | 0.7961|$\pm $|0.0945 | 0.8797|$\pm $|0.0862 | 0.8555|$\pm $|0.0812 | 0.8958|$\pm $|0.0744 |
MNDR | 0.9187|$\pm $|0.0445 | 0.9081|$\pm $|0.0560 | 0.9108|$\pm $|0.0504 | 0.9151|$\pm $|0.0495 | 0.9216|$\pm $|0.0453 | |
Recall | lncRNADisease | 0.6894|$\pm $|0.1520 | 0.7625|$\pm $|0.1592 | 0.6482|$\pm $|0.1872 | 0.7040|$\pm $|0.1518 | 0.7214|$\pm $|0.1518 |
MNDR | 0.8234|$\pm $|0.0920 | 0.7812|$\pm $|0.1339 | 0.8259|$\pm $|0.0849 | 0.8141|$\pm $|0.1288 | 0.8346|$\pm $|0.0834 | |
Accuracy | lncRNADisease | 0.7812|$\pm $|0.0773 | 0.7749|$\pm $|0.0707 | 0.7762|$\pm $|0.0880 | 0.7872|$\pm $|0.0662 | 0.8150|$\pm $|0.0744 |
MNDR | 0.8745|$\pm $|0.0498 | 0.8483|$\pm $|0.0605 | 0.8707|$\pm $|0.0439 | 0.8678|$\pm $|0.0639 | 0.8799|$\pm $|0.0374 | |
F1-score | lncRNADisease | 0.7503|$\pm $|0.1118 | 0.7645|$\pm $|0.0974 | 0.7260|$\pm $|0.1566 | 0.7596|$\pm $|0.0972 | 0.7869|$\pm $|0.1092 |
MNDR | 0.8654|$\pm $|0.0594 | 0.8313|$\pm $|0.0834 | 0.8627|$\pm $|0.0546 | 0.8540|$\pm $|0.0945 | 0.8723|$\pm $|0.0463 | |
AUC | lncRNADisease | 0.8597|$\pm $|0.0646 | 0.8257|$\pm $|0.0844 | 0.8754|$\pm $|0.0687 | 0.8736|$\pm $|0.0630 | 0.8924|$\pm $|0.0666 |
MNDR | 0.9454|$\pm $|0.0327 | 0.9025|$\pm $|0.0492 | 0.9436|$\pm $|0.0335 | 0.9448|$\pm $|0.04125 | 0.9576|$\pm $|0.0218 | |
AUPR | lncRNADisease | 0.8669|$\pm $|0.0690 | 0.8611|$\pm $|0.0677 | 0.8893|$\pm $|0.0665 | 0.8804|$\pm $|0.0578 | 0.9056|$\pm $|0.0579 |
MNDR | 0.9501|$\pm $|0.0307 | 0.9248|$\pm $|0.0365 | 0.9488|$\pm $|0.0292 | 0.9490|$\pm $|0.0370 | 0.9621|$\pm $|0.0194 |
. | Dataset . | XGBoost . | AdaBoost . | CatBoost . | LightGBM . | LDA-VGHB . |
---|---|---|---|---|---|---|
Precision | lncRNADisease | 0.8531|$\pm $|0.0849 | 0.7961|$\pm $|0.0945 | 0.8797|$\pm $|0.0862 | 0.8555|$\pm $|0.0812 | 0.8958|$\pm $|0.0744 |
MNDR | 0.9187|$\pm $|0.0445 | 0.9081|$\pm $|0.0560 | 0.9108|$\pm $|0.0504 | 0.9151|$\pm $|0.0495 | 0.9216|$\pm $|0.0453 | |
Recall | lncRNADisease | 0.6894|$\pm $|0.1520 | 0.7625|$\pm $|0.1592 | 0.6482|$\pm $|0.1872 | 0.7040|$\pm $|0.1518 | 0.7214|$\pm $|0.1518 |
MNDR | 0.8234|$\pm $|0.0920 | 0.7812|$\pm $|0.1339 | 0.8259|$\pm $|0.0849 | 0.8141|$\pm $|0.1288 | 0.8346|$\pm $|0.0834 | |
Accuracy | lncRNADisease | 0.7812|$\pm $|0.0773 | 0.7749|$\pm $|0.0707 | 0.7762|$\pm $|0.0880 | 0.7872|$\pm $|0.0662 | 0.8150|$\pm $|0.0744 |
MNDR | 0.8745|$\pm $|0.0498 | 0.8483|$\pm $|0.0605 | 0.8707|$\pm $|0.0439 | 0.8678|$\pm $|0.0639 | 0.8799|$\pm $|0.0374 | |
F1-score | lncRNADisease | 0.7503|$\pm $|0.1118 | 0.7645|$\pm $|0.0974 | 0.7260|$\pm $|0.1566 | 0.7596|$\pm $|0.0972 | 0.7869|$\pm $|0.1092 |
MNDR | 0.8654|$\pm $|0.0594 | 0.8313|$\pm $|0.0834 | 0.8627|$\pm $|0.0546 | 0.8540|$\pm $|0.0945 | 0.8723|$\pm $|0.0463 | |
AUC | lncRNADisease | 0.8597|$\pm $|0.0646 | 0.8257|$\pm $|0.0844 | 0.8754|$\pm $|0.0687 | 0.8736|$\pm $|0.0630 | 0.8924|$\pm $|0.0666 |
MNDR | 0.9454|$\pm $|0.0327 | 0.9025|$\pm $|0.0492 | 0.9436|$\pm $|0.0335 | 0.9448|$\pm $|0.04125 | 0.9576|$\pm $|0.0218 | |
AUPR | lncRNADisease | 0.8669|$\pm $|0.0690 | 0.8611|$\pm $|0.0677 | 0.8893|$\pm $|0.0665 | 0.8804|$\pm $|0.0578 | 0.9056|$\pm $|0.0579 |
MNDR | 0.9501|$\pm $|0.0307 | 0.9248|$\pm $|0.0365 | 0.9488|$\pm $|0.0292 | 0.9490|$\pm $|0.0370 | 0.9621|$\pm $|0.0194 |
The other performance comparison
In the proposed LDA-VGHB model, linear and nonlinear features were extracted to represent each lncRNA–disease pair based on SVD and VGAE, respectively. We analyzed their affects on the LDA identification performance. Figure 3 demonstrates the LDA-VGHB performance on two LDA datasets under four different 5-fold CVs when using linear features, nonlinear features or their combination. In most cases, the combination of linear features and nonlinear features improved the LDA prediction performance.

Affects of linear features, nonlinear features and their combination on performance. A–D denote the performance of LDA-VGHB when using the three types of features on the lncRNADisease database under |$CV_l$|, |$CV_d$|, |$CV_{ld}$| and |$CV_{ind}$|, respectively. E–H denote the performance of LDA-VGHB when using the three types of features on the MNDR database under |$CV_l$|, |$CV_d$|, |$CV_{ld}$| and |$CV_{ind}$|, respectively.
The parameter |$\alpha $| was used to measure the importance on the LDA identification performance. Thus, we analyzed the affect of the parameter |$\alpha $| at the range of [0,1] with the stepsize of 0 on the LDA prediction performance. As shown in Figure 4, when |$\alpha $| was set to 0.5, LDA-VGHB computed the best AUC and AUPR on the lncRNADisease and MNDR datasets under the four different CVs. Consequently, we set |$\alpha $| to 0.5. Tables S5-S8 in Supplementary Materials show the LDA-VGHB performance based on different |$\alpha $| under the four different 5-fold CVs.

The affect of the parameter |$\alpha $| on the LDA prediction performance. A-B, C-D, E-F and G-H denote AUC and AUPR of LDA-VGHB based on different |$\alpha $| values on the lncRNADisease and MNDR databases under |$CV_l$|, |$CV_d$|, |$CV_{ld}$| and |$CV_{ind}$|, respectively.
The obtained feature dimensions of lncRNAs and diseases are unknown. However, we failed to implement dimension reduction because they were not high-dimensional. Consequently, we analyzed affects of different dimensions (i.e. 5, 10, 16, 32, 50, and 64) on the LDA identification performance. By comprehensively considering six evaluation index values, as shown in Table 10, we selected different dimensions on different datasets.
CV . | Dataset . | Linear . | Nonlinear . |
---|---|---|---|
|$CV_l$| | lncRNADisease | 32 | 32 |
MNDR | 32 | 32 | |
|$CV_d$| | lncRNADisease | 10 | 10 |
MNDR | 16 | 16 | |
|$CV_{ld}$| | lncRNADisease | 16 | 16 |
MNDR | 10 | 10 | |
|$CV_{ind}$| | lncRNADisease | 10 | 10 |
MNDR | 50 | 50 |
CV . | Dataset . | Linear . | Nonlinear . |
---|---|---|---|
|$CV_l$| | lncRNADisease | 32 | 32 |
MNDR | 32 | 32 | |
|$CV_d$| | lncRNADisease | 10 | 10 |
MNDR | 16 | 16 | |
|$CV_{ld}$| | lncRNADisease | 16 | 16 |
MNDR | 10 | 10 | |
|$CV_{ind}$| | lncRNADisease | 10 | 10 |
MNDR | 50 | 50 |
CV . | Dataset . | Linear . | Nonlinear . |
---|---|---|---|
|$CV_l$| | lncRNADisease | 32 | 32 |
MNDR | 32 | 32 | |
|$CV_d$| | lncRNADisease | 10 | 10 |
MNDR | 16 | 16 | |
|$CV_{ld}$| | lncRNADisease | 16 | 16 |
MNDR | 10 | 10 | |
|$CV_{ind}$| | lncRNADisease | 10 | 10 |
MNDR | 50 | 50 |
CV . | Dataset . | Linear . | Nonlinear . |
---|---|---|---|
|$CV_l$| | lncRNADisease | 32 | 32 |
MNDR | 32 | 32 | |
|$CV_d$| | lncRNADisease | 10 | 10 |
MNDR | 16 | 16 | |
|$CV_{ld}$| | lncRNADisease | 16 | 16 |
MNDR | 10 | 10 | |
|$CV_{ind}$| | lncRNADisease | 10 | 10 |
MNDR | 50 | 50 |
Case Study
Lung cancer, breast cancer, colorectal cancer and kidney cancer are four of the most frequent cancers worldwide. They demonstrate high morbidity and mortality. In the above section, we have verified the LDA-VGHB performance. Subsequently, we selected LDA-VGHB to identify potential lncRNAs for the four cancers. Figure 5 illustrates the top 20 lncRNAs associated with the four cancers on the lncRNADisease and MNDR databases. Tables 11–14 list the rankings of the top 20 lncRNAs according to the association scores between them and a query cancer, respectively.

The predicted top 20 lncRNAs associated with lung cancer (A and B), breast cancer (C and D), colorectal cancer (E and F) and kidney neoplasms (G and H) on the lncRNADisease and MNDR databases. The solid line and dashed line denote a predicted LDA that can be validated and can not be validated.
The predicted top 20 lncRNAs associated with lung cancer on lncRNADisease and MNDR
lncRNADisease . | MNDR . | ||||
---|---|---|---|---|---|
Rank . | lncRNA . | Evidence . | Rank . | lncRNA . | Evidence . |
1 | PSORS1C3 | RNADisease | 1 | HAR1A | Unknown |
2 | HIF1A-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 2 | BOK-AS1 | Unknown |
3 | DSCAM-AS1 | Lnc2Cancer 3.0, RNADisease | 3 | SNHG3 | Lnc2Cancer 3.0, RNADisease |
4 | WT1-AS | RNADisease | 4 | KCNQ1DN | Unknown |
5 | DAOA-AS1 | Unknown | 5 | IGF2-AS | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
6 | SNHG16 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 6 | KCNQ1OT1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
7 | NAMA | Unknown | 7 | DNM3OS | Unknown |
8 | HCP5 | Lnc2Cancer 3.0, RNADisease | 8 | HULC | Lnc2Cancer 3.0, RNADisease |
9 | GHET1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 9 | LINC00271 | Unknown |
10 | KCNQ1DN | Unknown | 10 | LINC00162 | Unknown |
11 | EPB41L4A-AS1 | Unknown | 11 | EPB41L4A-AS1 | Unknown |
12 | WRAP53 | Unknown | 12 | ESRG | Unknown |
13 | MIR31HG | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 13 | LINC00032 | Unknown |
14 | DANCR | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 14 | IFNG-AS1 | Unknown |
15 | IFNG-AS1 | Unknown | 15 | GHET1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
16 | HAR1A | Unknown | 16 | HIF1A-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
17 | BC040587 | Unknown | 17 | ATXN8OS | Unknown |
18 | BACE1-AS | Unknown | 18 | WRAP53 | Unknown |
19 | BCAR4 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 19 | ZFAT-AS1 | Unknown |
20 | DISC2 | Unknown | 20 | TUG1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
lncRNADisease . | MNDR . | ||||
---|---|---|---|---|---|
Rank . | lncRNA . | Evidence . | Rank . | lncRNA . | Evidence . |
1 | PSORS1C3 | RNADisease | 1 | HAR1A | Unknown |
2 | HIF1A-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 2 | BOK-AS1 | Unknown |
3 | DSCAM-AS1 | Lnc2Cancer 3.0, RNADisease | 3 | SNHG3 | Lnc2Cancer 3.0, RNADisease |
4 | WT1-AS | RNADisease | 4 | KCNQ1DN | Unknown |
5 | DAOA-AS1 | Unknown | 5 | IGF2-AS | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
6 | SNHG16 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 6 | KCNQ1OT1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
7 | NAMA | Unknown | 7 | DNM3OS | Unknown |
8 | HCP5 | Lnc2Cancer 3.0, RNADisease | 8 | HULC | Lnc2Cancer 3.0, RNADisease |
9 | GHET1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 9 | LINC00271 | Unknown |
10 | KCNQ1DN | Unknown | 10 | LINC00162 | Unknown |
11 | EPB41L4A-AS1 | Unknown | 11 | EPB41L4A-AS1 | Unknown |
12 | WRAP53 | Unknown | 12 | ESRG | Unknown |
13 | MIR31HG | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 13 | LINC00032 | Unknown |
14 | DANCR | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 14 | IFNG-AS1 | Unknown |
15 | IFNG-AS1 | Unknown | 15 | GHET1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
16 | HAR1A | Unknown | 16 | HIF1A-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
17 | BC040587 | Unknown | 17 | ATXN8OS | Unknown |
18 | BACE1-AS | Unknown | 18 | WRAP53 | Unknown |
19 | BCAR4 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 19 | ZFAT-AS1 | Unknown |
20 | DISC2 | Unknown | 20 | TUG1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
The predicted top 20 lncRNAs associated with lung cancer on lncRNADisease and MNDR
lncRNADisease . | MNDR . | ||||
---|---|---|---|---|---|
Rank . | lncRNA . | Evidence . | Rank . | lncRNA . | Evidence . |
1 | PSORS1C3 | RNADisease | 1 | HAR1A | Unknown |
2 | HIF1A-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 2 | BOK-AS1 | Unknown |
3 | DSCAM-AS1 | Lnc2Cancer 3.0, RNADisease | 3 | SNHG3 | Lnc2Cancer 3.0, RNADisease |
4 | WT1-AS | RNADisease | 4 | KCNQ1DN | Unknown |
5 | DAOA-AS1 | Unknown | 5 | IGF2-AS | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
6 | SNHG16 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 6 | KCNQ1OT1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
7 | NAMA | Unknown | 7 | DNM3OS | Unknown |
8 | HCP5 | Lnc2Cancer 3.0, RNADisease | 8 | HULC | Lnc2Cancer 3.0, RNADisease |
9 | GHET1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 9 | LINC00271 | Unknown |
10 | KCNQ1DN | Unknown | 10 | LINC00162 | Unknown |
11 | EPB41L4A-AS1 | Unknown | 11 | EPB41L4A-AS1 | Unknown |
12 | WRAP53 | Unknown | 12 | ESRG | Unknown |
13 | MIR31HG | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 13 | LINC00032 | Unknown |
14 | DANCR | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 14 | IFNG-AS1 | Unknown |
15 | IFNG-AS1 | Unknown | 15 | GHET1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
16 | HAR1A | Unknown | 16 | HIF1A-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
17 | BC040587 | Unknown | 17 | ATXN8OS | Unknown |
18 | BACE1-AS | Unknown | 18 | WRAP53 | Unknown |
19 | BCAR4 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 19 | ZFAT-AS1 | Unknown |
20 | DISC2 | Unknown | 20 | TUG1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
lncRNADisease . | MNDR . | ||||
---|---|---|---|---|---|
Rank . | lncRNA . | Evidence . | Rank . | lncRNA . | Evidence . |
1 | PSORS1C3 | RNADisease | 1 | HAR1A | Unknown |
2 | HIF1A-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 2 | BOK-AS1 | Unknown |
3 | DSCAM-AS1 | Lnc2Cancer 3.0, RNADisease | 3 | SNHG3 | Lnc2Cancer 3.0, RNADisease |
4 | WT1-AS | RNADisease | 4 | KCNQ1DN | Unknown |
5 | DAOA-AS1 | Unknown | 5 | IGF2-AS | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
6 | SNHG16 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 6 | KCNQ1OT1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
7 | NAMA | Unknown | 7 | DNM3OS | Unknown |
8 | HCP5 | Lnc2Cancer 3.0, RNADisease | 8 | HULC | Lnc2Cancer 3.0, RNADisease |
9 | GHET1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 9 | LINC00271 | Unknown |
10 | KCNQ1DN | Unknown | 10 | LINC00162 | Unknown |
11 | EPB41L4A-AS1 | Unknown | 11 | EPB41L4A-AS1 | Unknown |
12 | WRAP53 | Unknown | 12 | ESRG | Unknown |
13 | MIR31HG | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 13 | LINC00032 | Unknown |
14 | DANCR | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 14 | IFNG-AS1 | Unknown |
15 | IFNG-AS1 | Unknown | 15 | GHET1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
16 | HAR1A | Unknown | 16 | HIF1A-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
17 | BC040587 | Unknown | 17 | ATXN8OS | Unknown |
18 | BACE1-AS | Unknown | 18 | WRAP53 | Unknown |
19 | BCAR4 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 19 | ZFAT-AS1 | Unknown |
20 | DISC2 | Unknown | 20 | TUG1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
The predicted top 20 lncRNAs associated with breast cancerdraftrulesdr on lncRNADisease and MNDR
lncRNADisease . | MNDR . | ||||
---|---|---|---|---|---|
Rank . | lncRNA . | Evidence . | Rank . | lncRNA . | Evidence . |
1 | WRAP53 | Unknown | 1 | ZFAT-AS1 | PMID: 21460236 |
2 | ATXN8OS | Lnc2Cancer 3.0, RNADisease | 2 | HAR1A | PMID: 26942882 |
3 | DNM3OS | RNADisease | 3 | BOK-AS1 | Unknown |
4 | ATP6V1G2-DDX39B | Unknown | 4 | RRP1B | Unknown |
5 | CBR3-AS1 | Lnc2Cancer 3.0, RNADisease | 5 | SCAANT1 | Unknown |
6 | DAOA-AS1 | Unknown | 6 | KCNQ1DN | Unknown |
7 | 7SK | Unknown | 7 | IGF2-AS | RNADisease |
8 | DLEU1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 8 | TUG1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
9 | DGCR5 | RNADisease | 9 | PTENP1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
10 | SNHG3 | Lnc2Cancer 3.0, RNADisease | 10 | DNM3OS | Unknown |
11 | SNHG4 | Unknown | 11 | HULC | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
12 | HCP5 | Lnc2Cancer 3.0 | 12 | LINC00162 | Unknown |
13 | TCL6 | Lnc2Cancer 3.0 | 13 | EPB41L4A-AS1 | Lnc2Cancer 3.0, RNADisease |
14 | KCNQ1DN | Unknown | 14 | ESRG | Unknown |
15 | HAR1B | Unknown | 15 | LINC00032 | Unknown |
16 | HNF1A-AS1 | RNADisease | 16 | CASC2 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
17 | PINK1-AS | lncRNADisease v2.0, RNADisease | 17 | GHET1 | Lnc2Cancer 3.0, RNADisease |
18 | IGF2-AS | RNADisease | 18 | HIF1A-AS1 | Unknown |
19 | HIF1A-AS1 | Unknown | 19 | ATXN8OS | Unknown |
20 | PSORS1C3 | Unknown | 20 | MIR31HG | Lnc2Cancer 3.0, RNADisease |
lncRNADisease . | MNDR . | ||||
---|---|---|---|---|---|
Rank . | lncRNA . | Evidence . | Rank . | lncRNA . | Evidence . |
1 | WRAP53 | Unknown | 1 | ZFAT-AS1 | PMID: 21460236 |
2 | ATXN8OS | Lnc2Cancer 3.0, RNADisease | 2 | HAR1A | PMID: 26942882 |
3 | DNM3OS | RNADisease | 3 | BOK-AS1 | Unknown |
4 | ATP6V1G2-DDX39B | Unknown | 4 | RRP1B | Unknown |
5 | CBR3-AS1 | Lnc2Cancer 3.0, RNADisease | 5 | SCAANT1 | Unknown |
6 | DAOA-AS1 | Unknown | 6 | KCNQ1DN | Unknown |
7 | 7SK | Unknown | 7 | IGF2-AS | RNADisease |
8 | DLEU1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 8 | TUG1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
9 | DGCR5 | RNADisease | 9 | PTENP1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
10 | SNHG3 | Lnc2Cancer 3.0, RNADisease | 10 | DNM3OS | Unknown |
11 | SNHG4 | Unknown | 11 | HULC | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
12 | HCP5 | Lnc2Cancer 3.0 | 12 | LINC00162 | Unknown |
13 | TCL6 | Lnc2Cancer 3.0 | 13 | EPB41L4A-AS1 | Lnc2Cancer 3.0, RNADisease |
14 | KCNQ1DN | Unknown | 14 | ESRG | Unknown |
15 | HAR1B | Unknown | 15 | LINC00032 | Unknown |
16 | HNF1A-AS1 | RNADisease | 16 | CASC2 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
17 | PINK1-AS | lncRNADisease v2.0, RNADisease | 17 | GHET1 | Lnc2Cancer 3.0, RNADisease |
18 | IGF2-AS | RNADisease | 18 | HIF1A-AS1 | Unknown |
19 | HIF1A-AS1 | Unknown | 19 | ATXN8OS | Unknown |
20 | PSORS1C3 | Unknown | 20 | MIR31HG | Lnc2Cancer 3.0, RNADisease |
The predicted top 20 lncRNAs associated with breast cancerdraftrulesdr on lncRNADisease and MNDR
lncRNADisease . | MNDR . | ||||
---|---|---|---|---|---|
Rank . | lncRNA . | Evidence . | Rank . | lncRNA . | Evidence . |
1 | WRAP53 | Unknown | 1 | ZFAT-AS1 | PMID: 21460236 |
2 | ATXN8OS | Lnc2Cancer 3.0, RNADisease | 2 | HAR1A | PMID: 26942882 |
3 | DNM3OS | RNADisease | 3 | BOK-AS1 | Unknown |
4 | ATP6V1G2-DDX39B | Unknown | 4 | RRP1B | Unknown |
5 | CBR3-AS1 | Lnc2Cancer 3.0, RNADisease | 5 | SCAANT1 | Unknown |
6 | DAOA-AS1 | Unknown | 6 | KCNQ1DN | Unknown |
7 | 7SK | Unknown | 7 | IGF2-AS | RNADisease |
8 | DLEU1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 8 | TUG1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
9 | DGCR5 | RNADisease | 9 | PTENP1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
10 | SNHG3 | Lnc2Cancer 3.0, RNADisease | 10 | DNM3OS | Unknown |
11 | SNHG4 | Unknown | 11 | HULC | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
12 | HCP5 | Lnc2Cancer 3.0 | 12 | LINC00162 | Unknown |
13 | TCL6 | Lnc2Cancer 3.0 | 13 | EPB41L4A-AS1 | Lnc2Cancer 3.0, RNADisease |
14 | KCNQ1DN | Unknown | 14 | ESRG | Unknown |
15 | HAR1B | Unknown | 15 | LINC00032 | Unknown |
16 | HNF1A-AS1 | RNADisease | 16 | CASC2 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
17 | PINK1-AS | lncRNADisease v2.0, RNADisease | 17 | GHET1 | Lnc2Cancer 3.0, RNADisease |
18 | IGF2-AS | RNADisease | 18 | HIF1A-AS1 | Unknown |
19 | HIF1A-AS1 | Unknown | 19 | ATXN8OS | Unknown |
20 | PSORS1C3 | Unknown | 20 | MIR31HG | Lnc2Cancer 3.0, RNADisease |
lncRNADisease . | MNDR . | ||||
---|---|---|---|---|---|
Rank . | lncRNA . | Evidence . | Rank . | lncRNA . | Evidence . |
1 | WRAP53 | Unknown | 1 | ZFAT-AS1 | PMID: 21460236 |
2 | ATXN8OS | Lnc2Cancer 3.0, RNADisease | 2 | HAR1A | PMID: 26942882 |
3 | DNM3OS | RNADisease | 3 | BOK-AS1 | Unknown |
4 | ATP6V1G2-DDX39B | Unknown | 4 | RRP1B | Unknown |
5 | CBR3-AS1 | Lnc2Cancer 3.0, RNADisease | 5 | SCAANT1 | Unknown |
6 | DAOA-AS1 | Unknown | 6 | KCNQ1DN | Unknown |
7 | 7SK | Unknown | 7 | IGF2-AS | RNADisease |
8 | DLEU1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 8 | TUG1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
9 | DGCR5 | RNADisease | 9 | PTENP1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
10 | SNHG3 | Lnc2Cancer 3.0, RNADisease | 10 | DNM3OS | Unknown |
11 | SNHG4 | Unknown | 11 | HULC | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
12 | HCP5 | Lnc2Cancer 3.0 | 12 | LINC00162 | Unknown |
13 | TCL6 | Lnc2Cancer 3.0 | 13 | EPB41L4A-AS1 | Lnc2Cancer 3.0, RNADisease |
14 | KCNQ1DN | Unknown | 14 | ESRG | Unknown |
15 | HAR1B | Unknown | 15 | LINC00032 | Unknown |
16 | HNF1A-AS1 | RNADisease | 16 | CASC2 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
17 | PINK1-AS | lncRNADisease v2.0, RNADisease | 17 | GHET1 | Lnc2Cancer 3.0, RNADisease |
18 | IGF2-AS | RNADisease | 18 | HIF1A-AS1 | Unknown |
19 | HIF1A-AS1 | Unknown | 19 | ATXN8OS | Unknown |
20 | PSORS1C3 | Unknown | 20 | MIR31HG | Lnc2Cancer 3.0, RNADisease |
The predicted top 20 lncRNAs associated with colorectal cancer on lncRNADisease and MNDR
lncRNADisease . | MNDR . | ||||
---|---|---|---|---|---|
Rank . | lncRNA . | Evidence . | Rank . | lncRNA . | Evidence . |
1 | ZFAT-AS1 | Unknown | 1 | ESRG | PMID: 34896077, 31905146 |
2 | WRAP53 | Unknown | 2 | DGCR5 | Lnc2Cancer 3.0, RNADisease |
3 | HIF1A-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 3 | KCNQ1DN | Unknown |
4 | DSCAM-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 4 | BOK-AS1 | Unknown |
5 | DAOA-AS1 | Unknown | 5 | WRAP53 | Unknown |
6 | SNHG3 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 6 | DISC2 | Unknown |
7 | SNHG4 | RNADisease | 7 | ATP6V1G2-DDX39B | Unknown |
8 | HCP5 | Lnc2Cancer 3.0, RNADisease | 8 | DNM3OS | Unknown |
9 | KCNQ1DN | Unknown | 9 | HAR1A | unknown |
10 | EPB41L4A-AS1 | RNADisease | 10 | IGF2-AS | Unknown |
11 | WT1-AS | Unknown | 11 | HIF1A-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
12 | TCL6 | Unknown | 12 | LINC00162 | Unknown |
13 | IFNG-AS1 | Unknown | 13 | LINC00032 | Unknown |
14 | HAR1A | unknown | 14 | SRA1 | Unknown |
15 | SNHG11 | Lnc2Cancer 3.0, RNADisease | 15 | EPB41L4A-AS1 | RNADisease |
16 | BC040587 | Unknown | 16 | PTENP1 | Unknown |
17 | BACE1-AS | Unknown | 17 | NRON | Unknown |
18 | DISC2 | Unknown | 18 | DLEU1 | Lnc2Cancer 3.0, RNADisease |
19 | HNF1A-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 19 | 7SK | Unknown |
20 | DNM3OS | Unknown | 20 | ZFAT-AS1 | Unknown |
lncRNADisease . | MNDR . | ||||
---|---|---|---|---|---|
Rank . | lncRNA . | Evidence . | Rank . | lncRNA . | Evidence . |
1 | ZFAT-AS1 | Unknown | 1 | ESRG | PMID: 34896077, 31905146 |
2 | WRAP53 | Unknown | 2 | DGCR5 | Lnc2Cancer 3.0, RNADisease |
3 | HIF1A-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 3 | KCNQ1DN | Unknown |
4 | DSCAM-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 4 | BOK-AS1 | Unknown |
5 | DAOA-AS1 | Unknown | 5 | WRAP53 | Unknown |
6 | SNHG3 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 6 | DISC2 | Unknown |
7 | SNHG4 | RNADisease | 7 | ATP6V1G2-DDX39B | Unknown |
8 | HCP5 | Lnc2Cancer 3.0, RNADisease | 8 | DNM3OS | Unknown |
9 | KCNQ1DN | Unknown | 9 | HAR1A | unknown |
10 | EPB41L4A-AS1 | RNADisease | 10 | IGF2-AS | Unknown |
11 | WT1-AS | Unknown | 11 | HIF1A-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
12 | TCL6 | Unknown | 12 | LINC00162 | Unknown |
13 | IFNG-AS1 | Unknown | 13 | LINC00032 | Unknown |
14 | HAR1A | unknown | 14 | SRA1 | Unknown |
15 | SNHG11 | Lnc2Cancer 3.0, RNADisease | 15 | EPB41L4A-AS1 | RNADisease |
16 | BC040587 | Unknown | 16 | PTENP1 | Unknown |
17 | BACE1-AS | Unknown | 17 | NRON | Unknown |
18 | DISC2 | Unknown | 18 | DLEU1 | Lnc2Cancer 3.0, RNADisease |
19 | HNF1A-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 19 | 7SK | Unknown |
20 | DNM3OS | Unknown | 20 | ZFAT-AS1 | Unknown |
The predicted top 20 lncRNAs associated with colorectal cancer on lncRNADisease and MNDR
lncRNADisease . | MNDR . | ||||
---|---|---|---|---|---|
Rank . | lncRNA . | Evidence . | Rank . | lncRNA . | Evidence . |
1 | ZFAT-AS1 | Unknown | 1 | ESRG | PMID: 34896077, 31905146 |
2 | WRAP53 | Unknown | 2 | DGCR5 | Lnc2Cancer 3.0, RNADisease |
3 | HIF1A-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 3 | KCNQ1DN | Unknown |
4 | DSCAM-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 4 | BOK-AS1 | Unknown |
5 | DAOA-AS1 | Unknown | 5 | WRAP53 | Unknown |
6 | SNHG3 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 6 | DISC2 | Unknown |
7 | SNHG4 | RNADisease | 7 | ATP6V1G2-DDX39B | Unknown |
8 | HCP5 | Lnc2Cancer 3.0, RNADisease | 8 | DNM3OS | Unknown |
9 | KCNQ1DN | Unknown | 9 | HAR1A | unknown |
10 | EPB41L4A-AS1 | RNADisease | 10 | IGF2-AS | Unknown |
11 | WT1-AS | Unknown | 11 | HIF1A-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
12 | TCL6 | Unknown | 12 | LINC00162 | Unknown |
13 | IFNG-AS1 | Unknown | 13 | LINC00032 | Unknown |
14 | HAR1A | unknown | 14 | SRA1 | Unknown |
15 | SNHG11 | Lnc2Cancer 3.0, RNADisease | 15 | EPB41L4A-AS1 | RNADisease |
16 | BC040587 | Unknown | 16 | PTENP1 | Unknown |
17 | BACE1-AS | Unknown | 17 | NRON | Unknown |
18 | DISC2 | Unknown | 18 | DLEU1 | Lnc2Cancer 3.0, RNADisease |
19 | HNF1A-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 19 | 7SK | Unknown |
20 | DNM3OS | Unknown | 20 | ZFAT-AS1 | Unknown |
lncRNADisease . | MNDR . | ||||
---|---|---|---|---|---|
Rank . | lncRNA . | Evidence . | Rank . | lncRNA . | Evidence . |
1 | ZFAT-AS1 | Unknown | 1 | ESRG | PMID: 34896077, 31905146 |
2 | WRAP53 | Unknown | 2 | DGCR5 | Lnc2Cancer 3.0, RNADisease |
3 | HIF1A-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 3 | KCNQ1DN | Unknown |
4 | DSCAM-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 4 | BOK-AS1 | Unknown |
5 | DAOA-AS1 | Unknown | 5 | WRAP53 | Unknown |
6 | SNHG3 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 6 | DISC2 | Unknown |
7 | SNHG4 | RNADisease | 7 | ATP6V1G2-DDX39B | Unknown |
8 | HCP5 | Lnc2Cancer 3.0, RNADisease | 8 | DNM3OS | Unknown |
9 | KCNQ1DN | Unknown | 9 | HAR1A | unknown |
10 | EPB41L4A-AS1 | RNADisease | 10 | IGF2-AS | Unknown |
11 | WT1-AS | Unknown | 11 | HIF1A-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
12 | TCL6 | Unknown | 12 | LINC00162 | Unknown |
13 | IFNG-AS1 | Unknown | 13 | LINC00032 | Unknown |
14 | HAR1A | unknown | 14 | SRA1 | Unknown |
15 | SNHG11 | Lnc2Cancer 3.0, RNADisease | 15 | EPB41L4A-AS1 | RNADisease |
16 | BC040587 | Unknown | 16 | PTENP1 | Unknown |
17 | BACE1-AS | Unknown | 17 | NRON | Unknown |
18 | DISC2 | Unknown | 18 | DLEU1 | Lnc2Cancer 3.0, RNADisease |
19 | HNF1A-AS1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 | 19 | 7SK | Unknown |
20 | DNM3OS | Unknown | 20 | ZFAT-AS1 | Unknown |
The predicted top 20 lncRNAs associated with kidney neoplasms on lncRNADisease and MNDR
lncRNADisease . | MNDR . | ||||
---|---|---|---|---|---|
Rank . | lncRNA . | Evidence . | Rank . | lncRNA . | Evidence . |
1 | CBR3-AS1 | Unknown | 1 | SRA1 | Unknown |
2 | BOK-AS1 | Unknown | 2 | DLEU1 | Lnc2Cancer 3.0, RNADisease |
3 | IFNG-AS1 | Unknown | 3 | CCAT1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
4 | BC040587 | Unknown | 4 | HCP5 | Unknown |
5 | GHET1 | Unknown | 5 | HAR1B | Unknown |
6 | HULC | Unknown | 6 | DISC2 | Unknown |
7 | HAR1B | Unknown | 7 | SNHG3 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
8 | DSCAM-AS1 | Unknown | 8 | TUG1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
9 | HCP5 | Unknown | 9 | UCA1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
10 | SNHG16 | Lnc2Cancer 3.0, RNADisease | 10 | LINC00271 | Unknown |
11 | WRAP53 | Unknown | 11 | ESRG | Unknown |
12 | RMST | Unknown | 12 | IFNG-AS1 | unknow |
13 | SNHG11 | RNADisease | 13 | SNHG11 | RNADisease |
14 | BCYRN1 | Unknown | 14 | SNHG16 | Lnc2Cancer 3.0, RNADisease |
15 | PDZRN3-AS1 | Unknown | 15 | WRAP53 | Unknown |
16 | TERC | Unknown | 16 | SCAANT1 | Unknown |
17 | TRAF3IP2-AS1 | RNADisease, lncRNADiseasev2.0 | 17 | SPRY4-IT1 | Lnc2Cancer 3.0, RNADisease |
18 | WT1-AS | Unknown | 18 | TRAF3IP2-AS1 | RNADisease, lncRNADisease v2.0 |
19 | XIST | Unknown | 19 | GHET1 | Unknown |
20 | HNF1A-AS1 | Unknown | 20 | LINC00162 | Unknown |
lncRNADisease . | MNDR . | ||||
---|---|---|---|---|---|
Rank . | lncRNA . | Evidence . | Rank . | lncRNA . | Evidence . |
1 | CBR3-AS1 | Unknown | 1 | SRA1 | Unknown |
2 | BOK-AS1 | Unknown | 2 | DLEU1 | Lnc2Cancer 3.0, RNADisease |
3 | IFNG-AS1 | Unknown | 3 | CCAT1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
4 | BC040587 | Unknown | 4 | HCP5 | Unknown |
5 | GHET1 | Unknown | 5 | HAR1B | Unknown |
6 | HULC | Unknown | 6 | DISC2 | Unknown |
7 | HAR1B | Unknown | 7 | SNHG3 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
8 | DSCAM-AS1 | Unknown | 8 | TUG1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
9 | HCP5 | Unknown | 9 | UCA1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
10 | SNHG16 | Lnc2Cancer 3.0, RNADisease | 10 | LINC00271 | Unknown |
11 | WRAP53 | Unknown | 11 | ESRG | Unknown |
12 | RMST | Unknown | 12 | IFNG-AS1 | unknow |
13 | SNHG11 | RNADisease | 13 | SNHG11 | RNADisease |
14 | BCYRN1 | Unknown | 14 | SNHG16 | Lnc2Cancer 3.0, RNADisease |
15 | PDZRN3-AS1 | Unknown | 15 | WRAP53 | Unknown |
16 | TERC | Unknown | 16 | SCAANT1 | Unknown |
17 | TRAF3IP2-AS1 | RNADisease, lncRNADiseasev2.0 | 17 | SPRY4-IT1 | Lnc2Cancer 3.0, RNADisease |
18 | WT1-AS | Unknown | 18 | TRAF3IP2-AS1 | RNADisease, lncRNADisease v2.0 |
19 | XIST | Unknown | 19 | GHET1 | Unknown |
20 | HNF1A-AS1 | Unknown | 20 | LINC00162 | Unknown |
The predicted top 20 lncRNAs associated with kidney neoplasms on lncRNADisease and MNDR
lncRNADisease . | MNDR . | ||||
---|---|---|---|---|---|
Rank . | lncRNA . | Evidence . | Rank . | lncRNA . | Evidence . |
1 | CBR3-AS1 | Unknown | 1 | SRA1 | Unknown |
2 | BOK-AS1 | Unknown | 2 | DLEU1 | Lnc2Cancer 3.0, RNADisease |
3 | IFNG-AS1 | Unknown | 3 | CCAT1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
4 | BC040587 | Unknown | 4 | HCP5 | Unknown |
5 | GHET1 | Unknown | 5 | HAR1B | Unknown |
6 | HULC | Unknown | 6 | DISC2 | Unknown |
7 | HAR1B | Unknown | 7 | SNHG3 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
8 | DSCAM-AS1 | Unknown | 8 | TUG1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
9 | HCP5 | Unknown | 9 | UCA1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
10 | SNHG16 | Lnc2Cancer 3.0, RNADisease | 10 | LINC00271 | Unknown |
11 | WRAP53 | Unknown | 11 | ESRG | Unknown |
12 | RMST | Unknown | 12 | IFNG-AS1 | unknow |
13 | SNHG11 | RNADisease | 13 | SNHG11 | RNADisease |
14 | BCYRN1 | Unknown | 14 | SNHG16 | Lnc2Cancer 3.0, RNADisease |
15 | PDZRN3-AS1 | Unknown | 15 | WRAP53 | Unknown |
16 | TERC | Unknown | 16 | SCAANT1 | Unknown |
17 | TRAF3IP2-AS1 | RNADisease, lncRNADiseasev2.0 | 17 | SPRY4-IT1 | Lnc2Cancer 3.0, RNADisease |
18 | WT1-AS | Unknown | 18 | TRAF3IP2-AS1 | RNADisease, lncRNADisease v2.0 |
19 | XIST | Unknown | 19 | GHET1 | Unknown |
20 | HNF1A-AS1 | Unknown | 20 | LINC00162 | Unknown |
lncRNADisease . | MNDR . | ||||
---|---|---|---|---|---|
Rank . | lncRNA . | Evidence . | Rank . | lncRNA . | Evidence . |
1 | CBR3-AS1 | Unknown | 1 | SRA1 | Unknown |
2 | BOK-AS1 | Unknown | 2 | DLEU1 | Lnc2Cancer 3.0, RNADisease |
3 | IFNG-AS1 | Unknown | 3 | CCAT1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
4 | BC040587 | Unknown | 4 | HCP5 | Unknown |
5 | GHET1 | Unknown | 5 | HAR1B | Unknown |
6 | HULC | Unknown | 6 | DISC2 | Unknown |
7 | HAR1B | Unknown | 7 | SNHG3 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
8 | DSCAM-AS1 | Unknown | 8 | TUG1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
9 | HCP5 | Unknown | 9 | UCA1 | Lnc2Cancer 3.0, RNADisease, lncRNADisease v2.0 |
10 | SNHG16 | Lnc2Cancer 3.0, RNADisease | 10 | LINC00271 | Unknown |
11 | WRAP53 | Unknown | 11 | ESRG | Unknown |
12 | RMST | Unknown | 12 | IFNG-AS1 | unknow |
13 | SNHG11 | RNADisease | 13 | SNHG11 | RNADisease |
14 | BCYRN1 | Unknown | 14 | SNHG16 | Lnc2Cancer 3.0, RNADisease |
15 | PDZRN3-AS1 | Unknown | 15 | WRAP53 | Unknown |
16 | TERC | Unknown | 16 | SCAANT1 | Unknown |
17 | TRAF3IP2-AS1 | RNADisease, lncRNADiseasev2.0 | 17 | SPRY4-IT1 | Lnc2Cancer 3.0, RNADisease |
18 | WT1-AS | Unknown | 18 | TRAF3IP2-AS1 | RNADisease, lncRNADisease v2.0 |
19 | XIST | Unknown | 19 | GHET1 | Unknown |
20 | HNF1A-AS1 | Unknown | 20 | LINC00162 | Unknown |
In the lncRNADisease and MNDR databases, 10 and 7 lncRNAs have been verified by existing databases (Lnc2Cancer 3.0 [88], lncRNADisease v2.0 [29], RNADisease [89]) among the predicted top 20 lncRNAs associated with lung cancer, respectively. We predicted that HAR1A may associate with lung cancer with the rankings of 16 and 1 on the two databases, respectively. lncRNA HAR1A may be a tumor suppressor in many cancer including oral cancer, hepatocellular carcinoma, brease cancer and glioma [90–92]. Its knockdown boosted ALPK1 expression and downregulated BRD7, and further induce the progression of oral cancer [93]. Its expression has been examined in glioma, and was obviously lower in hepatocellular cancer than chronic hepatitis B [90, 91]. Its decreased expression could involve in poor prognosis of hepatocellular cancer [90]. Thus, we predicted that HAR1A could associate with lung cancer and need further experimental validation.
In the lncRNADisease and MNDR databases, 11 and 8 lncRNAs have been verified by existing three databases (Lnc2Cancer 3.0, lncRNADisease v2.0, RNADisease) among the predicted top 20 lncRNAs associated with breast cancer, respectively. We inferred that KCNQ1DN may associate with breast cancer with the rankings of 14 and 6, respectively. KCNQ1DN is an lncRNA with 1109 nucleotides. It is downregulated in the renalcell carcinoma tissues and could inhibit growth and progression of renalcell carcinoma cells [94]. Xin et al. [95] detected its expression in Wihns’ tumors and found that it may link with Wilms’ tumorigenesis along with IGF2.
In the lncRNADisease and MNDR databases, eight and four lncRNAs have been verified by three publicly available databases (Lnc2Cancer 3.0, lncRNADisease v2.0, RNADisease) among the predicted top 20 lncRNAs associated with colorectal cancer, respectively. In the two databases, we found that ZFAT-AS1 could associate with colorectal cancer. lncRNA ZFAT-AS1 is an antisense transcript of gene ZFAT, which encodes a protein that functions as a transcriptional regulator with respect to apoptosis and cell survival [96]. ZFAT-AS1 is prominently downregulated in glioma. Its over-expression could inhibit proliferation, migration and invasion, and accelerate apoptosis in glioma [97–99]. Its expression is downregulated in breast cancer [100], upregulated in hepatocellular, gastric, bladder and ovarian cancers and dysregulated in multiple malignant tumors [101]. In general, ZFAT-AS1 acts as a tumor-suppressive gene in many cancers including colorectal cancer and need further in vivo or in vitro experimental validation.
In the lncRNADisease and MNDR databases, two and 9nine lncRNAs have been reported by three publicly available databases (Lnc2Cancer 3.0, lncRNADisease v2.0, RNADisease) among the predicted top 20 lncRNAs associated with kidney neoplasms, respectively. In the two databases, we inferred that HAR1B could associate with kidney neoplasms. HAR1B helps the formation of stable RNA structures in human body [102]. Its expression is obviously lower in the hepatocellular carcinoma patients [90]. It can serve as a potential biomarker in bone and soft-tissue sarcomas [103]. Deregulated HAR1B has a greatly higher expression profile in aggressive colorectal cancers [104]. The association between kidney neoplasms and HAR1B needs further experimental confirmation.
CONCLUSION
In this study, we developed a computational model LDA-VGHB to investigate underlying LDAs. LDA-VGHB first extracted features of each lncRNA–disease pair by incorporating similarity computation, linear feature extraction based on SVD and nonlinear feature extraction based on VGAE. Subsequently, it used a heterogeneous Newton boosting machine to classify unobserved lncRNA–disease pairs. LDA-VGHB was compared with the other four classical LDA prediction methods (i.e. SDLDA [76], LDNFSGB [77], IPCARF [78] and LDASR [79]) and four popular boosting models (i.e. XGBoost [80], AdaBoost [81], CatBoost [82] and LightGBM [83]) under four 5-fold CVs on lncRNAs, diseases, lncRNA–disease pairs and independent lncRNAs and independent diseases, respectively. It significantly outperformed the eight methods with its best performance on the lncRNADisease and MNDR databases under the four different CVs. We further conducted case studies for lung cancer, breast cancer, colorectal cancer and kidney neoplasms and predicted the top 20 lncRNAs associated with them among all their unobserved lncRNAs. The results showed that most of the predicted top 20 lncRNAs have been verified by biomedical experiments provided by the Lnc2Cancer 3.0, lncRNADisease v2.0 and RNADisease databases. We inferred that HAR1A, KCNQ1DN, ZFAT-AS1 and HAR1B could associate with lung cancer, breast cancer, colorectal cancer and kidney neoplasms, respectively. The results need further biological experiment validation.
LDA-VGHB is developed to identify potential LDAs by incorporating feature extraction based on SVD and variational graph autoencoder and LDA classification based on heterogeneous Newton boosting machine.
Differing from traditional CV on lncRNA–disease pairs, the LDA-VGHB performance was assessed by comparing with four classical LDA prediction methods and four popular boosting models under 5-fold CVs on lncRNAs, diseases, lncRNA–disease pairs and independent lncRNAs and independent diseases.
Most of the predicted top 20 lncRNAs for lung cancer, breast cancer, colorectal cancer and kidney neoplasms have been verified by biomedical experiments provided by the Lnc2Cancer 3.0, lncRNADisease v2.0 and RNADisease databases. HAR1A, KCNQ1DN, ZFAT-AS1 and HAR1B could associate with the four cancers, respectively.
ACKNOWLEDGEMENTS
We would like to thank three anonymous reviewers and all authors of the cited references.
FUNDING
L.H.P. was supported by National Natural Science Foundation of China under Grant No. 61803151 and Natural Science Foundation of Hunan Province of China under Grant 2023JJ50201. M.C. was supported by National Natural Science Foundation of China under Grant No. 62172158. G.S.H. was supported by Natural Science Foundation of Hunan Province of China Grant 2021JJ30684 and Hunan Provincial Key Research Program (Grant No. 2022WK2009).
AUTHOR CONTRIBUTION STATEMENT
L.H.P. and L.L.H.: conceptualization; L.H.P., Q.L.S., G.T., M.C. and G.S.H.: funding acquisition; L.H.P., L.L.H., Q.L.S., G.T., M.C. and G.S.H.: project administration; L.L.H.: writing-original draft; L.H.P., L.L.H. and G.S.H.: writing-review and editing; L.H.P., L.L.H., Q.L.S., G.T., M.C. and G.S.H.: investigation; L.H.P. and L.L.H.: methodology; L.L.H.: software; L.L.H., Q.L.S., G.T., M.C., G.S.H.: validation. All authors contributed to the article and approved the submitted version.
DATA AVAILABILITY STATEMENT
Datasets and codes can be downloaded at https://github.com/plhhnu/LDA-VGHB.
Author Biographies
Lihong Peng is working in Hunan University of Technology as an associate professor. She received a PhD in College of Information Science and Engineering, Hunan University, China. Her research interests include Machine Learning, Data Mining and Bioinformatics.
Liangliang Huang is a postgraduate student in the School of Computer Science, Hunan University of Technology, China. His research interests include Machine Learning and Bioinformatics.
Qiongli Su is working in the Department of Pharmacy, the Affiliated Zhuzhou Hospital Xiangya Medical College CSU. Her research interests include Cardiovascular, tumor and thrombotic diseases.
Geng Tian is the chief executive officer in Geneis (Beijing) Co. Ltd, China. His research interests include Tumor Precise Medicine and Bioinformatics.
Min Chen is working in Hunan Institute of Technology as a professor. He received a PhD in the College of Information Science and Engineering, Hunan University, China. Her research interests include Machine Learning, Data Mining and Bioinformatics.
Guosheng Han is working in Xiangtan University as an associate professor. He received a PhD in School of Mathematics and Computational Science, Xiangtan University, China. His research interests include Machine Learning, Data Mining and Bioinformatics.
References
Author notes
Lihong Peng and Liangliang Huang contributed equally to this work and share first authorship.