MLCDForest: multi-label classification with deep forest in disease prediction for long non-coding RNAs

Wang, Wei; Dai, QiuYing; Li, Fang; Xiong, Yi; Wei, Dong-Qing

doi:10.1093/bib/bbaa104

Abstract

The long non-coding RNAs (lncRNAs) are subject of intensive recent studies due to its association with various human diseases. It is desirable to build the artificial intelligence-based models for prediction of diseases or tissues based on the lncRNAs data, which will be useful in disease diagnosis and therapy. The accuracy and robustness of existing models based on the machine learning techniques are subject to further improvement. In this study, we propose a deep learning model, called Multi-Label Classifications with Deep Forest, termed MLCDForest, to address multi-label classification on tissue prediction for a given lncRNA, which can be regarded as an implementation of the deep forest model in multi-label classification. The MLCDForest is a sequential multi-label-grained scanning method, which distinguishes from the standard deep forest model. It is proposed to train in sequential of multi-labels with label correlation considered. A systematic comparison using the lncRNA-disease association datasets demonstrates that our method consistently shows superior performance over the state-of-the-art methods in disease prediction. Considering label correlation in the sequential multi-label-grained scanning, our model provides a powerful tool to make multi-label classification and tissue prediction based on given lncRNAs.

cascade forest, deep learning, multi-label classification, lncRNAs, disease association and prediction

Introduction

Long non-coding RNAs (lncRNAs) are very critical in many biological processes [1] and associated with a wide range of human diseases, such as diabetes [2], cardiovascular diseases [3], HIV [4], neurological disorders [5] and cancers including lung cancer [6], breast cancer [7] and prostate cancer [8]. To understand, disease- or tissue-associated lncRNAs will provide a new perspective on deciphering disease mechanisms, novel drug development and personalized medications [9].

The known association between lncRNA and disease is rare. Compared to the experimental methods for identification of associations between lncRNAs and diseases, computational approach is much more efficient [10, 11]. The methods for prediction of lncRNA-disease associations are categorized into two groups: network models [12, 13], which based on a network representation to identify novel associations between lncRNAs and diseases, and machine learning [9–11, 14–18], such as dual-network integrated logistic matrix factorization [11], LRSLDA with semi-supervised learning [16], SVM based on the similarity between two lncRNAs and similarity of disease [17], and NCPHLDA based on network consistency projection [18]. In general, all these approaches are based on the combination of classification algorithms of machine learning and prior knowledge to diseases. However, more than 227 human diseases are associated with 266 lncRNAs [19], which means that the association between lncRNA and disease is a multi-label classification problem. Although many modified models have been proposed to ease these challenges in the past few years, more accurate and robust methods need to be further developed for multi-label classification.

As one of the supervised learning algorithms, multi-label classification is used for the problem that an instance is associated with one or more labels [20]. Currently, multi-label classification algorithms can be categorized into two main groups: problem transformation and algorithm adaptation. The problem transformation methods convert the problem into a series of single-label single-class or a single-label multi-class classification task [21]. The most representative models in problem transformation are binary relevance (BR) method and label powerset (LP) [21]. The random k-labelsets (RAkEL) method divides a large set of labels into a number of small random subsets. Then, LP is utilized to train an easier single-label multi-class classifier on each small subset [22]. In algorithm adaptation methods, multi-label k-nearest neighbor (ML-kNN) and back-propagation multi-label learning (BPMLL) [20] are widely applied in various applications.

Deep neural network (DNN) is a hot topic that has reached great success in natural language processing and visual recognitions. With the high requirements of the size of training data and hyperparameter tuning skills, the application of DNN in multi-label classification will be limited. gcForest is proposed as an alternative approach for DNN, which is a multiple-layer cascade frame with multiple random forest in each layer [23]. Two ensemble components, which are multi-grained scanning and cascade forest, have been employed in the frame. To overcome the limitation of gcForest in biology data, manually defining different types of forests may increase the risk of overfitting, and the importance of feature is ignored. Boosting cascade deep forest (BCDForest) [24] has been proposed for multiple class classification of cancer subtypes. In BCDForest, multi-class-grained scanning strategy is implemented from the multi-grained scanning to improve the diversity of ensemble with different training data of classes considered. And in each layer, the importance of features in forest learning is considered with the boosting strategy.

In this study, a further implemented deep forest in multi-label classification (MLCDForest) has been proposed for prediction of lncRNA-tissue associations considering label correlations as prior information [25, 26]. In each layer, the estimated class distribution is employed in the training of each forest. Finally, the voting results of multiple weak classifiers are used to determine which class a test sample should belong to. The experimental results show that the proposed method performs better on the dataset than the other machine learning methods.

The rest of this paper is organized as follows. In Section 2, we describe our method, MLCDForest and how it considers label correlations as prior information in detail. In Section 3, we introduce the experiment process and results. Finally, the summaries and conclusions are given.

Methods

In this section, details of the proposed method MLCDForest will be presented. The problem of multi-label classification will be defined first. Then the correlation between labels will be introduced. Lastly, the proposed models will be detailed with implementation to gcForest with label correlation considered.

Multi-label classification

As the basic information of a multi-label classification dataset |$(X,Y)$|⁠, |$n$| is the number of instances, |$X$| stands for the attributes and |$Y$| is the label. Given the label space |$Y=\{{Y}_1,{Y}_2,\cdots, {Y}_m\}$|⁠, an instance of |${x}_i$| with |$k$| lncRNA features is assigned by a subset of y in the label space |$Y$|⁠.

Label correlations and concurrence

Label correlations

For multinomial sample on the |$r\times c$| contingency table, Cramér’s V statistics [27, 28] is a popular association measure for nominal random variables with the following formula:

$$\begin{equation} V=\sqrt{\frac{\phi^2}{\min \left(r-1,c-1\right)}} \end{equation}$$

(1)

in which mean square contingency |${\phi}^2$| is estimated as

$$\begin{equation} {\hat{\phi}}^2=\sum_{i=1}^r\sum_{j=1}^c\frac{{\left({p}_{ij}-{p}_{i+}{p}_{+j}\right)}^2}{p_{i+}{p}_{+j}} \end{equation}$$

(2)

and the estimated proportion in cell |$(i,j)$| is |${p}_{ij}$|⁠, and |${p}_{i+}$| and |${p}_{+j}$| are summation over that subscript.

To correct the bias of Cramér’s V statistics, correlated Cramér’s V statistics [29] is proposed as

$$\begin{equation} V=\sqrt{\frac{{\overset{\sim }{\phi}}_{+}^2}{\min \left(\overset{\sim }{r}-1,\overset{\sim }{c}-1\right)}} \end{equation}$$

(3)

in which |${\overset{\sim }{\phi}}_{+}^2=\max (0,{\hat{\phi}}^2-\frac{1}{n-1}(r-1)(c-1))$|⁠, |$\overset{\sim }{r}=r-\frac{1}{n-1}(r-1)(c-1)$| and |$\overset{\sim }{c}=c-\frac{1}{n-1}(r-1)(c-1)$|⁠.

Correlated Cramér’s V statistics is employed in this study to evaluate the association between each of these two labels.

Label concurrence

The level of imbalance between different labels can be measured by the imbalance ratio [30]. |$\mathrm{SCUMBLE}$| [30, 31] is another measurement, considering both the imbalance ratio and sparsity of the labels. The formula for |$\mathrm{SCUMBLE}$| is as follows:

$$\begin{equation} \mathrm{SCUMBLE}(D)=\frac{1}{n}\sum_{i=1}^n\left(1-\frac{1}{IRLbl_i}{\left(\prod_{l=1}^m{IRLbl}_{il}\right)}^{\frac{1}{m}}\right) \end{equation}$$

(4)

in which

$$ IRLbl(y)=\frac{\underset{y^{\prime}\in Y}{\max}\left(\sum_{i=1}^nh\left(y^{\prime },{\mathrm{labelset}}_i\right)\right)}{\sum_{i=1}^nh\left(y,{\mathrm{labelset}}_i\right)} $$$$

$$ h\left(y,{\mathrm{labelset}}_i\right)=\left\{\begin{array}{c}1,y\in{\mathrm{labelset}}_i\\{}0,y\notin{\mathrm{labelset}}_i\end{array}\right. $$$$

|${\mathrm{labelset}}_i$| is for any subset of full label set |$Y$|⁠. The significance of |$\mathrm{SCUMBLE}$| can be measured with standard coefficient of variation (CV) of |$\mathrm{SCUMBLE}$|⁠. The larger standard CV means the higher differences in concurrence among instances.

In the data for multi-label classification, most of these labels are imbalance. In most of common practice, imbalance ratio is used to measure the imbalance. And the overall imbalance of these labels is measured as |$\mathrm{Mean} IR$| [32], which is based on the mean of imbalance ratios among all labels from |$Y$|⁠. And standard coefficient of variation (CV) is to measure of significance of |$\mathrm{Mean} IR$|⁠. Formulas to |$\mathrm{Mean} IR$| and the standard can be checked as follows:

$$\begin{equation} \mathrm{Mean} IR=\frac{1}{m}\sum_{y\in Y} IR Lbl(y) \end{equation}$$

(5)

and

$$\begin{equation} CV=\frac{\sqrt{\sum_{y\in Y}{\left( IRLbl(y)-\mathrm{Mean} IR\right)}^2}}{\mathrm{Mean} IR} \end{equation}$$

(6)

Framework of the proposed methodology

As an alternative to the deep neural network (DNN), deep forest tries to employ the class distribution features by multi-grained scanning and cascade forest.

Multi-grained scanning

In the first step, multi-grained scanning is based on the window sliding approach, which is to obtain class distribution based on the generated low-dimensional feature vectors [33]. It has been proven to be an effective approach in the recognition of local feature. In the multi-grained scanning, as shown in Figure 1A and B, suppose there are |$n$| instances with 100 raw features from 4 labels in the training data, and each label is binary class (in the multi-level labels, approach from [24]), multi-grained scanning is performed for each label to generate a 50-dimensional feature vector by sliding the window by one feature. Considering the correlation between different labels, multi-grained scanning for the first label is performed based on the input features and other three labels, and 54 feature vectors are produced. Fifty-three feature vectors are generated for each of the rest three labels. The extracted instances will be trained with a completely random tree forest and a random forest to generate a class vector, leading to an 852 (⁠|$(54+53+53+53)\times 2\times 2$|⁠)-dimensional transformed feature vector.

Figure 1

(A) Illustration of multi-grained scanning in training. Suppose there are four labels, raw features are 100-dim, and sliding window is 50-dim in the training of the first label. (B) Illustration of multi-grained scanning in predicting. Suppose there are four labels, raw features are 100-dim, and sliding window is 50-dim in the predicting of the first label.

Open in new tab Download slide

Figure 1

Continued.

Open in new tab Download slide

Considering the joint probabilities of labels and input of |$x$|⁠, the inference model to binary classification is

$$\begin{equation} p\left(y|x\right)=\prod_ip\left({y}_i|x\right)\ \prod_kp\left({y}_i,{y}_k\right) \end{equation}$$

(7)

in which |${y}_i\in 0,1$|⁠, the probability of the independent binary classification to label |$i$| is |$p({y}_i|x)$|⁠, |$k,j\in \{i|{y}_i=1\}$|⁠, and |$p({y}_i|x)$| is the pairwise probability of each pair of labels, which is based on the correlation coefficient. The probability will be discounted when pair of labels is co-dependent. Multi-grained scanning for each label should be combined in the classification and prediction of each label. In order to enhance classification performance, the loss from each classifier may be transmitted when the predicted probability is incorporated in the classification of the label. The correlated loss (CL) can be computed when label |$j$|’s prediction is incorporated to the classification of label |$i$|⁠:

$$\begin{equation} {\mathrm{CL}}_{ij}=p\left({y}_i,{y}_j\right){\mathrm{CE}}_j \end{equation}$$

(8)

|$p({y}_i,{y}_j)$| is the correlation coefficient between label i and label j and calculated with correlated Cramér’s V statistics. |${\mathrm{CE}}_j$| is the cross-entropy loss [34] to label j, described as follows:

$$\begin{equation} \mathrm{CE}=- qln(p) \end{equation}$$

(9)

in which p is the expected output and q is the actual output:

$$ {p}_i^{\prime}\left(y=1\right)=\left({p}_j\left({y}_j=1\right)+\left(1-H\right){CE}_j\right)p\left({y}_i,{y}_j\right) $$$$

$$\begin{equation} {p}_i^{\prime}\left(y=0\right)=\left({p}_j\left({y}_j=0\right)+\left(1-H\right){CE}_j\right)p\left({y}_i,{y}_j\right) \end{equation}$$

(10)

H is a transform gate to control the rate of correlated loss which may be transformed from label j.

As shown in Figure 1B, in the prediction phase, probability is predicted first with traditional random forest for each label and concatenated to raw features. Probability of each label in the test data is predicted in the sequence of concurrence. Label with low concurrence rate is trained first.

Cascade forest

In the layer-wise cascade forest, the powerful classifier, random forest, is ensembled in each layer. In the classification to each label, the feature importance is considered assuming that the discriminative features should take higher weights. In the most correlated labels, this discriminative feature may also contribute to the classification of other labels. Boost class distribution vectors are generated by two random forests (one for complete random forest, the other for partial random forest) for both multi-grained scanning and cascade forest periods. The performance of each layer is evaluated with k-fold cross-validation [35, 36] to overcome the risk of overfitting. And in the cascade forest, the propagation will be terminated when there is no significant improvement to the performance of the whole cascade on validation set.

Overall procedure of MLCDForest

As the gcForest, there are two main components in the framework of MLCDForest. In the multi-grained scanning part, the corresponding transformed feature representation is classified according to different forests. And in the cascade forest, layer-wise random forest is to get more discriminative representations. The example of MLCDForest is illustrated for the first label in Figure 2. Two window sizes (50, 80) are used in multi-grained scanning for the data with 100 dimensions. |$(54+53+53+53)\times 2\times 2$| and |$(24+23+23+23)\times 2\times 2$| dimensional feature vectors are obtained for the window size of 50 and 80，respectively. Combining these feature vectors to different labels together with the correlation statistics, a 1224-dimensional transformed feature vector is available here if there are only four labels. In the cascade forest, the cascade-wise random forest is learnt with such 1224-dimensional feature vector, and the process will be terminated when the performance of the validation set is not significantly improved.

Figure 2

Overall procedure of MLCDForest. Suppose there are two classes, raw features are 100-dim, and sliding windows are 50-dim and 80-dim.

Open in new tab Download slide

In any test instance, a 1224-dimensional representation vector, generated with multi-grained scanning, is the input data to the cascade forest to get its final prediction to each label by taking its class according to the maximum aggregated value.

Performance measures

Performance of different methods in multi-label classification is evaluated from the example-based, label-based and ranking-based [31]. In order to evaluate the generalization ability of MLCDForest, we utilized cross-validation in this study.

Example-based performance evaluation

With performance evaluation practice from Madjarov [37], accuracy, precision, recall, |$F- measure$|⁠, hamming loss and subset accuracy will be adopted in this study to compare the example-based performance evaluation of different methods for multi-label lncRNA-disease association classification:

$$\begin{equation} \mathrm{Accuracy}=\frac{1}{n}\sum_{i=1}^n\frac{\left|{Y}_i\cap{Z}_i\right|}{\left|{Y}_i\cup{Z}_i\right|}(p) \end{equation}$$

(11)

$$\begin{equation} \mathrm{Precision}=\frac{1}{n}\sum_{i=1}^n\frac{\left|{Y}_i\cap{Z}_i\right|}{\left|{Y}_i\right|}(p) \end{equation}$$

(12)

$$\begin{equation} \mathrm{Recall}=\frac{1}{n}\sum_{i=1}^n\frac{\left|{Y}_i\cap{Z}_i\right|}{\left|{Z}_i\right|}(p) \end{equation}$$

(13)

$$\begin{equation} \mathrm{F}-\mathrm{Measure}=2\ast \frac{\mathrm{Precision}\ast \mathrm{Recall}}{\mathrm{Precision}+\mathrm{Recall}}(p) \end{equation}$$

(14)

$$\begin{equation} \mathrm{Hamming}\ \mathrm{Loss}=\frac{1}{n}\sum_{i=1}^n\frac{\left|{Y}_i\Delta{Z}_i\right|}{m}(p) \end{equation}$$

(15)

$$\begin{equation} \mathrm{Subset}\ \mathrm{Accuracy}=\frac{1}{n}\sum_{i=1}^n [[ {Y}_i={Z}_i]] (p) \end{equation}$$

(16)

in which|$\cap$| is for the intersection of two sets, |$\cup$| is the union of two sets and |$\Delta$| is for the symmetric difference between predicted and true labels. |${Y}_i$| is the subset of subset predicted labels for the|$i$|-instance, and |${Z}_i$| is the true subset labels |$[[ {Y}_i={Z}_i]] =1$| if |${Y}_i={Z}_i$| is TRUE and 0 otherwise.

Label-based performance evaluation

There are two methods to aggregate values of the labels: macro-average and micro-average approach. Macro-average is to compute the average of evaluation for each label independently. In micro-average approach, the evaluation is based on the count for true positive (TP), false positive (FP), true negative (TN) and false negative (FN) computed for all labels. These two evaluations can be calculated as follows:

$$\begin{eqnarray} &&\mathrm{Macro}-\mathrm{avg}\ \mathrm{Evaluation}\nonumber\\ \quad&&=\frac{1}{m}\sum_{i=1}^m\mathrm{evaluateMatric}\left({TP}_i,{FP}_i,{TN}_i,{FN}_i\right)(p) \end{eqnarray}$$

(17)

$$\begin{eqnarray} &&\mathrm{Micro}-\mathrm{avg}\ \mathrm{Evaluation}\nonumber\\&&=\mathrm{evaluateMatric}\left(\sum_{i=1}^m{TP}_i,\sum_{i=1}^m{FP}_i,\sum_{i=1}^m{TN}_i,\sum_{i=1}^m{FN}_i\right)(p) \end{eqnarray}$$

(18)

in which evaluation matrix precision, recall and |$F- measure$| are considered here in |$evaluateMatric(\ast )$|⁠.

Ranking-based performance evaluation

In the ranking-based performance evaluation, One error, Coverage and Average Precision will be adopted here and can be computed as follows:

$$\begin{equation} \mathrm{OneError}=\frac{1}{n}\sum_{i=1}^n [[ {}_{y\in{Z}_i}{}^{\mathrm{argmax}}\operatorname{rank}\left({x}_i,y\right)\notin{Y}_i]] \end{equation}$$

(19)

$$\begin{equation} \mathrm{Coverage}=\frac{1}{n}\sum_{i=1}^n\underset{y\in{Y}_i}{\max}\operatorname{rank}\left({x}_i,y\right)-1 \end{equation}$$

(20)

$$\begin{eqnarray} &&\mathrm{AveragePrecision}\nonumber\\&&=\frac{1}{n}\sum_{i=1}^n\frac{1}{\mid{Y}_i\mid}\sum_{y\in{Y}_i}\frac{\mid \left\{{y}^{\prime}\in{Y}_i:\operatorname{rank}\left({x}_i,{y}^{\prime}\right)\le \operatorname{rank}\left({x}_i,y\right)\right\}\mid }{\operatorname{rank}\left({x}_i,y\right)} \end{eqnarray}$$

(21)

in which |$\mathit{\operatorname{rank}}({x}_i,y)$| refers to the rank of the position of label |$y$| in the |${x}_i$| instance and |${Y}_i$| is the subset of subset predicted labels for the|$i$|-instance.

Experiments and results

To evaluate the effectiveness of our proposed method, MLCDForest was compared with other multi-label classification methods: Deep Back-Propagation Neural Network (DBPNN) [38,39], RAkEL [22], MLkNN, BR and BPMLL [20] over this lncRNA dataset [40].

Datasets and hyperparameters

Data to associations between specific disease and lncRNAs [40] was employed in the construction of the proposed method. In the data downloaded from http://biomecis.uta.edu/∼ashis/res/csps2014/suppl/, 7566 lncRNA transcripts with 22 tissue labels were selected from the Human Body Map Project [41] with annotation and expression information of these 21,626 distinct lncRNAs. Eighty-nine composition-based features and 21 secondary structure-based features were identified with tissue-specificity threshold and as the input feature for different tissue classification. The detail information can be checked in [40].

In the experiments the division of the data in training and test set, which follows stratified approach [42], is in ratio of 80% versus 20%. Multi-grained scanning was based on 500 trees in each forest, and there were 1000 trees in the cascade forest as the default. In both multi-grained scanning and cascade forest, two completely random forests and two partial random forests were used in the training and prediction. In the two partial random forests, |$\sqrt{d}$| of features were selected as the candidates and separated with gini values. Fivefold cross-validation is used to evaluate the overall accuracy to overcome over-fitting. Comparing MLCDForest with DBPNN, which is conducted with Meka [39] and set the base classifier as random forest, other hyperparameters are the same as recommended in Meka. In [40], BR and RAkEL are based on sequential minimal optimization with support vector machines (SVMs) as base classifier. The number of nearest neighbors is 10 in MLkNN. And 10-fold cross-validation was conducted.

One major advantage of multi-label learning framework is to explore label correlations. Bias-corrected Cramér’s V statistics was calculated for all the label in label pairs and depicted them in a heat map (Figure 3A). Label concurrence is depicted in Figure 3B and Supplementary Table S1. Twenty out of 22 tissues are with standard CV of |$\mathrm{SCUMBLE}<0.8$|⁠, 14 tissues are with standard CV of |$\mathrm{SCUMBLE}<0.75$|⁠, and 8 tissues are with standard CV of |$\mathrm{SCUMBLE}<0.7$|⁠. Tissues Foreskin_R and LF_r2 take the standard CV of 0.366 and 0.157 correspondingly. Details about the standard CV of |$\mathrm{SCUMBLE}$| can be found in Supplementary Table S1. Details about the pairwise corrected Cramér’s V statistics and pairwise intersections plot between all the labels are appended as Supplementary Table S2 and Supplementary Figure S1.

Figure 3

(A) Heat map of the correlation between tissues. (B) Concurrence to tissues based on SCUMBLE.

Open in new tab Download slide

Figure 3

Continued.

Open in new tab Download slide

Performance comparison

Performance comparison of the MLCDForest and other multi-label classifiers are presented in Tables 1–3. The MLCDForest achieved the best performance. In example-based evaluation from Table 1, it has achieved about 13% improvement in accuracy compared to MLkNN [40] and 16% in the precision compared to BR [40]. It is about 10% improvement in the label-based evaluation (Table 2) and similar performance in ranking-based evaluation (Table 3). As the performance of standard gcForest presented in [23] for single-label classification, neural network has shown to be better in the precision from example-based and label-based evaluation, but did not have similar performance from any evaluation aspect when compared to MLCDForest. This is because the dataset [40] is a small-scale biology data and deep neural network is highly dependent on the scale of dataset.

Label-wise evaluation

With the multi-label learning models, label-wise manner was also performed to check the performance for each label. Performance of MLCDForest can be checked in Table 4, which was conducted in 5-fold cross-validation.

Table 1

Open in new tab

Example-based evaluation of the predictive performance of different multi-label classifiers

	Hamming loss	Accuracy	Precision	Recall	F1-measure	Subset accuracy
MLCDForest	0.1145	0.6978	0.8402	0.7400	0. 6978	0.3347
DBPNN	0.2118	0.4811	0.8258	0.4999	0.5997	0.1636
RAkEL [36]	0.2032	0.5471	0.7781	0.6133	0.6409	0.1980
MLkNN [36]	0.1970	0.5610	0.7599	0.6486	0.6627	0.1807
BR [36]	0.2048	0.5441	0.7804	0.6050	0.6405	0. 1965
BPMLL [36]	0.2241	0.5191	0.6900	0.6660	0.6412	0.1006

	Hamming loss	Accuracy	Precision	Recall	F1-measure	Subset accuracy
MLCDForest	0.1145	0.6978	0.8402	0.7400	0. 6978	0.3347
DBPNN	0.2118	0.4811	0.8258	0.4999	0.5997	0.1636
RAkEL [36]	0.2032	0.5471	0.7781	0.6133	0.6409	0.1980
MLkNN [36]	0.1970	0.5610	0.7599	0.6486	0.6627	0.1807
BR [36]	0.2048	0.5441	0.7804	0.6050	0.6405	0. 1965
BPMLL [36]	0.2241	0.5191	0.6900	0.6660	0.6412	0.1006

Table 1

Open in new tab

Example-based evaluation of the predictive performance of different multi-label classifiers

	Hamming loss	Accuracy	Precision	Recall	F1-measure	Subset accuracy
MLCDForest	0.1145	0.6978	0.8402	0.7400	0. 6978	0.3347
DBPNN	0.2118	0.4811	0.8258	0.4999	0.5997	0.1636
RAkEL [36]	0.2032	0.5471	0.7781	0.6133	0.6409	0.1980
MLkNN [36]	0.1970	0.5610	0.7599	0.6486	0.6627	0.1807
BR [36]	0.2048	0.5441	0.7804	0.6050	0.6405	0. 1965
BPMLL [36]	0.2241	0.5191	0.6900	0.6660	0.6412	0.1006

	Hamming loss	Accuracy	Precision	Recall	F1-measure	Subset accuracy
MLCDForest	0.1145	0.6978	0.8402	0.7400	0. 6978	0.3347
DBPNN	0.2118	0.4811	0.8258	0.4999	0.5997	0.1636
RAkEL [36]	0.2032	0.5471	0.7781	0.6133	0.6409	0.1980
MLkNN [36]	0.1970	0.5610	0.7599	0.6486	0.6627	0.1807
BR [36]	0.2048	0.5441	0.7804	0.6050	0.6405	0. 1965
BPMLL [36]	0.2241	0.5191	0.6900	0.6660	0.6412	0.1006

Table 2

Open in new tab

Label-based evaluation of the predictive performance of different multi-label classifiers

	Micro-avg			Macro-avg
	Precision	Recall	F1	Precision	Recall	F1
MLCDForest	0.8603	0.7947	0.8262	0.8496	0.7170	0.7682
DBPNN	0.8556	0.4586	0.5971	0.7610	0.3160	0.3726
RAkEL [36]	0.7680	0.6219	0.6872	0.6625	0.4948	0.5494
MLkNN [36]	0.7698	0.6439	0.7011	0.6998	0.5237	0.5804
BR [36]	0.7766	0.6029	0.6788	0.5787	0.4588	0.5058
BPMLL [36]	0.7043	0.6497	0.6754	0.5753	0.4827	0.4700

	Micro-avg			Macro-avg
	Precision	Recall	F1	Precision	Recall	F1
MLCDForest	0.8603	0.7947	0.8262	0.8496	0.7170	0.7682
DBPNN	0.8556	0.4586	0.5971	0.7610	0.3160	0.3726
RAkEL [36]	0.7680	0.6219	0.6872	0.6625	0.4948	0.5494
MLkNN [36]	0.7698	0.6439	0.7011	0.6998	0.5237	0.5804
BR [36]	0.7766	0.6029	0.6788	0.5787	0.4588	0.5058
BPMLL [36]	0.7043	0.6497	0.6754	0.5753	0.4827	0.4700

Table 2

Open in new tab

Label-based evaluation of the predictive performance of different multi-label classifiers

	Micro-avg			Macro-avg
	Precision	Recall	F1	Precision	Recall	F1
MLCDForest	0.8603	0.7947	0.8262	0.8496	0.7170	0.7682
DBPNN	0.8556	0.4586	0.5971	0.7610	0.3160	0.3726
RAkEL [36]	0.7680	0.6219	0.6872	0.6625	0.4948	0.5494
MLkNN [36]	0.7698	0.6439	0.7011	0.6998	0.5237	0.5804
BR [36]	0.7766	0.6029	0.6788	0.5787	0.4588	0.5058
BPMLL [36]	0.7043	0.6497	0.6754	0.5753	0.4827	0.4700

	Micro-avg			Macro-avg
	Precision	Recall	F1	Precision	Recall	F1
MLCDForest	0.8603	0.7947	0.8262	0.8496	0.7170	0.7682
DBPNN	0.8556	0.4586	0.5971	0.7610	0.3160	0.3726
RAkEL [36]	0.7680	0.6219	0.6872	0.6625	0.4948	0.5494
MLkNN [36]	0.7698	0.6439	0.7011	0.6998	0.5237	0.5804
BR [36]	0.7766	0.6029	0.6788	0.5787	0.4588	0.5058
BPMLL [36]	0.7043	0.6497	0.6754	0.5753	0.4827	0.4700

Table 3

Open in new tab

Ranking-based evaluation of the predictive performance of different multi-label classifiers

	One error	Coverage	Average Precision
MLCDForest	0.1503	12.5520	0.8024
DBPNN	0.2052	15.4786	0.7249
RAkEL [36]	0.2865	14.2869	0.7382
MLkNN [36]	0.1075	11.5804	0.8155
BR [36]	0.2959	14.5713	0.7274
BPMLL [36]	0.1034	12.5604	0.7867

	One error	Coverage	Average Precision
MLCDForest	0.1503	12.5520	0.8024
DBPNN	0.2052	15.4786	0.7249
RAkEL [36]	0.2865	14.2869	0.7382
MLkNN [36]	0.1075	11.5804	0.8155
BR [36]	0.2959	14.5713	0.7274
BPMLL [36]	0.1034	12.5604	0.7867

Table 3

Open in new tab

Ranking-based evaluation of the predictive performance of different multi-label classifiers

	One error	Coverage	Average Precision
MLCDForest	0.1503	12.5520	0.8024
DBPNN	0.2052	15.4786	0.7249
RAkEL [36]	0.2865	14.2869	0.7382
MLkNN [36]	0.1075	11.5804	0.8155
BR [36]	0.2959	14.5713	0.7274
BPMLL [36]	0.1034	12.5604	0.7867

	One error	Coverage	Average Precision
MLCDForest	0.1503	12.5520	0.8024
DBPNN	0.2052	15.4786	0.7249
RAkEL [36]	0.2865	14.2869	0.7382
MLkNN [36]	0.1075	11.5804	0.8155
BR [36]	0.2959	14.5713	0.7274
BPMLL [36]	0.1034	12.5604	0.7867

Table 4

Open in new tab

Label-wise analysis of MLCDForest

Tissue	Accuracy	Precision	Recall	F1 score	ROCAUC
Adipose	0.8902	0.8788	0.7477	0.8080	0.8375
Adrenal	0.8936	0.9066	0.8573	0.8813	0.8845
Brain	0.8601	0.8555	0.8054	0.8297	0.7770
Brain_R	0.9393	0.9016	0.6445	0.7517	0.5561
Breast	0.8751	0.8303	0.8594	0.8446	0.8501
Colon	0.8705	0.8302	0.7483	0.7871	0.8236
Foreskin_R	0.9503	0.8205	0.4507	0.5818	0.7215
Heart	0.8902	0.8875	0.7058	0.7863	0.6596
LF_r1	0.8994	0.8868	0.5465	0.6763	0.6556
LF_r2	0.9740	0.7727	0.3208	0.4533	0.7186
Kidney	0.8480	0.8585	0.7665	0.8099	0.7995
Liver	0.8728	0.8062	0.5347	0.6430	0.5528
Lung	0.8231	0.7484	0.8223	0.7836	0.8439
Lymph node	0.8751	0.8318	0.8393	0.8356	0.9018
Ovary	0.8480	0.9048	0.7432	0.8160	0.7531
Placenta_R	0.9104	0.8095	0.5113	0.6267	0.7615
Prostate	0.8549	0.8327	0.8557	0.8441	0.8825
Skeletal muscle	0.8954	0.8320	0.6103	0.7041	0.6988
Testes	0.8936	0.8922	0.9944	0.9405	0.8336
Testes_R	0.9029	0.9288	0.9297	0.9292	0.9601
Thyroid	0.8699	0.8577	0.8420	0.8498	0.8661
White blood cell	0.8647	0.8187	0.6381	0.7172	0.7342

Tissue	Accuracy	Precision	Recall	F1 score	ROCAUC
Adipose	0.8902	0.8788	0.7477	0.8080	0.8375
Adrenal	0.8936	0.9066	0.8573	0.8813	0.8845
Brain	0.8601	0.8555	0.8054	0.8297	0.7770
Brain_R	0.9393	0.9016	0.6445	0.7517	0.5561
Breast	0.8751	0.8303	0.8594	0.8446	0.8501
Colon	0.8705	0.8302	0.7483	0.7871	0.8236
Foreskin_R	0.9503	0.8205	0.4507	0.5818	0.7215
Heart	0.8902	0.8875	0.7058	0.7863	0.6596
LF_r1	0.8994	0.8868	0.5465	0.6763	0.6556
LF_r2	0.9740	0.7727	0.3208	0.4533	0.7186
Kidney	0.8480	0.8585	0.7665	0.8099	0.7995
Liver	0.8728	0.8062	0.5347	0.6430	0.5528
Lung	0.8231	0.7484	0.8223	0.7836	0.8439
Lymph node	0.8751	0.8318	0.8393	0.8356	0.9018
Ovary	0.8480	0.9048	0.7432	0.8160	0.7531
Placenta_R	0.9104	0.8095	0.5113	0.6267	0.7615
Prostate	0.8549	0.8327	0.8557	0.8441	0.8825
Skeletal muscle	0.8954	0.8320	0.6103	0.7041	0.6988
Testes	0.8936	0.8922	0.9944	0.9405	0.8336
Testes_R	0.9029	0.9288	0.9297	0.9292	0.9601
Thyroid	0.8699	0.8577	0.8420	0.8498	0.8661
White blood cell	0.8647	0.8187	0.6381	0.7172	0.7342

Table 4

Open in new tab

Label-wise analysis of MLCDForest

Tissue	Accuracy	Precision	Recall	F1 score	ROCAUC
Adipose	0.8902	0.8788	0.7477	0.8080	0.8375
Adrenal	0.8936	0.9066	0.8573	0.8813	0.8845
Brain	0.8601	0.8555	0.8054	0.8297	0.7770
Brain_R	0.9393	0.9016	0.6445	0.7517	0.5561
Breast	0.8751	0.8303	0.8594	0.8446	0.8501
Colon	0.8705	0.8302	0.7483	0.7871	0.8236
Foreskin_R	0.9503	0.8205	0.4507	0.5818	0.7215
Heart	0.8902	0.8875	0.7058	0.7863	0.6596
LF_r1	0.8994	0.8868	0.5465	0.6763	0.6556
LF_r2	0.9740	0.7727	0.3208	0.4533	0.7186
Kidney	0.8480	0.8585	0.7665	0.8099	0.7995
Liver	0.8728	0.8062	0.5347	0.6430	0.5528
Lung	0.8231	0.7484	0.8223	0.7836	0.8439
Lymph node	0.8751	0.8318	0.8393	0.8356	0.9018
Ovary	0.8480	0.9048	0.7432	0.8160	0.7531
Placenta_R	0.9104	0.8095	0.5113	0.6267	0.7615
Prostate	0.8549	0.8327	0.8557	0.8441	0.8825
Skeletal muscle	0.8954	0.8320	0.6103	0.7041	0.6988
Testes	0.8936	0.8922	0.9944	0.9405	0.8336
Testes_R	0.9029	0.9288	0.9297	0.9292	0.9601
Thyroid	0.8699	0.8577	0.8420	0.8498	0.8661
White blood cell	0.8647	0.8187	0.6381	0.7172	0.7342

Tissue	Accuracy	Precision	Recall	F1 score	ROCAUC
Adipose	0.8902	0.8788	0.7477	0.8080	0.8375
Adrenal	0.8936	0.9066	0.8573	0.8813	0.8845
Brain	0.8601	0.8555	0.8054	0.8297	0.7770
Brain_R	0.9393	0.9016	0.6445	0.7517	0.5561
Breast	0.8751	0.8303	0.8594	0.8446	0.8501
Colon	0.8705	0.8302	0.7483	0.7871	0.8236
Foreskin_R	0.9503	0.8205	0.4507	0.5818	0.7215
Heart	0.8902	0.8875	0.7058	0.7863	0.6596
LF_r1	0.8994	0.8868	0.5465	0.6763	0.6556
LF_r2	0.9740	0.7727	0.3208	0.4533	0.7186
Kidney	0.8480	0.8585	0.7665	0.8099	0.7995
Liver	0.8728	0.8062	0.5347	0.6430	0.5528
Lung	0.8231	0.7484	0.8223	0.7836	0.8439
Lymph node	0.8751	0.8318	0.8393	0.8356	0.9018
Ovary	0.8480	0.9048	0.7432	0.8160	0.7531
Placenta_R	0.9104	0.8095	0.5113	0.6267	0.7615
Prostate	0.8549	0.8327	0.8557	0.8441	0.8825
Skeletal muscle	0.8954	0.8320	0.6103	0.7041	0.6988
Testes	0.8936	0.8922	0.9944	0.9405	0.8336
Testes_R	0.9029	0.9288	0.9297	0.9292	0.9601
Thyroid	0.8699	0.8577	0.8420	0.8498	0.8661
White blood cell	0.8647	0.8187	0.6381	0.7172	0.7342

Discussion

As an alternative to deep learning, deep forest has been proven to be very powerful in the single-label classification in practice. However, most of practical biology classifications are multi-label classification problem. As one of the novel implementations and application of the standard deep forest model (gcForest), we emphasize the correlation and concurrence of labels in the data transformation. It is shown to be an effective method in the multi-label classification to lncRNA-disease associations. Our MLCDForest method provides an effective option to investigate multi-label classification by using deep learning on small-scale biology datasets.

Based on the data to associations between specific disease and lncRNAs [40], MLCDForest is compared with other multi-label classification approaches in the performance metrics. The proposed model has achieved the best performance on the dataset with original features. In the present study, we made an initial and rough attempt to incorporate the label correlation in sequential manner into the deep forest framework. In further study, we will test our proposed approach on more strictly experimental settings and apply it on more similar bioinformatics problems. Based on various types of association between ncRNAs, ncRNAs and disease, ncRNA and drug targets, small molecules and ncRNAs, genome analysis applications, etc. [43–48], we should perform further evaluation based on other independent dataset (i.e., association between miRNA–circRNA associations).

Key Points

Predicting lncRNA-tissue association using computational methods is very important in disease diagnosis and therapy.
Label correlation is considered in multi-label classification with frame of MLCDForest

Funding

Dong-Qing Wei is supported by the grants from the Key Research Area Grant 2016YFA0501703 of the Ministry of Science and Technology of China, the National Natural Science Foundation of China (Contract no. 61832019, 61503244), the Science and Technology Commission of Shanghai Municipality (Grant: 19430750600), the Natural Science Foundation of Henan Province (162300410060) and Joint Research Funds for Medical and Engineering and Scientific Research at Shanghai Jiao Tong University (YG2017ZD14). The computations were partially performed at the Peng Cheng Lab and the Center for High-Performance Computing, Shanghai Jiao Tong University.

Wei Wang is a PhD student at the School of Mathematical Sciences, Shanghai Jiao Tong University. He works on statistical learning algorithm for the drug discovery.

Qiuying Dai is a PhD student at the School of Life Sciences and Biotechnology, Shanghai Jiao Tong University. She works on predicting circRNA-disease associations through machine learning methods.

Fang Li is a lecturer at the School of Life Sciences and Biotechnology, Shanghai Jiao Tong University. She works on drug discovery through machine learning methods and molecular simulation.

Yi Xiong is an associate professor at the School of Life Sciences and Biotechnology, Shanghai Jiao Tong University. His main research interests focus on machine learning algorithms and their applications in the protein sequence–structure–function relationship and biomedicine.

Dong-Qing Wei is a full professor at the School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, State Key Laboratory of Microbial Metabolism and Joint Laboratory of International Cooperation in Metabolic and Developmental Sciences, Ministry of Education, Shanghai Jiao Tong University and Peng Cheng Laboratory, Vanke Cloud City Phase I Building 8, Xili Street, Nanshan District, Shenzhen, Guangdong. His main research areas include structural bioinformatics and biomedicine.

References

1.

Guttman

M

,

Amit

I

,

Garber

M

, et al.

Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals

.

Nature

2009

;

458

:

223

–

7

.

2.

Pasmant

E

,

Sabbagh

A

,

Vidaud

M

, et al.

ANRIL, a long, noncoding RNA, is an unexpected major hotspot in GWAS

.

FASEB J

2011

;

25

:

444

–

8

.

3.

Congrains

A

,

Kamide

K

,

Oguro

R

, et al.

Genetic variants at the 9p21 locus contribute to atherosclerosis through modulation of ANRIL and CDKN2A/B

.

Atherosclerosis

2012

;

220

:

449

–

55

.

4.

Zhang

Q

,

Chen

C-Y

,

Yedavalli

VSRK

, et al.

NEAT1 long noncoding RNA and paraspeckle bodies modulate HIV-1 posttranscriptional expression

.

MBio

2013

;

4

:

e00596

–

12

.

5.

Johnson

R

.

Long non-coding RNAs in Huntington’s disease neurodegeneration

.

Neurobiol Dis

2012

;

46

:

245

–

54

.

6.

Ji

P

,

Diederichs

S

,

Wang

W

, et al.

MALAT-1, a novel noncoding RNA, and thymosin beta4 predict metastasis and survival in early-stage non-small cell lung cancer

.

Oncogene

2003

;

22

:

8031

–

41

.

7.

Gupta

RA

,

Shah

N

,

Wang

KC

, et al.

Long non-coding RNA HOTAIR reprograms chromatin state to promote cancer metastasis

.

Nature

2010

;

464

:

1071

–

6

.

8.

Széll

M

,

Bata-Csörgo

Z

,

Kemény

L

.

The enigmatic world of mRNA-like ncRNAs: their role in human evolution and in human diseases

.

Semin Cancer Biol

2008

;

18

:

141

–

8

.

9.

Chen

X

,

Yan

CC

,

Zhang

X

, et al.

Long non-coding RNAs and complex diseases: from experimental results to computational models

.

Brief Bioinform

2017

;

18

:

558

–

76

.

Google Scholar

PubMed

OpenURL Placeholder Text

WorldCat

10.

Fan

XN

,

Zhang

SW

,

Zhang

SY

, et al.

Prediction of lncRNA-disease associations by integrating diverse heterogeneous information sources with RWR algorithm and positive pointwise mutual information

.

BMC Bioinform

2019

;

20

:

87

.

Google Scholar

Crossref

WorldCat

11.

Li

Y

,

Li

J

,

Bian

N

.

DNILMF-LDA: prediction of lncrna-disease associations by dual-network integrated logistic matrix factorization and Bayesian optimization

.

Genes (Basel)

2019

;

10

:

608

. doi:

10.3390/genes10080608

.

Google Scholar

Crossref

WorldCat

12.

Zhang

J

,

Zhang

Z

,

Chen

Z

, et al.

Integrating multiple heterogeneous networks for novel LncRNA-disease association inference

.

IEEE/ACM Trans Comput Biol Bioinform

2019

;

16

:

396

–

406

.

13.

Yang

X

,

Gao

L

,

Guo

X

, et al.

A network based method for analysis of lncRNA-disease associations and prediction of lncRNAs implicated in diseases

.

PLoS One

2014

;

9

:

1

–

10

. doi:

10.1371/journal.pone.0087797

.

Google Scholar

Crossref

WorldCat

14.

Sun

J

,

Shi

H

,

Wang

Z

, et al.

Inferring novel lncRNA-disease associations based on a random walk model of a lncRNA functional similarity network

.

Mol Biosyst

2014

;

10

:

2074

–

81

.

15.

Ou-Yang

L

,

Huang

J

,

Zhang

XF

, et al.

LncRNA-disease association prediction using two-side sparse self-representation

.

Front Genet

2019

;

10

:

476

. doi:

10.3389/fgene.2019.00476

.

16.

Chen

X

,

Yan

G-Y

.

Novel human lncRNA–disease association inference based on lncRNA expression profiles

.

Bioinformatics

2013

;

29

:

2617

–

24

.

17.

Fu

G

,

Wang

J

,

Domeniconi

C

, et al.

Matrix factorization-based data fusion for the prediction of lncRNA-disease associations

.

Bioinformatics

2018

;

34

:

1529

–

37

.

18.

Xie

G

,

Huang

Z

,

Liu

Z

, et al.

NCPHLDA: a novel method for human lncRNA-disease association prediction based on network consistency projection

.

Mol Omi

2019

;

15

:

442

–

50

.

Google Scholar

Crossref

WorldCat

19.

Chen

G

,

Wang

Z

,

Wang

D

, et al.

LncRNADisease a database for long-non-coding RNA-associated diseases

.

Nucleic Acids Res

2013

;

41

:

983

–

6

.

Google Scholar

Crossref

WorldCat

20.

Tsoumakas

G

,

Katakis

I

.

Multi-label classification: an overview

.

INT J DATA Warehous Min

2007

;

2007

:

1

–

13

.

Google Scholar

OpenURL Placeholder Text

WorldCat

21.

Zhang

ML

,

Zhou

ZH

.

A review on multi-label learning algorithms

.

IEEE Trans Knowl Data Eng

2014

;

26

:

1819

–

37

.

Google Scholar

Crossref

WorldCat

22.

Tsoumakas

G

,

Vlahavas

I

.

Random k-Labelsets: An Ensamble Method for Multilabel Classification, ECML’07 Proceedings of the 18th European conference on Machine Learning

,

2007

.

23.

Zhou

ZH

,

Feng

J

. Deep forest: Towards an alternative to deep neural networks. In:

IJCAI International Joint Conference on Artificial Intelligence

,

2017

.

24.

Guo

Y

,

Liu

S

,

Li

Z

, et al.

BCDForest: a boosting cascade deep forest model towards the classification of cancer subtypes based on gene expression data

.

BMC Bioinform

2018

;

19

:

118

. doi:

10.1186/s12859-018-2095-4

.

Google Scholar

Crossref

WorldCat

25.

Yu

Y

,

Pedrycz

W

,

Miao

D

.

Multi-label classification by exploiting label correlations

.

Expert Syst Appl

2014

;

41

:

2989

–

3004

.

Google Scholar

Crossref

WorldCat

26.

Huang

S-J

,

Zhou

Z-H

.

Multi-label learning by exploiting label correlations locally

.

AAAI

2012

;

949

–

955

.

Google Scholar

OpenURL Placeholder Text

WorldCat

27.

Cramér

H

.

Mthematical Methods of Statitics

.

Princeton

:

Princeton University Press

,

1946

.

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

28.

Sheskin

D.

Handbook of Parametric and Nonparametric Statistical Procedures

. Boca Raton: Chapman & Hall/CRC,

2011

.

29.

Bergsma

W

.

A bias-correction for Cramér’s V and Tschuprow’s T

.

J Korean Stat Soc

2013

;

42

:

323

–

8

.

Google Scholar

Crossref

WorldCat

30.

Charte

F

,

Rivera

A

,

Del Jesus

MJ

, et al. Concurrence among imbalanced labels and its influence on multilabel resampling algorithms. In:

Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

, Vol.

8480

.

LNAI

:

Springer Verlag

,

2014

,

110

–

21

.

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

31.

Charte

F

,

Charte

D

.

Working with multilabel datasets in R: the mldr package

.

R J

2015

;

7

:

149

–

62

.

Google Scholar

Crossref

WorldCat

32.

Charte

F

,

Rivera

AJ

,

del

Jesus

MJ

, et al.

Addressing imbalance in multilabel classification: measures and random resampling algorithms

.

Neurocomputing

2015

;

163

:

3

–

16

.

Google Scholar

Crossref

WorldCat

33.

Zhou

Z-H

,

Feng

J

.

Deep Forest: Towards an Alternative to Deep Neural Networks.

In:

Proceedings of the 26th International Joint Conference on Artificial Intelligence

, IJCAI’17: Melbourne, Australia,

2017

,

3553

–

3559

.

34.

De Boer

P-T

,

Kroese

DP

,

Rubinstein

RY

.

A Tutorial on the Cross-Entropy Method

.

Annals of Operations Research

,

2005

;

134

:

19

–

67

.

35.

Rao

RB

,

Fung

G

,

Rosales

R

.

On the Dangers of Cross-Validation. An Experimental Evaluation.

In: Society for Industrial and Applied Mathematics - 8th SIAM International Conference on Data Mining 2008, Proceedings in Applied Mathematics

, Vol.

2

.

2008

,

588

–

596

. doi:

10.1137/1.9781611972788.54

.

36.

Hastie

T

,

Tibshirani

R

,

Friedman

J

.

Elements of statistical learning 2nd ed

.

Elements

2009

;

27

:

745

.

Google Scholar

OpenURL Placeholder Text

WorldCat

37.

Madjarov

G

,

Kocev

D

,

Gjorgjevikj

D

, et al.

An extensive experimental comparison of methods for multi-label learning

.

Pattern Recognition

2012

;

45

:

3084

–

104

.

Google Scholar

Crossref

WorldCat

38.

Hinton

GE

,

Salakhutdinov

RR

.

Reducing the dimensionality of data with neural networks

.

Science (80-)

2006

;

313

:

504

–

7

.

Google Scholar

Crossref

WorldCat

39.

Read

J

,

Reutemann

P

,

Pfahringer

B

, et al.

MEKA: a multi-label/multi-target extension to WEKA

.

J Mach Learn Res

2016

;

17

:

1

–

5

.

Google Scholar

OpenURL Placeholder Text

WorldCat

40.

Biswas

AK

,

Zhang

B

,

Wu

X

, et al. A multi-label classification framework to predict disease associations of long non-coding RNAs (IncRNAs). In:

Lecture Notes in Electrical Engineering

, Vol.

322

.

Springer Verlag: Cham, Switzerland

,

2015

,

821

–

30

.

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

41.

Cabili

M

,

Trapnell

C

,

Goff

L

, et al.

Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses

.

Genes Dev

2011

;

25

:

1915

–

27

.

42.

Szymański

P

,

Kajdanowicz

T

.

Scikit-multilearn: a scikit-based python environment for performing multi-label classification

.

J Mach Learn Res

2019

;

20

:

209

–

30

.

Google Scholar

OpenURL Placeholder Text

WorldCat

43.

Fang

Z

,

Lei

X

.

Prediction of miRNA-circRNA associations based on k-NN multi-label with random walk restart on a heterogeneous network

.

Big Data Min Anal

2019

;

2

:

261

–

72

.

Google Scholar

Crossref

WorldCat

44.

Yu

N

,

Li

Z

,

Yu

Z

.

Survey on encoding schemes for genomic data representation and feature learning—from signal processing to machine learning

.

Big Data Min Anal

2018

;

1

:

191

–

210

.

Google Scholar

OpenURL Placeholder Text

WorldCat

45.

Chen

X

,

Guan

N-N

,

Sun

Y-Z

, et al.

MicroRNA-small molecule association identification: from experimental results to computational models

.

Brief Bioinform

2018

;

21

:

47

–

61

. doi:

.

46.

Lin

YC

,

Lee

YC

,

Chang

KL

, et al.

Analysis of common targets for circular RNAs

.

BMC Bioinformatics

2019

;

20

:

372

.

47.

Wang

WT

,

Han

C

,

Sun

YM

, et al.

Noncoding RNAs in cancer therapy resistance and targeted drug development

.

J Hematol Oncol

2019

;

12

:

1

–

15

.

Google Scholar

PubMed

OpenURL Placeholder Text

WorldCat

48.

Ling

H

,

Fabbri

M

,

Calin

GA

.

MicroRNAs and other non-coding RNAs as targets for anticancer drug development

.

Nat Rev Drug Discov

2013

;

12

:

847

–

65

.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://dbpia.nl.go.kr/journals/pages/open_access/funder_policies/chorus/standard_publication_model)

Download all slides

Month:	Total Views:
June 2020	97
July 2020	41
August 2020	23
September 2020	58
October 2020	110
November 2020	45
December 2020	40
January 2021	32
February 2021	13
March 2021	30
April 2021	38
May 2021	43
June 2021	66
July 2021	42
August 2021	23
September 2021	30
October 2021	34
November 2021	49
December 2021	33
January 2022	27
February 2022	23
March 2022	44
April 2022	49
May 2022	74
June 2022	36
July 2022	42
August 2022	39
September 2022	17
October 2022	29
November 2022	27
December 2022	32
January 2023	21
February 2023	27
March 2023	40
April 2023	14
May 2023	36
June 2023	22
July 2023	21
August 2023	48
September 2023	37
October 2023	50
November 2023	54
December 2023	51
January 2024	83
February 2024	37
March 2024	52
April 2024	15
May 2024	56
June 2024	31
July 2024	26
August 2024	22
September 2024	35
October 2024	22
November 2024	36
December 2024	50
January 2025	37
February 2025	52
March 2025	28
April 2025	20
May 2025	19

Article Contents

MLCDForest: multi-label classification with deep forest in disease prediction for long non-coding RNAs

Abstract

Introduction

Methods

Multi-label classification

Label correlations and concurrence

Label correlations

Label concurrence

Framework of the proposed methodology

Multi-grained scanning

Cascade forest

Overall procedure of MLCDForest

Performance measures

Example-based performance evaluation

Label-based performance evaluation

Ranking-based performance evaluation

Experiments and results

Datasets and hyperparameters

Performance comparison

Label-wise evaluation

Discussion

Funding

References

Supplementary data

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

Article Contents

MLCDForest: multi-label classification with deep forest in disease prediction for long non-coding RNAs

Abstract

Introduction

Methods

Multi-label classification

Label correlations and concurrence

Label correlations

Label concurrence

Framework of the proposed methodology

Multi-grained scanning

Cascade forest

Overall procedure of MLCDForest

Performance measures

Example-based performance evaluation

Label-based performance evaluation

Ranking-based performance evaluation

Experiments and results

Datasets and hyperparameters

Performance comparison

Label-wise evaluation

Discussion

Funding

References

Supplementary data

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only