Construction of robust prognostic predictors by using projective adaptive resonance theory as a gene filtering method

Abstract

Motivation: For establishing prognostic predictors of various diseases using DNA microarray analysis technology, it is desired to find selectively significant genes for constructing the prognostic model and it is also necessary to eliminate non-specific genes or genes with error before constructing the model.

Results: We applied projective adaptive resonance theory (PART) to gene screening for DNA microarray data. Genes selected by PART were subjected to our FNN-SWEEP modeling method for the construction of a cancer class prediction model. The model performance was evaluated through comparison with a conventional screening signal-to-noise (S2N) method or nearest shrunken centroids (NSC) method. The FNN-SWEEP predictor with PART screening could discriminate classes of acute leukemia in blinded data with 97.1% accuracy and classes of lung cancer with 90.0% accuracy, while the predictor with S2N was only 85.3 and 70.0% or the predictor with NSC was 88.2 and 90.0%, respectively. The results have proven that PART was superior for gene screening.

Availability: The software is available upon request from the authors.

Contact: [email protected]

INTRODUCTION

Recent advances in DNA microarray technologies have made it possible to measure the expression levels of thousands of genes simultaneously. These gene expression data are useful in the diagnosis and prognosis of diseases. Most approaches to computational analysis of gene expression data are functionally significant classifications of genes in unsupervised learning methods, e.g. k-means (Somogyi, 1999), hierarchical clustering (Eisen et al., 1998), self-organized maps (SOMs) (Tamayo et al., 1999) and Fuzzy adaptive resonance theory (ART) (Tomida et al., 2002b). On the basis of the expression pattern of classified genes, disease diagnosis or prognosis is carried out. In contrast, supervised learning methods use training sets to specify the genes that should be clustered together (Brown et al., 2000).

An artificial neural network (ANN) is one of the supervised learning methods, and it has been utilized in medical research as a powerful tool for accurately detecting the causal relationship between variables (Tomida et al., 2002a; Gruvberger et al., 2001; Xu et al., 2002). Fuzzy neural network (FNN) is one of the advanced ANN models, and its most attractive feature is that causality between input and output variables can be described extremely accurately as explicit IF-THEN rules obtained from the constructed model (Noguchi et al., 2001). However, it takes a very long time to analyze thousands of gene expression data by FNN. In our previous work, for the purpose of dealing with thousands of genes, we developed the FNN combined with the SWEEP operator method (FNN-SWEEP method) (Ando et al., 2002). The FNN-SWEEP method has been used for microarray analysis and has proved to be a precise, simple tool for predicting patients' survival (Ando et al., 2002, 2003a,b). However, the expression data comprise a huge number of genes including experimental error as well as non-specific genes. In cases where artificial non-specific gene expression data with random noise were added to real gene expression data, the FNN-SWEEP method sometimes selected such artificial non-specific genes, and the FNN model constructed using such selected genes showed high prediction accuracy for estimation data (data not shown). This result shows that the FNN-SWEEP method is sensitive for non-specific genes and genes with error. Therefore, it is necessary to identify selectively significant genes and to eliminate non-specific genes and genes with error before modeling. Many researchers have tried to extract significant genes from microarray data without a priori knowledge, for example through clustering. However, successful results have not yet been obtained by the method to eliminate genes, proposed until now. In the present paper, we apply the projective adaptive resonance theory (PART) (Cao and Wu, 2002) to gene expression profiles in order to eliminate non-specific genes or genes with error. Genes selected by PART were subjected to the FNN-SWEEP modeling method to construct a cancer class prediction model (predictor). The results of the modeling were evaluated through comparison with those models that did not apply screening, using the conventional screening method, S2N or NSC.

MATERIALS AND METHODS

Data processing

In the present study, we used two kinds of gene expression profiles. The first is the gene expression profiles reported by Golub et al. (1999). These gene expression data consist of 72 bone marrow samples [47 acute lymphoblastic leukemia (ALL) and 25 acute myeloid leukemia (AML)], which were obtained from acute leukemia patients at the first time of diagnosis. RNAs prepared from bone marrow mononuclear cells were hybridized to high-density oligonucleotide microarrays (Affymetrix) containing probes for 7129 human genes. From this dataset, we selected 5401 genes with wide expression variation, which are of more than 50 SD. The selection was carried to eliminate genes that had little difference in gene expression among all patients. Furthermore, these data were separated into two groups; 38 samples (27 ALL and 11 AML) and 34 samples (20 ALL and 14 AML). The former group was used as a modeling dataset for constructing the class prediction model (predictor) and the latter was used as a blinded dataset for evaluating the constructed predictor.

The second is the gene expression profile reported by Bhattacharjee et al. (2001). The gene expression data consist of 203 lung tumor samples [139 adenocarcinoma, 21 squamous cell lung carcinomas, 20 pulmonary carcinoids, 6 small cell lung carcinomas (SCLC) and 17 normal lung samples]. RNAs were hybridized to high-density oligonucleotide microarrays (Affymetrix) containing probes for 12600 human genes. From these gene expression data, we selected 3312 genes by the same criterion mentioned above. These samples were separated into a modeling dataset consisting of 198 samples and a blinded dataset consisting of five samples (one sample from each class). An FNN model was constructed from 198 modeling data, and 5 blinded data were predicted. This procedure was repeated five times under the condition that the same sample was not to be selected as blinded data. Since the SCLC sample was only 6, more than 6 blinded data sets cannot be prepared. The accuracy of blinded data was calculated as the average of six times predictions (Table 2).

To apply these acute leukemia and lung cancer data to PART, the expression intensity of each gene was normalized so that the mean value was 0 and the SD 1.

Evaluation of extracted genes by PART

To evaluate genes extracted by PART, we constructed four kinds of FNN class predictors. First, we applied PART to the modeling dataset and extracted 253 genes from the 5401 genes of the acute leukemia data. Then, class predictor genes were selected by FNN-SWEEP from genes extracted by PART (predictor 1). Second, the predictor genes were selected directly from 5401 genes without screening (predictor 2). Third, the genes were selected from the 5401 genes by S2N instead of using PART, and class predictor genes were selected form these genes (predictors 3 and 4). Fourth, the genes were selected by nearest shrunken centroids (NSC) method instead of using PART, and class predictor genes were selected form these genes (predictor 5). These four kinds of predictors were compared with one another with respect to their prediction accuracy of blinded data. Similarly, in the case of lung cancer data, predictor 6 was constructed by FNN-SWEEP from 387 genes (in the case of one set among six datasets) selected by PART, predictor 7 from 3312 genes without screening, predictor 8 from the same number of genes with PART selected by MaxS2N and predictor 9 from the genes by NSC. The outline of the predictor construction is shown in Figure 1.

Projective adaptive resonance theory model

PART was proposed to find projected clusters for datasets in high-dimensional spaces. The architecture is based on the well-known ART developed by Carpenter and Grossberg (1987), and a major modification is provided in order to deal with the inherent sparsity in the full space of the data points (Fig. 2).

In the present paper, PART (Cao and Wu, 2002) was used as a screening method, which enables the elimination of non-specific dimensions for clustering from high-dimensional data, while conventional clustering methods cannot eliminate correctly. The learning procedure of PART is briefly described below.

PART includes comparison layers F₁ and clustering layers F₂, which are connected by the bottom-up weight z_ij and top-down weight z_ji (i = 1, …, m, dimension number of input; j = 1, …, n, category number). PART is controlled by a vigilance parameter ρ and a distance parameter σ (Cao and Wu, 2002). Input vector I has dimensions corresponding to the sampling genes. First of all, an input I_i itself is provided as a weight z_ji, and z_ij is set according to the following equations.

\[{z}_{ij}=L/\left(L-1+m\right),\left(1\right)\]

where m denotes dimension number of input, L is the constant parameter, which is higher than one.

As a first step, a winner category for each input vector I is determined as follows. The function T_j of category j is defined as Equation (2), which indicates the similarity between input vector I and weight vector z_j.

\[{T}_{j}={\displaystyle \sum _{i=1}^{m}}{z}_{ij}{h}_{ij}\left({I}_{i},{z}_{ji}\right),\left(2\right)\]

where

\[{h}_{ij}\left({I}_{i},{z}_{ji}\right)={h}_{\sigma }\left({I}_{i},{z}_{ji}\right)l\left({z}_{ij}\right),\left(3\right)\]

where

\[{h}_{\sigma }\left({I}_{i},{z}_{ji}\right)=\{\begin{array}{cc}1& \hbox{ if|<| |>| }\left|{I}_{i}-{z}_{ji}\right|\le \sigma \\ 0& \hbox{ if|<| |>| }\left|{I}_{i}-{z}_{ji}\right| > \sigma .\end{array}\left(4\right)\]

\[l\left({z}_{ij}\right)=\{\begin{array}{cc}1& \hbox{ if|<| |>| }{z}_{ij} > \theta \\ 0& \hbox{ if|<| |>| }{z}_{ji}\le \theta .\end{array}\left(5\right)\]

The category j that has the maximal T_j is defined as the ‘winner’ category for input vector I.

As a next step, the category selected above is judged to follow ‘resonance’ procedure or ‘mismatch reset’ procedure by the function r_j defined as the following equation.

\[{r}_{j}={\displaystyle \sum _{i=1}^{m}}{h}_{ij}.\left(6\right)\]

‘Resonance’ procedure is carried out if the function r_j of the winner category for input vector I is bigger than ρ; that is expressed as

\[{r}_{j}\ge \rho .\left(7\right)\]

The function r_j indicates the size of dimensions of projected subspace and the vigilance parameter indicates its threshold. When the ‘resonance’ procedure should be done, learning of the weight vector of winner category is performed. Learning of the bottom-up weight z_ij and top-down weight z_ji is updated according to the following equations.

\[{z}_{ji}^{\hbox{ new }}=\{\begin{array}{cc}L/\left(L-1+\left|X\right|\right)& \hbox{ if|<| |>| }{h}_{ij}=1\\ 0& \hbox{ if|<| |>| }{h}_{ij}=0,\end{array}\left(8\right)\]

\[{z}_{ji}^{\hbox{ new }}=\left(1-\alpha \right){z}_{ji}^{\hbox{ old }}+\alpha {I}_{i},\left(9\right)\]

where α is the learning rate within the range from 0 to 1, was set to 0.1, which is the same value as reported by Cao and Wu (2002), because the change of this value did not affect the selected genes in this paper (data not shown). |X| denotes the number of elements in the set X = {i, h_ij = 1}.

Otherwise, if the function r_j of the winner category for input vector I is lower than ρ, ‘resonance’ procedure is not done and ‘mismatch reset’ procedure is carried out. A new category that has the next maximal T_j is chosen by Equation (2) again. When any category cannot satisfy Equation (7), a new category is generated according to the following equations.

\[{z}_{ij}^{\hbox{ new }}=L/\left(L-1+\left|m\right|\right).\left(10\right)\]

\[{z}_{ji}^{\hbox{ new }}={I}_{i}.\left(11\right)\]

These steps mentioned above are continued until every input vector I is assigned to any category.

We modified the above algorithm as described below. All patterns and all categories have a teacher signal. When a new category is generated, its teacher signal is decided according to the input pattern's teacher signal at that time. The ‘resonance’ procedure is carried out only for the pattern having the same teacher signal as category.

Furthermore, we defined correctness ratio of clustering as the following equation.

\[\hbox{ correctness ratio }=\frac{A-E}{A},\left(12\right)\]

where A is all pattern number and E is mismatch pattern number with respect to cluster signal and pattern signal.

Signal-to-noise statistic (S2N)

The signal-to-noise statistic has been proposed to calculate weight of genes for weighed voting algorithm as a binary class predictor by Golub et al. (1999). This statistic is defined as the following equation.

\[\hbox{ S }2\hbox{ N }=\left|\frac{{\mu }_{\hbox{ class1 }}-{\mu }_{\hbox{ class2 }}}{{\sigma }_{\hbox{ class1 }}+{\sigma }_{\hbox{ class2 }}}\right|,\left(13\right)\]

where μ is the average of log gene expressions for each class, and σ is the SD of log gene expressions for each class.

We calculate the S2N value of all class pairs for each gene in a multiclass prediction, and then the biggest S2N value of all pairs is defined as MaxS2N value for the gene.

\[\hbox{ MaxS2N }=max\left\{\left|\frac{{\mu }_{{\hbox{ class }}_{\hbox{ i }}}-{\mu }_{{\hbox{ class }}_{\hbox{ j }}}}{{\sigma }_{{\hbox{ class }}_{\hbox{ i }}}+{\sigma }_{{\hbox{ class }}_{\hbox{ j }}}}\right|:i\in C,j\in C,i\ne j\right\},\left(14\right)\]

where C is the set of classes.

Nearest shrunken centroids

NSC method has been proposed to identify minimal subsets of the genes by Tibshirani et al. (2002). In this method, the mean expression of each gene within each class is calculated. Then it shrinks these centroids toward the overall mean for that gene by a fixed quantity, threshold Δ. The NSC value is defined as the following equation.

\[\hbox{ NSC }=\hbox{ sign }\left({d}_{ik}\right){\left(\left|{d}_{ik}\right|-\Delta \right)}_{+},\left(15\right)\]

where i is each gene, k is each class, + is positive part (if t > 0 then t₊ = t else t₊ = 0), and d_ik is defined as the following equation.

\[{d}_{ik}=\frac{{\overline{x}}_{ik}-\overline{{x}_{i}}}{{m}_{k}\cdot \left({s}_{i}+{s}_{0}\right)},\left(16\right)\]

where

\({\overline{x}}_{ik}\)

where is centroids of i-th gene in the class k,

\({\overline{x}}_{i}\)

is overall centroids of i-th gene, s_i is the pooled within-class SD for gene i, s₀ is a positive constant value (we set it equal to the median value of the s_i over the set of the genes), and m_k is defined as the following equation.

\[{m}_{k}=\sqrt{\frac{1}{{n}_{k}}+\frac{1}{n}},\left(17\right)\]

where n_k is sample number in the class k and n is all sample number.

In this paper, the FNN-SWEEP method is then applied to the genes that survive the thresholding. The threshold Δ was optimized by cross-validation in the modeling dataset.

RESULTS AND DISCUSSIONS

Identification of optimal gene number of PART screening

PART is the architecture based on the well-known ART developed by Carpenter and Grossberg (1987). In our previous paper (Tomida et al., 2002b), Fuzzy ART based on the ART was constructed by us and applied to clustering of time course data of gene expression profiles. Cluster construction obtained by Fuzzy ART showed high reproducibility and the highest clustering robustness was obtained even when adding random noise corresponding to the 2-fold change. This means that ART can select genes with a similar expression pattern with high robustness. For this reason, we applied Fuzzy ART to time course data of gene expression profiles for constructing genetic networks in our previous paper (Takahashi et al., 2003). Although Fuzzy ART is very useful for time course data analysis of several dimensions, this method cannot be applied for high-dimensional data. PART was derived from ART to extract specific dimensions for correctness clustering of high-dimensional data. Therefore, all ARTs except PART can extract dimensions. Cao and Wu (2002) reported comparison of PART with Fuzzy ART. In their experiment, PART and Fuzzy ART were applied to artificial datasets that have 20-dimension or 100-dimension. For both datasets, the patterns clustered correctly in the case of PART, while the patterns were very disorderly in the case of Fuzzy ART and too many clusters were generated by more rigid parameters. In the present paper, the function to extract dimensions in the PART algorithm was applied to the selection of specific genes from gene expression data.

The number of genes selected by PART is controlled by a vigilance parameter ρ and a distance parameter σ. For the vigilance parameter, we selected the smallest ρ for which the highest correctness ratio was obtained for clustering, with a fixed distance parameter. For example, when the distance parameter was 2.7 in the analysis of acute leukemia, the vigilance parameter became 1 in the range of 1–5401 of integers so as to achieve the clustering correctness ratio of 100.0%, and then the number of selected genes was 253. In order to identify the optimal gene number in PART screening, the distance parameter should also be surveyed. For this purpose, an FNN-SWEEP method was carried out for gene selection and model construction, and 10 independent FNN class predictors were constructed. As shown in our previous papers (Ando et al., 2002, 2003a,b), FNN modeling was carried out by the parameter increasing method. Therefore, the number of input units was optimized during this procedure. In the case of acute leukemia, 10 FNN models with one input were constructed. In the case of lung cancer, 10 FNN models with four inputs were constructed. We calculated the average of the model accuracy.

The results are shown in Table 1 and Figure 3 for acute leukemia, and Figure 4 for lung cancer. Excess elimination decreased the accuracy of the model, which may be caused by the elimination of important genes. In the case of acute leukemia, the model constructed by selecting 244 genes showed 92.1% modeling data accuracy, and selecting 253 genes increased its accuracy to 93.1%. In the case of lung cancer, the model constructed by selecting 352 genes showed 92.4% modeling data accuracy, and selecting 387 genes increased its accuracy to 94.7%. Based on these results, we selected 253 genes and 387 genes as the lowest gene number, respectively.

Comparison of the performance of FNN-SWEEP class predictor with PART and other screening method

Performance of the class predictor constructed with PART screening was investigated. For comparison, class predictors were constructed using the S2N ranking method, which has been frequently used by many researchers, or the NSC method. We also constructed a predictor without screening. Class predictors, which can correctly classify not only modeling data but also new data, should be constructed. Therefore, the performance of the predictors was compared for accuracy using blinded data that were never used for modeling. The average accuracy for blinded data using 10 independent FNN models was calculated. In the case of acute leukemia, the top 50 genes were selected by means of S2N, because 50 genes had been used for analysis by weighted voting method in the original paper, and those were used for FNN-SWEEP analysis. In addition, the top 253 genes were also selected by S2N so as to select the same number of genes as those resulting from PART screening, and those were used for FNN-SWEEP analysis. In the case of multiple class data of lung cancer, the top 387 genes were selected by S2N to evaluate the discrimination accuracy of the predictor constructed by the FNN-SWEEP method. The top 387 genes were defined by MaxS2N. Furthermore, the case using NSC was also compared.

As shown in Table 2, the predictor with PART screening showed significantly high performance. All predictors for acute leukemia showed high-modeling ability for modeling data (∼93% accuracy). However, for blinded data, predictor 1 with PART screening showed 97.1% accuracy, while predictor 2 without screening was only 76.5% accurate. In addition, predictors resulting from the S2N method (predictors 3 and 4) and NSC method (predictor 5) showed only 85.3 and 88.2% of accuracy for blinded data, respectively. In the case of acute leukemia, we prepared 34 samples of blinded data. Therefore, 5 samples out of 34 were predicted incorrectly using the S2N method, while only 1 was incorrect for PART. It should be noted that the accuracy never increased with predictor 7 using 253 candidate genes from S2N. These data mean that PART screening can select genes with the similar expression pattern with high robustness.

In the case of acute leukemia, it should be noted that only 10 significant genes were used for class prediction, since FNN models with one input were constructed, while 50 genes were necessary for the same analysis in the conventional method using weighted voting method (Golub et al., 1999). This is a superior feature from the viewpoint of a biological experiment. If the number of genes of interest is small, the researcher could easily investigate the gene expression level by RT–PCR or histological staining of expressed protein on the specimen in order to know the relationship between gene expression and cancer classification.

For multiple class data of lung cancer, four kinds of FNN class prediction models such as the model with PART screening without screening and with S2N screening were constructed. As shown in Table 2, predictor 6 with PART screening showed 93.7% accuracy for modeling data and 90.0% for blinded data and predictor 9 with NSC screening showed 94.0% accuracy for modeling data and 90.0% for blinded data. On the other hand, predictor 7 without screening was 93.4 and 80.0% accurate, respectively. Predictor 8 with MaxS2N screening was also only 90.4% accurate for modeling data and 70.0% for blinded data, respectively. The reason why blinded data accuracy of MaxS2N was low is that the gene number selected by MaxS2N was matched to one by PART, and this shows that it is possible that PART can condense more significant genes than MaxS2N. In all cases, 10 FNN models were constructed for prediction and for predictor 6, only 17 independent significant genes were used for class prediction, since FNN models with four inputs were constructed.

Comparison of genes used in a predictor with PART, with other screening

The genes selected by the FNN-SWEEP predictor with PART screening were compared with those used in the predictor with NSC, S2N or without screening. The deviation of gene expression level of the genes selected by these four methods was investigated. In this examination, all patients are divided into two classes, such as AML or ALL. Average expression levels of genes used in the predictor with PART screening were calculated for each patient class. The SD of gene expression was calculated against 10 genes in each class. Averages of SDs are also shown in Figure 5. The average SD of the expression level of genes used in the predictor with PART screening in the lower class was significantly smaller than that of the higher class. The value of the SD for the predictor with PART screening was 1/2.3, 1/2.5 or 1/3.6 times less than that with NSC, S2N or without screening. Although in the case of higher class a higher SD was obtained in the predictor with PART, the difference is not so big and 1.2, 1.2 or 1.3 times higher SD were obtained compared with that with NSC, S2N or without screening. Low SD values mean that signals of those genes have less noise and the genes show similar expression level in a class. If genes with a high SD were used in a predictor, the model constructed will show low accuracy for blinded data, although high accuracy may be obtained for modeling data. Actually, as shown in Table 1, discrimination accuracy of all models for the modeling dataset was ∼93%. However, the accuracy of predictor 1 with PART screening was 97.1% for blinded data, while the accuracy of the other models was significantly lower than this. The high accuracy is the result of low SD values in the lower class.

Genes that were part of the FNN class predictor for acute leukemia are shown in Table 3. A total of 10 genes in the FNN model with PART screening included five genes from the top 50 genes selected by the S2N method as reported in the original paper (Golub et al., 1999). The zyxin (X95735) gene was also selected in other predictors derived from either the S2N method or without screening, and the gene was commonly used in the first FNN model in other methods. This result means that the zyxin gene was very important for discriminating acute leukemia and PART did not eliminate this important gene. Furthermore, CD36 antigen gene expression was selected only in the predictor with PART screening. CD36 antigen has been reported by Valet et al. (2003) as a high-risk AML marker gene. This may be due to the fact that the SD in the lower class of the predictor with PART was the smallest among the four predictors as shown in Figure 5.

For clustering of lung cancer, we investigated the presence of previously reported marker genes among the genes selected in the constructed FNN predictors. As shown in Table 4, 17 genes were selected in the predictor with PART screening. Among those, five genes were marker genes reported by Bhattacharjee et al. (2001): tumor protein 63 kDa with strong homology to p53 and keratin 5 for squamous cell lung carcinoma (SQ), adv. glycosylation end product-sp. receptor for normal lung (NL), and ISL1 transcription factor and insulinoma-associated 1 for neuroendocrine tumor (NE). In addition to these genes, two genes among the remaining 12 genes, chromogranin A and chromogranin C, were reported to be marker genes for NE by Lamberts et al. (2001). One gene, ceruloplasmin, was also reported to be a marker gene for AD and SQ by (Wang et al., 2002). These findings suggest that the genes selected in the predictor with PART screening may identify new marker genes and also suggest that the conventional method may miss the selection of important genes for class prediction.

CONCLUSION

In this paper, we applied PART to gene expression data to eliminate non-specific genes and genes with error. Furthermore, the genes selected by PART were subjected to the FNN-SWEEP method for the construction of robust cancer class prediction models. The results showed that the FNN-SWEEP class predictor with PART screening was superior to the predictor without screening, with S2N or NSC. The predictor constructed with PART screening showed 97.1 and 90.0% accuracy for blinded data of acute leukemia and lung cancer respectively, while 76.5 and 80.0% accuracy were obtained using the predictor without screening. This result suggests that PART has the potential to function as a new method of gene screening for class prediction.

Fig. 1

Outline of this paper.

Open in new tab Download slide

Fig. 2

Function of PART clustering.

Open in new tab Download slide

Fig. 3

Accuracy on various gene numbers for acute leukemia. Solid line with open circles indicates accuracy for blinded data. Solid line with closed circles indicates accuracy for modeling data.

Open in new tab Download slide

Fig. 4

Accuracy on various gene numbers for one set of six lung cancer modeling datasets. Solid line with open circles indicates accuracy for blinded data. Solid line with closed circles indicates accuracy for modeling data.

Open in new tab Download slide

Table 1

Open in new tab

Comparison of average accuracy and optimal gene numbers

For acute leukemia

Distance parameter (−)

2.5

2.6

2.7

2.8

2.9

100.0

Vigilance parameter (−)

Selected gene number (−)

233

244

253

262

271

5401

For lung cancera

Distance parameter (−)

0.9

1.0

1.1

1.2

1.3

100.0

Vigilance parameter (−)

Selected gene number (−)

320

352

387

419

449

3312

^aParameters for one set of six lung cancer datasets.

Table 2

Open in new tab

Accuracy of class prediction for acute leukemia and lung cancer

Model	Gene screening	Discrimination accuracya (%)
		Modeling datasetb	Blinded dataset
For acute leukemia
Predictor 1	PART	93.1	97.1
Predictor 2	Nothing	93.3	76.5
Predictor 3	Top 50 by S2N	92.9	85.3
Predictor 4	Top 253 by S2N	93.1	85.3
Predictor 5	NSC	93.7	88.2
For lung cancer
Predictor 6	PART	93.7 ± 0.8	90.0 ± 9.3
Predictor 7	Nothing	93.4 ± 0.4	80.0 ± 0.0
Predictor 8	MaxS2N	90.4 ± 1.9	70.0 ± 9.3
Predictor 9	NSC	94.0 ± 0.4	90.0 ± 9.3

Model	Gene screening	Discrimination accuracya (%)
		Modeling datasetb	Blinded dataset
For acute leukemia
Predictor 1	PART	93.1	97.1
Predictor 2	Nothing	93.3	76.5
Predictor 3	Top 50 by S2N	92.9	85.3
Predictor 4	Top 253 by S2N	93.1	85.3
Predictor 5	NSC	93.7	88.2
For lung cancer
Predictor 6	PART	93.7 ± 0.8	90.0 ± 9.3
Predictor 7	Nothing	93.4 ± 0.4	80.0 ± 0.0
Predictor 8	MaxS2N	90.4 ± 1.9	70.0 ± 9.3
Predictor 9	NSC	94.0 ± 0.4	90.0 ± 9.3

^aOne blinded dataset was prepared for acute leukemia and six blinded datasets were prepared for lung cancer. For lung cancer, the average accuracy from six models corresponding to six blinded datasets was listed.

^bThe value was obtained from 3-fold cross-validation for acute leukemia and 5-fold cross-validation for lung cancer.

Table 2

Open in new tab

Accuracy of class prediction for acute leukemia and lung cancer

Model	Gene screening	Discrimination accuracya (%)
		Modeling datasetb	Blinded dataset
For acute leukemia
Predictor 1	PART	93.1	97.1
Predictor 2	Nothing	93.3	76.5
Predictor 3	Top 50 by S2N	92.9	85.3
Predictor 4	Top 253 by S2N	93.1	85.3
Predictor 5	NSC	93.7	88.2
For lung cancer
Predictor 6	PART	93.7 ± 0.8	90.0 ± 9.3
Predictor 7	Nothing	93.4 ± 0.4	80.0 ± 0.0
Predictor 8	MaxS2N	90.4 ± 1.9	70.0 ± 9.3
Predictor 9	NSC	94.0 ± 0.4	90.0 ± 9.3

Model	Gene screening	Discrimination accuracya (%)
		Modeling datasetb	Blinded dataset
For acute leukemia
Predictor 1	PART	93.1	97.1
Predictor 2	Nothing	93.3	76.5
Predictor 3	Top 50 by S2N	92.9	85.3
Predictor 4	Top 253 by S2N	93.1	85.3
Predictor 5	NSC	93.7	88.2
For lung cancer
Predictor 6	PART	93.7 ± 0.8	90.0 ± 9.3
Predictor 7	Nothing	93.4 ± 0.4	80.0 ± 0.0
Predictor 8	MaxS2N	90.4 ± 1.9	70.0 ± 9.3
Predictor 9	NSC	94.0 ± 0.4	90.0 ± 9.3

^bThe value was obtained from 3-fold cross-validation for acute leukemia and 5-fold cross-validation for lung cancer.

Fig. 5

Comparison of average of each parameter for 10 genes of predictors. Average of the SD is indicated by σ. Average and SD of gene expression data of all patients were normalized to become 0 and 1, respectively. White bars and grey bars mean average σ in lower class and higher class, respectively.

Open in new tab Download slide

Table 3

Open in new tab

The genes used in FNN class predictor for acute leukemia

Description	Sequence	Marka
ACADM acyl-coenzyme A dehydrogenase, C-4 to C-12 straight chain	M91432	°
Azurocidin gene	M96326	°
CD36 antigen (collagen type I receptor, thrombospondin receptor)	M98399
CST3 Cystatin C (amyloid angiopathy and cerebral hemorrhage)	M27891	°
Cystatin A	D88422
DF D component of complement (adipsin)	M84526	°
ELA2 Elastatse 2, neutrophil	M27783
Nucleolysin TIA-1	M77142
PTX3 Pentaxin-related gene, rapidly induced by IL-1 beta	M31166
Zyxin	X95735	°

Description	Sequence	Marka
ACADM acyl-coenzyme A dehydrogenase, C-4 to C-12 straight chain	M91432	°
Azurocidin gene	M96326	°
CD36 antigen (collagen type I receptor, thrombospondin receptor)	M98399
CST3 Cystatin C (amyloid angiopathy and cerebral hemorrhage)	M27891	°
Cystatin A	D88422
DF D component of complement (adipsin)	M84526	°
ELA2 Elastatse 2, neutrophil	M27783
Nucleolysin TIA-1	M77142
PTX3 Pentaxin-related gene, rapidly induced by IL-1 beta	M31166
Zyxin	X95735	°

^aMarks mean genes included in top 50 by S2N.

Table 3

Open in new tab

The genes used in FNN class predictor for acute leukemia

Description	Sequence	Marka
ACADM acyl-coenzyme A dehydrogenase, C-4 to C-12 straight chain	M91432	°
Azurocidin gene	M96326	°
CD36 antigen (collagen type I receptor, thrombospondin receptor)	M98399
CST3 Cystatin C (amyloid angiopathy and cerebral hemorrhage)	M27891	°
Cystatin A	D88422
DF D component of complement (adipsin)	M84526	°
ELA2 Elastatse 2, neutrophil	M27783
Nucleolysin TIA-1	M77142
PTX3 Pentaxin-related gene, rapidly induced by IL-1 beta	M31166
Zyxin	X95735	°

Description	Sequence	Marka
ACADM acyl-coenzyme A dehydrogenase, C-4 to C-12 straight chain	M91432	°
Azurocidin gene	M96326	°
CD36 antigen (collagen type I receptor, thrombospondin receptor)	M98399
CST3 Cystatin C (amyloid angiopathy and cerebral hemorrhage)	M27891	°
Cystatin A	D88422
DF D component of complement (adipsin)	M84526	°
ELA2 Elastatse 2, neutrophil	M27783
Nucleolysin TIA-1	M77142
PTX3 Pentaxin-related gene, rapidly induced by IL-1 beta	M31166
Zyxin	X95735	°

^aMarks mean genes included in top 50 by S2N.

Table 4

Open in new tab

The genes used in FNN class predictor for one set of six lung cancer modeling datasets

Description	Sequence	Marka	Marker genes
Adv. glycosylation end product-sp. receptor	M91211	°
Cathepsin Z	AF032906
Ceruloplasmin	M13699		°b
Chromogranin A	U03749		°c
Chromogranin C	M25756		°c
Human amyloid precursor-like protein 1 mRNA	U48437
Hypothetical protein 384D8_6	U62317
Insulinoma-associated 1	M93119	°
ISL1 transcription factor	U07559	°
Keratin 5	M21389	°
KIAA0736 protein	AB018279
KIAA1087 protein	AB029010
Protein tyrosine phosphatase, receptor type, N	L18983
Secretagogin	Y16752
Secretory granule, neuroendocrine protein 1	Y00757
Synaptosomal-associated protein, 25 kDa	D21267
Tumor protein 63 kDa with strong homology to p53	Y16961

Description	Sequence	Marka	Marker genes
Adv. glycosylation end product-sp. receptor	M91211	°
Cathepsin Z	AF032906
Ceruloplasmin	M13699		°b
Chromogranin A	U03749		°c
Chromogranin C	M25756		°c
Human amyloid precursor-like protein 1 mRNA	U48437
Hypothetical protein 384D8_6	U62317
Insulinoma-associated 1	M93119	°
ISL1 transcription factor	U07559	°
Keratin 5	M21389	°
KIAA0736 protein	AB018279
KIAA1087 protein	AB029010
Protein tyrosine phosphatase, receptor type, N	L18983
Secretagogin	Y16752
Secretory granule, neuroendocrine protein 1	Y00757
Synaptosomal-associated protein, 25 kDa	D21267
Tumor protein 63 kDa with strong homology to p53	Y16961

^aMarks mean genes selected by Bhattacharjee et al. (2001).

^bThe marker gene reported by Wang et al. (2002).

^cThe marker genes reported by Lamberts et al. (2001).

Table 4

Open in new tab

The genes used in FNN class predictor for one set of six lung cancer modeling datasets

Description	Sequence	Marka	Marker genes
Adv. glycosylation end product-sp. receptor	M91211	°
Cathepsin Z	AF032906
Ceruloplasmin	M13699		°b
Chromogranin A	U03749		°c
Chromogranin C	M25756		°c
Human amyloid precursor-like protein 1 mRNA	U48437
Hypothetical protein 384D8_6	U62317
Insulinoma-associated 1	M93119	°
ISL1 transcription factor	U07559	°
Keratin 5	M21389	°
KIAA0736 protein	AB018279
KIAA1087 protein	AB029010
Protein tyrosine phosphatase, receptor type, N	L18983
Secretagogin	Y16752
Secretory granule, neuroendocrine protein 1	Y00757
Synaptosomal-associated protein, 25 kDa	D21267
Tumor protein 63 kDa with strong homology to p53	Y16961

Description	Sequence	Marka	Marker genes
Adv. glycosylation end product-sp. receptor	M91211	°
Cathepsin Z	AF032906
Ceruloplasmin	M13699		°b
Chromogranin A	U03749		°c
Chromogranin C	M25756		°c
Human amyloid precursor-like protein 1 mRNA	U48437
Hypothetical protein 384D8_6	U62317
Insulinoma-associated 1	M93119	°
ISL1 transcription factor	U07559	°
Keratin 5	M21389	°
KIAA0736 protein	AB018279
KIAA1087 protein	AB029010
Protein tyrosine phosphatase, receptor type, N	L18983
Secretagogin	Y16752
Secretory granule, neuroendocrine protein 1	Y00757
Synaptosomal-associated protein, 25 kDa	D21267
Tumor protein 63 kDa with strong homology to p53	Y16961

^aMarks mean genes selected by Bhattacharjee et al. (2001).

^bThe marker gene reported by Wang et al. (2002).

^cThe marker genes reported by Lamberts et al. (2001).

We wish to thank Dr Wu Jianhong for discussions on PART algorithm. This research was supported in part by Grant-in-Aid for the Project for Development of a Technological Infrastructure for Industrial Bioprocesses on R&D of New Industrial Science and Technology Frontiers by the Ministry of Economy, Trade and Industry (METI), which was entrusted by New Energy and Industrial Technology Development Organization (NEDO).

REFERENCES

Ando, T., Suguro, M., Hanai, T., Kobayashi, T., Honda, H., Seto, M.

2002

Fuzzy neural network applied to gene expression profiling for predicting the prognosis of diffuse large B-cell lymphoma.

Jpn J. Cancer Res.

1207

–1212

Ando, T., Suguro, M., Kobayashi, T., Seto, M., Honda, H.

2003

Selection of casual gene sets for lymphoma prognostication from expression profiling and construction of prognostic fuzzy neural network models.

J. Biosci. Bioeng.

161

–167

Ando, T., Suguro, M., Kobayashi, T., Seto, M., Honda, H.

2003

Multiple fuzzy neural network system for outcome prediction and classification of 220 lymphoma patients on the basis of molecular profiling.

Cancer Sci.

906

–913

Bhattacharjee, A., Richards, W.G., Staunton, J., Li, C., Monti, S., Vasa, P., Ladd, C., Beheshti, J., Bueno, R., Gillette, M., et al.

2001

Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses.

Proc. Natl Acad. Sci. USA

13790

–13795

Brown, M.P., Grundy, W.N., Lin, D., Cristianini, N., Sugnet, C.W., Furey, T.S., Ares, M., Jr, Haussler, D.

2000

Knowledge-based analysis of microarray gene expression data by using support vector machines.

Proc. Natl Acad. Sci., USA

262

–267

Cao, Y. and Wu, J.

2002

Projective ART for clustering data sets in high dimensional spaces.

Neural Netw.

105

–120

Carpenter, G.A. and Grossberg, S.

1987

A massively parallel architecture for a self-organizing neural pattern recognition machine.

Comput. Vision Graphics Image Process

–115

Eisen, M.B., Spellman, P.T., Brown, P.O., Botstein, D.

1998

Cluster analysis and display of genome-wide expression patterns.

Proc. Natl Acad. Sci., USA

14863

–14868

Golub, T.R., Slonim, D.K., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J.P., Coller, H., Loh, M.L., Downing, J.R., Caligiuri, M.A., Bloomfield, C.D., Lander, E.S.

1999

Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.

Science

286

531

–537

Gruvberger, S., Ringner, M., Chen, Y., Panavally, S., Saal, L.H., Borg, A., Ferno, M., Peterson, C., Meltzer, P.S.

2001

Estrogen receptor status in breast cancer is associated with remarkably distinct gene expression patterns.

Cancer Res.

5979

–5984

Lamberts, S.W., Hofland, L.J., Nobels, F.R.

2001

Neuroendocrine tumor markers.

Front. Neuroendocrinol.

309

–339

Noguchi, H., Hanai, T., Honda, H., Harrison, L.C., Kobayashi, T.

2001

Fuzzy neural network-based prediction of the motif for MHC class II binding peptides.

J. Biosci. Bioeng.

227

–231

Somogyi, R.

1999

Making sense of gene-expression data.

Pharmainformatics

–24

Takahashi, H., Tomida, S., Kobayashi, T., Honda, H.

2003

Inference of common genetic network.

J. Biosci. Bioeng.

161

–167

Tamayo, P., Slonim, D., Mesirov, J., Zhu, Q., Kitareewan, S., Dmitrovsky, E., Lander, E.S, Golub, T.R.

1999

Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation.

Proc. Natl Acad. Sci., USA

2907

–2912

Tibshirani, R., Hastie, T., Narasimhan, B., Chu, G.

2002

Diagnosis of multiple cancer types by shrunken centroids of gene expression.

Proc. Natl Acad. Sci., USA

6567

–6572

Tomida, S., Hanai, T., Koma, N., Suzuki, Y., Kobayashi, T., Honda, H.

2002

Artificial neural network predictive model for allergic disease using signal nucleotide polymorphisms data.

J. Biosci. Bioeng.

470

–478

Tomida, S., Hanai, T., Honda, H., Kobayashi, T.

2002

Analysis of expression profile using fuzzy adaptive resonance theory.

Bioinformatics

1073

–1083

Valet, G., Repp, R., Link, H., Ehninger, A., Gramatzki, M.M.

2003

Pretherapeutic identification of high-risk acute myeloid leukemia (AML) patients from immunophenotypic, cytogenetic, and clinical parameters.

Cytometry

53B

–10

Wang, K.K., Liu, N., Radulovich, N., Wigle, D.A., Johnston, M.R., Shepherd, F.A., Minden, M.D., Tsao, M.S.

2002

Novel candidate tumor marker genes for lung adenocarcinoma.

Oncogene

7598

–7604

Xu, Y., Selaru, F.M., Yin, J., Zou, T.T., Shustova, V., Mori, Y., Sato, F., Liu, T.C., Olaru, A., Wang, S., et al.

2002

Artificial neural networks and gene filtering distinguish between global gene expression profiles of Barrett's esophagus and esophageal cancer.

Cancer Res.

3493

–3497

Download all slides

Month:	Total Views:
January 2017	2
February 2017	8
May 2017	3
June 2017	2
July 2017	2
August 2017	3
November 2017	1
December 2017	10
January 2018	27
February 2018	2
March 2018	8
April 2018	8
May 2018	9
June 2018	8
July 2018	13
August 2018	17
September 2018	3
October 2018	16
November 2018	13
December 2018	11
January 2019	3
February 2019	5
March 2019	24
April 2019	23
May 2019	11
June 2019	6
July 2019	8
August 2019	6
September 2019	10
October 2019	3
November 2019	8
December 2019	3
January 2020	14
March 2020	2
April 2020	4
May 2020	5
June 2020	6
July 2020	2
August 2020	4
September 2020	10
October 2020	12
November 2020	7
February 2021	6
March 2021	4
April 2021	8
May 2021	3
June 2021	3
July 2021	4
August 2021	1
September 2021	3
October 2021	8
November 2021	5
December 2021	4
January 2022	3
February 2022	7
March 2022	6
April 2022	4
May 2022	3
June 2022	5
July 2022	9
August 2022	16
September 2022	13
October 2022	18
November 2022	2
December 2022	11
January 2023	2
February 2023	6
March 2023	10
April 2023	11
May 2023	5
June 2023	1
July 2023	3
August 2023	7
September 2023	2
October 2023	7
November 2023	10
December 2023	4
January 2024	9
February 2024	8
March 2024	8
April 2024	8
May 2024	13
June 2024	7
July 2024	8
August 2024	3
September 2024	3
October 2024	10
November 2024	3
December 2024	2
January 2025	6
February 2025	2
March 2025	14
April 2025	7
May 2025	3

Article Contents

Construction of robust prognostic predictors by using projective adaptive resonance theory as a gene filtering method

Abstract

INTRODUCTION

MATERIALS AND METHODS

Data processing

Evaluation of extracted genes by PART

Projective adaptive resonance theory model

Signal-to-noise statistic (S2N)

Nearest shrunken centroids

RESULTS AND DISCUSSIONS

Identification of optimal gene number of PART screening

Comparison of the performance of FNN-SWEEP class predictor with PART and other screening method

Comparison of genes used in a predictor with PART, with other screening

CONCLUSION

REFERENCES

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

Looking for your next opportunity?

Article Contents

Construction of robust prognostic predictors by using projective adaptive resonance theory as a gene filtering method Free

Abstract

INTRODUCTION

MATERIALS AND METHODS

Data processing

Evaluation of extracted genes by PART

Projective adaptive resonance theory model

Signal-to-noise statistic (S2N)

Nearest shrunken centroids

RESULTS AND DISCUSSIONS

Identification of optimal gene number of PART screening

Comparison of the performance of FNN-SWEEP class predictor with PART and other screening method

Comparison of genes used in a predictor with PART, with other screening

CONCLUSION

REFERENCES

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

Looking for your next opportunity?

This Feature Is Available To Subscribers Only

Construction of robust prognostic predictors by using projective adaptive resonance theory as a gene filtering method