Positive-unlabeled learning in bioinformatics and computational biology: a brief review

2.

Chen

J

,

Guo

M

,

Li

S

, et al.

ProtDec-LTR2.0: an improved method for protein remote homology detection by combining pseudo protein and supervised learning to rank

.

Bioinformatics

2017

;

33

:

3473

–

6

.

3.

Kalkatawi

M

,

Magana-Mora

A

,

Jankovic

B

, et al.

DeepGSR: an optimized deep-learning structure for the recognition of genomic signals and regions

.

Bioinformatics

2019

;

35

:

1125

–

32

.

4.

Umarov

R

,

Kuwahara

H

,

Li

Y

, et al.

Promoter analysis and prediction in the human genome using sequence-based deep learning models

.

Bioinformatics

2019

;

35

:

2730

–

7

.

5.

Rapakoulia

T

,

Gao

X

,

Huang

Y

, et al.

Genome-scale regression analysis reveals a linear relationship for promoters and enhancers after combinatorial drug treatment

.

Bioinformatics

2017

;

33

:

3696

–

700

.

6.

Lin

H

,

Deng

EZ

,

Ding

H

, et al.

iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition

.

Nucleic Acids Res

2014

;

42

:

12961

–

72

.

7.

Zhang

QC

,

Petrey

D

,

Deng

L

, et al.

Structure-based prediction of protein-protein interactions on a genome-wide scale

.

Nature

2012

;

490

:

556

–

60

.

8.

Luck

K

,

Kim

DK

,

Lambourne

L

, et al.

A reference map of the human binary protein interactome

.

Nature

2020

;

580

:

402

–

8

.

9.

Chen

H

,

Li

F

,

Wang

L

, et al.

Systematic evaluation of machine learning methods for identifying human-pathogen protein-protein interactions

.

Brief Bioinform

2021

;

22

(3):bbaa068. https://doi.org/10.1093/bib/bbaa068.

10.

Fossati

A

,

Li

C

,

Uliana

F

, et al.

PCprophet: a framework for protein complex prediction and differential analysis using proteomic data

.

Nat Methods

2021

;

18

:

520

–

7

.

11.

Barutcuoglu

Z

,

Schapire

RE

,

Troyanskaya

OG

.

Hierarchical multi-label prediction of gene function

.

Bioinformatics

2006

;

22

:

830

–

6

.

12.

Cho

H

,

Berger

B

,

Peng

J

.

Compact integration of multi-network topology for functional analysis of genes

.

Cell Syst

2016

;

3

:

540

–

548 e545

.

13.

Zhao

Y

,

Wang

J

,

Chen

J

, et al.

A literature review of gene function prediction by modeling gene ontology

.

Front Genet

2020

;

11

:

400

.

14.

Hong

J

,

Luo

Y

,

Zhang

Y

, et al.

Protein functional annotation of simultaneously improved stability, accuracy and false discovery rate achieved by a sequence-based deep learning

.

Brief Bioinform

2020

;

21

:

1437

–

47

.

15.

Li

F

,

Fan

C

,

Marquez-Lago

TT

, et al.

PRISMOID: a comprehensive 3D structure database for post-translational modifications and mutations with functional impact

.

Brief Bioinform

2020

;

21

:

1069

–

79

.

16.

Li

F

,

Wang

Y

,

Li

C

, et al.

Twenty years of bioinformatics research for protease-specific substrate and cleavage site prediction: a comprehensive revisit and benchmarking of existing methods

.

Brief Bioinform

2019

;

20

:

2150

–

66

.

17.

Pazos

F

,

Sternberg

MJ

.

Automated prediction of protein function and detection of functional sites from structure

.

Proc Natl Acad Sci U S A

2004

;

101

:

14754

–

9

.

18.

Wang

X

,

Li

C

,

Li

F

, et al.

SIMLIN: a bioinformatics tool for prediction of S-sulphenylation in the human proteome based on multi-stage ensemble-learning models

.

BMC Bioinform

2019

;

20

:

602

.

19.

Lee

D

,

Redfern

O

,

Orengo

C

.

Predicting protein function from sequence and structure

.

Nat Rev Mol Cell Biol

2007

;

8

:

995

–

1005

.

20.

Li

F

,

Li

C

,

Wang

M

, et al.

GlycoMine: a machine learning-based approach for predicting N-, C- and O-linked glycosylation in the human proteome

.

Bioinformatics

2015

;

31

:

1411

–

9

.

21.

Li

F

,

Li

C

,

Revote

J

, et al.

GlycoMine(struct): a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features

.

Sci Rep

2016

;

6

:

34595

.

22.

Song

J

,

Li

F

,

Leier

A

, et al.

PROSPERous: high-throughput prediction of substrate cleavage sites for 90 proteases with improved accuracy

.

Bioinformatics

2018

;

34

:

684

–

7

.

23.

Li

F

,

Leier

A

,

Liu

Q

, et al.

Procleave: predicting protease-specific substrate cleavage sites by combining sequence and structural information

.

Genom Proteom Bioinform

2020

;

18

:

52

–

64

.

24.

Li

F

,

Guo

X

,

Jin

P

, et al.

Porpoise: a new approach for accurate prediction of RNA pseudouridine sites

.

Brief Bioinform

2021

.

10.1093/bib/bbab245

.

25.

Liu

Q

,

Chen

J

,

Wang

Y

, et al.

DeepTorrent: a deep learning-based approach for predicting DNA N4-methylcytosine sites

.

Brief Bioinform

2021

;

22

(3):bbaa124. https://doi.org/10.1093/bib/bbaa124.

26.

Mei

S

,

Li

F

,

Xiang

D

, et al.

Anthem: a user customised tool for fast and accurate prediction of binding between peptides and HLA class I molecules

.

Brief Bioinform

2021

.

10.1093/bib/bbaa415

.

27.

Lv

H

,

Dao

FY

,

Zulfiqar

H

, et al.

DeepIPs: comprehensive assessment and computational identification of phosphorylation sites of SARS-CoV-2 infection using a deep learning-based approach

.

Brief Bioinform

2021

.

10.1093/bib/bbab244

.

28.

Dao

FY

,

Lv

H

,

Su

W

, et al.

iDHS-deep: an integrated tool for predicting DNase I hypersensitive sites by deep neural network

.

Brief Bioinform

2021

.

10.1093/bib/bbab047

.

29.

Song

Z

,

Huang

D

,

Song

B

, et al.

Attention-based multi-label neural networks for integrated prediction and interpretation of twelve widely occurring RNA modifications

.

Nat Commun

2021

;

12

:

4011

.

30.

Dai

C

,

Feng

P

,

Cui

L

, et al.

Iterative feature representation algorithm to improve the predictive performance of N7-methylguanosine sites

.

Brief Bioinform

2021

;

22

(4):bbaa278. https://doi.org/10.1093/bib/bbaa278.

10.1093/bioinformatics/btab380

31.

Tang

Q

,

Kang

J

,

Yuan

J

, et al.

DNA4mC-LIP: a linear integration method to identify N4-methylcytosine site in multiple species

.

Bioinformatics

2020

;

36

:

3327

–

35

.

32.

Liu

K

,

Chen

W

.

iMRM: a platform for simultaneously identifying multiple kinds of RNA modifications

.

Bioinformatics

2020

;

36

:

3336

–

42

.

33.

Campanella

G

,

Hanna

MG

,

Geneslaw

L

, et al.

Clinical-grade computational pathology using weakly supervised deep learning on whole slide images

.

Nat Med

2019

;

25

:

1301

–

9

.

34.

Manifold

B

,

Men

S

,

Hu

R

, et al.

A versatile deep learning architecture for classification and label-free prediction of hyperspectral images

.

Nat Mach Intell

2021

;

3

:

306

–

15

.

35.

Wang

G

,

Liu

X

,

Shen

J

, et al.

A deep-learning pipeline for the diagnosis and discrimination of viral, non-viral and COVID-19 pneumonia from chest X-ray images

.

Nat Biomed Eng

2021

;

5

:

509

–

21

.

36.

Wang

Y

,

Coudray

N

,

Zhao

Y

, et al.

HEAL: an automated deep learning framework for cancer histopathology image analysis

.

Bioinformatics

2021

.

.

37.

Chen

Z

,

Zhao

P

,

Li

F

, et al.

PROSPECT: a web server for predicting protein histidine phosphorylation sites

.

J Bioinform Comput Biol

2020

;

18

:

2050018

.

38.

Li

F

,

Li

C

,

Marquez-Lago

TT

, et al.

Quokka: a comprehensive tool for rapid and accurate prediction of kinase family-specific phosphorylation sites in the human proteome

.

Bioinformatics

2018

;

34

:

4223

–

31

.

39.

Larrañaga

P

,

Calvo

B

,

Santana

R

, et al.

Machine learning in bioinformatics

.

Brief Bioinform

2006

;

7

:

86

–

112

.

40.

Byvatov

E

,

Schneider

G

.

Support vector machine applications in bioinformatics

.

Appl Bioinform

2003

;

2

:

67

–

77

.

41.

Boulesteix

A-L

,

Janitza

S

,

Kruppa

J

, et al.

Overview of random forest methodology and practical guidance with emphasis on computational biology and bioinformatics

.

Wiley Interdiscipl Rev: Data Mining Knowl Discov

2012

;

2

:

493

–

507

.

42.

Wang

Q

,

Garrity

GM

,

Tiedje

JM

, et al.

Naïve Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy

.

Appl Environ Microbiol

2007

;

73

:

5261

.

43.

Sobel

E

,

Lange

K

,

Wu

TT

, et al.

Genome-wide association analysis by lasso penalized logistic regression

.

Bioinformatics

2009

;

25

:

714

–

21

.

44.

Li

F

,

Zhang

Y

,

Purcell

AW

, et al.

Positive-unlabelled learning of glycosylation sites in the human proteome

.

BMC Bioinform

2019

;

20

:

112

.

45.

Kilic

C

,

Tan

M

.

Positive Unlabeled Learning for Deriving Protein Interaction Networks

.

Netherlands

:

Springer

,

2012

,

87

.

46.

Liu

H

,

Torii

M

,

Xu

G

, et al.

Learning from Positive and Unlabeled Documents for Retrieval of Bacterial Protein-Protein Interaction Literature

.

Germany

:

Springer-Verlag

,

2010

,

62

.

47.

Xing-Ming

Z

,

Yong

W

,

Luonan

C

, et al.

Gene function prediction using labeled and unlabeled data

.

BMC Bioinform

2008

;

9

:

1

–

14

.

48.

Chen

Y

,

Li

Z

,

Wang

X

, et al.

Predicting gene function using few positive examples and unlabeled ones

.

BMC Genomics

2010

;

11

(

Suppl 2

):

S11

–

1

.

49.

Mordelet

F

,

Vert

JP

.

ProDiGe: prioritization of disease genes with multitask machine learning from positive and unlabeled examples

.

BMC Bioinform

2011

;

12

:

389

.

50.

Bhardwaj

N

,

Gerstein

M

,

Lu

H

.

Genome-wide sequence-based prediction of peripheral proteins using a novel semi-supervised learning technique

.

BMC Bioinform

2010

;

11

(

Suppl 1

):

S6

–

6

.

51.

Xiao

Y

,

Segal

MR

.

Biological sequence classification utilizing positive and unlabeled data

.

Bioinformatics

2008

;

24

:

1198

–

205

.

52.

Wang

C

,

Ding

C

,

Meraz

RF

, et al.

PSoL: A Positive Sample Only Learning Algorithm for Finding Non-coding RNA Genes

.

Great Britain

:

Oxford University Press

,

2006

,

2590

.

53.

Hameed

PN

,

Verspoor

K

,

Kusljic

S

, et al.

Positive-unlabeled learning for inferring drug interactions based on heterogeneous attributes

.

BMC Bioinform

2017

;

18

:

140

.

54.

Bekker

J

,

Davis

J

.

Learning from positive and unlabeled data: a survey

.

Mach Learn

2020

;

109

:

719

–

60

.

55.

van

Engelen

JE

,

Hoos

HH

.

A survey on semi-supervised learning

.

Mach Learn

2020

;

109

:

373

–

440

.

56.

Khan

SS

,

Madden

MG

.

One-class classification: taxonomy of study and review of techniques

.

Knowl Eng Rev

2014

;

29

:

345

–

74

.

57.

Cerulo

L

,

Elkan

C

,

Ceccarelli

M

.

Learning gene regulatory networks from only positive and unlabeled data

.

BMC Bioinform

2010

;

11

:

228

.

58.

Li

C

,

Zhang

Y

,

Li

X

.

OcVFDT: one-class very fast decision tree for one-class classification of data streams

.

Proceedings of the Third International Workshop on Knowledge Discovery from Sensor Data

.

Paris, France

:

Association for Computing Machinery

,

2009

,

79

–

86

.

59.

Cerulo

L

,

Paduano

V

,

Zoppoli

P

, et al.

A negative selection heuristic to predict new transcriptional targets

.

BMC Bioinform

2010

;

14

(

Suppl 1

):

S3

–

3

.

60.

Patel

N

,

Wang

JTL

.

Semi-Supervised Prediction of Gene Regulatory Networks Using Machine Learning Algorithms

.

J Biosci

2015

;

40

:

731

–

40

.

61.

Jiang

M

,

Cao

J-Z

.

Positive-unlabeled learning for pupylation sites prediction

.

Biomed Res Int

2016

;

2016

:

1

–

5

.

62.

Lan

W

,

Li

M

,

Zhao

K

, et al.

LDAP: a web server for lncRNA-disease association prediction

.

Bioinformatics

2017

;

33

:

458

–

60

.

63.

Nan

X

,

Bao

L

,

Zhao

X

, et al.

EPuL: an enhanced positive-unlabeled learning algorithm for the prediction of pupylation sites

.

Molecules

2017

;

22

(9):1463. https://doi.org/10.3390/molecules22091463.

64.

Zeng

X

,

Zhong

Y

,

Lin

W

, et al.

Predicting disease-associated circular RNAs using deep forests combined with positive-unlabeled learning methods

.

Brief Bioinform

2020

;

21

:

1425

–

36

.

65.

Zhou

Z-H

,

Feng

J

.

Deep forest

.

Natl Sci Rev

2019

;

6

:

74

–

86

.

66.

Wei

H

,

Xu

Y

,

Liu

B

.

iPiDi-PUL: identifying Piwi-interacting RNA-disease associations based on positive unlabeled learning

.

Brief Bioinform

2021

;

22

(3):bbaa058. https://doi.org/10.1093/bib/bbaa058.

. https://doi.org/10.1016/j.patter.2021.100311:100311.

67.

Yan

F

,

Zhao

Z

,

Simon

LM

.

EmptyNN: a neural network based on positive and unlabeled learning to remove cell-free droplets and recover lost cells in scRNA-seq data

.

Patterns

2021

68.

Yang

P

,

Li

XL

,

Mei

JP

, et al.

Positive-unlabeled learning for disease gene identification

.

Bioinformatics

2012

;

28

:

2640

–

7

.

69.

Yang

P

,

Li

X

,

Chua

H-N

, et al.

Ensemble positive unlabeled learning for disease gene identification

.

PLoS One

2014

;

9

:

1

–

11

.

10.1016/j.celrep.2015.06.031:183

70.

Yanqi

H

,

Recep

C

,

Joan

T

, et al.

Semi-supervised learning predicts approximately one third of the alternative splicing isoforms as functional proteins

.

Cell Rep

2015

;

12

(

2

):

183

–

9

.

.

71.

Ren

J

,

Liu

Q

,

Ellis

J

, et al.

Positive-unlabeled learning for the prediction of conformational B-cell epitopes

.

BMC Bioinform

2015

;

16

(

Suppl 18

):

S12

.

72.

Lan

W

,

Wang

J

,

Li

M

, et al.

Predicting drug–target interaction using positive-unlabeled learning

.

Neurocomputing

2016

;

206

:

50

–

7

.

73.

Mamitsuka

HE

,

DeLisi

CE

,

Kanehisa

ME

, et al.

Supervised Inference of Gene Regulatory Networks from Positive and Unlabeled Examples

.

Totowa, NJ

:

Humana Press

,

2012

,

47

.

10.1186/1471-2105-15-S1-S4

74.

Pio

G

,

Malerba

D

,

D'Elia

D

, et al.

Integrating MicroRNA target predictions for the discovery of gene regulatory networks: a semi-supervised ensemble learning approach

.

BMC Bioinform

2014

;

15

:S4. doi:

.

75.

Cheng

Z

,

Zhou

S

,

Guan

J

.

Computationally predicting protein-RNA interactions using only positive and unlabeled examples

.

J Bioinform Comput Biol

2015

;

13

:

1541005

.

76.

Yang

P

,

Humphrey

SJ

,

James

DE

, et al.

Positive-Unlabeled Ensemble Learning for Kinase Substrate Prediction from Dynamic Phosphoproteomics Data

.

Great Britain

:

Oxford University Press

,

2016

,

252

.

77.

Pejaver

V

,

Urresti

J

,

Lugo-Martinez

J

, et al.

Inferring the molecular and phenotypic impact of amino acid variants with MutPred2

.

Nat Commun

2020

;

11

:

5918

.

78.

Li

F

,

Song

J

,

Li

C

, et al.

PAnDE : averaged n-dependence estimators for positive unlabeled learning, ICIC express letters. Part B, Applications

.

Int J Res Surveys

2017

;

8

:

1287

–

97

.

. https://doi.org/10.3389/fgene.2021.658078.

79.

Bepler

T

,

Morin

A

,

Rapp

M

, et al.

Positive-unlabeled convolutional neural networks for particle picking in cryo-electron micrographs

.

Nat Methods

2019

;

16

:

1153

–

60

.

80.

Li

Z

,

Hu

L

,

Tang

Z

, et al.

Predicting HIV-1 protease cleavage sites with positive-unlabeled learning

.

Front Genet

2021

;

12

81.

Scholkopf

B

,

Platt

JC

,

Shawe-Taylor

J

, et al.

Estimating the support of a high-dimensional distribution

.

Neural Comput

2001

;

13

:

1443

–

71

.

82.

Zhang

M

,

Zhou

Z

. In:

Zhou

Z

(ed).

A k-Nearest Neighbor Based Algorithm for Multi-label Classification

.

Piscataway, NJ, USA, USA

:

IEEE

,

2005

,

718

.

83.

Ma

H

,

Yang

H

,

Lyu

MR

et al.

Mining social networks using heat diffusion processes for marketing candidates selection

.

Proceedings of the 17th ACM conference on Information and knowledge management

.

Napa Valley, California, USA

:

ACM

,

2008

,

233

–

42

.

84.

Elkan

C

,

Noto

K

. Learning classifiers from only positive and unlabeled data.

Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

.

Las Vegas, Nevada, USA

:

ACM

,

2008

,

213

–

20

.

85.

Webb

GI

,

Boughton

JR

,

Zheng

F

, et al.

Learning by Extrapolation from Marginal to Full-Multivariate Probability Distributions: Decreasingly Naive Bayesian Classification

.

Great Britain

:

Springer Science + Business Media

,

2012

,

233

.

10.1038/s41576-019-0122-6

86.

Jain

S

,

White

M

,

Trosset

MW

, et al.

Nonparametric Semi-supervised Learning of Class Proportions

,

2016

. arXiv:1601.01944.

87.

Jain

S

,

White

M

,

Radivojac

P

.

Estimating the Class Prior and Posterior from Noisy Positives and Unlabeled Data

,

2016

. arXiv:1606.08561.

88.

Hershberg

R

,

Altuvia

S

,

Margalit

H

.

A survey of small RNA-encoding genes in Escherichia coli

.

Nucleic Acids Res

2003

;

31

:

1813

–

20

.

89.

Eraslan

G

,

Avsec

Ž

,

Gagneur

J

, et al.

Deep learning: new computational modelling techniques for genomics

.

Nat Rev Genet

2019

.

.

90.

Denis

F

,

Gilleron

R

,

Letouzey

F

.

Learning from positive and unlabeled examples

.

Theor Comput Sci

2005

;

348

:

70

–

83

.

91.

Li

C

,

Hua

X-L

. Towards Positive Unlabeled Learning for Parallel Data Mining: A Random Forest Framework.

In the conference proceedings of International Conference on Advanced Data Mining and Applications 2014 (ADMA 2014).

Cham

:

Springer International Publishing

,

2014

,

573

–

87

.

92.

He

J

,

Zhang

Y

,

Li

X

, et al.

Bayesian Classifiers for Positive Unlabeled Learning

.

Berlin, Heidelberg

:

Springer Berlin Heidelberg

,

2011

,

81

–

93

.

93.

Dong

X

,

Yu

Z

,

Cao

W

, et al.

A survey on ensemble learning

.

Front Comp Sci

2020

;

14

:

241

–

58

.

94.

Freund

Y

,

Schapire

RE

.

A decision-theoretic generalization of on-line learning and an application to boosting

.

J Comput Syst Sci

1997

;

55

:

119

–

39

.

95.

Hastie

T

,

Rosset

S

,

Zhu

J

, et al.

Multi-class adaboost

.

Stat Interface

2009

;

2

:

349

–

60

.

96.

Wolpert

DH

.

Stacked generalization

.

Neural Netw

1992

;

5

:

241

–

59

.

97.

Gligorijević

V

,

Renfrew

PD

,

Kosciolek

T

, et al.

Structure-based protein function prediction using graph convolutional networks

.

Nat Commun

2021

;

12

:

3168

.

98.

Kulmanov

M

,

Hoehndorf

R

.

DeepGOPlus: improved protein function prediction from sequence

.

Bioinformatics

2020

;

36

:

422

–

9

.

10.1093/bioinformatics/btab144

99.

Li

F

,

Chen

J

,

Leier

A

, et al.

DeepCleave: a deep learning predictor for caspase and matrix metalloprotease substrates and cleavage sites

.

Bioinformatics

2020

;

36

:

1057

–

65

.

100.

Chen

Z

,

Zhao

P

,

Li

F

, et al.

Comprehensive review and assessment of computational methods for predicting RNA post-transcriptional modification sites from RNA sequences

.

Brief Bioinform

2020

;

21

:

1676

–

96

.

101.

Zhu

Q

,

Shao

Y

,

Wang

Z

, et al.

DeepS: a web server for image optical sectioning and super resolution microscopy based on a deep learning framework

.

Bioinformatics

2021

.

.

102.

Oh

M

,

Park

S

,

Kim

S

, et al.

Machine learning-based analysis of multi-omics data on the cloud for investigating gene regulations

.

Brief Bioinform

2021

;

22

:

66

–

76

.

103.

Sharifi-Noghabi

H

,

Zolotareva

O

,

Collins

CC

, et al.

MOLI: multi-omics late integration with deep neural networks for drug response prediction

.

Bioinformatics

2019

;

35

:

i501

–

9

.

104.

Meyer

JG

.

Deep learning neural network tools for proteomics

.

Cell Reports Methods

2021

;

1

:

100003

.

105.

Gessulat

S

,

Schmidt

T

,

Zolg

DP

, et al.

Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning

.

Nat Methods

2019

;

16

:

509

–

18

.

106.

Wilhelm

M

,

Zolg

DP

,

Graber

M

, et al.

Deep learning boosts sensitivity of mass spectrometry-based immunopeptidomics

.

Nat Commun

2021

;

12

:

3346

.

107.

Schmauch

B

,

Romagnoni

A

,

Pronier

E

, et al.

A deep learning model to predict RNA-Seq expression of tumours from whole slide images

.

Nat Commun

2020

;

11

:

3877

.

108.

Kiryo

R

,

Niu

G

,

MCd

P

, et al. Positive-unlabeled learning with non-negative risk estimator. In: ,

arXiv preprint arXiv:1703.00593 2017. 07, November, 2017, preprint: not peer reviewed.

109.

Hou

M

,

Chaib-Draa

B

,

Li

C

, et al. Generative adversarial positive-unlabelled learning. In: ,

arXiv preprint arXiv:1711.08054 2017. preprint: not peer reviewed.

110.

Wu

M

,

Pan

S

,

Du

L

, et al. Long-short distance aggregation networks for positive unlabeled graph learning.

Proceedings of the 28th ACM International Conference on Information and Knowledge Management

.

2019

, p.

2157

–

60

.

111.

Liu

B

.

BioSeq-analysis: a platform for DNA, RNA and protein sequence analysis based on machine learning approaches

.

Brief Bioinform

2019

;

20

:

1280

–

94

.

112.

Chen

Z

,

Zhao

P

,

Li

F

, et al.

iFeature: a python package and web server for features extraction and selection from protein and peptide sequences

.

Bioinformatics

2018

;

34

:

2499

–

502

.

113.

Chen

Z

,

Zhao

P

,

Li

F

, et al.

iLearn: an integrated platform and meta-learner for feature engineering, machine-learning analysis and modeling of DNA, RNA and protein sequence data

.

Brief Bioinform

2020

;

21

:

1047

–

57

.

114.

Chen

Z

,

Zhao

P

,

Li

C

, et al.

iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization

.

Nucleic Acids Res

2021

;

49

:

e60

.

115.

Cao

C

,

Liu

F

,

Tan

H

, et al.

Deep learning and its applications in biomedicine, genomics

.

Proteom Bioinform

2018

;

16

:

17

–

32

.

116.

Shin

H

,

Orton

M

,

Collins

DJ

et al. Autoencoder in time-series analysis for unsupervised tissues characterisation in a large unlabelled medical image dataset.

2011 10th International Conference on Machine Learning and Applications and Workshops

, USA.

2011

, p.

259

–

64

.

117.

Lee

T

,

Yoon

S

. Boosted categorical restricted Boltzmann machine for computational prediction of splice junctions. In:

Proceedings of the 32nd International Conference on International Conference on Machine Learning

, Vol.

37

.

Lille, France

:

JMLR.org

,

2015

,

2483

–

92

.

118.

Jia

C

,

Bi

Y

,

Chen

J

, et al.

PASSION: an ensemble neural network approach for identifying the binding sites of RBPs on circRNAs

.

Bioinformatics

2020

;

36

:

4276

–

82

.

119.

Zhu

Y

,

Li

F

,

Xiang

D

, et al.

Computational identification of eukaryotic promoters based on cascaded deep capsule neural networks

.

Brief Bioinform

2020

.

10.1093/bib/bbaa299

.