Preprocessing and downstream analysis of microarray DNA copy number profiles Free

15

Olshen

AB

Venkatraman

E

Lucito

R

et al. ,

Circular binary segmentation for the analysis of array-based DNA copy number data

,

Biostatistics

,

2004

, vol.

5

(pg.

557

-

72

)

16

Rancoita

PM

Hutter

M

Bertoni

F

et al. ,

Bayesian DNA copy number analysis

,

BMC Bioinformatics

,

2009

, vol.

10

(pg.

1

-

19

)

17

Huang

J

Gusnanto

A

O'S;ullivan

K

et al. ,

Robust smooth segmentation approach for array CGH data analysis

,

Bioinformatics

,

2007

, vol.

23

(pg.

2463

-

9

)

18

Hsu

L

Self

S

Grove

D

et al. ,

Denoising array-based comparative genomic hybridization data using wavelets

,

Biostatistics

,

2005

, vol.

6

(pg.

211

-

26

)

19

Lai

WR

Johnson

M

Kucherlapati

R

et al. ,

Comparative analysis of algorithms for identifying amplifications and deletions in array CGH data

,

Bioinformatics

,

2005

, vol.

21

(pg.

3763

-

70

)

20

Willenbrock

H

Fridlyand

J

,

A comparison study: applying segmentation to array CGH data for downstream analyses

,

Bioinformatics

,

2005

, vol.

21

(pg.

4084

-

91

)

21

Van de Wiel

MA

Kim

K

Vosse

S

et al. ,

CGHcall: calling aberrations for array CGH tumor profiles

,

Bioinformatics

,

2007

, vol.

23

(pg.

892

-

4

)

22

Picard

F

Robin

S

Lebarbier

E

et al. ,

A segmentation/clustering model of the analysis of array CGH data

,

Biometrics

,

2007

, vol.

63

(pg.

758

-

66

)

23

Attiyeh

EF

Diskin

SJ

Attiyeh

MA

et al. ,

Genomic copy number determination in cancer cells from single nucleotide polymorphism microarrays based on quantitative genotyping corrected for aneuploidy

,

Genome Res

,

2009

, vol.

19

(pg.

276

-

83

)

24

Wang

K

Li

J

Li

S

et al. ,

Estimation of tumor heterogeneity using CGH array data

,

BMC Bioinformatics

,

2009

, vol.

10

pg.

12

25

Pique-Regi

R

Monso-Varona

J

Ortega

A

et al. ,

Sparse representation and bayesian detection of genome copy number alterations from microarray data

,

Bioinformatics

,

2008

, vol.

24

(pg.

309

-

18

)

26

Díaz-Uriarte

R

Rueda

OM

,

AdaCGH: A parallelized web-based application and R package for the analysis of aCGH data

,

PLoS One

,

2007

, vol.

2

pg.

e737

27

Bengtsson

H

Ray

A

Spellman

P

et al. ,

A single-sample method for normalizing and combining full-resolution copy numbers from multiple platforms, labs and analysis methods

,

Bioinformatics

,

2009

, vol.

25

(pg.

861

-

7

)

28

Zhang

NR

Senbabaoglu

Y

Li

JZ

,

Joint estimation of DNA copy number from multiple platforms

,

Bioinformatics

Advance Access

29

Leary

RJ

Lin

JC

Cummins

J

et al. ,

Integrated analysis of homozygous deletions, focal amplifications, and sequence alterations in breast and colorectal cancers

,

Proc Natl Acad Sci USA

,

2008

, vol.

105

(pg.

16224

-

9

)

30

Van de Wiel

MA

Van Wieringen

WN

,

CGHregions: dimension reduction for array CGH data with minimal information loss

,

Cancer Informatics

,

2007

, vol.

2

(pg.

55

-

63

)

31

Van Wieringen

WN

Van de Wiel

MA

Ylstra

B

,

Weighted clustering of called aCGH data

,

Biostatistics

,

2008

, vol.

9

(pg.

484

-

500

)

32

Van Wieringen

WN

Van de Wiel

MA

Van der Vaart

AW

,

A test for partial differential expression

,

J Amer Statist Assoc

,

2008

, vol.

103

(pg.

1039

-

49

)

33

Gonzalez

JR

Subirana

I

Escarams

G

et al. ,

A latent class model to assess association between copy number and disease

,

BMC Bioinformatics

,

2009

, vol.

10

pg.

172

34

Gilbert

PB

,

A modified false discovery rate multiple-comparisons procedure for discrete data, applied to human immunodeficiency virus genetics

,

Applied Statistics

,

2005

, vol.

54

(pg.

143

-

58

)

35

Chin

SF

Wang

Y

Thorne

N

et al. ,

Using array-comparative genomic hybridization to define molecular portraits of primary breast cancers

,

Oncogene

,

2007

, vol.

26

(pg.

1959

-

70

)

36

Wilhelm

M

Veltman

JA

Olshen

AB

et al. ,

Array-based comparative genomic hybridiza-tion for the differential diagnosis of renal cell cancer

,

Cancer Research

,

2002

, vol.

62

(pg.

957

-

60

)

37

Jong

K

Marchiori

E

Van der Vaart

AW

et al. ,

Cross-platform array comparative genomic hybridization meta-analysis separates hematopoietic and mesenchymal from epithelial tumors

,

Oncogene

,

2007

, vol.

26

(pg.

1499

-

506

)

38

Liu

J

Mohammed

J

Carter

J

et al. ,

Distance-based clustering of CGH data

,

Bioinformatics

,

2006

, vol.

22

(pg.

1971

-

8

)

39

Liu

J

Ranka

S

Kahveci

T

,

Markers improve clustering of CGH data

,

Bioinformatics

,

2007

, vol.

23

(pg.

450

-

7

)

40

Somiari

S

Shriver

C

He

J

et al. ,

Global search for chromosomal abnormalities in infil- trating ductal carcinoma of the breast using array-comparative genomic hybridization

,

Cancer Genetics and Cytogenetics

,

2004

, vol.

155

(pg.

108

-

18

)

41

Unger

K

Malisch

E

Thomas

G

et al. ,

Array CGH demonstrates characteristic aberration signatures in human papillary thyroid carcinomas governed by RET/PTC

,

Oncogene

,

2008

, vol.

27

(pg.

4592

-

602

)

42

Shah

SP

Cheung

KJ

Jr

Johnson

NA

et al. ,

Model-based clustering of array CGH data

,

Bioinformatics

,

2009

, vol.

25

(pg.

i30

-

i38

)

43

O'H;agan

RC

Brennan

CW

Strahs

A

et al. ,

Array comparative genome hybridization for tumor classification and gene discovery in mouse models of malignant melanoma

,

Cancer Research

,

2003

, vol.

63

(pg.

5352

-

6

)

44

Jönsson

G

Naylor

TL

Vallon-Christersson

J

et al. ,

Distinct genomic profiles in hereditary breast tumors identified by array-based comparative genomic hybridization

,

Cancer Research

,

2005

, vol.

65

(pg.

7612

-

21

)

45

Wang

S

Wang

Y

Girard

L

et al.

Arabnia

HR

Yang

MQ

Yang

JY

,

An interval tree based feature reduction method for cancer classification using high-throughput DNA copy number data

,

International Conference on Bioinformatics & Computational Biology, BIOCOMP 2007, Volume I, June 25–28, 2007. Las Vegas, Nevada, USA

Las Vegas, NV

CSREA Press

(pg.

248

-

55

)

Google Preview

46

Gambin

T

Walczak

K

,

A new classification method using array Comparative Genome Hybridization data, based on the concept of Limited Jumping Emerging Patterns

,

BMC Bioinformatics

,

2009

, vol.

10

(pg.

S1

-

S64

)

47

Wang

Y

Makedon

F

Pearlman

J

,

Tumor classification basedon DNA copy number aberrations determined using SNP arrays

,

Oncology Reports

,

2006

, vol.

15

(pg.

1057

-

9

)

48

Rapaport

F

Barillot

E

Vert

JP

,

Classification of array CGH data using fused SVM

,

Bioinformatics

,

2008

, vol.

24

(pg.

i375

-

i382

)

49

Barutcuoglu

Z

Airoldi

E

Dumeaux

V

et al. ,

Aneuploidy prediction and tumor clas- sification with heterogeneous hidden conditional random fields

,

Bioinformatics

,

2009

, vol.

25

(pg.

1307

-

13

)

50

Rueda

OM

Diaz-Uriarte

R

,

Finding recurrent regions of copy number variation: A review of methods

,

Current Bioinformatics

,

2009

in press

51

Rouverol

C

Stransky

N

Hupé

P

et al. ,

Computation of recurrent minimal genomic alterations from array-CGH data

,

Bioinformatics

, vol.

22

7

(pg.

849

-

56

)

52

Shah

SP

,

Computational methods for identification of recurrent copy number alteration patterns by array CGH

,

Cytogenet Genome Res

,

2008

, vol.

123

(pg.

343

-

51

)

53

Rueda

OM

Diaz-Uriarte

R

,

Detection of recurrent copy number alterations in the genome: taking among-subject heterogeneity seriously

,

BMC Bioinformatics

,

2009

, vol.

10

pg.

308

54

Shah

SP

Lam

WL

Ng

RT

Murphy

KP

,

Modeling recurrent DNA copy number alterations in array CGH data

,

Bioinformatics

,

2007

, vol.

23

13

(pg.

i450

-

8

)

55

Feuk

L

Carson

A

Scherer

S

,

Structural variation in the human genome

,

Nat Rev Genet

,

2006

, vol.

7

(pg.

85

-

97

)

56

Carter

N

,

Methods and strategies for analyzing copy number variation using DNA microarrays

,

Nat Genet

,

2007

, vol.

39

(pg.

S16

-

S21

)

57

Weir

BA

Woo

MS

Getz

G

et al. ,

Characterizing the cancer genome in lung adenocarcinoma

,

Nature

,

2007

, vol.

450

(pg.

893

-

8

)

58

Leary

RJ

Cummins

J

Wang

TL

et al. ,

Digital karyotyping

,

Nat Protoc

,

2007

, vol.

2

(pg.

1973

-

86

)

59

,

R Development Core Team

,

R: A Language and Environment for Statistical Computing

,

2006

(11 February 2010, date last accessed)

R Foundation for Statistical Computing

Vienna, Austria

ISBN 3-900051-07-0 http://www.R-project.org

60

Gentleman

RC

Carey

VJ

Bates

DM

et al. ,

Bioconductor: Open software development for computational biology and bioinformatics

,

Genome Biology

,

2004

, vol.

5

pg.

R80

61

Hofmann

WA

Weigmann

A

Tauscher

M

et al. ,

Analysis of array-CGH data using the R and Bioconductor software suite

,

Comp Funct Genomics

,

2009

Article 201325

Appendix: software

We present two tables of software resources for array CGH data analysis. We mostly cite R software from the repositories CRAN [59, http://cran.r-project.org/] & Bioconductor [60, http://www.bioconductor.org]. Additional R-software for aCGH analysis is discussed in [61]. For software references on recurrent regions refer to [50].

Table A1:

Software for preprocessing aCGH profiles

Reference	Source	Name	Platform
Removing wave-like artifacts
Diskin et al. [8]	http://www.openbioinformatics.org/penncnv/	PennCNV-gcmodel	Stand-alone
Van de Wiel et al. [9]	http://www.few.vu.nl/∼mavdwiel/nowaves.html	NoWaves	R
Normalization
Staaf et al. [10]	CRAN	popLowess	R
Chen et al. [11]	http://ntumaps.cgm.ntu.edu.tw/aCGH supplementary/		Matlab
Segmentation, smoothing and calling
Marioni et al. [12]	Bioconductor	snapCGH	R
Rueda et al. [14]	CRAN	RJaCGH	R
Olshen et al. [15]	Bioconductor	DNAcopy	R
Rancoita et al. [16]	http://www.idsia.ch/∼paola/mBPCR/	mBPCR	R
Huang et al. [17]	http://www.meb.ki.se/∼yudpaw/	smoothseg	R
Van de Wiel et al. [21]	Bioconductor	CGHcall	R
Picard et al. [22]	CRAN	segclust	R/C++
Pique-Regi et al. [25]	http://biron.usc.edu/∼piquereg/GADA/GADA.html	GADA	Stand-alone
Díaz-Uriarte et al. [26]	http://adacgh.bioinfo.cnio.es/ & CRAN	adaCGH	Stand-alone & R

Reference	Source	Name	Platform
Removing wave-like artifacts
Diskin et al. [8]	http://www.openbioinformatics.org/penncnv/	PennCNV-gcmodel	Stand-alone
Van de Wiel et al. [9]	http://www.few.vu.nl/∼mavdwiel/nowaves.html	NoWaves	R
Normalization
Staaf et al. [10]	CRAN	popLowess	R
Chen et al. [11]	http://ntumaps.cgm.ntu.edu.tw/aCGH supplementary/		Matlab
Segmentation, smoothing and calling
Marioni et al. [12]	Bioconductor	snapCGH	R
Rueda et al. [14]	CRAN	RJaCGH	R
Olshen et al. [15]	Bioconductor	DNAcopy	R
Rancoita et al. [16]	http://www.idsia.ch/∼paola/mBPCR/	mBPCR	R
Huang et al. [17]	http://www.meb.ki.se/∼yudpaw/	smoothseg	R
Van de Wiel et al. [21]	Bioconductor	CGHcall	R
Picard et al. [22]	CRAN	segclust	R/C++
Pique-Regi et al. [25]	http://biron.usc.edu/∼piquereg/GADA/GADA.html	GADA	Stand-alone
Díaz-Uriarte et al. [26]	http://adacgh.bioinfo.cnio.es/ & CRAN	adaCGH	Stand-alone & R

Table A1:

Software for preprocessing aCGH profiles

Reference	Source	Name	Platform
Removing wave-like artifacts
Diskin et al. [8]	http://www.openbioinformatics.org/penncnv/	PennCNV-gcmodel	Stand-alone
Van de Wiel et al. [9]	http://www.few.vu.nl/∼mavdwiel/nowaves.html	NoWaves	R
Normalization
Staaf et al. [10]	CRAN	popLowess	R
Chen et al. [11]	http://ntumaps.cgm.ntu.edu.tw/aCGH supplementary/		Matlab
Segmentation, smoothing and calling
Marioni et al. [12]	Bioconductor	snapCGH	R
Rueda et al. [14]	CRAN	RJaCGH	R
Olshen et al. [15]	Bioconductor	DNAcopy	R
Rancoita et al. [16]	http://www.idsia.ch/∼paola/mBPCR/	mBPCR	R
Huang et al. [17]	http://www.meb.ki.se/∼yudpaw/	smoothseg	R
Van de Wiel et al. [21]	Bioconductor	CGHcall	R
Picard et al. [22]	CRAN	segclust	R/C++
Pique-Regi et al. [25]	http://biron.usc.edu/∼piquereg/GADA/GADA.html	GADA	Stand-alone
Díaz-Uriarte et al. [26]	http://adacgh.bioinfo.cnio.es/ & CRAN	adaCGH	Stand-alone & R

Reference	Source	Name	Platform
Removing wave-like artifacts
Diskin et al. [8]	http://www.openbioinformatics.org/penncnv/	PennCNV-gcmodel	Stand-alone
Van de Wiel et al. [9]	http://www.few.vu.nl/∼mavdwiel/nowaves.html	NoWaves	R
Normalization
Staaf et al. [10]	CRAN	popLowess	R
Chen et al. [11]	http://ntumaps.cgm.ntu.edu.tw/aCGH supplementary/		Matlab
Segmentation, smoothing and calling
Marioni et al. [12]	Bioconductor	snapCGH	R
Rueda et al. [14]	CRAN	RJaCGH	R
Olshen et al. [15]	Bioconductor	DNAcopy	R
Rancoita et al. [16]	http://www.idsia.ch/∼paola/mBPCR/	mBPCR	R
Huang et al. [17]	http://www.meb.ki.se/∼yudpaw/	smoothseg	R
Van de Wiel et al. [21]	Bioconductor	CGHcall	R
Picard et al. [22]	CRAN	segclust	R/C++
Pique-Regi et al. [25]	http://biron.usc.edu/∼piquereg/GADA/GADA.html	GADA	Stand-alone
Díaz-Uriarte et al. [26]	http://adacgh.bioinfo.cnio.es/ & CRAN	adaCGH	Stand-alone & R

Table A2:

Software for aCGH data analysis

Reference	Source	Name	Platform
Data-driven genomic regions
Van de Wiel et al. [30]	Bioconductor	CGHregions	R
Testing
Gonzalez et al. [33]	http://www.creal.cat/jrgonzalez/software.htm	CNVassoc	R
Clustering of samples
Van Wieringen et al. [31]	http://www.few.vu.nl/∼wvanwie/software/software.html	WECCA	R
Liu et al. [38]	Upon request	Unspecified	Unspecified
Shah et al. [43]	http://www.cs.ubc.ca/∼sshah/acgh	CNA-HMMer	Matlab
Classification
Rapaport et al. [49]	Upon request	fused-SVM	Matlab

Reference	Source	Name	Platform
Data-driven genomic regions
Van de Wiel et al. [30]	Bioconductor	CGHregions	R
Testing
Gonzalez et al. [33]	http://www.creal.cat/jrgonzalez/software.htm	CNVassoc	R
Clustering of samples
Van Wieringen et al. [31]	http://www.few.vu.nl/∼wvanwie/software/software.html	WECCA	R
Liu et al. [38]	Upon request	Unspecified	Unspecified
Shah et al. [43]	http://www.cs.ubc.ca/∼sshah/acgh	CNA-HMMer	Matlab
Classification
Rapaport et al. [49]	Upon request	fused-SVM	Matlab

Table A2: