Abstract

Cancer stem cells (CSCs) contribute to the relapse and development of new neoplasm lesions. While most available clinical approaches, such as chemical and radiation therapies, will kill the majority of cancer cells, they do not kill them all. Some resisting cells, like CSCs, are able to survive due to their excellent self-maintaining capabilities, even in challenging environments. In the present study, we investigated the mRNA level of DNA repair genes of colon CSCs from the HT29 cell line in response to single-strand damage and double-strand breaks, as well as the evident upregulation of key genes in base excision repair, mismatch repair, non-homologous end-joining, and homologous recombination pathways in these cells. Digital gene expression analysis identified upregulated genes in CD44+ HT29 cells that may play important roles in DNA repair. Our results reveal that colon CSCs bear efficient DNA repair abilities, which might explain the survival of colon CSCs after repeated chemical and radiation therapy.

Introduction

Globally, colorectal cancer is one of the most lethally malignant, leading to 698,000 deaths in 2012 [1]. Surgery combined with chemotherapy and radiotherapy is recommended for most patients with high grade (stage III and IV) colon cancer. Although severe injury to normal rectal tissue may also result from these procedures, radiation and chemotherapeutics that trigger carcinoma cell DNA damage are still suggested as the most effective approaches for the treatment of colon cancer.

Cancer stem cells (CSCs) are a small subpopulation of tumor cells with strong self-renewal and metastasis properties which contribute to their radio- and chemotherapy resistance. For colon CSC collection, cellular surface antigens, such as epithelial cell adhesion molecule (EpCAM) and CD44, have been used previously as colon CSC markers [2]. Furthermore, the remarkable DNA polymerase activity of ovarian CSCs was associated with their strong resistance to DNA damage following respective cisplatin and DNA cross-linker treatment in vitro and in vivo [35]. In addition, the excellent capability of CSCs to resist DNA damage assists their survival and their progeny under tumor microenvironments of inflammation, hypoxia, low pH, and poor nutrient supply [6]. In human colon cancer, caffeine, an inhibitor of phosphatidylinositol 3-kinase (PI3K), has been shown to decrease CSC subpopulations and chemoresistance, suggesting a crosstalk between tumorigenicity and DNA damage response [7,8]. Even in healthy bodies, hematopoietic stem cells show radiation resistance by entering the quiescent state and minimizing DNA exposure to damage [9,10].

Following single-strand damage and double-strand breaks, cells employ DNA repair that consists of base excision repair, nucleotide excision repair, mismatch repair [1113], non-homologous end-joining, microhomology-mediated end-joining, and homologous recombination [1416]. Previous research has demonstrated that APE1/Ref-1, the mammalian apurinic/apyrimidinic endonuclease protein, functions as a scavenger to remove reactive oxidative species promoting DNA damage under CSC microenvironments (e.g. hypoxia, inflammatory cytokines). Attenuation of APE1/Ref-1 signals sensitizes prostate cancer, breast cancer and osteosarcoma cells to DNA damage agents [17]. Double-strand breaks are the most harmful type of DNA damage. The Rad50, MRE11, and Nbs1 (MRN) complex efficiently responds to double-strand breaks by binding to and stabilizing broken DNA ends [18,19]. Cancer cells also repair double-strand breaks by homology-directed recombination or non-homologous end-joining [20,21]. Non-homologous end-joining is an error-prone DNA repair mechanism that utilizes DNA end-binding proteins Ku70/Ku80 [22]. BRCA1, a tumor suppressor, also stabilizes the genome following DNA lesions, mutations, and deletions in cooperation with RAD51, which binds to double-strand break sites and actualizes cell cycle checking [2325].

In this study, we performed magnetic isolation of CD44+ colon CSCs for digital gene expression (DGE) profiling of the DNA repair pathway. Functional analysis of the genes related to DNA repair showed integral activation of base excision repair and mismatch repair for single-strand damage, and non-homologous end-joining and homologous recombination for double-strand breaks. Our study revealed that colon CSCs possessed excellent DNA repair capacity, which could explain, at least in part, their resistance to chemical and radiation therapy-induced DNA damage in colon cancer patients.

Materials and Methods

Cell culture

Colon cancer cell line HT29 was provided by Dr Qiangming Sun of the Institute of Medical Biology, Chinese Academy of Medical Sciences (Kunming, China) and cultured in Dulbecco's modified Eagle's medium (DMEM; Gibco, Shanghai, China) supplemented with 10% fetal bovine serum (FBS; Biological Industries, Wentworthville, Australia) in an incubator with 5% CO2 at 37 °C. The CD44+ colon CSCs from the HT29 cell line were maintained in DMEM/F12 medium (Gibco) containing 20 ng/ml epidermal growth factor (EGF; Miltenyi Biotec, Bergisch Gladbach, Germany), 10 ng/ml fibroblast growth factor 2 (FGF2; Miltenyi Biotec) and 2% B27 (Miltenyi Biotec). The CD44+ HT29 cell-derived tumorspheres were cultured in suspension cell culture plates (Beaver Bio, Suzhou, China). The CD44 HT29 cells were cultured in DMEM with 10% FBS, as described above.

Isolation of CD44+ colon CSCs

The CD44+ subpopulation of HT29 cells was isolated using magnetic beads according to standard procedures. In brief, HT29 cells were trypsinized to obtain an individual cell suspension and incubated with magnetic beads and anti-human CD44 antibodies (Miltenyi Biotec) for 10 min at 4°C. Cells were loaded onto LS columns and CD44+ HT29 cells were separated from other cells using a QuadroMACS™ Separator (Miltenyi Biotec).

RNA extraction

RNA samples of CD44+ and CD44 HT29 cells were prepared using an RNeasy Mini Kit (QIAGEN, Hilden, Germany). Briefly, RLT lysis buffer was added to cells, and the lysate was then well mixed with 70% ethanol by pipetting. The mixture was loaded into a spin column for RNA binding. The column was washed twice with 500 μl RW1, then washed twice with 500 μl PRE buffer. Finally, RNA was dissolved in RNase-free water for further use.

Quantitative real time fluorescence polymerase chain reaction (qPCR)

Reverse transcription (RT) reactions were conducted to obtain cDNAs using a Prime Script TM RT Reagent Kit (TaKaRa, Dalian, China). SYBR Primix Ex Taq TM (TaKaRa) was used for quantitative real time fluorescence polymerase chain reaction (qPCR). Experiments were carried out using a real-time PCR machine (Applied Biosystems, Alameda, USA). Sequence of primers is listed in Table 1.

Table 1.

Sequence of primers used in qPCR

GeneSequence of sense primerSequence of anti-sense primer
UNG15′-GAGCCACATCGCAACTAC-3′5′-GGGCACCACTCACTTCTG-3′
MBD45′-CTGCTCAGTTTGGTGCTA-3′5′-ATCTTCCTGCTGTCTTCC-3′
SMUG15′-GAGCCACATCGCAACTAC-3′5′-GGGCACCACTCACTTCTG-3′
POLσ5′-CGGAAATCAGCATCTTGTCA-3′5′-GGCGAAAGTCGGTATGGTAG-3′
MLH15′-TACCTTCTCAACACCACCAA-3′5′-GCATCTCAGCCTTCTTCTTC-3′
RFC15′-TGACGAAGCCTACAAGAT-3′5′-GTTCACTCCTTCCGCTAT-3′
MSH25′-ACGATGGATTTGGGTTAG-3′5′-CCAGGGCTTTCTGTTTAG-3′
LIG15′-CCCATCGGTCACATCCTT-3′5′-ATCCACCTCCTTGCGTTT-3′
KU705′-CTCTTGGCTGTGGTGTTC-3′5′-AGTAGTCAGATCCGTGGC-3′
RAD505′-TGTGGCGAAGTACCTAT-3′5′-TTTGTTGTTGGCTCATC-3′
POLλ5′-ATCTTCAGCCGCCTCCTT-3′5′-GGCACCACGATGATGTCC-3′
RAD515′-GCAGCACAGCAGAGGAAG-3′5′-GAGCAGAGTCCACCGAAG-3′
RAD525′-GGCTCAGTGTTATGCTTTGG-3′5′-TGCGGCTACTTATGTATTCT-3′
TOP35′-CCCGAACAGAAACAAACA-3′5′-GGAAATGGCGAACAATAA-3′
EME15′-CACCTGTCCCAGAAATAG-3′5′-TTGAGCCAGAGGAATAAA-3′
XPC5′-GAACCACCCTCTGTATGCC-3′5′-TTCTCCAAGCCTCACCACT-3′
XPD5′-TGCCGTTTCTGGGACTGG-3′5′-TACTGCGCCCGCACATAG-3′
DDB25′-GGACAAACCCACCTTCAT-3′5′-GACCACCATTCGGCTACT-3′
POLε5′-CAGTGGACGGATAAGATGGA-3′5′-CCTTTCTGGTCGCAATGTAG-3′
GeneSequence of sense primerSequence of anti-sense primer
UNG15′-GAGCCACATCGCAACTAC-3′5′-GGGCACCACTCACTTCTG-3′
MBD45′-CTGCTCAGTTTGGTGCTA-3′5′-ATCTTCCTGCTGTCTTCC-3′
SMUG15′-GAGCCACATCGCAACTAC-3′5′-GGGCACCACTCACTTCTG-3′
POLσ5′-CGGAAATCAGCATCTTGTCA-3′5′-GGCGAAAGTCGGTATGGTAG-3′
MLH15′-TACCTTCTCAACACCACCAA-3′5′-GCATCTCAGCCTTCTTCTTC-3′
RFC15′-TGACGAAGCCTACAAGAT-3′5′-GTTCACTCCTTCCGCTAT-3′
MSH25′-ACGATGGATTTGGGTTAG-3′5′-CCAGGGCTTTCTGTTTAG-3′
LIG15′-CCCATCGGTCACATCCTT-3′5′-ATCCACCTCCTTGCGTTT-3′
KU705′-CTCTTGGCTGTGGTGTTC-3′5′-AGTAGTCAGATCCGTGGC-3′
RAD505′-TGTGGCGAAGTACCTAT-3′5′-TTTGTTGTTGGCTCATC-3′
POLλ5′-ATCTTCAGCCGCCTCCTT-3′5′-GGCACCACGATGATGTCC-3′
RAD515′-GCAGCACAGCAGAGGAAG-3′5′-GAGCAGAGTCCACCGAAG-3′
RAD525′-GGCTCAGTGTTATGCTTTGG-3′5′-TGCGGCTACTTATGTATTCT-3′
TOP35′-CCCGAACAGAAACAAACA-3′5′-GGAAATGGCGAACAATAA-3′
EME15′-CACCTGTCCCAGAAATAG-3′5′-TTGAGCCAGAGGAATAAA-3′
XPC5′-GAACCACCCTCTGTATGCC-3′5′-TTCTCCAAGCCTCACCACT-3′
XPD5′-TGCCGTTTCTGGGACTGG-3′5′-TACTGCGCCCGCACATAG-3′
DDB25′-GGACAAACCCACCTTCAT-3′5′-GACCACCATTCGGCTACT-3′
POLε5′-CAGTGGACGGATAAGATGGA-3′5′-CCTTTCTGGTCGCAATGTAG-3′
Table 1.

Sequence of primers used in qPCR

GeneSequence of sense primerSequence of anti-sense primer
UNG15′-GAGCCACATCGCAACTAC-3′5′-GGGCACCACTCACTTCTG-3′
MBD45′-CTGCTCAGTTTGGTGCTA-3′5′-ATCTTCCTGCTGTCTTCC-3′
SMUG15′-GAGCCACATCGCAACTAC-3′5′-GGGCACCACTCACTTCTG-3′
POLσ5′-CGGAAATCAGCATCTTGTCA-3′5′-GGCGAAAGTCGGTATGGTAG-3′
MLH15′-TACCTTCTCAACACCACCAA-3′5′-GCATCTCAGCCTTCTTCTTC-3′
RFC15′-TGACGAAGCCTACAAGAT-3′5′-GTTCACTCCTTCCGCTAT-3′
MSH25′-ACGATGGATTTGGGTTAG-3′5′-CCAGGGCTTTCTGTTTAG-3′
LIG15′-CCCATCGGTCACATCCTT-3′5′-ATCCACCTCCTTGCGTTT-3′
KU705′-CTCTTGGCTGTGGTGTTC-3′5′-AGTAGTCAGATCCGTGGC-3′
RAD505′-TGTGGCGAAGTACCTAT-3′5′-TTTGTTGTTGGCTCATC-3′
POLλ5′-ATCTTCAGCCGCCTCCTT-3′5′-GGCACCACGATGATGTCC-3′
RAD515′-GCAGCACAGCAGAGGAAG-3′5′-GAGCAGAGTCCACCGAAG-3′
RAD525′-GGCTCAGTGTTATGCTTTGG-3′5′-TGCGGCTACTTATGTATTCT-3′
TOP35′-CCCGAACAGAAACAAACA-3′5′-GGAAATGGCGAACAATAA-3′
EME15′-CACCTGTCCCAGAAATAG-3′5′-TTGAGCCAGAGGAATAAA-3′
XPC5′-GAACCACCCTCTGTATGCC-3′5′-TTCTCCAAGCCTCACCACT-3′
XPD5′-TGCCGTTTCTGGGACTGG-3′5′-TACTGCGCCCGCACATAG-3′
DDB25′-GGACAAACCCACCTTCAT-3′5′-GACCACCATTCGGCTACT-3′
POLε5′-CAGTGGACGGATAAGATGGA-3′5′-CCTTTCTGGTCGCAATGTAG-3′
GeneSequence of sense primerSequence of anti-sense primer
UNG15′-GAGCCACATCGCAACTAC-3′5′-GGGCACCACTCACTTCTG-3′
MBD45′-CTGCTCAGTTTGGTGCTA-3′5′-ATCTTCCTGCTGTCTTCC-3′
SMUG15′-GAGCCACATCGCAACTAC-3′5′-GGGCACCACTCACTTCTG-3′
POLσ5′-CGGAAATCAGCATCTTGTCA-3′5′-GGCGAAAGTCGGTATGGTAG-3′
MLH15′-TACCTTCTCAACACCACCAA-3′5′-GCATCTCAGCCTTCTTCTTC-3′
RFC15′-TGACGAAGCCTACAAGAT-3′5′-GTTCACTCCTTCCGCTAT-3′
MSH25′-ACGATGGATTTGGGTTAG-3′5′-CCAGGGCTTTCTGTTTAG-3′
LIG15′-CCCATCGGTCACATCCTT-3′5′-ATCCACCTCCTTGCGTTT-3′
KU705′-CTCTTGGCTGTGGTGTTC-3′5′-AGTAGTCAGATCCGTGGC-3′
RAD505′-TGTGGCGAAGTACCTAT-3′5′-TTTGTTGTTGGCTCATC-3′
POLλ5′-ATCTTCAGCCGCCTCCTT-3′5′-GGCACCACGATGATGTCC-3′
RAD515′-GCAGCACAGCAGAGGAAG-3′5′-GAGCAGAGTCCACCGAAG-3′
RAD525′-GGCTCAGTGTTATGCTTTGG-3′5′-TGCGGCTACTTATGTATTCT-3′
TOP35′-CCCGAACAGAAACAAACA-3′5′-GGAAATGGCGAACAATAA-3′
EME15′-CACCTGTCCCAGAAATAG-3′5′-TTGAGCCAGAGGAATAAA-3′
XPC5′-GAACCACCCTCTGTATGCC-3′5′-TTCTCCAAGCCTCACCACT-3′
XPD5′-TGCCGTTTCTGGGACTGG-3′5′-TACTGCGCCCGCACATAG-3′
DDB25′-GGACAAACCCACCTTCAT-3′5′-GACCACCATTCGGCTACT-3′
POLε5′-CAGTGGACGGATAAGATGGA-3′5′-CCTTTCTGGTCGCAATGTAG-3′

Digital gene expression profiling

DGE profiling was performed according to the manufacturer's protocols (Illumina, San Diego, USA). Total RNAs were pooled from two types of cells as described above, that is, the CD44+ CSCs (CD44+) and CD44 control cells (CD44), separately. Two sequencing libraries were generated according to the manufacturer's recommendations. The mRNAs were purified from total RNAs using poly-T oligo-attached magnetic beads. Fragmentation was carried out in NEB proprietary fragmentation buffer. First strand cDNA synthesis was performed using random hexamers. Second strand cDNA synthesis was performed using dNTPs, RNase H, and DNA polymerase I. The remaining overhangs were converted into blunt ends using a QiaQuick PCR kit (QIAGEN). After adenylation of the 3′ ends of the cDNA fragments, adaptor oligonucleotides were ligated to the cDNA fragments. The cDNA fragments with ligated adaptor molecules on both ends were then size-selected using agarose gel electrophoresis and amplified by PCR. Each library was sequenced on an Illumina HiSeqTM 2500 platform (Illumina).

Mapping and analysis of DGE reads

For each DGE library, clean reads were obtained by filtering out low-quality reads and rRNAs and removing adaptor sequences. The DGE reads were mapped onto the human reference genome using TopHat v 2.0.11 (http://ccb.jhu.edu/software/tophat/index.shtml) [26]. The gene expression level was analyzed using Cufflinks v 2.1.1 (http://cole-trapnell-lab.github.io/cufflinks/releases/v2.1.1/) [27] and measured by reads per kilobase per million mapped reads (RPKM) [28]. An RPKM threshold value of 0.1 was established to determine whether or not the given genes were expressed. EBSeq (Bioconductor 3.3; http://www.bioconductor.org) was used for differentially expressed gene analysis [29]. The P-values were corrected by the Benjamini-Hochberg method [30]. Genes with more than a 2-fold change (FDR < 0.01) were identified as differentially expressed genes.

Functional analysis of differentially expressed genes

The differentially expressed genes were annotated using the non-redundant (NR), Swiss-Prot [31], Gene Ontology (GO) [32], Cluster of Orthologous Groups (COG) [33], and Kyoto Encyclopedia of Genes and Genomes (KEGG) [34] databases with BLAST searching [35]. Enrichment analysis was performed on different types of gene annotations for various gene lists using the Fisher's exact test [36] adjusted by the Bonferroni correction [37].

Results and Discussion

Digital gene expression sequencing of CD44+ colon CSCs

To obtain comprehensive transcripts of the CD44+ colon CSCs and an overview of their gene expression profiles, total RNAs were isolated from the CD44+ colon CSCs (CD44+) and CD44 control cells (CD44). DGE profiling was performed using the Illumina platform. A total of 9,957,439 and 8,463,165 reads were obtained for CD44+ and CD44, respectively, corresponding to more than 1.7 billion base pairs of sequence data (Table 2). Sequence quality and GC content distribution were inspected (Supplementary Figs. S1 and S2), and results showed that DEG sequencing possessed high quality score and exactly matched GC content distribution.

Table 2.

DGE data statistics

SampleTotal readsTotal nucleotidesGC percentageQ30 percentageMappedUnique mappedMultiple mapped
CD44+9,957,439914,801,33950.66%91.19%7,602,5887,390,377212,211
CD448,463,165854,779,66551.66%89.72%7,045,2726,846,050199,222
SampleTotal readsTotal nucleotidesGC percentageQ30 percentageMappedUnique mappedMultiple mapped
CD44+9,957,439914,801,33950.66%91.19%7,602,5887,390,377212,211
CD448,463,165854,779,66551.66%89.72%7,045,2726,846,050199,222
Table 2.

DGE data statistics

SampleTotal readsTotal nucleotidesGC percentageQ30 percentageMappedUnique mappedMultiple mapped
CD44+9,957,439914,801,33950.66%91.19%7,602,5887,390,377212,211
CD448,463,165854,779,66551.66%89.72%7,045,2726,846,050199,222
SampleTotal readsTotal nucleotidesGC percentageQ30 percentageMappedUnique mappedMultiple mapped
CD44+9,957,439914,801,33950.66%91.19%7,602,5887,390,377212,211
CD448,463,165854,779,66551.66%89.72%7,045,2726,846,050199,222

Sequencing data saturation analysis was performed to confirm whether gene expression increased proportionally to the number of sequence reads. Results showed that the number of detected genes (19,640 and 18,674 for CD44+ and CD44, respectively) approached saturation (Fig. 1) for the CD44+ and CD44 samples. In addition, high transcript homogeneity of gene expression profiling confirmed that the library construction was excellent (Supplementary Fig. S3). Statistical analysis (TopHat-assembly module in Maq) demonstrated that ~70% of total reads were mapped to exons (Supplementary Fig. S4), suggesting that gene annotation could be further improved.
Saturation curve analysis of the digital gene expression profiling of CD44+ and CD44− samples
Figure 1.

Saturation curve analysis of the digital gene expression profiling of CD44+ and CD44 samples

Identification of differential gene expression

Gene expression levels of CD44+ and CD44 populations of HT29 cells were calculated and normalized using RPKM. The RPKM distribution and density distribution were compared and were shown in Supplementary Figs. S5 and S6, respectively. Correlation testing between CD44+ and CD44 samples (Supplementary Fig. S7) showed that few differentially expressed genes existed between the CD44+ and CD44 samples. Fold-changes and P-values were used to compare the gene expression levels between the two samples. Comparative analysis identified a total of 6249 differentially expressed genes. Among them, 3115 were upregulated and 3134 were downregulated in CD44+ compared with CD44 subpopulation of HT29 cells.

Functional analysis of differentially expressed genes

Functional annotations of gene expression profiling were assigned using the BLAST tool, with most differentially expressed genes able to be matched in the databases (Table 3). Among them, COG provided classified functions based on genome-scale orthology analysis. The COG functional analysis showed that most genes differentially expressed in CD44+ were related to replication, recombination and repair (function category L) (Fig. 2). The other three top function categories included transcription (K), signal transduction mechanisms (T), and translation, ribosomal structure and biogenesis (J). These results suggest an active cellular process in CD44+ CSCs with complicated regulations possibly related to cell survival.
COG function classification of differentially expressed genes The function categories are listed on the right.
Figure 2.

COG function classification of differentially expressed genes The function categories are listed on the right.

Table 3.

Functional annotation statistics of differentially expressed genes

TotalNRSwiss-ProtGOKEGGCOG
624962336201611133912379
TotalNRSwiss-ProtGOKEGGCOG
624962336201611133912379
Table 3.

Functional annotation statistics of differentially expressed genes

TotalNRSwiss-ProtGOKEGGCOG
624962336201611133912379
TotalNRSwiss-ProtGOKEGGCOG
624962336201611133912379
GO provides a unified and controlled representation of gene and gene product attributes. We performed GO functional enrichment analysis to interpret the functions of the differentially expressed genes (Fig. 3). The most enriched GO terms (biological processes) are listed in Table 4. Compared with the gene expression in CD44 cells, the significantly regulated (both up and down) genes in CD44+ cells were associated with biological processes, such as cell cycle, organ development, cell division and DNA replication.
GO functional enrichment of differentially expressed genes The distribution of all genes encoded in the reference genome is shown in red. The distribution of differentially expressed genes is shown in blue.
Figure 3.

GO functional enrichment of differentially expressed genes The distribution of all genes encoded in the reference genome is shown in red. The distribution of differentially expressed genes is shown in blue.

The KEGG pathway can be used for understanding high-level functions of genes and biological systems. We completed KEGG pathway analysis of the differentially expressed genes and the top 20 enriched KEGG pathways are shown in Fig. 4. Consistent with GO analysis, cell cycle and DNA replication were the most enriched KEGG pathways. Interestingly, cancer-related pathways, signaling pathways and adherens junction-related pathways were enriched in the differentially expressed genes of CD44+ cells.
KEGG pathway enrichment of differentially expressed genes The top right corner shows the most enriched KEGG pathways.
Figure 4.

KEGG pathway enrichment of differentially expressed genes The top right corner shows the most enriched KEGG pathways.

Table 4.

GO functional enrichment (biological process) of differentially expressed genes

GO termNumberCorrelated P-value
G1/S transition of mitotic cell cycle (GO:0000082)1041.49E-06
Organ development (GO:0048513)2261.63E-06
Regulation of DNA-dependent DNA replication initiation (GO:0030174)203.84E-06
DNA replication initiation (GO:0006270)391.51E-05
Single-organism developmental process (GO:0044767)3531.67E-05
Chromosome segregation (GO:0007059)882.25E-05
Cell division (GO:0051301)2720.001320845
DNA replication (GO:0006260)930.001664745
Gene silencing (GO:0016458)290.001893301
DNA strand elongation involved in DNA replication (GO:0006271)330.002818764
Activation of protein kinase activity (GO:0032147)420.003097855
Pyrimidine nucleoside salvage (GO:0043097)160.003181955
Cell cycle process (GO:0022402)800.006871153
Regulation of cell cycle (GO:0051726)1090.007973628
Cell cycle arrest (GO:0007050)1080.008742882
Negative regulation of transcription from RNA polymerase II promoter (GO:0000122)4330.018076542
Neurotrophin TRK receptor signaling pathway (GO:0048011)2260.020824833
Regulation of phosphorylation (GO:0042325)260.023171996
Immune system process (GO:0002376)830.024763176
G2/M transition of mitotic cell cycle (GO:0000086)1270.028641706
GO termNumberCorrelated P-value
G1/S transition of mitotic cell cycle (GO:0000082)1041.49E-06
Organ development (GO:0048513)2261.63E-06
Regulation of DNA-dependent DNA replication initiation (GO:0030174)203.84E-06
DNA replication initiation (GO:0006270)391.51E-05
Single-organism developmental process (GO:0044767)3531.67E-05
Chromosome segregation (GO:0007059)882.25E-05
Cell division (GO:0051301)2720.001320845
DNA replication (GO:0006260)930.001664745
Gene silencing (GO:0016458)290.001893301
DNA strand elongation involved in DNA replication (GO:0006271)330.002818764
Activation of protein kinase activity (GO:0032147)420.003097855
Pyrimidine nucleoside salvage (GO:0043097)160.003181955
Cell cycle process (GO:0022402)800.006871153
Regulation of cell cycle (GO:0051726)1090.007973628
Cell cycle arrest (GO:0007050)1080.008742882
Negative regulation of transcription from RNA polymerase II promoter (GO:0000122)4330.018076542
Neurotrophin TRK receptor signaling pathway (GO:0048011)2260.020824833
Regulation of phosphorylation (GO:0042325)260.023171996
Immune system process (GO:0002376)830.024763176
G2/M transition of mitotic cell cycle (GO:0000086)1270.028641706
Table 4.

GO functional enrichment (biological process) of differentially expressed genes

GO termNumberCorrelated P-value
G1/S transition of mitotic cell cycle (GO:0000082)1041.49E-06
Organ development (GO:0048513)2261.63E-06
Regulation of DNA-dependent DNA replication initiation (GO:0030174)203.84E-06
DNA replication initiation (GO:0006270)391.51E-05
Single-organism developmental process (GO:0044767)3531.67E-05
Chromosome segregation (GO:0007059)882.25E-05
Cell division (GO:0051301)2720.001320845
DNA replication (GO:0006260)930.001664745
Gene silencing (GO:0016458)290.001893301
DNA strand elongation involved in DNA replication (GO:0006271)330.002818764
Activation of protein kinase activity (GO:0032147)420.003097855
Pyrimidine nucleoside salvage (GO:0043097)160.003181955
Cell cycle process (GO:0022402)800.006871153
Regulation of cell cycle (GO:0051726)1090.007973628
Cell cycle arrest (GO:0007050)1080.008742882
Negative regulation of transcription from RNA polymerase II promoter (GO:0000122)4330.018076542
Neurotrophin TRK receptor signaling pathway (GO:0048011)2260.020824833
Regulation of phosphorylation (GO:0042325)260.023171996
Immune system process (GO:0002376)830.024763176
G2/M transition of mitotic cell cycle (GO:0000086)1270.028641706
GO termNumberCorrelated P-value
G1/S transition of mitotic cell cycle (GO:0000082)1041.49E-06
Organ development (GO:0048513)2261.63E-06
Regulation of DNA-dependent DNA replication initiation (GO:0030174)203.84E-06
DNA replication initiation (GO:0006270)391.51E-05
Single-organism developmental process (GO:0044767)3531.67E-05
Chromosome segregation (GO:0007059)882.25E-05
Cell division (GO:0051301)2720.001320845
DNA replication (GO:0006260)930.001664745
Gene silencing (GO:0016458)290.001893301
DNA strand elongation involved in DNA replication (GO:0006271)330.002818764
Activation of protein kinase activity (GO:0032147)420.003097855
Pyrimidine nucleoside salvage (GO:0043097)160.003181955
Cell cycle process (GO:0022402)800.006871153
Regulation of cell cycle (GO:0051726)1090.007973628
Cell cycle arrest (GO:0007050)1080.008742882
Negative regulation of transcription from RNA polymerase II promoter (GO:0000122)4330.018076542
Neurotrophin TRK receptor signaling pathway (GO:0048011)2260.020824833
Regulation of phosphorylation (GO:0042325)260.023171996
Immune system process (GO:0002376)830.024763176
G2/M transition of mitotic cell cycle (GO:0000086)1270.028641706

DNA single-strand damage repair provision in colon CSCs

In the cellular base excision repair process, a C-terminal mismatch-specific glycosylase of the 5-methylcytosine binding domain (MBD) binds specifically to methylated DNA for DNA repair [38]. The DNA glycosylases UNG and SMUG1 both cleave the N-glycosylic bond of genomic DNA for deleting deficient or mismatched uracil bases. Furthermore, adenine nucleobases with oxidative lesions can be excised by the glycosylase MUTY [39,40]. In the current study, DGE showed that the UNG, SMUG1 and MUTY genes (Fig. 5A, labeled in green) significantly increased (fold change by 8.28, 4.37, and 5.38, and false discovery rate by 5.55 × 10−16, 2.29 × 10−3, and 9.61 × 10−6, respectively) in colon CSCs (HT29/CD44+) compared with colon non-CSCs (HT29/CD44) (Fig. 5B). The preponderances of MBD4, UNG, SMUG1 and MUTY might endue colon CSCs with excellent DNA lesion recognition and removal capabilities.
Base excision repair pathway in colon CSCs (A) KEGG pathway. Green, upregulated; Red, downregulated; Blue, mix regulated. (B) Histogram of mRNA levels of base excision repair genes. ENST number (UNG, ENST00000242576; MUTY, ENST00000354383; MBD4-1, ENST00000503197; MBD4-2, ENST00000515266; TDGMug, ENST00000266775 SMUG-1, ENST00000508394; SMUG-2, ENST00000337581; SMUG-3, ENST00000401977; polσ-1, ENST00000376930; polσ-2, ENST00000262266; polσ-3, ENST00000395803; polε-1, ENST00000376577; polε-2, ENST00000320574; polε-3, ENST00000216367; polε-4, ENST00000374171; Lig1, ENST00000263274).
Figure 5.

Base excision repair pathway in colon CSCs (A) KEGG pathway. Green, upregulated; Red, downregulated; Blue, mix regulated. (B) Histogram of mRNA levels of base excision repair genes. ENST number (UNG, ENST00000242576; MUTY, ENST00000354383; MBD4-1, ENST00000503197; MBD4-2, ENST00000515266; TDGMug, ENST00000266775 SMUG-1, ENST00000508394; SMUG-2, ENST00000337581; SMUG-3, ENST00000401977; polσ-1, ENST00000376930; polσ-2, ENST00000262266; polσ-3, ENST00000395803; polε-1, ENST00000376577; polε-2, ENST00000320574; polε-3, ENST00000216367; polε-4, ENST00000374171; Lig1, ENST00000263274).

The heterodimers of MLH1 and MLH3 sustain genomic stability during DNA replication and after meiotic recombination [41]. Replication factor C (RFC), a DNA-dependent ATPase, activates DNA polymerase and boosts cooperative synthesis of both DNA strands [42]. MSH2 heterodimers with MSH6 take on a highly conserved Walker-A adenine nucleotide binding motif of 150 amino acids and distinguish mismatched DNA and ADP and ATP exchange [43,44]. MSH2 also interacts with EXO1 with 5′ to 3′ exonuclease activity for DNA mismatch repair and recombination [45]. In the colon CSCs (HT29/CD44+), the significantly upregulated MLH1 and MSH2 (Fig. 6A,B) suggest that the major components of the mismatch recognition complex remain alert to locate genomic abnormalities where bases are not paired correctly during DNA replication. Following the success of recognition, Exo1 (Fig. 6A,B, labeled in green, fold change by 6.98 and false discovery rate by 6.19 × 10−11) was expected to excise the discriminated bases.
Mismatch repair pathway in colon CSCs (A) KEGG pathway. Green, upregulated; Red, downregulated; Blue, mix regulated. (B) Histogram of mRNA levels of mismatch repair genes. ENST number (MLH1, ENST00000458205; MLH3-1, ENST00000355774; MLH3-2, ENST00000238662; RFC-1, ENST00000449502; RFC-2, ENST00000463194; RFC-3, ENST00000454402; MSH2, ENST00000394794 MSH6, ENST00000445503; Exo1-1, ENST00000366547; Exo1-2, ENST00000423397; polσ-1, ENST00000376930; polσ-2, ENST00000262266; polσ-3, ENST00000395803; Lig1, ENST00000263274).
Figure 6.

Mismatch repair pathway in colon CSCs (A) KEGG pathway. Green, upregulated; Red, downregulated; Blue, mix regulated. (B) Histogram of mRNA levels of mismatch repair genes. ENST number (MLH1, ENST00000458205; MLH3-1, ENST00000355774; MLH3-2, ENST00000238662; RFC-1, ENST00000449502; RFC-2, ENST00000463194; RFC-3, ENST00000454402; MSH2, ENST00000394794 MSH6, ENST00000445503; Exo1-1, ENST00000366547; Exo1-2, ENST00000423397; polσ-1, ENST00000376930; polσ-2, ENST00000262266; polσ-3, ENST00000395803; Lig1, ENST00000263274).

High capacities of colon CSCs to repair DNA double-strand breaks

Single-strand DNA-dependent helicase of the KU70/KU80 complex participates in the repair of non-homologous DNA ends [46]. The MRN complex possesses enzymatic activities for non-homologous end-joining, telomere maintenance, and meiotic recombination [47]. DNA polymerase polλ catalyzes the extension of the 3′-end of a DNA strand in non-homologous end-joining and other repair processes [48]. In the present study, CD44+ HT29 cells showed upregulation of KU70 (Fig. 7A,B, labeled in green, fold change by 2.24, and false discovery rate by 6.01 × 10−3), which is responsible for end binding and processing of double-strand breaks. Furthermore, the significant upregulation of polλ (Fig. 7A,B, labeled in blue, fold change by 4.37, and false discovery rate by 2.29 × 10−3) might play a critical role in the repair of breaks in DNA strands.
Non-homologous end-joining pathway in colon CSCs (A) KEGG pathway. Green, upregulated; Red, downregulated; Blue, mix regulated. (B) Histogram of mRNA levels of non-homologous end-joining genes. ENST number (ku70, ENST00000402580; Rad50, ENST00000453394; polλ-1, ENST00000429502; polλ-2, ENST00000456836).
Figure 7.

Non-homologous end-joining pathway in colon CSCs (A) KEGG pathway. Green, upregulated; Red, downregulated; Blue, mix regulated. (B) Histogram of mRNA levels of non-homologous end-joining genes. ENST number (ku70, ENST00000402580; Rad50, ENST00000453394; polλ-1, ENST00000429502; polλ-2, ENST00000456836).

In early homologous-recombining DNA repair, the MRN complex in double-strand break repair deals with checkpoint activation [49]. The RAD51D, RAD51L1, RAD51L2 and XRCC2 complex catalyzes the pairing of single-strand DNA with double-strand DNA [50]. BRCA2 interacts with RAD51 and DSS1 and mediates the localization and DNA-binding ability of RAD52 [51,52]. The binding of RAD54 to double-strand DNA induces DNA topological change, resulting in homologous DNA pairing and recombination [53]. DNA topoisomerase 3 (TOP3) prompts supercoil construction and DNA topology change by catalyzing the breaking and rejoining of single-strand DNA [54]. Finally, essential meiotic structure-specific endonuclease 1 (EME1) binds to specific DNA constructs, including aberrant replication fork structures, nicked Holliday junctions and 3′-flap structures, for homologous recombination [55]. Our results showed an upregulation in Nbs1 (Fig. 8A,B, labeled in green, fold change by 6.72 and false discovery rate by 5.40 × 10−10) of the MRN complex in CD44+ HT29 cells, thus providing colon CSCs with greater capacity to initiate resection after double-strand breakage of genomic DNA. Upregulated Rad51D (Fig. 8A,B, labeled in green, fold change by 5.49, and false discovery rate by 4.58 × 10−6) and paralogs tend to form Rad51 filaments, which could be provoked by specific DNA binding of Rad52 (Fig. 8A,B, labeled in green, fold change by 4.45, and false discovery rate by 1.59 × 10−3). Rad54 was upregulated (fold change by 2.89, and false discovery rate by 3.52 × 10−4) in the CD44+ HT29 cells, thus demonstrating a strong potential to induce double-strand DNA topological change and D-loop formation for homologous recombination. The upregulation of Eme1 (Fig. 8A,B, labeled in green, fold change by 4.37, and false discovery rate by 2.29 × 10−3) might support the localization of colon CSCs to Holliday junctions and the formation of replication fork structures.
Homologous recombination pathway in colon CSCs (A) KEGG pathway. Green, upregulated; Red, downregulated; Blue, mix regulated. (B) Histogram of mRNA levels of homologous recombination genes. ENST number (Rad50, ENST00000453394; Rad51, ENST00000423169; Rad51D-1, ENST00000360276; Rad51D-2, ENST00000394589; Rad51D-3, ENST00000494406; Rad52, ENST00000481052; Rad54, ENST00000371975; Nbs1, ENST00000265433; DSS1, ENST00000482389; polσ−1, ENST00000376930; polσ−2, ENST00000262266; polσ−3, ENST00000395803; TOP3-1, ENST00000424393; TOP3-2, ENST00000457270; TOP3-3, ENST00000357179; TOP3-4, ENST00000403680; Eme1-1, ENST00000510246; Eme1-2, ENST00000511519).
Figure 8.

Homologous recombination pathway in colon CSCs (A) KEGG pathway. Green, upregulated; Red, downregulated; Blue, mix regulated. (B) Histogram of mRNA levels of homologous recombination genes. ENST number (Rad50, ENST00000453394; Rad51, ENST00000423169; Rad51D-1, ENST00000360276; Rad51D-2, ENST00000394589; Rad51D-3, ENST00000494406; Rad52, ENST00000481052; Rad54, ENST00000371975; Nbs1, ENST00000265433; DSS1, ENST00000482389; polσ−1, ENST00000376930; polσ−2, ENST00000262266; polσ−3, ENST00000395803; TOP3-1, ENST00000424393; TOP3-2, ENST00000457270; TOP3-3, ENST00000357179; TOP3-4, ENST00000403680; Eme1-1, ENST00000510246; Eme1-2, ENST00000511519).

Further qPCR confirmation and nucleotide excision repair pathway analysis

Previous studies have shown that activation of CD44 is associated with DNA repair activities [5658]. In this study, 186 DNA repair-related genes were found to be significantly upregulated in CD44+ cells, and part of them playing key role in DNA repair was shown in this study (Figs. 58 andSupplementary Fig. S9). They were distributed in various DNA repair pathways, including base excision repair, nucleotide excision repair, mismatch repair, homologous recombination and non-homologous end-joining. Further qPCR analysis also confirmed that CD44 antigen positive population in HT29 cells exhibit excellent DNA repair capacity (Supplementary Fig. S9).

Core basal transcription factor 2 (TFIIH), which is composed of TTDA, XPB, XPD and TFIIH proteins, is complexed to CAK. When UV damage occurs, the Xeroderma Pigmentosum Group C (XPC) complex initiates damage recognition and opens complex formation. The heterodimeric DNA damage-binding (DDB) complex binds to recognized DNA, followed by repair protein complex formation [59,60]. We found that DDB1, XPB, CDK7 and CCNH (Supplementary Fig. S8A,B, labeled in red) which provide vital function in initiation of nucleotide excision repair are downregulated. This result showed that colon CSCs do not burden preponderant capacity of nucleotide excision repair, suggesting that colon CSCs may depend on base excision repair and mismatch repair to fix single-strand breaks instead of nucleotide excision repair.

In summary, our study suggests that colon CSCs activate DNA repair-related signaling, including base excision repair, nucleotide excision repair, mismatch repair, non-homologous end-joining and homologous recombination for their genomic stability. Our results provide new insights into our understanding of radiation and chemotherapy resistance in colon CSCs.

Supplementary Data

Supplementary data is available at ABBS online.

Funding

This work was supported by the grants from the National Basic Research Program of China (973 Program) (No. 2013CB127500), the National Natural Science Foundation of China (Nos. 31270131, 31160021, 81271330, 31160237, and 31260276), the Yunnan Province Science and Technology Innovation Team (No. 2011CI123), the Talent Program of Yunnan Province (No. W8110305), the Foundation of School of Life Sciences of Yunnan University (No. 2012S301), the Natural Science Foundation of Yunnan Province (No. 2011BC003), the Science Foundation of Education Department of Yunnan Province (No. ZD2015002), and the Yunnan Province New Academic Talent Award 2014 (No. YN201418).

References

1

Colon Cancer Treatment (PDQ®)
. National Cancer Institute
2014
, Retrieved 29 June 2014.

2

Clevers
H
.
The cancer stem cell: premises, promises and challenges
.
Nat Med
2011
,
17
:
313
319
.

3

Cheung-Ong
K
,
Giaever
G
,
Nislow
C
.
DNA-damaging agents in cancer chemotherapy: serendipity and chemical biology
.
Chem Biol
2013
,
20
:
648
659
.

4

Jackson
SP
,
Bartek
J
.
The DNA-damage response in human biology and disease
.
Nature
2009
,
461
:
1071
1078
.

5

Srivastava
AK
,
Han
C
,
Zhao
R
,
Cui
T
,
Dai
Y
,
Mao
C
,
Zhao
W
, et al. .
Enhanced expression of DNA polymerase eta contributes to cisplatin resistance of ovarian cancerstem cells
.
Proc Natl Acad Sci USA
2015
,
14
:
4411
4416
.

6

Peitzsch
C
,
Kurth
I
,
Kunz-Schughart
L
,
Baumann
M
,
Dubrovska
A
.
Discovery ofthe cancer stem cell related determinants of radioresistance
.
Radiother Oncol
2013
,
108
:
378
387
.

7

Yin
H
,
Glass
J
.
The phenotypic radiation resistance of CD44+/CD24(-or low) breast cancer cells is mediated through the enhanced activation of ATM signaling
.
PLoS One
2011
,
6
:
e24080
.

8

Gallmeier
E
,
Hermann
PC
,
Mueller
MT
,
Machado
JG
,
Ziesch
A
,
De Toni
EN
,
Palagyi
A
, et al. .
Inhibition of ataxia telangiectasia- and Rad3-related function abrogatesthe in vitro and in vivo tumorigenicity of human colon cancer cells throughdepletion of the CD133(+) tumor-initiating cell fraction
.
Stem Cells
2011
,
29
:
418
429
.

9

Bonnet
D
,
Dick
JE
.
Human acute myeloid leukemia is organized as a hierarchy that originates from a primitive hematopoietic cell
.
Nat Med
1997
,
3
:
730
737
.

10

Al-Hajj
M
,
Wicha
MS
,
Benito-Hernandez
A
,
Morrison
SJ
,
Clarke
MF
.
Prospective identification of tumorigenic breast cancer cells
.
Proc Natl Acad Sci USA
2003
,
100
:
3983
3988
.

11

Willey
J
,
Sherwood
L
,
Woolverton
C
.
Prescott's Microbiology
.
New York, NY
:
McGraw Hill
,
2014
,
381
.

12

Reardon
J
,
Sancar
A
.
Purification and characterization of Escherichia coli and human nucleotide excision repair enzyme systems
.
Methods Enzymol
2006
,
408
:
189
213
.

13

Berg
M
,
Tymoczko
J
,
Stryer
L
.
Biochemistry
, 7th edn,
New York
:
W.H. Freeman and Company
,
2012
,
840
.

14

Wilson
TE
,
Grawunder
U
,
Lieber
MR
.
Yeast DNA ligase IV mediates non-homologous DNA end joining
.
Nature
1997
,
388
:
495
498
.

15

Truong
LN
,
Li
Y
,
Shi
LZ
,
Hwang
PY
,
He
J
,
Wang
H
,
Razavian
N
, et al. .
Microhomology-mediated end joining and homologous recombination share the initial end resection step to repair DNA double-strand breaks in mammalian cells
.
Proc Natl Acad Sci USA
2013
,
110
:
7720
7725
.

16

Liang
L
,
Deng
L
,
Chen
Y
,
Li
GC
,
Shao
C
,
Tischfield
JA
.
Modulation of DNA end joining by nuclear proteins
.
J Biol Chem
2005
,
280
:
31442
31449
.

17

Skvortsova
I
,
Skvortsov
S
,
Stasyk
T
,
Raju
U
,
Popper
BA
,
Schiestl
B
,
von Guggenberg
E
, et al. .
Intracellular signaling pathways regulating radioresistance of human prostate carcinomacells
.
Proteomics
2008
,
8
:
4521
4533
.

18

Williams
GJ
,
Lees-Miller
SP
,
Tainer
JA
.
Mre11-Rad50-Nbs1 conformations andthe control of sensing, signaling, and effector responses at DNA double-strand breaks
.
DNA Repair (Amst)
2010
,
9
:
1299
1306
.

19

Bruhn
C
,
Zhou
ZW
,
Ai
H
,
Wang
ZQ
.
The essential function of the MRN com-plex in the resolution of endogenous replication intermediates
.
Cell Rep
2014
,
6
:
182
195
.

20

Brandsma
I
,
Gent
DC
.
Pathway choice in DNA double strand break repair:observations of a balancing act
.
Genome Integr
2012
,
3
:
9
.

21

Thompson
LH
.
Recognition, signaling, and repair of DNA double-strand breaksproduced by ionizing radiation in mammalian cells: the molecular choreography
.
Mutat Res
2012
,
751
:
158
246
.

22

Jackson
SP
.
Sensing and repairing DNA double-strand breaks
.
Carcinogenesis
2002
,
23
:
687
696
.

23

Venkitaraman
AR
.
Cancer susceptibility and the functions of BRCA1 and BRCA2
.
Cell
2002
,
108
:
171
182
.

24

Bunting
SF
,
Callén
E
,
Wong
N
,
Chen
HT
,
Polato
F
,
Gunn
A
,
Bothmer
A
, et al. .
53BP1 inhibits homologous recombination in Brca1-deficient cells by blocking resection of DNA breaks
.
Cell
2010
,
141
:
243
254
.

25

Kakarougkas
A
,
Jeggo
P
.
DNA DSB repair pathway choice: an orchestrated handover mechanism
.
Br J Radiol
2014
,
87
:
20130685
.

26

Trapnell
C
,
Pachter
L
,
Salzberg
SL
.
Top Hat: discovering splice junctions with RNA-Seq
.
Bioinformatics
2009
,
25
:
1105
1111
.

27

Trapnell
C
,
Williams
BA
,
Pertea
G
,
Mortazavi
A
,
Kwan
G
,
van Baren
MJ
,
Salzberg
SL
, et al. .
Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation
.
Nat Biotechnol
2010
,
28
:
511
515
.

28

Mortazavi
A
,
Williams
BA
,
McCue
K
,
Schaeffer
L
,
Wold
B
.
Mapping and quantifying mammalian transcriptomes by RNA-Seq
.
Nat Methods
2008
,
5
:
621
628
.

29

Leng
N
,
Dawson
JA
,
Thomson
JA
,
Ruotti
V
,
Rissman
AI
,
Smits
BM
,
Haag
JD
, et al. .
EBSeq: an empirical Bayes hierarchical model for inference in RNA-seq experiments
.
Bioinformatics
2013
,
29
:
1035
1043
.

30

Klipper-Aurbach
Y
,
Wasserman
M
,
Braunspiegel-Weintrob
N
,
Borstein
D
,
Peleg
S
,
Assa
S
,
Karp
M
, et al. .
Mathematical formulae for the prediction of the residual beta cell function during the first two years of disease in children and adolescents with insulin-dependent diabetes mellitus
.
Med Hypotheses
1995
,
45
:
486
490
.

31

Apweiler
R
,
Bairoch
A
,
Wu
CH
,
Barker
WC
,
Boeckmann
B
,
Ferro
S
,
Gasteiger
E
, et al. .
UniProt: the universal protein knowledgebase
.
Nucleic Acids Res
2004
,
32
:
D115
D119
.

32

Ashburner
M
,
Ball
CA
,
Blake
JA
,
Botstein
D
,
Butler
H
,
Cherry
JM
,
Davis
AP
, et al. .
Gene ontology: tool for the unification of biology. The Gene Ontology Consortium
.
Nat Genetics
2000
,
25
:
25
29
.

33

Tatusov
RL
,
Fedorova
ND
,
Jackson
JD
,
Jacobs
AR
,
Kiryutin
B
,
Koonin
EV
,
Krylov
DM
, et al. .
The COG database: an updated version includes eukaryotes
.
BMC Bioinformatics
2003
,
4
:
41
.

34

Kanehisa
M
,
Goto
S
,
Sato
Y
,
Furumichi
M
,
Tanabe
M
.
KEGG for integration and interpretation of large-scale molecular data sets
.
Nucleic Acids Res
2012
,
40
:
D109
D114
.

35

Altschul
SF
,
Madden
TL
,
Schäffer
AA
,
Zhang
J
,
Zhang
Z
,
Miller
W
,
Lipman
DJ
.
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
.
Nucleic Acids Res
1997
,
25
:
3389
3402
.

36

Liao
YY
,
Lee
TS
,
Lin
YM
.
A Fisher exact test will be more proper
.
Radiology
2006
,
239
:
300
301
.

37

Dunn
OJ
.
Multiple comparisons among means
.
J Am Stat Assoc
1961
,
56
:
52
64
.

38

Zhang
W
,
Liu
Z
,
Crombet
L
,
Amaya
MF
,
Liu
Y
,
Zhang
X
,
Kuang
W
, et al. .
Crystal structure of the mismatch-specific thymine glycosylase domain of human methyl-CpG-binding protein MBD4
.
Biochem Biophys Res Commun
2011
,
412
:
425
428
.

39

Lühnsdorf
B
,
Epe
B
,
Khobta
A
.
Excision of uracil from transcribed DNA negatively affects gene expression
.
J Biol Chem
2014
,
289
:
22008
22018
.

40

Abdel-Fatah
TM
,
Albarakati
N
,
Bowell
L
,
Agarwal
D
,
Moseley
P
,
Hawkes
C
,
Ball
G
, et al. .
Single-strand selective monofunctional uracil-DNA glycosylase (SMUG1) deficiency is linked to aggressive breast cancer and predicts response to adjuvant therapy
.
Breast Cancer Res Treat
2013
,
142
:
515
527
.

41

Roesner
LM
,
Mielke
C
,
Fähnrich
S
,
Merkhoffer
Y
,
Dittmar
KE
,
Drexler
HG
,
Dirks
WG
.
Stable expression of MutLγ in human cells reveals no specific response to mismatched DNA, but distinct recruitment to damage sites
.
J Cell Biochem
2013
,
114
:
2405
2414
.

42

Tang
H
,
Hilton
B
,
Musich
PR
,
Fang
DZ
,
Zou
Y
.
Replication factor C1, the large subunit of replication factor C, is proteolytically truncated in Hutchinson-Gilford progeria syndrome
.
Aging Cell
2012
,
11
:
363
365
.

43

Hoell
JI
,
Gombert
M
,
Ginzel
S
,
Loth
S
,
Landgraf
P
,
Käfer
V
,
Streiter
M
.
Constitutional mismatch repair-deficiency and whole-exome sequencing as the means of the rapid detection of the causative MSH6 defect
.
Klin Padiatr
2014
,
226
:
357
361
.

44

Hoell
JI
,
Gombert
M
,
Ginzel
S
,
Loth
S
,
Landgraf
P
,
Käfer
V
,
Streiter
M
, et al. .
Minor changes in expression of the mismatch repair protein MSH2 exert a major impact on glioblastoma response to temozolomide
.
Cancer Res
2015
,
75
:
3127
3138
.

45

Tomimatsu
N
,
Mukherjee
B
,
Deland
K
,
Kurimasa
A
,
Bolderson
E
,
Khanna
KK
,
Burma
S
.
Exo1 plays a major role in DNA end resection in humans and influences double-strand break repair and damage signaling decisions
.
DNA Repair (Amst)
2012
,
11
:
441
448
.

46

Jia
J
,
Ren
J
,
Yan
D
,
Xiao
L
,
Sun
R
.
Association between the XRCC6 polymorphisms and cancer risks: a systematic review and meta-analysis
.
Medicine (Baltimore)
2015
,
94
:
e283
.

47

Kinoshita
E
,
van Rossum-Fikkert
S
,
Sanchez
H
,
Kertokalio
A
,
Wyman
C
.
Human RAD50 makes a functional DNA-binding complex
.
Biochimie
2015
,
113
:
47
53
.

48

Vyas
R
,
Zahurancik
WJ
,
Suo
Z
.
Structural basis for the binding and incorporation of nucleotide analogs with L-stereochemistry by human DNA polymerase λ
.
Proc Natl Acad Sci USA
2014
,
111
:
e3033
e3042
.

49

Li
JT
,
Zhong
BY
,
Xu
HH
,
Qiao
SY
,
Wang
G
,
Huang
J
,
Fan
HZ
, et al. .
Associations between NBS1 polymorphisms and colorectal cancer in Chinese Population
.
PLoS One
2015
,
10
:
e0132332
.

50

Pelttari
LM
,
Kiiski
J
,
Nurminen
R
,
Kallioniemi
A
,
Schleutker
J
,
Gylfe
A
,
Aaltonen
LA
, et al. .
A Finnish founder mutation in RAD51D: analysis in breast, ovarian, prostate, and colorectal cancer
.
J Med Genet
2012
,
49
:
429
432
.

51

Wang
AT
,
Kim
T
,
Wagner
JE
,
Conti
BA
,
Lach
FP
,
Huang
AL
,
Molina
H
, et al. .
A dominant mutation in human RAD51 reveals its function in DNA interstrand crosslink repair independent of homologous recombination
.
Mol Cell
2015
,
59
:
478
490
.

52

Kristensen
CN
,
Bystol
KM
,
Li
B
,
Serrano
L
,
Brenneman
MA
.
Depletion of DSS1 protein disables homologous recombinational repair in human cells
.
Mutat Res
2010
,
694
:
60
64
.

53

Mazina
OM
,
Rossi
MJ
,
Thomaä
NH
,
Mazin
AV
.
Interactions of human rad54 protein with branched DNA molecules
.
J Biol Chem
2007
,
282
:
21068
21080
.

54

Wang
Y
,
Lyu
YL
,
Wang
JC
.
Dual localization of human DNA topoisomerase IIIalpha to mitochondria and nucleus
.
Proc Natl Acad Sci USA
2002
,
99
:
12114
12119
.

55

Taylor
ER
,
McGowan
CH
.
Cleavage mechanism of human Mus81-Eme1 acting on Holliday-junction structures
.
Proc Natl Acad Sci USA
2008
,
105
:
3757
3762
.

56

Chen
C
,
Chang
MC
,
Hsieh
RK
,
Chang
YF
,
Lin
J
,
Tsan
KW
.
Activation of CD44 facilitates DNA repair in T-cell lymphoma but has differential effects on apoptosis induced by chemotherapeutic agents and ionizing radiation
.
Leuk Lymphoma
2005
,
46
:
1785
1795
.

57

Yin
H
,
Glass
J
.
The phenotypic radiation resistance of CD44+/CD24(-or low) breast cancer cells is mediated through the enhanced activation of ATM signaling
.
PloS One
2011
,
6
:
e24080
.

58

Phillips
TM
,
McBride
WH
,
Pajonk
F
.
The response of CD24(-/low)/CD44+ breast cancer-initiating cells to radiation
.
J Natl Cancer Inst
2006
,
98
:
1777
1785
.

59

Li
T
,
Chen
X
,
Garbutt
KC
,
Zhou
P
,
Zheng
N
.
Structure of DDB1 in complex with a paramyxovirus V protein: viral hijack of a propeller cluster in ubiquitin ligase
.
Cell
2006
,
124
:
105
117
.

60

Peng
Q
,
Lao
X
,
Tang
W
,
Chen
Z
,
Li
R
,
Qin
X
,
Li
S
.
XPC Lys939Gln polymorphism contributes to colorectal cancer susceptibility: evidence from a meta-analysis
.
Diagn Pathol
2014
,
9
:
120
.

Author notes

These authors contributed equally to this work.

Supplementary data