Abstract

Background

The genetic contribution to inflammatory bowel disease (IBD), encompassing both Crohn’s disease (CD) and ulcerative colitis (UC), accounts for around 20% of disease variance, highlighting the need to characterize environmental and epigenetic influences. Recently, considerable progress has been made in characterizing the adult methylome in epigenome-wide association studies.

Methods

We report detailed analysis of the circulating methylome in 86 patients with childhood-onset CD and UC and 30 controls using the Illumina Infinium Human MethylationEPIC platform.

Results

We derived and validated a 4-probe methylation biomarker (RPS6KA2, VMP1, CFI, and ARHGEF3), with specificity and high diagnostic accuracy for pediatric IBD in UK and North American cohorts (area under the curve: 0.90-0.94). Significant epigenetic age acceleration is present at diagnosis, with the greatest observed in CD patients. Cis-methylation quantitative trait loci (meQTL) analysis identifies genetic determinants underlying epigenetic alterations notably within the HLA 6p22.1-p21.33 region. Passive smoking exposure is associated with the development of UC rather than CD, contrary to previous findings.

Conclusions

These data provide new insights into epigenetic alterations in IBD and illustrate the reproducibility and translational potential of epigenome-wide association studies in complex diseases.

1. Introduction

The inflammatory bowel diseases (IBDs), including Crohn’s disease (CD) and ulcerative colitis (UC), are common causes of chronic illness in both children and adults worldwide. Disease prevalence in the Western world is projected to reach 1% by 2030.1–3 Childhood-onset disease is typically characterized by extensive intestinal involvement and long-standing effects on health, educational attainment, and well-being,4–6 along with increased mortality.7

Both genetic and environmental factors are unequivocally implicated in disease pathogenesis. Linkage studies followed by genome-wide association studies (GWAS) have been successfully applied in IBD and, to date, have allowed the characterization of over 300 susceptibility loci,8 accounting for up to 20% of disease variance. More recently, the application of -omics technologies has allowed the exploration of the epigenetic contributions to complex diseases, with parallel aims of understanding gene-environmental mechanisms in pathogenesis and informing biomarker discovery. Epigenome-wide association studies (EWAS) and GWAS are complementary in defining disease biology,9,10 with a consideration of cell specificity and reverse causation.11 Successful translation of biomarkers derived from the epigenome has been applied in immune-mediated, malignant, neurodegenerative, and metabolic diseases.12,13

In previous work, we reported the profile of genome-wide DNA methylation in peripheral blood leukocytes at diagnosis from a cohort of newly diagnosed children with CD.14 This characteristic pattern of genome-wide alterations was replicated in adult inception patient cohorts from the United Kingdom and Scandinavia,15,16 allowing the definition of the circulating IBD methylome (differential methylation changes highly associated with the diagnosis of IBD) in adult-onset disease. Recently, this profile was also replicated in adult patients with medically refractory CD who required resection surgery.17 In these studies, site-specific DNA methylation changes found in peripheral blood in IBD patients are underpinned by genotypes and cell-specific alterations in gene expression. Meta-analysis has provided additional validation of robust methylation changes associated with diagnosis of IBD from whole blood.18

On the basis of evidence currently available, it appears that methylation alterations associated with IBD are influenced by genetic variations (methylation quantitative trait loci [meQTL]) and modulated by environmental exposures as well as by the presence or absence of active inflammation. Lifestyle factors, particularly nutrition, tobacco smoking, and alterations of the microbiome, are strongly implicated in IBD and known to exert significant effects on key epigenetics processes.19–21 The interplay between the exposome (totality of environmental exposures an individual is exposed to from conception throughout life) and genome in shaping epigenetic alterations in complex disease remains inadequately understood and necessitates further investigation.

We hypothesize that the environmental and genetic influences on epigenetic mechanisms may be most tractable in childhood-onset disease. In the present study, we, therefore, extend the previous research on the circulating methylome in children with IBD. We utilize these previously published data to define and validate a simple accurate diagnostic biomarker panel for IBD, which outperforms other blood-based markers, including C-reactive protein (CRP), and shows disease specificity compared with other immune diseases. Furthermore, we report epigenetic evidence for accelerated biological aging in affected children and define the contributions of germline variation, parental imprinting, and passive smoke exposure to the disease methylome using parent-child trios.

2. Methods

2.1. UK inception cohorts (UK1 and UK2 cohorts)

Children with pediatric IBD in this study were defined as those under 18 years of age, diagnosed in line with the modified Porto criteria.22

The UK1 inception cohort consisted of 36 children with CD and 36 controls, as described previously in Adams et al’s study.14

In the present study, a new inception cohort (UK2) totaling 86 patients with a diagnosis of IBD (33 CD, 31 UC, 22 IBD-unclassified [IBDU]; median age of 12 years [range: 0.9-17.5] [Supplementary Table]) was recruited from the pediatric clinics at John Radcliffe Hospital, Oxford, and Addenbrooke’s Hospital, Cambridge, United Kingdom. All patients were sampled within a year of diagnosis. Early disease progression was defined as the need for treatment escalation within 18 months of diagnosis and sample collection—specifically need for surgery or for biological therapy and/or more than 2 courses of steroids while on thiopurine therapy. A total of 30 non-IBD controls (children who were investigated for gastrointestinal [GI] symptoms without IBD or healthy siblings from other disease cohorts) with a median age of 9 years (range: 2-17) were also recruited from the John Radcliffe Hospital, Oxford.

2.2. Parental trios cohort (UK3)

A total of 90 pediatric IBD patients (60 CD and 30 UC/IBDU) were recruited alongside their trio families (child with IBD, mother, and father), totaling 270 participants, at Southampton Children’s Hospital, United Kingdom. The median age of the affected children was 12.5 years (range: 5-17) for CD and 11.9 (range: 2-17) for UC, with corresponding mean disease duration of 1.3 years for CD and 1.7 years for UC, as detailed in Supplementary Table.

2.3. Ethical approval

Ethical approval for these studies was obtained from the Southampton & South West Hampshire Research Ethics Committee (09/H0504/125), the Yorkshire & The Humber Sheffield Research Ethics Committee for the Oxford Gastrointestinal (GI) Cohort (21/YH/0206), and local research committees (REC 12/EE/0482 and REC 17/EE/0265).

2.4. Sample size estimation

There were 2 primary outcomes of this study: (1) further characterization and validation of the methylome in pediatric IBD using epigenome-wide analysis and (2) the definition of a methylation biomarker panel, which has the potential for clinical translation. Group sizes ranging from 30 to 60 would yield a statistical power of 0.8 at an effect size of 10%23 to replicate previous EWAS findings. For our targeted biomarker analysis, we need group sizes of 15 to achieve a statistical power of 0.8 at an effect size of 20% for p-value significance. We additionally performed post hoc analyses, which are more exploratory, including EWAS for IBD subtypes.

2.5. DNA extraction and genome-wide methylation analysis

Genomic DNA was extracted from peripheral blood samples collected in EDTA by the salting-out method for the Southampton samples, Qiagen Puregene Blood Core Kit C for Oxford samples, and DNA Blood Mini Kit for Cambridge samples. For methylation arrays, 500 ng of DNA was randomized onto a 96-well plate for bisulfite conversion and analyzed at UCL Genomics (London, UK) using the HumanMethylationEPIC platform (Illumina, San Diego, CA). A further 750 ng of genomic DNA was analyzed using Global Screening Arrays v3.0 (Illumina) at UCL Genomics.

2.6. Data processing

Idat files were processed using pipelines from minfi24 in R (R Foundation for Statistical Computing, Vienna, Austria). Two samples were removed due to incomplete bisulfite conversions. Failed probes (detection p value >0.01 in at least 50% of the samples) or probes found on the sex chromosomes were removed from the analysis. The raw data were background-adjusted and corrected for dye color bias, and quantile-normalized. Batch effects were adjusted for both slide and position using ComBat.25 Estimated cell proportions (CD4+, CD8+, T cells, natural killer [NK] cells, B cells, monocytes, and granulocytes) were computed using the Houseman algorithm.26 EWAS analysis of differential DNA methylation analysis was performed using limma,27 and models were adjusted for the first principal component of the cell type proportions, sex, age, and center. Unless stated otherwise, all multiple correction testing was adjusted using the Benjamini-Hochberg false discovery rate (FDR).

2.7. Gene ontology enrichment analysis and identification of cell type of origin

The function gometh within the R package missMethyl28 was used to test gene ontology (GO) enrichment for CpG sites with an adjusted p value of <0.001. The function corrects for the bias between the differing number of probes per gene present on the Illumina EPIC array and CpGs associated with multiple genes. The top 1000 differentially methylated probes (DMPs) were then used to identify the cell type of origin using eFORGE v2.0 (https://eforge.altiusinstitute.org/).29

2.8. MeQTL analysis

A sex check was performed using PLINK (v1.07) to identify and remove any sex mismatches. MeQTLs were estimated using Matrix-eQTL (v2.3), with a cis distance threshold of 1 Mb. We used a genome-wide approach of all single nucleotide polymorphisms (SNPs; n = 654 027) and all methylation sites (n = 773 449) with covariates of age, sex, and cell count proportions included in the model.

2.9. Epigenetic clock analysis

DNA methylation age was determined using 328 of the 353 probes in the Horvath epigenetic clock,30 implemented in the R package watermelon.31 Age acceleration was calculated as the difference between DNA methylation age and chronological age. Statistical differences between IBD and controls were computed using a general linear model. We determined the median age acceleration between disease groups from residuals obtained after regressing DNA methylation age from controls.

2.10. Correlation of IBD-associated methylation findings

We analyzed the correlation of alterations in methylation seen in IBD between the UK1 and UK2 inception cohorts for the methylation sites most strongly associated with pediatric IBD. We carried out a correlation analysis of the top 100 overlapping methylation sites’ beta values reported in the UK1 with the new UK2 inception cohort.

2.11. Biomarker validation and 4-probe-based diagnostic model

Differentially methylated sites in the discovery cohort with the highest statistical significance were selected for analysis as potential biomarkers.14 The top 4 nonredundant sites were taken forward as our 4-probe diagnostic logistic regression model, which was initially trained in the normalized beta values in the UK1 cohort14 and then tested in the UK2 inception cohort. A 2-dimensional t-distributed stochastic neighbor embedding (t-SNE) plot was created for a visual analogue of the data using the UK2 inception cohort. Publicly available and previous datasets were utilized for model validation (Supplementary Methods). Performance of the models was defined using the area under the curve (AUC), receiver operating characteristic (ROC) curve, accuracy, specificity, sensitivity, positive predictive value (PPV), and negative predictive value (NPV).

2.12. Assessment of intergeneration inheritance of IBD-associated methylation changes and imprinted genes

Correlations in methylation between father and child, mother and child, and mother and father were assessed for a panel of IBD-associated methylation sites, by deriving Spearman correlation coefficients.14 The package DMRcate32 was used to compute differentially methylated regions (DMRs), with significance determined by an FDR <0.05. We then further selected out DMRs within imprinted region using 2 databases: the geneimprint (http://www.geneimprint.com/) and Otago imprinting (http://igc.otago.ac.nz).33 We firstly compared DMRs in the UK2 inception cohort. We then carried out formal EWAS and DMR analyses in the parental trios cohort, by comparing levels of methylation in child-father and child-mother pairings.

2.13. Passive smoking exposure

Fisher’s exact test was used to assess differences between IBD subtypes and passive smoking exposure in both the parental trios cohort and UK2 inception subcohort. Parental smoking data were collected by parental and patient interviews at the time of recruitment into the UK3 cohort. Information on parental past and current smoking, including pack-year history, was obtained (Supplementary method).

3. Results

3.1. The circulating methylome in childhood-onset UC and CD

Epigenome-wide analysis within the UK2 inception cohort identified 384 CpG sites with significant evidence for altered methylation levels when comparing IBD with controls (Holm-corrected p < 0.05) (Figure 1A, Supplementary Table). The site showing the greatest evidence for differential methylation implicates the gene ZBTB16 with 3 CpG sites with >5% hypomethylation in IBD patients. Other genes showing strong evidence for differential methylation include PP4R3B (12% hypermethylation), Interferon Alpha and Beta Receptor Subunit 1 (IFNAR1; 9% hypomethylation), and Suppressor of Cytokine Signaling 2 (SOCS2; 6% hypermethylation). Overall, of the 100 most differentially methylated sites, there were 51 CpG sites with hypermethylation and 49 with hypomethylation. GO analysis of the 384 DMPs revealed 3 significantly enriched GO terms involving regulation of alpha-beta T cells (Supplementary Figure 1).

Epigenome-wide analysis of methylation differences associated with pediatric inflammatory bowel disease (IBD) (Holm-corrected < 0.05). A, Manhattan plot with labeled differentially methylated sites. B, Estimated cell type proportion differences between IBD and control. C, Correlation of the 100 most differentially methylated between Cohort 1 and the index cohort reported by Adams et al.14
Figure 1

Epigenome-wide analysis of methylation differences associated with pediatric inflammatory bowel disease (IBD) (Holm-corrected < 0.05). A, Manhattan plot with labeled differentially methylated sites. B, Estimated cell type proportion differences between IBD and control. C, Correlation of the 100 most differentially methylated between Cohort 1 and the index cohort reported by Adams et al.14

We next performed an EWAS for each of the IBD subtypes (CD and UC) compared with controls. For CD vs controls, a total of 157 CpG sites were differentially methylated (FDR < 0.05) (Figure 2A and Supplementary Table). For UC vs controls, a total of 107 CpG sites were differentially methylated (FDR p < 0.05) (Figure 2B and Supplementary Table). One CpG site found within LOC100131347 was found to be differentially methylated (FDR = 0.034) when comparing CD and UC directly (Supplementary Table).

Manhattan plot showing the top epigenome-wide association study sites. A, Crohn’s disease only vs controls with a total of 157 CpG sites differentially methylated (false discovery rate [FDR] < 0.05). B, Ulcerative colitis only vs controls with 107 CpG sites differentially methylated (FDR < 0.05).
Figure 2

Manhattan plot showing the top epigenome-wide association study sites. A, Crohn’s disease only vs controls with a total of 157 CpG sites differentially methylated (false discovery rate [FDR] < 0.05). B, Ulcerative colitis only vs controls with 107 CpG sites differentially methylated (FDR < 0.05).

There were no statistically significant DMPs associated with a more severe disease course/progression (Supplementary Table and Supplementary Figure 2 and 4).

3.2. Monocytes and NK cells are involved in the observed methylation changes in pediatric IBD patients

Active inflammation is known to affect the proportion of immune cell types in IBD. We assessed the estimated cell proportions derived from methylation profiling of the UK2 inception cohort. An increase in estimated cell type proportions was found in both granulocytes (p = 0.00137) and monocytes (p = 0.000261) between IBD patients and controls (Figure 1B), whereas a decrease in the estimated cell type proportions was observed in B cells (p = 0.0411), CD4 T cells (p = 0.0411), CD8 T cells (p = 0.0411), and NK cells (p = 0.00406) between patients and controls. The top 1000 significantly differentiated methylation CpG sites between IBD and controls were assessed for identification of cell type origin using eFORGEv2.0. This analysis revealed differentially methylated sites that colocalize with regulatory elements of specific tissue enrichments from primary monocytes (binomial p = 1.82 × 10−6) and NK cells (binomial p = 4.60 × 10−8) (Supplementary Figure 3; Supplementary Table).

3.3. Derivation and validation of a pediatric IBD diagnostic model

We compared the evidence for altered methylation for the 100 sites most strongly implicated by EWAS analysis in the 2 UK inception cohorts: UK1 and UK2 (Supplementary Table). A strong correlation was seen across the cohorts (R = 0.93, p < 2.2e-16) (Figure 1C).

The top 4 methylation sites within the genes RPS6KA2, VMP1, CFI, and ARHGEF3 were selected from the EWAS analysis in the UK1 cohort (Table 1).14

Table 1

Functional relevance of genes in the 4-site model.

Illumina probeChromosomeFeatureGeneIBD relevanceFunctional role
cg175012106BodyRPS6KA2Proximity to IBD GWAS SNP (rs1819333)Key roles include autophagy-associated mTOR pathway and proliferation34
cg1205445317BodyVMP1Coincides with primary transcription site for microRNA-21Regulator of the ER and the formation of autophagosomes35
cg043890583BodyARHGEF3Differential methylation previously reported14Roles include myeloid differentiation and binding to mTORC236
cg003821384First exonCFIDifferential methylation previously reported14Encodes for a serine proteinase which regulates the complement cascade bridging innate and adaptive immunity37
Illumina probeChromosomeFeatureGeneIBD relevanceFunctional role
cg175012106BodyRPS6KA2Proximity to IBD GWAS SNP (rs1819333)Key roles include autophagy-associated mTOR pathway and proliferation34
cg1205445317BodyVMP1Coincides with primary transcription site for microRNA-21Regulator of the ER and the formation of autophagosomes35
cg043890583BodyARHGEF3Differential methylation previously reported14Roles include myeloid differentiation and binding to mTORC236
cg003821384First exonCFIDifferential methylation previously reported14Encodes for a serine proteinase which regulates the complement cascade bridging innate and adaptive immunity37

Abbreviations: ER, endoplasmic reticulum; GWAS, genome-wide association studies; IBD, inflammatory bowel disease; mTOR, mammalian target of rapamycin; SNP, single nucleotide polymorphism.

Table 1

Functional relevance of genes in the 4-site model.

Illumina probeChromosomeFeatureGeneIBD relevanceFunctional role
cg175012106BodyRPS6KA2Proximity to IBD GWAS SNP (rs1819333)Key roles include autophagy-associated mTOR pathway and proliferation34
cg1205445317BodyVMP1Coincides with primary transcription site for microRNA-21Regulator of the ER and the formation of autophagosomes35
cg043890583BodyARHGEF3Differential methylation previously reported14Roles include myeloid differentiation and binding to mTORC236
cg003821384First exonCFIDifferential methylation previously reported14Encodes for a serine proteinase which regulates the complement cascade bridging innate and adaptive immunity37
Illumina probeChromosomeFeatureGeneIBD relevanceFunctional role
cg175012106BodyRPS6KA2Proximity to IBD GWAS SNP (rs1819333)Key roles include autophagy-associated mTOR pathway and proliferation34
cg1205445317BodyVMP1Coincides with primary transcription site for microRNA-21Regulator of the ER and the formation of autophagosomes35
cg043890583BodyARHGEF3Differential methylation previously reported14Roles include myeloid differentiation and binding to mTORC236
cg003821384First exonCFIDifferential methylation previously reported14Encodes for a serine proteinase which regulates the complement cascade bridging innate and adaptive immunity37

Abbreviations: ER, endoplasmic reticulum; GWAS, genome-wide association studies; IBD, inflammatory bowel disease; mTOR, mammalian target of rapamycin; SNP, single nucleotide polymorphism.

Of these 4 sites, the methylation sites within the RPS6KA2 and VMP1 loci (adjacent to RPS6KB1) contributed the most weight in the model, and the individual marker with the highest diagnostic accuracy was found to be RPS6KA2, with an accuracy of 0.85. Incorporating 4 probes into the model improved its performance. A logistic regression model consisting of methylation of these 4 probes was then tested in the UK2 cohort for a first validation; following ROC/AUC analyses, the model demonstrated a diagnostic accuracy with an AUC of 0.912 (95% confidence interval [CI], 0.86-0.96) (accuracy: 0.74; PPV: 0.98) (Figure 3A and B and Table 2).

Table 2

Modeling outcomes from the 4-probe methylation panel of each of the scenarios test (N).

Study design (N)AUC95% CIThresholdSensitivitySpecificityPPVNPVAccuracy
Training model: UK1 cohort CD (36), non-IBD controls (36)0.9460.901-0.9920.50.8280.8610.8520.8370.845
0.90.5140.9720.9470.6730.746
Testing model: UK2 cohort IBD (86), non-IBD controls (30)0.9120.861-0.9630.50.6860.9660.9830.5170.758
0.90.7550.8660.9420.5530.784
UK2 IBD (86), celiac (36)0.8820.809-0.9550.50.9180.6380.8580.7660.836
0.90.5340.9440.9580.4590.657
Celiac (36), controls (35)0.5570.444-0.7120.50.5500.5500.5500.5500.555
0.90.0001.000NA0.5000.500
UK2 IBD CRP-positive (36), non-IBD controls (30)1.001.00-1.000.51.0000.7660.8051.0000.881
0.90.9650.9660.9650.9660.966
UK2 IBD CRP-negative (49), non-IBD controls (30)0.9030.838-0.9680.50.8540.7660.8540.7760.820
0.90.5200.9660.9610.5570.692
RISK: baseline CD (164), non-IBD control (74)0.9340.901-0.9680.50.7910.9110.8140.8990.872
0.90.3610.9860.9280.7590.780
RISK: follow-up CD (164), non-IBD (74)0.6660.590-0.7460.50.1940.9440.6360.7010.694
0.90.0001.000NA0.6660.666
BIOM UK IBD adults (240), non-IBD controls (190)0.7480.685-0.8120.50.8780.4040.7320.6410.712
0.90.1470.9880.9580.3840.441
Rheumatoid arthritis (354), controls (337)0.6460.605-0.6870.50.6380.6050.6310.6130.622
0.901NA0.4860.486
Study design (N)AUC95% CIThresholdSensitivitySpecificityPPVNPVAccuracy
Training model: UK1 cohort CD (36), non-IBD controls (36)0.9460.901-0.9920.50.8280.8610.8520.8370.845
0.90.5140.9720.9470.6730.746
Testing model: UK2 cohort IBD (86), non-IBD controls (30)0.9120.861-0.9630.50.6860.9660.9830.5170.758
0.90.7550.8660.9420.5530.784
UK2 IBD (86), celiac (36)0.8820.809-0.9550.50.9180.6380.8580.7660.836
0.90.5340.9440.9580.4590.657
Celiac (36), controls (35)0.5570.444-0.7120.50.5500.5500.5500.5500.555
0.90.0001.000NA0.5000.500
UK2 IBD CRP-positive (36), non-IBD controls (30)1.001.00-1.000.51.0000.7660.8051.0000.881
0.90.9650.9660.9650.9660.966
UK2 IBD CRP-negative (49), non-IBD controls (30)0.9030.838-0.9680.50.8540.7660.8540.7760.820
0.90.5200.9660.9610.5570.692
RISK: baseline CD (164), non-IBD control (74)0.9340.901-0.9680.50.7910.9110.8140.8990.872
0.90.3610.9860.9280.7590.780
RISK: follow-up CD (164), non-IBD (74)0.6660.590-0.7460.50.1940.9440.6360.7010.694
0.90.0001.000NA0.6660.666
BIOM UK IBD adults (240), non-IBD controls (190)0.7480.685-0.8120.50.8780.4040.7320.6410.712
0.90.1470.9880.9580.3840.441
Rheumatoid arthritis (354), controls (337)0.6460.605-0.6870.50.6380.6050.6310.6130.622
0.901NA0.4860.486

Abbreviations: AUC, area under the curve; CI, confidence interval; CD, Crohn’s disease; CRP, C-reactive protein; IBD, inflammatory bowel disease; NA, not applicable; NPV, negative predictive value; PPV, positive predictive value.

Table 2

Modeling outcomes from the 4-probe methylation panel of each of the scenarios test (N).

Study design (N)AUC95% CIThresholdSensitivitySpecificityPPVNPVAccuracy
Training model: UK1 cohort CD (36), non-IBD controls (36)0.9460.901-0.9920.50.8280.8610.8520.8370.845
0.90.5140.9720.9470.6730.746
Testing model: UK2 cohort IBD (86), non-IBD controls (30)0.9120.861-0.9630.50.6860.9660.9830.5170.758
0.90.7550.8660.9420.5530.784
UK2 IBD (86), celiac (36)0.8820.809-0.9550.50.9180.6380.8580.7660.836
0.90.5340.9440.9580.4590.657
Celiac (36), controls (35)0.5570.444-0.7120.50.5500.5500.5500.5500.555
0.90.0001.000NA0.5000.500
UK2 IBD CRP-positive (36), non-IBD controls (30)1.001.00-1.000.51.0000.7660.8051.0000.881
0.90.9650.9660.9650.9660.966
UK2 IBD CRP-negative (49), non-IBD controls (30)0.9030.838-0.9680.50.8540.7660.8540.7760.820
0.90.5200.9660.9610.5570.692
RISK: baseline CD (164), non-IBD control (74)0.9340.901-0.9680.50.7910.9110.8140.8990.872
0.90.3610.9860.9280.7590.780
RISK: follow-up CD (164), non-IBD (74)0.6660.590-0.7460.50.1940.9440.6360.7010.694
0.90.0001.000NA0.6660.666
BIOM UK IBD adults (240), non-IBD controls (190)0.7480.685-0.8120.50.8780.4040.7320.6410.712
0.90.1470.9880.9580.3840.441
Rheumatoid arthritis (354), controls (337)0.6460.605-0.6870.50.6380.6050.6310.6130.622
0.901NA0.4860.486
Study design (N)AUC95% CIThresholdSensitivitySpecificityPPVNPVAccuracy
Training model: UK1 cohort CD (36), non-IBD controls (36)0.9460.901-0.9920.50.8280.8610.8520.8370.845
0.90.5140.9720.9470.6730.746
Testing model: UK2 cohort IBD (86), non-IBD controls (30)0.9120.861-0.9630.50.6860.9660.9830.5170.758
0.90.7550.8660.9420.5530.784
UK2 IBD (86), celiac (36)0.8820.809-0.9550.50.9180.6380.8580.7660.836
0.90.5340.9440.9580.4590.657
Celiac (36), controls (35)0.5570.444-0.7120.50.5500.5500.5500.5500.555
0.90.0001.000NA0.5000.500
UK2 IBD CRP-positive (36), non-IBD controls (30)1.001.00-1.000.51.0000.7660.8051.0000.881
0.90.9650.9660.9650.9660.966
UK2 IBD CRP-negative (49), non-IBD controls (30)0.9030.838-0.9680.50.8540.7660.8540.7760.820
0.90.5200.9660.9610.5570.692
RISK: baseline CD (164), non-IBD control (74)0.9340.901-0.9680.50.7910.9110.8140.8990.872
0.90.3610.9860.9280.7590.780
RISK: follow-up CD (164), non-IBD (74)0.6660.590-0.7460.50.1940.9440.6360.7010.694
0.90.0001.000NA0.6660.666
BIOM UK IBD adults (240), non-IBD controls (190)0.7480.685-0.8120.50.8780.4040.7320.6410.712
0.90.1470.9880.9580.3840.441
Rheumatoid arthritis (354), controls (337)0.6460.605-0.6870.50.6380.6050.6310.6130.622
0.901NA0.4860.486

Abbreviations: AUC, area under the curve; CI, confidence interval; CD, Crohn’s disease; CRP, C-reactive protein; IBD, inflammatory bowel disease; NA, not applicable; NPV, negative predictive value; PPV, positive predictive value.

A Modeling of the 4-probe methylation panel for diagnostic accuracy of pediatric inflammatory bowel disease (IBD). A, t-distributed stochastic neighbor embedding (t-SNE) plot of the 4 probes used in the diagnostic model from the UK inception cohort colored by disease type. B, Receiver operating characteristic (ROC) curve and area under the curve (AUC) from the logistic regression of the 4 probes trained in Scottish children and tested in Oxford and Cambridge IBD samples. C, ROC curve and AUC of the 4-probe model fitted to an externally validated CD cohort from North America (RISK cohort).38 D, ROC curve and AUC of the 4-probe model fitted to a subset population of the UK inception cohort who are C-reactive protein (CRP) positive compared to non-IBD controls. E, ROC curve and AUC of the 4-probe model fitted to a subset population of the UK inception cohort who are CRP-negative compared to non-IBD controls. F, ROC curve and AUC of the 4-probe model fitted to cohort of pediatric celiac patient’s vs non-celiac controls. G, ROC curve and AUC of the 4-probe model fitted to UK adult IBD cohort at diagnosis (IBD BIOM).15 H, ROC curve and AUC fitted to the RA dataset.
Figure 3

A Modeling of the 4-probe methylation panel for diagnostic accuracy of pediatric inflammatory bowel disease (IBD). A, t-distributed stochastic neighbor embedding (t-SNE) plot of the 4 probes used in the diagnostic model from the UK inception cohort colored by disease type. B, Receiver operating characteristic (ROC) curve and area under the curve (AUC) from the logistic regression of the 4 probes trained in Scottish children and tested in Oxford and Cambridge IBD samples. C, ROC curve and AUC of the 4-probe model fitted to an externally validated CD cohort from North America (RISK cohort).38 D, ROC curve and AUC of the 4-probe model fitted to a subset population of the UK inception cohort who are C-reactive protein (CRP) positive compared to non-IBD controls. E, ROC curve and AUC of the 4-probe model fitted to a subset population of the UK inception cohort who are CRP-negative compared to non-IBD controls. F, ROC curve and AUC of the 4-probe model fitted to cohort of pediatric celiac patient’s vs non-celiac controls. G, ROC curve and AUC of the 4-probe model fitted to UK adult IBD cohort at diagnosis (IBD BIOM).15 H, ROC curve and AUC fitted to the RA dataset.

To further validate our model, we separately compared children in the UK2 cohort who were CRP-positive (>5 mg/L; n = 32) and children with no CRP rise at presentation (denoted CRP-negative [<5 mg/L; n = 49]) against non-IBD controls. In the CRP-positive group, we found an AUC of 0.99 (95% CI, 0.99-1) (accuracy: 0.88; PPV: 0.89) (Figure 3D and Table 2). Within the CRP-negative group, an AUC of 0.90 was observed against controls (95% CI, 0.83-0.96) (accuracy: 0.82; PPV: 0.85) (Figure 3E and Table 2).

We then sought to externally validate our prediction model using the available methylation data at the baseline time point for the North American inception RISK cohort (164 children with CD and 74 controls34). In this cohort, we demonstrated an AUC of 0.93 (95% CI, 0.90-0.96) (accuracy: 0.87; PPV: 0.81) (Figure 3C and Table 2). At the second time point (1-3 years after diagnosis), accuracy was markedly reduced, with an AUC of 0.66 (95% CI, 0.59-0.74) (accuracy: 0.69; PPV: 0.63), demonstrating a rapid reduction in utility over a relatively short time.

To investigate the utility in adults at presentation, we analyzed a large adult IBD inception cohort15 for accuracy of diagnosis with our model. In this cohort, a more modest accuracy for diagnosis was noted—AUC 0.74 (95% CI, 0.68-0.81) (accuracy: 0.71; PPV: 0.73) (Figure 3G).

To confirm diagnostic specificity for IBD against other GI and inflammatory diseases, our prediction model was tested in cohorts of pediatric patients with celiac disease and non-celiac controls. The observed AUC in celiac disease was 0.55 (95% CI, 0.44-0.71) (accuracy: 0.55; PPV: 0.55) (Figure 3F and Table 2). Additionally, the model exhibited no discrimination between adult rheumatoid arthritis patients and controls, as evidenced by an AUC of 0.64 (95% CI, 0.60-0.68) (accuracy: 0.622; PPV: 0.63) (Figure 3H).

3.4. Epigenetic age acceleration at diagnosis in pediatric IBD cohorts

We assessed the correlation between chronological and epigenetic age using the Horvath clock in IBD vs controls in the UK2 inception cohort (Figure 4A). A strong positive correlation of chronological and epigenetic age was observed in both groups, with a higher degree exhibited in the control group (R = 0.78) compared to IBD patients (R = 0.53) (Figure 4A). Increased age acceleration was found in children with IBD compared to non-IBD children (p = 0.0011). Following regression analysis, the median age acceleration was found to be 3.53 years in IBD patients (Figure 4B). Subtype analysis showed an increased median age acceleration of 4.52 years in the CD group. A total of 20 children with IBD were identified as extreme outliers, with >10 years age acceleration at diagnosis. One child was noted to have derived age acceleration of 39 years; this individual escalated to biologics within 1 month of sampling. Advanced age acceleration was not correlated with clinical parameters, including location and behavior for CD and extent and severity for UC. Escalation to the need of biologic or surgery was also not associated with increased age acceleration.

Epigenetic aging in pediatric inflammatory bowel disease (IBD). A, The correlation between epigenetic age (computed via the Horvath clock) and chronological age of both controls and pediatric IBD patients from UK inception cohort. B, Age acceleration (years) between both controls and pediatric IBD patients from UK inception cohort following regression from controls.
Figure 4

Epigenetic aging in pediatric inflammatory bowel disease (IBD). A, The correlation between epigenetic age (computed via the Horvath clock) and chronological age of both controls and pediatric IBD patients from UK inception cohort. B, Age acceleration (years) between both controls and pediatric IBD patients from UK inception cohort following regression from controls.

3.5. Genetic variant-driven methylation differences defined by meQTL analysis

We compared the contribution of all cis-meQTLs (SNPs n = 654 027, methylation n = 773 449) between IBD and controls in the UK2 cohort. A total of 800 515 cis-meQTLs were found to be significant following FDR analysis. Across the genome, the majority of meQTLs reside within intergenic regions. Chromosome 6 contains a disproportionately high number of meQTLs (n = 120 309, 15.2%) compared to any other chromosomes (Figure 5A). On further analysis by subchromosomal location, a nonrandom distribution of meQTLs was observed with a disproportionate number residing between 6p22.1 and 6p21.33, which encompasses HLA-associated genes. Implicated loci include RNF39, HLA-A/B/E/F/G, LTA, and HLA-DQB1-DQB2-DQA1 (Figure 5B).

Methylation quantitative trait loci (MeQTL) findings in pediatric inflammatory bowel disease (IBD). A, Count data of the genome-wide meQTL plotted by chromosome and colored for CpG island location. B, The distribution of meQTLs at base pair location on chromosome 6 (6p22.1-p21.33) surrounding the HLA-associated genes.
Figure 5

Methylation quantitative trait loci (MeQTL) findings in pediatric inflammatory bowel disease (IBD). A, Count data of the genome-wide meQTL plotted by chromosome and colored for CpG island location. B, The distribution of meQTLs at base pair location on chromosome 6 (6p22.1-p21.33) surrounding the HLA-associated genes.

3.6. Investigation into intergeneration inheritance of IBD-associated methylation changes and imprinted genes

To address the issue as to whether methylation signatures associated with childhood-onset IBD are inherited, we examined statistical evidence of correlation of methylation levels in children with IBD and their parents. The 100 methylation sites, which were most strongly associated with IBD in both the UK1 and UK2 inception cohorts, were assessed for evidence of correlation in the parental trios cohort (UK3). None of the CpG sites tested were correlated between parents and children after multiple correction testing (Supplementary Table).

We extended our analysis by carrying out a 2-step process for investigating DMRs within regions of the genome with known a priori scientific evidence for intergenerational imprinting. We firstly assessed DMRs using the UK2 cohort as the discovery cohort; 11 DMRs were found to reside within imprinted regions of the genome (Supplementary Table). Within Guanine Nucleotide-Binding Subunit (GNAS), which encodes highly isoform-dependent products, 3 different DMRs were observed. The same analysis was then performed in data generated from UK2; 6 DMRs within imprinted regions were implicated in both cohorts (Table 3). Of the overlapping DMRs found between both cohorts, 2 are from paternal alleles, 2 from maternal alleles, and 2 regions are from isoform-dependent regions.

Table 3

Differential methylation regions associated with imprinting regions found within the UK inception cohort and Scotland Crohn’s disease cohort.

GeneImprintedExpressed alleleFisher UK inception cohortFisher Scotland cohort
GNAS20q13.3Isoform dependent1.56E-084.55E-40
PEG10, SGCE7q21-q22Paternal2.82E-116.32E-35
MEG314q32Maternal1.03E-151.41E-26
H1911p15.5Maternal1.20E-151.89E-22
MEST7q32Paternal2.27E-182.83E-21
GNAS20q13.3Isoform dependent4.53E-223.86E-11
GeneImprintedExpressed alleleFisher UK inception cohortFisher Scotland cohort
GNAS20q13.3Isoform dependent1.56E-084.55E-40
PEG10, SGCE7q21-q22Paternal2.82E-116.32E-35
MEG314q32Maternal1.03E-151.41E-26
H1911p15.5Maternal1.20E-151.89E-22
MEST7q32Paternal2.27E-182.83E-21
GNAS20q13.3Isoform dependent4.53E-223.86E-11
Table 3

Differential methylation regions associated with imprinting regions found within the UK inception cohort and Scotland Crohn’s disease cohort.

GeneImprintedExpressed alleleFisher UK inception cohortFisher Scotland cohort
GNAS20q13.3Isoform dependent1.56E-084.55E-40
PEG10, SGCE7q21-q22Paternal2.82E-116.32E-35
MEG314q32Maternal1.03E-151.41E-26
H1911p15.5Maternal1.20E-151.89E-22
MEST7q32Paternal2.27E-182.83E-21
GNAS20q13.3Isoform dependent4.53E-223.86E-11
GeneImprintedExpressed alleleFisher UK inception cohortFisher Scotland cohort
GNAS20q13.3Isoform dependent1.56E-084.55E-40
PEG10, SGCE7q21-q22Paternal2.82E-116.32E-35
MEG314q32Maternal1.03E-151.41E-26
H1911p15.5Maternal1.20E-151.89E-22
MEST7q32Paternal2.27E-182.83E-21
GNAS20q13.3Isoform dependent4.53E-223.86E-11

We further performed DMR analysis by comparing either father and child or mother and child pairs in the parental trios cohort. In total, evidence for imprinting of 3 DMRs was found between child and father; these had been previously implicated in the UK2 cohort (Supplementary Table) but were not found in the UK1 cohort. These regions were within the gene KCNQ1 (11p15.5), close to CD81, PLAGL1 (6q24-q25), and HOXA2-A3 (7p15-p14). No DMRs were observed between mother and child.

3.7. Parental smoking habit associated with UC but not CD

We assessed the influence of passive smoking exposure on IBD subtypes in the UK3 cohort of parent-child trios. We observed a difference in the proportion of children with UC who had been exposed to passive smoking compared to children with CD in the parental trio cohort. In total there were, two nonexposed children with UC vs 11 definitely exposed; 30 nonexposed CD children vs 15 definitely exposed; Fisher’s exact test between CD and UC (p = 0.001; 95% CI, 0.009-0.51; OR = 0.09) (Supplementary Table).

We sought to replicate this finding. A total of 44 of the patients or parents of the child provided data on smoking habit in the UK2 cohort (involving 17 children with CD and 14 UC). In children developing CD, 15 had no exposure to passive smoking vs 2 who had definite exposure; whereas for UC, 7 children had no exposure vs 7 who had definite exposure (p = 0.04387; 95% CI, 0.01-1.01; OR = 0.14) (Supplementary Table).

Fisher’s exact test for the combined group of children studied in the UK2 and UK3 cohorts confirmed that children developing UC were more likely to have had passive smoke exposure in early life compared to children with CD (p = 0.0008; 95% CI, 0.06-0.55; OR = 0.19) (Supplementary Table).

4. Discussion

This study provides further insights into the reproducibility, biological importance, and translational potential of epigenetic analyses in complex diseases. We provide evidence confirming the characteristic pattern of DNA methylation present at diagnosis in children with CD and UC, complementing and supporting previous studies in children with CD, and in North European adults with CD and UC. Together, these studies demonstrate a remarkable degree of replication of the circulating methylome in both adult-onset and childhood-onset disease, providing a catalyst to explore other immune-mediated diseases. Using these data, we have been able to derive and validate a simple 4-probe biomarker panel with high accuracy and specificity for both CD and UC in children. Additionally, the findings provide insights into pathogenic mechanisms complementary to those discovered by GWAS studies. Thus, we define the genetic contribution to shaping the epigenome, highlighting the contribution of the major histocompatibility complex (MHC); provide first evidence for paternal imprinting in IBD; and demonstrate the influence of aging, inflammation, and passive smoke exposure on the epigenome in disease.

The consistency observed between the UK inception cohorts for the most strongly associated sites demonstrating altered methylation is striking.14,34 The present dataset demonstrates more regions of altered methylation in CD than in UC, although only 1 marker is implicated in direct comparison of CD vs UC; and the top markers are strongly correlated in both diseases. Our findings in children are also highly consistent with the results from reported EWAS in adult IBD populations in Northern Europe, and the associated meta-analysis.15–17 In the present EWAS, the differential methylation at 3 CpG sites within the gene zinc finger and BTB domain containing 16 (ZBTB16) is of particular note. ZBTB16 has been linked to autophagy through the degradation of Atg14L36; zinc-binding groups also play a crucial role in interacting with histone deacetylase.37 Additional sites resided in SOCS2 and IFNAR1, both playing a role in modulating JAK-STAT signaling pathways. We confirm the dysregulation of methylation at CpG sites associated with the VMP1 and ZBTB16 loci, the most consistent sites in previous studies.

The predictive accuracy (0.91-0.93) of the 4-site-based diagnostic model derived in this study for children offers promising implications for clinical practice, with similar predictive power in both CD and UC. Notably, the methylation sites within RPS6KA2 and VMP1 (mir-21) carry substantial weight in the model. These sites are situated in genomic regions previously associated with IBD pathogenesis.15,38 The diagnostic accuracy derived is greater than that for other blood-based biomarkers and is similar to that reported for fecal calprotectin, currently the most commonly used biomarker in this context. The model appears to be specific for IBD in contrast to  celiac disease, which also involves intestinal inflammation, and rheumatoid disease, a systemic inflammatory condition. The high accuracy in children with no CRP elevation at diagnosis is particularly noteworthy in clinical practice and suggests that described biomarker model is not secondary to inflammation alone. Interestingly, our model performed less effectively in adult IBD populations than in children. This is perhaps explained by the well-accepted stochastic methylation changes that occur as a result of lifetime environmental exposures and the aging process per se.39 Further work is needed to explore these methylation patterns in other ethnicities and geographical populations as an early diagnostic and prognostic markers, to explore the predictive ability in other immune-mediated conditions and to look at the impact of IBD disease course and therapy over time on the biomarker, particularly with regards to exposure to both immunosuppressant therapies and cumulative inflammation. The model performs with high accuracy as a diagnostic tool at presentation; however, performance drops following a period of treatment after diagnosis in the RISK cohort. However, of particular note is the fact that the model performs accurately as a diagnostic test in children with low CRP levels at presentation and that the data demonstrate specificity for IBD when compared to other inflammatory diseases.

DNA methylation panels have recently emerged as molecular biomarkers for biological aging; several epigenetic clocks, which utilize methylation across cell types, predict a spectrum of ages from early in life to centenarians.40 Positive age acceleration is associated with adverse outcomes, such as cancer41 and cardiovascular disease,42 and with all-cause mortality.43 Furthermore, acute age acceleration has been found during pregnancy and been associated with heightened circulating markers of inflammation. In our cohorts, we found significant age acceleration in IBD patients compared to controls, most notably in CD patients. In some instances, extreme age acceleration of greater than 30 years was observed. These data are consistent with findings in adult IBD populations, where CD patients appear to show the greatest age acceleration.17 Recent data suggest reversibility can occur in some disease states.44,45 Further studies are now needed to identify whether either acute or chronic inflammation is the biggest driver of increased age acceleration in patients; to assess the reversibility of the observed aging with therapy; and, most importantly, to characterize the consequences of observed premature epigenetic aging in children with IBD.

Our targeted approach of cis-meQTL analysis confirms the importance of germline variation in the shaping of the methylome.14–16,46 Thus, we identified a high proportion of methylation differences in IBD patients independent of sex, age, and cell count proportions. Of particular interest, our genome-wide analysis highlights the nonrandom clustering of meQTLs residing within chromosome 6 (6p22.1-p21.33), strongly implicating the functional relevance of HLA-related genes in shaping the methylome in immune diseases, as previously reported in rheumatoid arthritis and multiple sclerosis.47 These findings support the hypothesis that the observed methylation-associated differences in children with IBD largely reflect the combined effects of the exposome, and active inflammation and germline variants acting as meQTLs.

Regions within the gene GNAS (20q13.2-3) were repeatedly found to be shared between IBD and control. GNAS is a complex gene locus, which gives rise to multiple gene products including NESP55 (maternally derived isoform) and XLas (paternally derived isoform). Genetic mutations in GNAS have previously been associated with both colorectal cancer48 and colitis-associated cancers.49 Downstream from GNAS at position 20q13.3.2 is the zinc finger protein 831 (ZNF831), which is also a well-described IBD genetic susceptibility locus.50 We found several methylation sites implicated at rs259964 within ZNF831 following our targeted meQTL analysis. Further investigation is required into understanding this locus to understand functional implications of isoform expression in IBD.

Exploring passive smoking exposure in pediatric IBD, we observed a strong bias towards the development of UC rather than CD in children exposed to parental smoking (p = 0.0008). Although these observations are discordant with some historical reports in the literature,51,52 our study has the strength not only that we involved 2 independent cohorts but also that we were able to confirm smoking status in the parents of the Southampton cohort (in which the effect size was greatest).

While our study contributes valuable insights into the circulating methylome of pediatric IBD, it is essential to acknowledge areas for further work. We have studied North European populations and extended the research using available data from North America, confirming the accuracy of the diagnostic biomarker we derived; however, these findings need to be explored in other diverse populations, especially in ethnic groups where the incidence has increased rapidly over the last generation. We have limited our studies to blood samples, rather than gut samples14–17; and direct comparison of circulating and gut-associated methylomes is valuable.53–56 The impact of these methylation findings in blood on changes at the tissue level needs to be evaluated. A level of consistency has been found between meQTL findings in blood and gut,46 and our whole blood findings also replicate 21 of these loci found in ileum samples. The cohort recruited for our diagnostic biomarker study did not allow direct comparison of the accuracy of the 4-probe panel against fecal calprotectin. Notwithstanding this, the accuracy we report here compares favorably against published literature for fecal calprotectin57 and suggests comparability to other blood-based markers. Further prospective investigation is needed to assess the accuracy formally before clinical translation in the context of a blood-based screening or diagnostic tool. Our findings highlight the need for key data to be collected rigorously regarding environmental exposures in the development of IBD.

In conclusion, these data provide a strong impetus for studies of the epigenome in IBD, and more widely in other complex immune-mediated diseases. The studies provide confidence that the methylation alterations may be detected in cohorts of well-characterized patients, using available technologies. The findings have implications for early clinical translation in biomarker discovery as well as for understanding pathogenesis, specifically the role of the epigenome in mediating gene-environmental interactions. Further work is needed to explore these findings in diverse populations and to define the pathophysiological effects of these alterations in disease.

Supplementary Data

Supplementary data are available online at ECCO-JCC online.

Funding

This study was funded by the Action Medical Research (AMR) (GN2880). HUU was supported by the Biomedical Research Centre, Oxford, The Leona M. and Harry B. Helmsley Charitable Trust, Medical Research Council (MRC) grant, and National Health Service (NHS). JS was funded by the National Institute for Health and Care Research (NIHR), The Leona M. and Harry B. Helmsley Charitable Trust, Crohn’s Colitis United Kingdom (CCUK), European Crohn’s Colitis Organisation (ECCO), and European Council (EC). JJA was funded by an NIHR advanced fellowship. RK was funded by CCUK, National Research Scotland Chief Scientist Office (NRS CSO), and Medical Research Council Clinical Academic Research Partnerships (MRC CARP) fellowships. The views expressed are those of the authors and do not necessarily represent those of the NHS, the NIHR, or the Department of Health.

Conflict of Interest

HHU has received research support or consultancy fees from Janssen, UCB Pharma, Falk, Eli Lilly, GSK, Celgene/BMS, and AbbVie. A patent application (N42837GB) has been filed for the 4-probe diagnostic biomarker discussed.

Acknowledgments

We thank the GI Biobanking Team at TGU, Oxford, for their help in collecting patient samples and DNA extractions. We also thank Shana-Lee Bownes and Jennifer Hollis for collecting patient data. Lastly, we thank all patients and their families for their participation in this study.

Author Contributions

AA, SE, and JS conceived the study. AN, KN, MK, and GC conducted the experimental work. GC, KM, AQ, FG, CJ, RK, NTV, MZ, HHU, RH, GLH, EE-O, DW, JJA, RMB, NMC, and SE collected the clinical samples and medical data. AN, AA, JN, RK, HHU, and JS performed the formal analysis. AN, AA, JN, GC, KM, AQ, FG, CJ, RK, NTV, NMC, RH, GLH, EE-O, MK, JJA, RMB, DW, MZ, SE, HHU, and JS contributed to the writing of the manuscript. AA and JS acquired funding for the study.

Data Availability

Data are available upon reasonable request.

References

1.

Jones
G-R
,
Lyons
M
,
Plevris
N
, et al. .
IBD prevalence in Lothian, Scotland, derived by capture–recapture methodology
.
Gut
2019
;
68
:
1953
60
.

2.

Coward
S
,
Benchimol
EI
,
Kuenzig
ME
, et al. .
The 2023 impact of inflammatory bowel disease in Canada: epidemiology of IBD
.
J Can Assoc Gastroenterol
2023
;
6
:
S9
S15
.

3.

Ng
SC
,
Shi
HY
,
Hamidi
N
, et al. .
Worldwide incidence and prevalence of inflammatory bowel disease in the 21st century: a systematic review of population-based studies
.
Lancet
2017
;
390
:
2769
78
.

4.

Uhlig
HH
,
Schwerd
T
,
Koletzko
S
, et al. ;
COLORS in IBD Study Group and NEOPICS
.
The diagnostic approach to monogenic very early onset inflammatory bowel disease
.
Gastroenterology
2014
;
147
:
990
1007.e3
.

5.

Van Limbergen
J
,
Russell
RK
,
Drummond
HE
, et al. .
Definition of phenotypic characteristics of childhood-onset inflammatory bowel disease
.
Gastroenterology
2008
;
135
:
1114
22
.

6.

Ashton
JJ
,
Ennis
S
,
Beattie
RM.
Early-onset paediatric inflammatory bowel disease
.
Lancet Child Adolesc Health
2017
;
1
:
147
58
.

7.

Olén
O
,
Askling
J
,
Sachs
MC
, et al. .
Increased mortality of patients with childhood-onset inflammatory bowel diseases, compared with the general population
.
Gastroenterology
2019
;
156
:
614
22
.

8.

de Lange
KM
,
Moutsianas
L
,
Lee
JC
, et al. .
Genome-wide association study implicates immune activation of multiple integrin genes in inflammatory bowel disease
.
Nat Genet
2017
;
49
:
256
61
.

9.

Rakyan
VK
,
Down
TA
,
Balding
DJ
,
Beck
S.
Epigenome-wide association studies for common human diseases
.
Nat Rev Genet
2011
;
12
:
529
41
.

10.

Battram
T
,
Gaunt
TR
,
Relton
CL
,
Timpson
NJ
,
Hemani
G.
A comparison of the genes and genesets identified by GWAS and EWAS of fifteen complex traits
.
Nat Commun
2022
;
13
:
7816
.

11.

Zhang
H
,
Kalla
R
,
Chen
J
, et al. .
Altered DNA methylation within DNMT3A, AHRR, LTA/TNF loci mediates the effect of smoking on inflammatory bowel disease
.
Nat Commun
2024
;
15
:
595
.

12.

Mo
S
,
Dai
W
,
Wang
H
, et al. .
Early detection and prognosis prediction for colorectal cancer by circulating tumour DNA methylation haplotypes: a multicentre cohort study
.
eClinicalMedicine
2023
;
55
:
101717
.

13.

Hansson
O
,
Blennow
K
,
Zetterberg
H
,
Dage
J.
Blood biomarkers for Alzheimer’s disease in clinical practice and trials
.
Nat Aging
2023
;
3
:
506
19
.

14.

Adams
AT
,
Kennedy
NA
,
Hansen
R
, et al. .
Two-stage genome-wide methylation profiling in childhood-onset Crohn’s disease implicates epigenetic alterations at the VMP1/MIR21 and HLA loci
.
Inflamm Bowel Dis
2014
;
20
:
1784
93
.

15.

Ventham
NT
,
Kennedy
NA
,
Adams
AT
, et al. ;
IBD BIOM Consortium
.
Integrative epigenome-wide analysis demonstrates that DNA methylation may mediate genetic risk in inflammatory bowel disease
.
Nat Commun
2016
;
7
:
13507
.

16.

Kalla
R
,
Adams
AT
,
Nowak
JK
, et al. .
Analysis of systemic epigenetic alterations in inflammatory bowel disease: defining geographical, genetic and immune-inflammatory influences on the circulating methylome
.
J Crohns Colitis
2022
;
17
:
170
84
.

17.

Ventham
NT
,
Kennedy
NA
,
Kalla
R
, et al. ;
TOPPIC Study Group
.
Genome-wide methylation profiling in 229 patients with Crohn’s disease requiring intestinal resection: epigenetic analysis of the Trial of Prevention of Post-operative Crohn’s disease (TOPPIC)
.
Cell Mol Gastroenterol Hepatol
2023
;
16
:
431
50
.

18.

Joustra
V
,
Hageman
IL
,
Satsangi
J
, et al. .
Systematic review and meta-analysis of peripheral blood DNA methylation studies in inflammatory bowel disease
.
J Crohns Colitis
2023
;
17
:
185
98
.

19.

Noble
AJ
,
Nowak
JK
,
Adams
AT
,
Uhlig
HH
,
Satsangi
J.
Defining interactions between the genome, epigenome, and the environment in inflammatory bowel disease: progress and prospects
.
Gastroenterology
2023
;
165
:
44
60.e2
.

20.

Vieujean
S
,
Caron
B
,
Haghnejad
V
, et al. .
Impact of the exposome on the epigenome in inflammatory bowel disease patients and animal models
.
Int J Mol Sci
2022
;
23
:
7611
.

21.

Sun
Y
,
Yuan
S
,
Chen
X
, et al. .
The contribution of genetic risk and lifestyle factors in the development of adult-onset inflammatory bowel disease: a prospective cohort study
.
Am J Gastroenterol
2023
;
118
:
511
.

22.

Levine
A
,
Koletzko
S
,
Turner
D
, et al. ;
European Society of Pediatric Gastroenterology, Hepatology, and Nutrition
.
ESPGHAN revised Porto criteria for the diagnosis of inflammatory bowel disease in children and adolescents
.
J Pediatr Gastroenterol Nutr
2014
;
58
:
795
806
.

23.

Tsai
PC
,
Bell
JT.
Power and sample size estimation for epigenome-wide association scans to detect differential DNA methylation
.
Int J Epidemiol
2015
;
44
:
1429
41
.

24.

Aryee
MJ
,
Jaffe
AE
,
Corrada-Bravo
H
, et al. .
Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays
.
Bioinformatics
2014
;
30
:
1363
9
.

25.

Tian
Y
,
Morris
TJ
,
Webster
AP
, et al. .
ChAMP: updated methylation analysis pipeline for Illumina BeadChips
.
Bioinformatics
2017
;
33
:
3982
4
.

26.

Houseman
EA
,
Accomando
WP
,
Koestler
DC
, et al. .
DNA methylation arrays as surrogate measures of cell mixture distribution
.
BMC Bioinf
2012
;
13
:
86
.

27.

Ritchie
ME
,
Phipson
B
,
Wu
D
, et al. .
limma powers differential expression analyses for RNA-sequencing and microarray studies
.
Nucleic Acids Res
2015
;
43
:
e47
e47
.

28.

Phipson
B
,
Maksimovic
J
,
Oshlack
A.
An R package for analyzing data from Illumina’s HumanMethylation450 platform
.
Bioinformatics
2016
;
32
:
286
8
.

29.

Breeze Charles
E
,
Paul Dirk
S
,
van Dongen
J
, et al. .
eFORGE: a tool for identifying cell type-specific signal in epigenomic data
.
Cell Rep
2016
;
17
:
2137
50
.

30.

Horvath
S.
DNA methylation age of human tissues and cell types
.
Genome Biol
2013
;
14
:
R115
.

31.

Pidsley
R
,
Wong
CCY
,
Volta
M
,
Lunnon
K
,
Mill
J
,
Schalkwyk
LC.
A data-driven approach to preprocessing Illumina 450K methylation array data
.
BMC Genomics
2013
;
14
:
293
.

32.

Peters
TJ
,
Buckley
MJ
,
Statham
AL
, et al. .
De novo identification of differentially methylated regions in the human genome
.
Epigenetics Chromatin
2015
;
8
:
6
.

33.

Glaser
RL
,
Ramsay
JP
,
Morison
IM.
The imprinted gene and parent-of-origin effect database now includes parental origin of de novo mutations
.
Nucleic Acids Res
2006
;
34
:
D29
31
.

34.

Somineni
HK
,
Venkateswaran
S
,
Kilaru
V
, et al. .
Blood-derived DNA methylation signatures of Crohn’s disease and severity of intestinal inflammation
.
Gastroenterology
2019
;
156
:
2254
65.e3
.

35.

Wang
P
,
Chen
X
,
Wang
Y
, et al. ,
Essential role for autophagy protein VMP1 in maintaining neuronal homeostasis and preventing axonal degeneration
.
Cell Death Dis
2021
;
12
:
116
.

36.

Zhang
T
,
Dong
K
,
Liang
W
, et al. .
G-protein-coupled receptors regulate autophagy by ZBTB16-mediated ubiquitination and proteasomal degradation of Atg14L
.
eLife
2015
;
4
:
e06734
.

37.

Zhang
L
,
Zhang
J
,
Jiang
Q
,
Zhang
L
,
Song
W.
Zinc binding groups for histone deacetylase inhibitors
.
J Enzyme Inhib Med Chem
2018
;
33
:
714
21
.

38.

Yan
H
,
Zhang
X
,
Xu
Y.
Aberrant expression of miR-21 in patients with inflammatory bowel disease: a protocol for systematic review and meta analysis
.
Medicine (Baltim)
2020
;
99
:
e19693
.

39.

Fraga
MF
,
Ballestar
E
,
Paz
MF
, et al. .
Epigenetic differences arise during the lifetime of monozygotic twins
.
Proc Natl Acad Sci USA
2005
;
102
:
10604
9
.

40.

Horvath
S
,
Raj
K.
DNA methylation-based biomarkers and the epigenetic clock theory of ageing
.
Nat Rev Genet
2018
;
19
:
371
84
.

41.

Morales Berstein
F
,
McCartney
DL
,
Lu
AT
, et al. .
Assessing the causal role of epigenetic clocks in the development of multiple cancers: a Mendelian randomization study
.
eLife
2022
;
11
:
e75374
.

42.

Perna
L
,
Zhang
Y
,
Mons
U
,
Holleczek
B
,
Saum
K-U
,
Brenner
H.
Epigenetic age acceleration predicts cancer, cardiovascular, and all-cause mortality in a German case cohort
.
Clin Epigenet
2016
;
8
:
64
.

43.

Chen
BH
,
Marioni
RE
,
Colicino
E
, et al. .
DNA methylation-based measures of biological age: meta-analysis predicting time to death
.
Aging (Milano)
2016
;
8
:
1844
65
.

44.

Poganik
JR
,
Zhang
B
,
Baht
GS
, et al. .
Biological age is increased by stress and restored upon recovery
.
Cell Metab
2023
;
35
:
807
20.e5
.

45.

Fahy
GM
,
Brooke
RT
,
Watson
JP
, et al. .
Reversal of epigenetic aging and immunosenescent trends in humans
.
Aging Cell
2019
;
18
:
e13028
.

46.

Venkateswaran
S
,
Somineni
HK
,
Kilaru
V
, et al. .
Methylation quantitative trait loci are largely consistent across disease states in Crohn’s disease
.
G3 (Bethesda)
2022
;
12
:
jkac041
.

47.

Liu
Y
,
Aryee
MJ
,
Padyukov
L
, et al. .
Epigenome-wide association data implicate DNA methylation as an intermediary of genetic risk in rheumatoid arthritis
.
Nat Biotechnol
2013
;
31
:
142
7
.

48.

Lu
Y
,
Kweon
S-S
,
Tanikawa
C
, et al. .
Large-scale genome-wide association study of East Asians identifies loci associated with risk for colorectal cancer
.
Gastroenterology
2019
;
156
:
1455
66
.

49.

Yaeger
R
,
Shah
MA
,
Miller
VA
, et al. .
Genomic alterations observed in colitis-associated cancers are distinct from those found in sporadic colorectal cancers and vary by type of inflammatory bowel disease
.
Gastroenterology
2016
;
151
:
278
87.e6
.

50.

Jostins
L
,
Ripke
S
,
Weersma
RK
, et al. ;
International IBD Genetics Consortium (IIBDGC)
.
Host–microbe interactions have shaped the genetic architecture of inflammatory bowel disease
.
Nature
2012
;
491
:
119
24
.

51.

Jones
DT
,
Osterman
MT
,
Bewtra
M
,
Lewis
JD.
Passive smoking and inflammatory bowel disease: a meta-analysis
.
Am J Gastroenterol
2008
;
103
:
2382
93
.

52.

Agrawal
M
,
Sabino
J
,
Frias-Gomes
C
, et al. .
Early life exposures and the risk of inflammatory bowel disease: systematic review and meta-analyses
.
eClinicalMedicine
2021
;
36
:
100884
.

53.

Venkateswaran
S
,
Somineni
HK
,
Matthews
JD
, et al. .
Longitudinal DNA methylation profiling of the rectal mucosa identifies cell-specific signatures of disease status, severity and clinical outcomes in ulcerative colitis cell-specific DNA methylation signatures of UC
.
Clin Epigenet
2023
;
15
:
50
.

54.

Howell
KJ
,
Kraiczy
J
,
Nayak
KM
, et al. .
DNA methylation and transcription patterns in intestinal epithelial cells from pediatric patients with inflammatory bowel diseases differentiate disease subtypes and associate with outcome
.
Gastroenterology
2018
;
154
:
585
98
.

55.

Edgar
RD
,
Perrone
F
,
Foster
AR
, et al. .
Culture-associated DNA methylation changes impact on cellular function of human intestinal organoids
.
Cell Mol Gastroenterol Hepatol
2022
;
14
:
1295
310
.

56.

Kraiczy
J
,
Nayak
KM
,
Howell
KJ
, et al. .
DNA methylation defines regional identity of human intestinal epithelial organoids and undergoes dynamic changes during development
.
Gut
2019
;
68
:
49
61
.

57.

Rheenen
PF
,
Vijver
EV
,
Fidler
V.
Faecal calprotectin for screening of patients with suspected inflammatory bowel disease: diagnostic meta-analysis
.
BMJ
2010
;
341
:
c3369
.

This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact [email protected]