A Transcriptome-Wide Association Study Identifies Novel Candidate Susceptibility Genes for Pancreatic Cancer

Statistically significant expression–trait associations for genes at loci not previously identified by pancreatic cancer GWAS

Region	Gene name	Lead GWAS variant (±1 Mb)^†	GWAS P^†	Approach	Training tissue	R^2,‡	TWAS Z^§	TWAS P^¶	TWAS P after conditioning on lead GWAS variant^#
1p36.12	CELA3B*	rs61132601	2.27 × 10⁻⁷	FUSION	GTEx pancreas	0.06	−3.98	6.89 × 10⁻⁵	.08
				FUSION	Combined pancreas	0.04	−4.62	3.80 × 10⁻⁶*	.03
				MetaXcan	GTEx pancreas	0.05	−4.43	9.38 × 10⁻⁶*	.21
				MetaXcan	Combined pancreas	0.05	−4.29	1.83 × 10⁻⁵	.03
9q31.1	SMC2*	rs147699343	8.77 × 10⁻⁸	FUSION	Combined pancreas	0.04	4.95	7.52 × 10⁻⁷*	.08
				MetaXcan	Combined pancreas	0.02	4.93	8.19 × 10⁻⁷*	.06
				SMulTiXcan	Cross-tissue	0.02–0.61	−3.34 to 5.35	8.50 × 10⁻⁶	.66
9q31.1	SMC2-AS1	rs147699343	8.70 × 10⁻⁸	SMulTiXcan	Cross-tissue	0.04-0.18	−4.9 to 4.8	1.39 × 10⁻⁵	.61
10q23.31	RP11-80H5.9	rs7083351	5.22 × 10⁻⁵	SMulTiXcan	Cross-tissue	0.02–0.20	−2.21 to 4.4	8.23 × 10⁻⁶	.04
12q13.13	SMUG1	rs4759336	1.39 × 10⁻⁴	FUSION	GTEx pancreas	0.28	−4.04	5.40 × 10⁻⁵	.06
14q32.33	BTBD6	rs10638535	2.73 × 10⁻⁵	FUSION	GTEx pancreas,	0.07	4.06	4.98 × 10⁻⁵	.94
				FUSION	Combined pancreas	0.05	4.00	6.30 × 10⁻⁵	.73
15q23	HEXA	rs11636684	2.35 × 10⁻⁵	FUSION	Combined pancreas	0.02	−4.02	5.68 × 10⁻⁵	5.31 × 10⁻³
15q26.1	RCCD1	rs8028409	3.77 × 10⁻⁵	FUSION	LTG pancreas	0.44	−3.98	6.94 × 10⁻⁵	.87
				FUSION	GTEx pancreas	0.28	−4.04	5.38 × 10⁻⁵	.86
				FUSION	Combined pancreas	0.37	−3.99	6.52 × 10⁻⁵	.95
17q12	PNMT*	rs12951693	6.17 × 10⁻⁷	FUSION	LTG pancreas	0.02	4.86	1.20 × 10⁻⁶*	4.01 × 10⁻⁵
17q12	CDK12	rs12951693	6.17 × 10⁻⁷	FUSION	GTEx pancreas	0.02	−4.05	5.15 × 10⁻⁵	1.37 × 10⁻³
17q12	PGAP3	rs12951693	6.17 × 10⁻⁷	FUSION	LTG pancreas	0.10	3.91	9.11 × 10⁻⁵	1.44 × 10⁻³
				FUSION	GTEx pancreas	0.25	3.98	6.96 × 10⁻⁵	2.16 × 10⁻⁴
				MetaXcan	GTEx pancreas	0.24	4.11	3.03 × 10⁻⁵	1.03 × 10⁻⁴
				MetaXcan	Combined pancreas	0.18	4.17	2.98 × 10⁻⁵	1.04 × 10⁻⁴
17q22	SUPT4H1	rs6503868	2.15 × 10⁻⁵	FUSION	GTEx pancreas	0.08	4.12	3.72 × 10⁻⁵	3.32 × 10⁻³
				MetaXcan	GTEx pancreas	0.07	4.11	3.90 × 10⁻⁵	6.50 × 10⁻³
18.q11.22	RP11-888D10.3	rs28637808	1.30 × 10⁻⁵	MetaXcan	GTEx pancreas	0.09	−4.07	4.67 × 10⁻⁵	.07
19p13.11	PGPEP1	rs12985909	3.48 × 10⁻⁵	MetaXcan	Combined pancreas	0.06	−4.13	3.67 × 10⁻⁵	.85

Region	Gene name	Lead GWAS variant (±1 Mb)^†	GWAS P^†	Approach	Training tissue	R^2,‡	TWAS Z^§	TWAS P^¶	TWAS P after conditioning on lead GWAS variant^#
1p36.12	CELA3B*	rs61132601	2.27 × 10⁻⁷	FUSION	GTEx pancreas	0.06	−3.98	6.89 × 10⁻⁵	.08
				FUSION	Combined pancreas	0.04	−4.62	3.80 × 10⁻⁶*	.03
				MetaXcan	GTEx pancreas	0.05	−4.43	9.38 × 10⁻⁶*	.21
				MetaXcan	Combined pancreas	0.05	−4.29	1.83 × 10⁻⁵	.03
9q31.1	SMC2*	rs147699343	8.77 × 10⁻⁸	FUSION	Combined pancreas	0.04	4.95	7.52 × 10⁻⁷*	.08
				MetaXcan	Combined pancreas	0.02	4.93	8.19 × 10⁻⁷*	.06
				SMulTiXcan	Cross-tissue	0.02–0.61	−3.34 to 5.35	8.50 × 10⁻⁶	.66
9q31.1	SMC2-AS1	rs147699343	8.70 × 10⁻⁸	SMulTiXcan	Cross-tissue	0.04-0.18	−4.9 to 4.8	1.39 × 10⁻⁵	.61
10q23.31	RP11-80H5.9	rs7083351	5.22 × 10⁻⁵	SMulTiXcan	Cross-tissue	0.02–0.20	−2.21 to 4.4	8.23 × 10⁻⁶	.04
12q13.13	SMUG1	rs4759336	1.39 × 10⁻⁴	FUSION	GTEx pancreas	0.28	−4.04	5.40 × 10⁻⁵	.06
14q32.33	BTBD6	rs10638535	2.73 × 10⁻⁵	FUSION	GTEx pancreas,	0.07	4.06	4.98 × 10⁻⁵	.94
				FUSION	Combined pancreas	0.05	4.00	6.30 × 10⁻⁵	.73
15q23	HEXA	rs11636684	2.35 × 10⁻⁵	FUSION	Combined pancreas	0.02	−4.02	5.68 × 10⁻⁵	5.31 × 10⁻³
15q26.1	RCCD1	rs8028409	3.77 × 10⁻⁵	FUSION	LTG pancreas	0.44	−3.98	6.94 × 10⁻⁵	.87
				FUSION	GTEx pancreas	0.28	−4.04	5.38 × 10⁻⁵	.86
				FUSION	Combined pancreas	0.37	−3.99	6.52 × 10⁻⁵	.95
17q12	PNMT*	rs12951693	6.17 × 10⁻⁷	FUSION	LTG pancreas	0.02	4.86	1.20 × 10⁻⁶*	4.01 × 10⁻⁵
17q12	CDK12	rs12951693	6.17 × 10⁻⁷	FUSION	GTEx pancreas	0.02	−4.05	5.15 × 10⁻⁵	1.37 × 10⁻³
17q12	PGAP3	rs12951693	6.17 × 10⁻⁷	FUSION	LTG pancreas	0.10	3.91	9.11 × 10⁻⁵	1.44 × 10⁻³
				FUSION	GTEx pancreas	0.25	3.98	6.96 × 10⁻⁵	2.16 × 10⁻⁴
				MetaXcan	GTEx pancreas	0.24	4.11	3.03 × 10⁻⁵	1.03 × 10⁻⁴
				MetaXcan	Combined pancreas	0.18	4.17	2.98 × 10⁻⁵	1.04 × 10⁻⁴
17q22	SUPT4H1	rs6503868	2.15 × 10⁻⁵	FUSION	GTEx pancreas	0.08	4.12	3.72 × 10⁻⁵	3.32 × 10⁻³
				MetaXcan	GTEx pancreas	0.07	4.11	3.90 × 10⁻⁵	6.50 × 10⁻³
18.q11.22	RP11-888D10.3	rs28637808	1.30 × 10⁻⁵	MetaXcan	GTEx pancreas	0.09	−4.07	4.67 × 10⁻⁵	.07
19p13.11	PGPEP1	rs12985909	3.48 × 10⁻⁵	MetaXcan	Combined pancreas	0.06	−4.13	3.67 × 10⁻⁵	.85

*

Genes and corresponding TWAS P that are statistically significant after Bonferroni correction for multiple testing in each of the analyses. GTEx = Genotype-Tissue Expression; GWAS = genome-wide association studies; LTG = Laboratory of Translational Genomics; SMulTiXcan = Summary-MulTiXcan; TWAS = transcriptome-wide association study.

^†

The lead GWAS variant and GWAS P value indicates the most statistically significant GWAS variant within ± 1 Mb for each gene listed.

^‡

R²: model prediction performance.

^§

TWAS Z: effect size and direction. Effect sizes for SMulTiXcan results in individual tissues are shown in Supplementary Figure 7 (available online).

^¶

TWAS P: P value from the TWAS for genes that passed the false discovery rate corrected P value ≤ 0.05 in each of the analyses.

^#

TWAS P values after conditioning on the lead GWAS variant within ± 1 Mb for each gene is shown in the last column.

Table 1.

Statistically significant expression–trait associations for genes at loci not previously identified by pancreatic cancer GWAS

Region	Gene name	Lead GWAS variant (±1 Mb)^†	GWAS P^†	Approach	Training tissue	R^2,‡	TWAS Z^§	TWAS P^¶	TWAS P after conditioning on lead GWAS variant^#
1p36.12	CELA3B*	rs61132601	2.27 × 10⁻⁷	FUSION	GTEx pancreas	0.06	−3.98	6.89 × 10⁻⁵	.08
				FUSION	Combined pancreas	0.04	−4.62	3.80 × 10⁻⁶*	.03
				MetaXcan	GTEx pancreas	0.05	−4.43	9.38 × 10⁻⁶*	.21
				MetaXcan	Combined pancreas	0.05	−4.29	1.83 × 10⁻⁵	.03
9q31.1	SMC2*	rs147699343	8.77 × 10⁻⁸	FUSION	Combined pancreas	0.04	4.95	7.52 × 10⁻⁷*	.08
				MetaXcan	Combined pancreas	0.02	4.93	8.19 × 10⁻⁷*	.06
				SMulTiXcan	Cross-tissue	0.02–0.61	−3.34 to 5.35	8.50 × 10⁻⁶	.66
9q31.1	SMC2-AS1	rs147699343	8.70 × 10⁻⁸	SMulTiXcan	Cross-tissue	0.04-0.18	−4.9 to 4.8	1.39 × 10⁻⁵	.61
10q23.31	RP11-80H5.9	rs7083351	5.22 × 10⁻⁵	SMulTiXcan	Cross-tissue	0.02–0.20	−2.21 to 4.4	8.23 × 10⁻⁶	.04
12q13.13	SMUG1	rs4759336	1.39 × 10⁻⁴	FUSION	GTEx pancreas	0.28	−4.04	5.40 × 10⁻⁵	.06
14q32.33	BTBD6	rs10638535	2.73 × 10⁻⁵	FUSION	GTEx pancreas,	0.07	4.06	4.98 × 10⁻⁵	.94
				FUSION	Combined pancreas	0.05	4.00	6.30 × 10⁻⁵	.73
15q23	HEXA	rs11636684	2.35 × 10⁻⁵	FUSION	Combined pancreas	0.02	−4.02	5.68 × 10⁻⁵	5.31 × 10⁻³
15q26.1	RCCD1	rs8028409	3.77 × 10⁻⁵	FUSION	LTG pancreas	0.44	−3.98	6.94 × 10⁻⁵	.87
				FUSION	GTEx pancreas	0.28	−4.04	5.38 × 10⁻⁵	.86
				FUSION	Combined pancreas	0.37	−3.99	6.52 × 10⁻⁵	.95
17q12	PNMT*	rs12951693	6.17 × 10⁻⁷	FUSION	LTG pancreas	0.02	4.86	1.20 × 10⁻⁶*	4.01 × 10⁻⁵
17q12	CDK12	rs12951693	6.17 × 10⁻⁷	FUSION	GTEx pancreas	0.02	−4.05	5.15 × 10⁻⁵	1.37 × 10⁻³
17q12	PGAP3	rs12951693	6.17 × 10⁻⁷	FUSION	LTG pancreas	0.10	3.91	9.11 × 10⁻⁵	1.44 × 10⁻³
				FUSION	GTEx pancreas	0.25	3.98	6.96 × 10⁻⁵	2.16 × 10⁻⁴
				MetaXcan	GTEx pancreas	0.24	4.11	3.03 × 10⁻⁵	1.03 × 10⁻⁴
				MetaXcan	Combined pancreas	0.18	4.17	2.98 × 10⁻⁵	1.04 × 10⁻⁴
17q22	SUPT4H1	rs6503868	2.15 × 10⁻⁵	FUSION	GTEx pancreas	0.08	4.12	3.72 × 10⁻⁵	3.32 × 10⁻³
				MetaXcan	GTEx pancreas	0.07	4.11	3.90 × 10⁻⁵	6.50 × 10⁻³
18.q11.22	RP11-888D10.3	rs28637808	1.30 × 10⁻⁵	MetaXcan	GTEx pancreas	0.09	−4.07	4.67 × 10⁻⁵	.07
19p13.11	PGPEP1	rs12985909	3.48 × 10⁻⁵	MetaXcan	Combined pancreas	0.06	−4.13	3.67 × 10⁻⁵	.85

Region	Gene name	Lead GWAS variant (±1 Mb)^†	GWAS P^†	Approach	Training tissue	R^2,‡	TWAS Z^§	TWAS P^¶	TWAS P after conditioning on lead GWAS variant^#
1p36.12	CELA3B*	rs61132601	2.27 × 10⁻⁷	FUSION	GTEx pancreas	0.06	−3.98	6.89 × 10⁻⁵	.08
				FUSION	Combined pancreas	0.04	−4.62	3.80 × 10⁻⁶*	.03
				MetaXcan	GTEx pancreas	0.05	−4.43	9.38 × 10⁻⁶*	.21
				MetaXcan	Combined pancreas	0.05	−4.29	1.83 × 10⁻⁵	.03
9q31.1	SMC2*	rs147699343	8.77 × 10⁻⁸	FUSION	Combined pancreas	0.04	4.95	7.52 × 10⁻⁷*	.08
				MetaXcan	Combined pancreas	0.02	4.93	8.19 × 10⁻⁷*	.06
				SMulTiXcan	Cross-tissue	0.02–0.61	−3.34 to 5.35	8.50 × 10⁻⁶	.66
9q31.1	SMC2-AS1	rs147699343	8.70 × 10⁻⁸	SMulTiXcan	Cross-tissue	0.04-0.18	−4.9 to 4.8	1.39 × 10⁻⁵	.61
10q23.31	RP11-80H5.9	rs7083351	5.22 × 10⁻⁵	SMulTiXcan	Cross-tissue	0.02–0.20	−2.21 to 4.4	8.23 × 10⁻⁶	.04
12q13.13	SMUG1	rs4759336	1.39 × 10⁻⁴	FUSION	GTEx pancreas	0.28	−4.04	5.40 × 10⁻⁵	.06
14q32.33	BTBD6	rs10638535	2.73 × 10⁻⁵	FUSION	GTEx pancreas,	0.07	4.06	4.98 × 10⁻⁵	.94
				FUSION	Combined pancreas	0.05	4.00	6.30 × 10⁻⁵	.73
15q23	HEXA	rs11636684	2.35 × 10⁻⁵	FUSION	Combined pancreas	0.02	−4.02	5.68 × 10⁻⁵	5.31 × 10⁻³
15q26.1	RCCD1	rs8028409	3.77 × 10⁻⁵	FUSION	LTG pancreas	0.44	−3.98	6.94 × 10⁻⁵	.87
				FUSION	GTEx pancreas	0.28	−4.04	5.38 × 10⁻⁵	.86
				FUSION	Combined pancreas	0.37	−3.99	6.52 × 10⁻⁵	.95
17q12	PNMT*	rs12951693	6.17 × 10⁻⁷	FUSION	LTG pancreas	0.02	4.86	1.20 × 10⁻⁶*	4.01 × 10⁻⁵
17q12	CDK12	rs12951693	6.17 × 10⁻⁷	FUSION	GTEx pancreas	0.02	−4.05	5.15 × 10⁻⁵	1.37 × 10⁻³
17q12	PGAP3	rs12951693	6.17 × 10⁻⁷	FUSION	LTG pancreas	0.10	3.91	9.11 × 10⁻⁵	1.44 × 10⁻³
				FUSION	GTEx pancreas	0.25	3.98	6.96 × 10⁻⁵	2.16 × 10⁻⁴
				MetaXcan	GTEx pancreas	0.24	4.11	3.03 × 10⁻⁵	1.03 × 10⁻⁴
				MetaXcan	Combined pancreas	0.18	4.17	2.98 × 10⁻⁵	1.04 × 10⁻⁴
17q22	SUPT4H1	rs6503868	2.15 × 10⁻⁵	FUSION	GTEx pancreas	0.08	4.12	3.72 × 10⁻⁵	3.32 × 10⁻³
				MetaXcan	GTEx pancreas	0.07	4.11	3.90 × 10⁻⁵	6.50 × 10⁻³
18.q11.22	RP11-888D10.3	rs28637808	1.30 × 10⁻⁵	MetaXcan	GTEx pancreas	0.09	−4.07	4.67 × 10⁻⁵	.07
19p13.11	PGPEP1	rs12985909	3.48 × 10⁻⁵	MetaXcan	Combined pancreas	0.06	−4.13	3.67 × 10⁻⁵	.85

*

Genes and corresponding TWAS P that are statistically significant after Bonferroni correction for multiple testing in each of the analyses. GTEx = Genotype-Tissue Expression; GWAS = genome-wide association studies; LTG = Laboratory of Translational Genomics; SMulTiXcan = Summary-MulTiXcan; TWAS = transcriptome-wide association study.

^†

The lead GWAS variant and GWAS P value indicates the most statistically significant GWAS variant within ± 1 Mb for each gene listed.

^‡

R²: model prediction performance.

^§

TWAS Z: effect size and direction. Effect sizes for SMulTiXcan results in individual tissues are shown in Supplementary Figure 7 (available online).

^¶

TWAS P: P value from the TWAS for genes that passed the false discovery rate corrected P value ≤ 0.05 in each of the analyses.

^#

TWAS P values after conditioning on the lead GWAS variant within ± 1 Mb for each gene is shown in the last column.

Table 2.

Statistically significant expression–trait associations for genes at known pancreatic cancer risk loci previously identified by GWAS

Region	Gene name	Lead GWAS Variant (± 1 Mb)^†	GWAS P^†	Approach	Training tissue	R^2,^‡	TWAS Z^§	TWAS P^¶	TWAS P after conditioning on lead GWAS variant^#
5p15.33	TERT*	rs31490	1.28 × 10⁻¹⁷	SMulTiXcan	Cross-tissue	0.05–0.11	−8.24 to 4.20	5.80 × 10⁻¹⁸*	3.37 × 10⁻⁴
5p15.33	CLPTM1L*	rs31490	1.28 × 10⁻¹⁷	SMulTiXcan	Cross-tissue	0.02–0.06	−8.33 to 0.91	1.48 × 10⁻¹⁶*	6.10 × 10⁻⁴
5p15.33	ZDHHC11B	rs31490	1.28 × 10⁻¹⁷	SMulTiXcan	Cross-tissue	0.02–0.19	−1.16 to 3.13	3.18 × 10⁻⁶	1.56 × 10⁻⁵
7p14.1	INHBA*	rs12701838	3.59 × 10⁻⁰⁹	MetaXcan	GTEx pancreas	0.04	−5.11	3.20 × 10⁻⁷*	.81
				SMulTiXcan	Cross-tissue	0.04–0.34	−5.11 to −0.72	4.10 × 10⁻⁶	.02
9q34.2	ABO*	rs687621	2.37 × 10⁻²⁷	FUSION	LTG pancreas	0.37	9.38	6.71 × 10⁻²¹*	.49
				FUSION	GTEx pancreas	0.56	6.96	3.44 × 10⁻¹²*	.21
				FUSION	Combined pancreas	0.50	7.55	4.34 × 10⁻¹⁴*	.23
				MetaXcan	LTG pancreas	0.30	10.72	8.07 × 10⁻²⁷*	.98
				MetaXcan	GTEx pancreas	0.55	7.08	1.41 × 10⁻¹²*	.08
				MetaXcan	Combined pancreas	0.49	7.65	2.05 × 10⁻¹⁴*	.07
13q12.2	PDX1*	rs2297316	4.43 × 10⁻¹³	MetaXcan	GTEx pancreas	0.05	−7.18	6.85 × 10⁻¹³*	.64
				SMulTiXcan	Cross-tissues	0.03–0.05	−7.18 to −6.59	4.87 × 10⁻¹²*	.45
13q22.1	KLF5*	rs9573166	1.51 × 10⁻²⁵	FUSION	GTEx pancreas	0.05	4.91	9.17 × 10⁻⁷*	4.91 × 10⁻⁴
				FUSION	Combined pancreas	0.03	5.92	3.15 × 10⁻⁹*	.02
16q23.1	WDR59*	rs72802357	1.32 × 10⁻¹⁶	FUSION	Combined pancreas	0.01	−4.70	2.54 × 10⁻⁶*	3.42 × 10⁻³
16q23.1	CFDP1*	rs72802357	1.32 × 10⁻¹⁶	FUSION	LTG pancreas	0.03	6.07	1.26 × 10⁻⁹*	.06
				FUSION	Combined pancreas	0.12	5.76	8.47 × 10⁻⁹*	.03
				MetaXcan	Combined pancreas	0.17	5.58	2.40 × 10⁻⁸*	.05
				SMulTiXcan	Cross-tissue	0.03–0.20	2.50 to 6.89	2.02 × 10⁻⁸*	.12
16q23.1	BCAR1*	rs72802357	1.32 × 10⁻¹⁶	SMulTiXcan	Cross-tissue	0.02–0.21	−5.60 to 6.49	1.94 × 10⁻⁷*	.22
16q23.1	TMEM170A	rs72802357	1.32 × 10⁻¹⁶	SMulTiXcan	Cross-tissue	0.02–0.22	−3.69 to 2.86	1.21 × 10⁻⁵	.41

Region	Gene name	Lead GWAS Variant (± 1 Mb)^†	GWAS P^†	Approach	Training tissue	R^2,^‡	TWAS Z^§	TWAS P^¶	TWAS P after conditioning on lead GWAS variant^#
5p15.33	TERT*	rs31490	1.28 × 10⁻¹⁷	SMulTiXcan	Cross-tissue	0.05–0.11	−8.24 to 4.20	5.80 × 10⁻¹⁸*	3.37 × 10⁻⁴
5p15.33	CLPTM1L*	rs31490	1.28 × 10⁻¹⁷	SMulTiXcan	Cross-tissue	0.02–0.06	−8.33 to 0.91	1.48 × 10⁻¹⁶*	6.10 × 10⁻⁴
5p15.33	ZDHHC11B	rs31490	1.28 × 10⁻¹⁷	SMulTiXcan	Cross-tissue	0.02–0.19	−1.16 to 3.13	3.18 × 10⁻⁶	1.56 × 10⁻⁵
7p14.1	INHBA*	rs12701838	3.59 × 10⁻⁰⁹	MetaXcan	GTEx pancreas	0.04	−5.11	3.20 × 10⁻⁷*	.81
				SMulTiXcan	Cross-tissue	0.04–0.34	−5.11 to −0.72	4.10 × 10⁻⁶	.02
9q34.2	ABO*	rs687621	2.37 × 10⁻²⁷	FUSION	LTG pancreas	0.37	9.38	6.71 × 10⁻²¹*	.49
				FUSION	GTEx pancreas	0.56	6.96	3.44 × 10⁻¹²*	.21
				FUSION	Combined pancreas	0.50	7.55	4.34 × 10⁻¹⁴*	.23
				MetaXcan	LTG pancreas	0.30	10.72	8.07 × 10⁻²⁷*	.98
				MetaXcan	GTEx pancreas	0.55	7.08	1.41 × 10⁻¹²*	.08
				MetaXcan	Combined pancreas	0.49	7.65	2.05 × 10⁻¹⁴*	.07
13q12.2	PDX1*	rs2297316	4.43 × 10⁻¹³	MetaXcan	GTEx pancreas	0.05	−7.18	6.85 × 10⁻¹³*	.64
				SMulTiXcan	Cross-tissues	0.03–0.05	−7.18 to −6.59	4.87 × 10⁻¹²*	.45
13q22.1	KLF5*	rs9573166	1.51 × 10⁻²⁵	FUSION	GTEx pancreas	0.05	4.91	9.17 × 10⁻⁷*	4.91 × 10⁻⁴
				FUSION	Combined pancreas	0.03	5.92	3.15 × 10⁻⁹*	.02
16q23.1	WDR59*	rs72802357	1.32 × 10⁻¹⁶	FUSION	Combined pancreas	0.01	−4.70	2.54 × 10⁻⁶*	3.42 × 10⁻³
16q23.1	CFDP1*	rs72802357	1.32 × 10⁻¹⁶	FUSION	LTG pancreas	0.03	6.07	1.26 × 10⁻⁹*	.06
				FUSION	Combined pancreas	0.12	5.76	8.47 × 10⁻⁹*	.03
				MetaXcan	Combined pancreas	0.17	5.58	2.40 × 10⁻⁸*	.05
				SMulTiXcan	Cross-tissue	0.03–0.20	2.50 to 6.89	2.02 × 10⁻⁸*	.12
16q23.1	BCAR1*	rs72802357	1.32 × 10⁻¹⁶	SMulTiXcan	Cross-tissue	0.02–0.21	−5.60 to 6.49	1.94 × 10⁻⁷*	.22
16q23.1	TMEM170A	rs72802357	1.32 × 10⁻¹⁶	SMulTiXcan	Cross-tissue	0.02–0.22	−3.69 to 2.86	1.21 × 10⁻⁵	.41

*

Genes and corresponding TWAS P values that are statistically significant after Bonferroni correction for multiple testing in each of the analyses. GTEx = Genotype-Tissue Expression; GWAS = genome-wide association studies; LTG = Laboratory of Translational Genomics; SMulTiXcan = Summary-MulTiXcan; TWAS = transcriptome-wide association study.

†

The lead GWAS variant and GWAS P value indicates the most statistically significant GWAS variant within ±1 Mb for each gene listed.

‡

R²: model prediction performance.

§

TWAS Z: effect size and direction. Effect sizes for SMulTiXcan results in individual tissues are shown in Supplementary Figure 7 (available online).

¶

TWAS P: P value from the TWAS for genes that passed the false discovery rate corrected P value ≤ .05 in each of the analyses.

#

TWAS P values after conditioning on the lead GWAS variant within ± 1 Mb for each gene is shown in the last column.

Table 2.

Statistically significant expression–trait associations for genes at known pancreatic cancer risk loci previously identified by GWAS

Region	Gene name	Lead GWAS Variant (± 1 Mb)^†	GWAS P^†	Approach	Training tissue	R^2,^‡	TWAS Z^§	TWAS P^¶	TWAS P after conditioning on lead GWAS variant^#
5p15.33	TERT*	rs31490	1.28 × 10⁻¹⁷	SMulTiXcan	Cross-tissue	0.05–0.11	−8.24 to 4.20	5.80 × 10⁻¹⁸*	3.37 × 10⁻⁴
5p15.33	CLPTM1L*	rs31490	1.28 × 10⁻¹⁷	SMulTiXcan	Cross-tissue	0.02–0.06	−8.33 to 0.91	1.48 × 10⁻¹⁶*	6.10 × 10⁻⁴
5p15.33	ZDHHC11B	rs31490	1.28 × 10⁻¹⁷	SMulTiXcan	Cross-tissue	0.02–0.19	−1.16 to 3.13	3.18 × 10⁻⁶	1.56 × 10⁻⁵
7p14.1	INHBA*	rs12701838	3.59 × 10⁻⁰⁹	MetaXcan	GTEx pancreas	0.04	−5.11	3.20 × 10⁻⁷*	.81
				SMulTiXcan	Cross-tissue	0.04–0.34	−5.11 to −0.72	4.10 × 10⁻⁶	.02
9q34.2	ABO*	rs687621	2.37 × 10⁻²⁷	FUSION	LTG pancreas	0.37	9.38	6.71 × 10⁻²¹*	.49
				FUSION	GTEx pancreas	0.56	6.96	3.44 × 10⁻¹²*	.21
				FUSION	Combined pancreas	0.50	7.55	4.34 × 10⁻¹⁴*	.23
				MetaXcan	LTG pancreas	0.30	10.72	8.07 × 10⁻²⁷*	.98
				MetaXcan	GTEx pancreas	0.55	7.08	1.41 × 10⁻¹²*	.08
				MetaXcan	Combined pancreas	0.49	7.65	2.05 × 10⁻¹⁴*	.07
13q12.2	PDX1*	rs2297316	4.43 × 10⁻¹³	MetaXcan	GTEx pancreas	0.05	−7.18	6.85 × 10⁻¹³*	.64
				SMulTiXcan	Cross-tissues	0.03–0.05	−7.18 to −6.59	4.87 × 10⁻¹²*	.45
13q22.1	KLF5*	rs9573166	1.51 × 10⁻²⁵	FUSION	GTEx pancreas	0.05	4.91	9.17 × 10⁻⁷*	4.91 × 10⁻⁴
				FUSION	Combined pancreas	0.03	5.92	3.15 × 10⁻⁹*	.02
16q23.1	WDR59*	rs72802357	1.32 × 10⁻¹⁶	FUSION	Combined pancreas	0.01	−4.70	2.54 × 10⁻⁶*	3.42 × 10⁻³
16q23.1	CFDP1*	rs72802357	1.32 × 10⁻¹⁶	FUSION	LTG pancreas	0.03	6.07	1.26 × 10⁻⁹*	.06
				FUSION	Combined pancreas	0.12	5.76	8.47 × 10⁻⁹*	.03
				MetaXcan	Combined pancreas	0.17	5.58	2.40 × 10⁻⁸*	.05
				SMulTiXcan	Cross-tissue	0.03–0.20	2.50 to 6.89	2.02 × 10⁻⁸*	.12
16q23.1	BCAR1*	rs72802357	1.32 × 10⁻¹⁶	SMulTiXcan	Cross-tissue	0.02–0.21	−5.60 to 6.49	1.94 × 10⁻⁷*	.22
16q23.1	TMEM170A	rs72802357	1.32 × 10⁻¹⁶	SMulTiXcan	Cross-tissue	0.02–0.22	−3.69 to 2.86	1.21 × 10⁻⁵	.41

Region	Gene name	Lead GWAS Variant (± 1 Mb)^†	GWAS P^†	Approach	Training tissue	R^2,^‡	TWAS Z^§	TWAS P^¶	TWAS P after conditioning on lead GWAS variant^#
5p15.33	TERT*	rs31490	1.28 × 10⁻¹⁷	SMulTiXcan	Cross-tissue	0.05–0.11	−8.24 to 4.20	5.80 × 10⁻¹⁸*	3.37 × 10⁻⁴
5p15.33	CLPTM1L*	rs31490	1.28 × 10⁻¹⁷	SMulTiXcan	Cross-tissue	0.02–0.06	−8.33 to 0.91	1.48 × 10⁻¹⁶*	6.10 × 10⁻⁴
5p15.33	ZDHHC11B	rs31490	1.28 × 10⁻¹⁷	SMulTiXcan	Cross-tissue	0.02–0.19	−1.16 to 3.13	3.18 × 10⁻⁶	1.56 × 10⁻⁵
7p14.1	INHBA*	rs12701838	3.59 × 10⁻⁰⁹	MetaXcan	GTEx pancreas	0.04	−5.11	3.20 × 10⁻⁷*	.81
				SMulTiXcan	Cross-tissue	0.04–0.34	−5.11 to −0.72	4.10 × 10⁻⁶	.02
9q34.2	ABO*	rs687621	2.37 × 10⁻²⁷	FUSION	LTG pancreas	0.37	9.38	6.71 × 10⁻²¹*	.49
				FUSION	GTEx pancreas	0.56	6.96	3.44 × 10⁻¹²*	.21
				FUSION	Combined pancreas	0.50	7.55	4.34 × 10⁻¹⁴*	.23
				MetaXcan	LTG pancreas	0.30	10.72	8.07 × 10⁻²⁷*	.98
				MetaXcan	GTEx pancreas	0.55	7.08	1.41 × 10⁻¹²*	.08
				MetaXcan	Combined pancreas	0.49	7.65	2.05 × 10⁻¹⁴*	.07
13q12.2	PDX1*	rs2297316	4.43 × 10⁻¹³	MetaXcan	GTEx pancreas	0.05	−7.18	6.85 × 10⁻¹³*	.64
				SMulTiXcan	Cross-tissues	0.03–0.05	−7.18 to −6.59	4.87 × 10⁻¹²*	.45
13q22.1	KLF5*	rs9573166	1.51 × 10⁻²⁵	FUSION	GTEx pancreas	0.05	4.91	9.17 × 10⁻⁷*	4.91 × 10⁻⁴
				FUSION	Combined pancreas	0.03	5.92	3.15 × 10⁻⁹*	.02
16q23.1	WDR59*	rs72802357	1.32 × 10⁻¹⁶	FUSION	Combined pancreas	0.01	−4.70	2.54 × 10⁻⁶*	3.42 × 10⁻³
16q23.1	CFDP1*	rs72802357	1.32 × 10⁻¹⁶	FUSION	LTG pancreas	0.03	6.07	1.26 × 10⁻⁹*	.06
				FUSION	Combined pancreas	0.12	5.76	8.47 × 10⁻⁹*	.03
				MetaXcan	Combined pancreas	0.17	5.58	2.40 × 10⁻⁸*	.05
				SMulTiXcan	Cross-tissue	0.03–0.20	2.50 to 6.89	2.02 × 10⁻⁸*	.12
16q23.1	BCAR1*	rs72802357	1.32 × 10⁻¹⁶	SMulTiXcan	Cross-tissue	0.02–0.21	−5.60 to 6.49	1.94 × 10⁻⁷*	.22
16q23.1	TMEM170A	rs72802357	1.32 × 10⁻¹⁶	SMulTiXcan	Cross-tissue	0.02–0.22	−3.69 to 2.86	1.21 × 10⁻⁵	.41

*

Genes and corresponding TWAS P values that are statistically significant after Bonferroni correction for multiple testing in each of the analyses. GTEx = Genotype-Tissue Expression; GWAS = genome-wide association studies; LTG = Laboratory of Translational Genomics; SMulTiXcan = Summary-MulTiXcan; TWAS = transcriptome-wide association study.

†

The lead GWAS variant and GWAS P value indicates the most statistically significant GWAS variant within ±1 Mb for each gene listed.

‡

R²: model prediction performance.

§

TWAS Z: effect size and direction. Effect sizes for SMulTiXcan results in individual tissues are shown in Supplementary Figure 7 (available online).

¶

TWAS P: P value from the TWAS for genes that passed the false discovery rate corrected P value ≤ .05 in each of the analyses.

#

TWAS P values after conditioning on the lead GWAS variant within ± 1 Mb for each gene is shown in the last column.

We performed conditional tests at two loci containing more than one TWAS gene using pancreatic tissue models to determine if they represented conditionally independent signals. At chr17q12, three adjacent genes (Table 1;Figure 2A) were identified by TWAS: PNMT, CDK12, and PGAP3. The TWAS signal for PNMT and PGAP3 dropped substantially after conditioning the analysis on predicted CDK12 expression in the GTEx pancreas dataset (PNMT TWAS P value changed from 5.10 × 10⁻⁴ to .53 and PGAP3 TWAS P value from 6.96 × 10⁻⁵ to .09). The GWAS signal at this locus also dropped markedly after conditioning on predicted expression of CDK12 (Figure 2A) indicating that CDK12 may explain a large part of the signal at this locus. The gene expression correlation for the three genes was low (CDK12 and PNMT, Pearson r = 0.09 in both LTG and GTEx) to moderate (PNMT and PGAP3, Pearson r = 0.33 and r = 0.27; CDK12 and PGAP3, Pearson r = 0.66 and r = 0.29 in the LTG and GTEx pancreas datasets, respectively) (Supplementary Table 2, available online). In contrast, the association with PDAC risk for two adjacent genes at chr16q23.1 (Table 2;Figure 2B), WDR59 and CFDP1, appeared largely independent (TWAS P value changed from 2.54 × 10⁻⁶ to 5.60 × 10⁻⁴ for WDR59 and from 8.47 × 10⁻⁹ to 1.70 × 10⁻⁶ for CFDP1 in a joint analysis in the combined LTG + GTEx pancreatic dataset). The GWAS signal at this locus dropped dramatically after conditioning on predicted expression of WDR59 and CFDP1, indicating that genetically predicted expression of WDR59 and CFDP1 together explain most of the signal at this locus (Figure 2B). The expression of these two genes was moderately to strongly correlated in the two datasets (Pearson r = 0.52–0.80) (Supplementary Table 2, available online).

To determine whether the associations between predicted gene expression and PDAC risk were independent of the lead GWAS-identified variants at each locus, we performed conditional analyses adjusting for the most statistically significant risk variants within ±1 Mb of TWAS-identified genes in the PanScan and PanC4 GWAS datasets. Among the 25 TWAS-identified genes, the association for three genes at novel loci (PNMT, CDK12, and PGAP3) and four genes at known loci (TERT, CLPTM1L, ZDHHC11B, and KLF5) remained statistically significant at the Bonferroni corrected P value threshold (P < .05/25 genes, ie, P < .002, Tables 1 and 2), indicating that these genes may be associated with PDAC risk independently of the GWAS-identified lead risk variants. Interestingly, at chr5p15.33, substantial loss of the TWAS signals for both TERT and CLPTM1L was seen after conditioning on three of the four GWAS-identified variants that mark independent pancreatic cancer risk signals at this locus (Table 3; Supplementary Table 3, available online) indicating that the underlying biology at this locus may involve both genes.

Table 3.

TWAS results on chr5p15.33 before and after conditional analyses for SNPs that mark independent GWAS risk signals on chr5p15.33

Gene name	TWAS P*
		GWAS conditioned on:
		rs31490	rs2736098	rs36115365	rs35226131
TERT	5.80 × 10⁻¹⁸	3.37 × 10⁻⁴	7.48 × 10⁻⁹	9.86 × 10⁻⁶	3.88 × 10⁻¹⁶
CLPTM1L	1.48 × 10⁻¹⁶	6.10 × 10⁻⁴	4.83 × 10⁻⁸	4.85 × 10⁻⁵	4.04 × 10⁻¹⁴
ZDHHC11B	3.18 × 10⁻⁶	1.56 × 10⁻⁵	8.41 × 10⁻³	3.00 × 10⁻²	6.50 × 10⁻³

Gene name	TWAS P*
		GWAS conditioned on:
		rs31490	rs2736098	rs36115365	rs35226131
TERT	5.80 × 10⁻¹⁸	3.37 × 10⁻⁴	7.48 × 10⁻⁹	9.86 × 10⁻⁶	3.88 × 10⁻¹⁶
CLPTM1L	1.48 × 10⁻¹⁶	6.10 × 10⁻⁴	4.83 × 10⁻⁸	4.85 × 10⁻⁵	4.04 × 10⁻¹⁴
ZDHHC11B	3.18 × 10⁻⁶	1.56 × 10⁻⁵	8.41 × 10⁻³	3.00 × 10⁻²	6.50 × 10⁻³

*

Transcriptome-wide association study (TWAS) P values are shown prior to and after conditioning the Pancreatic Cancer Cohort Consortium–Pancreatic Cancer Case-Control Consortium genome-wide association studies analysis on four independent pancreatic cancer risk signals at chr5p15.33 per Wang et al. (50), Petersen et al. (7), Wolpin et al. (8), Zhang et al. (10), and Fang et al. (12).

Table 3.

TWAS results on chr5p15.33 before and after conditional analyses for SNPs that mark independent GWAS risk signals on chr5p15.33

Gene name	TWAS P*
		GWAS conditioned on:
		rs31490	rs2736098	rs36115365	rs35226131
TERT	5.80 × 10⁻¹⁸	3.37 × 10⁻⁴	7.48 × 10⁻⁹	9.86 × 10⁻⁶	3.88 × 10⁻¹⁶
CLPTM1L	1.48 × 10⁻¹⁶	6.10 × 10⁻⁴	4.83 × 10⁻⁸	4.85 × 10⁻⁵	4.04 × 10⁻¹⁴
ZDHHC11B	3.18 × 10⁻⁶	1.56 × 10⁻⁵	8.41 × 10⁻³	3.00 × 10⁻²	6.50 × 10⁻³

Gene name	TWAS P*
		GWAS conditioned on:
		rs31490	rs2736098	rs36115365	rs35226131
TERT	5.80 × 10⁻¹⁸	3.37 × 10⁻⁴	7.48 × 10⁻⁹	9.86 × 10⁻⁶	3.88 × 10⁻¹⁶
CLPTM1L	1.48 × 10⁻¹⁶	6.10 × 10⁻⁴	4.83 × 10⁻⁸	4.85 × 10⁻⁵	4.04 × 10⁻¹⁴
ZDHHC11B	3.18 × 10⁻⁶	1.56 × 10⁻⁵	8.41 × 10⁻³	3.00 × 10⁻²	6.50 × 10⁻³

*

Transcriptome-wide association study (TWAS) P values are shown prior to and after conditioning the Pancreatic Cancer Cohort Consortium–Pancreatic Cancer Case-Control Consortium genome-wide association studies analysis on four independent pancreatic cancer risk signals at chr5p15.33 per Wang et al. (50), Petersen et al. (7), Wolpin et al. (8), Zhang et al. (10), and Fang et al. (12).

Transcriptome Changes Associated With High and Low Expression of Genes Identified by TWAS

To begin unravelling the potential consequences associated with different expression levels of TWAS-identified genes, we assessed transcriptome differences in samples in the top vs bottom quartiles of expression for each gene in the GTEx and LTG pancreatic datasets (see Supplementary Methods and Tables 4 and 5, available online) as previously described (40). As this analysis may be most relevant for well-expressed genes that are highly differentially expressed between samples in the top vs bottom quartile of expression, we focused on CELA3B, which was highly expressed and with a large difference in median expression (GTEx = 76%; LTG = 91%) in samples in the top and bottom quartiles (Supplementary Table 6 and Figure 8, available online). Pathway enrichment analyses for genes differentially expressed in samples at the top vs bottom quartile of CELA3B gene expression showed a negative correlation between expression of CELA3B and inflammatory and immune response genes (Table 4) indicating that low CELA3B expression may be associated with an inflammatory state in the pancreas.

Table 4.

Pathway enrichment analysis for genes expressed at higher levels in samples with low vs high CELA3B expression in the GTEx and LTG transcriptome datasets

Pathways enrichment for genes inversely associated with CELA3B expression
				GTEx	LTG
Category	Term	DE genes, No.*	Fold enrichment†	P‡	Fold enrichment†	P‡
GO biological process	Inflammatory response	72	6.3	3.8 × 10⁻³³	3.2	6.0 × 10⁻⁵⁵
GO biological process	Immune response	70	5.6	1.2 × 10⁻²⁸	3	2.7 × 10⁻⁵⁰
KEGG	Staphylococcus aureus infection	24	11.5	7.9 × 10⁻¹⁷	4.5	3.5 × 10⁻²¹
GO biological process	Cell adhesion	57	4.1	1.4 × 10⁻¹⁶	2.7	1.2 × 10⁻⁴⁰
GO biological process	Innate immune response	52	4	1.6 × 10⁻¹⁴	2.4	8.6 × 10⁻²⁵

Pathways enrichment for genes inversely associated with CELA3B expression
				GTEx	LTG
Category	Term	DE genes, No.*	Fold enrichment†	P‡	Fold enrichment†	P‡
GO biological process	Inflammatory response	72	6.3	3.8 × 10⁻³³	3.2	6.0 × 10⁻⁵⁵
GO biological process	Immune response	70	5.6	1.2 × 10⁻²⁸	3	2.7 × 10⁻⁵⁰
KEGG	Staphylococcus aureus infection	24	11.5	7.9 × 10⁻¹⁷	4.5	3.5 × 10⁻²¹
GO biological process	Cell adhesion	57	4.1	1.4 × 10⁻¹⁶	2.7	1.2 × 10⁻⁴⁰
GO biological process	Innate immune response	52	4	1.6 × 10⁻¹⁴	2.4	8.6 × 10⁻²⁵

*

Genes expressed at twofold or higher levels in samples in the bottom vs top quartile of CELA3B (chymotrypsin-like elastase 3B) mRNA expression in the GTEx pancreas and LTG histologically normal pancreatic transcriptome datasets were included in a pathway enrichment analysis using DAVID. GTEx = Genotype-Tissue Expression; LTG = Laboratory of Translational Genomics; DE = Differentially expressed genes; GO = Gene Ontology; KEGG = Kyoto Encyclopedia of Genes and Genomes.

†

Fold enrichment for these genes in the pathways listed are shown as well as ‡Benjamini-Hochberg false discovery rate corrected P values.

Table 4.

Pathway enrichment analysis for genes expressed at higher levels in samples with low vs high CELA3B expression in the GTEx and LTG transcriptome datasets

Pathways enrichment for genes inversely associated with CELA3B expression
				GTEx	LTG
Category	Term	DE genes, No.*	Fold enrichment†	P‡	Fold enrichment†	P‡
GO biological process	Inflammatory response	72	6.3	3.8 × 10⁻³³	3.2	6.0 × 10⁻⁵⁵
GO biological process	Immune response	70	5.6	1.2 × 10⁻²⁸	3	2.7 × 10⁻⁵⁰
KEGG	Staphylococcus aureus infection	24	11.5	7.9 × 10⁻¹⁷	4.5	3.5 × 10⁻²¹
GO biological process	Cell adhesion	57	4.1	1.4 × 10⁻¹⁶	2.7	1.2 × 10⁻⁴⁰
GO biological process	Innate immune response	52	4	1.6 × 10⁻¹⁴	2.4	8.6 × 10⁻²⁵

Pathways enrichment for genes inversely associated with CELA3B expression
				GTEx	LTG
Category	Term	DE genes, No.*	Fold enrichment†	P‡	Fold enrichment†	P‡
GO biological process	Inflammatory response	72	6.3	3.8 × 10⁻³³	3.2	6.0 × 10⁻⁵⁵
GO biological process	Immune response	70	5.6	1.2 × 10⁻²⁸	3	2.7 × 10⁻⁵⁰
KEGG	Staphylococcus aureus infection	24	11.5	7.9 × 10⁻¹⁷	4.5	3.5 × 10⁻²¹
GO biological process	Cell adhesion	57	4.1	1.4 × 10⁻¹⁶	2.7	1.2 × 10⁻⁴⁰
GO biological process	Innate immune response	52	4	1.6 × 10⁻¹⁴	2.4	8.6 × 10⁻²⁵

*

Genes expressed at twofold or higher levels in samples in the bottom vs top quartile of CELA3B (chymotrypsin-like elastase 3B) mRNA expression in the GTEx pancreas and LTG histologically normal pancreatic transcriptome datasets were included in a pathway enrichment analysis using DAVID. GTEx = Genotype-Tissue Expression; LTG = Laboratory of Translational Genomics; DE = Differentially expressed genes; GO = Gene Ontology; KEGG = Kyoto Encyclopedia of Genes and Genomes.

†

Fold enrichment for these genes in the pathways listed are shown as well as ‡Benjamini-Hochberg false discovery rate corrected P values.

Discussion

To identify novel susceptibility loci and putative causally relevant genes for pancreatic cancer development, we integrated eQTL datasets derived from pancreatic, as well as other tissues, with the largest currently available pancreatic cancer GWAS dataset (11) and identified 25 genes whose genetically predicted expression associated with pancreatic cancer risk. These genes localize to 17 genomic regions, of which 11 do not overlap with known PDAC risk loci.

Several TWAS genes identified at novel loci function in DNA repair, chromosome organization, and cell division. SMC2 (9q31.1) encodes structural maintenance of chromosomes protein 2, a core component of the condensin complex, which regulates chromosome organization during mitosis and meiosis and plays a critical role in single-strand break DNA repair (41–43). SMUG1 (12q13.13) encodes a base excision repair enzyme (single-strand-selective monofunctional uracil-DNA glycosylase 1) that repairs several DNA-pyrimidine oxidation products, some of which are mutagenic (44). RCCD1 (15q26.1) encodes RCC1 domain-containing protein 1, a partner of histone H3K36 demethylase KDM8; this complex is important for spindle organization, chromosome segregation, and accurate mitotic division (45). CDK12 (cyclin-dependent kinase 12, 17q12) belongs to the cyclin-dependent kinase (CDK) family of serine and threonine protein kinases that regulate transcriptional and posttranscriptional processes, including DNA damage response, splicing, pre-mRNA processing, development, and differentiation (46,47). CDK12 is mutated in some tumors and overexpressed in others, indicating that it may have context-dependent oncogenic and tumor suppressor functions (46). Decreased genetically predicted expression of SMUG1, RCCD1, and CDK12 was associated with increased risk of pancreatic cancer, in agreement with their roles in maintaining genome stability. Conversely, increased SMC2 expression was associated with pancreatic cancer risk, which is less consistent with its role in cell division and DNA repair but aligns with reports showing that its expression is regulated by the transcription factors β-catenin-TCF4 and that it is important for WNT-mediated cell proliferation in intestinal cells (48).

At chr1p36, another locus not previously reported in GWAS, genetically predicted CELA3B expression was inversely associated with risk of pancreatic cancer. This gene encodes chymotrypsin-like elastase family member 3B and, along with other pancreatic serine proteases, has a digestive function (49). Pathway enrichment analysis indicated that low expression of CELA3B may be associated with an inflammatory state in the pancreas, which is interesting in the light that inflammatory conditions, including pancreatitis, increase risk of pancreatic cancer (3).

Chr5p15.33 is a well-known multicancer risk locus with multiple independent signals reported in the TERT-CLPTM1L gene region for more than 10 different cancers, including pancreatic cancer (5,10,12,50–53). TERT encodes the catalytic subunit of the telomerase reverse transcriptase complex, whose major function is to maintain the ends of our chromosomes and overall chromosomal integrity (54–58). The CLPTM1L gene, relatively unknown until a few years ago, is now known to encode a multipass transmembrane protein that promotes growth and is frequently overexpressed in pancreatic and lung cancers (59–61). It is important for endoplasmic reticulum stress, apoptosis and cytokinesis, and KRAS-driven lung cancer (61). Using cross-tissue prediction models, we identified both TERT and CLPTM1L as pancreatic cancer TWAS genes with positive and negative effects, depending on the tissues. This type of pleiotropy for chr5p15.33 has been previously described by us and others (5,10,12,50–53).

Some of the genes identified in our study have been reported in TWAS for breast (RCCD1, KLF5), ovarian cancer (RCCD1), and type 2 diabetes (RCCD1) (20,22,62,63). KLF5 is located at chr13q22.1, a pancreatic cancer risk locus in a large, nongenic region flanked by KLF5 and KLF12 (13). It encodes Kruppel Like Factor 5, a zinc finger transcription factor frequently overexpressed in pancreatic cancer, and is important for Kras mediated pancreatic tumorigenesis in mouse models (64). Because we have previously shown that DIS3, a gene that encodes a catalytic subunit of the nuclear RNA exosome complex that mediates RNA processing and decay, represents a functional gene at chr13q22.1 (13), our current findings indicate that KLF5 may also play a role at this risk locus. None of the suggestive genes (unadjusted P < .05) reported in a recent but much smaller TWAS for pancreatic cancer (65) overlap with the genes reported in our study. Three loci overlap with our recent pathway-based analysis of pancreatic cancer GWAS data (chr9q31.1: SMC2; chr15q23: HEXA; and chr17q12: PNMT, CDK12, and PGAP3) and are suggestive in the GWAS analysis (66).

Although TWAS represents an attractive method to map risk loci that influence gene expression, this approach has advantages and disadvantages. Benefits include the reduced multiple testing burden and nomination of plausible candidate risk genes. However, identification of trait-associated gene expression differences by TWAS does not imply causality, and functional studies are needed to comprehensively determine underlying mechanisms of risk. Furthermore, coregulated genes can present as multiple associated genes at the same locus, even though only one gene underlies the signal. Finally, only cis-eQTLs are assessed, and genes whose genetically regulated gene expression cannot be predicted using SNPs are not evaluated. In the future, larger transcriptome and GWAS datasets for pancreatic cancer are likely to further improve statistical power for gene identification using this approach. Likewise, transcriptome datasets from specific cellular subtypes within the pancreas, such as acinar and ductal cells, could also improve future pancreatic cancer TWAS approaches.

In summary, we report 25 genes whose genetically predicted expression was associated with pancreatic cancer risk (FDR < .05), including 14 genes at 11 novel genomic loci. Twelve of these genes remained statistically significant after Bonferroni correction. Our findings provide new insights into the genetic basis of pancreatic cancer risk and identify target genes for future functional studies to thoroughly explore the mechanistic underpinnings of risk at each locus.

Funding

This work was supported by the Intramural Research Program (IRP) of the Division of Cancer Epidemiology and Genetics, National Cancer Institute, US National Institutes of Health (NIH).

Notes

The funders had no role in the design of the study; the collection, analysis, and interpretation of the data; the writing of the manuscript; and the decision to submit the manuscript for publication. The authors have no conflicts of interest to disclose. Acknowledgements, data access, and additional funding information are listed in the Supplementary Material (available online).

The content of this publication does not necessarily reflect the views or policies of the US Department of Health and Human Services, nor does mention of trade names, commercial products, or organizations imply endorsement by the US government.

References

1

Siegel

RL

,

Miller

KD

,

Jemal

A.

Cancer statistics, 2018

.

CA Cancer J Clin

.

2018

;

68

(

1

):

7

–

30

.

2

Bray

F

,

Ferlay

J

,

Soerjomataram

I

, et al. .

Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries

.

CA Cancer J Clin

.

2018

;

68

(

6

):

394

–

424

.

3

Stolzenberg-Solomon

RZ

,

Amundadottir

LT.

Epidemiology and inherited predisposition for sporadic pancreatic adenocarcinoma

.

Hematol Oncol Clin North Am

.

2015

;

29

(

4

):

619

–

640

.

4

Petersen

GM.

Familial pancreatic cancer

.

Semin Oncol

.

2016

;

43

(

5

):

548

–

553

.

5

Amundadottir

LT.

Pancreatic cancer genetics

.

Int J Biol Sci

.

2016

;

12

(

3

):

314

–

325

.

6

Amundadottir

L

,

Kraft

P

,

Stolzenberg-Solomon

RZ

, et al. .

Genome-wide association study identifies variants in the ABO locus associated with susceptibility to pancreatic cancer

.

Nat Genet

.

2009

;

41

(

9

):

986

–

990

.

7

Petersen

GM

,

Amundadottir

L

,

Fuchs

CS

, et al. .

A genome-wide association study identifies pancreatic cancer susceptibility loci on chromosomes 13q22.1, 1q32.1 and 5p15.33

.

Nat Genet

.

2010

;

42

(

3

):

224

–

228

.

8

Wolpin

BM

,

Rizzato

C

,

Kraft

P

, et al. .

Genome-wide association study identifies multiple susceptibility loci for pancreatic cancer

.

Nat Genet

.

2014

;

46

(

9

):

994

–

1000

.

9

Childs

EJ

,

Mocci

E

,

Campa

D

, et al. .

Common variation at 2p13.3, 3q29, 7p13 and 17q25.1 associated with susceptibility to pancreatic cancer

.

Nat Genet

.

2015

;

47

(

8

):

911

–

916

.

10

Zhang

M

,

Wang

Z

,

Obazee

O

, et al. .

Three new pancreatic cancer susceptibility signals identified on chromosomes 1q32.1, 5p15.33 and 8q24.21

.

Oncotarget

.

2016

;

7

(

41

):

66328

–

66343

.

11

Klein

AP

,

Wolpin

BM

,

Risch

HA

, et al. .

Genome-wide meta-analysis identifies five new susceptibility loci for pancreatic cancer

.

Nat Commun

.

2018

;

9

(

1

):

556

.

12

Fang

J

PanScan Consortium

Jia

J

,

Makowski

M

, et al. .

Functional characterization of a multi-cancer risk locus on chr5p15.33 reveals regulation of TERT by ZNF148

.

Nat Commun

.

2017

;

8

(

1

):

15034

.

13

Hoskins

JW

,

Ibrahim

A

,

Emmanuel

MA

, et al. .

Functional characterization of a chr13q22.1 pancreatic cancer risk locus reveals long-range interaction and allele-specific effects on DIS3 expression

.

Hum Mol Genet

.

2016

;

25

(

21

):

4726

–

4738

.

14

Zheng

J

,

Huang

X

,

Tan

W

, et al. .

Pancreatic cancer risk variant in LINC00673 creates a miR-1231 binding site and interferes with PTPN11 degradation

.

Nat Genet

.

2016

;

48

(

7

):

747

–

757

.

15

Schaid

DJ

,

Chen

W

,

Larson

NB.

From genome-wide associations to candidate causal variants by statistical fine-mapping

.

Nat Rev Genet

.

2018

;

19

(

8

):

491

–

504

.

16

Gamazon

ER

GTEx Consortium

Wheeler

HE

,

Shah

KP

, et al. .

A gene-based association method for mapping traits using reference transcriptome data

.

Nat Genet

.

2015

;

47

(

9

):

1091

–

1098

.

17

Gusev

A

,

Ko

A

,

Shi

H

, et al. .

Integrative approaches for large-scale transcriptome-wide association studies

.

Nat Genet

.

2016

;

48

(

3

):

245

–

252

.

18

Pasaniuc

B

,

Price

AL.

Dissecting the genetics of complex traits using summary association statistics

.

Nat Rev Genet

.

2017

;

18

(

2

):

117

–

127

.

19

Zhang

T

,

Choi

J

,

Kovacs

MA

, et al. .

Cell-type-specific eQTL of primary melanocytes facilitates identification of melanoma susceptibility genes

.

Genome Res

.

2018

;

28

(

11

):

1621

–

1635

.

20

Wu

L

,

Shi

W

,

Long

J

, et al. .

A transcriptome-wide association study of 229,000 women identifies new candidate susceptibility genes for breast cancer

.

Nat Genet

.

2018

;

50

(

7

):

968

–

978

.

21

Mancuso

N

,

Gayther

S

,

Gusev

A

, et al. .

Large-scale transcriptome-wide association study identifies new prostate cancer risk regions

.

Nat Commun

.

2018

;

9

(

1

):

4079

.

22

Lu

Y

,

Beeghly-Fadiel

A

,

Wu

L

, et al. .

A transcriptome-wide association study among 97,898 women to identify candidate susceptibility genes for epithelial ovarian cancer risk

.

Cancer Res

.

2018

;

78

(

18

):

5419

–

5430

.

23

Gusev

A

,

Mancuso

N

,

Won

H

, et al. .

Transcriptome-wide association study of schizophrenia and chromatin activity yields mechanistic disease insights

.

Nat Genet

.

2018

;

50

(

4

):

538

–

548

.

24

Theriault

S

,

Gaudreault

N

,

Lamontagne

M

, et al. .

A transcriptome-wide association study identifies PALMD as a susceptibility gene for calcific aortic valve stenosis

.

Nat Commun

.

2018

;

9

(

1

):

988

.

25

Zhang

M

,

Lykke-Andersen

S

,

Zhu

B

, et al. .

Characterising cis-regulatory variation in the transcriptome of histologically normal and tumour-derived pancreatic tissues

.

Gut

.

2018

;

67

(

3

):

521

–

533

.

26

Li

X

,

Kim

Y

,

Tsang

EK

, et al. .

The impact of rare variation on gene expression across tissues

.

Nature

.

2017

;

550

(

7675

):

239

–

243

.

27

Consortium

G.

Genetic effects on gene expression across human tissues

.

Nature

.

2017

;

550

(

7675

):

204

–

213

.

28

Dobin

A

,

Davis

CA

,

Schlesinger

F

, et al. .

STAR: ultrafast universal RNA-seq aligner

.

Bioinformatics

.

2013

;

29

(

1

):

15

–

21

.

29

Das

S

,

Forer

L

,

Schonherr

S

, et al. .

Next-generation genotype imputation service and methods

.

Nat Genet

.

2016

;

48

(

10

):

1284

–

1287

.

30

Zheng

X

,

Levine

D

,

Shen

J

, et al. .

A high-performance computing toolset for relatedness and principal component analysis of SNP data

.

Bioinformatics

.

2012

;

28

(

24

):

3326

–

3328

.

31

Stegle

O

,

Parts

L

,

Durbin

R

, et al. .

A Bayesian framework to account for complex non-genetic factors in gene expression levels greatly increases power in eQTL studies

.

PLoS Comput Biol

.

2010

;

6

(

5

):

e1000770

.

32

Barbeira

AN

GTEx Consortium

Dickinson

SP

,

Bonazzola

R

, et al. .

Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics

.

Nat Commun

.

2018

;

9

(

1

):

1825

.

33

Barbeira

AN

,

Pividori

MD

,

Zheng

J

, et al. . Integrating predicted transcriptome from multiple tissues improves association detection. PLoS Genet. 2019;15(1):e1007889.

34

Robinson

MD

,

McCarthy

DJ

,

Smyth

GK.

edgeR: a bioconductor package for differential expression analysis of digital gene expression data

.

Bioinformatics

.

2010

;

26

(

1

):

139

–

140

.

35

McCarthy

DJ

,

Chen

Y

,

Smyth

GK.

Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation

.

Nucleic Acids Res

.

2012

;

40

(

10

):

4288

–

4297

.

36

Huang da

W

,

Sherman

BT

,

Lempicki

RA.

Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources

.

Nat Protoc

.

2009

;

4

(

1

):

44

–

57

.

37

Huang da

W

,

Sherman

BT

,

Lempicki

RA.

Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists

.

Nucleic Acids Res

.

2009

;

37

(

1

):

1

–

13

.

38

The GTEx Consortium.

The genotype-tissue expression (GTEx) pilot analysis: multitissue gene regulation in humans

.

Science

.

2015

;

348

(

6235

):

648

–

660

.

Crossref

39

Wheeler

HE

,

Shah

KP

,

Brenner

J

, et al. .

Survey of the heritability and sparse architecture of gene expression traits across human tissues

.

PLoS Genet

.

2016

;

12

(

11

):

e1006423

.

40

Cobo

I

,

Martinelli

P

,

Flandez

M

, et al. .

Transcriptional regulation by NR5A2 links differentiation and inflammation in the pancreas

.

Nature

.

2018

;

554

(

7693

):

533

–

537

.

41

Takemoto

A

,

Kimura

K

,

Yokoyama

S

, et al. .

Cell cycle-dependent phosphorylation, nuclear localization, and activation of human condensin

.

J Biol Chem

.

2004

;

279

(

6

):

4551

–

4559

.

42

Schmiesing

JA

,

Gregson

HC

,

Zhou

S

, et al. .

A human condensin complex containing hCAP-C-hCAP-E and CNAP1, a homolog of Xenopus XCAP-D2, colocalizes with phosphorylated histone H3 during the early stage of mitotic chromosome condensation

.

Mol Cell Biol

.

2000

;

20

(

18

):

6996

–

7006

.

43

Kong

X

,

Stephens

J

,

Ball

AR

Jr, et al. .

Condensin I recruitment to base damage-enriched DNA lesions is modulated by PARP1

.

PLoS One

.

2011

;

6

(

8

):

e23548

.

44

Wood

RD

,

Mitchell

M

,

Sgouros

J

, et al. .

Human DNA repair genes

.

Science

.

2001

;

291

(

5507

):

1284

–

1289

.

45

Marcon

E

,

Ni

Z

,

Pu

S

, et al. .

Human-chromatin-related protein interactions identify a demethylase complex required for chromosome segregation

.

Cell Rep

.

2014

;

8

(

1

):

297

–

310

.

46

Paculova

H

,

Kohoutek

J.

The emerging roles of CDK12 in tumorigenesis

.

Cell Div

.

2017

;

12

:

7

. doi: 10.1186/s13008-017-0033-x.

47

Dubbury

SJ

,

Boutz

PL

,

Sharp

PA.

CDK12 regulates DNA repair genes by suppressing intronic polyadenylation

.

Nature

.

2018

;

564

(

7734

):

141

–

145

.

48

Davalos

V

,

Suarez-Lopez

L

,

Castano

J

, et al. .

Human SMC2 protein, a core subunit of human condensin complex, is a novel transcriptional target of the WNT signaling pathway and a new therapeutic target

.

J Biol Chem

.

2012

;

287

(

52

):

43472

–

43481

.

49

O’Leary

NA

,

Wright

MW

,

Brister

JR

, et al. .

Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation

.

Nucleic Acids Res

.

2016

;

44

(

D1

):

D733

–

D745

.

50

Wang

Z

,

Zhu

B

,

Zhang

M

, et al. .

Imputation and subset-based association analysis across different cancer types identifies multiple independent risk loci in the TERT-CLPTM1L region on chromosome 5p15.33

.

Hum Mol Genet

.

2014

;

23

(

24

):

6616

–

6633

.

51

Mocellin

S

,

Verdi

D

,

Pooley

KA

, et al. .

Telomerase reverse transcriptase locus polymorphisms and cancer risk: a field synopsis and meta-analysis

.

J Natl Cancer Inst

.

2012

;

104

(

11

):

840

–

854

.

52

Bojesen

SE

,

Pooley

KA

,

Johnatty

SE

, et al. .

Multiple independent variants at the TERT locus are associated with telomere length and risks of breast and ovarian cancer

.

Nat Genet.

2013

;

45

(

4

):371–384, 384e1–384e2.

53

Kote-Jarai

Z

,

Saunders

EJ

,

Leongamornlert

DA

, et al. .

Fine-mapping identifies multiple prostate cancer risk loci at 5p15, one of which associates with TERT expression

.

Hum Mol Genet

.

2013

;

22

(

12

):

2520

–

2528

.

54

Armanios

M

,

Blackburn

EH.

The telomere syndromes

.

Nat Rev Genet

.

2012

;

13

(

10

):

693

–

704

.

55

Janknecht

R.

On the road to immortality: HTERT upregulation in cancer cells

.

FEBS Lett

.

2004

;

564

(

1-2

):

9

–

13

.

56

Cheung

AL

,

Deng

W.

Telomere dysfunction, genome instability and cancer

.

Front Biosci

.

2008

;

13

(

13

):

2075

–

2090

.

57

Kim

NW

,

Piatyszek

MA

,

Prowse

KR

, et al. .

Specific association of human telomerase activity with immortal cells and cancer

.

Science

.

1994

;

266

(

5193

):

2011

–

2015

.

58

Shay

JW

,

Bacchetti

S.

A survey of telomerase activity in human cancer

.

Eur J Cancer

.

1997

;

33

(

5

):

787

–

791

.

59

Jia

J

,

Bosley

AD

,

Thompson

A

, et al. .

CLPTM1L promotes growth and enhances aneuploidy in pancreatic cancer cells

.

Cancer Res

.

2014

;

74

(

10

):

2785

–

2795

.

60

Clarke

WR

,

Amundadottir

L

,

James

MA.

CLPTM1L/CRR9 ectodomain interaction with GRP78 at the cell surface signals for survival and chemoresistance upon ER stress in pancreatic adenocarcinoma cells

.

Int J Cancer

.

2019

;

144

(

6

):

1367

–

1378

.

61

James

MA

,

Vikis

HG

,

Tate

E

, et al. .

CRR9/CLPTM1L regulates cell survival signaling and is required for RAS transformation and lung tumorigenesis

.

Cancer Res

.

2014

;

74

(

4

):

1116

–

1127

.

62

Hoffman

JD

,

Graff

RE

,

Emami

NC

, et al. .

Cis-eQTL-based trans-ethnic meta-analysis reveals novel genes associated with breast cancer risk

.

PLoS Genet

.

2017

;

13

(

3

):

e1006690

.

63

Torres

JM

,

Barbeira

AN

,

Bonazzola

R

, et al. . Integrative cross tissue analysis of gene expression identifies 2 novel type 2 diabetes genes. BioRxiv

2017

. doi:10.1101/108134.

64

He

P

,

Yang

JW

,

Yang

VW

, et al. .

Kruppel-like factor 5, increased in pancreatic ductal adenocarcinoma, promotes proliferation, acinar-to-ductal metaplasia, pancreatic intraepithelial neoplasia, and tumor growth in mice

.

Gastroenterology

.

2018

;

154

(

5

):

1494

–

1508 e13

.

65

Gong

L

,

Zhang

D

,

Lei

Y

, et al. .

Transcriptome-wide association study identifies multiple genes and pathways associated with pancreatic cancer

.

Cancer Med

.

2018

;

7

(

11

):

5727

–

5732

.

66

Walsh

N

,

Zhang

H

,

Hyland

PL

, et al. .

Agnostic pathway/gene set analysis of genome-wide association data identifies associations for pancreatic cancer

.

J Natl Cancer Inst

.

2019

;

111

(

6

):

557

–

567

.