DeLIVR: a deep learning approach to IV regression for testing nonlinear causal effects in transcriptome-wide association studies

Simulations: Empirical Type I error rates and power (over} 100 runs for DeLIVR and DeepIV, 1000 runs for TWAS); For TWAS, the numbers in parentheses are obtained on the test set with the Cauchy Combination Test (see Section3.1); “global” refers to (2.7) and “nonlinear” refers to (2.8)

		TWAS-L	TWAS-LQ		DeLIVR		DeLIVR + TWAS-L	DeepIV
Gene			Global	Nonlinear	Global	Nonlinear	Global	Global	Nonlinear
CDK2AP1	Null	0.06 (0.05)	0.05 (0.05)	0.05 (0.05)	0.05	0.05	0.05	0.08	0.07
	Linear	0.98 (0.95)	0.92 (0.91)	0.04 (0.07)	0.78	0.01	0.84	0.49	0.02
	Quadratic	0.06 (0.09)	0.55 (0.53)	0.64 (0.62)	0.64	0.57	0.63	0.27	0.24
	Cubic	0.91 (0.87)	0.84 (0.78)	0.07 (0.09)	1.0	0.95	1.0	0.84	0.51
NT5DC2	Null	0.05 (0.05)	0.03 (0.06)	0.04 (0.06)	0.05	0.04	0.05	0.05	0.04
	Linear	0.93 (0.90)	0.88 (0.85)	0.05 (0.06)	0.69	0	0.77	0.33	0.05
	Quadratic	0.13 (0.14)	0.84 (0.81)	0.88 (0.84)	0.81	0.71	0.81	0.42	0.37
	Cubic	0.13 (0.14)	0.15 (0.16)	0.10 (0.12)	0.47	0.37	0.45	0.08	0.04

		TWAS-L	TWAS-LQ		DeLIVR		DeLIVR + TWAS-L	DeepIV
Gene			Global	Nonlinear	Global	Nonlinear	Global	Global	Nonlinear
CDK2AP1	Null	0.06 (0.05)	0.05 (0.05)	0.05 (0.05)	0.05	0.05	0.05	0.08	0.07
	Linear	0.98 (0.95)	0.92 (0.91)	0.04 (0.07)	0.78	0.01	0.84	0.49	0.02
	Quadratic	0.06 (0.09)	0.55 (0.53)	0.64 (0.62)	0.64	0.57	0.63	0.27	0.24
	Cubic	0.91 (0.87)	0.84 (0.78)	0.07 (0.09)	1.0	0.95	1.0	0.84	0.51
NT5DC2	Null	0.05 (0.05)	0.03 (0.06)	0.04 (0.06)	0.05	0.04	0.05	0.05	0.04
	Linear	0.93 (0.90)	0.88 (0.85)	0.05 (0.06)	0.69	0	0.77	0.33	0.05
	Quadratic	0.13 (0.14)	0.84 (0.81)	0.88 (0.84)	0.81	0.71	0.81	0.42	0.37
	Cubic	0.13 (0.14)	0.15 (0.16)	0.10 (0.12)	0.47	0.37	0.45	0.08	0.04

Table 1

Open in new tab Download slide

Simulations: Empirical Type I error rates and power (over} 100 runs for DeLIVR and DeepIV, 1000 runs for TWAS); For TWAS, the numbers in parentheses are obtained on the test set with the Cauchy Combination Test (see Section3.1); “global” refers to (2.7) and “nonlinear” refers to (2.8)

		TWAS-L	TWAS-LQ		DeLIVR		DeLIVR + TWAS-L	DeepIV
Gene			Global	Nonlinear	Global	Nonlinear	Global	Global	Nonlinear
CDK2AP1	Null	0.06 (0.05)	0.05 (0.05)	0.05 (0.05)	0.05	0.05	0.05	0.08	0.07
	Linear	0.98 (0.95)	0.92 (0.91)	0.04 (0.07)	0.78	0.01	0.84	0.49	0.02
	Quadratic	0.06 (0.09)	0.55 (0.53)	0.64 (0.62)	0.64	0.57	0.63	0.27	0.24
	Cubic	0.91 (0.87)	0.84 (0.78)	0.07 (0.09)	1.0	0.95	1.0	0.84	0.51
NT5DC2	Null	0.05 (0.05)	0.03 (0.06)	0.04 (0.06)	0.05	0.04	0.05	0.05	0.04
	Linear	0.93 (0.90)	0.88 (0.85)	0.05 (0.06)	0.69	0	0.77	0.33	0.05
	Quadratic	0.13 (0.14)	0.84 (0.81)	0.88 (0.84)	0.81	0.71	0.81	0.42	0.37
	Cubic	0.13 (0.14)	0.15 (0.16)	0.10 (0.12)	0.47	0.37	0.45	0.08	0.04

		TWAS-L	TWAS-LQ		DeLIVR		DeLIVR + TWAS-L	DeepIV
Gene			Global	Nonlinear	Global	Nonlinear	Global	Global	Nonlinear
CDK2AP1	Null	0.06 (0.05)	0.05 (0.05)	0.05 (0.05)	0.05	0.05	0.05	0.08	0.07
	Linear	0.98 (0.95)	0.92 (0.91)	0.04 (0.07)	0.78	0.01	0.84	0.49	0.02
	Quadratic	0.06 (0.09)	0.55 (0.53)	0.64 (0.62)	0.64	0.57	0.63	0.27	0.24
	Cubic	0.91 (0.87)	0.84 (0.78)	0.07 (0.09)	1.0	0.95	1.0	0.84	0.51
NT5DC2	Null	0.05 (0.05)	0.03 (0.06)	0.04 (0.06)	0.05	0.04	0.05	0.05	0.04
	Linear	0.93 (0.90)	0.88 (0.85)	0.05 (0.06)	0.69	0	0.77	0.33	0.05
	Quadratic	0.13 (0.14)	0.84 (0.81)	0.88 (0.84)	0.81	0.71	0.81	0.42	0.37
	Cubic	0.13 (0.14)	0.15 (0.16)	0.10 (0.12)	0.47	0.37	0.45	0.08	0.04

3.3. DeLIVR provided more accurate and stable estimates of E(g(X)|Z) than DeepIV

Figure 2 shows the fitted models of DeLIVR and DeepIV for gene NT5DC2. The black dashed lines are the true |$E(g(\boldsymbol{X})|\boldsymbol{Z})$| as a function of |$\boldsymbol{\mu}_{\boldsymbol{Z}}$|⁠, and the grey points are |$\boldsymbol{Y}$| for each unique value of |$\hat{\boldsymbol{\mu}}_{\boldsymbol{Z}}$|⁠. The red dashed lines are the average of |$\hat{E}(g(\boldsymbol{X})|\boldsymbol{Z})$| over 100 runs given by DeLIVR. The red shaded area is the “empirical point-wise confidence intervals”—the 97.5th and 2.5th percentiles from the 100 runs. The estimates for DeepIV were calculated by MC simulations. DeLIVR was almost unbiased for the true |$E(g(\boldsymbol{X})|\boldsymbol{Z})$|⁠. On the other hand, DeepIV could only capture the general shape of the true function, and the estimates were very unstable with large variability, even after averaging over 100 runs, which explained its low power.

$Estimates of $E(g(\boldsymbol{X})|\boldsymbol{Z})$ given by DeLIVR left) and DeepIV (right): the dash-dotted line is the true $E(g(\boldsymbol{X})|\boldsymbol{Z})$; the dashed line is the average of $\hat{E}(g(\boldsymbol{X})|\boldsymbol{Z})$ over 100 runs; the shaded area is the empirical point wise $95\%$ CI of $\hat{E}(g(\boldsymbol{X})|\boldsymbol{Z})$.$

Figure 2

Estimates of |$E(g(\boldsymbol{X})|\boldsymbol{Z})$| given by DeLIVR left) and DeepIV (right): the dash-dotted line is the true |$E(g(\boldsymbol{X})|\boldsymbol{Z})$|⁠; the dashed line is the average of |$\hat{E}(g(\boldsymbol{X})|\boldsymbol{Z})$| over 100 runs; the shaded area is the empirical point wise |$95\%$| CI of |$\hat{E}(g(\boldsymbol{X})|\boldsymbol{Z})$|⁠.

Section S2 of the Supplementary material available at Biostatistics online presents the results for gene CDK2AP1 and the estimated |$g(\boldsymbol{X})$| given by DeepIV.

3.4. Robust test of DeLIVR maintained correct Type I errors in the presence of invalid IVs

Table 2 shows the Type I error rates and power of the robust test of DeLIVR for gene NT5DC2 with both uncorrelated and correlated pleiotropy. DeLIVR could control the Type I error rate at the nominal level of 0.05 with the robust test whereas the default nonlinearity test had dramatically inflated Type I error rates of nearly 1 if there were any invalid IVs. On the other hand, we observed slightly decreasing power of the robust test as the number of invalid IVs increased.

Table 2

Empirical Type I error rates and power of DeLIVR over 100 runs with both correlated and uncorrelated pleiotropy. Gene: NT5DC2; Tests: “nonlinear” refers to (2.8), “robust” refers to (2.12)

	Models	DeLIVR
	No. of invalid IVs	0		1		2		3
Horizontal pleiotropy	Tests	Nonlinear	Robust	Nonlinear	Robust	Nonlinear	Robust	nonlinear	robust
Correlated and uncorrelated	Null	0.03	0.07	0.84	0.07	0.99	0.03	.96	.05
	Linear	0.01	0.03	—	0.08	—	0.09	-	.06
	Quadratic	0.74	0.52	—	0.60	—	0.47	-	.34
	Cubic	0.47	0.59	—	0.38	—	0.33	-	.40
Uncorrelated	Null	0.04	0.08	0.86	0.06	0.88	0.07	.98	.07
	Linear	0.02	0.06	—	0.01	—	0.06	-	.02
	Quadratic	0.80	0.57	—	0.62	—	0.51	-	.27
	Cubic	0.45	0.53	—	0.31	—	0.23	-	.14

	Models	DeLIVR
	No. of invalid IVs	0		1		2		3
Horizontal pleiotropy	Tests	Nonlinear	Robust	Nonlinear	Robust	Nonlinear	Robust	nonlinear	robust
Correlated and uncorrelated	Null	0.03	0.07	0.84	0.07	0.99	0.03	.96	.05
	Linear	0.01	0.03	—	0.08	—	0.09	-	.06
	Quadratic	0.74	0.52	—	0.60	—	0.47	-	.34
	Cubic	0.47	0.59	—	0.38	—	0.33	-	.40
Uncorrelated	Null	0.04	0.08	0.86	0.06	0.88	0.07	.98	.07
	Linear	0.02	0.06	—	0.01	—	0.06	-	.02
	Quadratic	0.80	0.57	—	0.62	—	0.51	-	.27
	Cubic	0.45	0.53	—	0.31	—	0.23	-	.14

Table 2

Open in new tab Download slide

Empirical Type I error rates and power of DeLIVR over 100 runs with both correlated and uncorrelated pleiotropy. Gene: NT5DC2; Tests: “nonlinear” refers to (2.8), “robust” refers to (2.12)

	Models	DeLIVR
	No. of invalid IVs	0		1		2		3
Horizontal pleiotropy	Tests	Nonlinear	Robust	Nonlinear	Robust	Nonlinear	Robust	nonlinear	robust
Correlated and uncorrelated	Null	0.03	0.07	0.84	0.07	0.99	0.03	.96	.05
	Linear	0.01	0.03	—	0.08	—	0.09	-	.06
	Quadratic	0.74	0.52	—	0.60	—	0.47	-	.34
	Cubic	0.47	0.59	—	0.38	—	0.33	-	.40
Uncorrelated	Null	0.04	0.08	0.86	0.06	0.88	0.07	.98	.07
	Linear	0.02	0.06	—	0.01	—	0.06	-	.02
	Quadratic	0.80	0.57	—	0.62	—	0.51	-	.27
	Cubic	0.45	0.53	—	0.31	—	0.23	-	.14

	Models	DeLIVR
	No. of invalid IVs	0		1		2		3
Horizontal pleiotropy	Tests	Nonlinear	Robust	Nonlinear	Robust	Nonlinear	Robust	nonlinear	robust
Correlated and uncorrelated	Null	0.03	0.07	0.84	0.07	0.99	0.03	.96	.05
	Linear	0.01	0.03	—	0.08	—	0.09	-	.06
	Quadratic	0.74	0.52	—	0.60	—	0.47	-	.34
	Cubic	0.47	0.59	—	0.38	—	0.33	-	.40
Uncorrelated	Null	0.04	0.08	0.86	0.06	0.88	0.07	.98	.07
	Linear	0.02	0.06	—	0.01	—	0.06	-	.02
	Quadratic	0.80	0.57	—	0.62	—	0.51	-	.27
	Cubic	0.45	0.53	—	0.31	—	0.23	-	.14

When there was no invalid IV, the robust test lost power, as compared to the (default) nonlinearity test, in the quadratic case but not necessarily in the cubic case. The results for rDeLIVR and CDK2AP1 were similar and thus relegated to Section S2 of the Supplementary material available at Biostatistics online.

3.5. Results for PMR-Egger and VC-TWAS

Due to the space limit, we put the results in the Supplementary material available at Biostatistics online. The general conclusions were the same as discussed in the Introduction.

4. Real data: DeLIVR identified additional genes associated with lipids

For real-data analysis, we will only report the results of the Cauchy Combination Test as the significant genes identified by the Hommel’s method were a subset of those identified by the Cauchy Combination Test. The quantile–quantile plot for p-values is shown for TWAS-L and DeLIVR (Figure 3), and we found that the p-values for DeLIVR were better calibrated than TWAS-L. DeepIV did not identify any significant genes for either traits, so we will focus on comparing DeLIVR with TWAS-L and TWAS-LQ. Figure 3 shows the number of significant genes identified for each method. For HDL, TWAS-L discovered 87 significant genes, and with the global test, DeLIVR discovered 58 genes, 49 of which overlapped with those discovered by TWAS-L. Compared to using the entire data set, testing TWAS-L on the test set only identified 58 genes, showing that the power difference between TWAS-L and DeLIVR was mainly due to the difference in the test sample sizes. Additionally, combining the p-values of DeLIVR and TWAS-L on test data improved the number of significant genes to 65, 61 of which overlapped with those identified by TWAS-L. The nonlinearity test for DeLIVR discovered 10 significant genes, 2 of which were identified by the TWAS-L model. TWAS-LQ identified four genes with the global test, one of which was also identified by DeLIVR. As for the nonlinearity test, TWAS-LQ identified two genes, one of which was identified by DeLIVR. There were eight novel genes solely discovered by both the global test and the nonlinearity test of DeLIVR, among which BUD13 was previously found to be associated with HDL. For example, many variants in BUD13 have been found to be related to HDL and other metabolic traits in both European and Asian populations (Johansen and others, 2010; Zhang and others, 2017; Lin and others, 2016; Oh and others, 2020).

Figure 3

Venn diagrams (left) and Q–Q plots (right) for the numbers of the significant genes for HDL (top) and LDL (bottom); the results of DeLIVR were given by the Cauchy Combination Test over 21 repeated runs for each gene.}

For LDL, TWAS-L discovered 51 significant genes, and with the global test, DeLIVR discovered 40 genes, 29 of which overlap with those discovered by TWAS-L. Testing TWAS-L on the test set identified 35 genes, and combining the p-values of DeLIVR and TWAS-L on test set improved the number of significant genes to 45. The nonlinearity test for DeLIVR discovered 12 significant genes, 5 of which were also identified by TWAS-L. Compared to TWAS-L and TWAS-LQ, the nonlinearity and global test of DeLIVR discovered seven novel genes associated with LDL. Out of the newly identified genes, SLC44A2 and GMIP were identified as significant contributors to the LDL level in many ethnic groups by various studies (Sinnott-Armstrong and others, 2021; De Vries and others, 2019).

The analysis above was based on the assumption that all IVs were valid, which might not be true. Therefore, we applied the robust test in DeLIVR to the significant genes discovered by the nonlinearity test (12 for HDL and 11 for LDL). We used the same Bonferroni corrected cutoff as before (⁠|$0.05/4701 = 1.1\mathrm{e}-5$|⁠). For HDL, none of the genes were significant, though gene BUD13 on chromosome 11 was almost significant with a p-value of |$3.6\mathrm{e}-4$|⁠. For LDL, MAU2 and YJEFN3 on chromosome 19 were significant with p-values of |$6.8\mathrm{e}-14$| and |$2.3\mathrm{e}-7$|⁠, respectively. The plots for the fitted models of these genes are given in Section S3 of the Supplementary material available at Biostatistics online. We can see that for all three genes, DeLIVR showed some nonlinear trends not captured by TWAS-LQ.

5. Discussion

In this article, we have proposed a DL-based IV regression method, DeLIVR, to discover potentially nonlinear causal effects for a more flexible and nonparametric TWAS analysis. In addition, a general hypothesis testing framework was also proposed, along with a robust test in the presence of invalid IVs. In our simulation study, we found that DeLIVR had much higher power than DeepIV, an existing and perhaps best-known DL-based IV regression method. DeLIVR also outperformed the existing TWAS models, such as TWAS-L and TWAS-LQ, when the parametric models were mis-specified. For example, when the true causal relationship between the gene expression/exposure and the trait/outcome was cubic, the nonlinearity test of TWAS-LQ had power of 0.07, dramatically lower than 0.95 of DeLIVR. The simulation study also revealed that DeepIV was unstable in estimating the true causal function |$g(\boldsymbol{X})$|⁠, leading to its low statistical power. Applying DeLIVR to the GTEx and UK Biobank data led to some new discoveries. For example, DeLIVR identified eight genes associated nonlinearly with HDL, which were missed by both parametric TWAS-L, TWAS-LQ, and nonparametric DeepIV. Some of these findings were supported by previous studies. For example, out of the newly discovered genes, BUD13 has been reported to have significant associations with HDL (Johansen and others, 2010; Zhang and others, 2017; Lin and others, 2016; Oh and others, 2020). Similarly, DeLIVR uniquely identified seven putative causal genes for LDL, all of which were missed by TWAS-L, TWAS-LQ, and DeepIV. Some of these genes were still deemed significant by the robust test of DeLIVR after accounting for the possible presence of linear pleiotropic effects of invalid IVs.

Although TWAS has been extensively applied in the last few years, we are not aware of any studies aiming to detect the nonlinear causal relationships with a nonparametric method. Some recently proposed methods addressing the nonlinearity issue, such as TWAS-LQ and PolyMR (Sulc and others, 2021), were however parametric, which may lose power when mis-specified as discussed above. On the other hand, an existing popular nonparametric method, DeepIV, was slow and gave unstable estimates when applied to TWAS data, while hypothesis testing was not studied before. We also note that, although we have focused on TWAS, the proposed method can be broadly applied to other problems, especially if the goal is for nonlinear association testing.

There are a few limitations in this study. First, the major innovation of DeLIVR is to estimate |$E(g(\boldsymbol{X})|\boldsymbol{Z})$|⁠, instead of the causal function |$g(\boldsymbol{X})$|⁠, to serve the purpose of association analysis in TWAS. Hence, DeLIVR does not offer an estimate of |$g(\boldsymbol{X})$|⁠, which may be important in some applications. As a result, DeLIVR is more relevant to association testing (e.g., metabolome- or proteome-wide association studies, in addition to TWAS) than to other IV regression problems. Second, DeLIVR is not applicable to the widely available GWAS summary data, as it requires individual-level data to perform the analysis. Third, the robust test in DeLIVR only works if the underlying causal function is highly nonlinear—cannot be well approximated by a linear function; otherwise, we would lose power. It also depends on the assumption of linear pleiotropic effects of SNPs on a trait, which is however reasonable given small effect sizes of SNPs on complex traits and/or often a small sample size in stage 1, though it is possible and interesting to extend it to nonlinear. Finally, while the power of DL is for multivariate data, we have only considered univariate TWAS with a single gene (or exposure); extensions to multivariate TWAS (or multivariate regression) with multiple genes (or exposures) (Knutson and others, 2020) will be useful.

Data and code availability statement

The GTEx data are available to the approved user at https://www.ncbi.nlm.nih.gov/gap/, and the UKB data are available to the approved user at https://www.ukbiobank.ac.uk/. The R and Python code can be found at https://github.com/RuoyuHe/DeLIVR. For a gene with a total sample size of 200 000, it took approximately 90–120 min to run DeLIVR for all 21 repeats on the AMD Rome.

Supplementary material

Supplementary material is available at http://biostatistics.oxfordjournals.org.

Acknowledgments

The authors thank the reviewers for many helpful comments and suggestions.

Conflict of Interest: The authors declare no conflicts of interest.

Funding

National Institutes of Health (NIH) (U01AG073079, R01AG065636, RF1AG067924, R01AG069895, R01AG074858, R01GM126002, and R01HL116720); and The Minnesota Supercomputing Institute (MSI). The Genotype-Tissue Expression (GTEx) Project was supported by the Common Fund of the Office of the Director of the National Institutes of Health, and by National Cancer Institute (NCI), National Human Genome Research Institute (NHGRI), National Heart, Lung, and Blood Institute (NHLBI), National Institute on Drug Abuse (NIDA), National Institute of Mental Health (NIMH), and National Institute of Neurological Disorders and Stroke (NINDS). The data used for the analyses described in this article were obtained from dbGaP Project #26511. The access to the UK Biobank (UKB) data was approved through UKB Application #35107.

References

Abadi,

M.

,

Agarwal,

A.,

Barham,

P.,

Brevdo,

E.,

Chen,

Z.,

Citro,

C.,

Corrado,

G. S.

,

Davis,

A.,

Dean,

J.,

Devin,

M.

and others. (

2015

).

TensorFlow: large-scale machine learning on heterogeneous systems

.

12th USENIX symposium on operating systems design and implementation (OSDI 16)

,

265

–

283

.

Software available from tensorflow.org

.

Chernozhukov,

V.

,

Chetverikov,

D.

,

Demirer,

M.

,

Duflo,

E.

,

Hansen,

C.

,

Newey,

W.

and

Robins,

J.

(

2018

).

Double/debiased machine learning for treatment and structural parameters

.

The Econometrics Journal

21

,

C1

–

C68

.

De Vries,

P. S.

,

Brown,

M. R.,

Bentley,

A. R.,

Sung,

Y.J.,

Winkler,

T.W.,

Ntalla,

I.,

Schwander,

K.,

Kraja,

A.T.,

Guo,

X.,

Franceschini,

N.

and others. (

2019

).

Multiancestry genome-wide association study of lipid levels incorporating gene-alcohol interactions

.

American Journal of Epidemiology

188

,

1033

–

1054

.

Deng,

Y.

and

Pan,

W.

(

2021

).

Model checking via testing for direct effects in Mendelian randomization and transcriptome-wide association studies

.

PLoS Computational Biology

17

,

e1009266

.

Gamazon,

E. R.

,

Wheeler,

H. E.

,

Shah,

K. P.

,

Mozaffari,

S. V.

,

Aquino-Michaels,

K.

,

Carroll,

R. J.

,

Eyler,

A. E.

,

Denny,

J. C.

, GTEx Consortium,

Nicolae,

D. L.

,

Cox,

N. J.

and

Im,

H. K.

(

2015

).

A gene-based association method for mapping traits using reference transcriptome data

.

Nature Genetics

47

,

1091

–

1098

.

GTEx Consortium. (

2020

).

The GTEx consortium atlas of genetic regulatory effects across human tissues

.

Science

369

,

1318

–

1330

.

Gusev,

A.

,

Ko,

A.

,

Shi,

H.

,

Bhatia,

G.

,

Chung,

W.

,

Penninx,

B. W. J. H.

,

Jansen,

R.

,

De Geus,

E. J. C.

,

Boomsma,

D. I.

,

Wright,

F. A.

and others. (

2016

).

Integrative approaches for large-scale transcriptome-wide association studies

.

Nature Genetics

48

,

245

–

252

.

Hartford,

J.

,

Lewis,

G.

,

Leyton-Brown,

K.

and

Taddy,

M.

(

2017

).

Deep IV: a flexible approach for counterfactual prediction

. In:

Jebara,

T.

and others (editors),

Proceedings of the 34th International Conference on Machine Learning

,

PMLR

70

,

pp

.

1414

–

1423

.

ML Research Press

.

Google Preview

Hartford,

J.

,

Veitch,

V.

,

Sridhar,

D.

and

Leyton-Brown,

K.

(

2021

).

Valid causal inference with (some) invalid instruments

. In:

Balcan,

M. F.

and others (editors),

Proceedings of the 38th International Conference on Machine Learning

,

PMLR

139

,

pp

.

4096

–

4106

.

ML Research Press

.

Google Preview

Hemani,

G.

,

Bowden,

J.

and

Davey Smith,

G.

(

2018

).

Evaluating the potential role of pleiotropy in Mendelian randomization studies

.

Human Molecular Genetics

27

,

R195

–

R208

.

Hommel,

G.

(

1983

).

Tests of the overall hypothesis for arbitrary dependence structures

.

Biometrical Journal

25

,

423

–

430

.

Johansen,

C. T.

,

Wang,

J.

,

Lanktree,

M. B.

,

Cao,

H.

,

McIntyre,

A. D.

,

Ban,

M. R.

,

Martins,

R. A.

,

Kennedy,

B. A.

,

Hassell,

R. G.

,

Visser,

M. E.

and others. (

2010

).

Excess of rare variants in genes identified by genome-wide association study of hypertriglyceridemia

.

Nature Genetics

42

,

684

–

687

.

Kim,

J.

,

Pan,

W.

and Alzheimer’s Disease Neuroimaging Initiative. (

2015

).

Highly adaptive tests for group differences in brain functional connectivity

.

NeuroImage: Clinical

9

,

625

–

639

.

Kingma,

D. P.

and

Ba,

J.

(

2015

).

Adam: a method for stochastic optimization

. In:

Bengio,

Y.

and others (editors),

Proceedings of the 3rd International Conference on Learning Representations (ICLR 2015)

.

Knutson,

K. A.

,

Deng,

Y.

and

Pan,

W.

(

2020

).

Implicating causal brain imaging endophenotypes in Alzheimer’s disease using multivariable IWAS and GWAS summary data

.

NeuroImage

223

,

117347

.

Kress,

R.

,

Maz’ya,

V.

and

Kozlov,

V.

(

1989

).

Linear Integral Equations

,

Volume 82

.

Heidelberg

:

Springer Berlin

.

Lin,

E.

,

Kuo,

P.-H.

,

Liu,

Y.-L.

,

Yang,

A. C.

,

Kao,

C.-F.

and

Tsai,

S.-J.

(

2016

).

Association and interaction of APOA5, BUD13, CETP, LIPA and health-related behavior with metabolic syndrome in a Taiwanese population

.

Scientific Reports

6

,

1

–

9

.

PubMed

Lin,

Z.

,

Xue,

H.

,

Malakhov,

M. M.

,

Knutson,

K.

and

Pan,

W.

(

2021

).

Accounting for non-linear effects of gene expression identifies additional associated genes in transcriptome-wide association studies

.

Human Molecular Genetics

31

,

2462

–

2470

.

Liu,

Y.

and

Xie,

J.

(

2020

).

Cauchy combination test: a powerful test with analytic p-value calculation under arbitrary dependency structures

.

Journal of the American Statistical Association

115

,

393

–

402

.

Newey,

W. K.

(

2013

).

Nonparametric instrumental variables estimation

.

American Economic Review

103

,

550

–

556

.

Oh,

S.-W.

,

Lee,

J.-E

,

Shin,

E.

,

Kwon,

H.

,

Choe,

E. K.

,

Choi,

S.-Y.

,

Rhee,

H.

and

Choi,

S. H.

(

2020

).

Genome-wide association study of metabolic syndrome in Korean populations

.

PLoS One

15

,

e0227357

.

Pan,

W.

(

2011

).

Relationship between genomic distance-based regression and kernel machine regression for multi-marker association testing

.

Genetic Epidemiology

35

,

211

–

216

.

Sinnott-Armstrong,

N.

,

Tanigawa,

Y.

,

Amar,

D.

,

Mars,

N.

,

Benner,

C.

,

Aguirre,

M.

,

Venkataraman,

G. R.

,

Wainberg,

M.

,

Ollila,

H. M.

,

Kiiskinen,

T.

and others. (

2021

).

Genetics of 35 blood and urine biomarkers in the UK Biobank

.

Nature Genetics

53

,

185

–

194

.

Sudlow,

C.

,

Gallacher,

J.

,

Allen,

N

,

Beral,

V.

,

Burton,

P.

,

Danesh,

J.

,

Downey,

P.

,

Elliott,

P.

,

Green,

J.

,

Landray,

M.

and others. (

2015

).

UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age

.

PLoS Medicine

12

,

e1001779

.

Sulc,

J.

,

Sjaarda,

J.

and

Kutalik,

Z.

(

2021

).

Polynomial Mendelian randomization reveals widespread non-linear causal effects in the UK biobank

.

Human Genetics and Genomics Advances

3

,

100124

.

Tang,

S.

,

Buchman,

A. S.

,

de Jager,

P. L.

,

Bennett,

D. A.

,

Epstein,

M. P.

and

Yang,

J.

(

2021

).

Novel variance-component TWAS method for studying complex human diseases with applications to Alzheimer’s dementia. PLOS Genetics

17

,

e1009482

.