Abstract

Aim

A recent study has reported that anti-reflux surgery reduced the risk of lung cancer. However, the exact causal association between gastro-esophageal reflux disease (GORD) and lung cancer remains obscure. Therefore, we conducted a multivariable and network Mendelian randomization (MR) study to explore this potential association and mediation effect.

Methods

Independent single nucleotide polymorphisms (SNPs) strongly associated with GORD were selected as instrumental variables (IVs) from the corresponding genome-wide association studies (GWAS). The summary statistics were obtained from the largest GORD GWAS meta-analysis of 367 441 (78 707 cases) European individuals, and the summary statistics of lung cancer and pathological subtypes came from International Lung Cancer Consortium (ILCCO) and FinnGen databases. Univariable and multivariable MR analyses were performed to investigate and verify the causal relationship between genetically predicted GORD and lung cancer. Network MR analysis was conducted to reveal the mediating role of GORD between smoking initiation and lung cancer.

Results

The univariable MR analysis demonstrated that GORD was associated with an increased risk of total lung cancer in both ILCCO [inverse variance weighted (IVW): odds ratio (OR) = 1.37, 95% confidence interval (CI) 1.16–1.62, P = 1.70E-04] and FinnGen database (IVW: OR = 1.25, 95% confidence interval CI 1.03–1.52, P = 2.27E-02). The consistent results were observed after adjusting the potential confounders [smoking traits, body mass index (BMI) and type 2 diabetes] in multivariable MR analyses. In subtype analyses, GORD was associated with lung adenocarcinoma (IVW: OR = 1.27, 95% CI 1.02–1.59, P = 3.48E-02) and lung squamous cell carcinomas (IVW: OR = 1.50, 95% CI 1.22–1.86, P = 1.52E-04). Moreover, GORD mediated 32.43% (95% CI 14.18–49.82%) and 25.00% (95% CI 3.13–50.00%) of the smoking initiation effects on lung cancer risk in the ILCCO and FinnGen databases, respectively.

Conclusion

This study provides credible evidence that genetically predicted GORD was significantly associated with an increased risk of total lung cancer, lung adenocarcinoma and lung squamous cell carcinomas. Furthermore, our results suggest GORD is involved in the mechanism of smoking initiation-induced lung cancer.

Key Messages
  • Recent study has reported that anti-reflux surgery reduced the risk of lung cancer. However, due to bias by uncontrolled confounders, the exact causal association and mediation remain elusive.

  • Genetically predicted gastro-esophageal reflux disease (GORD) was found to increase the risk of total lung cancer, lung adenocarcinoma and lung squamous cell carcinomas in both International Lung Cancer Consortium and FinnGen databases.

  • These findings were further confirmed by multivariable Mendelian randomization (MR) analysis adjusting for potential confounding factors (smoking traits, body mass index and type 2 diabetes).

  • Genetically predicted GORD mediated the increased risk of total lung cancer caused by smoking initiation, suggesting that GORD may be part of the mechanism for smoking-induced lung cancer.

Introduction

Lung cancer is one of the most prevalent malignancies, with a high incidence rate, and is a leading cause of cancer-related deaths worldwide.1 The 5-year overall survival of lung cancer is nearly 19%, partly because more than one-half of cases are diagnosed at an advanced stage.1 Therefore, early detection and diagnosis are of great importance to prevent cancer and improve prognosis. Recently, genetically predicted causal inference between complex risk factors and diseases has been popularized in epidemiological studies, which provide new insights into early screening and prevention of disease.2–4 This study aims to investigate gastro-esophageal reflux disease (GORD) as the potential risk factor for lung cancer, using genetic instruments, and to explore the potential mediation effect, providing new insights for the early prevention and screening of lung cancer.

GORD is a gastrointestinal disorder characterized by frequent regurgitation of stomach acid and bile, which is known as the predominant risk factor of Barrett’s esophagus and esophageal adenocarcinoma.5 Airway aspiration is an important symptom of GORD, which could cause a series of pulmonary chemical irritation symptoms and inflammation.6 However, there are only a few studies, with insufficient evidence, which focus on the potential association between GORD and lung cancer.7 A recent multicentre study from Northern Europe demonstrated that anti-reflux surgery for patients with GORD decreased the risk of lung cancer, but the result might be biased due to the confounder of smoking.8 Additionally, an Asian retrospective study found that GORD was significantly associated with lung cancer.9 Due to the inevitable potential reverse causality and residual confounding in observational studies, whether there is a causal association between GORD and lung cancer remains obscure.

Two-sample Mendelian randomization (MR) analysis is a popular method using single nucleotide polymorphisms (SNPs) as instrumental variables (IVs) to infer credible causal relationship when randomized controlled trials (RCTs) are not feasible.10 As genetic variations were randomly assigned at meiosis and were independent of environmental and other acquired factors, MR design provides a randomization procedure and minimizes the interference of residual confounding and reverse causality.3 To achieve a credible causal inference, IVs of MR analysis must satisfy three assumptions. (i) The instrumental variables must be strongly related to exposure. (ii) The instrumental variables are not associated with any confounders. (iii) The instrumental variables are not directly related to outcomes. Recently, MR studies have revealed a causal association between several risk factors [e.g. smoking, body mass index (BMI), diabetes] and lung cancer.11–14 Nonetheless, there is as yet no MR evidence to prove the causal association and mediation effect between GORD and lung cancer.

Considering that tobacco smoking is a common risk factor for both GORD and lung cancer,15,16 we used mediation analysis to investigate the mediation role of GORD between smoking initiation and lung cancer. Mediation analysis is a statistical approach that aims to identify intermediate factors as potential intervention targets and to improve the cognition of aetiological composition.17 Traditional non-IV mediation analysis is limited by the lack of strong causal assumptions. MR design overcomes these limitations by using the instrumental variable technique. Therefore, network MR analysis is widely used to investigate potential mediation effects in a causal pathway.2,4

In this study, we conducted two-sample Mendelian randomization (MR) analyses utilizing genome-wide association study (GWAS) summary data to investigate the causal effect of GORD on lung cancer susceptibility, based on two large databases. Moreover, we further conducted network MR analysis to explore the potential mediator role of GORD between smoking initiation and lung cancer. These analyses provide causal evidence for the role of GORD in lung cancer genesis, which provides a new perspective on the prediction and prevention of lung cancer.

Methods

We conducted two-sample MR analyses based on the largest GORD dataset and two large lung cancer datasets, to infer the causal effects of GORD on overall lung cancer and its pathological subtypes. Multivariable MR analysis was performed to reveal the direct causal effect of GORD on lung cancer by adjusting potential confounders. Furthermore, we performed network MR analysis to investigate the potential mediation role of GORD between smoking initiation and lung cancer. A detailed flowchart summarizes the study design (Figure 1). This study is designed and reported according to the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guideline, specific for MR (STROBE-MR) (Checklist).18

Flowchart of the study design. GORD, gastro-esophageal reflux disease; TS-MR, two-sample Mendelian randomization; BMI, body mass index; ILCCO, International Lung Cancer Consortium; IVs, instrument variables
Figure 1

Flowchart of the study design. GORD, gastro-esophageal reflux disease; TS-MR, two-sample Mendelian randomization; BMI, body mass index; ILCCO, International Lung Cancer Consortium; IVs, instrument variables

GWAS data sources

GWAS for GORD

We identified genetic instruments for GORD from the largest GORD GWAS meta-analysis in Europe, which included a total of 367 441 (78 707 cases) European individuals from the UK Biobank study (354 285 individuals) and QSKIN study (13 156 individuals).19 UK Biobank is a population-based cohort study of more than 500 000 individuals and biological samples from across Great Britain. GORD cases from UK Biobank were defined as a mixture of self-reported GORD symptoms, ICD-9/10 diagnosis and GORD-related medication use.20 QSKIN study is conducted by the QIMR Berghofer Medical Research Institute and aims to investigate risk factors for skin cancers and other complex diseases, which includes 43 794 participants between 40 and 69 years old from Queensland, Australia. GORD in the QSKIN cohort was defined as individuals who self-reported heartburn and took one or more types of GORD-related medications.21 More detail on the meta-analysis method may be obtained from the supplemental method and the original research (Supplementary Table S1 and Supplementary Method, available as Supplementary data at IJE online).

GWAS for smoking traits

To adjust for potential confounders and investigate the mediation effect of GORD between smoking and lung cancer, we obtained genetic instruments for smoking traits including smoking initiation, smoking duration (age of smoking initiation) and smoking frequency (cigarettes smoked per day) from the Sequencing Consortium of Alcohol and Nicotine use (GSCAN) project. GSCAN provided the largest smoking-related GWAS meta-analysis and identified 566 genetic variants associated with four smoking phenotypes in 1 232 091 individuals of European ancestry.22 More information on the study design and GWASs in this meta-analysis study can be found elsewhere.22

GWAS for other traits

To validate the effectiveness of the GORD instruments, we obtained the GWAS meta-analysis data of Barrett's esophagus, which was derived from four large European ancestry cohorts.23 Besides, to avoid the interference of potential confounders and explore the direct causal effect of GORD on lung cancer, we obtained the genetic instruments for BMI and type 2 diabetes from the most recent and largest studies. GWAS summary statistics for BMI were obtained from the Genetic Investigation of Anthropometric Traits (GIANT) consortium, which included 152 893 participants.24 Summary statistics for type 2 diabetes were obtained from Diabetes Genetics Replication and Meta-analysis (DIAGRAM), which included 149 821 (34 840 cases) participants.25 The details of traits involved in this study are presented in Supplementary Table S1).

GWAS for outcomes

The GWAS data for lung cancer were obtained from two large databases, International Lung Cancer Consortium (ILCCO) and FinnGen. ILCCO is a multicentre project dedicated to the study of genetic variants associated with lung cancer.26 FinnGen database is a combined project that integrates digital health records from the Finnish Health Registry with genetic data from the Finnish biobank (https://www.finngen.fi/en). In this study, we obtained two GWAS summary statistics for total lung cancer from ILCCO (11 348 cases and 15 861 controls) and FinnGen (4030 cases and 238 678 controls) databases as the primary outcomes. We further used GWAS for lung adenocarcinoma (3442 cases and 14 894 controls) and lung squamous cell carcinoma (3275 cases and 15 038 controls) from ILCCO, and the GWAS for small cell lung carcinoma (461 cases and 238 678 controls) from FinnGen, as secondary outcomes to reveal the association between GORD and lung cancer pathological subtypes (Supplementary Table S2, available as Supplementary data at IJE online). All summary statistics were filtered at the minimum variant allele (MAF) frequency >0.01. UK Biobank database is excluded in the selection of outcome data due to the consideration of sample overlap between exposure and outcome.

Genetic instruments selection criteria

Genetic instruments (SNPs) for GORD, Barrett's esophagus, smoking traits, BMI and type 2 diabetes were all selected at genome-wide significant threshold (P <5E-8). We identified independent SNPs by performing linkage disequilibrium (LD) pruning for each trait based on the 1000 Genomes LD reference panel in European ancestry. The threshold of LD was set to r2 >0.01 and clump window <10 kb. In order to infer correct causal estimates from the MR analysis, the effect of an SNP on an outcome and exposure must be harmonized to be relative to the same allele. Thus, we used the ‘TwoSampleMR’ R package and performed variant harmonization to identify and exclude palindromic SNPs for which the correct orientation of the allele could not be determined. To avoid the confounding effect, SNPs significantly associated with outcome (P < 5E-8) were excluded. Also we excluded SNPs pleiotropically associated with second traits which might confound the causal estimate, through PhenoScanner v2 [PhenoScanner (cam.ac.uk), P <1E-5]27 (Supplementary Table S3, available as Supplementary data at IJE online). Besides, we calculated F statistics to evaluate the validity of instrument variables. A value less than 10 suggests a weak instrumental variable.

Statistical analyses

Main Mendelian randomization analyses

We used the random effect inverse variance weighted (IVW) method as the main method for all MR analyses. IVW model provided a weighted regression of the IV-specific causal estimation and could provide a stable causal inference even in the presence of horizontal pleiotropy.28 Additionally, we performed analyses in both ILCCO and FinnGen databases to ensure the consistency of the inference. Considering smoking traits (smoking initiation, smoking duration and smoking frequency), BMI and type 2 diabetes were potential risk factors for lung cancer, we adjusted these confounders by using multivariate MR analyses and explored the direct effect of GORD on lung cancer. Moreover, we further explored the causal relationship between GORD and lung cancer pathological subtypes to refine the risk of GORD for lung cancer.

Network Mendelian randomization analyses

To investigate the potential mediator role of GORD between smoking initiation and lung cancer, we conducted network MR analysis. Three estimates were performed for network MR analysis: (i) the total effect of smoking initiation on lung cancer; (ii) the direct effect α of smoking initiation on GORD; and (iii) the direct effect β of GORD on lung cancer. Mediation effect can be calculated by the follwing equation: Mediation effect=α*β. The calculation process is described in detail in Supplementary Method (available as Supplementary data at IJE online).29 The proportion of the mediation effect was estimated as the total causal effect of smoking on lung cancer divided by the mediation effect. To ensure the reliability of the results, we performed network MR analyses in both ILCCO and FinnGen databases and compared them for consistency.

Sensitivity analyses

Sensitivity analyses were performed by weighted median, MR-Egger regression and MR-Pleiotropy Residual Sum and Outlier (MR-PRESSO) methods to verify the basic assumptions for univariable MR analyses. Multivariable MR-Egger regression was used to detect the pleiotropy for multivariable MR analyses.30 The weighted median model calculates a weighted value of the median of the IV-specific causal estimation. When more than half of the analytical weights are derived from valid IVs, the weighted median model can provide a consistent causal estimation.31 MR-Egger regression allows free estimation of the horizontal pleiotropic effects, whereas it compromises statistical power.32 MR-PRESSO is developed to detect and correct horizontal pleiotropic outliers. The MR-PRESSO distortion test demonstrates the difference in estimation before and after outlier elimination.33 Cochran’s Q test is used to estimate heterogeneity among SNPs for exposure and checks the consistency between MR assumption and analyses. Based on the methods of MR estimate and sensitivity analysis mentioned above, we considered a credible causal inference must meet the following criteria. (i) The results of MR estimate and sensitivity analysis presented a consistent direction among the three methods. (ii) Intercept test of MR-Egger suggested no horizontal pleiotropic effect. We calculated F statistics to evaluate the strength of IVs for exposures in univariable MR analysis (Supplementary Method).34 Two‐sample conditional F statistic (FTS) was calculated by the ‘MVMR’ R package to evaluate the strength of IVs for exposures in multivariable MR analyses.35 F statistic >10 suggests a strong instrumental variable. Statistical power of each MR analysis was calculated by the mRnd power calculator for binary outcomes: mRnd: power calculations for Mendelian Randomization [cnsgenomics.com].36

All the above processes were carried out in R 4.1.0 [https://www.R-project.org/]. R package ‘TwoSampleMR’,37 ‘MRPRESSO’33 and ‘MendelianRandomization’38 were used to perform MR and sensitivity analyses.

Results

Genetic instruments

The process of IV selection for this study is presented in Supplementary Figure S1 (available as Supplementary data at IJE online). According to the criteria of genetic instrument selection, 77 and 76 independent SNPs were finally selected as IVs for GORD in the analyses of ILCCO and FinnGen databases, respectively, and 74 and 75 SNPs were selected as IVs for smoking initiation, respectively. The detail for IVs of each exposure is reported in Supplementary Tables S4 and S5 (available as Supplementary data at IJE online). The selected IVs overall explained 3.58% of the GORD variance and 2.15% of the smoking initiation variance. All F statistics for IVs used in this univariable MR analyses = >10, which suggested strong instrument variables (Supplementary Table S6, available as Supplementary data at IJE online). However in multivariable MR analysis, the FTS statistic of the GORD instrument variable was less than 10 after adjusting for smoking duration and BMI, indicating weak IVs (Supplementary Table S7, available as Supplementary data at IJE online).

In order to validate the effectiveness of the GORD IVs, we performed a test MR analysis on GORD and Barrett's esophagus. The results demonstrated that genetically predicted GORD was associated with an increased risk of Barrett's esophagus [IVW: odds ratio (OR) = 1.26, 95% confidence interval (CI) 1.05–1.52, P = 1.28E-02), and sensitivity analysis indicated consistent findings. This suggests that the instrumental variables for GORD used in this study are valid (Supplementary Table S8, available as Supplementary data at IJE online). Additionally, the statistical power of all MR analyses was 1.00 (>0.80; Supplementary Table S9, available as Supplementary data at IJE online).

Causal effects of GORD on lung cancer and pathological subtypes

By using the genetic IVs for GORD, univariate MR analysis provided credible evidence that GORD was associated with an increased risk of total lung cancer in the ILCCO database (IVW: OR = 1.37, 95% CI 1.16–1.62, P = 1.70E-04), and the estimate was consistent in the FinnGen database (IVW: odds ratio (OR) = 1.25, 95% confidence interval (CI) 1.03–1.52, P = 2.27E-02). In the subgroup analysis of lung cancer pathological subtypes, GORD increased the risk of lung adenocarcinoma (IVW: OR = 1.27, 95% CI 1.02–1.59, P = 3.48E-02) and lung squamous cell carcinomas (IVW: OR = 1.50, 95% CI 1.22–1.86, P = 1.52E-04). However, only a weak causal association was observed between GORD and small cell lung cancer (IVW: OR = 2.15, 95% CI 0.87–5.31, P = 0.10) (Figure 2).

Univariable MR analyses of genetically predicted GORD with risk of lung cancer and pathological subtypes. GORD, gastro-esophageal reflux disease; OR, odds ratio; CI, confidence interval; IVW, inverse variance weighted method; ILCCO, International Lung Cancer Consortium
Figure 2

Univariable MR analyses of genetically predicted GORD with risk of lung cancer and pathological subtypes. GORD, gastro-esophageal reflux disease; OR, odds ratio; CI, confidence interval; IVW, inverse variance weighted method; ILCCO, International Lung Cancer Consortium

Consistent directions of the causal estimates were observed in most sensitivity analyses, but the direction of ORs in MR-Egger was inconsistent in the analyses of adenocarcinoma and small cell lung cancer (0.84 and 0.03, respectively) (Figure 2). One outlier in the analysis of total lung cancer in the ILCCO database was detected by MR-PRESSO, but the result was still consistent after eliminating the outlier variant (OR = 1.33, 95% CI 1.31–1.35, P = 6.90E-04). Moderate heterogeneity was observed in the analysis of total lung cancer in the ILCCO database. Furthermore, no horizontal pleiotropy was detected in MR-Egger intercept tests for any of the MR analyses conducted (Table 1).

Table 1

Sensitivity analysis of GORD with risk of lung cancer in univariable TS-MR analyses

MethodTotal lung cancer
Adenocarcinoma
Squamous cell carcinomas
Small cell lung cancer
FinnGen consortium
ILCCO
OR95% CIPOR95% CIPOR95% CIPOR95% CIPOR95% CIP
IVW method1.251.03-1.522.27E-021.371.16–1.621.70E-041.271.02–1.593.48E-021.51.22–1.861.52E-042.150.87–5.310.10
Weighted median method1.090.85-1.390.511.321.06–1.620.011.280.94–1.720.111.250.92–1.690.151.130.54–2.370.75
MR-Egger regression1.160.37-3.620.801.940.73–5.140.190.840.23–3.090.792.210.63–7.740.220.590.03–13.570.74
MR-PRESSO methodana1.331.31–1.356.90E-04nanana
HeterogeneitybI2=21.3%; Cochrane’s Q=91; Phet=0.10I2=51.3%; Cochrane’s Q=115; Phet=3.12E-03I2=31.6%; Cochrane’s Q=90; Phet=0.13I2=3.9%; Cochrane’s Q=79; Phet=0.37I2=26.7%; Cochrane’s Q=95; Phet=0.06
PleiotropycIntercept=0.002; Pple=0.90Intercept=0.011; Pple=0.48Intercept=0.014; Pple=0.53Intercept=0.013; Pple=0.54Intercept=0.142; Pple=0.11
MethodTotal lung cancer
Adenocarcinoma
Squamous cell carcinomas
Small cell lung cancer
FinnGen consortium
ILCCO
OR95% CIPOR95% CIPOR95% CIPOR95% CIPOR95% CIP
IVW method1.251.03-1.522.27E-021.371.16–1.621.70E-041.271.02–1.593.48E-021.51.22–1.861.52E-042.150.87–5.310.10
Weighted median method1.090.85-1.390.511.321.06–1.620.011.280.94–1.720.111.250.92–1.690.151.130.54–2.370.75
MR-Egger regression1.160.37-3.620.801.940.73–5.140.190.840.23–3.090.792.210.63–7.740.220.590.03–13.570.74
MR-PRESSO methodana1.331.31–1.356.90E-04nanana
HeterogeneitybI2=21.3%; Cochrane’s Q=91; Phet=0.10I2=51.3%; Cochrane’s Q=115; Phet=3.12E-03I2=31.6%; Cochrane’s Q=90; Phet=0.13I2=3.9%; Cochrane’s Q=79; Phet=0.37I2=26.7%; Cochrane’s Q=95; Phet=0.06
PleiotropycIntercept=0.002; Pple=0.90Intercept=0.011; Pple=0.48Intercept=0.014; Pple=0.53Intercept=0.013; Pple=0.54Intercept=0.142; Pple=0.11

GORD, gastro-esophageal reflux disease; TS-MR, two-sample Mendelian randomization; ILCCO, International Lung Cancer Consortium; IVR, inverse variance weighted; MR-PRESSO, MR-Pleiotropy Residual Sum and Outlier; OR, odds ratio; CI, confidence interval; na, not available.

a

na means there is no outlier that needed to be corrected. One outlier was removed from the ILCCO analysis, PDistortion Test =0.64.

b

Significant heterogeneity was observed in the single nucleotide polymorphisms of ILCCO analysis.

c

MR-Egger was used to detect pleiotropy. No pleiotropy was observed across all five analyses.

Table 1

Sensitivity analysis of GORD with risk of lung cancer in univariable TS-MR analyses

MethodTotal lung cancer
Adenocarcinoma
Squamous cell carcinomas
Small cell lung cancer
FinnGen consortium
ILCCO
OR95% CIPOR95% CIPOR95% CIPOR95% CIPOR95% CIP
IVW method1.251.03-1.522.27E-021.371.16–1.621.70E-041.271.02–1.593.48E-021.51.22–1.861.52E-042.150.87–5.310.10
Weighted median method1.090.85-1.390.511.321.06–1.620.011.280.94–1.720.111.250.92–1.690.151.130.54–2.370.75
MR-Egger regression1.160.37-3.620.801.940.73–5.140.190.840.23–3.090.792.210.63–7.740.220.590.03–13.570.74
MR-PRESSO methodana1.331.31–1.356.90E-04nanana
HeterogeneitybI2=21.3%; Cochrane’s Q=91; Phet=0.10I2=51.3%; Cochrane’s Q=115; Phet=3.12E-03I2=31.6%; Cochrane’s Q=90; Phet=0.13I2=3.9%; Cochrane’s Q=79; Phet=0.37I2=26.7%; Cochrane’s Q=95; Phet=0.06
PleiotropycIntercept=0.002; Pple=0.90Intercept=0.011; Pple=0.48Intercept=0.014; Pple=0.53Intercept=0.013; Pple=0.54Intercept=0.142; Pple=0.11
MethodTotal lung cancer
Adenocarcinoma
Squamous cell carcinomas
Small cell lung cancer
FinnGen consortium
ILCCO
OR95% CIPOR95% CIPOR95% CIPOR95% CIPOR95% CIP
IVW method1.251.03-1.522.27E-021.371.16–1.621.70E-041.271.02–1.593.48E-021.51.22–1.861.52E-042.150.87–5.310.10
Weighted median method1.090.85-1.390.511.321.06–1.620.011.280.94–1.720.111.250.92–1.690.151.130.54–2.370.75
MR-Egger regression1.160.37-3.620.801.940.73–5.140.190.840.23–3.090.792.210.63–7.740.220.590.03–13.570.74
MR-PRESSO methodana1.331.31–1.356.90E-04nanana
HeterogeneitybI2=21.3%; Cochrane’s Q=91; Phet=0.10I2=51.3%; Cochrane’s Q=115; Phet=3.12E-03I2=31.6%; Cochrane’s Q=90; Phet=0.13I2=3.9%; Cochrane’s Q=79; Phet=0.37I2=26.7%; Cochrane’s Q=95; Phet=0.06
PleiotropycIntercept=0.002; Pple=0.90Intercept=0.011; Pple=0.48Intercept=0.014; Pple=0.53Intercept=0.013; Pple=0.54Intercept=0.142; Pple=0.11

GORD, gastro-esophageal reflux disease; TS-MR, two-sample Mendelian randomization; ILCCO, International Lung Cancer Consortium; IVR, inverse variance weighted; MR-PRESSO, MR-Pleiotropy Residual Sum and Outlier; OR, odds ratio; CI, confidence interval; na, not available.

a

na means there is no outlier that needed to be corrected. One outlier was removed from the ILCCO analysis, PDistortion Test =0.64.

b

Significant heterogeneity was observed in the single nucleotide polymorphisms of ILCCO analysis.

c

MR-Egger was used to detect pleiotropy. No pleiotropy was observed across all five analyses.

Multivariable MR analyses adjusting potential confounders

In most multivariable MR analyses, the causal effect of GORD on lung cancer and pathological subtypes remained consistent after adjusting the five potential confounders (the three smoking traits, BMI and type 2 diabetes). However, the causal effect of GORD on lung adenocarcinoma was attenuated after adjusting smoking initiation (IVW: OR = 1.17, 95% CI 0.98–1.36, P = 0.11) and smoking frequency (IVW: OR = 1.02, 95% CI 0.81–1.28, P = 0.88) (Figure 3).

The direct causal effect of GORD on lung cancer and pathological subtypes by adjusting smoking traits, BMI and type 2 diabetes. The reported values were calculated by the random effects IVW method. GORD, gastro-esophageal reflux disease; OR, odds ratio; CI, confidence interval; IVW, inverse variance weighted method
Figure 3

The direct causal effect of GORD on lung cancer and pathological subtypes by adjusting smoking traits, BMI and type 2 diabetes. The reported values were calculated by the random effects IVW method. GORD, gastro-esophageal reflux disease; OR, odds ratio; CI, confidence interval; IVW, inverse variance weighted method

In sensitivity analyses, no horizontal pleiotropy was detected by MR-Egger regression of the multivariable MR analyses, and mild to moderate heterogeneity was observed in the analysis of total lung cancer and lung adenocarcinoma (Table 2).

Table 2

Sensitivity analysis of adjusted GORD with risk of lung cancer in multivariable MR analyses

ExposureMethodTotal lung cancerAdenocarcinomaSquamous cell carcinomasSmall cell lung cancer
GORD adjusted by smoking initiationHeterogeneityaI2=21.6%; Cochrane’s Q=194; Phet=0.01I2=18.7%; Cochrane’s Q=187; Phet=2.91E-02I2=8.4%; Cochrane’s Q=166; Phet=0.20I2=13.1%; Cochrane’s Q=175; Phet =0.10
MR-Egger pleiotropybIntercept=–0.008; Pple=0.09Intercept=–0.003; Pple=0.70Intercept=–0.003; Pple=0.64Intercept=0.016; Pple=0.52
GORD adjusted by age of smoking initiationHeterogeneitycI2=41.7%; Cochrane’s Q=151; Phet=3.10E-05I2=20.7%; Cochrane’s Q=111; Phet=4.82E-02I2=8.3%; Cochrane’s Q=96; Phet=0.25I2=18.7%; Cochrane’s Q=107; Phet =0.07
MR-Egger pleiotropyIntercept=–0.004; Pple=0.78Intercept=0.015; Pple=0.42Intercept=–0.004; Pple=0.81Intercept=0.082; Pple=0.27
GORD adjusted by cigarettes smoked per dayHeterogeneityI2=41.7%; Cochrane’s Q=163; Phet=1.60E-05I2=18.8%; Cochrane’s Q=117; Phet=0.06I2=16.7%; Cochrane’s Q=114; Phet=0.09I2=13.0%; Cochrane’s Q=108; Phet =0.16
MR-Egger pleiotropyIntercept=–0.018; Pple=0.11Intercept=–0.007; Pple=0.64Intercept=–0.023; Pple=0.09Intercept=0.059; Pple=0.29
GORD adjusted by BMIHeterogeneityI2=15.4%; Cochrane’s Q=1099; Phet=9.80E-05I2=8.2%; Cochrane’s Q=1013; Phet=3.22E-02I2=9.5%; Cochrane’s Q=1028; Phet=1.34E-02I2=1.4%; Cochrane’s Q=913; Phet =0.62
MR-Egger pleiotropyIntercept=–0.002; Pple=0.14Intercept=0.001; Pple=0.79Intercept=–0.004; Pple=0.08Intercept=0.002; Pple=0.83
GORD adjusted by type 2 diabetesHeterogeneityI2=37.8%; Cochrane’s Q=148; Phet=1.76E-04I2=20.7%; Cochrane’s Q=116; Phet=4.75E-02I2=8.0%; Cochrane’s Q=100; Phet=0.26I2=14.0%; Cochrane’s Q=107; Phet =0.14
MR-Egger pleiotropyIntercept=–0.005; Pple=0.57Intercept=–0.011; Pple=0.34Intercept=–0.008; Pple=0.46Intercept=0.027; Pple=0.54
ExposureMethodTotal lung cancerAdenocarcinomaSquamous cell carcinomasSmall cell lung cancer
GORD adjusted by smoking initiationHeterogeneityaI2=21.6%; Cochrane’s Q=194; Phet=0.01I2=18.7%; Cochrane’s Q=187; Phet=2.91E-02I2=8.4%; Cochrane’s Q=166; Phet=0.20I2=13.1%; Cochrane’s Q=175; Phet =0.10
MR-Egger pleiotropybIntercept=–0.008; Pple=0.09Intercept=–0.003; Pple=0.70Intercept=–0.003; Pple=0.64Intercept=0.016; Pple=0.52
GORD adjusted by age of smoking initiationHeterogeneitycI2=41.7%; Cochrane’s Q=151; Phet=3.10E-05I2=20.7%; Cochrane’s Q=111; Phet=4.82E-02I2=8.3%; Cochrane’s Q=96; Phet=0.25I2=18.7%; Cochrane’s Q=107; Phet =0.07
MR-Egger pleiotropyIntercept=–0.004; Pple=0.78Intercept=0.015; Pple=0.42Intercept=–0.004; Pple=0.81Intercept=0.082; Pple=0.27
GORD adjusted by cigarettes smoked per dayHeterogeneityI2=41.7%; Cochrane’s Q=163; Phet=1.60E-05I2=18.8%; Cochrane’s Q=117; Phet=0.06I2=16.7%; Cochrane’s Q=114; Phet=0.09I2=13.0%; Cochrane’s Q=108; Phet =0.16
MR-Egger pleiotropyIntercept=–0.018; Pple=0.11Intercept=–0.007; Pple=0.64Intercept=–0.023; Pple=0.09Intercept=0.059; Pple=0.29
GORD adjusted by BMIHeterogeneityI2=15.4%; Cochrane’s Q=1099; Phet=9.80E-05I2=8.2%; Cochrane’s Q=1013; Phet=3.22E-02I2=9.5%; Cochrane’s Q=1028; Phet=1.34E-02I2=1.4%; Cochrane’s Q=913; Phet =0.62
MR-Egger pleiotropyIntercept=–0.002; Pple=0.14Intercept=0.001; Pple=0.79Intercept=–0.004; Pple=0.08Intercept=0.002; Pple=0.83
GORD adjusted by type 2 diabetesHeterogeneityI2=37.8%; Cochrane’s Q=148; Phet=1.76E-04I2=20.7%; Cochrane’s Q=116; Phet=4.75E-02I2=8.0%; Cochrane’s Q=100; Phet=0.26I2=14.0%; Cochrane’s Q=107; Phet =0.14
MR-Egger pleiotropyIntercept=–0.005; Pple=0.57Intercept=–0.011; Pple=0.34Intercept=–0.008; Pple=0.46Intercept=0.027; Pple=0.54

GORD, gastro-esophageal reflux disease; MR, Mendelian randomization; BMI, body mass index.

a

Multivariable MR-Egger was used to detect pleiotropy.

b

No pleiotropy was detected in any of the analyses.

c

Mild to moderate heterogeneity was observed in the analysis of total lung cancer and lung adenocarcinoma.

Table 2

Sensitivity analysis of adjusted GORD with risk of lung cancer in multivariable MR analyses

ExposureMethodTotal lung cancerAdenocarcinomaSquamous cell carcinomasSmall cell lung cancer
GORD adjusted by smoking initiationHeterogeneityaI2=21.6%; Cochrane’s Q=194; Phet=0.01I2=18.7%; Cochrane’s Q=187; Phet=2.91E-02I2=8.4%; Cochrane’s Q=166; Phet=0.20I2=13.1%; Cochrane’s Q=175; Phet =0.10
MR-Egger pleiotropybIntercept=–0.008; Pple=0.09Intercept=–0.003; Pple=0.70Intercept=–0.003; Pple=0.64Intercept=0.016; Pple=0.52
GORD adjusted by age of smoking initiationHeterogeneitycI2=41.7%; Cochrane’s Q=151; Phet=3.10E-05I2=20.7%; Cochrane’s Q=111; Phet=4.82E-02I2=8.3%; Cochrane’s Q=96; Phet=0.25I2=18.7%; Cochrane’s Q=107; Phet =0.07
MR-Egger pleiotropyIntercept=–0.004; Pple=0.78Intercept=0.015; Pple=0.42Intercept=–0.004; Pple=0.81Intercept=0.082; Pple=0.27
GORD adjusted by cigarettes smoked per dayHeterogeneityI2=41.7%; Cochrane’s Q=163; Phet=1.60E-05I2=18.8%; Cochrane’s Q=117; Phet=0.06I2=16.7%; Cochrane’s Q=114; Phet=0.09I2=13.0%; Cochrane’s Q=108; Phet =0.16
MR-Egger pleiotropyIntercept=–0.018; Pple=0.11Intercept=–0.007; Pple=0.64Intercept=–0.023; Pple=0.09Intercept=0.059; Pple=0.29
GORD adjusted by BMIHeterogeneityI2=15.4%; Cochrane’s Q=1099; Phet=9.80E-05I2=8.2%; Cochrane’s Q=1013; Phet=3.22E-02I2=9.5%; Cochrane’s Q=1028; Phet=1.34E-02I2=1.4%; Cochrane’s Q=913; Phet =0.62
MR-Egger pleiotropyIntercept=–0.002; Pple=0.14Intercept=0.001; Pple=0.79Intercept=–0.004; Pple=0.08Intercept=0.002; Pple=0.83
GORD adjusted by type 2 diabetesHeterogeneityI2=37.8%; Cochrane’s Q=148; Phet=1.76E-04I2=20.7%; Cochrane’s Q=116; Phet=4.75E-02I2=8.0%; Cochrane’s Q=100; Phet=0.26I2=14.0%; Cochrane’s Q=107; Phet =0.14
MR-Egger pleiotropyIntercept=–0.005; Pple=0.57Intercept=–0.011; Pple=0.34Intercept=–0.008; Pple=0.46Intercept=0.027; Pple=0.54
ExposureMethodTotal lung cancerAdenocarcinomaSquamous cell carcinomasSmall cell lung cancer
GORD adjusted by smoking initiationHeterogeneityaI2=21.6%; Cochrane’s Q=194; Phet=0.01I2=18.7%; Cochrane’s Q=187; Phet=2.91E-02I2=8.4%; Cochrane’s Q=166; Phet=0.20I2=13.1%; Cochrane’s Q=175; Phet =0.10
MR-Egger pleiotropybIntercept=–0.008; Pple=0.09Intercept=–0.003; Pple=0.70Intercept=–0.003; Pple=0.64Intercept=0.016; Pple=0.52
GORD adjusted by age of smoking initiationHeterogeneitycI2=41.7%; Cochrane’s Q=151; Phet=3.10E-05I2=20.7%; Cochrane’s Q=111; Phet=4.82E-02I2=8.3%; Cochrane’s Q=96; Phet=0.25I2=18.7%; Cochrane’s Q=107; Phet =0.07
MR-Egger pleiotropyIntercept=–0.004; Pple=0.78Intercept=0.015; Pple=0.42Intercept=–0.004; Pple=0.81Intercept=0.082; Pple=0.27
GORD adjusted by cigarettes smoked per dayHeterogeneityI2=41.7%; Cochrane’s Q=163; Phet=1.60E-05I2=18.8%; Cochrane’s Q=117; Phet=0.06I2=16.7%; Cochrane’s Q=114; Phet=0.09I2=13.0%; Cochrane’s Q=108; Phet =0.16
MR-Egger pleiotropyIntercept=–0.018; Pple=0.11Intercept=–0.007; Pple=0.64Intercept=–0.023; Pple=0.09Intercept=0.059; Pple=0.29
GORD adjusted by BMIHeterogeneityI2=15.4%; Cochrane’s Q=1099; Phet=9.80E-05I2=8.2%; Cochrane’s Q=1013; Phet=3.22E-02I2=9.5%; Cochrane’s Q=1028; Phet=1.34E-02I2=1.4%; Cochrane’s Q=913; Phet =0.62
MR-Egger pleiotropyIntercept=–0.002; Pple=0.14Intercept=0.001; Pple=0.79Intercept=–0.004; Pple=0.08Intercept=0.002; Pple=0.83
GORD adjusted by type 2 diabetesHeterogeneityI2=37.8%; Cochrane’s Q=148; Phet=1.76E-04I2=20.7%; Cochrane’s Q=116; Phet=4.75E-02I2=8.0%; Cochrane’s Q=100; Phet=0.26I2=14.0%; Cochrane’s Q=107; Phet =0.14
MR-Egger pleiotropyIntercept=–0.005; Pple=0.57Intercept=–0.011; Pple=0.34Intercept=–0.008; Pple=0.46Intercept=0.027; Pple=0.54

GORD, gastro-esophageal reflux disease; MR, Mendelian randomization; BMI, body mass index.

a

Multivariable MR-Egger was used to detect pleiotropy.

b

No pleiotropy was detected in any of the analyses.

c

Mild to moderate heterogeneity was observed in the analysis of total lung cancer and lung adenocarcinoma.

GORD mediates the causal effect of smoking initiation on lung cancer

We performed network MR analyses to investigate the potential mediating role of GORD in the association between smoking initiation and lung cancer, in both ILCCO and FinnGen databases. In the ILCCO analysis, we observed that smoking initiation (IVW: OR = 1.44, 95% CI 1.18–1.76, P = 2.66E-04) was associated with an increased total risk of lung cancer. For direct effects, smoking initiation (IVW: OR = 1.45, 95% CI 1.33–1.59, P = 2.74E-16) was associated with increased risk of GORD (direct effect α). In addition, GORD has exhibited a causal relationship with increased risk of lung cancer (IVW: OR = 1.37, 95% CI 1.16–1.62, P = 1.70E-04) (direct effect β). The proportion of the causal effect of smoking initiation on lung cancer mediated by GORD was 32.43% (95% CI 14.18%–49.82%). We repeated the network analysis in the FinnGen database and obtained consistent causal estimates of the mediating proportion of GORD in smoking initiation to lung cancer risk (25.00%, 95% CI 3.13%–50.00%). Moreover, no horizontal pleiotropy was detected in all network MR analyses in both ILCCO and FinnGen datasets (Table 3; Supplementary Table S10, available as Supplementary data at IJE online).

Table 3

Mediation effect of GORD in the association between smoking initiation and total lung cancer

ExposureMediatorOutcome data sourceTotal effecta
Direct effect αb
Direct effect βc
Mediation effectd
BetaeSEPBetaSEPBetaSEPEffect size (95% CI)fProportion % (95% CI)
Smoking initiationGORDILCCO0.370.102.66E-040.370.052.74E-160.320.081.70E-040.12 (0.05–0.18)32.43 (14.18–49.82)
FinnGen0.320.131.36E-020.220.102.27E-020.08 (0.01–0.16)25.00 (3.13–50.00)
ExposureMediatorOutcome data sourceTotal effecta
Direct effect αb
Direct effect βc
Mediation effectd
BetaeSEPBetaSEPBetaSEPEffect size (95% CI)fProportion % (95% CI)
Smoking initiationGORDILCCO0.370.102.66E-040.370.052.74E-160.320.081.70E-040.12 (0.05–0.18)32.43 (14.18–49.82)
FinnGen0.320.131.36E-020.220.102.27E-020.08 (0.01–0.16)25.00 (3.13–50.00)

GORD, gastro-esophageal reflux disease; ILCCO, International Lung Cancer Consortium; SE, standard error; TS-MR, two-sample Mendelian randomization; CI, confidence interval.

a

The causal effect of smoking on lung cancer in TS-MR analysis.

b

The causal effect of smoking on GORD in TS-MR analysis.

c

The causal effect of GORD on lung cancer in TS-MR analysis.

d

The effect of smoking on lung cancer mediated through GORD.

e

Beta of random effect inverse variance weighted method was used for mediation analysis.

f

The calculation formula is detailed in Supplementary Method (available as Supplementary data at IJE online).

Table 3

Mediation effect of GORD in the association between smoking initiation and total lung cancer

ExposureMediatorOutcome data sourceTotal effecta
Direct effect αb
Direct effect βc
Mediation effectd
BetaeSEPBetaSEPBetaSEPEffect size (95% CI)fProportion % (95% CI)
Smoking initiationGORDILCCO0.370.102.66E-040.370.052.74E-160.320.081.70E-040.12 (0.05–0.18)32.43 (14.18–49.82)
FinnGen0.320.131.36E-020.220.102.27E-020.08 (0.01–0.16)25.00 (3.13–50.00)
ExposureMediatorOutcome data sourceTotal effecta
Direct effect αb
Direct effect βc
Mediation effectd
BetaeSEPBetaSEPBetaSEPEffect size (95% CI)fProportion % (95% CI)
Smoking initiationGORDILCCO0.370.102.66E-040.370.052.74E-160.320.081.70E-040.12 (0.05–0.18)32.43 (14.18–49.82)
FinnGen0.320.131.36E-020.220.102.27E-020.08 (0.01–0.16)25.00 (3.13–50.00)

GORD, gastro-esophageal reflux disease; ILCCO, International Lung Cancer Consortium; SE, standard error; TS-MR, two-sample Mendelian randomization; CI, confidence interval.

a

The causal effect of smoking on lung cancer in TS-MR analysis.

b

The causal effect of smoking on GORD in TS-MR analysis.

c

The causal effect of GORD on lung cancer in TS-MR analysis.

d

The effect of smoking on lung cancer mediated through GORD.

e

Beta of random effect inverse variance weighted method was used for mediation analysis.

f

The calculation formula is detailed in Supplementary Method (available as Supplementary data at IJE online).

Discussion

Our Mendelian randomization study revealed that genetically predicted GORD was associated with an increased risk of total lung cancer, lung adenocarcinoma and lung squamous cell carcinomas, and this causal association was independent of smoking duration, BMI and type 2 diabetes. Besides, we provided causal evidence that GORD mediated a considerable proportion of smoking initiation effect on lung cancer risk. However, the study was analysed at the genetic level, and thus these results should be interpreted with caution.

The risk of GORD for lung cancer may be mediated by ongoing pulmonary aspiration and the resulting chronic inflammation.8 Previous study has proved that chronic inflammation induced by refluxate (i.e. stomach acid and bile salts) increases the risk of esophagus cancer and laryngeal/pharyngeal cancer,39 and here we revealed the genetically predicted association between GORD and lung cancer. Long-term exposure to stomach acid and bile salts, which aspirate into the airway, cause chronic inflammation and increase the release of reactive oxygen species (ROS).6 Accumulated ROS could cause ongoing oxidative DNA damage or mutation, whereas the mutant cells proliferating in an inflammatory environment potentiates tumorigenesis.40

Epidemiological evidence for the association between GORD and lung cancer is conflicting. Yanes et al.8 reported that anti-reflux surgery significantly reduced the risk of lung small cell carcinoma and squamous cell carcinoma, but not of adenocarcinoma. Another single preliminary report demonstrated the significant association between GORD and lung cancer irrespective of histology.41 Moreover, the credibility of epidemiological evidence is still under challenge due to the interference of smoking and other uncontrollable environmental variables.

Recently, Li et al. 42 reported that GORD was associated with an increased risk of lung cancer, which is consistent with some of our results. Compared with prior research, our research elucidated the causal impact of GORD on overall lung cancer, adenocarcinoma and squamous cell carcinoma, based on the ILCCO database, which was further validated using the FinnGen database. We also conducted multivariable MR analysis to adjust for potential confounders, and investigated the direct effect of this causal relationship. Moreover, we further explored the mediating effect of GORD in the causal pathway between smoking initiation and lung cancer. In our results, there is no causal association between GORD and small cell lung carcinoma. As a high-grade neuroendocrine carcinoma, the pathogenesis of small cell lung cancer is different from that of non-small cell lung cancer, which might be one of the reasons for the negative result.

Previous studies have found that smoking is a common risk factor for both GORD and lung cancer.15,43 Here we revealed a direct causal association between GORD and total lung cancer which is independent of smoking traits, BMI and type 2 diabetes. However, adjusting for smoking (especially smoking frequency) greatly attenuated the causal association between GORD and lung cancer risk. This suggests that when considering smoking status, the independent impact of GORD on lung risk is relatively small. In order to investigate the relationship between smoking, GORD and lung cancer risk, we further conducted and reported the proportion of the causal effect of smoking initiation on lung cancer mediated by GORD. This finding highlights that smoking cessation intervention is a vital measure for lung cancer prevention. Moreover, GORD was elucidated as part of the smoking initiation-induced lung cancer mechanism.

Our findings provided new perspectives on the early screening and prevention of lung cancer. Patients with severe GORD may need to consider the potential risk of developing lung cancer and take timely measures to intervene. Medical therapy for GORD might be a potential strategy for lung cancer prevention, although a study demonstrated that medical therapy could not reverse pulmonary aspiration.44 Anti-reflux surgery might be considered as the potential option to reduce the risk of lung cancer in patients with severe GORD.8 Furthermore, the results of the mediation analysis highlight the importance of smoking cessation in the prevention of both GORD and lung cancer.

This study has several highlights. The major merit of MR design is to minimize the biases caused by confounding and reverse causality. Second, we performed the causal estimate in two large databases to ensure consistency and thus obtained a credible causal inference. The population of our study was limited to European ancestry, which minimized the bias due to population stratification. The MR-Egger and MR-PRESSO analyses detected no evidence of pleiotropic effect, indicating that the observed causal estimates were not induced by confounding factors. Moreover, the appropriate sample size of exposures provided sufficient statistical power (>0.80) for our study.

There are also limitations in this work. We used outcomes from FinnGen and ILCCO databases to control the bias caused by sample overlap, but there is still potential overlap due to the large sample size and limiting the area to Europe.45 Some cases of GORD were not fully identified based on physician diagnosis, and case identification through self-report and medication use may lead to potential exposure misclassification and bias in the results. In the sensitivity analysis, the inconsistent direction of ORs was observed in the analyses of adenocarcinoma and small cell lung cancer. The possible reason is that using the MR-Egger method to estimate causality could be biased by inflating the type 1 error rates.46 Similar situations have been reported elsewhere.47 The exaggeration of type 1 error rates may lead to the bias of ORs. Besides, using the product method to calculate the mediation effect for binary outcome is challenging, due to the non-collapsibility of the odds ratio.48,49 A recent work simulates mediation analysis of rare binary outcomes on the log odds ratio scale, using network MR analysis. The results show the bias is small and typically would not affect the conclusions made.17 The direct causal effect of GORD adjusted by smoking duration and BMI should be interpreted with caution because of the weak instrument bias (FTS<10). The different definitions of lung cancer might cause heterogeneity in the analysis of the ILCCO database. Nonetheless, the use of the IVW random effects method and the absence of horizontal pleiotropy suggest that our results are unlikely to be disturbed by heterogeneity. Finally, due to this study restricting the population to Europe, generalization of our results to other populations should be done prudently.

Conclusion

The present study provides credible evidence that genetically predicted GORD was associated with an increased risk of total lung cancer, lung adenocarcinoma and lung squamous cell carcinoma, based on both ILCCO and FinnGen databases, which indicated GORD might be a novel screening signal for early-stage lung cancer. The mediation effect of GORD in the association between smoking initiation and lung cancer revealed that GORD might be a part of the mechanism for smoking-induced lung cancer. However, future randomized clinical trials will be still needed to confirm this genetic inference.

Ethics approval

This study used summary-level statistics from published studies and publicly available GWASs. No ethical approval was required for this study.

Data availability

GWAS summary statistic data for GORD was obtained from the previous study.19 GWAS data for total lung cancer and pathological subtypes may be obtained from the ILCCO at ILCCO: Research projects [ILCCO: Research projects (who.int)] and FinnGen consortia at [https://www.finngen.fi/en]. GWAS data for smoking traits,22 Barrett's esophagus,23 BMI24 and type 2 diabetes25 were all obtained from publicly available sources.

Supplementary data

Supplementary data are available at IJE online.

Author contributions

All authors contributed to the design of this study. Material preparation and data collection were performed by Y.L., R.Z. and L.X. Yi.L. and H.L. performed the analyses and wrote the manuscript. L.L. supervised the project.

Funding

This study was supported by the 1.3.5 Project for Disciplines of Excellence, West China Hospital, Sichuan University [ZYGD18021] (to L.L.), Key Projects of Sichuan Province (No. 2022YFS0208 for L.X.).

Acknowledgements

We thank the ILCCO at ILCCO: Home [iarc.fr], FinnGen [https://www.finngen.fi/en], GSCAN,22 GIANT24 and DIAGRAM25 consortia, James et al.,26 Ong et al. 19 and Gharahkhani et al. 23 for making the summary statistics involved in this study publicly available. Figure 1 was drawn partly with modified Servier Medical Art templates [http://smart.servier.com/], licensed under a Creative Common Attribution 3.0 Generic License [https://creativecommons.org/licenses/by/3.0/].

Conflict of interest

The authors declare no competing interests.

References

1

Siegel
RL
,
Miller
KD
,
Jemal
A.
Cancer statistics, 2019
.
CA Cancer J Clin
2019
;
69
:
7
34
.

2

Kachuri
L
,
Saarela
O
,
Bojesen
SE
et al.
Mendelian Randomization and mediation analysis of leukocyte telomere length and risk of lung and head and neck cancers
.
Int J Epidemiol
2019
;
48
:
751
66
.

3

Sekula
P
,
Del Greco
MF
,
Pattaro
C
,
Köttgen
A.
Mendelian randomization as an approach to assess causality using observational data
.
J Am Soc Nephrol
2016
;
27
:
3253
65
.

4

Xu
M
,
Zheng
J
,
Hou
T
et al.
SGLT2 inhibition, choline metabolites, and cardiometabolic diseases: a mediation Mendelian randomization study
.
Diabetes Care
2022
;
45
:
2718
28
.

5

Katzka
DA
,
Kahrilas
PJ.
Advances in the diagnosis and management of gastresophageal reflux disease
.
BMJ
2020
;
371
:
m3786
.

6

Lee
AS
,
Ryu
JH.
Aspiration pneumonia and related syndromes
.
Mayo Clin Proc
2018
;
93
:
752
62
.

7

Herbella
FA
,
Neto
SP
,
Santoro
IL
,
Figueiredo
LC.
Gastresophageal reflux disease and non-esophageal cancer
.
World J Gastroenterol
2015
;
21
:
815
19
.

8

Yanes
M
,
Santoni
G
,
Maret-Ouda
J
et al.
Antireflux surgery and risk of lung cancer by histological type in a multinational cohort study
.
Eur J Cancer
2020
;
138
:
80
88
.

9

Hsu
CK
,
Lai
CC
,
Wang
K
,
Chen
L.
Risk of lung cancer in patients with gastro-esophageal reflux disease: a population-based cohort study
.
PeerJ
2016
;
4
:
e2753
.

10

Bowden
J
,
Holmes
MV.
Meta-analysis and Mendelian randomization: a review
.
Res Synth Methods
2019
;
10
:
486
96
.

11

Larsson
SC
,
Carter
P
,
Kar
S
et al.
Smoking, alcohol consumption, and cancer: a Mendelian randomisation study in UK Biobank and international genetic consortia participants
.
PLoS Med
2020
;
17
:
e1003178
.

12

Pearson-Stuttard
J
,
Papadimitriou
N
,
Markozannes
G
et al.
Type 2 diabetes and cancer: an umbrella review of observational and Mendelian randomization studies
.
Cancer Epidemiol Biomarkers Prev
2021
;
30
:
1218
28
.

13

Yuan
S
,
Kar
S
,
Carter
P
et al.
Is type 2 diabetes causally associated with cancer risk? evidence from a two-sample Mendelian randomization study
.
Diabetes
2020
;
69
:
1588
96
.

14

Zhou
W
,
Liu
G
,
Hung
RJ
et al.
Causal relationships between body mass index, smoking and lung cancer: Univariable and multivariable Mendelian randomization
.
Int J Cancer
2021
;
148
:
1077
86
.

15

Jeon
J
,
Holford
TR
,
Levy
DT
et al.
Smoking and lung cancer mortality in the United States from 2015 to 2065: a comparative modeling approach
.
Ann Intern Med
2018
;
169
:
684
93
.

16

Maret-Ouda
J
,
Markar
SR
,
Lagergren
J.
Gastresophageal reflux disease: a review
.
JAMA
2020
;
324
:
2536
47
.

17

Carter
AR
,
Sanderson
E
,
Hammerton
G
et al.
Mendelian randomisation for mediation analysis: current methods and challenges for implementation
.
Eur J Epidemiol
2021
;
36
:
465
78
.

18

Skrivankova
VW
,
Richmond
RC
,
Woolf
BAR
et al.
Strengthening the reporting of observational studies in epidemiology using Mendelian randomization: the STROBE-MR statement
.
JAMA
2021
;
326
:
1614
21
.

19

Ong
JS
,
An
J
,
Han
X
et al. ;
Esophageal Cancer Consortium
.
Multitrait genetic association analysis identifies 50 new risk loci for gastro-esophageal reflux, seven new loci for Barrett's esophagus and provides insights into clinical heterogeneity in reflux diagnosis
.
Gut
2022
;
71
:
1053
61
.

20

Bycroft
C
,
Freeman
C
,
Petkova
D
et al.
The UK Biobank resource with deep phenotyping and genomic data
.
Nature
2018
;
562
:
203
09
.

21

Olsen
CM
,
Green
AC
,
Neale
RE
et al. ;
QSkin Study
.
Cohort Profile: The QSkin Sun and Health Study
.
Int J Epidemiol
2012
;
41
:
929
29i
.

22

Liu
M
,
Jiang
Y
,
Wedow
R
et al. ;
HUNT All-In Psychiatry
.
Association studies of up to 1.2 million individuals yield new insights into the genetic etiology of tobacco and alcohol use
.
Nat Genet
2019
;
51
:
237
44
.

23

Gharahkhani
P
,
Fitzgerald
RC
,
Vaughan
TL
et al. ;
Wellcome Trust Case Control Consortium 2 (WTCCC2)
.
Genome-wide association studies in esophageal adenocarcinoma and Barrett's esophagus: a large-scale meta-analysis
.
Lancet Oncol
2016
;
17
:
1363
73
.

24

Yengo
L
,
Sidorenko
J
,
Kemper
KE
et al. ;
GIANT Consortium
.
Meta-analysis of genome-wide association studies for height and body mass index in ∼700000 individuals of European ancestry
.
Hum Mol Genet
2018
;
27
:
3641
49
.

25

Morris
AP
,
Voight
BF
,
Teslovich
TM
et al. ;
DIAbetes Genetics Replication And Meta-analysis (DIAGRAM) Consortium
.
Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes
.
Nat Genet
2012
;
44
:
981
90
.

26

McKay
JD
,
Hung
RJ
,
Han
Y
et al. ;
SpiroMeta Consortium
.
Large-scale association analysis identifies new lung cancer susceptibility loci and heterogeneity in genetic susceptibility across histological subtypes
.
Nat Genet
2017
;
49
:
1126
32
.

27

Kamat
MA
,
Blackshaw
JA
,
Young
R
et al.
PhenoScanner V2: an expanded tool for searching human genotype-phenotype associations
.
Bioinformatics
2019
;
35
:
4851
53
.

28

Burgess
S
,
Butterworth
A
,
Thompson
SG.
Mendelian randomization analysis with multiple genetic variants using summarized data
.
Genet Epidemiol
2013
;
37
:
658
65
.

29

Sun
X
,
Chen
L
,
Zheng
L.
A Mendelian randomization study to assess the genetic liability of gastroesophageal reflux disease for cardiovascular diseases and risk factors
.
Hum Mol Genet
2022
;
31
:
4275
85
.

30

Rees
JMB
,
Wood
AM
,
Burgess
S.
Extending the MR-Egger method for multivariable Mendelian randomization to correct for both measured and unmeasured pleiotropy
.
Stat Med
2017
;
36
:
4705
18
.

31

Bowden
J
,
Davey Smith
G
,
Haycock
PC
,
Burgess
S.
Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator
.
Genet Epidemiol
2016
;
40
:
304
14
.

32

Bowden
J
,
Davey Smith
G
,
Burgess
S.
Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression
.
Int J Epidemiol
2015
;
44
:
512
25
.

33

Verbanck
M
,
Chen
CY
,
Neale
B
,
Do
R.
Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases
.
Nat Genet
2018
;
50
:
693
98
.

34

Palmer
TM
,
Lawlor
DA
,
Harbord
RM
et al.
Using multiple genetic variants as instrumental variables for modifiable risk factors
.
Stat Methods Med Res
2012
;
21
:
223
42
.

35

Sanderson
E
,
Spiller
W
,
Bowden
J.
Testing and correcting for weak and pleiotropic instruments in two-sample multivariable Mendelian randomization
.
Stat Med
2021
;
40
:
5434
52
.

36

Brion
MJ
,
Shakhbazov
K
,
Visscher
PM.
Calculating statistical power in Mendelian randomization studies
.
Int J Epidemiol
2013
;
42
:
1497
501
.

37

Hemani
G
,
Zheng
J
,
Elsworth
B
et al.
The MR-Base platform supports systematic causal inference across the human phenome
.
Elife
2018
;
7
:
e34408
.

38

Yavorska
OO
,
Burgess
S.
MendelianRandomization: an R package for performing Mendelian randomization analyses using summarized data
.
Int J Epidemiol
2017
;
46
:
1734
39
.

39

Parsel
SM
,
Wu
EL
,
Riley
CA
,
McCoul
ED.
Gastresophageal and laryngopharyngeal reflux associated with laryngeal malignancy: a systematic review and meta-analysis
.
Clin Gastroenterol Hepatol
2019
;
17
:
1253
64.e5
.

40

Wei
J
,
Gronert
K.
Eicosanoid and specialized proresolving mediator regulation of lymphoid cells
.
Trends Biochem Sci
2019
;
44
:
214
25
.

41

Vereczkei
A
,
Horvath
OP
,
Varga
G
,
Molnar
TF.
Gastresophageal reflux disease and non-small cell lung cancer. Results of a pilot study
.
Dis Esophagus
2008
;
21
:
457
60
.

42

Li
L
,
Ren
Q
,
Zheng
Q
et al.
Causal associations between gastresophageal reflux disease and lung cancer risk: A Mendelian randomization study
.
Cancer Med
2023
;
12
:
7552
59
.

43

Yuan
S
,
Larsson
SC.
Adiposity, diabetes, lifestyle factors and risk of gastresophageal reflux disease: a Mendelian randomization study
.
Eur J Epidemiol
2022
;
37
:
747
54
.

44

Burton
L
,
Falk
GL
,
Parsons
S
,
Cusi
M
,
Van Der Wall
H.
Benchmarking of a simple scintigraphic test for gastro-esophageal reflux disease that assesses esophageal disease and its pulmonary complications
.
Mol Imaging Radionucl Ther
2018
;
27
:
113
20
.

45

Burgess
S
,
Davies
NM
,
Thompson
SG.
Bias due to participant overlap in two-sample Mendelian randomization
.
Genet Epidemiol
2016
;
40
:
597
608
.

46

Burgess
S
,
Thompson
SG.
Interpreting findings from Mendelian randomization using the MR-Egger method
.
Eur J Epidemiol
2017
;
32
:
377
89
.

47

Fadista
J
,
Kraven
LM
,
Karjalainen
J
et al. ;
COVID-19 Host Genetics Initiative
.
Shared genetic etiology between idiopathic pulmonary fibrosis and COVID-19 severity
.
EBioMedicine
2021
;
65
:
103277
.

48

Pang
M
,
Kaufman
JS
,
Platt
RW.
Studying noncollapsibility of the odds ratio with marginal structural and logistic regression models
.
Stat Methods Med Res
2016
;
25
:
1925
37
.

49

Vanderweele
TJ
,
Vansteelandt
S.
Odds ratios for mediation analysis for a dichotomous outcome
.
Am J Epidemiol
2010
;
172
:
1339
48
.

Author notes

Yi Liu and Hongjin Lai have contributed equally to this work.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://dbpia.nl.go.kr/pages/standard-publication-reuse-rights)

Supplementary data