Body mass index and incidence of lung cancer in the HUNT study: using observational and Mendelian randomization approaches
BMC Cancer volume 22, Article number: 1152 (2022)
Traditional observational studies have shown an inverse association between body mass index (BMI) and lung cancer risk. Mendelian randomization (MR) analysis using genetic variants as instruments for BMI may clarify the nature of the association.
We studied the causal association between BMI and lung cancer incidence using observational and MR approaches.
We followed up 62,453 cancer-free Norwegian adults from 1995–97 (HUNT2) until 2017. BMI at baseline in HUNT2 was classified as < 25.0, 25.0–29.9 and ≥ 30.0 kg/m2. BMI change over ten years between HUNT1 (1984–86) and HUNT2 was calculated and classified into quartiles. Seventy-five genetic variants were included as instruments for BMI (among which 14 also associated with smoking behavior). Incident lung cancer cases were ascertained from the Cancer Registry of Norway. Cox regression models were used to estimate hazard ratios (HRs) with 95% confidence intervals (CIs). Multivariable MR was used to examine the effect of BMI after genetically controlling for smoking.
During a median follow-up of 21.1 years, 1009 participants developed lung cancer including 327 with lung adenocarcinoma. The HRs and 95% CIs for incidence of adenocarcinoma were 0.73 (0.58–0.92) for BMI 25.0–29.9 kg/m2 and 0.53 (0.37–0.76) for BMI ≥ 30 kg/m2 compared with BMI < 25.0 kg/m2 in HUNT2 (P for trend < 0.001). However, there was little evidence of a dose–response relationship between the BMI change from HUNT1 to HUNT2 in quartiles and the incidence of adenocarcinoma (P for trend = 0.08). Furthermore, multivariable MR approach suggested a positive association between genetically determined 1 kg/m2 increase in BMI and the incidence of adenocarcinoma (HR 1.25, 95% CI 1.02–1.53). No associations were found with other lung cancer histologic types.
Our study suggests that the inverse association between baseline BMI and lung adenocarcinoma in observational analysis may not be causal. More MR studies are needed to confirm our finding of a positive association between BMI and lung adenocarcinoma.
Body mass index (BMI) is inversely associated with incidence of lung cancer especially with adenocarcinoma as a major histologic type in previous traditional observational studies [1, 2]. Given that tobacco smoking accounts for around 80–90% of the risk of lung cancer  and that there is a complex interrelationship between smoking and BMI , residual confounding by smoking may explain for this inverse association [5, 6]. In addition, reverse causation due to weight loss before the lung cancer diagnosis  and competing risk due to death associated with obesity  may also contribute to the observed inverse association.
On the other hand, the measurements of BMI in most observational studies are attained at one time point [1, 2, 8, 9]. Although BMI change between at least two time points can better examine the potential causal association, BMI change in relation to the incidence of lung cancer has not been well investigated. One case–control study suggested a dose–response inverse association between BMI gain in adulthood and risk of lung cancer, but more evidence from prospective cohort studies is required .
Mendelian randomization (MR) approach uses genetic variants as instrumental variables for the risk factor of interest. The advantage of MR is that genetic variants are randomly assigned at conception . Bias due to reverse causation can be avoided and the influence of residual confounding is reduced . Furthermore, genetically determined BMI reflects the level of BMI across the lifespan and therefore is more accurate than a single measurement .
However, pleiotropy via smoking may lead to bias in the univariable MR since some of the genetic variants for BMI have been suggested to link with smoking . Multivariable MR, as an extension of univariable MR, can estimate the causal effects of multiple risk factors simultaneously [14, 15]. To date, there is only one such study on this topic showing an inverse association between BMI and incidence of adenocarcinoma but a positive association of BMI with small cell lung cancer . The finding of the inverse association with lung adenocarcinoma in this multivariable MR study is inconsistent with results from the previous univariable MR studies which demonstrated no association [12, 16, 17]. Thus, more multivariable MR studies are needed to clarify the potential associations and thereby to improve our understanding on the complex biological mechanisms underlying different lung cancer histologic types.
To study the causal relationships between BMI and incidence of lung cancer overall and histologic types, we first applied observational analyses using both BMI and BMI change as exposures in a large homogenous population of Norwegian adults who were followed up for over 20 years. Secondly, the possible causal associations were examined using a one-sample multivariable MR approach genetically controlling for smoking.
Study design and population
We used data from the second survey of The Trøndelag Health Study (HUNT2, 1995–97) as the baseline. In total, 65,227 adults (69% of the invited) participated in HUNT2  and were followed up until the date of first diagnosis of lung cancer, the date of death or emigration from Norway or the end of follow-up on December 31, 2017, whichever came first. Lung cancer diagnoses across histologic types were obtained from the Cancer Registry of Norway. Information on vital status and emigration was obtained from the National Population Registry.
We first excluded 2053 participants with previous cancer diagnoses before the baseline based on information from the Cancer Registry of Norway (Fig. 1). Afterwards 721 participants with missing information on BMI in HUNT2 were excluded, leaving 62,453 adults for the analysis between BMI in HUNT2 and lung cancer incidence. We additionally excluded 18,060 participants with missing information on BMI from the first survey of The Trøndelag Health Study (HUNT1, 1984–1986), leaving 44,393 participants for the analysis between BMI change from HUNT1 to HUNT2 and lung cancer incidence. Among the 18,060 participants who had missing information on BMI in HUNT1, about 50% of them were not eligible to participate in HUNT1 due to age < 20 years. In the multivariable MR analysis, 7942 participants without information on genetic variants for BMI in HUNT2 were excluded and 54,511 participants were included (Fig. 1).
BMI, BMI change and genetic variants for BMI
In both HUNT1 and HUNT2, weight and height were measured by health professionals at clinical examination. Height was measured to the nearest centimeters and weight to the nearest 0.5 kg. BMI was calculated as weight in kilograms divided by height squared in meter (kg/m2). BMI in HUNT2 was initially categorized into < 18.5, 18.5–24.9, 25.0–29.9 30.0–34.9 and ≥ 35.0 kg/m2 according to the recommendations of the World Health Organization (WHO) . Due to limited lung cancer cases in the BMI categories < 18.5 (n = 11) and ≥ 35.0 kg/m2 (n = 30), the BMI categories were collapsed into three groups such as: < 25.0, 25.0–29.9 and ≥ 30.0 kg/m2. BMI change from HUNT1 to HUNT2 was categorized into quartiles [1st (-21.3–0.5), 2nd (0.6–1.7), 3rd (1.8–3.1) and 4th (3.2–18.6) kg/m2].
DNA samples were extracted from blood samples that were collected in HUNT2 and stored in the HUNT Biobank. Genome-wide genotyping and imputation were carried out for all participants in HUNT2 with sample and variant quality control by using Illumina Humina HumanCoreExome arrays . Seventy-seven single nucleotide polymorphisms (SNPs) were suggested as candidate instrumental variables for BMI with a P value < 5 × 10–8 based on European sex-combined analyses in a genome-wide association study of the Genetic Investigation of Anthropometric Traits consortium (GIANT) . Information on 2 SNPs (rs12016871 and rs2033732) was missing in the HUNT data since they did not pass imputation quality control, leaving 75 BMI-associated SNPs for our analysis. We then split the remaining 75 BMI-associated SNPs into two groups [13, 15]: 1) 61 SNPs that only affected BMI (BMI-only SNPs) and 2) 14 SNPs that affected both BMI and smoking (BMI & smoking SNPs). The 14 BMI & smoking SNPs were identified based on a P value < 0.05 for the association between each of the 75 BMI-associated SNPs and smoking status in our study population.
Other baseline variables
Regarding smoking, participants were classified into never, former [≤ 10.0, 10.1–20.0 and > 20.1 pack-years (pyrs)] and current (≤ 10.0, 10.1–20.0 and > 20.1 pyrs). Other covariates were categorized as: sex (women and men), passive smoking (never, only childhood, only adulthood, and both), alcohol consumption (never, 1–4, and ≥ 5 times/month), leisure physical activity (inactive and active), total sitting time daily (0–4, 5–7, and ≥ 8 h), occupational activity (most sedentary work, much walking or lifting at work, and heavy physical work), education (< 10 and ≥ 10 years), economic difficulty (yes and no), family history of cancer (yes and no), self-reported chronic obstructive pulmonary disease (COPD) (yes and no) and asthma (no asthma, non-active asthma, and active asthma). Missing information on each of the aforementioned variables was included in the analyses as an “unknown” category.
Ascertainment of lung cancer
By using the unique 11-digit personal identification number, participants’ information from HUNT2 was linked to the Cancer Registry of Norway . The International Classification of Diseases version 10 (ICD-10) codes used for registration of lung cancer are C33-C34 . Histologic types were classified according to International Classification of Diseases of Oncology (ICD-O) . Data from the Cancer Registry of Norway are reported to be reasonably accurate and complete .
Baseline characteristics of the participants were presented by BMI categories in HUNT2. Cox proportional hazard models were used to examine the associations with incidence of lung cancer. We assessed the proportional hazards assumption by Schoenfeld residuals for exposures and all covariates. Tvc option of the stcox command in Stata was used if hazards were non-proportional. Crude and adjusted hazard ratios (HRs) with 95% confidence intervals (CIs) were calculated with age as the underlying time variable. Potential confounders were selected based on previous knowledge [25,26,27]. Included confounders in the main model were sex, smoking status combined with pack-years, passive smoking, physical activity, total sitting time daily, education, economic difficulties, family history of cancer, and self-reported COPD. Asthma, alcohol consumption, and occupational activity were adjusted in an additional model.
We performed several sensitivity analyses to test the robustness of our observational findings: 1) To address reverse causality due to existing but undiagnosed lung cancer, we excluded the first five years of follow-up. 2) Potential competing risk due to death was investigated using the Fine-Gray model . 3) Residual confounding due to information missing was investigated by multivariable chained imputation with fully conditional specification (m = 10 imputed datasets) for all covariates based on the assumption of missing at random. 4) To further address residual confounding by smoking, we performed negative control exposure analysis by using migraine as an alternative exposure  (details described in Supplementary Text 1). Previous study suggested that migraine was associated with smoking [30, 31] but not with lung cancer.
Furthermore, we performed a multivariable MR study to assess the potential causal association between genetically predicted BMI and incidence of lung cancer genetically controlling for the influence of smoking status. All 75 BMI-associated SNPs (61 BMI-only SNPs and 14 BMI & smoking SNPs) were used as instrument variables and r2 measure of linkage disequilibrium among instruments were < 0.01 at a 10-MB window . BMI and smoking status in HUNT2 were both regarded as exposures in the multivariable MR. We obtained estimates on SNPs–BMI, SNPs–smoking (regarded as an ordinal variable) and SNPs–lung cancer associations from the same individuals and applied two-sample MR methods such as the inverse variance weighted (IVW) and MR-Egger methods. Both methods could be used in a one-sample setting . We calculated coefficients and standard errors with adjustment for sex, age and age-squared for the SNPs-BMI associations . No adjustments were made for the SNPs-smoking and SNPs-lung cancer associations since no associations between the 75 SNPs and the other confounders were found except for smoking. Sanderson-Windmeijer conditional F-statistics was used to estimate the strength of the instruments for BMI conditional on smoking . Cochran’s Q tests for both IVW and MR-Egger were used to detect heterogeneity between the ratio estimates of SNPs. The intercept test of the MR-Egger was used to assess possibility of horizontal pleiotropy. The outliers in both multivariable IVW and MR-Egger regression methods were identifed using the MR-Pleiotropy Residual Sum and Outlier (MR-PRESSO)  method.
We also applied different univariable MR methods based on the 61 BMI-only SNPs as sensitivity analyses to test the robustness of the results from the multivariable MR analysis: 1) Two-stage method based on the weighted BMI genetic risk score (details described in Supplementary Text 2). 2) IVW and MR-Egger methods using summary data of the 61 individual genetic variants (Supplementary Text 2). All statistical analyses were performed with STATA/SE 16.1 (College Station, TX, USA) or R (4.0.2). The package “MendelianRandomization” was used for the multivariable MR in R.
In total, 1009 out of the 62,453 participants developed lung cancer during a median follow-up of 21.1 years, among which 327 were adenocarcinoma. Compared to those with BMI < 25.0 kg/m2, participants with BMI 25.0–29.9 and ≥ 30.0 kg/m2 were older, more likely to be former smokers or non-drinkers, less active or lower educated and were more likely to have a family history of cancer at baseline (Table 1).
In the observational analyses, BMI in HUNT2 was inversely associated with the incidence of lung cancer overall after adjustment for smoking and other confounders (Table 2). The HR was 0.79 (95% CI 0.69–0.91) for BMI 25.0–29.9 kg/m2 and 0.75 (95% CI 0.62–0.91) for BMI ≥ 30.0 kg/m2 compared with BMI < 25.0 kg/m2. There was a stronger inverse association between BMI and lung adenocarcinoma (P for trend < 0.001), and the corresponding HRs were 0.73 (95% CI 0.58–0.92) and 0.53 (95% CI 0.37–0.76). The corresponding HRs for 1 kg/ m2 increase in BMI was 0.95 (95% CI 0.92–0.98) for adenocarcinoma. No clear association was found for incidence of small cell or squamous cell lung cancer (Table 2). Additional adjustment for asthma, alcohol consumption, and occupational activity did not change the results markedly.
Results of the sensitivity analyses provided supportive evidence for the above findings. 1) After excluding the first five years of follow-up, the association estimate of BMI in HUNT2 was slightly attenuated for lung cancer overall (n = 854) but remained similar for adenocarcinoma (n = 281) (Supplementary Table S1). 2) The competing risk analysis showed similar results to our primary results even if there were many cases of death (n = 15,472) (Supplementary Table S2). 3) Multiple imputation for all covariates including smoking showed comparable association estimates between BMI in HUNT2 and adenocarcinoma. 4) In the analysis of using migraine as a negative control exposure (Supplementary Text 1, Supplementary Fig. 1, and Supplementary Table S3), no clear association between migraine and incidence of adenocarcinoma was demonstrated, suggesting that our observed inverse association of BMI in HUNT2 with adenocarcinoma was less likely to be explained by residual confounding from smoking.
Despite an inverse linear trend between BMI measured at one time point in HUNT2 and adenocarcinoma, there was little evidence of a dose–response relationship between the BMI change from HUNT1 to HUNT2 in quartiles and the incidence of adenocarcinoma (P for trend = 0.08, Table 3). Compared to participants with BMI change in the 1st quartile (-21.3–0.5), participants with BMI change of 0.6–1.7, 1.8–3.1 and 3.2–18.6 kg/m2 had a HR of 0.68 (95% CI 0.49–0.92), 0.68 (95% CI 0.49–0.94), and 0.78 (95% CI 0.56–1.09), respectively. Compared with participants who had information on BMI change from HUNT1 to HUNT2 (n = 44,393), those without information on the BMI change (n = 18,060) were younger, more physically active and had higher socio-economic status (data not presented).
The MR analyses were performed in 54,511 of the 62,453 participants with available BMI-associated SNPs. There were no major differences in most of the baseline characteristics between participants with (n = 54,511) and without (n = 7942) information on the SNPs, except for alcohol consumption and physical activity level showing relatively larger difference in the two populations (Supplementary Table S4). The F statistic value of the genetic risk score including 75 BMI-associated SNPs was 1174 and it explained 2.0% of the variance of BMI in HUNT2. The conditional F statistic was 242, which suggested that the SNPs used in the multivariable MR were good instruments for BMI conditional on smoking. Results from the Cochran’s Q tests for adenocarcinoma suggested possible heterogeneity of the ratio estimates (P for Q = 0.01, Table 4). After removing one outlier SNP (rs2121279) identified by the MR-PRESSO method, there was no heterogeneity between the remaining ratio estimates (P for Q = 0.09, Table 4). Each 1 kg/m2 increment in genetically determined BMI increased the incidence of adenocarcinoma by 25% (HR 1.25, 95% CI 1.02–1.53) in the multivariable MR analysis after removal of the outlier (Table 4). Multivariable MR-Egger method showed similar result (HR 1.52, 95% CI 1.09–2.11). There was no evidence of association between genetically determined BMI and incidence of other histologic types. The intercept test from MR-Egger suggested the pleiotropy was balanced as the P value for the intercept test was > 0.05. Finally, results from the univariable MR analyses using the 61 BMI-only SNPs showed similar results as in the multivariable MR analysis with wider 95% CIs (Supplementary Text 2, Supplementary Table S5 and Supplementary Table S6).
In the observational analyses, we found BMI at baseline was inversely associated with the incidence of lung adenocarcinoma in a dose–response fashion. However, the dose–response relationship was not supported by the results of the BMI change. In the multivariable MR analysis, we observed a positive association between genetically determined BMI and the incidence of adenocarcinoma after taking smoking into account. There was no evidence of associations with other histologic types in either the observational or the MR analysis.
Our observed inverse association between BMI and incidence of adenocarcinoma was consistent with previous observational studies [1, 2]. However, many previous studies might suffer from insufficient adjustment for smoking, reverse causation due to preclinical weight loss and/or competing risk of death. We performed several sensitivity analyses such as multiple imputation and negative control exposure by migraine to address the possibility of residual confounding by smoking. We also excluded the first five years of follow-up to address the possibility of reverse causation and performed analysis to address competing risk of death. Results from these sensitivity analyses appeared to support the inverse association between BMI at baseline and adenocarcinoma.
However, this observed inverse association may not be interpreted as a causal association for the following reasons: 1) Unlike the results of BMI at baseline, our results of BMI change over ten years in the adulthood did not show a dose–response relationship with the incidence of adenocarcinoma. 2) We were not able to completely exclude the residual confounding by smoking although we had attempted to address it in several ways. This is supported by the findings from two large cohort studies of 1.2 million women in the UK [5, 6], in which no clear association between BMI and incidence of lung cancer overall was found in never smokers. 3) Unmeasured or unknown confounders always exist in observational studies. Obesity is accompanied with many other lifestyle factors, some of which may not be measured or are unknown. 4) Our multivariable MR analysis suggested a positive association between BMI and the incidence of adenocarcinoma instead of an inverse association.
Using genetic variants as proxies of BMI, MR studies avoid reverse causation and have a better control for residual confounding . Additionally, in contrast to observational studies, MR studies can reduce bias due to measurement errors of BMI and reflect the BMI level across the lifespan. To date, there have been limited MR studies and the findings are inconsistent [12, 13, 16, 17]. Most of these studies have applied an univariable MR approach [12, 16, 17]. The interaction between BMI and smoking in the development of lung cancer seems complicated [13, 16]. Thus, the assumption of no horizontal pleiotropy may be violated in the univariable MR without taking account of smoking. By using multivariable MR approach, however, the effects of BMI and smoking on lung cancer can be jointly examined . Unlike our finding, Zhou et al. found an inverse association between BMI and risk of adenocarcinoma after controlling for smoking in a two-sample multivariable MR . However, the study by Zhou et al. might be subjected to weak instruments bias since the value of conditional F statistics was below 10 . In contrast, our result of conditional F statistics was 242 and thereby the genetic variants in our study seemed to be better instruments for BMI after controlling for smoking. In addition, it was not possible to check the assumption of independence in the study by Zhou et al. using a two-sample MR . In our one-sample MR study, we had the possibility to thoroughly check for the associations between BMI-associated SNPs and the important confounders and further adjusted for self-reported COPD when calculating the effect estimates between the BMI-only SNPs and adenocarcinoma. The effect estimates showed similar trends in our multivariable MR using all the BMI-associated SNPs and in the univariable MR using the BMI-only SNPs.
The mechanisms for a positive association between BMI and lung adenocarcinoma are unknown but some possibilities have been suggested: 1) High insulin resistance related to obesity might contribute to the lung carcinogenesis [35, 36]. 2) Adipokines (leptin and adiponectin) secreted by the adipose tissue might affect the progress of carcinogenesis through irregular immunomodulation or chronic inflammation [36, 37]. Although we did not find associations, previous MR studies reported positive associations between BMI and small cell or squamous cell lung cancer [12, 16]. Thus, the impact of BMI on the incidence of lung cancer histologic types and the underlying mechanisms warrants further investigation. The latest lung cancer genome-wide association studies identified distinct genetic variants for different lung histologic types [38, 39]. For example, the identified variants specifically for lung adenocarcinoma are near genes related to lung function, telomere regulation and endogenous DNA damage. Therefore, different mechanisms may exist for different histologic types. If positive associations were confirmed in future studies, control of body weight could be another preventive measure to reduce the incidence of lung cancer in addition to smoking cessation.
Our study is the first study that has explored the association between BMI and the incidence of lung cancer using different approaches in a large and homogenous population. A large and homogeneous study population can minimize population stratification bias . A long follow-up duration over 20 years made it possible to include a large number of lung cancer cases and the statistical power was ample. We also had objective measurements of BMI and validated information of lung cancer cases from the Cancer Registry of Norway  which reduced bias due to misclassification. The F statistic values including the conditional F statistic suggested that the BMI-associated SNPs were good instruments.
However, our study has several limitations. Firstly, selection bias cannot be completely ruled out for the MR results since 13% (7942/62453) of the participants had no genetic information on BMI. The excluded participants in the MR analyses had lower alcohol consumption and lower physical activity level than the included participants. However, the proportion of BMI ≥ 25.0 kg/m2 (60.0% & 59.9%) and the proportion of lung cancer cases (1.7% & 1.6%) were similar between the excluded and included participants. Secondly, we could not exclude the possibility of pleiotropy for the association between BMI and adenocarcinoma since there was evidence of heterogeneity in BMI-associated SNPs in both the multivariable and univariable MR analyses before the outlier was removed. Nevertheless, our results from the multivariable MR-Egger, which was relatively robust for horizontal pleiotropy, suggested the pleiotropic effects were balanced around the overall effect. After removing the outlier, no heterogeneity was shown.
Overall, our study suggests that the inverse association between BMI and the incidence of lung adenocarcinoma in the observational analyses may not be causal. Reverse causation and residual confounding by smoking were less likely to explain for the observed inverse association. More multivariable MR studies using large individual data are needed to confirm or refute our finding of a positive association between BMI and lung adenocarcinoma.
Availability of data and material
Data from the HUNT Study is available on request to the HUNT Data Access Committee (hunt@ medisin.ntnu.no) when is used in research projects. The HUNT data access information describes the policy regarding data availability (https://www.ntnu.edu/hunt/data).
Body mass index
Chronic obstructive pulmonary disease
Genetic Investigation of Anthropometric Traits consortium
The Trøndelag Health Study
International Classification of Diseases version 10
International Classification of Diseases of Oncology
Inverse variance weighted
MR- Pleiotropy Residual Sum and Outlier
Seventy-seven single nucleotide polymorphisms
World Health Organization
Yu D, Zheng W, Johansson M, Lan Q, Park Y, White E, et al. Overall and central obesity and risk of lung cancer: a pooled analysis. J Natl Cancer Inst. 2018;110(8):831–42.
Yang Y, Dong JY, Sun KK, Zhao L, Zhao F, Wang LL, et al. Obesity and incidence of lung cancer: a meta-analysis. Int J Cancer. 2013;132(5):1162–9.
Khuder SA. Effect of cigarette smoking on major histological types of lung cancer: a meta-analysis. Lung Cancer. 2001;31(2):139–48.
Taylor AE, Richmond RC, Palviainen T, Loukola A, Wootton RE, Kaprio J, et al. The effect of body mass index on smoking behaviour and nicotine metabolism: a Mendelian randomization study. Hum Mol Genet. 2018;28(8):1322–30.
Pirie K, Peto R, Green J, Reeves GK, Beral V. Lung cancer in never smokers in the UK Million Women Study. Int J Cancer. 2016;139(2):347–54.
Reeves GK, Pirie K, Beral V, Green J, Spencer E, Bull D. Cancer incidence and mortality in relation to body mass index in the Million Women Study: cohort study. BMJ. 2007;335(7630):1134.
Kanashiki M, Sairenchi T, Saito Y, Ishikawa H, Satoh H, Sekizawa K. Body mass index and lung cancer: a case-control study of subjects participating in a mass-screening program. Chest. 2005;128(3):1490–6.
Zhu H, Zhang S. Body mass index and lung cancer risk in never smokers: a meta-analysis. BMC Cancer. 2018;18(1):635.
Sanikini H, Yuan JM, Butler LM, Koh WP, Gao YT, Steffen A, et al. Body mass index and lung cancer risk: a pooled analysis based on nested case-control studies from four cohort studies. BMC Cancer. 2018;18(1):220.
Tarleton HP, Park SL, Zhu WM, Lee YCA, Hashibe M, Morgenstern H, et al. Body mass index change in adulthood and lung and upper aerodigestive tract cancers. Int J Cancer. 2012;131(6):1407–16.
Lawlor DA, Harbord RM, Sterne JA, Timpson N, Davey SG. Mendelian randomization: using genes as instruments for making causal inferences in epidemiology. Stat Med. 2008;27(8):1133–63.
Mariosa D, Carreras-Torres R, Martin RM, Johansson M, Brennan P. Commentary: What can Mendelian randomization tell us about causes of cancer? Int J Epidemiol. 2019;48(3):816–21.
Zhou W, Liu G, Hung RJ, Haycock PC, Aldrich MC, Andrew AS, et al. Causal relationships between body mass index, smoking and lung cancer: univariable and multivariable Mendelian randomization. Int J Cancer. 2021;148(5):1077–86.
Sanderson E, Davey Smith G, Windmeijer F, Bowden J. An examination of multivariable Mendelian randomization in the single-sample and two-sample summary data settings. Int J Epidemiol. 2019;48(3):713–27.
Burgess S, Thompson SG. Multivariable Mendelian randomization: the use of pleiotropic genetic variants to estimate causal effects. Am J Epidemiol. 2015;181(4):251–60.
Carreras-Torres R, Johansson M, Haycock PC, Wade KH, Relton CL, Martin RM, et al. Obesity, metabolic factors and risk of different histological types of lung cancer: A Mendelian randomization study. PLoS ONE. 2017;12(6):e0177875.
Gao C, Patel CJ, Michailidou K, Peters U, Gong J, Schildkraut J, et al. Mendelian randomization study of adiposity-related traits and risk of breast, ovarian, prostate, lung and colorectal cancer. Int J Epidemiol. 2016;45(3):896–908.
Krokstad S, Langhammer A, Hveem K, Holmen TL, Midthjell K, Stene TR, et al. Cohort Profile: the HUNT Study. Norway Int J Epidemiol. 2013;42(4):968–77.
World Health Organization. Preventing and managing the global epidemic [report of a WHO Consultation on Obesity]. Geneva: World Health Organization; 2004.
Ferreira MA, Vonk JM, Baurecht H, Marenholz I, Tian C, Hoffman JD, et al. Shared genetic origin of asthma, hay fever and eczema elucidates allergic disease biology. Nat Genet. 2017;49(12):1752–7.
Locke AE, Kahali B, Berndt SI, Justice AE, Pers TH, Day FR, et al. Genetic studies of body mass index yield new insights for obesity biology. Nature. 2015;518(7538):197–206.
Cancer Registry of Norway. Cancer in Norway 2014-Cancer incidence, mortality, survival and prevalence in Norway. Oslo: Cancer Registry of Norway; 2015.
International classification of diseases for oncology (ICD-O). 3rd ed., 1st revision. World Health Organization; 2013. https://apps.who.int/iris/handle/10665/96612.
Larsen IK, Småstuen M, Johannesen TB, Langmark F, Parkin DM, Bray F, et al. Data quality at the Cancer Registry of Norway: an overview of comparability, completeness, validity and timeliness. Eur J Cancer. 2009;45(7):1218–31.
Jiang L, Sun YQ, Brumpton BM, Langhammer A, Chen Y, Nilsen TIL, et al. Prolonged Sitting, Its Combination With Physical Inactivity and Incidence of Lung Cancer: Prospective Data From the HUNT Study. Front Oncol. 2019;9:101.
Sun YQ, Langhammer A, Wu C, Skorpen F, Chen Y, Nilsen TIL, et al. Associations of serum 25-hydroxyvitamin D level with incidence of lung cancer and histologic types in Norwegian adults: a case-cohort analysis of the HUNT study. Eur J Epidemiol. 2018;33(1):67–77.
Jiang L, Sun YQ, Langhammer A, Brumpton BM, Chen Y, Nilsen TI, et al. Asthma and asthma symptom control in relation to incidence of lung cancer in the HUNT study. Sci Rep. 2021;11(1):4539.
Austin P, Fine J. Practical recommendations for reporting Fine-Gray model analyses for competing risk data. Stat Med. 2017;36(27):4391–400.
Lipsitch M, Tchetgen ET, Cohen T. Negative controls: a tool for detecting confounding and bias in observational studies. Epidemiology. 2010;21(3):383.
López-Mesonero L, Márquez S, Parra P, Gamez-Leyva G, Munoz P, Pascual J. Smoking as a precipitating factor for migraine: a survey in medical students. J Headache Pain. 2009;10(2):101.
Le H, Tfelt-Hansen P, Skytthe A, Kyvik KO, Olesen J. Association between migraine, lifestyle and socioeconomic factors: a population-based cross-sectional study. J Headache Pain. 2011;12(2):157–72.
Minelli C, Fabiola Del Greco M, van der Plaat DA, Bowden J, Sheehan NA, Thompson J. The use of two-sample methods for Mendelian randomization analyses on single large datasets. Int J Epidemiol. 2021;50(5):1651–9.
Sanderson E, Windmeijer F. A weak instrument F-test in linear IV models with multiple endogenous variables. J Econom. 2016;190(2):212–21.
Verbanck M, Chen CY, Neale B, Do R. Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nat Genet. 2018;50(5):693–8.
Petridou ET, Sergentanis TN, Antonopoulos CN, Dessypris N, Matsoukis IL, Aronis K, et al. Insulin resistance: an independent risk factor for lung cancer? Metabolism. 2011;60(8):1100–6.
Khandekar MJ, Cohen P, Spiegelman BM. Molecular mechanisms of cancer development in obesity. Nat Rev Cancer. 2011;11(12):886–95.
Ribeiro R, Araújo A, Lopes C, Medeiros R. Immunoinflammatory mechanisms in lung cancer development: is leptin a mediator? J Thorac Oncol. 2007;2(2):105–8.
McKay JD, Hung RJ, Han Y, Zong X, Carreras-Torres R, Christiani DC, et al. Large-scale association analysis identifies new lung cancer susceptibility loci and heterogeneity in genetic susceptibility across histological subtypes. Nat Genet. 2017;49(7):1126–32.
Byun J, Han Y, Li Y, Xia J, Long E, Choi J, et al. Cross-ancestry genome-wide meta-analysis of 61,047 cases and 947,237 controls identifies new susceptibility loci contributing to lung cancer. Nat Genet. 2022;54(8):1167–77.
Burgess S, Butterworth A, Thompson SG. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet Epidemiol. 2013;37(7):658–65.
The HUNT Study is collaboration between HUNT Research Centre (Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology—NTNU), Nord-Trøndelag County Council, Central Norway Regional Health Authority and the Norwegian Institute of Public Health.
Open access funding provided by Norwegian University of Science and Technology. LJ was supported by funding from the Stiftelsen Dam through the Norwegian Cancer Society (project no. 90539400) and top-up funding from the collaboration partner between the Liaison Committee for Education, Research and Innovation in Central Norway and Central Norway Regional Health Authority. YQS was supported by a researcher grant from The Liaison Committee for Education, Research and Innovation in Central Norway (project ID 2018/42794). BMB works in a research unit funded by Stiftelsen Kristian Gerhard Jebsen; Faculty of Medicine and Health Sciences, NTNU; The Liaison Committee for Education, Research and Innovation in Central Norway; and Medical Research Council Integrative Epidemiology Unit at the University of Bristol. None of the funding sources was involved in any aspect of the study design, conduct, analysis, interpretation of data, or writing the report.
Ethics approval and consent to participate
The study was approved by the Regional Committee for Medical and Health Research Ethics of South-East Norway (2015/78/REC South-East). All participants signed informed written consent on participation in HUNT. The study was performed in accordance with the ethical standards as laid down in the 1964 Declaration of Helsinki and its later amendments or comparable ethical standards.
Consent for publication
The authors declare no potential conflict of interest.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The association of BMI in HUNT2 with incidence of lung cancer overall and different histologic types after excluding the first five years’ follow-up, the HUNT Study, 1995-97 to2017 (N = 59,711). Supplementary Table S2. The associations of BMI in HUNT2 with lung cancer incidence taking account of competing risk due to death, the HUNT Study, 1995-97 to 2017 (N = 62,453). Supplementary Text 1. Analysis using negative control exposure. Supplementary Figure 1. DAG for body mass index(BMI) (as the main exposure), migraine (as the negative control exposure) and incidence of lung cancer (as the outcome). Supplementary Table S3. Negative control using migraine as an alternative exposure to address residual confounding by smoking for the association of BMI in HUNT2 with lung cancer incidence across different lung cancer subtypes, the HUNT Study, 1995-97 to 2017(N = 49,969). Supplementary Table S4. Comparison of baseline characteristics of participants with complete and missing information on 75 BMI SNPs in HUNT2. Supplementary Text 2. Univariable Mendelian randomization (MR) analyses using the 61 single-nucleotide polymorphisms only for body mass index (BMI-Only SNPs). Supplementary Table S5. Associations of externally weighted BMI GRS1 based on 61 BMI-Only SNPs with potential confounders in HUNT2, 1995-1997 (N = 54,511). Supplementary Table S6. The association of BMI with incidence of lung cancer overall and different histologic types based on the univariable MR analyses using 61 BMI-Only SNPs,the HUNT Study, 1995-97 to 2017 (N = 54,511).
About this article
Cite this article
Jiang, L., Sun, YQ., Brumpton, B.M. et al. Body mass index and incidence of lung cancer in the HUNT study: using observational and Mendelian randomization approaches. BMC Cancer 22, 1152 (2022). https://doi.org/10.1186/s12885-022-10215-0