- Research article
- Open Access
Discovery of breast cancer risk genes and establishment of a prediction model based on estrogen metabolism regulation
BMC Cancer volume 21, Article number: 194 (2021)
Multiple common variants identified by genome-wide association studies have shown limited evidence of the risk of breast cancer in Chinese individuals. In this study, we aimed to uncover the relationship between estrogen levels and the genetic polymorphism of estrogen metabolism-related enzymes in breast cancer (BC) and establish a risk prediction model composed of estrogen-metabolizing enzyme genes and GWAS-identified breast cancer-related genes based on a polygenic risk score.
Unrelated BC patients and healthy subjects were recruited for analysis of estrogen levels and single nucleotide polymorphisms (SNPs) in genes encoding estrogen metabolism-related enzymes. The polygenic risk score (PRS) was used to explore the combined effect of multiple genes, which was calculated using a Bayesian approach. An independent sample t-test was used to evaluate the differences between PRS scores of BC and healthy subjects. The discriminatory accuracy of the models was compared using the area under the receiver operating characteristic (ROC) curve.
The estrogen homeostasis profile was disturbed in BC patients, with parent estrogens (E1, E2) and carcinogenic catechol estrogens (2/4-OHE1, 2-OHE2, 4-OHE2) significantly accumulating in the serum of BC patients. We then established a PRS model to evaluate the role of SNPs in multiple genes. PRS model 1 (M1) was established from SNPs in 6 GWAS-identified high risk genes. On the basis of M1, we added SNPs from 7 estrogen metabolism enzyme genes to establish PRS model 2 (M2). The independent sample t-test results showed that there was no difference between BC and healthy subjects in M1 (P = 0.17); however, there was a significant difference between BC and healthy subjects in M2 (P = 4.9*10− 5). The ROC curve results showed that the accuracy of M2 (AUC = 62.18%) in breast cancer risk identification was better than that of M1 (AUC = 54.56%).
Estrogen and related metabolic enzyme gene polymorphisms are closely related to BC. The model constructed by adding estrogen metabolic enzyme gene SNPs has a good predictive ability for breast cancer risk, and the accuracy is greatly improved compared with that of the PRS model that only includes GWAS-identified gene SNPs.
Breast cancer is the most common malignant disease among women worldwide, accounting for 24% of new cancer cases and 15% of cancer deaths in 2018, and incident cases are expected to increase by more than 46% by 2040, according to the GLOBOCAN Cancer Tomorrow prediction tool, which will seriously endanger women’s lives and health . At present, people’s understanding of breast cancer is deepening substantially, and new treatment strategies for tumors, including breast cancer, are continually emerging [2, 3]. With continuous improvements in diagnosis and treatment methods, the survival rate of breast cancer patients has been greatly improved. Early prediction, early detection, and early treatment of high-risk groups are the key issues that urgently need to be solved in the clinic.
The occurrence and development of breast cancer are closely related to genetic and environmental factors. In 1989, Gail proposed the breast cancer risk prediction model, which included factors such as age at evaluation, age at menarche, age at first live birth, race, number of breasts, and family history of breast cancer [4, 5]. Some subsequent prediction models also involved BRCA1/2, estrogen replacement therapy, mammography screening times, and genetic polymorphisms. Rare high-risk mutations, particularly in the BRCA1 and BRCA2 genes, explain less than 20% of the twofold familial relative risk (FRR) and account for a small proportion of breast cancer cases in the general population. Low-frequency variants conferring intermediate risk, such as those in CHEK2, ATM, and PALB2, explain 2 to 5% of the FRR . Genome-wide association studies (GWASs) have led to the discovery of multiple common, low-risk variants (single nucleotide polymorphisms [SNPs]) associated with breast cancer risk . Recently, it was found that genetic risk factors can account for 31% of breast cancer risk evaluations , which indicates that breast cancer is a multifactorial disease and that genetic factors are important etiological factors involved in the occurrence and development of breast cancer. At present, an increasing number of researchers are inclined to develop a comprehensive genetic risk scoring method to evaluate the polygenic effects of single nucleotide polymorphisms (SNPs) based on GWASs [9,10,11]. Some well-known studies, such as Mavaddat et al., used 77 GWAS-selected SNPs to construct a PRS for BC. Compared with middle quintile polygenic scores, the risk scores of the highest 1% were increased threefold .
GWASs also have their own limitations. First, a major limitation of genome-wide approaches is the need to adopt a high level of significance to account for multiple tests. Second, GWASs explain only a modest fraction of the missing heritability . Estrogen is an important risk factor for breast cancer. With long-term exposure, super physiological concentrations of estrogen can bind to estrogen receptors, mediate the overexpression of various growth factors, and promote the growth and proliferation of cells, and various metabolites of estrogen can form adducts with DNA, induce genetic mutations and produce direct genotoxicity . Thus, the abnormal accumulation of estrogen and its toxic metabolites in breast tissue is an important risk factor for breast cancer development. Estrogen homeostasis is regulated by estrogen-related metabolic enzymes. Endogenous estrogens are metabolized to be 2-, 4- and 16α-hydroxy estrogens, which are catalyzed by the phase I metabolizing enzymes cytochrome P450 CYP1A1, CYP1B1 and CYP3A4, respectively [14,15,16]. Hydroxyestrogens are detoxified by conjugation reactions catalyzed by phase II metabolizing enzymes such as COMT, UGTs and SULTs. Thus, the expression level of estrogen and its toxic metabolites can be considered to be a comprehensive reflection of the role of these estrogen metabolic enzymes to a certain extent. Polymorphisms in genes encoding these estrogen-related metabolic enzymes are reported to be closely related to differences in enzyme activities and alter the levels of DNA-damaging species to influence the individual’s susceptibility to breast cancer [14, 17, 18]. Genetic epidemiological studies have suggested that there is a correlation between polymorphisms in estrogen metabolism genes and breast cancer risk; however, these results are not consistent [18,19,20]. This is an important reason for the inconsistency of existing research results that studied the correlation between gene polymorphisms of estrogen metabolic enzymes and breast cancer in isolation. Currently, breast cancer risk gene prediction models have not taken estrogen metabolic enzyme genes into consideration; therefore, further optimization is needed from the perspective of overall estrogen metabolism levels.
Based on the above analysis, our research aims to reveal the form of estrogen homeostasis disorders in breast cancer and explore the association between metabolic enzyme gene polymorphisms and breast cancer occurrence from the overall level of estrogen metabolism. Furthermore, we developed a risk score comprising GWAS-selected SNPs and estrogen metabolic enzyme gene SNPs to optimize the breast cancer risk prediction model.
The standards and other chemical reagents were described in our previously published study .
Clinical sample collection
Serum samples were collected during the follicular and luteal phases of 64 premenopausal women (mean age: 45.5 ± 5.04 years) first diagnosed with BC and 49 matched healthy women (mean age: 43.7 ± 8.80 years) to detect the level of estrogens. Blood samples were also collected from 140 premenopausal women (mean age: 43.3 ± 6.24 years) first diagnosed with BC and 140 matched healthy women (mean age: 40.2 ± 3.52 years) to extract DNA and analyze SNP genotypes. All samples and related data were obtained from the Affiliated Hospital of Xuzhou Medical University, Xuzhou, China, from June 2017 to May 2019. Patients with BC were enrolled from the Department of Nail Surgery, whereas healthy subjects were enrolled from the physical examination center. Blood samples were collected before any therapy.
The enrollment criteria were as follows: no history of smoking; BMI ranging from 19 to 26; and no history of chemotherapy, radiotherapy, or estrogen-related endocrine therapy during blood sample collection. The characteristics of the patients at baseline can be seen in Table 1. This protocol was approved by the Ethics Committee of the Affiliated Hospital of Xuzhou Medical University. Written informed consent was obtained from each subject before the study.
Quantification of estrogens using the LC-MS/MS method
The LC-MS/MS method was performed according to our previously published method .
DNA was extracted from peripheral whole blood with a Tiangen DNA extraction kit (Biotech, Beijing, China). The main metabolic enzymes CYP19A1, CYP1A1, CYP1B1, HSD17B1, COMT, UGTs, and SULTs are involved in the regulation of estrogen metabolism. In this study, according to a previous study and pharmacogenomic database, 1 gene locus that is more common or affects the function and activity of metabolic enzymes was screened from each metabolic enzyme. At the same time, we used GWAS-identified breast cancer-related SNPs according to a previous study . All selected SNPs were potentially functional variants, with minor allelic frequencies (MAFs) of more than 10%. The allelic discrimination of the following SNPs was performed by SNaPshot assay (Applied Biosystems Inc., Waltham, MA, USA): estrogen metabolic enzyme gene SNPs including CYP19A1 (rs700519), CYP1A1 (rs1048943), CYP1A1 (rs4646903), CYP1B1 (rs1056827), CYP1B1 (rs1056836), COMT (rs4680), HSD17B1 (rs605059), SULT1A1 (rs1042028), and UGT2B7 (rs7439366) and the GWAS-identified high-risk breast cancer gene SNPs including ZNF365 (rs10822013), FGFR2 (rs2981579), RAD51B (rs3784099), TOX3 (rs3803662), MAP3K1 (rs889312), and HCN1 (rs981782). The allelic discrimination analysis was performed by Genesky Biotechnologies Inc., Shanghai, China (http://www.geneskybiotech.com). Detailed information about the basic SNP information can be found in Table 2. To assure genotyping quality, detailed quality control (QC) procedures, including the duplicate identification of genotypes and a Hardy–Weinberg equilibrium (HWE) test, were carried out. All 15 SNPs were successfully genotyped in 280 subjects with call rates of 100%.
SPSS 22.0 software was used to perform statistical analysis. We used the mean ± SEM to express all estrogen data and Student’s t-test to test differences between the two groups. Multivariate analysis was performed using SIMCA 14.0 software.
HWE was examined among controls using a goodness-of-fit chi-squared test. The odds ratio (OR) and 95% confidence interval (CI) were calculated using a logistic regression model to assess the association between the SNPs and the risk of breast cancer.
We established a PRS to estimate the multigene contribution of estrogen-metabolic enzyme gene loci for breast cancer susceptibility, which was created using marginally significant SNPs associated with breast cancer risk based on the per-allele models. For SNPs in strong linkage disequilibrium located on the same gene or chromosome, we chose the one variant with the lowest P value in the per-allele model as a candidate. The basic formula is as follows:
where βk is the per-allele OR for breast cancer associated with the minor allele for SNP k, and xk is the number of alleles for the same SNP (0, 1, or 2).
Disorders of estrogen expression in breast cancer patients
Using LC-MS/MS quantitative analysis, we measured the expression levels of 11 serum estrogens and metabolites in 64 patients with premenopausal BC (mean age: 45.5 ± 5.04 years) and 49 matched controls (mean age: 43.7 ± 8.80 years). We found that there was no significant difference in age between the BC group and NC group. As shown in Fig. 1a, compared with the NC group, the BC group exhibited significantly increased estrogen levels, especially E1, E2, 2-OHE2, 4-OHE2 (P < 0.01) and 2/4-OHE1 (P < 0.05). OPLS-DA was constructed as an unsupervised statistical method to identify potential estrogen homeostatic changes between the two groups. As shown in Fig. 1b, the metabolic profile of the NC group was clearly separated from that of the BC group, indicating that there was a considerable metabolite difference between the BC group and NC group. We also found that the potential biomarkers with VIP values higher than 1.0 in the OPLS-DA model were E1, E2, 2-OHE2, 4-OHE2 and 2/4-OHE1 in the serum of BC patients (Fig. 1c). Overall, these results supported the view that the disorder of estrogen homeostasis was closely related to increased risk of BC.
Cohort description and Hardy–Weinberg equilibrium testing
We enrolled 140 patients first diagnosed with breast cancer and 140 corresponding healthy women in this study. The mean age at diagnosis (for patients with cancer) was 43.3 ± 6.24 years, and the mean age of healthy women at enrollment was 40.2 ± 3.52 years. Blood samples were collected from these participants to extract DNA and analyze the SNP genotype. We found that there was no significant difference in age between the BC group and NC group. The chi-square test was used to test the HWE value, and P > 0.05 explained that the samples at enrollment were representative of the group. As seen in Table 2, all polymorphisms were found to be in genetic equilibrium, which indicated that the observed genotype frequencies of the case and control groups were constant and representative.
Association of estrogen-metabolizing enzyme genetic variants with breast cancer risk
Table 3 shows univariate analysis and ORs related to each metabolizing enzyme SNP. The polymorphic genotypes of CYP1A1 rs1048943 (P = 0.007), CYP1B1 rs1056827 (P = 0.004), CYP1B1 rs1056836 (P = 0.002) and SULT1A1 rs1042028 (P = 0.029) showed significant differences in distribution. Compared with the wild-type genotypes of CYP1A1 rs1048943 (TT) or SULT1A1 rs1042028 (CC), the heterozygous variant genotypes of CYP1A1 rs1048943 (TC) or SULT1A1 rs1042028 (CT) showed significantly higher risk in breast cancer, with ORs of 2.37 (95% confidence interval [CI] = 1.27–4.43) and 2.21 (95% CI = 1.20–4.05), respectively. Compared with the wild-type genotypes of CYP1B1 rs1056827 (CC), the homozygous variant genotypes (AA) showed a significantly higher risk in breast cancer, yielding an OR of 6.90 (95% CI = 1.50–31.76). Compared with the wild-type genotypes of CYP1B1 rs1056836 (GG), the heterozygous variant genotypes significantly reduced the risk of breast cancer, yielding an OR of 0.37 (95% CI = 0.21–0.67). In addition, no associations with breast cancer risk were observed for the estrogen metabolic enzyme gene SNPs CYP19A1 (rs700519), HSD17B1 (rs605059), COMT (rs4680), or UGT2B7 (rs7439366) or the GWAS-selected SNPs ZNF365 (rs10822013), FGFR2 (rs2981579), RAD51B (rs3784099), TOX3 (rs3803662), MAP3K1 (rs889312), or HCN1 (rs981782).
PRS breast cancer risk prediction model establishment and evaluation
The binary logistic regression method was used to calculate the OR of the per-allele model, and the detailed results are shown in Table 4. We used SNPs in the GWAS-identified high breast risk genes, namely, ZNF365 (rs10822013), FGFR2 (rs2981579), RAD51B (rs3784099), TOX3 (rs3803662), MAP3K1 (rs889312), and HCN1 (rs981782), to create PRS model 1 (M1) in the per-allele model. On the basis of M1, we also added estrogen metabolic enzyme gene SNPs, namely, CYP1A1 (rs1048943), CYP1B1 (rs1056827), SULT1A1 (rs1042028), CYP19A1 (rs700519), COMT (rs4680), HSD17B1 (rs605059), and UGT2B7 (rs7439366), to create PRS model 2 (M2). For SNPs in strong linkage disequilibrium located on the same gene or chromosome, we chose the one variant (rs1048943) with the lowest P value in CYP1A1, and rs1056836 is a protective gene loci, we chose the risk variant rs10526827 in CYP1B1. The PRS scores are expressed as the means ± SEM to find the difference between the two groups. Under M1 and M2, the PRS data of the two groups obeyed a normal distribution; therefore, we used an independent sample t-test to evaluate the difference between the two groups of data. As shown in Table 5 and Fig. 2, the PRS scores in the NC group were significantly lower than those in the BC group in M2 (P = 4.9*10− 5); however, there was no significant difference between NC and BC in M1 (P = 0.17). Finally, the ROC curve was calculated to evaluate how the risk models discriminated between women with and without breast cancer (Fig. 3). The ROC curve estimated for M2 was 62.18% (95% confidence interval [CI] = 0.56–0.69), whereas that for M1 was only 54.56% (95% confidence interval [CI] = 0.48–0.61). Therefore, the accuracy of M2 in breast cancer risk identification was better than that of M1.
Breast cancer (BC) is an estrogen-dependent tumor, and the occurrence of BC is closely related to the imbalance of estrogen homeostasis. The accumulation of estrogen and its toxic metabolites in vivo is a significant risk factor for BC development. Different types of estrogens have different physiological and pathological activities and can play an important role in the process of cancer development through different mechanisms. Parent estrogens are postulated to promote tumorigenesis directly through the stimulation of the estrogen receptor (ER) . The endogenous conversion of estrogen to genotoxic metabolites has been reported as an alternative, potentially ER-independent mechanism for estrogen-dependent breast tumorigenesis . Catechol estrogens can form adducts with DNA, causing gene mutations and producing direct genotoxicity . Methoxyestrogens, including 2-methoxyestradiol, have been shown to inhibit carcinogenesis by suppressing cell proliferation and estrogen oxidation due to their effects on microtubule stabilization .
In this study, the LC-MS/MS quantitative analysis method was used to determine the serum estrogens in the BC group and NC group. Comparing the levels of serum estrogens in the follicular phase and luteal phase of premenopausal breast cancer patients with healthy female volunteers, we found that the levels of parent and hydroxylated estrogen in the BC group were significantly higher than those in the NC group, which indicated that estrogen metabolism disorder is closely related to the occurrence and development of breast cancer. Using OPLS-DA, we also noticed that E1, E2, 4-OHE2, 2-OHE2, and 2/4-OHE1 are BC-related disease markers. This result was consistent with the epidemiologic characteristics of patients with BC .
A large number of studies have confirmed that breast cancer exhibits heritability [27, 28]. However, high-risk genes such as BRCA1 and BRCA2 account for less than 15% of breast cancer cases [29, 30], which suggests that numerous breast cancer-related risk genes have not been discovered, and these gene polymorphisms influence susceptibility to breast cancer.
Estrogen is an important risk factor for breast cancer. However, no research has incorporated estrogens into the breast cancer risk prediction model. A possible major reason is that there is no clinically effective estrogen evaluation method because the steady state of estrogen is affected by various physiological and pathological factors, such as menstrual cycle fluctuations. However, estrogen homeostasis is regulated by various metabolic enzymes. Therefore, we believe that estrogen metabolic enzyme gene polymorphisms are closely related to estrogen homeostasis and the occurrence and development of breast cancer. In this study, univariate logistic regression analysis showed that CYP1A1, CYP1B1, and SULT1A1 gene polymorphisms are closely related to the occurrence of breast cancer. It is worth noting that these gene polymorphisms are also associated with other estrogen-dependent tumors such as endometrial cancer and ovarian cancer. Hiroshi Hirata et al. found that the SULT1A1 rs9282861 (rs1042028) was related to endometrial cancer . A meta-analysis was performed to research the association between CYP1A1 gene polymorphism and ovarian cancer risk, which showed that the Ile/Val (rs1048943) was significantly associated with ovarian cancer, with homozygous carriers (Val/Val vs. Ile/Ile: OR = 2.64; 95% CI: 1.63–4.28) being risk factors for ovarian cancer development .
CYP1A1 and CYP1B1 are the major phase I drug metabolism enzymes that catalyze the hydroxylation of estrogens. The increasing polarity of estrogens may be related to the risk of breast cancer . Our experiments also verified this view. In this study, we found that the variant allele of CYP1B1 rs1086836 was involved in reducing the risk of breast cancer and that the exact mechanism of the protection of this variant allele was not clear . We assumed that the heterozygous model of CYP1B1 rs1086836 (GC vs. GG: OR = 0.37, 95% CI: 0.21–0.67, P = 0.001) may result in decreased function of the CYP1B1 enzyme, reducing the production of 4-hydroxy estrogen and even catechol estrogen-3,4-quinone (CE-3,4-Q) to form adducts with DNA. At the same time, this study also proved that the variant alleles of CYP1A1 rs1048943 (TC vs. TT: OR = 2.37, 95% CI: 1.27–4.43, P = 0.003) and CYP1B1 rs1056827 (AA vs. CC: OR = 6.90, 95% CI: 1.50–31.76, P = 0.001) are closely related to the risk of breast cancer, which is consistent with most research [34, 35]. The possible reason is that the mutations promote the activity of CYP1A1 and CYP1B1 enzymes to increase the production of hydroxylated estrogens or promote the individual’s susceptibility to estrogen.
SULTs catalyze the sulfate conjugation of a broad range of substrates and play an important role in the metabolism of endogenous and exogenous compounds, including thyroid and steroid hormones, neurotransmitters, drugs and procarcinogens . SULTs catalyze the sulfated metabolism of estrogen (E1 and E2) and its metabolites (such as catechol estrogen) and eliminate the activity of estrogen by forming sulfate compounds: sulfated estrogens that cannot combine with estrogen receptors (ERs). At the same time, it promotes the rapid excretion of sulfated estrogen from the cells, which can reduce the level of estrogen exposure in the circulation and target tissues. SULT1A1 rs1042028 is the most widely studied gene polymorphism. Its allelic variation can reduce enzyme activity and thermal stability, resulting in increased estrogen accumulation and increased individual susceptibility to breast cancer . In this study, the heterozygous model of rs1042028 had a 2.21-fold higher risk of breast cancer than the wild-type model. This is consistent with the results of multiple studies [38, 39].
Previous studies investigated associations between the PRS of multiple SNPs and breast cancer risk to study the cumulative effect of genes on the disease. Mavaddat et al. constructed a 77-SNP PRS for breast cancer and found a threefold increase in risk when comparing the polygenic scores of the highest 1% and the middle quintiles . Harlid et al. investigated the combined effect of low-penetrant SNPs on breast cancer, which included ten SNPs, and found that the cumulative effect was strongly correlated with breast cancer . However, most of this research on PRS comes from the Caucasian population sample database. Although Sueta, Chan and others have also conducted similar studies in Asian populations, the evidence is still limited [7, 41]. To date, there have been no relevant reports on the establishment of a breast cancer PRS risk prediction model from the perspective of estrogen-metabolizing enzymes. Thus, a multigene PRS model including estrogen metabolic enzyme gene SNPs and GWAS-selected SNPs was constructed in this study to evaluate the comprehensive effects of multiple estrogen metabolic enzyme SNPs on breast cancer.
In this study, we evaluated possible relationships between the increased breast cancer risk estrogen metabolic enzyme gene SNPs and GWAS-identified gene SNPs in an Asian population. Among them, the GWAS-identified SNPs were not associated with breast cancer risk in the per-allele model or dominant model in our study. This finding was inconsistent with a previous study . Further, we established PRS model 1, including only GWAS-identified SNPs, and PRS model 2, which included estrogen metabolic enzyme gene SNPs on the basis of M1. By calculating the PRS score of each individual under the M1 and M2 PRS models and performing a t-test analysis on the PRS score of the BC and NC groups, we found that the P-value (4.9*10− 5) of the M2 PRS model was far less than that of M1 (0.17). Moreover, the ROC curve (62.18%) of the M2 model was better than that of the M1 model (54.56%). Therefore, the model constructed by adding estrogen metabolic enzyme gene SNPs had a good ability in breast cancer risk prediction, and the accuracy was greatly improved.
There are several limitations of this study that should be noted. First, the sample size was relatively small. In this study, only 140 premenopausal women first diagnosed with BC and 140 matched healthy women were recruited based on our criteria; thus, we did not have enough statistical power to detect the effect of the genetic variants on some of the parameters. Second, because funding was limited, it did not include comprehensive metabolic enzymes and adequate breast cancer risk gene loci. Due to these reasons, the AUC was small and the model have not been tested. In the future, we will study additional estrogen-metabolizing enzyme genes and other breast cancer risk genes in our research. At the same time, we will also include recognized breast cancer risk factors such as age at evaluation, age at menarche, age at first live birth, race, number of breasts, and family history of breast cancer and construct a breast cancer risk prediction model composed of phenotype and genotype to obtain a more valuable ROC value. In addition, the sample size needs to be further expanded, and it is better to include more data information of different races.
Estrogens and related metabolic enzyme gene polymorphisms are closely related to BC. The model constructed by adding estrogen metabolic enzyme gene SNPs has good predictive ability for breast cancer risk, and the accuracy is greatly improved compared with that of the PRS model that only includes GWAS-identified gene SNPs.
Availability of data and materials
The data that support the findings of this study are available from the corresponding author upon reasonable request.
Body mass index
Electrospray ionization source
Familial relative risk
Genome-wide association study
Liquid chromatography-tandem mass spectrometry
PRS model 1
PRS model 2
Multiple reaction monitoring
Orthogonal Partial Least Squares Discriminant Analysis
Polygenic risk score
Receiver operating characteristic curve
Single nucleotide polymorphisms
World Health Organization
Heer E, Harper A, Escandor N, Sung H, McCormack V, Fidler-Benaoudia MM. Global burden and trends in premenopausal and postmenopausal breast cancer: a population-based study. Lancet Glob Health. 2020;8(8):e1027–37.
Muhammad N, Steele R, Isbell TS, Philips N, Ray RB. Bitter melon extract inhibits breast cancer growth in preclinical model by inducing autophagic cell death. Oncotarget. 2017;8(39):66226–36 Published 2017 Aug 3.
Mohammad N, Malvi P, Meena AS, et al. Cholesterol depletion by methyl-β-cyclodextrin augments tamoxifen induced cell death by enhancing its uptake in melanoma. Mol Cancer. 2014;13:204 Published 2014 Sep 1.
Gail MH, Brinton LA, Byar DP, et al. Projecting individualized probabilities of developing breast cancer for white females who are being examined annually. J Natl Cancer Inst. 1989;81(24):1879–86.
Crispo A, D'Aiuto G, De Marco M, et al. Gail model risk factors: impact of adding an extended family history for breast cancer. Breast J. 2008;14(3):221–7.
Bonache S, Gutierrez-Enriquez S, Tenés A, Masas M, Balmaña J, Diez O. Mutation analysis of the BCCIP gene for breast cancer susceptibility in breast/ovarian cancer families. Gynecol Oncol. 2013;131(2):460–3.
Chan M, Ji SM, Liaw CS, et al. Association of common genetic variants with breast cancer risk and clinicopathological characteristics in a Chinese population. Breast Cancer Res Treat. 2012;136(1):209–20.
Möller S, Mucci LA, Harris JR, et al. The heritability of breast cancer among women in the Nordic Twin Study of Cancer. Cancer Epidemiol Biomark Prev. 2016;25(1):145–50.
Mavaddat N, Pharoah PD, Michailidou K, et al. Prediction of breast cancer risk based on profiling with common genetic variants. J Natl Cancer Inst. 2015;107(5):djv036.
Warren Andersen S, Trentham-Dietz A, Gangnon RE, et al. The associations between a polygenic score, reproductive and menstrual risk factors and breast cancer risk. Breast Cancer Res Treat. 2013;140(2):427–34.
Reeves GK, Travis RC, Green J, et al. Incidence of breast cancer and its subtypes in relation to individual and multiple low-penetrance genetic susceptibility loci. JAMA. 2010;304(4):426–34.
Tam V, Patel N, Turcotte M, Bossé Y, Paré G, Meyre D. Benefits and limitations of genome-wide association studies. Nat Rev Genet. 2019;20(8):467–84.
Warner M, Gustafsson JA. On estrogen, cholesterol metabolism, and breast cancer. N Engl J Med. 2014;370(6):572–3.
Zhang Y, Gaikwad NW, Olson K, et al. Cytochrome P450 isoforms catalyze formation of catechol estrogen quinones that react with DNA. Metabolism. 2007;56:887–94.
Kiruthiga PV, Kannan MR, Saraswathi C, et al. CYP1A1 gene polymorphisms: lack of association with breast cancer susceptibility in the southern region (Madurai) of India. Asian Pac J Cancer Prev. 2011;12:2133–8.
Crooke PS, Ritchie MD, Hachey DL, et al. Estrogens, enzyme variants and breast cancer: a risk model. Cancer Epidemiol Biomark Prev. 2006;15:1620–9.
Ghisari M, Eiberg H, Long M, et al. Polymorphisms in phase I and phase II genes and breast cancer risk and relations to persistent organic pollutant exposure: a case-control study in Inuit women. Environ Health. 2014;13:19.
Qiu J, Du Z, Liu J, et al. Association between polymorphisms in estrogen metabolism genes and breast cancer development in Chinese women: a prospective case-control study. Medicine. 2018;97(47):e13337.0.
Sangrajrang S, Sato Y, Sakamoto H, et al. Genetic polymorphisms of estrogen metabolizing enzyme and breast cancer risk in Thai women. Int J Cancer. 2009;125(4):837–43.
Ghisari M, Long M, Røge DM, et al. Polymorphism in xenobiotic and estrogen metabolizing genes, exposure to perfluorinated compounds and subsequent breast cancer risk: a nested case-control study in the Danish National Birth Cohort. Environ Res. 2017;154:325–33.
Zhao F, Wang X, Wang Y, et al. The function of uterine UDP-glucuronosyltransferase 1A8 (UGT1A8) and UDP-glucuronosyltransferase 2B7 (UGT2B7) is involved in endometrial cancer b ased on estrogen metabolism regulation. Hormones (Athens). 2020;19(3):403–12.
Hsieh YC, Tu SH, Su CT, et al. A polygenic risk score for breast cancer risk in a Taiwanese population. Breast Cancer Res Treat. 2017;163(1):131–8.
Eliassen AH, Spiegelman D, Xu X, Keefer LK, Veenstra TD, Barbieri RL, et al. Urinary estrogens and estrogen metabolites and subsequent risk of breast cancer among premenopausal women. Cancer Res. 2012;72:696–706.
Newbold RR, Liehr JG. Induction of uterine adenocarcinoma in CD-1 mice by catechol estrogens. Cancer Res. 2000;60:235–7.
Nehal J, Laldmni, Mohamadi A, et al. 2-Methoxyestradiol, a promising anticancer agent. Pharmacotherapy. 2003;23(2):165–72.
Sampson JN, Falk RT, Schairer C, et al. Association of estrogen metabolism with breast cancer risk in different cohorts of postmenopausal women. Cancer Res. 2017;77:918–25.
Blazer KR, Slavin T, Weitzel JN. Increased reach of genetic cancer risk assessment as a tool for precision management of hereditary breast cancer. JAMA Oncol. 2016;2:723–4.
Doherty J, Bonadies DC, Matloff ET. Testing for hereditary breast cancer: panel or targeted testing? Experience from a clinical cancer genetics practice. J Genet Counsel. 2015;24:683–7.
Bogdanova N, Helbig S, Dork T. Hereditary breast cancer: ever more pieces to the polygenic puzzle. Hered Cancer Clin Pract. 2013;11:12.
El Saghir NS, Zgheib NK, Assi HA, et al. BRCA1 and BRCA2 mutations in ethnic Lebanese Arab women with high hereditary risk breast cancer. Oncologist. 2015;20:357–64.
Hirata H, Hinoda Y, Okayama N, et al. CYP1A1, SULT1A1, and SULT1E1 polymorphisms are risk factors for endometrial cancer susceptibility. Cancer. 2008;112(9):1964–73.
Huang M, Chen Q, Xiao J, Zhao X, Liu C. CYP1A1 Ile462Val is a risk factor for ovarian cancer development. Cytokine. 2012;58(1):73–8.
Gajjar K, Martin-Hirsch PL, Martin FL. CYP1B1 and hormone-induced cancer. Cancer Lett. 2012;324:13–30.
Martínez-Ramírez OC, Pérez-Morales R, Castro C, et al. Polymorphisms of catechol estrogens metabolism pathway genes and breast cancer risk in Mexican women. Breast. 2013;22:335–43.
Reding KW, Weiss NS, Chen C, et al. Genetic polymorphisms in the catechol estrogen metabolism pathway and breast cancer risk. Cancer Epidemiol Biomark Prev. 2009;18:1461–7.
Xiao J, Zheng Y, Zhou Y, Zhang P, Wang J, Shen F, et al. Sulfotransferase SULT1A1 Arg213His polymorphism with cancer risk: a meta-analysis of 53 case–control studies. PLoS One. 2014;9(9):e106774.
Nagar S, Walther S, Blanchard RL. Sulfotransferase (SULT) 1A1 polymorphic variants *1, *2, and *3 are associated with altered enzymatic activity, cellular phenotype, and protein degradation. Mol Pharmacol. 2006;69:2084–92.
Lee H, Wang Q, Yang F, Tao P, Li H, Huang Y, et al. SULT1A1 Arg213His polymorphism, smoked meat, and breast cancer risk: a case-control study and meta-analysis. DNA Cell Biol. 2012;31(5):688–99.
Forat-Yazdi M, Jafari M, Kargar S, Abolbaghaei SM, Nasiri R, et al. Association between SULT1A1 Arg213His (rs9282861) polymorphism and risk of breast cancer: a systematic review and meta-analysis. J Res Health Sci. 2017;17(4):e00396.
Harlid S, Ivarsson MI, Butt S, et al. Combined effect of low-penetrant SNPs on breast cancer risk. Br J Cancer. 2012;106(2):389–96.
Sueta A, Ito H, Kawase T, et al. A genetic risk predictor for breast cancer using a combination of low-penetrance polymorphisms in a Japanese population. Breast Cancer Res Treat. 2012;132(2):711–21.
This study was supported by the Natural Science Foundation of the Jiangsu Higher Education Institutions of China [No. 18KJA350002]; the Natural Science Foundation of Jiangsu Province [No. BK20181470]; the Science and Technology Foundation of Xuzhou [No. KC18044]; the Natural Science Foundation of China [No. 81403001]; the Six Talent Peaks Project in Jiangsu Province [YY-045]; the Qing Lan Project in Jiangsu Province; the Provincial Commission of Health and Family Planning in Jiangsu Province [No. H2017079]; the Natural Science Foundation General Project of Jiangsu Province [No. BK20171173]; and the Science and Technology Planning Project of Jiangsu Province [No. BE2019636]. The roles are to provide essential funding to pay personal efforts.
Ethics approval and consent to participate
All procedures performed in studies involving human participants followed the ethical standards of the institutional and/or national research committee and complied with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards. This study is registered on the Clinical Test Public Management Platform (Registration number: ChiCTR1800014658). The protocol was approved by the Ethics Committee of the Affiliated Hospital of Xuzhou Medical University (XYFY2017-KL008–01). Informed consent was obtained from all individual participants included in the study.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Zhao, F., Hao, Z., Zhong, Y. et al. Discovery of breast cancer risk genes and establishment of a prediction model based on estrogen metabolism regulation. BMC Cancer 21, 194 (2021). https://doi.org/10.1186/s12885-021-07896-4
- Breast cancer
- Risk prediction
- Estrogen-metabolizing enzyme
- Gene polymorphism
- Polygenic risk score