- Research article
- Open Access
- Open Peer Review
Coordinated expression and genetic polymorphisms in Grainyhead-like genes in human non-melanoma skin cancers
BMC Cancer volume 18, Article number: 23 (2018)
The Grainyhead-like (GRHL) transcription factors have been linked to many different types of cancer. However, no previous study has attempted to investigate potential correlations in expression of different GRHL genes in this context. Furthermore, there is very little information concerning damaging mutations and/or single nucleotide polymorphisms in GRHL genes that may be linked to cancer.
DNA and RNA were extracted from human non-melanoma skin cancers (NMSC) and adjacent normal tissues (n = 33 pairs of samples). The expression of GRHL genes was measured by quantitative real time PCR. Regulation of GRHL expression by miRNA was studied using cell transfection methods and dual-luciferase reporter system. Targeted deep sequencing of GRHL genes in tumor samples and control tissues were employed to search for mutations and single nucleotide polymorphisms. Single marker rs141193530 was genotyped with pyrosequencing in additional NMSC replication cohort (n = 176). Appropriate statistical and bioinformatic methods were used to analyze and interpret results.
We discovered that the expression of two genes – GRHL1 and GRHL3 – is reduced in a coordinated manner in tumor samples, in comparison to the control healthy skin samples obtained from the same individuals. It is possible that both GRHL1 and GRHL3 are regulated, at least to some extent, by different strands of the same oncogenic microRNA – miR-21, what would at least partially explain observed correlation. No de novo mutations in the GRHL genes were detected in the examined tumor samples. However, some single nucleotide polymorphisms in the GRHL genes occur at significantly altered frequencies in the examined group of NMSC patients.
Non-melanoma skin cancer growth is accompanied by coordinated reduced expression of epidermal differentiation genes: GRHL1 and GRHL3, which may be regulated by miR-21–3p and -5p, respectively. Some potentially damaging single nucleotide polymorphisms in GRHL genes occur with altered frequencies in NMSC patients, and they may in particular impair the expression of GRHL3 gene or functioning of encoded protein. The presence of these polymorphisms may indicate an increased risk of NMSC development in affected people.
Redundancy is recognized as one of the greatest challenges in developing new approaches to combat cancer . One example of redundancy concerns enzyme function, where enzymes encoded by different genes can catalyze the same chemical reaction. Another example of such phenomenon is when different transcription factors can regulate the expression of common target genes.
In mammals, proper structure and regeneration of various epithelia are dependent on three members of the Grainyhead-like (GRHL) family of transcription factors, which are currently termed GRHL1, GRHL2 and GRHL3. It was already shown that there are many genetic interactions between different Grhl genes. The best studied example concerns the roles of Grhl2 and Grhl3 in neural tube closure in mouse models . In that case, the two genes exhibit partially redundant roles, without being fully functionally equivalent, what is explained by partially overlapping target gene specificity, as GRHL2 and GRHL3 share some of their target genes, while other target genes are unique to each factor .
In the maintenance of adult skin barrier, genetic redundancy involves the Grhl1 and Grhl3 genes, which was recently demonstrated using mouse models. The epidermis of Grhl1-null mice is impermeable to extrinsic dyes and these mice are viable . Similarly, Grhl3 conditional knockout mice, in which the Grhl3 gene has been selectively inactivated in the epidermis after birth, display no apparent skin barrier defects and survive well into adulthood . However, when both Grhl1 and Grhl3 genes are simultaneously inactivated in the epidermis in adult mice, this leads to a complete loss of barrier impermeability which is lethal . This phenotype can be explained by the regulation of expression of different cross-linking enzymes by different GRHL transcription factors.
Transglutaminases (TGM) are enzymes which catalyze the formation of extensively cross-linked, insoluble protein polymers to establish the epidermal cornified envelope . In the mouse epidermis, GRHL3 regulates the expression of Tgm1 while GRHL1 regulates the expression of Tgm5 and, to a lesser extent, Tgm1. Consequently, in Grhl1-null mice and Grhl3 conditional knockout mice the total transglutaminase activity remains sufficiently high to ensure impermeability of skin barrier. However, concomitant loss of both Grhl1 and Grhl3 in the epidermis results in greatly reduced levels of both TGM1 and TGM5, markedly reduced total transglutaminase activity and loss of skin barrier impermeability which is incompatible with life .
All of the GRHLs have been implicated in various types of cancer (these findings have been summarized in a recent review ). Two of them, GRHL1 and GRHL3, have been linked to the development of cutaneous carcinoma of the skin. When subjected to the standard chemical skin carcinogenesis protocol, the Grhl1-null mice develop more squamous cell carcinomas (SCC), with an earlier onset, than their control wildtype littermates . Similar phenotype has been observed in conditional Grhl3 knockout mice with epidermis-specific ablation of Grhl3 after birth . However, the underlying molecular mechanisms are different: in the Grhl3 conditional knockout epidermis, there is a reduction in the level of tumor suppressor phosphatase and tensin homolog (PTEN), a direct target of GRHL3 regulation, which triggers dysregulation of the PI3K/AKT/mTOR signaling pathway ; in the Grhl1−/− mice, the underlying molecular mechanism involves aberrant keratinocyte terminal differentiation and subacute skin barrier defects which, by inducing tumor-promoting mild chronic inflammatory microenvironment in the skin, increase the risk of skin cancer [9, 10].
On the basis of the latest discoveries that: (i) the role of barrier genes in cutaneous carcinoma formation is well documented ; (ii) the Grhl genes are important for the epidermal barrier maintenance and they display a degree of redundancy in this context ; (iii) both Grhl1 and Grhl3 serve protective roles against the development of SCC of the skin in mouse models [5, 9]; we hypothesize that simultaneous downregulation of different GRHL genes, whose genetic redundancy and genetic interactions are evident, is relevant in skin carcinoma formation. Reduced expression of GRHL3 in human basal cell carcinoma (BCC) and squamous cell carcinoma (SCC) of the skin has already been observed, but other GRHL genes were not investigated in those studies [5, 12]. To test this hypothesis we assayed the expression of all the GRHL1–3 genes in two subtypes of human non-melanoma skin cancer samples – BCCs and SCCs. We also looked for common regulatory factors (such as microRNA) that may affect GRHLs expression. Furthermore, we searched for damaging mutations in all the GRHL1–3 genes in NMSC samples and in control healthy tissues from the same patients. In addition, we looked for single nucleotide polymorphisms in GRHL genes that might predispose human subjects to NMSC.
Patient cohort and sample collection
Thirty-three Polish patients with NMSC were enrolled in this study (22 with BCC and 11 with SCC). The patients were surgically treated in the Department of Soft Tissue/Bone Sarcoma and Melanoma in the Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology in Warsaw, Poland. The study was approved by local Bioethical Committee (permit number 13/2008). After surgical resection of an entire diseased area with a margin, small tissue samples were resected from both: the core of lesion and control normal skin from the border of excision and stored at −80 °C until use. General information about the examined group of patients and histopathological classification of carcinomas is provided in Additional file 1: Table S1.
Expression of GRHL genes in cancer samples
Total RNA was extracted from all collected fresh tissues (carcinoma and normal tissue from the border of excision) using RNeasy® Fibrous Tissue Mini Kit (Qiagen, cat. no. 74704) according to manufacturer’s instructions. The purity of RNA was determined using NanoDrop 2000 UV–vis spectrophotometer (Thermo Fisher Scientific). Assessment of RNA quality was performed with 2100 Bioanalyzer instrument and RNA 6000 Nano Kit (Agilent Technologies). Good quality samples (n = 27 pairs, cancer and normal tissue from the same patient) with RNA Integrity Number (RIN) higher than 5 were included in further analyses. RNA concentration was determined using the Qubit® 2.0 Fluorimeter and RNA BR Assay (Thermo Fisher Scientific, cat. no. Q10210).
Reverse transcription and real-time PCR
cDNA was synthesized from 250 ng of total RNA with SuperScript® VILO™ Master Mix (Invitrogen, cat. no. 11755050). The levels of expression of GRHL genes were assayed using Applied Biosystems chemistry: TaqMan® Fast Universal PCR Master Mix No AmpErase UNG (cat. no. 4352042) and TaqMan Gene Expression Assays (Assay ID: Hs01119372_m1 for GRHL1, Hs00227745_m1 for GRHL2, Hs00297962_m1 for GRHL3, Hs03929098_m1 for HPRT1 control). Real-time quantitative PCR was performed in 7900HT Fast Real-Time PCR System (Applied Biosystems). Gene expression was normalized to HPRT1 housekeeping gene and statistical differences were determined for relative expressions (2-∆∆Ct) with two-tailed Mann–Whitney U test with significance level < 0.05.
Interaction of miR-21–3p with 3′ untranslated region (UTR) of GRHL1
HaCaT cell line was bought from Cell Lines Service (cat. no. 300493). HEK293T cells were a kind gift from Ewelina Szymanska; the origin of this cell line is explained in a recent review article . HaCaT and HEK293T cells were routinely cultured in DMEM GlutaMAX medium (Thermo Fisher Scientific, cat. no. 10566–016) supplemented with 10% fetal bovine serum (Thermo Fisher Scientific, cat. no. 10270–106) and 100 IU/mL penicillin-streptomycin (Thermo Fisher Scientific, cat. no. 15140122) in a humidified incubator in an atmosphere of 95% air and 5% CO2 at 37 °C.
HaCaT cells treated with miR-21–3p mimic or hairpin inhibitor
60 nM miR-21–3p mimic or negative control (Ambion, mirVana cat. no. MC12979 and cat. No. 4464058) or 180 nM miR-21–3p hairpin inhibitor or negative control (Dharmacon, miRIDIAN cat. no. IH-301023-02 and IN-001005-01) were transfected into HaCaT cells using Lipofectamine 2000 or Lipofectamine 3000 transfection reagent (Invitrogen, cat. no. 11668019 or L3000–008). After 24 h cells were harvested with RNeasy Mini Kit (Qiagen, cat. no. 74104) or lysed in lysis buffer (50 mM Tris–HCl, pH 7.4, 150 mM NaCl, 1 mM EDTA, 1% Triton X-100 and 1× Complete™ Protease Inhibitor Cocktail from Roche) for Western blot analysis. cDNA was synthesized using High Capacity cDNA Reverse Transcription Kit (Applied Biosystems, cat. no. 4368814). Real Time PCR was carried out with TaqMan Probes (Assay ID: Hs01119372_m1 for GRHL1, Hs00227745_m1 for GRHL2, Hs00297962_m1 for GRHL3, Hs00427620_m1 for TBP and Hs99999901_s1 for 18S). Histone deacetylase 8 – HDAC8 (Assay ID: Hs00954359_m1), which expression is regulated by miR-21–3p , was used as a control target. Each transfection was performed in duplicates and repeated three times. Gene expression was normalized to TBP housekeeping gene and statistical differences for relative expression (2-∆∆Ct) were determined with two-tailed Student’s t-test. For Western blot analysis 20 μg of total protein was separated on 12% SDS-PAGE gels and subsequently transferred to PVDF membrane. Membranes were blocked with 5% non-fat milk and incubated with primary antibody in blocking buffer. The following antibodies were used for immunoblotting: anti-GRHL1 (Sigma, cat. no. HPA005798), anti-rabbit IgG, HRP-linked (Cell Signaling, cat. no. 7074) and anti-β-actin (Sigma, cat. no. A3854). Quantification of protein abundance was carried out by ImageJ software and the relevant bands were normalized against the corresponding β-actin levels. Statistical analysis was performed using Student’s t-test.
Luciferase 3’UTR reporter assay in HEK293T cells
HEK293T cells were plated in 24-well plates and transiently transfected with 50 ng of GRHL1_3’UTR, GRHL1_3’UTR_mut or negative control vector (GeneCopoeia, cat. no. HmiT055586-MT01 and cat. no. CmiT000001-MT01,) using Lipofectamine 2000 (Invitrogen, cat. no. 11668019) following the manufacturer’s protocol. At the same time, miR-21–3p mimics or negative control were co-transfected with reporter vector in a final concentration of 60 nM. The 3’UTR of human GRHL1 was mutated using a QuikChange Site-Directed Mutagenesis Kit (Agilent Technologies, cat. no. 200518). Cells were harvested 24 h after transfection using the reporter lysis buffer (Promega). Firefly and Renilla luciferase activities were analyzed at room temperature in a multimode reader Infinite M1000Pro (Tecan) using the Dual-Luciferase Reporter Assay System (Promega, cat. No. E1910). Relative luciferase activity was defined as the mean value of the firefly/Renilla normalized ratios obtained from 3 independent biological replicates. Statistical differences were indicated with 2-tailed Student’s t-test.
Expression of miR-21–3p and GRHL1 in cancer cell lines
Squamous cell carcinoma cell lines were purchased from the American Type Culture Collection: A-431 (cat. no. CRL-1555), CAL-27 (cat. no. CRL-2095), SCC-15 (cat. no. CRL-1623), SCC-25 (cat. no. CRL-1628). HaCaT cell line was bought from Cell Lines Service (cat. no. 300493). SCC-351 cell line was a kind gift from Agnieszka Kobielak; this cell line is also known as USC-HN1 and originates from the laboratory of Alan L. Epstein, Department of Pathology, Keck School of Medicine of the University of Southern California, Los Angeles, CA, USA, and was first described by the members of his laboratory . All cell lines were cultured as described above for the HaCaT cell line. RNA extraction, cDNA synthesis and TaqMan assays were carried out as described above.
Mutations and polymorphisms in GRHL genes
DNA extraction, target enrichment and next generation sequencing
DNA from 10 to 15 mg of homogenized tissues (Bio-Gen PRO homogenizer) was extracted using QIAamp® DNA Mini Kit (Qiagen, cat. no. 51304) according to manufacturer’s instructions. The purity of DNA was determined with the NanoDrop 2000 UV–vis spectrophotometer (Thermo Fisher Scientific). Accurate DNA concentration was quantitated using the Qubit® 2.0 Fluorimeter and dsDNA BR Assay (Invitrogen; cat. no. Q32853). SureDesign HaloPlex Standard Wizard was employed to select the custom probe sequences based on target regions of GRHL genes, according to the hg19/GRCh37 assembly from UCSC database ( Feb, 2009 version); the list of analyzable regions is provided in Additional file 1: Table S2. Capture of the targeted regions was performed with reagent set from a custom design HaloPlex Target Enrichment System 1–500 kb (Agilent Technologies), according to the Protocol Version D (August 2012). Briefly, the protocol consisted of the four following steps: 1) digestion of genomic DNA (250 ng) by restriction enzymes in eight parallel reactions; 2) hybridization resulting in circularization of digested DNA fragments with complementary probes which incorporated indexes and Illumina sequencing motifs; 3) capture of targeted DNA using streptavidin beads and ligation of circularized fragments; 4) PCR amplification of captured target libraries. Paired-end sequencing of samples was performed on a MiSeq instrument (Illumina) in Genomics Core Facility, European Molecular Biology Laboratory, Heidelberg, Germany.
NGS data processing
Sequence reads were resynchronized and trimmed to remove Illumina adapter sequences and only reads longer than 36 bp were kept. Sequences were further filtered with Trimmomatic  for low quality leading/trailing bases with phred quality lower than 20. Subsequently sequences were aligned to the human reference genome (version hg19) with Stampy . Additionally, the initial 5 bases were trimmed due to potential allele bias in case of single nucleotide polymorphisms (SNP) present in restriction enzyme cutting sites. SNPs were called with SAMtools mpileup algorithm with default parameters . Coverage cutoff value was 20.
SNPs of association
Distribution of SNPs in examined NMSC population was compared to the European population (data derived from 1000Genomes database ) and association p-value was determined with a χ2 – test or Fisher’s exact test. The p-value threshold for significance was adjusted with multiple-comparison correction (Bonferroni correction).
Predicted effects of SNPs in TF binding motifs
Mapping of SNPs to Encyclopedia of DNA Elements Consortium (ENCODE) regions  and transcription factor binding motifs was performed with the Nencki Genomics Database-Ensembl funcgen . The motif matches against DNA sequences were scored as the log-odds of the respective position weight matrix of the motif derived from the JASPAR database . For each SNP, the original reference sequence and the sequence modified by a single SNP were considered (interactions of multiple SNPs in the same motif were not considered). The differences between the log-odds scores were interpreted as the logarithm of the fold change of the binding energy. In cases where the odd scores of the mutated sequence were 0 (no motif match at all); the log odds differences were interpreted as infinite (log(x) approaches negative infinity as x approaches 0).
Genotyping of exonic SNPs rs141193530 and rs41268753 in a replication cohort
Formalin-fixed, paraffin-embedded (FFPE) materials from 177 Polish patients with non-melanoma skin cancers (144 with BCC and 32 with SCC) were randomly selected. The patients were surgically treated in the Department of Soft Tissue/Bone Sarcoma and Melanoma in the Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology, Warsaw, Poland and the study was approved by the Bioethical Committee; permit number 13/2008.
DNA extraction from FFPE samples (5–10 slices of 10-μm-thick sections) was performed using NucleoSpin DNA FFPE XS (Macherey-Nagel), all steps were performed according to manufacturer’s instructions. DNA quantitation was carried out with Qubit™ dsDNA HS Assay Kit (Invitrogen, cat. no. Q32854). To obtain useful information, including high-quality sequence data, FFPE DNA samples were treated with NEBNext FFPE DNA Repair Mix (New England BioLabs, cat. no. M6630 L) according to manufacturer’s instructions.
To identify DNA samples with potential SNPs in their sequences, preliminary selection was performed with PCR-RFLP (restriction fragment length polymorphism). PCR amplification of genomic DNA was conducted using forward primer: 5’–CTTCAGGGGCAATGAGACGAC–3′ and reverse primer 5’–GCACATTGGGGATGAACAGC–3′, the annealing temperature for PCR reactions was 65 °C. For accurate replication of template, Q5® High-Fidelity DNA Polymerase was used (NEB, cat. no. M0492S) and PCR products (size 80 bp) were digested with BsaHI restriction enzyme (NEB, cat. no. R0556S), which digests only templates without examined SNPs (the presence of either rs141193530 but also rs41268753 will abolish the restriction site 5’-GRCGYC-3′). The digested PCR products were resolved in 2.5–3% High Resolution Agarose (EurX, Poland, cat. no. E0302–50) gels, stained with SimplySafe™ (EurX, Poland, cat. no. E4600–01) and visualized with G:Box (Syngene). The digested PCR product of each sample was compared to the same amount of non-digested product. Samples with positive outcome (product not digested or partially digested) were additionally genotyped by pyrosequencing.
To specify exact single nucleotide polymorphism in selected DNA samples, a pyrosequencing assay was designed to measure the relative quantification of nucleotide incorporation at the SNP sites: rs41268753 (C/T) and rs141193530 (C/G). PCR amplification primers for the genomic DNA templates were as follows: Fw_5’–biotinylated–CTTCAGGGGCAATGAGACGAC–3′ and Rv_5’–GCACATTGGGGATGAACAGC–3′, with primer annealing at 65 °C. The biotinylated PCR products (80 bp) were subject to pyrosequencing using internal pyrosequencing primer 5’-ATTGGGGATGAACAGCAC–3′. The sequence content analyzed was GGGTG[G/C]C[G/A]TCTCC. Pyrosequencing service was provided by A&A Biotechnology (Gdynia, Poland).
For analysis of single marker association, SNP frequencies in coding regions in control European non-Finnish population were obtained from the Exome Aggregation Consortium database (ExAC) . For calculating odds ratio, relative risk, confidence interval, significance level and other parameters, statistical methods appropriate for medical research were employed . General information about the considered group of patients is provided in Additional file 1: Table S3.
Reduced expression of GRHL1 and GRHL3 genes in human non-melanoma skin cancers
We analyzed the expression of GRHL1–3 genes by Real-Time PCR in n = 27 NMSC samples (17 BCCs and 10 SCCs), including tumors and the adjacent histologically normal tissue from the border of excision. Expression levels of two of the examined genes – GRHL1 and GRHL3 – were significantly reduced in basal cell carcinoma (BCC) samples as well as in squamous cell carcinoma (SCC) samples, in comparison with the control healthy tissue from the same patient. We did not detect any significant changes in GRHL2 expression in either BCC or SCC cases. Interestingly, in both BCC and SCC, we observed statistically significant correlation between the expression of GRHL1 and GRHL3 genes, with the Spearman correlation coefficient R2 = 0.685 for BCC and 0.825 for SCC, with p < 0.001 for both (Fig. 1).
miR-21–3p may directly target GRHL1 by interaction with its 3’UTR
Coordinated decrease in the levels of expression of GRHL1 and GRHL3 in NMSC samples may indicate a common mechanism of regulation. We decided to look for miRNAs that may target mRNAs of both GRHL1 and GRHL3 using target microRNA and TargetScanHuman prediction algorithms [26, 27]. Multiple sequence alignment detected seed sequences with good SVR score for miR-21–3p in the 3’UTR of GRHL1 (Fig. 2a) which indicated that GRHL1 is a potential target of miR-21–3p regulation. It was previously demonstrated that miR-21-5p regulates the expression of GRHL3  hence in the present study we focused on the regulation of GRHL1 by miR-21–3p. When pMT01-GRHL1_3’UTR was co-transfected with miR-21–3p mimic into HEK293T cells, the relative luciferase activity of firefly reporter was significantly reduced compared with the transfection of negative control as well as with mutant reporter or empty vector (Fig. 2b). Furthermore, we observed decreased level of GRHL1 mRNA in human keratinocytes treated with miR-21–3p mimic (Fig. 2c). However, the decrease of the amount of GRHL1 protein was not statistically significant (Additional file 1: Fig. S1A). We transfected HaCaT cells with a miR-21–3p inhibitor, but we did not detect a significant change in the GRHL1 transcript or protein level, even though we tested a wide range of inhibitor concentrations, up to 180 nM (Additional file 1: Fig. S1B-C).
Expression of miR-21–3p and GRHL1 in SCC cell lines
We assayed the expression of miR-21–3p and GRHL1 in five SCC cell lines (Fig. 3). In all these lines we observed an increase in miR-21–3p levels, in comparison with the control HaCaT cell line. In almost all cases this increase was accompanied by a reduction in GRHL1 expression.
Polymorphisms in GRHL genes in examined cohort of patients with NMSC
We performed targeted deep sequencing in 33 human NMSC and control normal skin from the border of excision. The following regions of GRHL genes were sequenced: all the coding sequences, intron/exon boundaries, 5′ and 3’ UTRs, as well as all the potential regulatory regions as determined by ENCODE . These included CpG islands, regions with increased sensitivity to DNase I, regions with chromatin containing histone modifications specific for open chromatin, or binding regions for general transcription factors. The full list of analyzable regions is provided in Additional file 1: Table S2. This experiment did not detect any de novo mutations in these genes, that is, mutations that would be present in a tumor sample but absent from healthy control tissue from the same individual. Subsequently we analyzed SNP distribution in the examined group of patients. To record the most strongly associated SNPs we initially compared the frequencies of single nucleotide variants present in the examined NMSC patients to allele frequencies for European population available in 1000Genomes database . It was previously shown that genetic differences between extant European subpopulations are small , therefore European population was taken as a reference for SNP frequency comparison. SNPs with significantly altered frequencies are summarized in Table 1. In all cases the frequency of less common variant was increased in the patient cohort.
Single nucleotide polymorphisms in non-coding regions of GRHL2 and GRHL3 genes
Single nucleotide changes may contribute towards the risk of cancer development through their biological consequences. SNPs in the region upstream of the GRHL3 gene (rs55927162 and rs56256719) may possibly affect binding of transcription factors to potential enhancers or other regulatory elements. However, our bioinformatic analyses revealed that these SNPs do not significantly alter known transcription factors binding motifs provided in the JASPAR database . SNPs in intronic regions of GRHL2 and GRHL3 genes (rs151171718 and rs548650) may potentially affect splicing of the primary transcripts.
Single nucleotide polymorphisms in the coding region of GRHL3 gene
Two non-synonymous SNPs occur with significantly altered frequencies in our NMSC patient cohort: rs141193530 in exon 11 of the GRHL3 gene, which causes P455A amino acid residue substitution, and rs151326764 in alternative exon 16 of the GRHL3 gene, which causes R573H substitution (Table 1). We employed PROVEAN (Protein Variation Effect Analyzer) v1.1  to assess whether the presence of these SNPs may affect protein function. This analysis predicted that the R573H substitution is either “neutral” (PROVEAN prediction) or “tolerated” (SIFT prediction). The P455A substitution was predicted to be either “neutral” (PROVEAN prediction) or “damaging” (SIFT prediction).
Other than introducing amino acid residue substitutions in the encoded protein, the presence of these SNPs may be damaging for the functioning of the GRHL3 gene by affecting the processing of primary transcript. We employed two algorithms to search for exonic splicing enhancers (ESE). RESCUE-ESE  did not detect any potential ESEs that would overlap with the location of SNP rs141193530 or rs151326764. ESEfinder3.0  predicted some changes in binding of serine and arginine rich splicing factors (SRSF) to the region containing SNP rs141193530, which were: loss of one SRSF5 potential binding site and creation of a novel SRSF2 potential binding site. However, given multiple caveats of ESEfinder algorithm  it is difficult to ascertain whether these changes may influence pre-mRNA processing.
Interestingly, SNP rs141193530 was reported in a recent publication describing variants of the GRHL3 gene in patients with nonsyndromic cleft lip with/without cleft palate (nsCL/P) as well as nonsyndromic cleft palate only . The authors’ analyses concluded that this variant is either “tolerated” or “benign”. We re-analyzed frequency data provided by the authors in Additional file 1: Table S3  using appropriate statistical methods . As reference, we took the frequency data for European non-Finnish population provided by the ExAC database  because genetic differences between extant European subpopulations are small with the possible exception of population isolates as observed for the Finns . The frequency of rs141193530 in nsCL/P patient cohort was 0.016, compared to the control population of 0.00796. This difference in frequency was statistically significant, as indicated by OR = 2.0136; p = 0.0038; 95%CI: 1.2542–3.2329. Similarly, we observed increased frequency of this variant for NMSC patients (Table 2), what may indicate some effect of this single nucleotide change on GRHL3 functioning and increased risk of disease.
To further investigate whether this SNP is likely to be associated with NMSC occurrence, we carried out genotyping of SNP rs141193530 in a replication cohort, which included 176 additional patients with NMSC (Additional file 1: Table S3). The results are presented in Table 2.
Our research has demonstrated that the expression of two genes from the Grainyhead-like family – GRHL1 and GRHL3 – is reduced in a coordinated fashion in human NMSCs. In earlier work it was shown that the expression of GRHL3 is regulated by miR-21-5p . Here we provide evidence that the expression of GRHL1 may be regulated by miR-21–3p, even though our findings are not entirely consistent in this regard. This mechanism is supported by the results of GRHL1 and miR-21–3p expression studies in SCC cancer cell lines (Fig. 3) and, at the GRHL1 transcript level as well as in luciferase reporter assays, by transfections with miR-21–3p mimic (Fig. 2). However, we did not detect a statistically significant reduction in GRHL1 protein level following cell transfection with miR-21–3p mimic (p = 0.1; Additional file 1: Fig. S1A). This discrepancy could be the result of differences in sensitivity between the two methods, especially as in our experiments transfection with miR-21–3p mimic induced relatively small – by about 30% – decrease in GRHL1 transcript levels, as well as in luciferase reporter activity (Fig. 2). The interpretation of results of Western blotting is complicated further by the risk of cross-reactivity of antibodies with other proteins, as the three GRHL1–3 proteins display very high degrees of similarity, they are all expressed in HaCaT cells, and they have almost identical molecular weights so they cannot be separated by electrophoresis prior to Western blotting. Also, we did not observe changes in GRHL1 transcript or protein levels following cell transfection with miR-21–3p inhibitor, despite using very high inhibitor concentration (Additional file 1: Fig. S1B-C). The most likely explanation is that the effect of miR-21–3p on GRHL1 expression is small, hence the treatment of cells with miR-21–3p inhibitor is insufficient to induce statistically significant changes in GRHL1 expression. It is also possible that, since miR-21–3p levels are relatively low in the normal HaCaT cell line (Fig. 3), the effect of miR-21–3p inhibition is much weaker and thus less evident than the effect of miR-21–3p overexpression. Although it is generally assumed that “passenger” strands of miRNAs are degraded upon miRNA processing, for several miRNAs it was shown that the -5p/−3p ratio varies depending on cell type, developmental stage or different disease states, what suggests that strand selection is a tightly controlled process. There is experimental evidence that miR-21–3p is present at high levels in the UV-induced epidermis enhancing inflammation that may facilitate or contribute to tumor growth . It is thus possible that the expression of both GRHL1 and GRHL3 is regulated by the same oncogenic miRNA (miR-21), albeit by its different strands.
The oncogenic function of miR-21 is well documented. miR-21-deficient mice display decreased susceptibility to chemically-induced skin carcinogenesis . The expression of miR-21-5p  and miR-21–3p  is induced by UV irradiation, a known risk factor in NMSC. Interestingly, miR-21 expression is suppressed by GRHL3, suggesting the existence of a regulatory loop . The authors of that study proposed the following mechanism: upon malignant transformation the expression of GRHL3 is reduced, which leads to increased expression of miR-21 and further enhancement of tumorigenesis. It would be tempting to speculate that, as a result of GRHL3 decrease, the level of miR-21 increases which may subsequently lead to the reduction in the amount of GRHL1 mRNA. However, in the light of a different publication this mechanism is unlikely, as GRHL3 knock-down did not induce any changes in GRHL1 expression in HaCaT cells . It is thus more probable that the causes of reduced GRHL1 and GRHL3 expression in NMSC are environmental, occurring in response to UV irradiation, which triggers an increase in miR-21-5p and miR-21–3p levels in the epidermis, and in turn causes a decrease in the levels of GRHL1 and GRHL3 mRNA. Interestingly, in UVB-induced keratinocytes, there is a reduction of both transcript and protein level of desmoglein 1 , which is a direct target of GRHL1 regulation .
Our results shed further light on the importance of redundancy in studying carcinogenesis . The functions of Grhl1 and Grhl3 genes are redundant in the maintenance of epidermal barrier in adult mice; both these genes have to be concomitantly inactivated in order for the impermeable skin barrier to be compromised . Furthermore, it is known that barrier genes play important protective role in NMSC formation . It is thus likely that, in the context of human NMSC, the expression of both GRHL1 and GRHL3 must be simultaneously reduced to delay epidermal cells differentiation and/or induce tumor promoting inflammatory microenvironment in the skin, which would enable skin tumor progression. This hypothesis is supported by our observation.
We did not detect any de novo mutations in any of the GRHL genes in NMSC samples, that is, mutations that would be present in tumors but absent from healthy tissue from the same patient. This result is consistent with earlier analyses of GRHL3 gene in human SCC of the skin, where no de novo mutations were found . It would be premature to speculate that GRHL genes do not undergo mutations in cancer, especially as they have been associated with many different types of cancer , but no such mutations have been reported to date.
We marked several polymorphisms with significantly altered frequencies in the NMSC patient cohort, in comparison with the European population (Table 1). Some of these variants are rare (allele frequency < 0.01). It is still a matter of debate whether rare or common variants are more likely to contribute to disease risk , hence we decided to include both types of alleles in our further analyses.
Non-synonymous SNP rs141193530 is located in exon 11 of the GRHL3 gene and it introduces P455A amino acid residue substitution in the encoded GRHL3 protein. It seems likely that the presence of this SNP may be indeed deleterious for the functioning of the GRHL3 gene, as this polymorphism is associated with two very different diseases: nsCL/P and NMSC ( and present study). However, in both these diseases the GRHL3 gene serves a protective role. The reason why this SNP may be deleterious is related to the fact that the P455A substitution may nevertheless be damaging for the functioning of GRHL3 protein, despite some predictions to the contrary made by algorithms used by us and other researchers ( and present study). Alternatively, the presence of SNP rs141193530 may affect processing of the primary transcript, although our bioinformatic analyses did not provide any evidence to support this possibility. Thus the precise molecular mechanism responsible for the impact of SNP rs141193530 on the functioning of the GRHL3 gene and/or the encoded GRHL3 protein remains to be elucidated.
Akin to the impairment of GRHL3 protein function or defective splicing of GRHL3 pre-mRNA, reduced levels of GRHL gene expression in healthy human epidermis (prior to the development of NMSC) could also increase the risk of NMSC in affected people. This phenomenon is distinct from the reduction of expression of GRHL1 and GRHL3 genes in tumor samples in comparison with the adjacent healthy epidermis, as reduced Grhl3 mRNA levels were observed in SCC samples in the Grhl3+/+ (wild-type) mice, which did not have any mutations or polymorphisms in the Grhl3 gene . Instead, it is related to the levels of expression of GRHL genes in histologically normal epidermis before the onset of NMSC. Previously it was shown that mouse models with significantly reduced epidermal expression levels of Grhl1 or Grhl3 display an increased susceptibility to SCC of the skin [5, 9]. In human subjects, such reduction in expression levels may be brought about by altered binding of transcription factors to the regulatory regions of GRHL3 gene, which in turn may be caused by the presence of SNPs in binding motifs. Furthermore, the presence of SNPs in intronic regions may impact splicing efficiency, which may also lead to the reduction in mRNA levels (Table 1). These possibilities will require further studies.
Presented results show that the growth of human non-melanoma skin cancers is accompanied by reduced levels of GRHL1 and GRHL3 mRNA. This decrease occurs in a coordinated manner, which suggests a common mechanism of regulation. It is possible that the expression of both genes is regulated, at least in part, by the same oncogenic microRNA: miR-21, which expression is induced by UV, the main causal factor of skin cancers formation. We did not find any de novo mutations in GRHL genes in human NMSC samples compared to adjacent normal skin. However, we observed that in the examined cohort of NMSC patients, some single nucleotide polymorphisms occur at significantly altered frequencies, in comparison with the general European population. The presence of these SNPs may affect the functioning of encoded proteins, splicing of primary transcripts, or may alter the binding of transcription factors to the regulatory regions of GRHL genes. Thus presence of these polymorphisms may indicate an altered risk of NMSC development in affected people.
Basal cell carcinoma
Encyclopedia of DNA Elements Consortium
Exonic splicing enhancer
Exome aggregation consortium
Non-melanoma skin cancer
Nonsyndromic cleft lip with/without cleft palate
Restriction fragment length polymorphism
Squamous cell carcinoma
Single nucleotide polymorphism
Serine and arginine rich splicing factors
TATA box binding protein
Lavi O. Redundancy: a critical obstacle to improving cancer therapy. Cancer Res. 2015;75(5):808–12.
Rifat Y, Parekh V, Wilanowski T, Hislop NR, Auden A, Ting SB, Cunningham JM, Jane SM. Regional neural tube closure defined by the grainy head-like transcription factors. Dev Biol. 2010;345(2):237–45.
Boglev Y, Wilanowski T, Caddy J, Parekh V, Auden A, Darido C, Hislop NR, Cangkrama M, Ting SB, Jane SM. The unique and cooperative roles of the grainy head-like transcription factors in epidermal development reflect unexpected target gene specificity. Dev Biol. 2011;349(2):512–22.
Wilanowski T, Caddy J, Ting SB, Hislop NR, Cerruti L, Auden A, Zhao LL, Asquith S, Ellis S, Sinclair R, et al. Perturbed desmosomal cadherin expression in grainy head-like 1-null mice. EMBO J. 2008;27(6):886–97.
Darido C, Georgy SR, Wilanowski T, Dworkin S, Auden A, Zhao Q, Rank G, Srivastava S, Finlay MJ, Papenfuss AT, et al. Targeting of the tumor suppressor GRHL3 by a miR-21-dependent proto-Oncogenic network results in PTEN loss and tumorigenesis. Cancer Cell. 2011;20(5):635–48.
Cangkrama M, Darido C, Georgy SR, Partridge D, Auden A, Srivastava S, Wilanowski T, Jane SM. Two ancient gene families are critical for maintenance of the mammalian skin barrier in postnatal life. J Invest Dermatol. 2016;136(7):1438–48.
Griffin M, Casadio R, Bergamini CM. Transglutaminases: nature's biological glues. Biochem J. 2002;368(Pt 2):377–96.
Mlacki M, Kikulska A, Krzywinska E, Pawlak M, Wilanowski T. Recent discoveries concerning the involvement of transcription factors from the Grainyhead-like family in cancer. Exp Biol Med (Maywood). 2015;240(11):1396–401.
Mlacki M, Darido C, Jane SM, Wilanowski T. Loss of grainy head-like 1 is associated with disruption of the epidermal barrier and squamous cell carcinoma of the skin. PLoS One. 2014;9(2):e89247.
Demehri S, Turkoz A, Kopan R. Epidermal Notch1 loss promotes skin tumorigenesis by impacting the stromal microenvironment. Cancer Cell. 2009;16(1):55–66.
Darido C, Georgy SR, Jane SM. The role of barrier genes in epidermal malignancy. Oncogene. 2016;35(44):5705–12.
Di Girolamo D, Ambrosio R, De Stefano MA, Mancino G, Porcelli T, Luongo C, Di Cicco E, Scalia G, Vecchio LD, Colao A, et al. Reciprocal interplay between thyroid hormone and microRNA-21 regulates hedgehog pathway-driven skin tumorigenesis. J Clin Invest. 2016;126(6):2308–20.
Thomas P, Smart TG. HEK293 cell line: a vehicle for the expression of recombinant proteins. J Pharmacol Toxicol Methods. 2005;51(3):187–200.
Yan M, Chen C, Gong W, Yin Z, Zhou L, Chaugai S, Wang DW. miR-21-3p regulates cardiac hypertrophic response by targeting histone deacetylase-8. Cardiovasc Res. 2015;105(3):340–52.
Liebertz DJ, Lechner MG, Masood R, Sinha UK, Han J, Puri RK, Correa AJ, Epstein AL. Establishment and characterization of a novel head and neck squamous cell carcinoma cell line USC-HN1. Head Neck Onco. 2010;2:5.
Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, Haussler D. The human genome browser at UCSC. Genome Res. 2002;12(6):996–1006.
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20.
Lunter G, Goodson M. Stampy: a statistical algorithm for sensitive and fast mapping of Illumina sequence reads. Genome Res. 2011;21(6):936–9.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R. Genome project data processing S: the sequence alignment/map format and SAMtools. Bioinformatics. 2009;25(16):2078–9.
Genomes Project C, Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE, Kang HM, Marth GT, McVean GA: An integrated map of genetic variation from 1,092 human genomes. Nature 2012, 491(7422):56–65.
Dunham I, Kundaje A, Aldred SF, Collins PJ, Davis C, Doyle F, Epstein CB, Frietze S, Harrow J, Kaul R, et al. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489(7414):57–74.
Krystkowiak I, Lenart J, Debski K, Kuterba P, Petas M, Kaminska B, Dabrowski M. Nencki genomics database--Ensembl funcgen enhanced with intersections, user data and genome-wide TFBS motifs. Database (Oxford). 2013;2013(bat069):bat069.
Mathelier A, Zhao X, Zhang AW, Parcy F, Worsley-Hunt R, Arenillas DJ, Buchman S, Chen CY, Chou A, Ienasescu H, et al. JASPAR 2014: an extensively expanded and updated open-access database of transcription factor binding profiles. Nucleic Acids Res. 2014;42(Database issue):D142–7.
Lek M, Karczewski KJ, Minikel EV, Samocha KE, Banks E, Fennell T, O'Donnell-Luria AH, Ware JS, Hill AJ, Cummings BB, et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature. 2016;536(7616):285–91.
Altman DG. Practical statistics for medical research. London: Chapman and Hall; 1991.
Betel D, Wilson M, Gabow A, Marks DS, Sander C. The microRNA.Org resource: targets and expression. Nucleic Acids Res. 2008;36(Database issue):D149–53.
Lewis BP, Burge CB, Bartel DP. Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell. 2005;120(1):15–20.
Lao O, Lu TT, Nothnagel M, Junge O, Freitag-Wolf S, Caliebe A, Balascakova M, Bertranpetit J, Bindoff LA, Comas D, et al. Correlation between genetic and geographic structure in Europe. Curr Biol. 2008;18(16):1241–8.
Choi Y, Sims GE, Murphy S, Miller JR, Chan AP. Predicting the functional effect of amino acid substitutions and indels. PLoS One. 2012;7(10):e46688.
Yeo G, Hoon S, Venkatesh B, Burge CB. Variation in sequence and organization of splicing regulatory elements in vertebrate genes. Proc Natl Acad Sci U S A. 2004;101(44):15700–5.
Cartegni L, Wang J, Zhu Z, Zhang MQ, Krainer AR. ESEfinder: a web resource to identify exonic splicing enhancers. Nucleic Acids Res. 2003;31(13):3568–71.
Mangold E, Bohmer AC, Ishorst N, Hoebel AK, Gultepe P, Schuenke H, Klamt J, Hofmann A, Golz L, Raff R, et al. Sequencing the GRHL3 coding region reveals rare truncating mutations and a common susceptibility variant for Nonsyndromic cleft palate. Am J Hum Genet. 2016;98(4):755–62.
Degueurce G, D'Errico I, Pich C, Ibberson M, Schutz F, Montagner A, Sgandurra M, Mury L, Jafari P, Boda A, et al. Identification of a novel PPARbeta/delta/miR-21-3p axis in UV-induced skin inflammation. EMBO Mol Med. 2016;8(8):919–36.
Ma X, Kumar M, Choudhury SN, Becker Buscaglia LE, Barker JR, Kanakamedala K, Liu MF, Li Y. Loss of the miR-21 allele elevates the expression of its target genes and reduces tumorigenesis. Proc Natl Acad Sci U S A. 2011;108(25):10144–9.
Guo L, Huang ZX, Chen XW, Deng QK, Yan W, Zhou MJ, Ou CS, Ding ZH. Differential expression profiles of microRNAs in NIH3T3 cells in response to UVB irradiation. Photochem Photobiol. 2009;85(3):765–73.
Bhandari A, Gordon W, Dizon D, Hopkin AS, Gordon E, Yu Z, Andersen B. The Grainyhead transcription factor Grhl3/Get1 suppresses miR-21 expression and tumorigenesis in skin: modulation of the miR-21 target MSH2 by RNA-binding protein DND1. Oncogene. 2013;32(12):1497–507.
Johnson JL, Koetsier JL, Sirico A, Agidi AT, Antonini D, Missero C, Green KJ. The desmosomal protein desmoglein 1 aids recovery of epidermal differentiation after acute UV light exposure. J Invest Dermato. 2014;134(8):2154–62.
Gibson G. Rare and common variants: twenty arguments. Nat Rev Genet. 2011;13(2):135–45.
Schmittgen TD, Livak KJ. Analyzing real-time PCR data by the comparative ct method. Nat Protoc. 2008;3(6):1101–8.
Mapping of SNPs to ENCODE regions and transcription factor binding motifs was performed with the help of PhD Michal Dabrowski, the head of Laboratory of Bioinformatics in Neurobiology Center at Nencki Institute of Experimental Biology, Polish Academy of Sciences, Warsaw, Poland. The authors would like to thank the Exome Aggregation Consortium and the groups that provided exome variant data for comparison.
This work was supported by: National Science Centre of Poland grant No. 2011/03/N/NZ5/01518 to A. Kikulska; European Social Fund – scholarship for PhD student in the Mazovia Region to A. Kikulska; the European Molecular Biology Organization Installation Grant No. 2131 to T. Wilanowski; the Marie Curie International Reintegration Grant No. 256096 (Seventh Framework Programme, European Union) to T. Wilanowski. The funding bodies had no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
Availability of data and materials
Raw data from next generation sequencing datasets generated and analyzed during the current study are available from the corresponding author on reasonable request. All other data generated or analyzed during this study are included in this published article and its supplementary information files.
Ethics approval and consent to participate
This study complied with the Declaration of Helsinki and was approved by the Bioethical Committee of the Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology, Warsaw, Poland; permit number 13/2008. Written informed consent was obtained from all patients.
Consent for publication
AK, PR and TW have a patent pending. This patent has the following numbers: WO2015093998-A1; IT1421294-B; PL410049-A1.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Description of individuals with NMSC (fresh tissues, NGS). Table S2. SureSelect HaloPlex Design (analyzable regions). Table S3. Total number of NMSC patients including the replication cohort – general description. Fig. S1. Levels of GRHL1 protein in HaCaT cells transfected with miR-21–3p mimic or inhibitor. (DOC 240 kb)
About this article
- Non-melanoma skin cancer
- Molecular genetics
- Gene expression
- Single nucleotide polymorphism
- Transcription factor