The abnormal expression of HSPGs in cancer and stromal cells can serve as a biomarker for tumor progression and patient survival [25]. HS fine structure is determined by the cell-type specific expression of only certain isoforms of some of the biosynthetic enzymes, notwithstanding the existence in some specific cases of regulation at the level of translation or enzymatic catalysis [28,29,30]. In a previous work, we have described the alterations that take place in RSCRCs which affect both the core proteins of HSPGs and the different enzymes responsible for the synthesis of the GAG chains, as well as the differences in these changes depending on the presence or not of metastasis in malignances [21]. In the present study, we provide a similar analysis focused on LSCRCs in order to determine whether there are any differences between left- and right sided pathologies. As in the previous study, here we have considered the comparative analysis of tumors at the T3 stage, where the muscularis propria is affected, and classified the tumors depending on the presence or absence of lymph node metastasis.
Two gene families, syndecans and glypicans, account for most cell surface HSPGs in humans, along with a few part-time proteins [22]. Transcripts for all syndecan species were detected in LSCRCs, but only syndecan-1 mRNA appeared overexpressed in most tumors, independent of the presence of metastasis. Although some previous studies using colon carcinoma cells have described alterations in the transcriptions of syndecan-2 and -4 [16], our previous work on RSCRCs was only able to detect overexpression of syndecan-1 in metastatic tumors [21]. Interestingly, in this work the analysis of the expression of syndecan-1 protein using immunohistochemistry provided the noteworthy finding that non-metastatic LSCRCs displayed lower immunoreactivity to that detected in normal tissues from the same patients, while the metastatic tumors showed a dramatic decrease in staining. Furthermore, non-metastatic tumors have a certain level of staining in the extracellular matrix, suggesting the shedding of cell membrane-bound proteoglycans. These results are very similar to those previously described in RSCRCs, where we suggested that the expression of syndecan-1 involves additional post-transcriptional mechanisms, such as protein translation, degradation, inhibition by feedback loops or miRNA regulation [21]. There is evidence for the post-transcriptional regulation of syndecan-1 expression in, for example, pancreatic cancer and peritoneal macrophages [30, 31]. Our data also correlate well with other previous immunohistochemical studies which have described a loss of expression of syndecan-1 in CRCs, some of which have been found to correlate with tumor stage and metastasis [32,33,34,35]. Upregulation of syndecan-1 has been described in some types of tumors, and it has been postulated that this aberrant expression may play a key role in promoting growth factor signaling in cancer cells [36]. In contrast, other malignances have been found to show downregulation of this molecule, indicating that this HSPG could well serve as a prognostic marker in a cancer-type-specific manner [36].
The glypican family comprises six cell surface HSPGs that are involved in the regulation of several signaling pathways, where, depending on biological context, they either stimulate or inhibit activity [37]. As such, tumor progression is affected by their activity, with abnormal expression being linked to various human tumors [25]. In the samples analyzed in this study, their relative expression patterns were quite similar to those observed in RSCRCs [21]. However, very few transcriptional changes were detected: in isoforms 4 and 6 in metastatic tumors, where, moreover, these alterations were markedly different from those observed in ascending tumors, where there is a great underexpression of glypican 1 in all types of tumors; and in isoforms − 3 and − 6 in non-metastatic tumors [21]. Unlike the other isoforms, relatively little is known concerning the expression or functional roles of glypican-4 and -6 in tumors. However, the ability of glypican-4 to uncouple pluripotent stem cell differentiation from tumorigenic potential has been recorded [38], while the reduced expression or loss of function of glypican-6 has been described in retinoblastoma and autosomal-recessive omodysplasia [39, 40].
Betaglycan and CD44v3 are part time membrane HSPGs, meaning that they occur either with or without HS chains [22]. Although the expression of CD44v3 in CRCs has been described as being related to more advanced pathological stage and poorer prognosis [41], in this study no statistically significant differences in any type of LSCRCs were found, mirroring our previous findings for RSCRCs [21]. The other part time HSPG analyzed was betaglycan, whose expression in tumor cells appears to play an important role in the progression of the pathology [42]. However, in relation to CRCs, although this molecule appeared underexpressed in non-metastatic RSCRCs [21], in this study no significant differences between tumor and healthy tissues was detected in LSCRCs.
Another cell-associated HSPG is serglycin, which constitutes a separate category since it has the peculiarity of being located intracellularly [23]. Transcript levels of this gene in this work were significantly reduced, both in metastatic and non-metastatic tumors, following a similar pattern to that previously observed in RSCRCs [21]. Serglycin is mainly found in hematopoietic and endothelial cells, and the principal GAG chains found bound to this core protein are CS, except in mast cells where CS type E or heparin may be present, depending on the cell’s origin [23, 43]. Mast cells in LSCRCs were drastically reduced in tumors compared to non-tumor colon mucosa, which could be, at least in part, the reason for the decrease in protein expression. A number of previous studies have described alterations in serglycin in different tumors [44,45,46], and it is also worth noting that results analogous to those described in this work, involving both downregulation of transcription and reduction in the population of mast cells, have also been obtained in RSCRCs [21], suggesting that this is a common feature of both CRC types.
Three HSPG species are located at the ECM: agrin, perlecan and collagen XVIII, and the latter two showed significant downregulation in tumoral samples, which is interesting considering that these two species also appeared modified in RSCRCs [21]. Perlecan expression, both at the transcription and the protein level, was diminished in tumors, independent of the presence or absence of lymph node metastasis. Perlecan, a critical regulator of growth factor-mediated signaling and angiogenesis, is an essential element in maintaining basement membrane homeostasis [47], likely indicating that it has a role to play in the progression of CRCs.
Collagen XVIII appeared downregulated to a statistically significant extent only in non-metastatic LSCRCs, while in RSCRCs its expression was significantly reduced in both metastatic and non-metastatic tumors [21]. However, decreases of about 50% were found in 70% of metastatic LSCRCs, a result that approached significance (p = 0.07), leading us to suggest that the results observed could be dependent on the individual sample analyzed, and that the real effect might occur similarly in all CRCs, regardless of their location. Several reports in other malignances describe different types of alterations in collagen XVIII depending on type of tumor, e.g., its expression increases in ovarian and pancreatic cancer, while it diminishes in liver and oral cancer [25].
In summary the patterns of alterations in the levels of expression of HSPG core proteins in CRCs is quite similar for ECM molecules, syndecans and serglycin, independent of tumor location, while glypicans display differences between RS- and LS- malignances.
The tissue-specific expression of individual HSPGs will determine when and where HS chains are expressed. For GAG chain generation, it is necessary to regulate the activity of many different enzymes, mainly GTs, located in the lumen of the Golgi apparatus [10]. The initial step in the biosynthesis of the chains is the creation of a tetrasaccharide linkage region, followed by polymerization through the consecutive addition of alternating GlcA and GlcNAc. A number of works have described variations in HS levels, both increases and decreases, in different tumor types [15, 48], including for CRCs, where decreases have been reported [17, 49]. However, in this work, it was not possible to determine the existence of significant differences in the transcription levels of any of these genes in LSCRCs. This finding contrasts with the results previously described for metastatic RSCRCs, in which B3GAT1 expression decreased, particularly in non-metastatic RSCRCs, where several genes responsible for the synthesis of the linker (XYLT1, XYLT2, B3GAT1) and the polymerization of the chain (EXT1) were downregulated [21].
During HS biosynthesis, various sulfation and epimerization reactions take place which are responsible for the fine structure of the saccharide chain. The first reaction involved in polymer modification is the removal of acetyl groups from GlcNAc residues, after which the amino group is sulfated, catalyzed by four different isoforms of N-deacetylase/N-sulfotransferases [10]. The tissue distribution of NDST1 and NDST2 broadly overlap [50], and transcripts for both were detected in all samples analyzed in this study, with NDST2 appearing downregulated in all LSCRCs, while NDST1 transcription was downregulated in all metastatic tumors and in 60% of non-metastatic. In contrast, previous work with RSCRCs has shown that both isoforms were underexpressed, but only in the non-metastatic patients [21]. NDST4 was undetectable in most samples in the current work, while NDST3 transcripts were detected in only a small percentage of tumors. Expression of NDST3 and NDST4 is principally restricted to the period of embryonic development [51]. That said, in certain tumor types, expression of these molecules has been described, for example, NDST4 in breast cancer [51], although in RSCRCs neither was detected [21].
The next steps in the synthesis of the HS chains include the epimerization of GlcA into IdoA, an action catalyzed by the enzyme C5-GlcA epimerase, along with O-sulfation at C2 of uronic acid [11, 12]. An overexpression in the transcription levels of the two genes involved, GLCE and HS2ST1, was detected in non-metastatic LSCRCs in this study, although not in metastatic tumors, in contrast to our previous study in RSCRCs which found no alterations in the expression of these genes [21].
The addition of an O-sulfate group at C6 is mediated by enzymes encoded by the genes HS6ST1–3, each of which is specific to a particular substrate and differs in its tissue expression [52]. Transcripts for the three isoforms were identified in the healthy tissue studied here, but HS6ST3 mRNAs were not detected in tumor samples, neither metastatic nor non-metastatic. Meanwhile, HS6ST1 appeared downregulated in metastatic LSCRCs. These results show a pattern different from that previously described in RSCRCs, where HS6ST2 was not detected in either tumor type or in healthy tissue, and HS6ST1 appeared deregulated in non-metastatic tumors [21].
The last, and largest, family of enzymes involved in the biosynthesis of HS is the 3-O-sulfotransferases, which comprises seven different members (HS3ST1–6). In LSCRCs, only isoform − 6 expression appeared altered, its expression being diminished in non-metastatic tumors, although it was not detectable in metastatic forms. Again the data differ from those obtained in RSCRCs, where none of the isoforms appeared modified in metastatic tumors, and HS3STB1 and HS3ST5 were underexpressed only in non-metastatic ones [21]. 3-O-sulfation is a relatively rare modification, and it has only been found to influence a small number of proteins thus far [53]. That said, several studies have reported 3-O-sulfation alterations in different tumors [15, 48], and it has been suggested that certain patterns of 3-O-sulfation may be responsible for the appearance of cancerous phenotypics [25].
Once the HS biosynthetic process has been completed, 6S groups present at the glucosamine residues can be post-synthetically edited from the chain, suggesting that it may have special regulatory importance. The reaction is carried out by two endosulfatases located on the cell surface, SULF1 and SULF2 [27], and alterations in the expression of these genes in various tumor types, either up- or downregulation, have been reported [15, 48]. In the case of LSCRCs, transcript levels of none of these genes were found to be altered, mirroring the results previously observed in RSCRCs, where there were also no alterations [21].
Analysis of the HS structure by immunohistochemistry showed differences between normal mucosa and tumors as regards the intensity and distribution of the molecules. It is possible that these differences are caused by structural changes brought about by alterations in the transcription levels of HS biosynthetic enzymes.
Some of the HSPGs analyzed in this study are hybrid molecules, with both HS and CS side chains [54], making it interesting to extend the study to the genes involved in the biosynthesis of this GAG. In addition, changes in CSPGs associated with CRCs, such as versican and decorin, have been found [55], as well as alterations of the CS chains in RSCRCs [21].
CS chain extension takes place through the sequential addition of alternative GlcA and GalNAc residues. Five genes, CSGALNACT1, CSGALNACT2, CHSY1, CHPF and CHSY3 encode the GTs involved in this process [56]. CSGALNACT2 transcription was downregulated in non-metastatic LSCRCs, which coincides with what has been described in RSCRCs, although in the latter the rest of the genes involved, except CHPF, also experienced a decrease in their transcription levels [21]. Interestingly, CHPF appeared overexpressed in all LSCRCs, as it was in metastatic RSCRCs [21]. In other previous studies of CRCs, different changes have been described, although it must be stressed that these studies involved colorectal cancers from different locations and not only RSCRCs [57].
CS repeating disaccharide building units can be modified by epimerization of GlcA residues and by various sulfations [57]. O-sulfation at C4 of GalNAc residues is carried out by enzymes encoded by four genes (CHST11–14), although in this work only CHST11 and − 12 appeared underexpressed in all LSCRCs. This concurs with our previous study in RSCRCs that showed a very similar expression pattern for these genes, although in that case CHST14 was also underexpressed in all tumors, irrespective of their metastatic features [21].
O-sulfation at C6 of GalNAc residues is performed by three different genes, and CHST3 translation was downregulated in non-metastatic tumors, although immunohistochemistry showed the opposite result, i.e. that it was upregulated. This apparent contradiction between the two sets of results is similar to the case described above for syndecan 1, and suggests the involvement of additional post-transcriptional mechanisms [21]. Discordances between mRNA and protein in complex biological samples have been widely analyzed and discussed [58], and subsets of proteins displaying negative correlation with mRNA expression values have been described in some tumors [59]. In addition, CHST7 appeared underexpressed in metastatic LSCRCs in this work. Although very similar patterns of expression for the three genes were previously found in RSCRCs, no alterations in transcription were observed [21].
Both genes encoding the enzymes involved in the modification reactions of uronic acid residues, DSE and UST, were significantly altered, although DSE to a lesser extent, with the difference not reaching significance in metastatic tumors. The alterations observed once again followed a pattern similar to that previously observed in the ascending tumors, where both enzymes appeared downregulated [21]. The alterations in the CS chains as a result of the differences in expression of biosynthetic genes were analyzed by immunohistochemistry using the specific antibody CS-56, with clear differences found in the amount and location of the staining, although it must be taken into account that CS-56 antibody reacts preferentially with CS-D (sulfated at C-2 and C-6), but is also able to recognize other types of structures, including CS-A, -B, -C, and -E [60].
In terms of survival, the small sample size and the retrospective character of this study should be recognized as a considerable limitation, but, despite this, statistically significant differences in the underexpression of two genes were detected, along with a trend in two additional genes, and this behavior seems to be maintained regardless of lymph node involvement or not. Some of the other genes found to be significantly dysregulated in this work might also show a relationship with survival in a bigger sample, therefore prospective studies with a larger study population are necessary.