Extracellular matrix signatures of human primary metastatic colon cancers and their metastases to liver
BMC Cancer volume 14, Article number: 518 (2014)
Colorectal cancer is the third most frequently diagnosed cancer and the third cause of cancer deaths in the United States. Despite the fact that tumor cell-intrinsic mechanisms controlling colorectal carcinogenesis have been identified, novel prognostic and diagnostic tools as well as novel therapeutic strategies are still needed to monitor and target colon cancer progression. We and others have previously shown, using mouse models, that the extracellular matrix (ECM), a major component of the tumor microenvironment, is an important contributor to tumor progression. In order to identify candidate biomarkers, we sought to define ECM signatures of metastatic colorectal cancers and their metastases to the liver.
We have used enrichment of extracellular matrix (ECM) from human patient samples and proteomics to define the ECM composition of primary colon carcinomas and their metastases to liver in comparison with normal colon and liver samples.
We show that robust signatures of ECM proteins characteristic of each tissue, normal and malignant, can be defined using relatively small samples from small numbers of patients. Comparisons with gene expression data from larger cohorts of patients confirm the association of subsets of the proteins identified by proteomic analysis with tumor progression and metastasis.
The ECM protein signatures of metastatic primary colon carcinomas and metastases to liver defined in this study, offer promise for development of diagnostic and prognostic signatures of metastatic potential of colon tumors. The ECM proteins defined here represent candidate serological or tissue biomarkers and potential targets for imaging of occult metastases and residual or recurrent tumors and conceivably for therapies. Furthermore, the methods described here can be applied to other tumor types and can be used to investigate other questions such as the role of ECM in resistance to therapy.
With more than 140,000 new cases diagnosed in 2012, colorectal cancer is the third most commonly diagnosed cancer type in both men and women in the United States. Thanks to prevention and, particularly, early detection, there has been a steady decrease in the number of deaths due to colorectal cancer over the last two decades. And yet, in 2012 it was estimated that colorectal cancer would claim the lives of 50,000 patients. Several genes have been directly implicated in the etiology of colorectal cancer and, despite the fact that tumor-intrinsic molecular mechanisms controlling colorectal carcinogenesis have been identified [1, 2], novel prognostic and diagnostic tools as well as novel therapeutic strategies are still needed to prevent colon cancer progression.
Proteomics has become a method of choice to identify cancer-related biomarkers . Within the last five years, over 35 studies published in peer-reviewed journal applied global proteomics techniques to the study of colorectal samples from patients (reviewed in [4, 5]). These studies revealed a certain number of proteins (including extracellular matrix proteins, see Results and discussion section) up or down-regulated in cancer samples as compared with normal samples, which represent potential biomarkers. However, as discussed in the review by De Wit and colleagues, these studies have not yet been successfully translated to the clinic .
The extracellular matrix (ECM) is a complex meshwork of cross-linked proteins providing architectural support for cells. In addition, ECM proteins bind and present growth factors to cells, thus providing both biophysical and biochemical cues that are major regulators of cellular behavior [6, 7]. The ECM is a major component of the tumor microenvironment and exerts many roles during tumor progression: it supports proliferation and survival of tumor cells; it contributes to the formation of the cancer stem cell niche and thus sustains primary tumor growth; it participates by its nature and/or architecture in the formation of a pro-invasive environment; and, finally, it contributes to the invasion of distant sites by participating in the formation of a microenvironment that will support tumor cell seeding and growth [8–10]. Classical pathology has used excessive ECM deposition (desmoplasia) as a marker of tumors with poor prognosis long before the composition and the complexity of the ECM was even uncovered. Recent studies have also suggested that the ECM can act as a barrier to drug delivery and can confer chemo-resistance to tumors [11, 12]. The ECM thus appears of great interest for discovery of ways to predict, diagnose and cure cancer.
In order to characterize the ECM composition of tumors, we have developed a proteomics-based approach and have shown, using mouse models, that we can identify 100–150 ECM proteins in any given tissue or tumor sample . Using human melanoma and mammary carcinoma xenograft models, we have demonstrated that tumors of different metastatic ability differ in both tumor- and stroma-derived ECM components [13, 14]. Moreover, we showed that several tumor-derived ECM proteins, characteristic of highly metastatic tumors, play important causal roles in metastatic dissemination .
Having developed these systematic methods, we now wished to analyze the composition of the ECM of human patient samples. We report here the characterization of the ECM composition of metastatic colorectal cancer samples (both primary tumors and metastases to liver) and paired normal tissues (normal colon and liver tissue). We have been able to identify consistent changes in the ECM of i) colon tumors as compared with normal colon ECM; ii) primary tumors as compared with metastases derived from them. Based on these changes, we derived ECM protein signatures of primary colon carcinoma and primary colon tumor metastasis to liver. Comparisons of these signatures with available clinical gene expression array data show that subsets of these signatures correlate well with tumor progression and metastasis. We believe that these data sets will lead to the identification of more precise predictive signatures and to the development of assays, in particular serological measurements or immunohistochemical assays, which could be used by pathologists to improve cancer patient management and care.
For each of the three patients included in this study, we obtained a set of three samples: normal colon, colon tumor and its metastasis to liver. None of the patients had received chemotherapy prior to surgery and sample collection and were all diagnosed with stage IV metastatic colon cancer. When available, we obtained duplicate samples of some tissues. Samples were between 25 mg and 85 mg. To obtain enough material to characterize the composition of the normal liver ECM, we generated two pools of samples from 4 and 5 healthy patients respectively (reaching a total of approximately 450 mg per pool). Informed consent was obtained from all of the patients and none of the specimens came from minors. The anonymized specimens were obtained from the MGH tissue bank and were removed for medical reasons unrelated to this project. The specimens were analyzed in accordance with a protocol approved by the Massachusetts General Hospital’s Institutional Review Board (IRB).
Tissue preparation and ECM protein enrichment
The tissue and tumor samples were homogenized with a Bullet Blender (Next Advance, Averill Park, NY) according to the manufacturer’s instructions. Sequential extractions of frozen samples of tumors were performed using the CNMCS compartmental protein extraction kit (Millipore, Billerica, MA) as previously described . In brief, frozen samples were homogenized and extracted sequentially to remove preferentially (1) cytosolic proteins, (2) nuclear proteins, (3) membrane proteins, (4) cytoskeletal proteins leaving a final insoluble fraction enriched for ECM proteins, although different tissues behave somewhat differently in the ease of extraction so that proteins sometimes appear in more than one fraction. The effectiveness of extraction of specific proteins was monitored by immunoblotting using the following antibodies: rabbit anti-collagen I, mouse anti-GAPDH and rabbit anti-histones (Millipore, Billerica, MA), the rabbit anti-actin antibody (serum 14–1) was generated in our laboratory.
Protein digestion, peptide fractionation, and mass spectrometry
The ECM-enriched samples were solubilized in urea, reduced, digested with PNGaseF, Lys-C, and trypsin as previously described . The resulting peptides (~50 μg) were separated into 11 fractions by off-gel electrophoresis (OGE) according to isoelectric point over a pH range of 3–10 . Each OGE fraction was analyzed by LC-MS/MS with an LTQ Orbitrap XL mass spectrometer (Thermo Fisher Scientific, San Jose, CA). Mass spectra were interpreted with Spectrum Mill. MS/MS spectra were searched against a UniProt database containing human (78,369 entries) sequences downloaded from the UniProt web site on June 30, 2010 with a set of common laboratory contaminant proteins (73 entries) appended. Peptides identified with a false discovery rate < 1.6% were assembled into identified proteins, and annotated as being ECM-derived or not as previously described . Detailed information is provided as Additional file 1: Supplementary Methods. The raw mass spectrometry data have been deposited in the public proteomics repository MassIVE (http://massive.ucsd.edu) using the identifier: MSV000078555. The data should be accessible at ftp://MSV000078555:firstname.lastname@example.org.
Gene Set enrichment analysis
Gene Set Enrichment Analysis was performed using GSEA v2.0.12 (http://www.broadinstitute.org/gsea). We identified four clinical gene expression datasets which reported measurements on both primary colon tumors and metastases to liver. The sample type was used to define phenotypic classes for comparison. Probe sets were collapsed to unique gene symbols. Gene sets corresponding to the proteins in our ECM signatures of primary metastatic colon tumors (37 genes) or metastases to liver (7 genes) were created and the distributions of the genes for each of these signatures against the rank-ordered metastasis vs. primary colon tumor comparisons were characterized using GSEA with the default settings. A positive normalized enrichment score indicates enrichment in metastasis samples. A negative normalized enrichment score indicates enrichment in primary colon tumor samples. Each gene in the proteomics-derived gene set is ordered by its position in the ranked list of genes from the dataset and is assigned a rank metric score reflecting its position and the number of probes in the expression dataset. The leading-edge subsets of genes are those genes that appear in the ranked list before (for positive enrichment scores) or after (for negative enrichment scores) the point at which the running sum reaches its maximum deviation from zero. The leading-edge subset can thus be interpreted as the core that accounts for the gene set’s enrichment signal. Note that the analysis includes only those genes in the gene set that are also in the expression dataset. The raw GSEA data may be downloaded from: http://rowley.mit.edu/Hynes/Naba_GSEA_ColonCancer/.
Results and discussion
Proteomic analyses of ECM from normal tissues and tumors from colorectal patient samples
We obtained from Massachusetts General Hospital’s tissue bank patient-matched metastatic colorectal cancer samples (primary tumor and paired metastases to liver) and normal colonic tissue from three patients. We also obtained normal liver tissue from healthy donors (see Methods). ECM proteins were enriched from normal tissues or tumors using the subcellular fractionation protocol previously described . Figure 1A shows the efficiency of the sequential extraction protocol leading to significant enrichment of collagen I (other ECM proteins were similarly retained in the final insoluble fraction; data not shown) and concomitant depletion of intracellular proteins (actin, GAPDH, histones) in the final ECM-enriched samples.
The composition of the ECM-enriched fractions obtained was subsequently characterized by mass spectrometry. The complete proteomic data sets and the matrisome subsets (ECM and ECM-associated proteins) are presented in Additional files 2 and 3.
Evaluation of intra-patient reproducibility
To evaluate the reproducibility of our approach, we conducted analyses on two distinct pieces from each tissue or tumor from patient 1 (Figure 1B-D). We observed good overlap between the ECM proteins detected in each of the duplicate samples analyzed. This was true not only for the duplicate normal colon samples analyzed (Figure 1B) but also for the duplicate colon tumor samples (Figure 1C) and the duplicate metastasis samples (Figure 1D). We observed similar reproducibility for normal colon and colon tumor samples from patient 2 (Additional file 4). Of note, we had observed similar results from normal murine lung and colon tissues . These results argue against significant intra-tumoral heterogeneity detectable at this level of analysis; it appears that the sample size (25-85 mg of tissue) was sufficient to average out any spatial heterogeneity in the ECM. It is also worth noting that we detected more ECM proteins in the tumor samples (primary colon tumor and metastasis to liver) than in the normal colon samples, which may reflect the desmoplasia that often accompanies tumor progression.
Comparison of the ECM composition of tissues and tumors across different patients
We next wanted to compare the ECM composition of samples (normal tissues or tumors) from different patients. Therefore, we extended our analyses to samples from two additional patients and compared the compositions of the ECM of normal colon (Figure 2A), primary colorectal tumor (Figure 2B), and metastases to liver (Figure 2C) with the data obtained for patient 1. When we compared the ECM composition of the normal colon samples from three patients, we identified a set of 89 ECM proteins present in all three samples (Figure 2A). In addition we identified subsets of proteins present in two out of three samples (representing 10% to 12% of the proteins) and, finally, about 12% of the ECM proteins detected were patient-specific. The comparison of three primary metastatic colorectal tumor samples from three patients revealed that, again, the majority of proteins detected were found in all three samples (122 proteins; Figure 2B). We also identified subsets of proteins present in two out of three samples and finally, depending on the sample, 8% to 15% of the ECM proteins detected were patient-specific. The inter-patient variability was greater for the metastasis samples (Figure 2C). Although we detected a set of 71 proteins common to all three metastasis samples analyzed, the number of ECM proteins detected in the metastasis from patient 1 was twice the number detected in patient 2 and 1.5 times the number of proteins detected in patient 3. For each of the three tissue types analyzed, we observed a striking overlap, although it is worth noting that the inter-patient overlap is notably greater for normal colon samples than it is for colon tumor and metastasis samples, which may reflect the heterogeneity among tumor samples (primary tumors or metastases) as compared with normal tissues (see Conclusions).Because of the very small amount of extracellular matrix in the normal liver, we were not able to analyze reliably the liver ECM from individual patient samples (on average 50 – 75 mg). Instead, and because of the high inter-patient overlap observed with normal colon samples, we generated two pools of normal liver samples from healthy donors (pools were composed of 4 and 5 liver samples, respectively, and were approximately 450 mg each). ECM proteins could successfully be enriched from these pooled liver samples (Figure 1A). Moreover, this strategy allowed us to obtain enough ECM material to be analyzed by mass spectrometry. We identified 115 and 174 ECM proteins in each pool, and 105 of them were detected in both pools (Figure 2D).
After evaluating the similarity among samples obtained from different patients, we wanted to compare the ECM compositions of normal tissues with those of primary and secondary tumors. We therefore defined, for each tissue or tumor type, its “matrisome” as the ensemble of proteins detected in at least two of the three patients studied. According to this definition, the matrisome of normal human colon comprises 122 proteins, the matrisome of primary colon tumors 187 proteins and the matrisome of metastases 135 proteins (grey areas in Figure 2A-C). To define the normal liver matrisome, we took the intersection (105 proteins) of the two pools analyzed (Figure 2D). We further subdivided each matrisome list into the protein categories we previously defined [13, 15]: ECM glycoproteins, collagens and proteoglycans for core matrisome proteins and ECM-affiliated proteins, ECM regulators and secreted factors for ECM-associated proteins (Figure 3) [13, 15].
Definition of signatures of metastatic colorectal cancer and associated metastasis to liver
The comparison of the matrisome compositions of normal tissues (colon and liver) and colorectal tumors (primary tumors and metastases) revealed that a large fraction of proteins (69) are ubiquitously expressed and detected in all four tissue types (Figure 2E, Figure 3). We observed that half of the glycoproteins detected (Figure 3A) as well as most of the collagens (Figure 3B) and proteoglycans (Figure 3C) are common to the four tissue types. Components associated with the extracellular matrix, such as ECM regulators (that include ECM remodeling enzymes) or ECM-affiliated proteins and ECM-associated secreted factors (growth factors, cytokines etc.) are present at lower abundance in the ECM-rich samples (Additional file 2B) and are, for the most part, restricted to certain tissues (Figure 3, Additional file 2B). This comparison also revealed that the ECM composition of metastases to liver resembles more the ECM of primary colorectal tumors than that of normal liver. Importantly, we identified subsets of tumor-specific proteins: 37 proteins were characteristic of the colon tumor matrisome, 7 proteins were characteristic of the metastasis matrisome and 23 proteins were characteristic of both primary tumors and metastases (Figure 2E, Figure 3, and Additional file 2C).
Gene Set enrichment analyses identify subsets of ECM-encoding genes strongly correlated with primary colon tumors or their metastases to liver
We next sought to explore potential correlations between our data and other clinical data sets. Accordingly, we used Gene Set Enrichment Analysis  to evaluate the relationship between the proteomics-derived ECM signatures for i) colon primary tumors and ii) metastases (Figure 2E, Figure 3; see Methods for further details) and microarray-based gene expression studies involving large cohorts of patients. The four relevant clinical gene expression data sets analyzed [17–20] represent a total of 289 primary colon tumor samples and 120 metastasis samples (Additional file 5). Comparisons were set up between colon tumor metastases to liver and primary tumor samples; hence a positive enrichment score will indicate enrichment in metastasis samples and a negative enrichment score, enrichment in primary colon tumor samples (Figure 4A,B and Additional file 5). The distributions of genes in our two signatures within these comparisons were characterized with GSEA and we consistently observed enrichment of our proteomics-based signatures in their corresponding phenotypic classes in the gene expression experiments. The metastasis signature was consistently enriched in the metastasis samples in all four comparisons, with enrichment scores ranging between 1.112 and 1.815 (Figure 4A, Additional file 6, left panels). The colon cancer signature set was enriched in the colon cancer class in all four comparisons, with enrichment scores ranging from -0.876 and -1.481 (Figure 4B, Additional file 6, right panels).
Next, we evaluated which of the genes from our ECM signatures consistently ranked most highly with regard to class association by comparing the composition of the various leading edge sets (Figure 4C,D, see Methods). Despite the fact that it is well known that protein levels do not necessarily correlate with mRNA levels and one would not expect one-to-one concordance, we identified a subset of three out of the seven genes from the metastasis signature (HPX, SPP1 and COMP) that were closely associated with the metastasis class in three out of the four clinical expression datasets (Figure 4C). We also identified a subset of four genes from the primary colon tumor signature (MMP1, MMP2, MMP11, and LEFTY1) that was associated with the colon cancer class in at least three out of the four clinical expression datasets tested (Figure 4D). An additional six genes (EMID1, MMP12, LEPRE1, MFGE8, MMP9 and FMOD) were strongly associated with primary colon cancers in two out of four clinical gene expression data sets. A recent gene expression profiling study by Lin and colleagues also identified osteopontin (SPP1, Secreted Phospho-Protein1) as being up-regulated in colorectal liver metastases as compared to primary tumors and normal liver, in accordance with our data . The same study also identified SPARC (Secreted Protein Acidic and Rich in Cysteine or osteonectin) in colorectal metastases to liver whereas we have detected SPARC mostly in primary colorectal tumors and only in the metastasis of one patient; our results agree concerning the absence of SPARC in normal liver . Periostin has also been reported to be associated with colon cancer metastasis [22, 23]; we detected this protein in all the tissue samples studied – it was not specific to malignant tissue but that does not rule out a role in malignant progression. Of note, SPP1, SPARC and periostin have all been implicated in other types of metastatic cancer and are often invoked as possible biomarkers of aggressive tumor types [24–27].
In addition to these concordances with clinical gene expression data, our proteomics-based discovery pipeline identified other proteins that are potential serological biomarkers for colorectal cancer patients: a recent study by Yao and colleagues  identified EFEMP2 (EGF-containing fibulin-like extracellular matrix protein 2 or Fibulin 4) and thrombospondin 2 (THBS2) as a potential biomarkers detected in the serum of colorectal cancer patients. In accord with these data, we detected both EFEMP2 and THBS2 in both primary and secondary colon tumors but not in normal tissues. Another example is Tissue Inhibitor of Metalloproteinase-1 (TIMP1), detected in both primary metastatic tumors and metastases to liver in our study, and previously found to be elevated in the serum of colorectal cancer patients and proposed to be not only a good diagnostic factor but also a good predictor and indicator of response to chemotherapy [29, 30]. Both EFEMP2 and TIMP1 are also proposed to be of superior value than carcinoembryonic-antigen (CEA), the only biomarker currently used to diagnose and monitor the treatment of colorectal cancer patients .
We demonstrate in this study that we can characterize in detail the composition of the extracellular matrix of normal tissues and tumors using small samples from human colorectal cancer patients. We show that our proteomics approach can robustly define the matrisomes of tumors and matched normal samples. Based on this analysis, we established ECM signatures characteristic of primary metastatic colorectal tumors and their metastases to liver. Further work is needed to determine whether the proteins identified in our study play any functional roles in colorectal tumor progression and metastasis. Future work will aim to characterize the ECM composition of poorly metastatic tumors and of tumors and metastases that do or do not respond to chemotherapy.Although we focused here on proteins common to the three patients studied, it is worth noting that we also identified patient-specific sets of ECM proteins (white areas of the Venn diagrams presented in Figure 2). We hypothesize that these may correlate with some property of the tumors (particular region sampled, stage, etc.) that is beyond the scope of this study to determine due to the small number of tumors examined. Nonetheless, one can postulate that within the signatures defined here are sets of ECM proteins that could serve as novel biomarkers of metastatic potential in primary tumor biopsies and could, furthermore, be used to detect small disseminated metastases that remain undetected by current imaging methods. In order to test these postulates, one needs screening of larger cohorts of patients for presence or absence of these ECM proteins. Our initial comparisons between the ECM protein signatures and mRNA expression datasets support the correlation with metastatic progression of some of the proteins defined – others may well correlate when examined at the protein level (e.g., by immunohistochemistry of tumor tissue microarrays or by serological measurements). ECM proteins are particularly favorable candidate biomarkers since they are abundant, are laid down in characteristic patterns and are readily accessible. ECM protein levels may indeed be more appropriate indicators of tumor properties than mRNA levels since proteins are the operative molecules in the tumor microenvironment and changes occur at that level that are not reflected at the mRNA level because of post-transcriptional processes (translation, stability etc.).
Methods developed by others have already been used effectively to target tumor vascular-specific ECM proteins (splice variants of fibronectin or tenascin) for use in imaging tumors and metastases in mouse models and patients and also for targeting isotopes, drugs and cytokines to tumors for therapeutic applications . A recent study demonstrated that administration of interleukin 12 coupled to an antibody directed against a tumor-specific spliced isoform of fibronectin (a major ECM protein) led to the regression of various tumors including subcutaneous CT26 colon carcinoma tumors . Such approaches could provide sorely needed new strategies for the treatment and management of metastatic colon cancers and we hope that the definition of larger numbers of ECM biomarkers will contribute to improvements in colon cancer patients’ diagnosis, prognosis, treatment and survival.
Availability of supporting data
In addition to the supporting data included as additional files, the raw GSEA data may be downloaded from: http://rowley.mit.edu/Hynes/Naba_GSEA_ColonCancer/. The raw mass spectrometry data accompanying this publication have been deposited in the public proteomics repository MassIVE (http://massive.ucsd.edu) using the identifier: MSV000078555. The data should be accessible at ftp://MSV000078555:email@example.com.
Liquid chromatography and tandem mass spectrometry
Rustgi AK: The genetics of hereditary colon cancer. Genes Dev. 2007, 21: 2525-2538.
Network CGA: Comprehensive molecular characterization of human colon and rectal cancer. Nature. 2012, 487: 330-337.
Hanash S, Taguchi A: The grand challenge to decipher the cancer proteome. Nat Rev Cancer. 2010, 10: 652-660.
De Wit M, Fijneman RJA, Verheul HMW, Meijer GA, Jimenez CR: Proteomics in colorectal cancer translational research: biomarker discovery for clinical applications. Clin Biochem. 2013, 46: 466-479.
Luo Y, Wang L, Wang J: Developing proteomics-based biomarkers for colorectal neoplasms for clinical practice: opportunities and challenges. Proteomics Clin Appl. 2013, 7: 30-41.
Hynes RO: The extracellular matrix: not just pretty fibrils. Science. 2009, 326: 1216-1219.
Hynes RO, Naba A: Overview of the matrisome–an inventory of extracellular matrix constituents and functions. Cold Spring Harb Perspect Biol. 2012, 4: a004903-
Cretu A, Brooks PC: Impact of the non-cellular tumor microenvironment on metastasis: potential therapeutic and imaging opportunities. J Cell Physiol. 2007, 213: 391-402.
Egeblad M, Nakasone ES, Werb Z: Tumors as organs: complex tissues that interface with the entire organism. Dev Cell. 2010, 18: 884-901.
Lu P, Weaver VM, Werb Z: The extracellular matrix: a dynamic niche in cancer progression. J Cell Biol. 2012, 196: 395-406.
Jacobetz MA, Chan DS, Neesse A, Bapiro TE, Cook N, Frese KK, Feig C, Nakagawa T, Caldwell ME, Zecchini HI, Lolkema MP, Jiang P, Kultti A, Thompson CB, Maneval DC, Jodrell DI, Frost GI, Shepard HM, Skepper JN, Tuveson DA: Hyaluronan impairs vascular function and drug delivery in a mouse model of pancreatic cancer. Gut. 2013, 62: 112-120.
Chauhan VP, Stylianopoulos T, Boucher Y, Jain RK: Delivery of molecular and nanoscale medicine to tumors: transport barriers and strategies. Annu Rev Chem Biomol Eng. 2011, 2: 281-298.
Naba A, Clauser KR, Hoersch S, Liu H, Carr SA, Hynes RO: The matrisome: in silico definition and in vivo characterization by proteomics of normal and tumor extracellular matrices. Mol Cell Proteomics. 2012, 11: M111.014647-
Naba A, Clauser KR, Lamar JM, Carr SA, Hynes RO: Extracellular matrix signatures of human mammary carcinoma identify novel metastasis promoters. eLife. 2014, 3: e01308-
Naba A, Hoersch S, Hynes RO: Towards definition of an ECM parts list: an advance on GO categories. Matrix Biol. 2012, 31: 371-372.
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005, 102: 15545-15550.
Del Rio M, Mollevi C, Vezzio-Vie N, Bibeau F, Ychou M, Martineau P: Specific extracellular matrix remodeling signature of colon hepatic metastases. PLoS One. 2013, 8: e74599-
Ki DH, Jeung H-C, Park CH, Kang SH, Lee GY, Lee WS, Kim NK, Chung HC, Rha SY: Whole genome analysis for liver metastasis gene signatures in colorectal cancer. Int J Cancer J Int Cancer. 2007, 121: 2005-2012.
Tsuji S, Midorikawa Y, Takahashi T, Yagi K, Takayama T, Yoshida K, Sugiyama Y, Aburatani H: Potential responders to FOLFOX therapy for colorectal cancer by random forests analysis. Br J Cancer. 2012, 106: 126-132.
Sheffer M, Bacolod MD, Zuk O, Giardina SF, Pincas H, Barany F, Paty PB, Gerald WL, Notterman DA, Domany E: Association of survival and disease progression with chromosomal instability: a genomic exploration of colorectal cancer. Proc Natl Acad Sci U S A. 2009, 106: 7131-7136.
Lin AY, Chua M-S, Choi Y-L, Yeh W, Kim YH, Azzi R, Adams GA, Sainani K, van de Rijn M, So SK, Pollack JR: Comparative profiling of primary colorectal carcinomas and liver metastases identifies LEF1 as a prognostic biomarker. PLoS One. 2011, 6: e16636-
Ben Q-W, Zhao Z, Ge S-F, Zhou J, Yuan F, Yuan Y-Z: Circulating levels of periostin may help identify patients with more aggressive colorectal cancer. Int J Oncol. 2009, 34: 821-828.
Bao S, Ouyang G, Bai X, Huang Z, Ma C, Liu M, Shao R, Anderson RM, Rich JN, Wang X-F: Periostin potently promotes metastatic growth of colon cancer by augmenting cell survival via the Akt/PKB pathway. Cancer Cell. 2004, 5: 329-339.
Chlenski A, Cohn SL: Modulation of matrix remodeling by SPARC in neoplastic progression. Semin Cell Dev Biol. 2010, 21: 55-65.
Ahmed M, Behera R, Chakraborty G, Jain S, Kumar V, Sharma P, Bulbule A, Kale S, Kumar S, Mishra R, Raja R, Saraswati S, Kaur R, Soundararajan G, Kumar D, Thorat D, Sanyal M, Ramdasi A, Ghosh P, Kundu GC: Osteopontin: a potentially important therapeutic target in cancer. Expert Opin Ther Targets. 2011, 15: 1113-1126.
Rangaswami H, Bulbule A, Kundu GC: Osteopontin: role in cell signaling and cancer progression. Trends Cell Biol. 2006, 16: 79-87.
Liu AY, Zheng H, Ouyang G: Periostin, a multifunctional matricellular protein in inflammatory and tumor microenvironments. Matrix Biol J Int Soc Matrix Biol. 2014
Yao L, Lao W, Zhang Y, Tang X, Hu X, He C, Hu X, Xu LX: Identification of EFEMP2 as a serum biomarker for the early detection of colorectal cancer with lectin affinity capture assisted secretome analysis of cultured fresh tissues. J Proteome Res. 2012, 11: 3281-3294.
Holten-Andersen MN, Christensen IJ, Nielsen HJ, Stephens RW, Jensen V, Nielsen OH, Sørensen S, Overgaard J, Lilja H, Harris A, Murphy G, Brünner N: Total levels of tissue inhibitor of metalloproteinases 1 in plasma yield high diagnostic sensitivity and specificity in patients with colon cancer. Clin Cancer Res. 2002, 8: 156-164.
Sørensen NM, Byström P, Christensen IJ, Berglund A, Nielsen HJ, Brünner N, Glimelius B: TIMP-1 is significantly associated with objective response and survival in metastatic colorectal cancer patients receiving combination of irinotecan, 5-fluorouracil, and folinic acid. Clin Cancer Res. 2007, 13: 4117-4122.
McPherson RA, Pincus MR: Henry’s Clinical Diagnosis and Management by Laboratory Methods. 2011, Philadelphia, PA, U.S.A: Elsevier Health Sciences, 22
Pasche N, Neri D: Immunocytokines: a novel class of potent armed antibodies. Drug Discov Today. 2012, 17: 583-590.
Pasche N, Wulhfard S, Pretto F, Carugati E, Neri D: The antibody-based delivery of interleukin-12 to the tumor neovasculature eradicates murine models of cancer in combination with paclitaxel. Clin Cancer Res. 2012, 18: 4092-4103.
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2407/14/518/prepub
The authors wish to thank Dr. Jie Wu from the Barbara K. Ostrom Bioinformatics and Computing facility at the Swanson Biotechnology Center for assistance with Gene Set Enrichment Analyses, Dr. John Lamar for critical reading of the manuscript and the members of the Hynes Lab for helpful discussions.
This work was supported by grants from the National Cancer Institute – Tumor Microenvironment Network (U54 CA126515/CA163109), the Broad Institute of MIT and Harvard, the Howard Hughes Medical Institute, of which ROH is an Investigator and in part by Support Grant to the Koch Institute (P30-CA14051 from the National Cancer Institute). AN was supported by postdoctoral fellowships from the Howard Hughes Medical Institute and the Ludwig Center for Cancer Research.
The authors declare that they have no competing interests.
Conception and design of the experiments: AN, KRC, SAC, KKT, ROH. Development of proteomics methodology: AN, KRC. Acquisition of data: AN, KRC. Analysis and interpretation of data: AN, KRC, CAW, ROH. Writing of the manuscript: AN, KRC, CAW, SAC, KKT, ROH. Study supervision: ROH. All authors have read and approved the submitted manuscript.
Electronic supplementary material
Additional file 2: A. Complete proteomics data set. Proteins are sorted by matrisome divisions (i.e. core matrisome, matrisome associated or other, column A), matrisome categories (column B) and overall confidence score (column BS). B. Subset of extracellular matrix proteins detected in any samples. Data were extracted from Additional file 2A. C. Abundance of ECM proteins characteristic of metastatic colon tumors, metastases or metastatic colon tumors and metastases (see Figure 2E). Numbers represent the abundance of each protein and correspond to the sum of the abundance across the independent samples for each protein (calculated from Additional file 2B, columns F, N and R for normal colon samples; columns V, AD and AL for colon tumor samples; from columns AT, BB and BF for colon tumor metastasis to liver samples and columns BJ and BN for normal liver samples). Dash (-) indicates that the protein was detected in none of the independent samples. White cells are indicative of proteins that were detected in one patient only and thus do not qualify to be part of the matrisome of a given tissue. (XLSX 3 MB)
Additional file 3: Detailed list of all of the confidently identified peptide spectrum matches (PSMs) from the LC-MS/MS runs of each of the normal tissues and tumor samples analyzed. Due to its large size, the excel file has been deposited in the public proteomics repository MassIVE and is accessible in the Results directory at ftp://MSV000078555@massive.ucsd.edu/. (DOCX 14 KB)
Additional file 4: Intra-patient reproducibility (Patient 2). Venn diagrams represent the intra-patient reproducibility assessed by comparing the ECM composition of two distinct pieces of the same colon tumor (A) or metastasis (B) from patient 2. (PDF 37 KB)
Additional file 5: A. List of publicly available clinical gene expression datasets used for GSEA. B. Values indicate for each gene of the signatures its rank metric scores (used to build Figure 4C and D). (XLSX 15 KB)
Additional file 6: GSEA Enrichment Plots. Enrichment plots generated by comparing the metastasis ECM gene set (left panels) and the primary metastatic colon cancer gene set (right panels) defined in this study with four publicly available clinical gene expression data sets (see Additional file 5). (PDF 590 KB)
About this article
Cite this article
Naba, A., Clauser, K.R., Whittaker, C.A. et al. Extracellular matrix signatures of human primary metastatic colon cancers and their metastases to liver. BMC Cancer 14, 518 (2014). https://doi.org/10.1186/1471-2407-14-518