The Colorectal cancer disease-specific transcriptome may facilitate the discovery of more biologically and clinically relevant information
- Wendy L Allen†1,
- Puthen V Jithesh†1,
- Gavin R Oliver2,
- Irina Proutski1,
- Daniel B Longley1,
- Heinz-Josef Lenz3,
- Vitali Proutski2,
- Paul Harkin1, 2 and
- Patrick G Johnston1, 2Email author
© Allen et al; licensee BioMed Central Ltd. 2010
Received: 8 December 2009
Accepted: 20 December 2010
Published: 20 December 2010
To date, there are no clinically reliable predictive markers of response to the current treatment regimens for advanced colorectal cancer. The aim of the current study was to compare and assess the power of transcriptional profiling using a generic microarray and a disease-specific transcriptome-based microarray. We also examined the biological and clinical relevance of the disease-specific transcriptome.
DNA microarray profiling was carried out on isogenic sensitive and 5-FU-resistant HCT116 colorectal cancer cell lines using the Affymetrix HG-U133 Plus2.0 array and the Almac Diagnostics Colorectal cancer disease specific Research tool. In addition, DNA microarray profiling was also carried out on pre-treatment metastatic colorectal cancer biopsies using the colorectal cancer disease specific Research tool. The two microarray platforms were compared based on detection of probesets and biological information.
The results demonstrated that the disease-specific transcriptome-based microarray was able to out-perform the generic genomic-based microarray on a number of levels including detection of transcripts and pathway analysis. In addition, the disease-specific microarray contains a high percentage of antisense transcripts and further analysis demonstrated that a number of these exist in sense:antisense pairs. Comparison between cell line models and metastatic CRC patient biopsies further demonstrated that a number of the identified sense:antisense pairs were also detected in CRC patient biopsies, suggesting potential clinical relevance.
Analysis from our in vitro and clinical experiments has demonstrated that many transcripts exist in sense:antisense pairs including IGF2BP2, which may have a direct regulatory function in the context of colorectal cancer. While the functional relevance of the antisense transcripts has been established by many studies, their functional role is currently unclear; however, the numbers that have been detected by the disease-specific microarray would suggest that they may be important regulatory transcripts. This study has demonstrated the power of a disease-specific transcriptome-based approach and highlighted the potential novel biologically and clinically relevant information that is gained when using such a methodology.
Response rates for advanced colorectal cancer (CRC) remain disappointingly low at 40-50% for 5-FU-based combination therapies [1, 2]. The poor response rates are due to drug resistance, which is either inherent or acquired in nature. A number of predictive markers of response to these therapies have been proposed, however, the results are controversial [3–16] and to date, outside of KRAS testing, no predictive markers have made the transition to routine clinical use. Due to the lack of clinical implementation of molecular markers there is a need to identify robust predictive markers of response to ultimately increase response rates to treatment in these patients.
Many studies have identified predictive markers or cassettes of predictive markers using gene expression measurements [3, 17–21]. Within the current study we have utilized the leading generic microarray and compared it to a disease-specific transcriptome-based microarray. It is of interest to assess the content of the unique information present in the disease-specific microarray in relation to drug treatment and in the identification of potential predictive markers in this disease setting. Recently, the ENCODE pilot project published its findings on the detailed characterization of 1% of the human genome . The study observed a much higher level of transcription than was originally thought to occur including a high level of non-protein encoding transcripts. Indeed several studies have suggested that up to 20% of all protein-encoding genes could have an associated natural antisense transcript (NAT) . The aim of the present study was to assess the benefit of a disease-specific transcriptome-based profiling approach compared to a generic genomic-based microarray. In addition, we examined the composition of the disease-specific transcripts and found a high level of NAT expression both in vitro and clinically in this disease setting. These have a functional role in response to drug treatment in colorectal cancer and warrant further investigation.
Microarray Profiling and Experiment Design
We have previously carried out microarray profiling experiments using HCT116 colorectal cancer cells on the Affymetrix HGU133 Plus2.0 array (Plus2.0 array)  and the Almac Diagnostics Colorectal cancer DSA (Colorectal DSA). HCT116 parental cells and 5-FU-resistant daughter cells  were either untreated (0 h control) or treated with 5 μM 5-FU (IC50(72 h) for the parental cell line) for 24 hours (Additional File 1A). The comparison between parental control and parental treated with 5-FU is referred as the 'sensitive' experiment; while the comparison between the resistant control and the resistant treated with 5-FU is referred as the 'resistant' experiment. Microarray profiling was carried out on 28 pre-treatment (Irinotecan/5-FU) metastatic biopsies using the Colorectal DSA . All patients provided written fully informed consent as per IRB guidelines in the University of Southern California and approval was granted from this body. These patients underwent biopsy of colorectal liver metastases prior to commencing irinotecan/5-FU chemotherapy on the IFL schedule. Detailed experimental protocols and raw expression data are available at http://www.ebi.ac.uk/arrayexpress/ (Accession numbers E-MEXP-1691 (in vitro) and E-MEXP-1692 (Clinical) for Colorectal DSA analysis and Accession number E-MEXP-390 for Affymetrix Plus2.0 analysis).
Quantitative reverse transcription-PCR analysis
Total RNA was isolated using RNA STAT-60 (Tel-Test, Inc.) according to the manufacturer's instructions. Reverse transcription was carried out using 2 μg of RNA using a Moloney murine leukemia virus-based reverse transcriptase kit (Invitrogen) according to the manufacturer's instructions. Quantitative reverse transcription-PCR (RT-PCR) amplification was carried out in a final volume of 10 μL containing 5 μL of 2×SYBR green master mix (Qiagen), 4 μL of primers (2 μM), and 1 μl of cDNA using an Opticon DNA Engine Thermal Cycler (Bio-Rad Laboratories, Inc., Waltham, MA) using methods previously described . All amplifications were primed by pairs of chemically synthesized 18- to 22-mer oligonucleotides designed using freely available primer design software (Primer3) http://frodo.wi.mit.edu/primer3/ (Additional file 2).
Derivation of unique microarray content lists
HG-U133 Plus2.0 full sequences and probes were downloaded from the Affymetrix website http://www.affymetrix.com/ in FASTA format. Probe and full sequences used in the design of the Colorectal DSA were obtained from Almac Diagnostics in FASTA format.
Probe sequences from the Colorectal DSA probesets were aligned against the Plus2.0 array full length sequences using BLAST . Where 6 or more probes from a probeset (usually 11 probes) aligned to the same sequence with 100% identity over their entire length, the DSA probeset and the Affymetrix sequence were considered 'common'. Full length sequences representing the DSA probesets not considered common at this stage were extracted and the Plus2.0 array probesets were BLASTed against them. Where 6 or more probes from a probeset (usually 11 probes) aligned to the same sequence with 100% identity over their entire length, the DSA sequence and the Affymetrix probeset were again considered 'common'. Those sequences/probesets not considered common at this stage also formed the 'unique' groupings.
Data analysis was conducted using either Genespring GX v 7.3.1 (Agilent Technologies, UK) or the R statistical package  and Bioconductor . Background correction, scaling and summarization of the raw data to generate expression values were done with the MAS5 algorithm. The experiment was setup to measure the ability of each microarray platform to detect probesets, detect differentially expressed probesets and also to detect biologically relevant (cancer-related) probesets.
The detection of probesets was measured based on the MAS5 present, marginal and absent flag calls. For all the replicates, probesets passing the flag call filter as present or marginal were counted using data from the whole microarrays in both Plus2.0 array and Colorectal DSA. The number of probesets consistently detected across the 3 replicates in each condition, i.e., untreated parental, 5-FU treated parental, untreated 5-FU resistant and 5-FU treated 5-FU resistant, was calculated by selecting the probesets passing the flag filter in all the 3 replicates in each case.
Differential Expression Filtering
For both the Colorectal DSA and the Plus2.0 complete microarray data, following detection filtering, probesets were further filtered based on fold change in expression and a statistical filter in the case of both HCT116 parental and 5-FU resistant cell line data. Differential expression was measured between untreated and 5-FU treated samples in both the sensitive and resistant experiments. All probesets passing the fold change filter of 1.3 fold and also with a t-test p-value less than 0.05 were counted for differentially expressed transcripts.
All pathway analysis was carried out using Genespring v7.3.1 (Agilent Technologies, UK) using both KEGG and GenMAPP pathways. Pathway analysis was carried out using the complete content of each microarray platform for those probesets that were detected (present/marginal) and differentially expressed (1.3-fold + t test) in the sensitive and resistant experiments and pathways were selected that contained greater than 15 genes (sensitive experiment) and 10 genes (resistant experiment) per pathway. Statistical analysis for each pathway was carried out using hypergeometric statistics. The number of genes per pathway cut-off was selected based on the total number of genes contained within a given experiment.
Analysis of sense and antisense probesets
The total number of sense and antisense transcripts in the unique content (23,089 probesets) of the Colorectal DSA was assessed. In addition to the Colorectal DSA-specific probesets, the numbers of sense and antisense probesets within this group, which were detected in the in vitro (sensitive and resistant) and clinical experiments were also assessed independently. Detection was determined by the present and marginal flag calls as described earlier. Finally, probesets that passed both the detection filter and differential expression filter were classified into sense and antisense orientations and counted
In all the cases above, it was further investigated to find whether sense:antisense (SAS) pairs exist
Genomic Alignment of SAS pairs
Full sequences corresponding to the Colorectal DSA probesets were aligned to the human genome (Ensembl release 51.36 m; NCBI build 36) using BLAT via the Ensembl  website http://www.ensembl.org/index.html. The highest scoring alignments were viewed using the 'region in detail' view of the Ensembl genome browser. Tracks were customized to include known Ensembl genes and GENSCAN  predicted genes using the 'configure this page' option.
All probesets from the colorectal DSA analysis of metastatic CRC patient samples were initially filtered using detection flag calls, with present or marginal calls in > 50% of all samples. Subsequently, probesets were filtered using differential expression with a change of 1.5-fold in at least one condition (CR, PR, SD and PD). Sense and antisense probes were isolated from each list and only the probes with associated annotation were taken forward for SAS pair analysis.
Comparison of the Affymetrix HGU133 Plus2.0 microarray with the Almac Diagnostics Colorectal Cancer DSA
Content of Plus2.0 array and Colorectal DSA
To compare the two microarray platforms, we compared the complete content of each array based on detection (Affymetrix MAS5 present (P) or marginal (M) flag calls) and detection + differential expression (1.3-fold change and t-test p-value <0.05).
Validation of In Vitro Microarray Analyses
In order to validate the microarray results, we measured the expression of a representative number of genes from the in vitro Colorectal DSA experiment by quantitative RT-PCR, we have previously validated the Plus2.0 array experiment . For the Colorectal DSA 13 genes (Additional file 2) were selected for validation; all validations were carried out in three independent experiments (Additional file 3). The genes were selected based on fold-induction, with both highly and more moderately induced genes chosen and both up-regulated and down-regulated genes analyzed.
For the selected genes, the average fold-changes by both microarray and quantitative RT-PCR were log transformed and the correlation between the expression values were examined using Pearson's product correlation moment (r). For the 13 genes acutely altered in the HCT116 parental cells following 5-FU treatment over 24 h, the Pearson's correlation (r) was 0.75, with r2 = 0.57 (p = 0.0032). In terms of the basal alterations between parental and 5-FU-resistant cells, Pearson's correlation (r) of the 13 genes was 0.78, with r2 = 0.61 (p = 0.0017) (Additional file 3). Taken together, these results demonstrate that there is a strong overall concordance between the real-time PCR validation and the microarray experiment. Therefore these results highlight the robustness of the original microarray experiment.
Analysis of the total content of the microarray platforms
The complete content of each microarray was also compared based on detection and differential expression. The colorectal DSA detected a higher number of probesets compared to the Plus2.0 array when detection and differential expression were taken into account. The colorectal DSA detected 3713 differentially expressed probesets in the sensitive and 1660 differentially expressed probesets in the resistant experiments while the Plus2.0 array detected only 3296 differentially expressed probesets in the sensitive and 564 differentially expressed probesets in the resistant experiments (Figure 1B). Taken together, these results suggest that the Colorectal DSA consistently detects a higher number of differentially expressed probesets and displays a lower variance between sample replicates.
Pathway analysis of the microarray platforms
Pathway analysis from Plus2.0 array
Number of genes
Starch and sucrose metabolism
Ubiquitin mediated proteolysis
Wnt signaling pathway
Pathway analysis from Colorectal DSA
Number of genes
Biosynthesis of steroids
Fatty acid metabolism
Fructose and mannose metabolism
Insulin signaling pathway
Starch and sucrose metabolism
Valine, leucine and isoleucine degradation
We also examined which pathways were differentially regulated in the resistant experiment between the Plus2.0 array and the Colorectal DSA. In the resistant experiment, following filtering (Flags, 1.3-fold and t-test), 1660 genes were identified as altered following 5-FU treatment using the Colorectal DSA, while only 564 genes were identified as altered following 5-FU treatment using the Plus2.0 array. Pathway analysis revealed that 19 pathways were altered following 5-FU treatment using the Colorectal DSA, while only 3 pathways were altered following 5-FU treatment using the Plus2.0 array (Additional File 4). The 3 pathways (Focal adhesion, MAPK signaling and regulation of the actin cytoskeleton) that were identified using the Plus2.0 array were also identified using the Colorectal DSA, therefore, the Plus2.0 array was not identifying any unique information. In addition, there was no overlap in the identified pathways between the sensitive and the resistant experiments using the Plus2.0 array. Using the Colorectal DSA, 16 unique pathways were identified that were not identified by the Plus2.0 array and 4 pathways (Cell cycle, Insulin signaling, Purine metabolism and Pyrimidine metabolism) were identified in both the sensitive and the resistant experiments (Additional File 5). These pathways may play an important role not only in drug response, but also in drug resistance. Overall, it appears that compared to the Plus2.0 array, the Colorectal DSA is providing more biologically relevant information, both in the sensitive and resistant experiments.
Composition of the specific Colorectal DSA content
Sense and antisense in vitro analysis
Sensitive experiment Detection
Resistant experiment Detection
Sensitive experiment Detection + DE
Resistant experiment Detection + DE
Sense:Antisense (SAS) probe pair analysis
In vitro analysis
Of the 1299 Colorectal DSA-specific probesets detected in this experiment, 661 were in the antisense orientation and 638 were in the sense orientation, and 45 were common to both sense and antisense probesets and termed SAS probe pairs (Additional File 6). Gene ontology analysis revealed that the SAS probesets were involved in a plethora of biological processes, of those the most statistically robust terms were oxidative phosphorylation, JAK-STAT signaling, phosphorylation, metabolism, cell death and splicing (data not shown).
Sense and antisense clinical analysis
Detection + DE
When comparing the probesets in the sense orientation, antisense orientation and those that exist in SAS pairs, it was observed that 244 sense probesets are common between the sensitive in vitro and clinical experiments, while 247 sense probesets are common between the resistant in vitro and clinical experiments and 565 sense probesets were found to be common between the sensitive and resistant in vitro experiments. Further analysis demonstrated that 147 antisense probesets were shared between the sensitive in vitro and clinical experiments, 150 antisense probesets were common between the resistant in vitro and clinical experiments, while 582 antisense probesets were common between the sensitive and resistant in vitro experiments. Finally, in terms of those probesets that were detected as SAS pairs, 7 were common between the sensitive in vitro and clinical experiments, 5 SAS pairs were shared between the resistant in vitro and clinical experiments and 34 SAS pairs were common between the sensitive and resistant in vitro experiments (Additional File 7).
The aim of this study was to compare transcriptional profiling data generated from colorectal cancer cell lines following treatment with 5-FU using either a leading generic genomic-based microarray (Plus2.0 array) or a disease-specific transcriptomic-based microarray (Colorectal DSA). The Colorectal DSA was developed based on the colorectal transcriptome, which was generated from large-scale in-house sequencing, public data mining and experimental investigation . The DSA array is a transcriptome based array as opposed to the Plus 2.0 which a genomic based array. Given the greater complexity of the transcriptome in comparison to the genome, it would be expected that an array of this type would detect a greater number of transcripts. When comparing the Colorectal DSA to the Plus2.0 array, the Colorectal DSA contains 37.5% unique information (23,089 probesets), which is not contained on the Plus2.0 array and the aim of the current study was to assess how important this unique information is. One of the benefits of the Colorectal DSA is that it is also based on the Affymetrix GeneChip technology meaning that cross-platform comparisons are possible.
The same experimental design was used for each microarray study, consisting of parental or 5-FU-resistant HCT116 cells either untreated or treated with 5-FU for 24 h. The resultant expression profile generated from the parental cells following treatment with 5-FU was termed as the sensitive experiment, while the expression profile generated from the resistant cells following 5-FU was termed as the resistant experiment. To assess the performance of each microarray platform we compared the complete content (all probesets) of the arrays based on detection (Flags, present or marginal) and detection plus differential expression.
Following analysis of the complete content of the microarrays, the Colorectal DSA outperformed the Plus2.0 array in terms of probesets detected and detected plus differentially expressed and also displayed a lower variance between sample replicates. In addition, the Colorectal DSA identified more pathways in both the sensitive and the resistant experiments when compared to the Plus2.0 array and also identified common pathways important for drug response and also drug resistance, cell cycle, insulin signaling, purine metabolism and pyrimidine metabolism. Indeed, it is not surprising that cell cycle, purine and pyrimidine metabolism pathways were altered following 5-FU treatment in sensitive and 5-FU-resistant cells given the mechanism of action of the drug. Interestingly, insulin signaling was also altered following 5-FU treatment in both sensitive and resistant settings. Previous studies have demonstrated that insulin signaling has an important role in colorectal cancer progression [33, 34]. Dallas et al demonstrated that colorectal cancer cells that are resistant to 5-FU and oxaliplatin, by repeated exposure to drug, are more responsive to IGF-1R inhibition than the parental cells , suggesting that insulin signaling is deregulated during the process of acquiring drug resistance. There are a number of reasons that can account of the observed differences in pathway identification between the two platforms, firstly, in terms of the 'complete' probeset analysis, the Colorectal DSA detected more probesets and also more differentially expressed probesets than the Plus2.0 array. More importantly, in terms of those probesets that are unique to each array platform our analysis suggested that the Plus2.0 array detected more probesets than the Colorectal DSA. In terms of pathway analysis we are interested in specific genes, so when we assessed the percentage of probesets that coded for a single gene name, we found that the Colorectal DSA identified many more individual genes than the Plus2.0 array, which identified multiple probesets that coded for the same gene name. Overall, this suggests that the Colorectal DSA was identifying more differentially expressed 'unique' genes than the Plus2.0 array and this accounts for the observed differences in pathway identification between the two array platforms.
We also wanted to examine the microarray specific content of the Colorectal DSA, which was not present on the Plus2.0 array. We found that approximately 50% of the Colorectal DSA specific probesets are in the antisense orientation, which is much higher than expected. Upon further examination of the microarray-specific probesets, we demonstrated that some are expressed in either the sense or antisense orientations only, while a portion (up to 8.9%) are detected in sense:antisense (SAS) pairs. Recently, the publication of the ENCODE pilot project, which aimed to provide a detailed characterization of 1% of the human genome, demonstrated that there is a much higher level of transcription than originally thought and this includes the generation of a high number of non-protein encoding transcripts . In addition, the literature suggests that approximately 20% of human protein-encoding genes have an associated natural antisense transcript (NAT), however, recent studies suggest that this figure could be much higher [23, 37–40]. NATs can be divided into either cis-acting or trans acting in nature . Cis-acting NATs are transcribed from the opposing DNA strand at the same genomic locus, while trans-acting NATs are transcribed from separate loci. The cis-NATs can also be further categorized according to their relative orientation and degree of overlap, either 5' to 5' (head to head), 3' to 3' (tail to tail) or fully overlapping [37, 41]. NATs have been proposed to regulate the expression of their target genes at several levels, but as yet no experimental data has been provided to assign a definite function to NATs. However, some studies using RT-PCR, northern blotting or microarray profiling have validated the expression of antisense transcripts [23, 38, 39, 42]. Interestingly, some SAS pairs are flanked by the same transcription factor binding sites, suggesting that the SAS pairs may be co-regulated . Analysis has demonstrated that SAS pairs can display concordant expression patterns, or discordant expression patterns . In addition, studies have demonstrated that targeting an antisense transcript using a siRNA approach can alter the levels of the sense transcript, by either up-regulating sense transcription or down-regulating sense transcription [40, 43], so the results are not always as expected. However, the same studies have demonstrated that alterations of the sense transcript does not affect the antisense expression levels [40, 43].
As previously described, the functional role of these antisense transcripts is currently unknown, but they have been implicated in transcriptional and translational interference, RNA masking, dsRNA-dependent mechanisms, alternative splicing, stability, cellular transport and chromatin remodeling [37, 40, 41, 44]. However, the functional relevance of antisense transcripts is something that is now commonly accepted [45–47]. Studies have demonstrated that long antisense transcripts function as epigenetic regulators of transcription in human cells . In addition, studies that have validated the functional relevance of antisense transcripts suggest that they are not a uniform group of regulatory RNAs, but rather that they carry out a wide variety of biological roles . The utility of a transcriptome-based approach has been demonstrated in the detection of these non-coding antisense transcripts, as this information could be important when examining pathway regulation. Further examination of these NATs may answer a number of important questions such as why when an upstream regulator of a pathway is highly up regulated at the mRNA level do we not see downstream mediators up regulated, or why do the changes observed at the RNA level not always correlate with protein expression? Obviously, a great deal of experimental work would need to take place to assess whether NATS do play a role in gene regulation, but if as we suspect at least some do, we need to not only examine the sense transcripts, but also the antisense transcripts at the same time to get a true view of what is happening in the cell, for example, following drug treatment.
We further examined the 45 SAS pairs that were detected as either present or marginal in the 5-FU sensitive experiment; we decided not to include a fold change filter at this stage as it is not necessarily to have both the sense and the antisense transcript altered to a certain level to see a functional effect. For example, the antisense may be up regulated which leads to the suppression of the sense, resulting in no change in the sense probeset. Overall, when we examined the intensities/expression of the probesets contained within the SAS pairs it was found that ~50% displayed similar intensities, therefore displaying no differential intensities between sense and antisense probesets. However, ~50% displayed discordant or differential intensities, therefore this group of SAS pairs may be the most functionally relevant, however, this will require more experimental testing. Gene ontology analysis demonstrated that these SAS pairs were involved in diverse biological processes, with the most statistically robust involved in oxidative phosphorylation, JAK-STAT signaling, phosphorylation, metabolism, cell death and splicing. We further chose two SAS pairs to examine at the sequence level, they were SOCS6 and IGF2BP2. Sequence alignment demonstrated that the full length SOCS6 transcript aligned exactly with the SOCS6 gene on the forward strand of chromosome 18. In addition, the full length antisense transcript aligned to the reverse strand of chromosome 18 and demonstrated good tail to tail sequence overlap with the full length sense sequence and the SOCS6 gene. In terms of IGF2BP2, the full length sense sequence aligned completely with the IGF2BP2 gene on the reverse strand of chromosome 3. The full length antisense sequence aligned to the forward strand of chromosome 3 and again demonstrated good tail to tail overlap with the full length sense sequence and the IGF2BP2 gene. The sequence alignment results demonstrate that the SAS pairs show good overlap in sequence and appear to be cis-NATS that are transcribed from the opposing DNA strand in the same genomic locus. Numerous novel SAS pairs have previously been identified on DSA microarrays and their existence validated with alternative technologies including strand-specific RT-PCR. Functional relevance has also been suggested through analysis of SAS pair expression patterns . Full characterization of the IGF2BP2 and SOCS6 antisense transcripts will require further work which forms the basis of future studies however; inspection of the sequences with the Ensembl Human Genome Browser supports their existence. Extensive EST evidence exists and appears to suggest a regular exonic structure. Numerous currently unclassified regulatory elements also occur in the region surrounding the sequences. Since both the EST sequencing used in DSA design and the experimental labelling process are polyA-based, it would suggest that the transcripts are polyadenylated, but since the ESTs represent only a fragment of the full transcript, analysis of precise polyA signal location and constitution (i.e. canonical or non canonical) is difficult.
To investigate the clinical relevance of SAS pairs we utilized microarray data generated from pre-treatment (irinotecan/5-FU) metastatic colorectal biopsies with full response data. Following detection filtering we demonstrated that 8 SAS pairs existed (4.8% of total antisense and 3% of total sense probesets). In addition, we demonstrated that 3 SAS pairs existed following detection plus differential expression filtering (4.5% total antisense and 3.4% total sense probesets). Upon examination of the probesets in the sense orientation, antisense orientation and those existing in SAS pairs between in vitro experiments and clinical experiments, the results demonstrate that there is a high percentage of sense, antisense and SAS pairs that exist between in vitro and clinical samples. The clinical experiments generated fewer sense, antisense and SAS pairs than the in vitro experiments, however, a high percentage of those detected in the clinical experiment were also detected in the in vitro experiments. Taken together, these results suggest that in vitro experiments do highlight potentially clinically relevant information; however, these types of analysis would require further independent validation. These in vitro and clinical analyses demonstrate in this disease setting that potentially up to 8.9% of all probesets could exist in SAS pairs; currently there is little investigation to the functional role that these SAS pairs may play. Interestingly, one SAS pair, IGF2BP2, was found to be common between the in vitro and the clinical analysis. IGF2BP2 has been demonstrated to regulate translation of IGF2 by binding to its 5'UTR . In addition, IGF2 is known to be overexpressed in cancer [50, 51] and specifically, insulin signaling has been demonstrated to play a role in colorectal cancer [35, 52–55]. Given the results from the pathway analysis also identifying the significance of insulin signaling, further experimental investigation into the identified SAS pairs, in particular IGFBP2, should discover if some or all have functional relevance in this disease setting and whether they are disease-specific or have more widespread effects. The focus of future studies examining the SAS pairs identified from this study will also include questions such as what is their exact function within the cell, are they all functioning in the same way in this disease setting or is it dependent on the specific SAS pair.
One of the limitations of this analysis is that we compared the power of the two microarray platforms using data generated from a single 5-FU-sensitive and -resistant cell line model. While the main focus of the study was to directly compare the data generated from the two microarray platforms based on detected transcripts and pathways and for this a single model cell line would be appropriate, however, a secondary aim was to assess the biological relevance of the colorectal transcriptome and compare this to a generic genomic approach. In this respect the use of a number of CRC cell line models would have given greater insight into the power of such an approach as the problem of tissue homogeneity would have been addressed to some degree. It is widely accepted that cell lines models are not very representative of the primary tumour and to somewhat address these issues we identified the unique biological information, SAS pairs, that was generated using the colorectal transcriptome-based approach and assessed if these occurred in metastatic (liver) CRC patient biopsies. The cell line models identified 45 SAS pairs and when we examined the data generated from the clinical biopsies we found that not as many SAS pairs existed, 8 in total were detected. When we compared the SAS pairs from the cell lines and patient biopsies we found that 7 were in common, therefore ~87% of the clinical SAS pairs were also contained within the cell line SAS pairs list. This would suggest that many of the cell line SAS pairs are lost in the clinical samples probably due to the homogeneity of the cell line model and that those are occurring in the clinical samples may be the most biologically relevant, however, further analysis of these SAS pairs would be required.
In conclusion, we have carried out transcriptional profiling using the Plus2.0 array and the Colorectal DSA and compared their overall performance. We observed that the transcriptome-based Colorectal DSA has outperformed the genome-based Plus2.0 array as demonstrated by the detection and differential expression of the entire microarray content. This study has demonstrated that the strength of a disease-specific transcriptome-based approach is in the amount of biologically relevant information gained, as noted from the pathway analysis. When analyzing the results from the Colorectal DSA a number of pathways, cell cycle, insulin signaling, purine metabolism and pyrimidine metabolism, were highlighted as important regulators of drug response and drug resistance, which were not identified using the Plus2.0 array. In addition, the novel biologically relevant information gained from the Colorectal DSA contained a number of antisense probesets that exist in SAS pairs, including IGF2BP2, again highlighting the potential importance of insulin signaling, also highlighted by pathway analysis. It is currently unclear at this point what the functionality of the identified NATs is, but the literature suggests that they may be involved in diverse gene regulatory mechanisms. However, it is clear from the numbers of antisense probesets detected and differentially expressed by the Colorectal DSA that these may be very important regulatory transcripts. Finally, if this disease-specific transcriptome-based approach was not utilized in this setting, important biologically relevant information, including the regulation of SAS pairs could potentially be overlooked.
The following abbreviation were used throughtout the text
Disease specific array
Natural antisense transcript
We would like to acknowledge funding from Cancer Research UK (C212/A7402) and Staff Training and Development Unit (STDU), Queen's University Belfast.
- Douillard JY, Cunningham D, Roth AD, Navarro M, James RD, Karasek P, Jandik P, Iveson T, Carmichael J, Alakl M, et al: Irinotecan combined with fluorouracil compared with fluorouracil alone as first-line treatment for metastatic colorectal cancer: a multicentre randomised trial. Lancet. 2000, 355 (9209): 1041-1047. 10.1016/S0140-6736(00)02034-1.View ArticlePubMedGoogle Scholar
- Giacchetti S, Perpoint B, Zidani R, Le Bail N, Faggiuolo R, Focan C, Chollet P, Llory JF, Letourneau Y, Coudert B, et al: Phase III multicenter randomized trial of oxaliplatin added to chronomodulated fluorouracil-leucovorin as first-line treatment of metastatic colorectal cancer. J Clin Oncol. 2000, 18 (1): 136-147.PubMedGoogle Scholar
- Salonga D, Danenberg KD, Johnson M, Metzger R, Groshen S, Tsao-Wei DD, Lenz HJ, Leichman CG, Leichman L, Diasio RB, et al: Colorectal tumors responding to 5-fluorouracil have low gene expression levels of dihydropyrimidine dehydrogenase, thymidylate synthase, and thymidine phosphorylase. Clin Cancer Res. 2000, 6 (4): 1322-1327.PubMedGoogle Scholar
- Johnston SJ, Ridge SA, Cassidy J, McLeod HL: Regulation of dihydropyrimidine dehydrogenase in colorectal cancer. Clin Cancer Res. 1999, 5 (9): 2566-2570.PubMedGoogle Scholar
- Wei X, McLeod HL, McMurrough J, Gonzalez FJ, Fernandez-Salguero P: Molecular basis of the human dihydropyrimidine dehydrogenase deficiency and 5-fluorouracil toxicity. J Clin Invest. 1996, 98 (3): 610-615. 10.1172/JCI118830.View ArticlePubMedPubMed CentralGoogle Scholar
- Schneider HB, Becker H: Dehydropyrimidine dehydrogenase deficiency in a cancer patient undergoing 5-fluorouracil chemotherapy. Anticancer Res. 2004, 24 (2C): 1091-1092.PubMedGoogle Scholar
- Matsuyama R, Togo S, Shimizu D, Momiyama N, Ishikawa T, Ichikawa Y, Endo I, Kunisaki C, Suzuki H, Hayasizaki Y, et al: Predicting 5-fluorouracil chemosensitivity of liver metastases from colorectal cancer using primary tumor specimens: three-gene expression model predicts clinical response. Int J Cancer. 2006, 119 (2): 406-413. 10.1002/ijc.21843.View ArticlePubMedGoogle Scholar
- Vallbohmer D, Iqbal S, Yang DY, Rhodes KE, Zhang W, Gordon M, Fazzone W, Schultheis AM, Sherrod AE, Danenberg KD, et al: Molecular determinants of irinotecan efficacy. Int J Cancer. 2006, 119 (10): 2435-2442. 10.1002/ijc.22129.View ArticlePubMedGoogle Scholar
- Ichikawa W, Uetake H, Shirota Y, Yamada H, Nishi N, Nihei Z, Sugihara K, Hirayama R: Combination of dihydropyrimidine dehydrogenase and thymidylate synthase gene expressions in primary tumors as predictive parameters for the efficacy of fluoropyrimidine-based chemotherapy for metastatic colorectal cancer. Clin Cancer Res. 2003, 9 (2): 786-791.PubMedGoogle Scholar
- Kornmann M, Schwabe W, Sander S, Kron M, Strater J, Polat S, Kettner E, Weiser HF, Baumann W, Schramm H, et al: Thymidylate synthase and dihydropyrimidine dehydrogenase mRNA expression levels: predictors for survival in colorectal cancer patients receiving adjuvant 5-fluorouracil. Clin Cancer Res. 2003, 9 (11): 4116-4124.PubMedGoogle Scholar
- Lenz HJ, Hayashi K, Salonga D, Danenberg KD, Danenberg PV, Metzger R, Banerjee D, Bertino JR, Groshen S, Leichman LP, et al: p53 point mutations and thymidylate synthase messenger RNA levels in disseminated colorectal cancer: an analysis of response and survival. Clin Cancer Res. 1998, 4 (5): 1243-1250.PubMedGoogle Scholar
- Longley DB, Boyer J, Allen WL, Latif T, Ferguson PR, Maxwell PJ, McDermott U, Lynch M, Harkin DP, Johnston PG: The role of thymidylate synthase induction in modulating p53-regulated gene expression in response to 5-fluorouracil and antifolates. Cancer Res. 2002, 62 (9): 2644-2649.PubMedGoogle Scholar
- Bunz F, Hwang PM, Torrance C, Waldman T, Zhang Y, Dillehay L, Williams J, Lengauer C, Kinzler KW, Vogelstein B: Disruption of p53 in human cancer cells alters the responses to therapeutic agents. J Clin Invest. 1999, 104 (3): 263-269. 10.1172/JCI6863.View ArticlePubMedPubMed CentralGoogle Scholar
- Liang JT, Huang KC, Cheng YM, Hsu HC, Cheng AL, Hsu CH, Yeh KH, Wang SM, Chang KJ: P53 overexpression predicts poor chemosensitivity to high-dose 5-fluorouracil plus leucovorin chemotherapy for stage IV colorectal cancers after palliative bowel resection. Int J Cancer. 2002, 97 (4): 451-457. 10.1002/ijc.1637.View ArticlePubMedGoogle Scholar
- Paradiso A, Simone G, Petroni S, Leone B, Vallejo C, Lacava J, Romero A, Machiavelli M, De Lena M, Allegra CJ, et al: Thymidilate synthase and p53 primary tumour expression as predictive factors for advanced colorectal cancer patients. Br J Cancer. 2000, 82 (3): 560-567. 10.1054/bjoc.1999.0964.View ArticlePubMedPubMed CentralGoogle Scholar
- Brett MC, Pickard M, Green B, Howel-Evans A, Smith D, Kinsella A, Poston G: p53 protein overexpression and response to biomodulated 5-fluorouracil chemotherapy in patients with advanced colorectal cancer. Eur J Surg Oncol. 1996, 22 (2): 182-185. 10.1016/S0748-7983(96)90827-6.View ArticlePubMedGoogle Scholar
- Barrier A, Boelle PY, Roser F, Gregg J, Tse C, Brault D, Lacaine F, Houry S, Huguier M, Franc B, et al: Stage II colon cancer prognosis prediction by tumor gene expression profiling. J Clin Oncol. 2006, 24 (29): 4685-4691. 10.1200/JCO.2005.05.0229.View ArticlePubMedGoogle Scholar
- Del Rio M, Molina F, Bascoul-Mollevi C, Copois V, Bibeau F, Chalbos P, Bareil C, Kramar A, Salvetat N, Fraslon C, et al: Gene expression signature in advanced colorectal cancer patients select drugs and response for the use of leucovorin, fluorouracil, and irinotecan. J Clin Oncol. 2007, 25 (7): 773-780. 10.1200/JCO.2006.07.4187.View ArticlePubMedPubMed CentralGoogle Scholar
- Wang Y, Jatkoe T, Zhang Y, Mutch MG, Talantov D, Jiang J, McLeod HL, Atkins D: Gene expression profiles and molecular markers to predict recurrence of Dukes' B colon cancer. J Clin Oncol. 2004, 22 (9): 1564-1571. 10.1200/JCO.2004.08.186.View ArticlePubMedGoogle Scholar
- Johnston PG, Lenz HJ, Leichman CG, Danenberg KD, Allegra CJ, Danenberg PV, Leichman L: Thymidylate synthase gene and protein expression correlate and are associated with response to 5-fluorouracil in human colorectal and gastric tumors. Cancer Res. 1995, 55 (7): 1407-1412.PubMedGoogle Scholar
- Shirota Y, Stoehlmacher J, Brabender J, Xiong YP, Uetake H, Danenberg KD, Groshen S, Tsao-Wei DD, Danenberg PV, Lenz HJ: ERCC1 and thymidylate synthase mRNA levels predict survival for colorectal cancer patients receiving combination oxaliplatin and fluorouracil chemotherapy. J Clin Oncol. 2001, 19 (23): 4298-4304.PubMedGoogle Scholar
- The ENCODE (ENCyclopedia Of DNA Elements) Project. Science. 2004, 306 (5696): 636-640. 10.1126/science.1105136.
- Chen J, Sun M, Kent WJ, Huang X, Xie H, Wang W, Zhou G, Shi RZ, Rowley JD: Over 20% of human transcripts might form sense-antisense pairs. Nucleic Acids Res. 2004, 32 (16): 4812-4820. 10.1093/nar/gkh818.View ArticlePubMedPubMed CentralGoogle Scholar
- Boyer J, Allen WL, McLean EG, Wilson PM, McCulla A, Moore S, Longley DB, Caldas C, Johnston PG: Pharmacogenomic identification of novel determinants of response to chemotherapy in colon cancer. Cancer Res. 2006, 66 (5): 2765-2777. 10.1158/0008-5472.CAN-05-2693.View ArticlePubMedGoogle Scholar
- Boyer J, McLean E, Aroori S, Wilson P, McCulla A, Carey P, Longley D, Johnston P: Characterization of p53 wild-type and null isogenic colorectal cancer cell lines resistant to 5-fluorouracil, oxaliplatin, and irinotecan. Clin Cancer Res. 2004, 10 (6): 2158-2167. 10.1158/1078-0432.CCR-03-0362.View ArticlePubMedGoogle Scholar
- Allen WL, Coyle VM, Jithesh PV, Proutski I, Stevenson L, Fenning C, Longley DB, Wilson RH, Gordon M, Lenz HJ, et al: Clinical determinants of response to irinotecan-based therapy derived from cell line models. Clin Cancer Res. 2008, 14 (20): 6647-6655. 10.1158/1078-0432.CCR-08-0452.View ArticlePubMedGoogle Scholar
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215 (3): 403-410.View ArticlePubMedGoogle Scholar
- R_Development_Core_Team: R: A language and environment for statistical computing. 2008, Vienna, Austria: R Foundation for Statistical ComputingGoogle Scholar
- Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, Dudoit S, Ellis B, Gautier L, Ge Y, Gentry J, et al: Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 2004, 5 (10): R80-10.1186/gb-2004-5-10-r80.View ArticlePubMedPubMed CentralGoogle Scholar
- Flicek P, Aken BL, Beal K, Ballester B, Caccamo M, Chen Y, Clarke L, Coates G, Cunningham F, Cutts T, et al: Ensembl 2008. Nucleic Acids Res. 2008, D707-714. 36 Database
- Burge C, Karlin S: Prediction of complete gene structures in human genomic DNA. J Mol Biol. 1997, 268 (1): 78-94. 10.1006/jmbi.1997.0951.View ArticlePubMedGoogle Scholar
- Tanney A, Oliver GR, Farztdinov V, Kennedy RD, Mulligan JM, Fulton CE, Farragher SM, Field JK, Johnston PG, Harkin DP, et al: Generation of a non-small cell lung cancer transcriptome microarray. BMC Med Genomics. 2008, 1: 20-10.1186/1755-8794-1-20.View ArticlePubMedPubMed CentralGoogle Scholar
- Reinmuth N, Fan F, Liu W, Parikh AA, Stoeltzing O, Jung YD, Bucana CD, Radinsky R, Gallick GE, Ellis LM: Impact of insulin-like growth factor receptor-I function on angiogenesis, growth, and metastasis of colon cancer. Lab Invest. 2002, 82 (10): 1377-1389.View ArticlePubMedGoogle Scholar
- Reinmuth N, Liu W, Fan F, Jung YD, Ahmad SA, Stoeltzing O, Bucana CD, Radinsky R, Ellis LM: Blockade of insulin-like growth factor I receptor function inhibits growth and angiogenesis of colon cancer. Clin Cancer Res. 2002, 8 (10): 3259-3269.PubMedGoogle Scholar
- Dallas NA, Xia L, Fan F, Gray MJ, Gaur P, van Buren G, Samuel S, Kim MP, Lim SJ, Ellis LM: Chemoresistant colorectal cancer cells, the cancer stem cell phenotype, and increased sensitivity to insulin-like growth factor-I receptor inhibition. Cancer Res. 2009, 69 (5): 1951-1957. 10.1158/0008-5472.CAN-08-2023.View ArticlePubMedPubMed CentralGoogle Scholar
- Birney E, Stamatoyannopoulos JA, Dutta A, Guigo R, Gingeras TR, Margulies EH, Weng Z, Snyder M, Dermitzakis ET, Thurman RE, et al: Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature. 2007, 447 (7146): 799-816. 10.1038/nature05874.View ArticlePubMedGoogle Scholar
- Katayama S, Tomaru Y, Kasukawa T, Waki K, Nakanishi M, Nakamura M, Nishida H, Yap CC, Suzuki M, Kawai J, et al: Antisense transcription in the mammalian transcriptome. Science. 2005, 309 (5740): 1564-1566. 10.1126/science.1112009.View ArticlePubMedGoogle Scholar
- Yelin R, Dahary D, Sorek R, Levanon EY, Goldstein O, Shoshan A, Diber A, Biton S, Tamir Y, Khosravi R, et al: Widespread occurrence of antisense transcription in the human genome. Nat Biotechnol. 2003, 21 (4): 379-386. 10.1038/nbt808.View ArticlePubMedGoogle Scholar
- Vallon-Christersson J, Staaf J, Kvist A, Medstrand P, Borg A, Rovira C: Non-coding antisense transcription detected by conventional and single-stranded cDNA microarray. BMC Genomics. 2007, 8: 295-10.1186/1471-2164-8-295.View ArticlePubMedPubMed CentralGoogle Scholar
- Wahlestedt C: Natural antisense and noncoding RNA transcripts as potential drug targets. Drug Discov Today. 2006, 11 (11-12): 503-508. 10.1016/j.drudis.2006.04.013.View ArticlePubMedGoogle Scholar
- Lapidot M, Pilpel Y: Genome-wide natural antisense transcription: coupling its regulation to its different regulatory mechanisms. EMBO Rep. 2006, 7 (12): 1216-1222. 10.1038/sj.embor.7400857.View ArticlePubMedPubMed CentralGoogle Scholar
- Rosok O, Sioud M: Systematic identification of sense-antisense transcripts in mammalian cells. Nat Biotechnol. 2004, 22 (1): 104-108. 10.1038/nbt925.View ArticlePubMedGoogle Scholar
- Faghihi MA, Wahlestedt C: RNA interference is not involved in natural antisense mediated regulation of gene expression in mammals. Genome Biol. 2006, 7 (5): R38-10.1186/gb-2006-7-5-r38.View ArticlePubMedPubMed CentralGoogle Scholar
- Galante PA, Vidal DO, de Souza JE, Camargo AA, de Souza SJ: Sense-antisense pairs in mammals: functional and evolutionary considerations. Genome Biol. 2007, 8 (3): R40-10.1186/gb-2007-8-3-r40.View ArticlePubMedPubMed CentralGoogle Scholar
- Morrissy AS, Morin RD, Delaney A, Zeng T, McDonald H, Jones S, Zhao Y, Hirst M, Marra MA: Next-generation tag sequencing for cancer gene expression profiling. Genome Res. 2009, 19 (10): 1825-1835. 10.1101/gr.094482.109.View ArticlePubMedPubMed CentralGoogle Scholar
- Morris KV: Long antisense non-coding RNAs function to direct epigenetic complexes that regulate transcription in human cells. Epigenetics. 2009, 4 (5): 296-301. 10.4161/epi.4.5.9282.View ArticlePubMedPubMed CentralGoogle Scholar
- Faghihi MA, Wahlestedt C: Regulatory roles of natural antisense transcripts. Nat Rev Mol Cell Biol. 2009, 10 (9): 637-643. 10.1038/nrm2738.View ArticlePubMedPubMed CentralGoogle Scholar
- Grigoriadis A, Oliver GR, Tanney A, Kendrick H, Smalley MJ, Jat P, Neville AM: Identification of differentially expressed sense and antisense transcript pairs in breast epithelial tissues. BMC Genomics. 2009, 10 (1): 324-10.1186/1471-2164-10-324.View ArticlePubMedPubMed CentralGoogle Scholar
- Nielsen J, Christiansen J, Lykke-Andersen J, Johnsen AH, Wewer UM, Nielsen FC: A family of insulin-like growth factor II mRNA-binding proteins represses translation in late development. Mol Cell Biol. 1999, 19 (2): 1262-1270.View ArticlePubMedPubMed CentralGoogle Scholar
- Cariani E, Lasserre C, Seurin D, Hamelin B, Kemeny F, Franco D, Czech MP, Ullrich A, Brechot C: Differential expression of insulin-like growth factor II mRNA in human primary liver cancers, benign liver tumors, and liver cirrhosis. Cancer Res. 1988, 48 (23): 6844-6849.PubMedGoogle Scholar
- Reeve AE, Eccles MR, Wilkins RJ, Bell GI, Millow LJ: Expression of insulin-like growth factor-II transcripts in Wilms' tumour. Nature. 1985, 317 (6034): 258-260. 10.1038/317258a0.View ArticlePubMedGoogle Scholar
- Donovan EA, Kummar S: Role of insulin-like growth factor-1R system in colorectal carcinogenesis. Crit Rev Oncol Hematol. 2008, 66 (2): 91-98. 10.1016/j.critrevonc.2007.09.003.View ArticlePubMedGoogle Scholar
- Hewish M, Chau I, Cunningham D: Insulin-like growth factor 1 receptor targeted therapeutics: novel compounds and novel treatment strategies for cancer medicine. Recent Pat Anticancer Drug Discov. 2009, 4 (1): 54-72. 10.2174/157489209787002515.View ArticlePubMedGoogle Scholar
- Kaulfuss S, Burfeind P, Gaedcke J, Scharf JG: Dual silencing of insulin-like growth factor-I receptor and epidermal growth factor receptor in colorectal cancer cells is associated with decreased proliferation and enhanced apoptosis. Mol Cancer Ther. 2009, 8 (4): 821-833. 10.1158/1535-7163.MCT-09-0058.View ArticlePubMedGoogle Scholar
- Min Y, Adachi Y, Yamamoto H, Imsumran A, Arimura Y, Endo T, Hinoda Y, Lee CT, Nadaf S, Carbone DP, et al: Insulin-like growth factor I receptor blockade enhances chemotherapy and radiation responses and inhibits tumour growth in human gastric cancer xenografts. Gut. 2005, 54 (5): 591-600. 10.1136/gut.2004.048926.View ArticlePubMedPubMed CentralGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2407/10/687/prepub
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.