Anticancer drug clustering in lung cancer based on gene expression profiles and sensitivity database
© Gemma et al; licensee BioMed Central Ltd. 2006
Received: 20 November 2005
Accepted: 30 June 2006
Published: 30 June 2006
The effect of current therapies in improving the survival of lung cancer patients remains far from satisfactory. It is consequently desirable to find more appropriate therapeutic opportunities based on informed insights. A molecular pharmacological analysis was undertaken to design an improved chemotherapeutic strategy for advanced lung cancer.
We related the cytotoxic activity of each of commonly used anti-cancer agents (docetaxel, paclitaxel, gemcitabine, vinorelbine, 5-FU, SN38, cisplatin (CDDP), and carboplatin (CBDCA)) to corresponding expression pattern in each of the cell lines using a modified NCI program.
We performed gene expression analysis in lung cancer cell lines using cDNA filter and high-density oligonucleotide arrays. We also examined the sensitivity of these cell lines to these drugs via MTT assay. To obtain our reproducible gene-drug sensitivity correlation data, we separately analyzed two sets of lung cancer cell lines, namely 10 and 19. In our gene-drug correlation analyses, gemcitabine consistently belonged to an isolated cluster in a reproducible fashion. On the other hand, docetaxel, paclitaxel, 5-FU, SN-38, CBDCA and CDDP were gathered together into one large cluster.
These results suggest that chemotherapy regimens including gemcitabine should be evaluated in second-line chemotherapy in cases where the first-line chemotherapy did not include this drug. Gene expression-drug sensitivity correlations, as provided by the NCI program, may yield improved therapeutic options for treatment of specific tumor types.
While various anti-cancer drugs have been developed, many patients with solid tumors still exhibit poor prognosis. Accordingly, it is now important to determine the appropriate use of such drugs clinically. With respect to treatment of lung cancer, there are many anti-cancer agents in use, such as cisplatin (CDDP), carboplatin (CBDCA), docetaxel, paclitaxel, vinorelbine, gemicitabine, 5-fluorouracil (5-FU), CPT-11, etc. A number of combination therapy regimens employing platinum compounds have proven to be effective and are widely applied to initial treatment for unresected non-small cell lung cancer (NSCLC). In addition, docetaxel and pemetrexed have been reported to be effective in the context of second-line chemotherapy for NSCLC[3, 4]. However, at present, the effect of these therapies on improving patient survival remains far from satisfactory [1–3]. It is consequently desirable to find more appropriate therapeutic opportunities based on informed insights. With the recent near-completion of the human genome sequence, genome-wide gene expression profiling through both cDNA and oligonucleotide arrays has been greatly facilitated [5–7]. There are many reports associated with isolation of molecules involved in drug sensitivity [8–10]. Of particular relevance was the use of DNA array-based methodology by the National Cancer Institute (NCI) to assess the gene expression profiles of 60 human cancer cell lines of diverse tissue origin (NCI60 set), with a view to determining associations with the extensive drug sensitivity data accumulated on this cell line cohort so far. The NCI60 gene expression study was analogous in some respects to assessment of clinical tumors for markers that predict sensitivity to therapy. The essential aim of this study was to utilize similar advanced gene expression profiling technologies and drug sensitivity assays to aid in the selection of appropriate drug combinations for the treatment of lung cancer. We performed gene expression analysis in lung cancer cell lines using cDNA filter and high-density oligonucleotide arrays. We also examined the sensitivity of these cell lines to commonly used anti-cancer agents (docetaxel, paclitaxel, gemcitabine, vinorelbine, 5-FU, SN38, cisplatin (CDDP), and carboplatin (CBDCA)) via MTT assay. We related the cytotoxic activity of each of these agents to the corresponding expression pattern in each of the cell lines using a modified NCI program. To obtain our reproducible gene-drug sensitivity correlation data, we separately analyzed two sets of lung cancer cell lines, namely 10 and 19.
Clustering on the basis of drug activity and gene expression patterns
We analyzed the expression profiles and sensitivity to anti-cancer drugs of separate two sets of lung cancer cell lines. The first set consisted of PC9, PC7, PC14, A549, Lu65, LK2, H69, N231, Lu135, and SBC3 (Set 1). The second consisted of RERF-LC-KJ, RERF-LC-MS, RERF-LC-AI, PC1, PC3, PC6, PC10, Lu130, Lu139, Lu165, ABC-1, EBC-1, LC2/ad, LC1/sq, LC-1F, SQ5, QG56, MS-1, and SBC5 (Set 2). The PC1, PC3, PC6, PC7, PC9, PC10, PC14, and QG56 cell lines were obtained from IBL (Gumma, Japan). The A549, NCI-H69, and NCI-N231 cell lines were obtained from the American Type Culture Collection (Rockville, MD). The Lu65 and Lu135 cell lines were provided by Y. Shimosato and T. Terasaki (National Cancer Center Research Institute, Tokyo, Japan). The LK-2 and SBC-3 cell lines were obtained from the Health Science Research Resources Bank (Osaka, Japan). PC1, PC3, PC6, PC10, Lu130, Lu139, and Lu165 cell lines were provided by S. Hirohashi (National Cancer Center Research Institute, Tokyo, Japan). RERF-LC-KJ, LC2/ad, SQ5, LC1/sq, LC-1F, RERF-LC-AI, and MS-1 cell lines were obtained from the RIKEN Cell Bank (Ibaraki, Japan). RERF-LC-MS, EBC-1, SBC5, and ABC-1 cell lines were purchased from the Health Science Research Resources Bank (Osaka, Japan). PC7, PC9, PC14, A549, Lu65, RERF-LC-KJ, RERF-LC-MS, PC3, ABC-1, and LC2/ad are adenocarcinoma cell lines. LK-2, RERF-LC-AI, PC1, PC10, EBC-1, LC1/sq, LC-1F, SQ5, and QG56 are squamous cell cancer cell lines. NCI-H69, NCI-N231, Lu135, SBC-3, PC6, Lu130, Lu139, Lu165, MS-1, and SBC5 are small cell lung cancer cell lines.
Assay for drug activity
Estimation of cytotoxicity in the above-mentioned cell types was mediated by a rapid colorimetric assay for mitochondrial dehydrogenase activity, as previously described. Briefly, cells were seeded into 12-well plates (Falcon, Lincoln Park, NJ). Following 24 hr exposure to particular anti-cancer agents, the cells were washed twice and incubated for a further 24 hr in drug-free medium. Subsequently, the cells were incubated with 0.5 mg/mL 3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyl-tetrazolium bromide (MTT) for 4 hr. The blue formazan crystals, formed by viable cells, were solubilized by the addition of 10% n-dodecylsulfate sodium salt (SDS) in 0.01N HCl followed by overnight incubation. Samples were then subjected to spectrophotometric analysis at 560 nm (Ultraspec 4050; LKB, Bromma, Sweden).
RNA isolation, cDNA array hybridization and analysis of hybridization signals
Total RNA was isolated from each cell line using standard protocols described previously[14, 15]. To avoid variations due to cell culture conditions, we cultured each untreated cell line separately in 6 different flasks. mRNA was then purified from total RNA by incubation with oligo-dT-magnetic beads (Toyobo Co., Osaka, Japan). The ElectorGene Array System (GeneticLab. Co., Ltd. Sapporo, Japan) was used for filter-based cDNA array analysis, as previously reported. Thirteen hundred species of human DNA fragments are spotted in duplicate on a filter. The genes represented on this filter included cancer-related and drug resistance-associated genes, as well as housekeeping genes and non-mammalian genes as negative controls. To prepare the probes, reverse transcription was performed using Reverse Transcriptase, ReverTraAce (Toyobo Co., Osaka, Japan), together with a random 9 mer (Toyobo Co., Osaka, Japan) as the primer and 5 μg of polyA RNA. The probes were labelled with biotin by incorporation of biotin-16-deoxyuracil triphosphate (dUTP) during the synthesis of cDNA. The filters were pre-incubated in 20 ml of PerfectHyb (Toyobo Co., Osaka, Japan) at 68°C for 30 min. The biotin-labeled probes were denatured and added to the pre-hybridization solution. The filters were incubated overnight at 68°C in the hybridization mixture. After washing, specific signals on the filters were detected by the Imaging High – Chemilumi – Detection kit (Toyobo Co., Osaka, Japan). CDP-Star substrate (Tropix, Bedford, MA) was used as the chemiluminescence substrate. A chemiluminescence image of the filter was acquired by Fluor-S (Bio-Rad, Hercules, CA). The gene expression images were quantified by measuring the intensity of the signals using Imagene (BioDiscovery, Los Angeles, CA). The signal intensity among filters was analyzed by ElectorGene Finding System (GeneticLab, Sapporo, Japan). The background threshold was set at a level of 3-fold higher than the negative control. Signal intensities were normalized by comparison with the average values of all probe. We also performed high-density oligonucleotide array analysis using Affymetrix GeneChip technology (Affymetrix, Santa Clara, CA). This oligonucleotide microarray contains 22,282 transcripts (HG-U133A, Affymetrix, Santa Clara, CA). Total RNA was used to synthesize double-strand cDNA with ReverTraAce and a T7-(dT)24 primer (Metabion, Germany). Then, biotinylated cRNA was synthesized from the double-stranded cDNA using the RNA Transcript Labeling kit (Enzo Life Sciences, Farmingdale, NY) and was purified and fragmented. The fragmented cRNA was hybridized to the oligonucleotide microarray, which was washed and stained with streptavidine-phycoerythrin. Scanning was performed with an Agilent Microarray Scanner (Agilent Technologies, Palo Alto, CA). GeneChip analysis was performed based on the Affymetrix GeneChip Mannual (Affymetrix Inc., Santa Clara, CA) with Microarray Analysis Suite (MAS) 5.0, Data Mining Tool (DMT) 2.0, and Microarray Database software. The data we generated by GeneChip was deposited in Gene Expression Omnibus (GEO)(GEO accession: GSE4127)(17).
We performed data cleansing for filter arrays as follows. Firstly, the gene expression matrix [T] was scaled by using the average of all probe sets. Each of the filter arrays contained three spots of negative control (pUC), so we figured out their average signal value M. We defined 3 M as the threshold value, and transformed the numerical signal values < 3 M to "Nan" (not a number). After omitting the rows holding "Nan" more than one, we selected 600 genes for this analysis. Data analysis for the correlation coefficients that related the drug activity patterns to the expression patterns of the genes was principally performed by a modified NCI program. The symbol [A] (GI50) refers to the drug activity matrix in which the rows represent the anti-cancer drugs and the columns represent the human lung cancer cell lines. The symbol [T] (gene expression) refers to the gene expression matrix in which the rows represent individual genes and the columns represent the cell lines. In order to analyze the relationship between gene expression and drug activity, we generated the gene-drug correlation matrix [AT] (correlation coefficient) in which the rows represent the genes and the columns represent the drugs. Firstly, we subtracted its mean value from the matrix [A] in the direction of row and columns for a pre-treatment. Secondly, we normalized each element in the matrix [A] by subtracting its row-wise mean and dividing by its row-wise standard deviation; normalized [T] was generated in a similar way. Finally, we took the inner product of the matrix [A] and the transpose of the matrix [T]. The resulting matrix [AT] implied the Pearson correlation coefficients ( 1) that reflected the relationship between drug activity and gene expression.
Hierarchical clustering helps to comprehend a characteristic of huge volumes of data. With cluster analysis, the elements are divided into groups that show similar patterns by calculating the distances between their respective rows and columns. The AT-clustered image map (CIM), indicating the correlation coefficients between gene expression and drug sensitivity in the 10 human lung cancer cell lines, was obtained by the linkage-average clustering method, also known as UPGMA (un-weighted pair-group method using arithmetic average). The statistical algorithms and the graphical outputs described here were implemented in MATLAB 6.5 Release 13 (the MathWorks, Inc., US).
Clustering on the basis of drug activity and gene expression patterns
Growth inhibitory activities (GI50)(μg/ml) of various anti-cancer agents against 10 human lung cancer cell lines – Set 1
Here, we used a DNA array-based gene expression profiling approach, together with assessment of the cytotoxic activity of several widely applied anti-cancer agents, in two collections of human lung cancer cell lines. In particular, we related gene expression and drug sensitivity patterns in these cell lines. According to our separate two combined cytotoxicity and transcriptomic analyses, gemcitabine belonged to an isolated cluster. These results would suggest that combination chemotherapy regimens including gemcitabine could be a candidate for initial treatment, because combinations of drugs belonging to different clusters could expand the spectrum of the chemotherapy. Gemcitabine was deemed from our studies to be a good candidate for the treatment of recurrent or refractory NSCLC. Recently, an in silico search was performed to identify genes whose expression was positively or negatively correlated with sensitivity to four platinum compounds (CDDP, CBDCA, oxaliplatin and tetraplatin); the publicly available databases of the National Cancer Institute (NCI)(21) were used for this purpose. CDDP, CBDCA, oxaliplatin and tetraplatin are platinum-based compounds that are classically thought to have a similar spectrum of activities, allowing for one agent to be substituted for the other. Important similarities were noticed between CDDP and CBDCA on one hand, and tetraplatin and oxaliplatin on the other hand. The gene-drug correlations using NCI program in these study may be a valuable tool for the identification of determinants of anticancer drug activity in tumors and for the design of cancer chemotherapy.
Vekris et al. described several limitations to the type of study that they have developed. 1. The evaluation of gene expression was performed on a subset of 1416 genes and molecular markers. 2. The level of expression of the 1416 molecular markers was determined with a technique that was still under development and not fully validated. 3. The criterion for drug cytotoxicity that has been retained by the NCI is the 50% growth inhibitory concentration (GI50) rather than the overall number of cells killed. There were several differences between the study of Vekris et al. and ours, most notably with respect to the cell lines analyzed in each case. Our study focused on lung cancer cell lines, whereas Vekris et al. utilized available information on the NCI60 set, a wide range of cancer cell lines. The evaluation of gene expression by Vekris et al. was performed on a subset of 1,416 genes, which represents a relatively small fraction of the total transcriptome. We analyzed gene expression using two different DNA array formats, namely spotted filter (data not shown) and genome-wide GeneGhip arrays, with similar results being obtained. In addition, we separately analyzed two sets of lung cancer cell lines, 10 and 19 lines to obtain our reproducible gene-drug sensitivity correlation data.
Using cDNA array technique and clinical response data, it is sometimes difficult to consistently reproduce gene-drug sensitivity correlation data. These data were often influenced by sampling methods, sample preservation status, tumor size, tumor environment status including tumor vessels and inflammation, etc. In the study of Vekris et al. and ours, these influences were small because cancer cell lines were used. However, cell lines differ from tumor cells and should therefore be considered as surrogates that may contain information on the molecular cell biology and molecular pharmacology of cancer.
In the treatment of lung cancer, a number of combination therapy regimens employing platinum compounds have proven to be effective and are widely applied as first-line treatment for unresected NSCLC; for example, CDDP + docetaxel, CBDCA + paclitaxel, CDDP + gemcitabine, CDDP + CPT-11, CDDP + paclitaxel, CDDP + vinorelbine, etc. In addition, docetaxel and pemetrexed have been reported to be effective in the context of second-line chemotherapy for NSCLC[3, 4]. However, how were the anti-cancer agents in these reports selected? It is consequently desirable to find more appropriate therapeutic opportunities based on informed insights.
The results of our molecular pharmacological analysis suggest that chemotherapy regimens including gemcitabine should be evaluated in second-line chemotherapy if the initial chemotherapy does not include this drugs. A total design approach to cancer chemotherapy through the gene-drug correlations using NCI program may yield improved therapeutic options.
Supported in part by a Grant-in-Aid from the Ministry of Education, Culture, Sports, Science, and Technology of Japan and Japan Society for the Promotion of Science. There are not any financial or other interests with regard to the submitted manuscript that might be construed as a conflict of interest.
- Schiller JH, Harrington D, Belani CP, Langer C, Sandler A, Krook J, Zhu J, Johnson DH: Comparison of four chemotherapy regimens for advanced non-small-cell lung cancer. N Engl J Med. 2002, 346: 92-98. 10.1056/NEJMoa011954.View ArticlePubMedGoogle Scholar
- Bonomi P, Kim K, Fairclough D, Cella D, Kugler J, Rowinsky E, Jiroutek M, Johnson D: Comparison of survival and quality of life in advanced non-small-cell lung cancer patients treated with two dose levels of paclitaxel combined with cisplatin versus etoposide with cisplatin: Results of an Eastern Cooperative Oncology Group trial. J Clin Oncol. 2000, 18: 623-631.PubMedGoogle Scholar
- Shepherd FA, Dancey J, Ramlau R, Mattson K, Gralla R, O'Rourke M, Levitan N, Gressot L, Vincent M, Burkes R, Coughlin S, Kim Y, Berille J: Prospective randomized trial of docetaxel versus best supportive care in patients with non-small-cell lung cancer previously treated with platinum-based chemotherapy. J Clin Oncol. 2000, 18: 2095-2103.PubMedGoogle Scholar
- Smit EF, Mattson K, von Pawel J, Manegold C, Clarke S, Postmus PE: ALIMTA (pemetrexed disodium) as second-line treatment of non-small-cell lung cancer: a phase II study. Ann Oncol. 2003, 14: 455-460. 10.1093/annonc/mdg099.View ArticlePubMedGoogle Scholar
- Eisen MB, Brown PO: DNA arrays for analysis of gene expression. Methods. 1999, 303: 179-205.Google Scholar
- Ross DT, Scherf U, Eisen MB, Perou CM, Rees C, Spellman P, Iyer V, Jeffrey SS, Van de Rijn M, Waltham M, Pergamenschikov A, Lee JC, Lashkari D, Shalon D, Myers TG, Weinstein JN, Botstein D, Brown PO: Systematic variation in gene expression patterns in human cancer cell lines. Nat Genet. 2000, 24: 227-35. 10.1038/73432.View ArticlePubMedGoogle Scholar
- Eisen MB, Spellman PT, Brown PO, Botstein D: Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci USA. 1998, 95: 14863-14868. 10.1073/pnas.95.25.14863.View ArticlePubMedPubMed CentralGoogle Scholar
- Kihara C, Tsunoda T, Tanaka T, Yamana H, Furukawa Y, Ono K, Kitahara O, Zembutsu H, Yanagawa R, Hirata K, Takagi T, Nakamura Y: Prediction of sensitivity of esophageal tumors to adjuvant chemotherapy by cDNA microarray analysis of gene-expression profiles. Cancer Res. 2001, 61: 6474-6479.PubMedGoogle Scholar
- Chang JC, Wooten EC, Tsimelzon A, Hilsenbeck SG, Gutierrez MC, Elledge R, Mohsin S, Osborne CK, Chamness GC, Allred DC, O'Connell P: Gene expression profiling for the prediction of therapeutic response to docetaxel in patients with breast cancer. Lancet. 2003, 362 (9381): 362-369. 10.1016/S0140-6736(03)14023-8.View ArticlePubMedGoogle Scholar
- Mariadason JM, Arango D, Shi Q, Wilson AJ, Corner GA, Nicholas C, Aranes MJ, Lesser M, Schwartz EL, Augenlicht LH: Gene expression profiling-based prediction of response of colon carcinoma cells to 5-fluorouracil and camptothecin. Cancer Res. 2003, 63 (24): 8791-8812.PubMedGoogle Scholar
- Scherf U, Ross DT, Waltham M, Smith LH, Lee JK, Tanabe L, Kohn KW, Reinhold WC, Myers TG, Andrews DT, Scudiero DA, Eisen MB, Sausville EA, Pommier Y, Botstein D, Brown PO, Weinstein JN: A gene expression database for the molecular pharmacology of cancer. Nat Genet. 2000, 24: 236-244. 10.1038/73439.View ArticlePubMedGoogle Scholar
- Gemma A, Seike M, Seike Y, Uematsu K, Hibino S, Kurimoto F, Yoshimura A, Shibuya M, Harris CC, Kudoh S: Somatic mutation of the hBUB1 mitotic checkpoint gene in primary lung cancer. Genes Chromosomes Cancer. 2000, 29: 213-218. 10.1002/1098-2264(2000)9999:9999<::AID-GCC1027>3.0.CO;2-G.View ArticlePubMedGoogle Scholar
- Kobayashi K, Kudoh S, Takemoto T, Hino M, Hayashihara K, Nakahiro K, Ando M, Niitani H: In vitro investigation of a combination of two drugs, cisplatin and carboplatin, as a function of the area under the c/t curve. J Cancer Res Clin Oncol. 1995, 121: 715-720. 10.1007/BF01213317.View ArticlePubMedGoogle Scholar
- Gemma A, Hagiwara K, Vincent F, Hancock AR, Nagashima M, Bennett WP, Harris CC: hSmad5 gene, a human hSmad family member: its full length cDNA, genomic structure, promoter region and mutation analysis in human tumors. Oncogene. 1998, 16: 951-956. 10.1038/sj.onc.1201614.View ArticlePubMedGoogle Scholar
- Gemma A, Takenoshita S, Hagiwara K, Okamoto A, Spillare EA, McMemamin MG, Hussain SP, Forrester K, Zariwala M, Xiong Y, Harris CC: Molecular analysis of the cyclin dependent kinase inhibitor genes p15INK4B/MTS2, p16 INK4/MTS1, p18 and p19 in human cancer cell lines. Int J Cancer. 1996, 68: 605-611. 10.1002/(SICI)1097-0215(19961127)68:5<605::AID-IJC9>3.0.CO;2-2.View ArticlePubMedGoogle Scholar
- Gemma A, Takenaka K, Hosoya Y, Matuda K, Seike M, Kurimoto F, Ono Y, Uematsu K, Takeda Y, Hibino S, Yoshimura A, Shibuya M, Kudoh S: Altered expression of several genes in highly metastatic subpopulations of a human pulmonary adenocarcinoma cell line. Eur J Cancer. 2001, 37: 1554-1561. 10.1016/S0959-8049(01)00154-X.View ArticlePubMedGoogle Scholar
- Website title. [http://www.ncbi.nlm.nih.gov/projects/geo]
- Ko J, Lee YH, Hwang SY, Lee YS, Shin SM, Hwang JH, Kim J, Kim YW, Jang SW, Ryoo ZY, Kim IK, Namkoong SE, Kim JW: Identification and differential expression of novel human cervical cancer oncogene HCCR-2 in human cancers and its involvement in p53 stabilization. Oncogene. 2003, 22: 4679-4689. 10.1038/sj.onc.1206624.View ArticlePubMedGoogle Scholar
- Wu Q, Xu M, Cheng C, Zhou Z, Huang Y, Zhao W, Zeng L, Xu J, Fu X, Ying K, Xie Y, Mao Y: Molecular cloning and characterization of a novel Dehydrogenase/reductase (SDR family) member 1 genea from human fetal brain. Mol Biol Rep. 2001, 28: 193-198. 10.1023/A:1015722001960.View ArticlePubMedGoogle Scholar
- Son YS, Park JH, Kang YK, Park JS, Choi HS, Lim JY, Lee JE, Lee JB, Ko MS, Kim YS, Ko JH, Yoon HS, Lee KW, Seong RH, Moon SY, Ryu CJ, Hong HJ: Heat shock 70-kDa protein 8 isoform 1 is expressed on the surface of human embryonic stem cells and downregulated upon differentiation. Stem Cells. 2005, 23: 1502-1513. 10.1634/stemcells.2004-0307.View ArticlePubMedGoogle Scholar
- Website title. [http://dtp.nci.nih.gov]
- Vekris A, Meynard D, Haaz MC, Bayssas M, Bonnet J, Robert J: Molecular determinants of the cytotoxicity of platinum compounds: the contribution of in silico research. Cancer Res. 2004, 64: 356-362. 10.1158/0008-5472.CAN-03-2258.View ArticlePubMedGoogle Scholar
- Klastersky J, Sculier JP, Lacroix H, Dabouis G, Bureau G, Libert P, Richez M, Ravez P, Vandermoten G, Thiriaux J: A randomized study comparing cisplatin or carboplatin with etoposide in patients with advanced non-small-cell lung cancer: European Organization for Research and Treatment of Cancer Protocol 07861. J Clin Oncol. 1990, 8: 1556-1562.PubMedGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2407/6/174/prepub
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.