We’re sorry, something doesn't seem to be working properly.
Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.
The impact of pharmacokinetic gene profiles across human cancers
BMC Cancervolume 18, Article number: 577 (2018)
The Correction to this article has been published in BMC Cancer 2018 18:743
The right drug to the right patient at the right time is one of the ideals of Individualized Medicine (IM) and remains one of the most compelling promises of the post-genomic age. The addition of genomic information is expected to increase the precision of an individual patient’s treatment, resulting in improved outcomes. While pilot studies have been encouraging, key aspects of interpreting tumor genomics information, such as somatic activation of drug transport or metabolism, have not been systematically evaluated.
In this work, we developed a simple rule-based approach to classify the therapies administered to each patient from The Cancer Genome Atlas PanCancer dataset (n = 2858) as effective or ineffective. Our Therapy Efficacy model used each patient’s drug target and pharmacokinetic (PK) gene expression profile; the specific genes considered for each patient depended on the therapies they received. Patients who received predictably ineffective therapies were considered at high-risk of cancer-related mortality and those who did not receive ineffective therapies were considered at low-risk. The utility of our Therapy Efficacy model was assessed using per-cancer and pan-cancer differential survival.
Our simple rule-based Therapy Efficacy model classified 143 (5%) patients as high-risk. High-risk patients had age ranges comparable to low-risk patients of the same cancer type and tended to be later stage and higher grade (odds ratios of 1.6 and 1.4, respectively). A significant pan-cancer association was identified between predictions of our Therapy Efficacy model and poorer overall survival (hazard ratio, HR = 1.47, p = 6.3 × 10− 3). Individually, drug export (HR = 1.49, p = 4.70 × 10− 3) and drug metabolism (HR = 1.73, p = 9.30 × 10− 5) genes demonstrated significant survival associations. Survival associations for target gene expression are mechanism-dependent. Similar results were observed for event-free survival.
While the resolution of clinical information within the dataset is not ideal, and modeling the relative contribution of each gene to the activity of each therapy remains a challenge, our approach demonstrates that somatic PK alterations should be integrated into the interpretation of somatic transcriptomic profiles as they likely have a significant impact on the survival of specific patients. We believe that this approach will aid the prospective design of personalized therapeutic strategies.
The right drug to the right patient at the right time is one of the promises of Precision or Individualized Medicine (IM) . The integration of genetic information and targeted therapies into patient care is expected to increase the precision of individual patient’s treatment, resulting in improved outcomes [2,3,4]. Previous work has reported the successful use of genomics information to discover causal or driver mutations , guide treatment [6,7,8,9] in oncology, and investigated the feasibility of relating somatic alterations to known druggable targets [5, 10,11,12,13,14,15]. These studies mostly focused on drug target identification. However, there are additional factors that influence the fate of drug therapies. Pharmacokinetic (PK) genes mediate transport of drugs into and out of cells and their metabolic processing. For example, PK genes can be activated within tumor tissues and thereby compromise the efficacy of an administered therapy independent of the target. While interest in PK genes has recently increased for interpreting the effects of epigenetic and copy number alterations in recurrent ovarian cancer [16, 17], systematic pan-cancer investigation of the prevalence and impact of PK alterations using patient-level data has not been performed, despite being a well-understood mechanism.
The basic mechanisms of PK genes have been studied extensively , as have PK gene alterations’ impact on drug response [19,20,21,22]. It has been shown that specific mechanisms of PK activation are associated with resistant cancer cell lines and patient tissues [23,24,25]. Moreover, cancer cell lines may increase expression of drug metabolism genes active against administered therapies . As a specific example for a commonly administered chemotherapy, the efficacy of 5-florouracil (5-FU) has been shown to be diminished by activation of the gene DPD, which degrades 5-FU , by altered expression levels of export genes , or by inhibited expression of metabolically activating enzymes , while greater efficacy is observed with genomic deletions of DPD . Thus, the systematic integration of PK knowledge, which we defined as the known relationships between genes and therapies, into the interpretation of individual patient’s tumor genomics may improve IM in oncology.
In this study, we investigated the potential contribution of PK knowledge, specifically the individualized prediction of drug efficacy, to the interpretation of tumor gene expression from The Cancer Genome Atlas (TCGA). We focused on therapies administered early in each patient’s course of treatment, because TCGA data was derived from untreated primary tumors. We assessed patient data from multiple tumor types (pan-cancer). First, we developed a simple rule-based Therapy Efficacy model (TEM) using a small number of features that correspond to well-established PK mechanistic processes [31,32,33]. We used the TEM to evaluate the therapies administered to each patient, accounting for their corresponding transcriptomics data. Therapies were predicted to be effective or ineffective. Next, we used survival analysis to validate predictions made by our model. To perform survival analysis, we first defined two groups of patients. Patients who received a predicted ineffective therapy were considered to be at higher risk of cancer-related mortality, due to the lack of expected drug efficacy. Those who did not receive a predicted ineffective therapy were considered at lower risk of cancer-related mortality. We refer to these groups as high-risk and low-risk, respectively. We hypothesized that high-risk patients would experience poorer therapy efficacy and thereby shorter survival compared to low-risk patients. Therefore, we validated our TEM predictions using differential overall and event-free survival. Our TEM categorized 5% of the patients as high-risk who exhibit poorer survival compared to those who were low-risk. Our data indicated that the altered PK gene expression does not significantly impact the majority of cancer patients in this cohort. However, altered PK gene expression has significant impact on patients with these alterations and therefore could be taken into account when designing personalized treatment strategies.
Data and normalization
Tumor genomic and patient datasets
Data released during the Pan-Cancer initiative [34, 35] was downloaded from synapse.org. We used normalized gene expression from syn1695384. We refer to each cancer type by TCGA abbreviation. Patient’s clinical characteristics, follow-up, and treatment administration data were gathered from the public access portal (https://tcga-data.nci.nih.gov/tcga/dataAccessMatrix.htm). According to TCGA policies, genomic data were collected from pre-treatment tissues, limiting our knowledge of post-treatment genomics. No embargoed or limited access data was used. See Supplemental Text for further detail on how clinical covariates were coded. Patients with verified missing data (e.g. no administration times for the therapies administered) were excluded from our analysis.
Gene expression normalization
In order to quantify the level of gene expression in each tumor as high or low, we utilized a normal-tissue reference (Fig. 1). We considered the differential expression of an individual tumor sample compared to the expression level seen in a collection of normal-tissue samples. In order to achieve a more comprehensive (applicable to all cancer types) yet conservative estimate of aberrant gene expression, all normal-tissue samples within the dataset (n = 295) were combined to generate composite-tissue gene expression reference ranges. Tumor gene expression levels were transformed to their Z-score by median-centering each gene and normalizing by its median absolute difference , using the composite-tissue reference ranges. As a balance between the number of affected samples and effect size, unless otherwise stated, we used a threshold of Z ≥ 2 to define a “high” and Z ≤ − 2 a “low” gene expression level .
Gene-drug relationships for administered therapies were extracted from CPIC guidelines, PharmGKB pathways , and DrugBank [39, 40]. We reviewed the literature supporting each relationship in order to determine the genes most likely to be dominant for each effect.
Therapy administration information from TCGA could not be used without additional processing since drugs were referenced by generic name, brand name, or abbreviations. Therapy names were manually corrected for spelling mistakes and normalized to their US generic name. When a therapy refers to a combination of drugs, the therapy name was de-convoluted into a normalized set of therapy names. For instance, “Macdonald” was converted to fluorouracil, leucovorin and radiotherapy. Our manual review process leveraged many sources including: uptodate.com, chemocare.com, chemoregimen.com, dtp.nci.nih.gov, DGIdb , DrugBank , NCCN guidelines [41,42,43,44], RxNorm [45, 46], and the NCI Thesaurus. 
Defining eligible therapies for evaluation
Because TCGA samples are untreated, we focused on first-line therapies. NCCN guidelines for multiple cancer types were manually reviewed to identify common administration schedules for standard-of-care chemotherapies. After this review, we determined that any group of therapies initiated within the first 1.5 years after diagnosis was a reasonable pan-cancer approximation to first-line therapies, but will likely miss cases that experienced early disease progression. Of note, the majority of therapies administered after 1.5 years are after each patient’s event-free interval (defined below). Therefore, because our event-free survival analysis accounted for changes in disease status, it did not evaluate therapies administered after disease progression. Patients were considered low-risk until they received a predicted ineffective therapy, according to our model. When they did, their status was updated to high-risk.
It has been shown that early response to treatment can predict long-term outcomes [48,49,50,51,52]. Therefore, it is likely that the effects of early ineffective therapies will be evident, on average, in long-term outcomes data. Previous studies have demonstrated that tumor evolution is associated with chemotherapy resistance [53,54,55,56]. Our eligibility criteria have the added benefit of minimizing the potential effects of treatment-induced tumor evolution on our analysis. Determination of the most informative duration of pre-treatment genomics data for predicting therapy efficacy is beyond the scope of the current study.
Deriving the therapy efficacy model
Defining tumor genomics features
We developed our Therapy Efficacy model (TEM) using a simple rule-based approach that leveraged each patient’s tumor genomics features. Three features were considered, each hypothesized to negatively affect drug efficacy: one feature for drug transport, one for drug metabolism, and one for drug targets. Specifically: 1) low gene expression of drug import genes that bring the therapy into tumor cells or high expression of drug export genes that pump the therapy out of tumor cells (e.g. ABCC1 exports paclitaxel from tumor cells); 2) high expression of genes known to metabolically degrade the therapy, or, when pro-drug activation occurs within target cells, low expression of pro-drug activating genes (e.g. CYP3A4 breaks down paclitaxel within tumor cells); 3) low expression of target genes (Fig. 2; e.g. paclitaxel binds to tubulin proteins). We list additional and specific drug-gene examples that were important for our TEM, in our results.
Defining rules from tumor genomic features
Each tumor genomics feature corresponds to a different molecular mechanism (transport, metabolism, and target) and could be activated independently of the others, for a particular patient. Therefore, the assessment of each feature individually and their combinations is of interest. For individual-feature rules, we classified a therapy as ineffective if any gene within the patient’s tumor genomics profile fit the features’ criteria as described above. We considered three rules that utilized combinations of features. The first combination was an “Any-Hit” rule which was a simple Boolean combination of all individual-feature rules. The second combination was an “Any-PK-Hit” rule where a therapy is predicted to be ineffective if any gene met the criteria for either of the two PK features. Finally, we considered the features from statistically significant rules (see below for details) and combined them to generate our TEM. Our TEM is therefore derived from an empirical and iterative process from a small number of well-established features. Our TEM used two features: high expression of any exporter gene or high expression of at least two drug metabolism genes.
Validating the therapy efficacy model using survival analysis
Classifying patients using rules from tumor genomic features
Patients who received a predicted ineffective therapy were considered high-risk, while patients who did not receive a predicted ineffective therapy were considered low-risk. We compared the overall and event-free survival of high-risk patients to low-risk patients to validate the predictions of tumor genomics rules, including our Therapy Efficacy model.
Survival analysis models
Using Cox regression, we first evaluated survival differences between high-risk and low-risk patients, per cancer. We computed univariate survival models that only used our patient classification, as well as multivariate survival models that adjusted for clinical covariates such as patient age and tumor stage. See Table 1 for the full list of covariates used for each cancer type. Statistical model results were summarized by hazard ratio (HR), the lower and upper bounds of the 95% confidence interval, and p-value. HRs greater than 1 indicated increased risk and values less than 1 indicated a protective association. We reported HRs of high-risk patients relative to low-risk patients. We used Kaplan-Meyer (KM) plots for visualization of differences between high- and low-risk patients.
Because we considered tumor genomic information, features may be transferrable across cancer types. However, due to many factors (including small numbers of cases for each cancer), observations of each rule’s HR may be variable. Therefore, we used random-effects meta-analysis to combine information across cancer types and derive a more robust estimate of the pan-cancer HR [57, 58]. In meta-analysis, it is assumed that each cancer type is an observation of the “true” pan-cancer HR. The number of high- and low-risk patients, the HR, and the confidence interval for the HR, from each cancer type were used in the meta-analysis. In addition to meta-analysis, we also computed a pan-cancer Cox regression, stratified by cancer type. Statistical models produced by meta-analysis and Cox regression were compared. Finally, we used 5-fold cross-validation repeated 50 times to assess variability in risk associated with each feature.
Overall survival times were calculated using the time to observed cancer-related mortality. Event-free survival was calculated using the time to any cancer-related event (cancer-related mortality, disease progression, recurrence, or beginning a new treatment) [59,60,61,62]. Subjects were censored when lost to follow-up. In all survival models, we required at least 5 patients to be in each group, at least 10 total observed events, and at least 20 total patients. Models were compared using likelihood ratio tests.
Analyses were conducted using the R programming language [59, 60, 63] (version 3.1.1), leveraging the survival (version 2.38.3), coxme (version 2.2.5), meta (version 4.3.0), and forestplot (version 1.1.0) packages.
Classification of therapies and patients using rules from tumor genomic features
Our Therapy Efficacy model classified 229 therapies (9.0%) as ineffective within the context presented by the corresponding patient’s tumor genomics profile. These therapies were administered to 143 (5.5%) patients, leading to classification of these patients as high-risk. While high-risk patients were distributed across cancer types (Table 2), a higher proportion was observed in BRCA and OV. Differences in prevalence between cancer types also corresponded with moderate differences in clinical covariates; high-risk patients trended towards higher grade, later stage, and younger age at cancer diagnosis. Further details on the cohort characteristics, number of genes annotated for each therapy, and the fraction of administered therapies affected by PK gene expression, are available in Additional file 1: Figures S1, S2.
Per-Cancer survival analysis
We utilized survival analysis comparing high- and low-risk patients to validate the predictions of our simple TEM. Survival analysis was first applied to each cancer type independently. Differential survival was assessed using the TEM classification (univariate) and with adjustment for clinical covariates (multivariate; see Table 3). For univariate analysis, only KIRC showed a statistically significant association. However, the majority of cancer types exhibited survival patterns consistent with poorer survival for high-risk patients. In multivariate analysis, four out of eight cancer types demonstrated statistically significant poorer survival for high-risk patients: GBM, BRCA, KIRC, and HNSC. Comparing univariate and multivariate models, GBM was the only cancer type for which the risk associated with the TEM significantly differed, but there were substantial clinical differences (e.g. age and radiation). While each per-cancer model identified a small number of high-risk patients, a significant association with survival was observed in multiple cancers.
Kaplan-Meyer (KM) plots were used to visualize differential survival between high- and low-risk patient cohorts (Fig. 3 and Additional file 1: Figure S3). The high- and low-risk cohorts were not balanced with respect to clinical characteristics. KM plots do not provide sufficient resolution to account for the differences in survival due to different covariate balance. Therefore, we calculated unadjusted and adjusted KM survival curves to more clearly represent the survival associations for the TEM.
Pan-Cancer survival analysis
Across all patients, we first tested single-feature rules (Table 4 and Additional file 1: Figure S4). Drug export (HR = 1.49 [1.13, 1.96], p = 4.7 × 10− 3, Fig. 4 and Additional file 1: Figure S5) and metabolism (HR = 1.73 [1.31, 2.28], p = 9.3 × 10− 5, Additional file 1: Figures. S6 and S7) both demonstrated statistically significant associations between high-risk patients and poorer survival. Neither therapy import nor target gene expression demonstrated significant survival associations.
We next constructed rules from combinations of features and tested them using survival models (Fig. 3 and Table 4). The Any-Hit model was consistent with poorer survival, but was not statistically significant. Both the Any-PK-Hit and TEM demonstrated statistically significant associations between high-risk patients and poorer survival (HR = 1.27 [0.96, 1.67], p = 9.4 × 10− 2 and 1.47 [1.11, 1.93] 6.3 × 10− 3, respectively). Associations were consistent between univariate and multivariate models. KM survival curves for the high-risk and low-risk cohorts were used to visualize these associations and are presented in Fig. 3, Additional file 1: Figures S1 and S3.
We tested if inclusion of scores from the TEM was a significant improvement. Details are presented in our Supplemental Information. Briefly, inclusion of the TEM classification was significant by the likelihood ratio test (Χ2(1) = 5.4, p = 0.02) over a multivariate model including all covariates.
Analysis of tumor genomics features
We analyzed the prevalence of each genomics feature across the cohort and investigated which specific drug-gene relationships were most commonly affected by altered PK gene expression levels for the PK genes relevant to each patient’s treatment.
Across cancer types, 4.2% (n = 109) of patients were affected by high exporter gene expression for a therapy they received. As described above, this feature was associated with increased cancer mortality risk (Table 4 and Fig. 4; HR = 1.49 [1.13, 1.96], p = 4.7 × 10− 3) and event-free survival (HR = 1.43 [1.11, 1.85], p = 5.9 × 10− 3; Additional file 1: Figure S5). Lowering the threshold used to classify gene expression as “high” lead to more patients identified at a lower HR and increasing the threshold lead to a similarly high HR. This indicated that different expression levels of drug export genes may indicate different levels of impact on drug efficacy.
The poorer survival associated with high expression of drug export genes was influenced by many individual drug-gene relationships. The most recurrent alteration observed was elevation of ABCC1 expression in 48 patients receiving doxorubicin and/or paclitaxel from across six cancer types. The next most recurrent alterations were observed for three further independent groups of patients exhibited high expression of ABCC2 (17 patients), ABCG2 (12 patients), and ABCC10 (8 patients). While these patients have high expression of one of these four genes, their treatment regimen may differ, impacting different therapies as ineffective across patients. Our model is able to integrate across these multiple drug-gene relationships. Specific patient cases are highlighted in Table 5. Thus, increased exporter gene expression is a pan-cancer feature affecting a modest fraction of cancer patients, but which may convey a significant impact on drug efficacy for individual patients.
Across cancer types, 7.6% (n = 216) of patients were affected by increased gene expression of any drug metabolism gene active against the therapies they received, while 1.7% (n = 49) of patients were affected by concurrent high gene expression of at least two drug metabolism genes. Activation of any drug metabolism gene was consistent with poorer overall (HR = 1.27 [0.94, 1.71], p = 0.12, Additional file 1: Figure S6) and event-free survival (HR = 1.32 [0.93, 1.88], p = 0.11, Additional file 1: Figure S7). However, high expression of at least two drug metabolism genes was associated with poorer overall survival (HR = 1.73 [1.31, 2.28], p = 9.3 × 10− 5, Table 4). Varying the threshold for calling gene expression “high,” lead to an increased HR. This suggests that different expression levels of drug metabolism genes may indicate different levels of impact on drug efficacy.
Many individual relationships contributed to the association between administered therapies and the drug metabolism genes that degrade them within cancer cells. For example, we identified 41 patients in two cancer types receiving tamoxifen where the genes UGT1A10, UGT2B15, or CYP2D6 were highly expressed. These genes are well established components of tamoxifen’s metabolic pathway . Across 5 cancer types, 109 patients who received paclitaxel exhibited activation of NR1I2, many with concurrent activation of CYP2C8 or CYP3A4 – known members of paclitaxel’s metabolic pathway . Further relationships are identified including high expression of the gene NQO1 in 33 patients receiving doxorubicin and the gene NT5C in 17 patients receiving gemcitabine. These and other relationships between administered therapies and the genes that metabolize them within tumor cells contribute to the survival association implicated by our pan-cancer survival model.
Across cancer types, 241 (8.4%) of patients were affected by low gene expression of any drug target gene for their administered chemotherapy, while 2.7% (n = 78) of patients were affected by concurrently low gene expression of at least two drug targets. We tested the association between survival and low drug target gene expression for administered therapies, but found no overall statistical association, either for one gene (HR = 0.99, p = 0.92) or two genes (HR = 1.12, p = 0.56). However, specific therapies did exhibit statistically significant associations with overall survival. The direction of association was mechanism-dependent (Fig. 5). Among the most frequent drug target associations were with MAPT, MAP2, and MAP4 for patients receiving paclitaxel or docetaxel (Table 5). In total, 110 patients across 6 cancer types were affected by this mechanism. These three MAP proteins associate with microtubules and function to stabilize them in a phosphorylation-dependent manner [65, 66]. Increased MAP4 expression has been previously shown to enhance sensitivity to paclitaxel . Thus, our analysis has recapitulated this known mechanism and demonstrated its pan-cancer prevalence. Another frequent alteration was low expression of growth factor receptors for patients receiving therapies that target them. For example, eight patients across four cancer types were identified with low expression of EGFR when receiving cetuximab or lapatinib, or of ERBB2 when receiving erlotinib. Additional individual patient examples included 14 patients receiving kinase inhibitors for which the primary target (KIT, RET, MTOR) was lowly expressed and 12 patients receiving tamoxifen when ESR2 was lowly expressed. These cases demonstrate the potential for individualized tumor genomics to inform the expected efficacy of administered therapies.
Platinum-based therapies are cytotoxic agents that nonspecifically target cellular DNA and their efficacy has been previously associated with the expression level of multiple genes including HMGB1 [68, 69], a critical chromatin modifier. HMGB1 has been shown to recognize and bind to platinum-induced damage [70, 71] and affect repair . We investigated the association of HMGB1 expression levels in OV and identified a negative association with overall survival (Additional file 1: Figure S8; HR = 2.07 [1.13, 3.79]; p = 1.6 × 10− 2). Thus, our analysis supports the mechanism of HMGB1 counteracting the action of platinum therapies.
We used repeated 5-fold cross-validation to assess variability in outcome associations. Therapy export was included in all cross-validated models with a similar effect size to our overall model (median HR = 1.47 ± 0.12). Therapy metabolism was included in 64% of models and both it and our TEM showed consistent effect sizes (median HR = 1.70 ± 0.13 and 1.52 ± 0.11, respectively). These results suggest that combinations of PK features, expected a priori to complement one another [31,32,33], validate from a data-driven perspective and our model is robust to variation in cohort.
Interactions between concurrently administered therapies
In our analysis, the association between expression of metabolic genes and survival in BRCA (Fig. 6) was the opposite as expected and for which was observed in other cancer types. For this reason, BRCA was excluded from the analysis of therapy metabolism presented above. We believe that this observation can be interpreted by considering the other concurrently administered therapies (in the same regimen). In BRCA, patients are administered a combination of anti-hormone therapy, typically CYP19A1 (aromatase) inhibitors, and cytotoxic chemotherapy, usually taxanes. Taxanes such as paclitaxel are metabolized by CYP19A1 and this relationship is representative of the majority of drug metabolism cases identified in BRCA. We believe the protective effect associated with drug metabolism in BRCA is conferred by anti-hormone therapy activity, rather than metabolic activity of CYP19A1 on taxanes. Analogous trends can be seen for CYP3A4. Specific roles for CYP3A4 and CYP19A1 in breast cancer therapy response have been reviewed [73, 74], but only preliminary considerations have been made for modeling the concurrent and opposing activities of these enzymes on therapies within the same treatment regimen. These types of interrelationships between multiple therapies and genes will require further development so that genomic effects on concurrently administered drugs may be properly interpreted.
Individual patient cases exhibiting multiple alterations
The associations between PK gene expression features and survival, presented above, defined a cohort of high-risk patients. These patients’ pre-treatment tumors exhibited features likely to indicate intrinsic resistance to the therapies they received. We summarize in Table 5, selected cases per cancer type where we believe prospective PK considerations during the interpretation of tumor genomics could have been of clinical benefit. Cases were selected to represent a diversity of clinical presentations. For example, we identified gene expression changes that indicated poor chemotherapy efficacy in 6 BRCA patients. These example patients had different hormone receptor status and therefore received differing anti-hormone therapies. However, we identified transcriptomic alterations that potentially affected the chemotherapies administered to each of these patients. The full list of cases is available in Additional file 2: Table S1. These observed alterations require additional investigation. However, we believe they demonstrate that PK considerations will be a critical component of interpreting tumor genomics for IM in oncology.
In this study, we have demonstrated that tumor genomics can be used to inform a deeper understanding of therapy efficacy. We used patient-level transcriptomic profiles to predict if administered therapies were rendered ineffective by somatic PK gene expression. Our Therapy Efficacy model (TEM) could be used to help select drugs that are the most appropriate for a patient, or used to identify patients that are less likely to respond to a therapy. In this study, we assumed that patients who are less likely to respond to their therapies would be at higher risk of cancer-related mortality. Therefore, we used survival analysis to validate predictions made by our TEM. While the level of detail in clinical annotations and follow-up time in TCGA data are not ideal, we believe they are sufficient to identify trends in overall and event-free survival as indirect measurements of therapy efficacy. The genomics data used was derived from pre-treatment tissue. It is expected that the occurrence of PK gene expression alterations will be higher for post-treatment samples due to therapy-induced selection. Scenarios where these assumptions do not apply represent limitations to our current study. While we did not detect altered somatic PK gene expression for the majority of patients, when present, it was significantly associated with poorer survival. We therefore believe that the features leveraged by our TEM will be important for interpreting tumor genomics profiles for individual patients.
The TEM is based on a small set of well-established PK features. They were individually tested using cross-validation which showed consistency in outcome associations. As the number of patients affected by each rule is relatively small, each fold may have too few cases to generate a robust estimate. Inclusion of the PK features we have considered in cross-validated models at comparable effect sizes to our overall model indicates a robust association for these rare events. As cancer genomics datasets increase, the number of cases with altered somatic PK gene expression is also likely to increase. Additionally, recurrent disease tends to be more resistant to baseline therapies, making adaptation through altered PK genes more likely. Therefore, assessing therapy efficacy using PK features such as we have considered, will be increasingly important.
The strongest PK features by survival analysis were high expression of genes that export or metabolize administered therapies. Influx genes were considered, but no significant association with survival was identified. We believe the difference in cancer mortality risk (quantified by HRs) between therapy importers and exporters can be simply explained by the inherent nature of the two systems; activation of a single exporter may convey sufficient efflux to impair the efficacy of a therapy, while inactivation of all importer genes may be required to sufficiently block influx of a therapy. Associations with target-gene expression depended on each target’s mechanism-of-action. Importantly, the risk estimates we have identified are comparable in incidence and magnitude to well-established germline mutations in cancer susceptibility genes  and to those observed for canonical cancer driver mutations . Thus, we have established that specific PK mechanisms may be altered within human tumors and that a subset of them may lead to decreased therapy efficacy, independent of the drug target. Additionally, these features can be identified in pre-treatment tumor data and could be used to generate individualized predictions of therapy efficacy.
In predicting if a patient was high- or low-risk using our TEM, we assumed that receiving any ineffective therapy would negatively affect long-term survival outcomes. Treatment regimen are diverse [76,77,78] and have often been developed by balancing efficacy and toxicity, and relying on different individual therapies that have synergistic or complementary effects. Accounting for the relative contribution of each therapy within a regimen is left for future study. Further, the relative impact of altered PK gene expression on each therapy within a regimen could differ and may depend on each therapy’s mechanism-of-action. For example, their impact on cytotoxic agents may differ from molecularly targeted agents or hormone therapies. Additionally, contributions to PK activity from epigenetic regulation, somatic and germline mutation, and copy number variation may be integrated with gene expression levels to form a more comprehensive model of therapy efficacy. That is, we have considered the downstream readout of genomic regulation in pre-treatment samples: gene expression. However, cancer cells’ ability to resist chemotherapies may depend upon their ability to change their gene expression after treatment initiation. Their ability to change gene expression will depend on the epigenetic state of the drug’s corresponding PK genes. Additionally, somatic activating and inactivating mutations will modulate the expected positive correlation between gene expression and gene activity. Therefore, we are in the process of building a more advanced TEM using machine learning techniques that can integrate data types and weigh the relative contribution of each gene, leading to greater resolution and a more detailed interpretation of each patient’s tumor genomics data.
In this study, we have established a simple rule-based Therapy Efficacy model for interpreting if a therapy will be ineffective for a specific patient by accounting for the patient’s somatic PK transcriptomic data. We validated the utility of these predictions by demonstrating that patients who were administered predicted ineffective therapies exhibited poorer survival than patients who were not administered predicted ineffective therapies, across cancer types. Thus, we suggest that PK-based guidelines could be integrated into clinical decision making to more effectively evaluate treatment regimen for individual cancer patients. These analyses demonstrate that somatic PK activity is likely to be important for IM in oncology.
Clinical Pharmacogenetics Implementation Consortium
National Comprehensive Cancer Network
The Cancer Genome Atlas
Therapy Efficacy Model
Bielinski SJ, Olson JE, Pathak J, Weinshilboum RM, Wang L, Lyke KJ, Ryu E, Targonski PV, Van Norstrand MD, Hathcock MA, et al. Preemptive genotyping for personalized medicine: design of the right drug, right dose, right time-using genomic data to individualize treatment protocol. Mayo Clin Proc. 2014;89(1):25–33.
Pinto N, Black M, Patel K, Yoo J, Mymryk JS, Barrett JW, Nichols AC. Genomically driven precision medicine to improve outcomes in anaplastic thyroid Cancer. J Oncol. 2014;2014:7. https://doi.org/10.1155/2014/936285.
Borad MJ, Egan JB, Condjella RM, Liang WS, Fonseca R, Ritacca NR, McCullough AE, Barrett MT, Hunt KS, Champion MD, et al. Clinical implementation of integrated genomic profiling in patients with advanced cancers. Sci Rep. 2016;6(1):25.
Lazaridis KN, McAllister TM, Babovic-Vuksanovic D, Beck SA, Borad MJ, Bryce AH, Chanan-Khan AA, Ferber MJ, Fonseca R, Johnson KJ, et al. Implementing individualized medicine into the medical practice. Am J Med Genet C Semin Med Genet. 2014;166C(1):15–23.
Johnson DB, Dahlman KH, Knol J, Gilbert J, Puzanov I, Means-Powell J, Balko JM, Lovly CM, Murphy BA, Goff LW, et al. Enabling a genetically informed approach to cancer medicine: a retrospective evaluation of the impact of comprehensive tumor profiling using a targeted next-generation sequencing panel. Oncologist. 2014;19(6):616–22.
Shih AH, Abdel-Wahab O, Patel JP, Levine RL. The role of mutations in epigenetic regulators in myeloid malignancies. Nat Rev Cancer. 2012;12(9):599–612.
Piccart-Gebhart MJ, Procter M, Leyland-Jones B, Goldhirsch A, Untch M, Smith I, Gianni L, Baselga J, Bell R, Jackisch C, et al. Trastuzumab after adjuvant chemotherapy in HER2-positive breast cancer. N Engl J Med. 2005;353(16):1659–72.
Romond EH, Perez EA, Bryant J, Suman VJ, Geyer CE Jr, Davidson NE, Tan-Chiu E, Martino S, Paik S, Kaufman PA, et al. Trastuzumab plus adjuvant chemotherapy for operable HER2-positive breast cancer. N Engl J Med. 2005;353(16):1673–84.
Katsanis SH, Javitt G, Hudson K. Public health. A case study of personalized medicine. Science. 2008;320(5872):53–4.
Hopkins AL, Groom CR. The druggable genome. Nat Rev Drug Discov. 2002;1(9):727–30.
Griffith M, Griffith OL, Coffman AC, Weible JV, McMichael JF, Spies NC, Koval J, Das I, Callaway MB, Eldred JM, et al. DGIdb: mining the druggable genome. Nat Methods. 2013;10(12):1209–10.
Fang H, Gough J. The 'dnet' approach promotes emerging research on cancer patient survival. Genome Med. 2014;6(8):64.
Relling MV, Klein TE. CPIC: clinical pharmacogenetics implementation consortium of the pharmacogenomics research network. Clin Pharmacol Ther. 2011;89(3):464–7.
Green RC, Berg JS, Grody WW, Kalia SS, Korf BR, Martin CL, McGuire AL, Nussbaum RL, O'Daniel JM, Ormond KE, et al. ACMG recommendations for reporting of incidental findings in clinical exome and genome sequencing. Genet Med. 2013;15(7):565–74.
Tuna MA, Christopher I. Chapter 14: activating mutations and targeted therapy in Cancer. In: Cooper PD, editor. Mutations in human genetic disease. London: InTech; 2012.
Patch AM, Christie EL, Etemadmoghadam D, Garsed DW, George J, Fereday S, Nones K, Cowin P, Alsop K, Bailey PJ, et al. Whole-genome characterization of chemoresistant ovarian cancer. Nature. 2015;521(7553):489–94.
Vaidyanathan A, Sawers L, Gannon AL, Chakravarty P, Scott AL, Bray SE, Ferguson MJ, Smith G. ABCB1 (MDR1) induction defines a common resistance mechanism in paclitaxel- and olaparib-resistant ovarian cancer cells. Br J Cancer. 2016;115(4):431–41. Published online 2016 Jul 14. https://doi.org/10.1038/bjc.2016.203.
Bussey KJ, Chin K, Lababidi S, Reimers M, Reinhold WC, Kuo WL, Gwadry F, Ajay K-MH, Fridlyand J, et al. Integrating data on DNA copy number with gene expression levels and drug sensitivities in the NCI-60 cell line panel. Mol Cancer Ther. 2006;5(4):853–67.
Yamaguchi K, Arai Y, Kanda Y, Akagi K. Germline mutation of dihydropyrimidine dehydrogenese gene among a Japanese population in relation to toxicity to 5-fluorouracil. Jpn J Cancer Res. 2001;92(3):337–42.
Davies J, Davies D. Origins and evolution of antibiotic resistance. Microbiol Mol Biol Rev. 2010;74(3):417–33.
Dever LA, Dermody TS. Mechanisms of bacterial resistance to antibiotics. Arch Intern Med. 1991;151(5):886–95.
Atkinson AJ Jr, Kushner W. Clinical pharmacokinetics. Annu Rev Pharmacol Toxicol. 1979;19:105–27.
Housman G, Byler S, Heerboth S, Lapinska K, Longacre M, Snyder N, Sarkar S. Drug resistance in cancer: an overview. Cancers (Basel). 2014;6(3):1769–92.
Shen DW, Pouliot LM, Hall MD, Gottesman MM. Cisplatin resistance: a cellular self-defense mechanism resulting from multiple epigenetic and genetic changes. Pharmacol Rev. 2012;64(3):706–21.
AbuHammad S, Zihlif M. Gene expression alterations in doxorubicin resistant MCF7 breast cancer cell line. Genomics. 2013;101(4):213–20.
Harmsen S, Meijerman I, Beijnen JH, Schellens JH. Nuclear receptor mediated induction of cytochrome P450 3A4 by anticancer drugs: a key role for the pregnane X receptor. Cancer Chemother Pharmacol. 2009;64(1):35–43.
Mochinaga K, Tsuchiya T, Nagasaki T, Arai J, Tominaga T, Yamasaki N, Matsumoto K, Miyazaki T, Nanashima A, Hayashi T, et al. High expression of dihydropyrimidine dehydrogenase in lung adenocarcinoma is associated with mutations in epidermal growth factor receptor: implications for the treatment of non--small-cell lung cancer using 5-fluorouracil. Clin Lung Cancer. 2014;15(2):136–44. e134
Hagmann W, Jesnowski R, Faissner R, Guo C, Lohr JM. ATP-binding cassette C transporters in human pancreatic carcinoma cell lines. Upregulation in 5-fluorouracil-resistant cells. Pancreatology. 2009;9(1–2):136–44.
Ishihama H, Chida M, Araki O, Karube Y, Seki N, Tamura M, Umezu H, Honma K, Masawa N, Miyoshi S. Comparison of 5-fluorouracil-related gene expression levels between adenocarcinomas and squamous cell carcinomas of the lung. Jpn J Clin Oncol. 2009;39(1):33–6.
Gross E, Meul C, Raab S, Propping C, Avril S, Aubele M, Gkazepis A, Schuster T, Grebenchtchikov N, Schmitt M, et al. Somatic copy number changes in DPYD are associated with lower risk of recurrence in triple-negative breast cancers. Br J Cancer. 2013;109(9):2347–55.
Riddick DS, Lee C, Ramji S, Chinje EC, Cowen RL, Williams KJ, Patterson AV, Stratford IJ, Morrow CS, Townsend AJ, et al. Cancer chemotherapy and drug metabolism. Drug Metab Dispos. 2005;33(8):1083–96.
Sahoo S, Haraldsdottir HS, Fleming RM, Thiele I. Modeling the effects of commonly used drugs on human metabolism. FEBS J. 2015;282(2):297–317.
Szakacs G, Paterson JK, Ludwig JA, Booth-Genthe C, Gottesman MM. Targeting multidrug resistance in cancer. Nat Rev Drug Discov. 2006;5(3):219–34.
Kandoth C, McLellan MD, Vandin F, Ye K, Niu B, Lu C, Xie M, Zhang Q, McMichael JF, Wyczalkowski MA, et al. Mutational landscape and significance across 12 major cancer types. Nature. 2013;502(7471):333–9.
Weinstein JN, Collisson EA, Mills GB, Shaw KR, Ozenberger BA, Ellrott K, Shmulevich I, Sander C, Stuart JM. The Cancer genome atlas pan-Cancer analysis project. Nat Genet. 2013;45(10):1113–20.
Larsen RJ, Marx ML. An introduction to mathematical statistics and its applications. 4th ed. Upper Saddle River, N.J: Pearson Prentice Hall; 2006.
Parmigiani G, Garett ES, Irizarry RA, Zeger SL. The analysis of gene expression data: methods and software. New York: Springer; 2006.
Whirl-Carrillo M, McDonagh EM, Hebert JM, Gong L, Sangkuhl K, Thorn CF, Altman RB, Klein TE. Pharmacogenomics knowledge for personalized medicine. Clin Pharmacol Ther. 2012;92(4):414–7.
Law V, Knox C, Djoumbou Y, Jewison T, Guo AC, Liu Y, Maciejewski A, Arndt D, Wilson M, Neveu V, et al. DrugBank 4.0: shedding new light on drug metabolism. Nucleic Acids Res. 2014;42(Database issue):D1091–7.
Drug Interactions: Cytochrome P450 Drug Interaction Table. Indiana University School of Medicine. http://medicine.iupui.edu/clinpharm/ddis.
NCCN Guidelines in Oncology: Breast Cancer (Version 1.2014). http://www.nccn.org/professionals/physician_gls/pdf/breast.pdf.
NCCN Guidelines in Oncology: Central Nervous System (Version 1.2014). http://www.nccn.org/professionals/physician_gls/pdf/cns.pdf.
NCCN Guidelines in Oncology: Colon Cancer (Version 3.2014). http://www.nccn.org/professionals/physician_gls/pdf/colon.pdf.
NCCN Guidelines in Oncology: Ovarian Cancer (Version 2.2014). http://www.nccn.org/professionals/physician_gls/pdf/ovarian.pdf.
Nelson SJ, Zeng K, Kilbourne J, Powell T, Moore R. Normalized names for clinical drugs: RxNorm at 6 years. J Am Med Inform Assoc. 2011;18(4):441–8.
RxNorm [dataset on the Internet]. Release 2014_02_03. http://www.nlm.nih.gov/research/umls/rxnorm/docs/rxnormfiles.html.
de Coronado S, Wright LW, Fragoso G, Haber MW, Hahn-Dantona EA, Hartel FW, Quan SL, Safran T, Thomas N, Whiteman L. The NCI thesaurus quality assurance life cycle. J Biomed Inform. 2009;42(3):530–9.
Cremolini C, Loupakis F, Antoniotti C, Lonardi S, Masi G, Salvatore L, Cortesi E, Tomasello G, Spadi R, Zaniboni A, et al. Early tumor shrinkage and depth of response predict long-term outcome in metastatic colorectal cancer patients treated with first-line chemotherapy plus bevacizumab: results from phase III TRIBE trial by the Gruppo Oncologico del Nord Ovest. Ann Oncol. 2015;26(6):1188–94.
Huang K, Sun H, Chen Z, Li X, Wang S, Zhao X, Tang F, Jia Y, Hu T, Du X, et al. Optimal pathological response indicated better long-term outcome among patients with stage IB2 to IIB cervical cancer submitted to neoadjuvant chemotherapy. Sci Rep. 2016;6:28278.
Jain P, Kantarjian H, Nazha A, O'Brien S, Jabbour E, Romo CG, Pierce S, Cardenas-Turanzas M, Verstovsek S, Borthakur G, et al. Early responses predict better outcomes in patients with newly diagnosed chronic myeloid leukemia: results with four tyrosine kinase inhibitor modalities. Blood. 2013;121(24):4867–74.
Maas M, Nelemans PJ, Valentini V, Das P, Rodel C, Kuo LJ, Calvo FA, Garcia-Aguilar J, Glynne-Jones R, Haustermans K, et al. Long-term outcome in patients with a pathological complete response after chemoradiation for rectal cancer: a pooled analysis of individual patient data. Lancet Oncol. 2010;11(9):835–44.
Piessevaux H, Buyse M, Schlichting M, Van Cutsem E, Bokemeyer C, Heeger S, Tejpar S. Use of early tumor shrinkage to predict long-term outcome in metastatic colorectal cancer treated with cetuximab. J Clin Oncol. 2013;31(30):3764–75.
Almendro V, Cheng YK, Randles A, Itzkovitz S, Marusyk A, Ametller E, Gonzalez-Farre X, Munoz M, Russnes HG, Helland A, et al. Inference of tumor evolution during chemotherapy by computational modeling and in situ analysis of genetic and phenotypic cellular diversity. Cell Rep. 2014;6(3):514–27.
Andor N, Graham TA, Jansen M, Xia LC, Aktipis CA, Petritsch C, Ji HP, Maley CC. Pan-cancer analysis of the extent and consequences of intratumor heterogeneity. Nat Med. 2016;22(1):105–13.
Keenan T, Moy B, Mroz EA, Ross K, Niemierko A, Rocco JW, Isakoff S, Ellisen LW, Bardia A. Comparison of the genomic landscape between primary breast Cancer in African American versus white women and the Association of Racial Differences with Tumor Recurrence. J Clin Oncol. 2015;33(31):3621–7.
Mroz EA, Rocco JW. MATH, a novel measure of intratumor genetic heterogeneity, is high in poor-outcome classes of head and neck squamous cell carcinoma. Oral Oncol. 2013;49(3):211–5.
Walker E, Hernandez AV, Kattan MW. Meta-analysis: its strengths and limitations. Cleve Clin J Med. 2008;75(6):431–9.
Haidich AB. Meta-analysis in medical research. Hippokratia. 2010;14(Suppl 1):29–37.
Oncology endpoints in a changing landscape. Manag Care. 2016;11(suppl):–12. https://www.managedcaremag.com/sites/default/files/graphics/OncoEndpoints_MC.pdf.
Gourgou-Bourgade S, Cameron D, Poortmans P, Asselain B, Azria D, Cardoso F, A'Hern R, Bliss J, Bogaerts J, Bonnefoi H, et al. Guidelines for time-to-event end point definitions in breast cancer trials: results of the DATECAN initiative (definition for the assessment of time-to-event endpoints in CANcer trials)dagger. Ann Oncol. 2015;26(5):873–9.
Kogan AJ, Haren M. Translating cancer trial endpoints into the language of managed care. Biotechnol Healthc. 2008;5(1):22–35.
Pazdur R. Endpoints for assessing drug activity in clinical trials. Oncologist. 2008;13(Suppl 2):19–21.
R Core Team. R. A language and environment for statistical computing: R Foundation for Statistical Computing; 2014. http://www.R-project.org
Klein DJ, Thorn CF, Desta Z, Flockhart DA, Altman RB, Klein TE. PharmGKB summary: tamoxifen pathway, pharmacokinetics. Pharmacogenet Genomics. 2013;23(11):643–7.
Oshiro C, Marsh S, McLeod H, Carrillo MW, Klein T, Altman R. Taxane pathway. Pharmacogenet Genomics. 2009;19(12):979–83.
Yu Y, Gaillard S, Phillip JM, Huang TC, Pinto SM, Tessarollo NG, Zhang Z, Pandey A, Wirtz D, Ayhan A, et al. Inhibition of spleen tyrosine kinase potentiates paclitaxel-induced cytotoxicity in ovarian Cancer cells by stabilizing microtubules. Cancer Cell. 2015;28(1):82–96.
Zhang CC, Yang JM, White E, Murphy M, Levine A, Hait WN. The role of MAP4 expression in the sensitivity to paclitaxel and resistance to vinca alkaloids in p53 mutant cells. Oncogene. 1998;16(12):1617–24.
Wang D, Lippard SJ. Cellular processing of platinum anticancer drugs. Nat Rev Drug Discov. 2005;4(4):307–20.
He Y, Ding Y, Wang D, Zhang W, Chen W, Liu X, Qin W, Qian X, Chen H, Guo Z. HMGB1 bound to cisplatin-DNA adducts undergoes extensive acetylation and phosphorylation in vivo. Chem Sci. 2015;6(3):2074–8.
Park S, Lippard SJ. Redox state-dependent interaction of HMGB1 and cisplatin-modified DNA. Biochemistry. 2011;50(13):2567–74.
Marsh S, McLeod H, Dolan E, Shukla SJ, Rabik CA, Gong L, Hernandez-Boussard T, Lou XJ, Klein TE, Altman RB. Platinum pathway. Pharmacogenet Genomics. 2009;19(7):563–4.
Ugrinova I, Zlateva S, Pashev IG, Pasheva EA. Native HMGB1 protein inhibits repair of cisplatin-damaged nucleosomes in vitro. Int J Biochem Cell Biol. 2009;41(7):1556–62.
Philips S, Zhou J, Li Z, Skaar TC, Li L. A translational bioinformatic approach in identifying and validating an interaction between vitamin a and CYP19A1. BMC Genomics. 2015;16(Suppl 7):S17.
Westbrook K, Stearns V. Pharmacogenomics of breast cancer therapy: an update. Pharmacol Ther. 2013;139(1):1–11.
Lu C, Xie M, Wendl MC, Wang J, McLellan MD, Leiserson MD, Huang KL, Wyczalkowski MA, Jayasinghe R, Banerjee T, et al. Patterns and functional implications of rare germline variants across 12 cancer types. Nat Commun. 2015;6:10086.
Hanahan D, Weinberg RA. Hallmarks of cancer: the next generation. Cell. 2011;144(5):646–74.
Wang L, Liu H, Chute CG, Zhu Q. Cancer based pharmacogenomics network supported with scientific evidences: from the view of drug repurposing. BioData Min. 2015;8:9.
Schilsky RL. Personalized medicine in oncology: the future is now. Nat Rev Drug Discov. 2010;9(5):363–6.
We thank Gavin Oliver for his insights into state-of-the-art IM initiatives, Eric C. Polley for guidance on statistical methodologies, Aminah Jatoi, M.D., and James N. Ingle, M.D., for their insights into clinical treatment considerations. The results published here are in whole or part based upon data generated by the TCGA Research Network: http://cancergenome.nih.gov/. This work was funded by the Center for Individualized Medicine, Mayo Clinic.
Mayo Clinic Center for Individualized Medicine.
Availability of data and materials
Data were downloaded from the TCGA Research Network. We refer to data by their TCGA identifiers throughout the manuscript. We include the full list of participants and their classification by our method in the supplemental materials.
Ethics approval and consent to participate
The results shown here are based upon data generated by the TCGA Research Network: http://cancergenome.nih.gov [34, 35]. All data downloaded from TCGA is publically accessible and de-identified. Patients were consented by the TCGA Research Network. Documentation about consent and sample acquisition is publically posted: https://cancergenome.nih.gov/abouttcga/policies.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The original version of this article was revised as it was highlighted that in the original article Figure 1 was omitted and figure 2 appeared twice.
PK gene profiles across human cancers. This file contains additional text and figures that support the analyses presented in the main text. (DOCX 1543 kb)
Table S1. This data file contains the full list of TCGA cases for which the TEM identified a potentially relevant interaction between PK gene expression levels and at least one administered therapy. (XLSX 41 kb)