Gene expression of PMP22 is an independent prognostic factor for disease-free and overall survival in breast cancer patients
© Tong et al; licensee BioMed Central Ltd. 2010
Received: 9 April 2010
Accepted: 15 December 2010
Published: 15 December 2010
Gene expression of peripheral myelin protein 22 (PMP22) and the epithelial membrane proteins (EMPs) was found to be differentially expressed in invasive and non-invasive breast cell lines in a previous study. We want to evaluate the prognostic impact of the expression of these genes on breast cancer.
In a retrospective multicenter study, gene expression of PMP22 and the EMPs was measured in 249 primary breast tumors by real-time PCR. Results were statistically analyzed together with clinical data.
In univariable Cox regression analyses PMP22 and the EMPs were not associated with disease-free survival or tumor-related mortality. However, multivariable Cox regression revealed that patients with higher than median PMP22 gene expression have a 3.47 times higher risk to die of cancer compared to patients with equal values on clinical covariables but lower PMP22 expression. They also have a 1.77 times higher risk to relapse than those with lower PMP22 expression. The proportion of explained variation in overall survival due to PMP22 gene expression was 6.5% and thus PMP22 contributes equally to prognosis of overall survival as nodal status and estrogen receptor status. Cross validation demonstrates that 5-years survival rates can be refined by incorporating PMP22 into the prediction model.
PMP22 gene expression is a novel independent prognostic factor for disease-free survival and overall survival for breast cancer patients. Including it into a model with established prognostic factors will increase the accuracy of prognosis.
Breast cancer is by far the most frequent cancer of women with about one million new cases every year worldwide. Even though the prognosis for breast cancer patients is rather good, it is still the leading cause of cancer mortality in women causing about 400,000 annual deaths . So far, the most important prognostic factor is lymph node status, which indicates disease-free survival and overall survival in breast cancer. The well defined predictors include the presence of hormone receptors that predict the response to endocrine therapy and the HER2 status that predicts the response to Tratuzumab. However, there is no predictive factor for chemotherapy that can be clinically used . The prognosis of breast cancer is also far from being precise. Identification of new prognostic and predictive markers will not only help patients to receive the proper treatment, it can also provide new therapeutic targets.
The invasive potential of tumor cells reflects the intrinsic characteristics of tumor cells. Genes involved in the invasive process might therefore correlate with outcome of the disease and have certain prognostic and predictive values. In a previous study, we characterized the cell lines derived from breast cancer or normal breast tissues by their invasive ability to penetrate into a collagen-fibroblast matrix and compared gene expression profiles of invasive and non-invasive cell lines using Affymetrix GeneChip technology . Several genes, which had not been described in the context of breast cancer, were identified and validated by RT-PCR. Two of these genes code for members of a subfamily of small hydrophobic membrane proteins, namely EMP3 and PMP22. Both are highly expressed in most of the invasive cell lines and had very low expression levels in non-invasive cell lines.
The whole family consists of the peripheral myelin protein 22 (PMP22) and the epithelial membrane proteins (EMP1, -2, and -3), which are expressed in many tissues, and have functions in cell growth, differentiation, and apoptosis .
We hypothesize that these genes can have prognostic impacts on breast cancer. The objectives of the study are defined as the measurement of the expression of the EMPs and PMP22 in tumor tissues from 249 breast cancer patients using real-time RT-PCR, statistical evaluation of their prognostic impacts, and assessment of their added values to already established prognostic factors.
Breast cancer patients
Patients' age and histopathological characteristics of tumors
Mean gene expression
Sample number (%)
invasive ductal carcinoma
invasive lobular carcinoma
others and unknown
pT1 (<2 cm)
pT2 (2-5 cm)
pT3 (>5 cm)
others and unknown
p = 0.005
p = 0.019
P < 0.0001
p < 0.001
p < 0.001
p < 0.001
Fresh tumor biopsies from breast carcinomas were collected during surgery and snap frozen immediately after the histologic examination of frozen sections. Only samples consisting of at least 90% tumor tissues were collected. Clinical pathological parameters were determined at the Department of Pathology, Medical University of Vienna. Characteristics of tumors are shown in Table 1. Tumor biopsies were frozen in liquid nitrogen until further processed.
Total RNA preparation
Tissues were homogenized using a microdismembrator and dissolved in GI lysis buffer (4 M Guanidine Isothiocyanate, 0.5% N-lauroyl-Sarcosine, 10 mM EDTA, 5 mM Sodium Citrate, and 100 μM β-mercaptoethanol). Total RNA was extracted from tumor biopsy lysates by isopycnic centrifugation as described previously  followed by a DNA digestion step of incubation with RNase-free DNase I (Roche Diagnostic, Mannheim, Germany) at 37°C for 15 minutes. The quality of the RNA was examined with RNA 6000 Nano Chips and RNA 6000 Nano Reagent & Supplies on a 2100 Bioanalyzer (Agilent Technologies, Waldbronn, Germany). RNA concentrations were determined spectrophotometrically.
Reverse transcription (RT)
RT was carried out using Omniscript Reverse Transcriptase kit (Qiagen, Hilden, Germany). The total reaction volume was 20 μl including 500 ng RNA. The reaction mixture was incubated at 37°C for 60 min, heated at 95°C for 10 min and then cooled on ice. The reaction was diluted 1:4 with water and aliquoted for further analysis.
The primers and probes for beta-2-microglobulin were included in TaqMan PDAR B2 M RNA Control Reagent (Applied Biosystems, Foster City, CA). For the quantifications of EMPs, PMP22 and ER, "Assay-on-Demand" kits (Applied Biosystems) were used (ER: Hs00174860_m1; EMP1: Hs00608055_m1; EMP2: Hs00171315_m1; EMP3: Hs00171319_m1 and PMP22: Hs00165556_m1). 5700 Sequence Detection System (Applied Biosystems) was used for real-time analysis. 4 μl diluted cDNA aliquot was used as template for PCR in a total volume of 25 μl including TaqMan Universal PCR Master Mix and the corresponding probes and primers. The mixture was pre-incubated at 95°C for 10 min followed by 40 cycles of two step incubations at 95°C for 15 s and 60°C for 1 min. All samples were measured in duplicates.
Quantitation of gene expression
The relative quantitation method with standard curves was used for the calculation of the relative amounts of mRNAs. A sample with a high expression level of a certain gene was chosen as calibrator. Its expression was defined as 1. A standard curve using serial dilutions of the calibrator was used to calculate the amount of RNAs in other samples. Target quantities of all other samples were expressed as n-fold in relation to the calibrator. To correct the quantity differences in the starting RNA samples, the target quantity of certain mRNA was normalized to that of the constitutively expressed house keeping gene beta-2-microglobulin in the same sample.
Estrogen receptor status by expression analysis
Protein levels of estrogen receptor (ER) in tumors were primarily determined using immunohistochemistry. Since ER status was missing for 56 samples (22.5%), we re-determined it using mRNA gene expression values. A similar procedure was suggested and used in a previous study . We measured the ER gene expression in a cohort of breast cancer tissues with known clinical ER status obtained by immunohistochemistry and used that value of ER gene expression as cutoff point for ER status, which minimized the sum of false positive and false negative rates.
For model building, the following parameters were considered besides the expression values of the markers analysed: age at diagnosis, histological type, nodal status, tumor size, differentiation grade, and estrogen receptor status.
Mean values and 95% confidence intervals for genes expression were calculated on a logarithmic scale (log2) and then transformed back to the original scale (Table 1). In order to compare the gene expression between two or more groups, T-test or one-way ANOVA, respectively, were performed using the log-transformed expression as independent variable with subsequent Bonferroni-Holm correction for multiple testing.
Disease-free survival is defined as time between diagnosis of disease and recurrence or distant metastasis. Overall survival is defined as time from diagnosis of disease to death of patients of breast cancer. Patients who died of causes unrelated to breast cancer were treated as censored in disease-specific survival analysis. Median follow-up time was computed by the Kaplan-Meier method with reverse status indicator as proposed by Schemper and Smith . For analysis of disease-free and overall survival, tumors with differentiation grade 1 and 2 were combined for comparison with those with differentiation grade 3 and tumors with pT1 were compared with those combining pT2, pT3 and pT4. These groupings were necessary because of the low number of cases in some subgroups.
The assumptions of the Cox models (additivity of effects, proportional hazards, linearity of effects) were assessed as follows. First, interactions of pairs of variables were evaluated by including and testing corresponding product terms. Second, time-dependency of hazard ratios was accounted for by testing correlation of scaled Schoenfeld residuals with time . Third, gene expression was entered into analysis by using the log2-values instead of categories, and a potential non-linear effect of gene-expression was tested by including additional model terms that were derived from a linear-tail restricted cubic spline. These sensitivity analyses uncovered a time-dependent effect of estrogen receptor status on tumor-specific survival, which was accounted for by including a time-dependent covariate defined as the product of estrogen receptor status (1 or 0) and the logarithm of survival time. However, this did not alter any conclusion on the other variables.
The predictive ability of the multivariable models was assessed by computing the proportion of explained variation due to Schemper and Henderson . Furthermore, relative importance of variables was assessed by omitting variables one-by-one from the multivariable model as suggested by Heinze and Schemper . Furthermore, we evaluated the predictive ability using ten-fold cross-validation as follows: first, the data set was randomly split into 10 approximately equally-sized subsamples. Second, nine of the ten subsamples were merged to form a training sample. Regression coefficients were estimated for the multivariable model including all variables using the training sample, and risk scores were predicted for the remaining tenth subsample (the test sample). These risk scores were obtained by inserting the estimated regression coefficients and the observed variable values of the test sample into the linear model equation. This second step was repeated ten times in turn such that each subject once appeared in the test sample and such was assigned a cross-validated risk score. Third, the cross-validated risk scores were stratified into quartiles and Kaplan-Meier curves, 5-year survival rates and a log-rank test were used to describe the association of risk scores with survival. The process was repeated, omitting gene expression variables from the model to assess if and to which extent prediction worsens if gene-expression was not accounted for. This ten-fold cross-validation was seen as more appropriate than a single split-up into a training set and a test set, as the former yields more accurate results than the latter . P-values < 0.05 were considered as indicating statistical significance. The statistical software packages R 2.4.2 http://www.r-project.org and SAS 9.1.3 (2003 SAS Institute Inc., Cary, NC) were used for statistical graphics and analyses, respectively.
Estrogen receptor status was determined as follows: Using 207 samples, of which we had both, immunohistochemical and gene expression results of ER, we determined a cut-off value of 0.4984 for the gene expression to differentiate ER positive and ER negative tumors. Using this value to judge the status of ER, the original immunohistochemical data had a 19.2% false positive and 26.5% false negative rate, respectively. All samples in this study were re-evaluated for their ER status using this cut-off value, which generated 116 negative and 133 positive tumors.
Estimates of hazard ratios for tumor related-death
95% Confidence Interval Of Hazard Ratio
95% Confidence Interval of Hazard Ratio
0.69 - 1.77
0.69 - 1.77
0.81 - 2.07
1.82 - 6.62
1.10 - 2.26
1.15 - 2.47
2.00 - 6.77
1.80 - 6.65
0.72 - 1.47
0.70 - 1.62
0.33 - 0.84
0.34 - 0.98
0.33 - 0.84
0.17 - 0.55
Estimates of hazard ratios for recurrent disease
95% Confidence Interval of Hazard Ratio
95% Confidence interval of Hazard Ratio
0.69 - 1.47
0.70 - 1.50
0.64 - 1.36
0.69 - 1.46
1.09 - 2.87
0.99 - 1.78
0.93 - 1.74
1.73 - 4.27
1.69 - 4.38
0.78 - 1.44
0.70 - 1.38
0.43 - 0.92
0.31 - 0.78
Proportion of explained variation (PEV)
5-year survival rates calculated by cross validation
5-year DFS rate
5-year OS rate
Quartile (Risk Scores)
1 (lowest risk)
2 (intermediate low risk)
3 (intermediate high risk)
4 (highest risk)
Rate difference (lowest to highest risk)
p = 0.001
PMP22 and EMPs were selected for the evaluation of their prognostic values based on their higher expression levels in invasive breast cell lines compared to non-invasive ones. The invasiveness of these cell lines was determined by the ability of the cells to penetrate into a collagen-fibroblast matrix . Cell motility and the capacity to invade into the surrounding tissues are preconditions for tumor cells to metastasize. Genes that are not expressed or expressed to less extent in non-invasive cells but are highly activated in invasive cells could be markers for prediction of tumor metastases. They could also indirectly indicate the outcome of patients. Indeed, we showed that patients with higher expression of PMP22 in their tumors have both, worse DFS and OS, suggesting that PMP22 is involved directly or indirectly in the invasion process. Our study also suggests that invasive and non-invasive cell lines provide a useful model for searching for prognostic factors.
In this study, we did not only show that PMP22 gene expression has prognostic value on DFS and OS, we also showed that PMP22 gene expression is as powerful as nodal status and ER status to predict mortality of breast cancer patients by calculating the proportion of explained variation. Traditionally, the prognostic values of gene expression were only evaluated by multivariable Cox regression model. The gain of including additional prognostic factors was not well addressed. Even though many new prognostic biomarkers have been reported, quite often they don't increase the predictive accuracy when added to the established clinical predictive factors . By leaving out one of the three important prognostic factors, namely PMP22 gene expression, pN, or ER, the proportion of explained variations decreased equally, demonstrating that PMP22 expression contributes equally to prognosis as pN or ER status does. Therefore, PMP22 has potential use in clinical practice.
Using cross validation, we compared the ranges of 5-year survival rates of breast cancer patients between models including and excluding PMP22 gene expression. In these analyses, risk scores were stratified into quartiles. The results show that by including PMP22 gene expression into the prediction model, the accuracy of the prediction was significantly increased. This indicates that including of PMP22 gene expression into clinical risk evaluation can refine the prognosis, again showing the added value of PMP22 gene expression to prognosis. It is of great interests to establish a prognostic score including PMP22 gene expression values and other know independent factors.
So far, expression and functions of PMP22 have been well investigated in neuroscience. Abnormalities in PMP22 can lead to various peripheral neuropathies . Increased PMP22 expression was found in other pre-malignant or malignant tissues, like pancreatic tissues , osteosarcoma, and glioblastoma tissues [16, 17]. However, little is known about the functions of PMP22 in human cancer. It is very interesting to further investigate the possible functions of PMP22 in breast cancer and to clarify its roles in tumor invasion, so that we can better understand its prognostic impact.
In this study we show that breast cancer patients with higher than median PMP22 gene expression had a 3.47 times higher risk to die of cancer than patients with lower than median PMP22 expression. They also had a 1.77 times higher risk to relapse than those with lower than median PMP22 expression levels. The analysis of the proportion of explained variation suggests that gene expression of PMP22, ER status and pN variables are equally important to predict mortality of breast cancer. Cross validation of the 5-year survival rates showed that the model including PMP22 expression has a broader range of prediction, therefore a better discrimination of different risk groups than models excluding PMP22 expression. This is true for both DFS and OS, showing that PMP22 has an additive value in predicting survival after the diagnosis of breast cancer.
Taking together, PMP22 gene expression is an independent prognostic factor for disease-free and overall survival for breast cancer patients. It contributes equally to the prediction of cancer related death as estrogen receptor status and nodal status. Including PMP22 gene expression into a multivariable model including ER and nodal status, the accuracy of the prediction can be increased.
Functional studies on PMP22 in breast cancer should be investigated to elucidate its roles in the progression of breast cancer and to explain if it could be a therapy target.
We thank Mrs. Eva Schuster and Mrs. Barbara Holzer for their technical supports.
- Parkin DM, Bray F, Ferlay J, Pisani P: Global cancer statistics, 2002. CA Cancer J Clin. 2005, 55: 74-108. 10.3322/canjclin.55.2.74.View ArticlePubMedGoogle Scholar
- Lønning PE: Breast cancer prognostication and prediction: are we making progress?. Ann Oncol. 2007, 18 (Suppl 8): viii3-7.PubMedGoogle Scholar
- Evtimova V, Zeillinger R, Weidle UH: Identification of genes associated with the invasive status of human mammary carcinoma cell lines by transcriptional profiling. Tumour Biol. 2003, 24: 189-198. 10.1159/000074429.View ArticlePubMedGoogle Scholar
- Jetten AM, Suter U: The peripheral myelin protein 22 and epithelial membrane protein family. Prog Nucleic Acid Res Mol Biol. 2000, 64: 97-129. full_text.View ArticlePubMedGoogle Scholar
- Kury FD, Schneeberger C, Sliutz G, Kubista E, Salzer H, Medl M, Leodolter S, Swoboda H, Zeillinger R, Spona J: Determination of HER-2/neu amplification and expression in tumor tissue and cultured cells using a simple, phenol free method for nucleic acid isolation. Oncogene. 1990, 5: 1403-1408.PubMedGoogle Scholar
- Tutt A, Wang A, Rowland C, Gillett C, Lau K, Chew K, Dai H, Kwok S, Ryder K, Shu H, Springall R, Cane P, McCallie B, Kam-Morgan L, Anderson S, Buerger H, Gray J, Bennington J, Esserman L, Hastie T, Broder S, Sninsky J, Brandt B, Waldman F: Risk estimation of distant metastasis in node-negative, estrogen receptor-positive breast cancer patients using an RT-PCR based prognostic expression signature. BMC Cancer. 2008, 8: 339--10.1186/1471-2407-8-339.View ArticlePubMedPubMed CentralGoogle Scholar
- Schemper M, Smith TA: Note on Quantifying Follow-up in Studies of Failure Time. Controll Clin Trials. 1996, 17: 343-346. 10.1016/0197-2456(96)00075-X.View ArticleGoogle Scholar
- Grambsch PM, Therneau T: Proportional Hazards Tests and Diagnostics based on Weighted Residuals. Biometrika. 1994, 81: 515-526. 10.1093/biomet/81.3.515.View ArticleGoogle Scholar
- Schemper M, Henderson R: Predictive Accuracy and Explained Variation in Cox Regression. Biometrics. 2000, 56: 249-255. 10.1111/j.0006-341X.2000.00249.x.View ArticlePubMedGoogle Scholar
- Heinze G, Schemper M: Comparing the importance of prognostic factors in Cox and logistic regression using SAS. Comput Methods Programs Biomed. 2003, 71: 155-163. 10.1016/S0169-2607(02)00077-9.View ArticlePubMedGoogle Scholar
- Harrell FE: Resampling: Validating, Describing, and Simplifying the Model. Regression Modeling Strategies. Edited by: Harrell FE Jr. 2001, New York: Springer New York, 93-94.View ArticleGoogle Scholar
- Tong D, Czerwenka K, Sedlak J, Schneeberger C, Schiebel I, Concin N, Leodolter S, Zeillinger R: Association of in vitro invasiveness and gene expression of estrogen receptor, progesterone receptor, pS2 and plasminogen activator inhibitor-1 in human breast cancer cell lines. Breast Cancer Res Treat. 1999, 56: 91-97. 10.1023/A:1006262501062.View ArticlePubMedGoogle Scholar
- Dunkler D, Michiels S, Schemper M: Gene expression profiling: does it add predictive accuracy to clinical characteristics in cancer prognosis?. Eur J Cancer. 2007, 43: 745-751. 10.1016/j.ejca.2006.11.018.View ArticlePubMedGoogle Scholar
- Douglas DS, Popko B: Mouse forward genetics in the study of the peripheral nervous system and human peripheral neuropathy. Neurochem Res. 2009, 34: 124-137. 10.1007/s11064-008-9719-4.View ArticlePubMedGoogle Scholar
- Li J, Kleeff J, Esposito I, Kayed H, Felix K, Giese T, Büchler MW, Friess H: Expression analysis of PMP22/Gas3 in premalignant and malignant pancreatic lesions. J Histochem Cytochem. 2005, 53: 885-893. 10.1369/jhc.4A6546.2005.View ArticlePubMedGoogle Scholar
- Hühne K, Park O, Liehr T, Rautenstrauss B: Expression analysis of the PMP22 gene in glioma and osteogenic sarcoma cell lines. J Neurosci Res. 1999, 58: 624-631.View ArticlePubMedGoogle Scholar
- van Dartel M, Leenstra S, Troost D, Hulsebos TJ: Infrequent but high-level amplification of 17p11.2 approximately p12 in human glioma. Cancer Genet Cytogenet. 2003, 140: 162-166. 10.1016/S0165-4608(02)00683-0.View ArticlePubMedGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2407/10/682/prepub
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.