Skip to main content

Two-dimensional electrophoretic comparison of metastatic and non-metastatic human breast tumors using in vitrocultured epithelial cells derived from the cancer tissues



Breast carcinomas represent a heterogeneous group of tumors diverse in behavior, outcome, and response to therapy. Identification of proteins resembling the tumor biology can improve the diagnosis, prediction, treatment selection, and targeting of therapy. Since the beginning of the post-genomic era, the focus of molecular biology gradually moved from genomes to proteins and proteomes and to their functionality. Proteomics can potentially capture dynamic changes in protein expression integrating both genetic and epigenetic influences.


We prepared primary cultures of epithelial cells from 23 breast cancer tissue samples and performed comparative proteomic analysis. Seven patients developed distant metastases within three-year follow-up. These samples were included into a metastase-positive group, the others formed a metastase-negative group. Two-dimensional electrophoretical (2-DE) gels in pH range 4–7 were prepared. Spot densities in 2-DE protein maps were subjected to statistical analyses (R/maanova package) and data-mining analysis (GUHA). For identification of proteins in selected spots, liquid chromatography-tandem mass spectrometry (LC-MS/MS) was employed.


Three protein spots were significantly altered between the metastatic and non-metastatic groups. The correlations were proven at the 0.05 significance level. Nucleophosmin was increased in the group with metastases. The levels of 2,3-trans-enoyl-CoA isomerase and glutathione peroxidase 1 were decreased.


We have performed an extensive proteomic study of mammary epithelial cells from breast cancer patients. We have found differentially expressed proteins between the samples from metastase-positive and metastase-negative patient groups.

Peer Review reports


Breast cancer is the most common cancer affecting women worldwide. Human breast carcinomas represent a heterogeneous group of tumors diverse in behavior, outcome, and response to therapy. Despite tremendous advances in screening, diagnosis, and treatment, causes of this disease remain elusive and complex.

It has been hypothesized that the clinical and genetic heterogeneity of breast cancer is a result of activation of different oncogenes or loss of different tumor suppressor genes in specific stem/progenitor cells [1]. The genetic and immunohistochemical analysis led to further clasification of human breast cacinomas as basal or luminal according to their cell type origin. To date, five types of breast carcinomas have been recognized according to the molecular genetics profiling [2, 3].

The nature of molecular changes varies between breast tumors and determines the characteristics of the disease. Current research priority is to develop methods to identify the most informative molecular changes, also known as disease markers. Thus the treatment strategy could be optimized and individualized using molecular-biological properties of the patient's tumor cells.

At present, several prognostic and predictive factors are commonly used in the breast carcinoma treatment. They include clinical factors such as tumor size, stage and histological type, histological grade, number and scale of regional lymph node involvement, hormone-receptor levels (ER, PR), HER-2/neu expression level and nuclear DNA ploidy. The significance of these factors has been clearly determined and together with the clinical state of the patient they are the main determinants in the process of selection of treatment modality [4]. Despite the research and treatment advances, the outcome of patients is still often poor. Clearly, there is a critical need to find new molecular parameters not only for detection, but also for classification and treatment of the breast cancer.

Proteomics is a rapidly developing field that can explore the heterogeneity of breast cancer and supplement the wealth of information gained from genomics. Breast cancer is one of the most studied cancers in proteomics. Studies investigating differential expression of proteins between normal and breast cancer cells revealed changes in the composition of cytoskeletal elements such as cytokeratin distribution and tropomyosin expression, the differential distribution of molecular chaperones (heat shock protein family members, protein folding enzymes, 14-3-3 σ) has been described together with elevated levels of glycolytic enzymes (aldolase, glyceraldehyde dehydrogenase) [5, 6]. Roles of lysozomal proteases (cathepsin D, cathepsin B) and matrix metalloproteases (MMPs) in the breast cancer development and progression have been explored [7].

However, proteomic analysis of larger amounts of clinical samples is so far a challenge [8]. Two-dimensional gel electrophoresis (2-DE) facilitates the separation of proteins from highly complex protein mixtures and has become a central method in proteomics in recent years. Unfortunately, the 2-DE methodology remains labor intensive and also the subsequent gel analysis is difficult. Although the 2-DE processing softwares are continuously developing, their full automation is immense [9, 10]. The methodology also puts demands on sample amount and composition. Selection of the most convenient samples containing sufficient amount of proteins suitable for 2-DE proteomic analyses is of crucial importance. Whereas differential proteomic analysis of breast tissue biopsies is complicated due to heterogeneity of cellular phenotypes contained in the sample [11], cells in culture represent a homogenous system, which can be to a certain extent defined and specifically altered.

Optimized feeder layer technique was adapted for cultivation of mammary gland epithelial cells [12]. Successful in vitro expansion of luminal cells together with myoepithelial cells in heterogeneous populations of human breast epithelial cells was achieved. It is assumed that among the bulk of cells forming the body of the tumor only a few drives the tumor outgrow. They are supposed to be derived from the so-called stem or progenitor cells [13, 14]. Recently, we have characterized a new cell line, EM-G3, possessing some characteristics of putative breast progenitor cells. The cell line was established from the primary culture of breast cancer biopsy sample using the optimized feeder layer technique [15, 16]. We believe that our method of temporal in vitro propagation of cells from breast cancer tissues could partially lead to the selection of cells relatively close to putative tumor stem cells [12, 17]. We performed the 2-DE protein analysis of malignant breast cancer cells cultivated from tissues of different patients in various stages of breast cancer. We tried to find association among possible variations in the expression of proteins and clinical outcome of breast cancer patients. R/computing environment was used to perform statistical analyses [18]. Namely, the analyses based on the R/maanova package [19] have been performed. We further employed the data-mining technique GUHA (General Unary Hypothesis Automaton) to reveal possible relations among protein spots and their impact on clinical image [20]. The GUHA is a method of exploratory data analysis with logical and statistical backgrounds. It automatically formulates and tests a huge amount of hypotheses on relations in data and reveals the "interesting" ones. Some potential candidates for protein markers ensue from these trials.



The samples were obtained in the years 1999 – 2002 from women who underwent partial breast resection or radical mastectomy at the General Faculty Hospital in Prague. Patients were chosen unselectively at the time of operation. The patient's written informed consent approved by the Ethical committee of the General Faculty Hospital in Prague was obtained prior to surgery. The morphology of tumors was determined and immunocytochemical staining for hormonal receptors (ER, PR), HER 2/neu and antigen Ki67 was performed. The patients were treated according to the stage-adjusted therapeutic standards. We estimated the clinical outcome of the patients. The patients with follow-up at least three years were chosen for further analysis. The patients were divided into two groups: distant metastase-free after three years and patients with proven distant metastases.


Paraffin sections 5 μm from formalin-fixed tissues were used. The tissue sections were incubated with primary antibodies ER, Dako (Glostrup, Denmark), clone 1D5, dilution 1 : 100; PR, Novocastra (Newcastle, UK) clone 16, dilution 1 : 100; Ki67, Novocastra (Newcastle, UK) clone MIB-1, dilution 1 : 50. Immunodetection was performed with the universal immuno-peroxidase polymer Histofine, Nichirei Biosciences INC (Tokyo, Japan). Detection of HER 2/neu was performed using HercepTest TM assay detection system, Dako (Glostrup, Denmark). Five percent 3,3'-diaminobenzidine tetrahydrochloride chromogen solution was used for visualization. Positive and negative controls were included in each run of slides.

Cell cultures

Primary cell cultures were isolated from biopsies of human breast carcinomas. The cells were cultured by the 3T3 feeder-layer technique [12, 17].

The cells in the second or third passage were grown to confluence, harvested and stored in liquid nitrogen in the culture medium containing 10% dimethylsulfoxid and 20% of bovine serum. The cells designated for 2-DE analyses were thawed, seeded, cultivated to confluent layers and harvested as described in Selicharova et al. [15]. Out of 120 cultivated samples, primary cultures from tumor tissue of 23 patients were suitable for further 2-DE based analysis because of a sufficient amount of cultivated cells (about five millions).

Two-dimensional gel electrophoresis

The cell lysate (70 μg of proteins) in rehydration buffer composed of 7 M urea, 2 M thiourea, 4% (w/v) CHAPS, 50 mM DTT, 0.8% (v/v) ampholytes (pH 3–10) was applied to 18 cm linear IPG strips pH 4–7, GE Healthcare (Uppsala, Sweden). 2-DE was performed exactly as described [15]. Briefly, the IEF of rehydrated strips was performed with stepwise increasing voltage as follows: 250 V for 1 h, 500 V for 1 h, 1000 V for 2 h and 10,000 V for the time period necessary to reach 70,000 Vh in total. The focused strips were equilibrated for 30 min in the solution containing 6 M urea, 20% (v/v) glycerol, 2% (w/v) SDS, 0.05 M Tris/HCl pH 8.8 and 2% (v/v) DTT with traces of Bromphenol Blue. Then free thiol groups were alkylated for 30 min in the same solution containing 2.5% (w/v) iodoacetamide instead of DTT. The SDS-PAGE on gradient gels (8–16%, 4% stacking gel, 19 × 22 cm) was performed in 0.025 M Tris/0.192 M Glycine with 0.1% (w/v) SDS running buffer for 1 h at 16 mA and for about 9 h at 24 mA per gel till the Bromphenol Blue line has reached the bottom of the gel. Three silver-stained analytical gels were prepared from each sample. All the common chemicals were from Sigma (St. Louis, USA) and Fluka (Buchs, Switzerland).

Image analysis

Gels were scanned by a GS-800 Calibrated Densitometer, Bio-Rad (Hercules, CA) at 700 dpi resolution. The images were further processed by PDQuest Advaced 8.0.1 2D Gel Analysis Software, Bio-Rad (Hercules, CA). For computational purposes the file size was reduced to 50% and the images were cropped to frame the same clusters of spots. One or two representative gels per each cell population were used to create a match-set. Spots were detected and matched automatically to a master gel selected by the software. The spot detection and matching were edited manually. The spot boundary tool was applied to detect large spots. The patterns in sections of the gels in appropriate magnification were checked and spots were added manually to the master gel to allow matching unique spots present in the individual gels. The spot quantity table containing all matched spots was generated. The quantity of missing spots was estimated by the software. The means of logarithmic ratios method was used for normalization. The mean of log ratios method of normalization calculates the normalization factor of a gel by calculating the mean of all log ratios (log spot quantity of gel/log spot quantity of master gel) of all matched spots (master gel – gel). The quantity table was exported to a spreadsheet .xls file and submitted to statistical analyses (Additional data file 1).

Statistical analysis and data mining – GUHA (General Unary Hypothesis Automaton)

Independent statistical tests were performed using R/computing environment in version 2.6.0 [18] and by adapting R/maanova package version 1.8.0. [19] which has been designed for processing microarray data. It implements sample shuffling. R/maanova provides a permutation method to calculate the nominal permutation p-values for each gene (i.e. spot intensity) using Fs test statistics. Because of multiple testing, the p-values were adjusted to false discovery rate [21].

The relations among spot intensities and clinical image were analyzed on the basis of data-mining technique GUHA [20]. The analyzed data were stored in a source database in the form of a table of n rows (objects = 2-DE gels) and m columns (variables = spot intensities). The variables were dichotomized. Each variable was categorized. Categories were actually subsets of ranges of variables given by cut points. The category was evaluated as 1 if the value of a variable dropped within a subset given by the respective cut point otherwise it was evaluated as 0. The settings of cut points were based on the specification and behavior of an impurity function [22]. In our application we employed the entropy function as impurity function for each intensity variable classified with respect to the clinical variable metastases. The impurity function takes its minimum if all objects are classified as 0 or 1. The maximum is reached if roughly a 50/50 mixture of classes is present in the group. The idea is to split the original group into two sub-groups in such a way that impurity decreases in a maximal way, i.e., that the sum of impurities of sub-groups is minimized. We identified optimal splits and corresponding optimal cut points by a script in MATLAB [23]. The cut points then enabled us to categorize spot intensities. A GUHA hypothesis is determined by the ordered pair of cedents (antecedent (A) and succedent (S)) and by a quantifier. Cedents are Boolean conjunctions formed from individual categories. The length of a cedent is given by the number of categories forming the conjunction. A cedent of length = 1 corresponds to a single category (simple cedent). Cedents of length > 1 are called compound cedents. For a given object a cedent can be evaluated as 1 (true) or 0 (false). The evaluation stems from the evaluation of single categories forming the cedent and rules for Boolean conjunction. For a given pair of cedents, we can construct a corresponding contingency table by evaluating cedents for all objects in the database and then perform statistical tests on this table. The employed quantifier determines the type of test. We used the Fisher quantifier corresponding to Fisher's exact test. A hypothesis is formally written A ~S, and if it is valid (statistically significant), it is revealed in the GUHA output. Q-values were calculated by q-val package [21]. Statistical tests were two-sided at the 5% level of significance.

Characterization of proteins

The spots generated from the statistical analyses as significantly changed were researched by their spot ID in the match set created by the PDQuest software. The relative molecular masses (Mr) and isoelectric points (pI) were estimated for each protein from their positions in the gels. The statistically important spots were considered for identification. The preparative 2-DE gels were prepared from cells with a relatively high content of the protein of interest using 400 μg of the cell lysate. They were stained with colloidal Coomassie stain [24].

Mass spectrometry and protein identification

Selected spots on the preparative gels were excised and destained using 50% acetonitrile in 25 mM ammonium bicarbonate, dehydrated with 200 μl of acetonitrile for 5 min at 30°C using thermomixer comfort, Eppendorf AG (Hamburg, Germany) at 30°C and then vacuum-dried in SpeedVac, Thermo Scientific (Waltham, Ma). Gel pieces were rehydrated and proteins were digested for 8 hours at 37°C in the thermomixer with 30 ng/μl trypsin (Trypsin Gold Mass Spectrometry Grade, Promega, Madison, WI) in 25 mM ammonium bicarbonate. After digestion, peptides were extracted from gel pieces using step by step extraction with acetonitrile gradient (15%–60% acetonitrile with 1% trifluoroacetic acid). The extraction was performed in sonicator, Elma (Singen, Germany) with ice cubes.

Extracted peptides were concentrated in SpeedVac, Thermo Scientific (Waltham, Ma). Tandem electrospray ionization mass spectrometry (ESI-MS/MS) was used to characterize the digests. The ESI-MS/MS was performed in a quadrupole-time of flight (Q-TOF) tandem Micro mass spectrometer (Waters-Micromass) equipped with nanoelectrospray source and coupled to 2-D capillary chromatography CapLC (Waters). Chromatographic separation was achieved using the symmetry 300 Å OPTI-PAC (1 cm × 5 μm) trap column (Waters) and Atlantis dC18 (75 μm × 10 cm × 3 μm) capillary column (Waters). Data were processed by proteomic software Proteinlynx global server 2.1 (Waters) (LC-MS/MS).


Clinico-pathological characteristics of the patients

Primary cultures of breast epithelial cells available for 2-DE analysis were cultivated from biopsy specimens of 23 breast cancer patients. An example of ER, PR and HER 2/neu positive immunocytochemical staining of tumor tissue of patient L116 together with in vitro outgrowths of the cells are shown in Figure 1(A–E). The example of positive staining for Ki67 with MIB-1 in Figure 1(F) is shown for sample L122. Dominating histology of tumors were invasive ductal carcinomas (16 patients), mostly high grade, five lobular carcinomas, one of them pleomorphic, one mucinous and one medulary carcinoma. Sixteen tumors were ER-positive, seven were ER-negative including three carcinomas, which were triple-negative (ER-negative, PR-negative, HER 2/neu negative). Four tumors were stained as HER 2/neu 3+ and five other were HER 2/neu 2+. The main tumor characteristics are given in Table 1. Metastatic spreading was proven in seven patients during postoperative follow-up. The pattern of metastases and time to relapse are given in Table 2.

Figure 1

Examples of positive staining of tumoral tissue sections. A) Invasive ductal carcinoma, patient L116. Hematoxylin-eosin stain in 200× magnification reveals poor differentiation with a predominantly solid infiltrative pattern. The nuclei are highly pleomorphic with notable nucleoli. B) Outgrowths of cells from tumor fragment in vitro phase contrast (400× magnified). C) Immunostaining for ER, patient L116 (100× magnified). D) Immunostaining for PR, patient L116 (100× magnified). E) Immunostaining for HER 2/neu, patient L116 (400× magnified). F) Immunostaining for Ki67 with MIB-1 antibody, patient L122 (400× magnified).

Table 1 Clinico-pathological characteristics of patients
Table 2 Metastatic pattern of patients in the metastase positive group

We intended to find differences in protein expression between the samples from metastase-positive and metastase-negative patient groups.

2-DE protein separation and image analysis

2-DE protein maps in the pH range 4–7 were prepared in triplicate. The 2-DE protein maps were very similar in all the cases and contained all the major spots that were identified in the normal mammary epithelial (NME) cell sample as described in [15]. A typical 2-DE gel is shown in Figure 2. The main protein spots are marked. The gels were analyzed by PDQuest Advanced 8.0.1 2D Gel Analysis Software. The software is not designed to compare multiple groups of samples that arise from our experimental setting (23 samples in triplicate). We compromised between the quantity of data and our ability to process them. The spot patterns in the area of the gels above Mr 50 kDa were not homogeneous and were not suitable for spot detection and matching by the PDQuest software due to huge protein clusters formed by cytokeratins and inconsistent streaks. Only spots in the pH range from 4.2 to 6.8 and Mr range from 15 kDa to 40 kDa were analyzed, matched and quantified. The area of the gel analyzed by PDQuest is outlined in Figure 2. A match-set containing 44 gels was created. Twenty-one samples were represented by two gels. Two samples were represented for capacity and gel quality reasons by one gel only. The correlation coefficients between technical replicates of gels ranged from 0.76 to 0.88. Matching of the most typical spots was checked and corrected manually. The spot patterns in distinct areas of gels were studied and spots unique to individual gels were added to the master gel to allow their analysis. Finally, 245 spots were matched and 70 of them were matched to all gels. The log-normalized spot quantity table containing all matched spots was generated and submitted to statistical analyses (Additional file 1).

Figure 2

Representative 2-DE map. Representative 2-DE map of proteins of primary culture of epithelial breast cancer cells (sample L116). The gels were silver-stained. Solubilized proteins were focused on IPG strips (pH 4–7) and separated in SDS polyacrylamide gradient gels (8–16%). The characteristic proteins are named according to [15], A = annexin, CK = cytokeratin, eIF = eukaryotic translation factor, FABP = fatty acid-binding protein, GST = glutathione transferase, HSP = heat-shock protein, PDI = protein disulfide isomerase, TCTP = translationally controled tumor protein, TPI = triosephosphate isomerase. Protein spots significantly changed between the metastase-positive and metastase-negative samples are marked and described in Tables 3, 4, 5. The area of gel analyzed using PDQuest is outlined (------).

Statistical analysis and data mining

We searched for protein spots changed between the seven samples of metastase-positive and sixteen samples of metastase-negative groups.

For the GUHA analysis we averaged the intensity values for each sample. Categorization of 245 spot variables was determined on the basis of specification of maximal impurity decrease generating cut points. The basic task was to search for statistically significant relations between the presence/absence of spots and their classifications (metastatic/non-metastatic). In GUHA task settings, the antecedent variables were spot variables, succedent variables clinic variables. The lengths of antecedent and succedent were set to 1 and Fisher quantifier was employed. We searched for hypotheses rejecting the null hypothesis on independence between the presence/absence of spots and classification of 2-DE gels. Tests were taken at the 0.05 probability level. However, because we performed many simultaneous tests we had to adjust the significance level using the q-value. The q-value was calculated from the value of corresponding Fisher statistics. Q-values represent the approximation of maximal false-positive rate among the proteins supposed to be significant. Spots 7305 and 1606 fulfilled the criteria of q-value ≤ 0.05. The data from the GUHA analysis are presented in Table 3. We are showing 10 spots with the lowest q-values. The complete data can be found in Additional file 2.

Table 3 GUHA data mining analysis

The false discovery rate adjusted p-value and the fold change were calculated using the R/maanova package [19]. All spots are presented in volcano plot (Huy) with log 2 fold change on x-axis and -log 10 (adjusted P-value) as y-axis (Figure 3). Spots 7305 and 5104 fulfilled statistical criteria of adjusted p-value ≤ 0.05 and log2 fold change ≥ 2 estimated by maanova. The data from maanova analysis are presented in Table 4. We are showing 10 spots with the lowest p-values and log2 fold change ≥ 1. The complete data can be found in Additional file 3.

Table 4 R/maanova
Figure 3

Volcano plot. The dots represent the individual spots from 2-DE analysis. The y-axis value is -log10(P-value) for the Fs test and the x-axis value is proportional to the fold changes (Huy, maanova [19]). The horizontal line represents the significance threshold 0.05 (Fs-adjusted). The red dots are the spots selected by the Fs test. Significantly changed spots are marked.

We considered the spots 7305, 5104 and 1606 that have been found by either type of computing as significantly changed in correlation with the metastases. The spots 5104 and 7305 were negatively correlated with the metastatic phenotype, the spot 1606 was correlated positively.

Characterization of proteins

The spots found by the statistical analyses were re-searched in the match-set created by the PDQuest software. The spots are marked in Figure 2 and enlarged sections of representative gels showing the respective spots in the metastatic and non-metastatic groups are presented in Figure 4. The data from protein identification are presented in Table 5. The identity of all peptides has been confirmed at least by three fragment ions in the MS/MS spectrum.

Table 5 Identification of significantly changed proteins between the metastase-positive and metastase-negative groups
Figure 4

Enlarged sections of representative 2-DE gels. Enlarged sections of representative 2-DE gels showing differential expression of spots 7305, 5104 and 1606 between the metastic and non-metastic groups. The averaged relative intensities of particular spots in individual samples from the PDQuest output are shown in the bar graphs. The bars corresponding to the representative samples are marked. Sample order in the bar graph is the same as in Table 1 and in Table 2, the patient groups being separated.

Spot 1606 had a diffused pattern in the 2-DE gel. It was identified as nucleophosmin, [Swiss-Prot:P06748]. Spots 7305 and 5104 were of very low intensity as could be estimated from the cut-off values and judged from Figures 2 and 4. Spot 7305 was identified as 2,3-trans-enoyl-CoA isomerase, [Swiss-Prot:P42126]. Spot 5104 was identified as glutathione peroxidase 1, [Swiss-Prot:P07203].


Breast cancer is one of the most intensively studied cancers. However, the breast cancer research proved to be extremely complicated due to the complex biology of mammary gland [11, 14]. We believe that our method of temporal in vitro propagation of cells from breast cancer tissues [12, 17] could partially lead to the selection of cells relatively close to putative tumor stem/progenitor cells. We suppose that through analysis of these cells we might indicate proteins responsible for the overall tumor behavior. We performed the 2-DE analysis of 23 primary cultures of epithelial cells derived from breast cancer tissues from which seven samples were metastase-positive.

All the 2-DE gels from different samples were similar to each other and were conformable to the normal mammary epithelial (NME) cell sample described in Selicharova et al [15]. The similarity of individual primary cultures of breast cancer cells was a good prerequisite for performing the comparative proteomic experiment. On the other hand, the experimental design based on the primary cultures substantially decreased the available amount of samples necessary for powerful statistics. From 120 human breast tumors we only obtained 23 usable cell cultures with a sufficient amount of cultured cells (about five millions).

Only few spots apparently varied among sets of gels from individual samples qualitatively or quantitatively. Some of the proteins have been identified (data not shown). We detected variations in quantity of cathepsin D, cathepsin B, squamous cell carcinoma antigen, γ synuclein, cytokeratin 19 and other proteins that have been reported to play a role in cancer etiology [2529]. We also found isoelectric variants of several proteins arising from common polymorphism that might have impact on the cancerogenesis (glutathione transferase ω, glyoxalase I) [30, 31]. On the other hand we have not observed variation in quantity of heat shock proteins (HSP 90, HSP 60, HSP 27) or the molecular chaperone 14-3-3 σ among our breast cancer cell cultures. These proteins have been reported to be altered in the breast cancer [5, 32]. Although these findings were exciting, we could not demonstrate without further validation the significance of above mentioned observations and their connectedness with the tumor characteristics.

We intended to perform computational quantitative analysis of our 2-DE data. Any computer software designed to align and compare 2-DE gels must somehow deal with distorted spot patterns that are pertinent to the methodology. So far, the spot detection and matching must be supervised by a researcher [10] which was another bottleneck of our experimental design. We have analyzed our gels by the PDQuest Advanced 8.0.1 2D Gel Analysis Software since it is available in our laboratory. The software is not designed to compare multiple groups of samples that have arisen from our experimental setting (23 samples in triplicate). We compromised between the quantity of data and our ability to process them. We have finally constructed a match set composed of 44 cropped gels yielding well distinguished spot patterns. The spot patterns were carefully studied and matching was adjusted in each gel. Finally, 245 spots were matched and their normalized quantities in each gel were subjected to statistical analyses and data mining. The quantification of proteins in 2-DE gels is relative and it is a matter of dynamic range versus sensitivity [8, 24]. The gels were silver-stained because we intended to achieve the utmost sensitivity to be able to detect possible changes in the expression of less abundant proteins in our samples. We are aware that this type of staining might be a source of inaccuracy. The correlation coefficients between technical replicates of gels ranged from 0.76 to 0.88 which is normal for the 2-DE analyses [9]. However, the overall variability within the data cut down the attainable statistical significance of our results. In spite of all the disputable issues we indicated spots correlated with metastases in the set of patients. We used two different statistical approaches to search for significant correlations between clinical data and spot intensities. The GUHA [20] enriched with cut-off points and q-values computes with categorical variables sorted according to the cut points. The R/maanova [19] computes with integers that correspond to the spot densities. The outputs of the methods slightly differ but in general the same spots were found with both the methods as significantly correlated with the clinical data.

Spots 7305, 5104 and 1606 fulfilled the statistical criteria in either analysis. Spot 7305 was identified as 2,3-trans-enoyl-CoA isomerase. The enzyme is involved in mitochondrial β-oxidation of unsaturated fatty acids [33]. The defects in distribution of polyunsaturated fatty acids in healthy and cancerous breast tissues have been documented [34]. Decreased levels of this enzyme might have impact on the aberrant behavior of cancer cells.

Spot 5104 was identified as glutathione peroxidase 1, a selenium dependent enzyme that detoxifies hydrogen and lipid peroxides. The protective function of selenium against cancer mortality has been documented. It remains unclear how selenium decreases cancer risk and whether glutathione peroxidase is involved in the action [35]. The lowered levels of the enzyme in our group of patients with metastases further support possible involvement of the glutathione peroxidase 1 in the anticancer defense.

Spot 1606 was increased in the group of patients with metastases and it appeared to be abundant. The spot had a diffused pattern in the 2-DE gel and the MS spectra were complicated. It was identified as nucleophosmin, a highly phosphorylated protein associated with nucleolar ribonucleoprotein structures [36]. The protein is known to be extensively post-translationally modified. It might be a reason for its 2-DE pattern but we cannot exclude that there might be other proteins contained in the spot. Nucleophosmin is overexpressed in many types of human solid tumors. It is a multifunctional protein and its physiological function in tumorigenesis is controversial [37].


We have performed an extensive proteomic study of mammary epithelial cells from the breast cancer patients. We found three proteins that were significantly altered in the group of patients who developed distant metastases within the three-year post-operative follow-up. After development of preferably an immunocytochemical detection methodology, our results need to be proven on a higher amount of samples (directly on tumor biopsies) to establish their value for prediction of tumor behavior, spreading or as potential treatment targets. It should be kept in mind that we were able to process only a small part of the 2-DE maps. Potentially, more proteins may be found if we analyze other portions of the proteomes of our samples. The work is in progress.



two-dimensional gel electrophoresis


estrogen receptor


electrospray ionization


General Unary Hypothesis Automaton

HER 2/neu:

human epidermal growth factor receptor 2


liquid chromatography-tandem mass spectrometry


quadrupole-time of flight


progesterone receptor


  1. 1.

    Li Y, Welm B, Podsypanina K, Huang SX, Chamorro M, Zhang XM, Rowlands T, Egeblad M, Cowin P, Werb Z, Tan LK, Rosen JM, Varmus HE: Evidence that transgenes encoding components of the Wnt signaling pathway preferentially induce mammary cancers from progenitor cells. Proceedings of the National Academy of Sciences of the United States of America. 2003, 100: 15853-15858. 10.1073/pnas.2136825100.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  2. 2.

    Charafe-Jauffret E, Ginestier C, Monville F, Fekairi S, Jacquemier J, Birnbaum D, Bertucci F: How to best classify breast cancer: Conventional and novel classifications (Review). International Journal of Oncology. 2005, 27: 1307-1313.

    CAS  PubMed  Google Scholar 

  3. 3.

    Sorlie T: Molecular portraits of breast cancer: tumour subtypes as distinct disease entities. European Journal of Cancer. 2004, 40: 2667-2675. 10.1016/j.ejca.2004.08.021.

    CAS  Article  PubMed  Google Scholar 

  4. 4.

    Kapoor A, Vogel VG: Prognostic factors for breast cancer and their use in the clinical setting. Expert Review of Anticancer Therapy. 2005, 5: 269-281. 10.1586/14737140.5.2.269.

    Article  PubMed  Google Scholar 

  5. 5.

    Hondermarck H, Vercoutter-Edouart AS, Revillion F, Lemoine J, El-Yazidi-Belkoura I, Nurcombe V, Peyrat JP: Proteomics of breast cancer for marker discovery and signal pathway profiling. Proteomics. 2001, 1: 1216-1232. 10.1002/1615-9861(200110)1:10<1216::AID-PROT1216>3.0.CO;2-P.

    CAS  Article  PubMed  Google Scholar 

  6. 6.

    Bini L, Magi B, Marzocchi B, Arcuri F, Tripodi S, Cintorino M, Sanchez JC, Frutiger S, Hughes G, Pallini V, Hochstrasser DF, Tosi P: Protein expression profiles in human breast ductal carcinoma and histologically normal tissue. Electrophoresis. 1997, 18: 2832-2841. 10.1002/elps.1150181519.

    CAS  Article  PubMed  Google Scholar 

  7. 7.

    Selicharova I, Sanda M, Mladkova J, Ohri SS, Vashishta A, Fusek M, Jiracek J, Vetvicka V: 2-DE Analysis of Breast Cancer Cell Lines 1833 and 4175 with Distinct Metastatic Organ-specific Potentials and Comparison with Parental Cell Line MDA-MB-231. Oncology Reports. 2008, 19: 1237-1244.

    CAS  PubMed  Google Scholar 

  8. 8.

    Mischak H, Apweiler R, Banks RE, Conaway M, Coon J, Dominiczak A, Ehrich JHH, Fliser D, Girolami M, Hermjakob H, Hochstrasser D, Jankowski J, Julian BA, Kolch W, Massy ZA, Neusuess C, Novak J, Peter K, Rossing K, Schanstra J, Semmes OJ, Theodorescu D, Thongboonkerd V, Weissinger EM, Van Eyk JE, Yamamoto T: Clinical proteomics: A need to define the field and to begin to set adequate standards. Proteomics Clinical Applications. 2007, 1: 148-156. 10.1002/prca.200600771.

    CAS  Article  PubMed  Google Scholar 

  9. 9.

    Challapalli KK, Zabel C, Schuchhardt J, Kaindl AM, Klose J, Herzel H: High reproducibility of large-gel two-dimensional electrophoresis. Electrophoresis. 2004, 25: 3040-3047. 10.1002/elps.200405979.

    CAS  Article  PubMed  Google Scholar 

  10. 10.

    Biron DG, Brun C, Lefevre T, Lebarbenchon C, Loxdale HD, Chevenet F, Brizard JP, Thomas F: The pitfalls of proteomics experiments without the correct use of bioinformatics tools. Proteomics. 2006, 6: 5577-5596. 10.1002/pmic.200600223.

    CAS  Article  PubMed  Google Scholar 

  11. 11.

    Hondermarck H: Breast cancer: when proteomics challenges biological complexity. Molecular & Cellular Proteomics. 2003, 2: 281-291.

    CAS  Google Scholar 

  12. 12.

    Matouskova E, Dudorkinova D, Krasna L, Vesely P: Temporal in vitro expansion of the luminal lineage of human mammary epithelial cells achieved with the 3T3 feeder layer technique. Breast Cancer Research and Treatment. 2000, 60: 241-249. 10.1023/A:1006409605067.

    CAS  Article  PubMed  Google Scholar 

  13. 13.

    Villadsen R: In search of a stem cell hierarchy in the human breast and its relevance to breast cancer evolution. Apmis. 2005, 113: 903-921. 10.1111/j.1600-0463.2005.apm_344.x.

    Article  PubMed  Google Scholar 

  14. 14.

    Polyak K: Breast cancer: origins and evolution. Journal of Clinical Investigation. 2007, 117: 3155-3163. 10.1172/JCI33295.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  15. 15.

    Selicharova I, Smutna K, Sanda M, Ubik K, Matouskova E, Bursikova E, Brozova M, Vydra J, Jiracek J: 2-DE analysis of a new human cell line EM-G3 derived from breast cancer progenitor cells and comparison with normal mammary epithelial cells. Proteomics. 2007, 7: 1549-1559. 10.1002/pmic.200600907.

    CAS  Article  PubMed  Google Scholar 

  16. 16.

    Brozova M, Kleibl Z, Netikova I, Sevcik J, Scholzova E, Brezinova J, Chaloupkova A, Vesely P, Dundr P, Zadinova M, Krasna L, Matouskova E: Establishment, growth and in vivo differentiation of a new clonal human cell line, EM-G3, derived from breast cancer progenitors. Breast Cancer Res Tr. 2006, 103: 247-257. 10.1007/s10549-006-9358-x.

    Article  Google Scholar 

  17. 17.

    Krasna L, Dudorkinova D, Vedralova J, Vesely P, Pokorna E, Kudlackova I, Chaloupkova A, Petruzelka L, Danes J, Matouskova E: Large expansion of morphologically heterogeneous mammary epithelial cells, including the luminal phenotype, from human breast tumours. Breast Cancer Research and Treatment. 2002, 71: 219-235. 10.1023/A:1014457731494.

    CAS  Article  PubMed  Google Scholar 

  18. 18.

    The R Project for Statistical computing . 2007, []

  19. 19.

    maanova . 2007, []

  20. 20.

    Coufal D, Vydra J, Selicharova I: GUHA analysis of proteomic oncological data. Neural Network World. 2007, 17: 447-457. []

    Google Scholar 

  21. 21.

    Storey JD, Taylor JE, Siegmund D: Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach. Journal of the Royal Statistical Society Series B-Statistical Methodology. 2004, 66: 187-205. 10.1111/j.1467-9868.2004.00439.x. []

    Article  Google Scholar 

  22. 22.

    Breiman L, Friedman J, Olshen R, Stone C: Classification and regression trees. 1984, Boca Raton, FL: Chapman & Hall/CRC

    Google Scholar 

  23. 23.

    The MathWorks . 2007, []

  24. 24.

    Candiano G, Bruschi M, Musante L, Santucci L, Ghiggeri GM, Carnemolla B, Orecchia P, Zardi L, Righetti PG: Blue silver: A very sensitive colloidal Coomassie G-250 staining for proteome analysis. Electrophoresis. 2004, 25: 1327-1333. 10.1002/elps.200305844.

    CAS  Article  PubMed  Google Scholar 

  25. 25.

    Garcia M, Platet N, Liaudet E, Laurent V, Derocq D, Brouillet JP, Rochefort H: Biological and clinical significance of cathepsin D in breast cancer metastasis. Stem Cells. 1996, 14: 642-650.

    CAS  Article  PubMed  Google Scholar 

  26. 26.

    Mai J, Waisman DM, Sloane BF: Cell surface complex of cathepsin B/annexin II tetramer in malignant progression. Biochim Biophys Acta. 2000, 1477: 215-230.

    CAS  Article  PubMed  Google Scholar 

  27. 27.

    Torre GC: SCC antigen in malignant and nonmalignant squamous lesions. Tumor Biology. 1998, 19: 517-526. 10.1159/000030045.

    CAS  Article  PubMed  Google Scholar 

  28. 28.

    Guo J, Shou C, Meng L, Jiang B, Dong B, Yao L, Xie Y, Zhang J, Chen Y, Budman DR, Shi YE: Neuronal protein synuclein gamma predicts poor clinical outcome in breast cancer. International Journal of Cancer. 2007, 121: 1296-1305. 10.1002/ijc.22763.

    CAS  Article  PubMed  Google Scholar 

  29. 29.

    Gusterson BA, Ross DT, Heath VJ, Stein T: Basal cytokeratins and their relationship to the cellular origin and functional classification of breast cancer. Breast Cancer Research. 2005, 7: 143-148. 10.1186/bcr1041.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  30. 30.

    Marahatta SB, Punyarit P, Bhudisawasdi V, Paupairoj A, Wongkham S, Petmitr S: Polymorphism of glutathione S-transferase omega gene and risk of cancer. Cancer Letters. 2006, 236: 276-281. 10.1016/j.canlet.2005.05.020.

    CAS  Article  PubMed  Google Scholar 

  31. 31.

    Thornalley PJ: Glyoxalase I - structure, function and a critical role in the enzymatic defence against glycation. Biochemical Society Transactions. 2003, 31: 1343-1348.

    CAS  Article  PubMed  Google Scholar 

  32. 32.

    Sarto C, Binz PA, Mocarelli P: Heat shock proteins in human cancer. Electrophoresis. 2000, 21: 1218-1226. 10.1002/(SICI)1522-2683(20000401)21:6<1218::AID-ELPS1218>3.0.CO;2-H.

    CAS  Article  PubMed  Google Scholar 

  33. 33.

    Janssen U, Fink T, Lichter P, Stoffel W: Human Mitochondrial 3,2-Trans-Enoyl-Coa Isomerase (Dci) - Gene Structure and Localization to Chromosome 16P13.3. Genomics. 1994, 23: 223-228. 10.1006/geno.1994.1480.

    CAS  Article  PubMed  Google Scholar 

  34. 34.

    He Q, Shkarin P, Hooley RJ, Lannin DR, Weinreb JC, Bossuyt VI: In vivo MR spectroscopic imaging of polyunsaturated fatty acids (PUFA) in healthy and cancerous breast tissues by selective multiple-quantum coherence transfer (Sel-MQC): A preliminary study. Magn Reson Med. 2007, 58: 1079-1085. 10.1002/mrm.21335.

    CAS  Article  PubMed  Google Scholar 

  35. 35.

    Lei XG, Cheng WH, McClung JP: Metabolic regulation and function of glutathione peroxidase-1. Annu Rev Nutr. 2007, 27: 41-61. 10.1146/annurev.nutr.27.061406.093716.

    CAS  Article  PubMed  Google Scholar 

  36. 36.

    Lim MJ, Wang XW: Nucleophosmin and human cancer. Cancer Detection and Prevention. 2006, 30: 481-490. 10.1016/j.cdp.2006.10.008.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  37. 37.

    Zhang Y: The ARF-B23 connection: implications for growth control and cancer treatment. Cell Cycle. 2004, 3: 259-262.

    CAS  PubMed  Google Scholar 

Pre-publication history

  1. The pre-publication history for this paper can be accessed here:

Download references


This work was supported by a grant NR/8323-3 from the Internal Grant Agency of the Ministry of Health of the Czech Republic (IGA MZCR) and Research Projects of the Academy of Sciences of the Czech Republic (Z40550506, AV0Z50520514, AV0Z10300504) and by Research Project of the Ministry of Education, Youth and Sports of the Czech Republic (MSM0021620808).

Author information



Corresponding author

Correspondence to Irena Selicharová.

Additional information

Competing interests

The author(s) declare that they have no competing interests.

Authors' contributions

JV designed and coordinated the study, evaluated and interpreted the clinical data, contributed to statistical computations and drafted the manuscript. IS designed and performed the analysis of 2-DE gels, prepared the samples for identification of proteins, analyzed, evaluated and interpreted the proteomical data and drafted the manuscript. KS optimized the methodology for preparation of 2-DE gels and prepared the gels. MS performed the LC-MS/MS experiments and protein identifications. EM contributed to the study concept and design, was responsible for cell cultures establishment and handling and contributed to writing of the manuscript. EB and MP established and handled the cell cultures. ZV performed the imunohistochemistry analysis. DC was responsible for the GUHA and statistical analyses and contributed to writing of the manuscript. JJ contributed to the study concept and design and critically revised the manuscript for important intellectual content. All authors read and approved the final manuscript.

Jan Vydra, Irena Selicharová, Kateřina Smutná, Miloslav Šanda, Eva Matoušková, Eva Buršíková, Markéta Prchalová, Zuzana Velenská, David Coufal and Jiří Jiráček contributed equally to this work.

Electronic supplementary material

PDQuest quantity table

Additional file 1: . The table is in the excel spreadsheet format (PDQuest.xls). Spots are numbered in rows. Means of logarithmic ratios method normalized spot densities in each gel are in columns. The paired gels from each sample (named according to the Table 1) are marked (a) respective (b), for example L40-sp-a, L40-sp-b. (XLS 86 KB)

Additional file 2: Results of the GUHA analysis. The table is in the excel spreadsheet format (guha.xls). Hypotheses are numbered in rows. Antecedent column displays categorized spot intensities. Succedent column displays presence (= 1) or absence (= 0) of metastases at follow-up. Table column displays the contingency table evaluating cedents. Fisher quantifier column corresponds to Fisher's exact test. (XLS 176 KB)

Additional file 3: Results of the R/maanova analysis. The table is in the excel spreadsheet format (maanova.xls). For each spot we show in adequate columns the number given by PDQuest software, log2 difference of log transformed means between metastases and non-metastases groups (fold change), nominal permutation p-values using F1 test statistics (F1 Pvalperm) and false discovery rate adjusted p-values for multiple testing (F1 adjPvalperm). Nominal permutation p-values using Fs test statistics (Fs Pvalperm) and false discovery rate-adjusted p-values for multiple testing (Fs adjPvalperm). (XLS 44 KB)

Authors’ original submitted files for images

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Vydra, J., Selicharová, I., Smutná, K. et al. Two-dimensional electrophoretic comparison of metastatic and non-metastatic human breast tumors using in vitrocultured epithelial cells derived from the cancer tissues. BMC Cancer 8, 107 (2008).

Download citation


  • Breast Cancer
  • Protein Spot
  • Additional Data File
  • Spot Intensity
  • Spot Pattern