Skip to main content
  • Research article
  • Open access
  • Published:

New models and online calculator for predicting non-sentinel lymph node status in sentinel lymph node positive breast cancer patients



Current practice is to perform a completion axillary lymph node dissection (ALND) for breast cancer patients with tumor-involved sentinel lymph nodes (SLNs), although fewer than half will have non-sentinel node (NSLN) metastasis. Our goal was to develop new models to quantify the risk of NSLN metastasis in SLN-positive patients and to compare predictive capabilities to another widely used model.


We constructed three models to predict NSLN status: recursive partitioning with receiver operating characteristic curves (RP-ROC), boosted Classification and Regression Trees (CART), and multivariate logistic regression (MLR) informed by CART. Data were compiled from a multicenter Northern California and Oregon database of 784 patients who prospectively underwent SLN biopsy and completion ALND. We compared the predictive abilities of our best model and the Memorial Sloan-Kettering Breast Cancer Nomogram (Nomogram) in our dataset and an independent dataset from Northwestern University.


285 patients had positive SLNs, of which 213 had known angiolymphatic invasion status and 171 had complete pathologic data including hormone receptor status. 264 (93%) patients had limited SLN disease (micrometastasis, 70%, or isolated tumor cells, 23%). 101 (35%) of all SLN-positive patients had tumor-involved NSLNs. Three variables (tumor size, angiolymphatic invasion, and SLN metastasis size) predicted risk in all our models. RP-ROC and boosted CART stratified patients into four risk levels. MLR informed by CART was most accurate. Using two composite predictors calculated from three variables, MLR informed by CART was more accurate than the Nomogram computed using eight predictors. In our dataset, area under ROC curve (AUC) was 0.83/0.85 for MLR (n = 213/n = 171) and 0.77 for Nomogram (n = 171). When applied to an independent dataset (n = 77), AUC was 0.74 for our model and 0.62 for Nomogram. The composite predictors in our model were the product of angiolymphatic invasion and size of SLN metastasis, and the product of tumor size and square of SLN metastasis size.


We present a new model developed from a community-based SLN database that uses only three rather than eight variables to achieve higher accuracy than the Nomogram for predicting NSLN status in two different datasets.

Peer Review reports


Current practice guidelines recommend a completion axillary lymph node dissection for breast cancer patients whose SLN contains metastatic tumor [13]. The risk of morbidity that accompanies completion ALND seems justified for patients with NSLN metastases, because they would undergo excision of residual cancer [4]. However, 50 to 65% of patients with tumor-involved SLNs do not have additional nodal metastasis [5, 6]. For them, ALND offers no clear therapeutic benefit, provides no further information for staging, and increases the cost of medical care. Further, completion ALND is associated with substantial morbidity affecting up to 39% of patients, with a nearly three-fold increased risk of lymphedema or regional sensory loss [79]. Identifying SLN-positive patients without NSLN metastases who could forgo completion ALND would improve the quality of life and reduce costs for the majority of women with new diagnoses of breast cancer.

Previous investigations have not identified predictors of NSLN status with accuracy sufficient to change clinical practice. This failure may be due to limited sample sizes or single institution studies [5, 6, 10]. The majority of prior investigations include sample sizes of less than two hundred subjects, with the challenges of dealing with small sample sizes leading to decreased predictive accuracy when applied to the general population [5, 6, 1012]. However in 2003 Van Zee et al. proposed a nomogram to predict risk of NSLN metastasis based on an accrued population of 1075 cases of primary invasive breast cancer [13]. The Memorial Sloan-Kettering Cancer Center (MSKCC) Breast Cancer Nomogram (Nomogram) has since been successfully applied internationally and become the most commonly used predictive model for NSLN involvement [14]. Use of a predictive nomogram has been shown to be superior to expert opinion, to improve clinical decision making, and to be partially responsible for the decreasing frequency of ALNDs performed [15, 16]. However, use of the Nomogram is limited by its complexity, and inability to be applied if not all patient characteristics are known [17]. Although the Nomogram was based on a large sample size, its reported predictive accuracy and its generalizability to patient populations with dissimilar tumor characteristics or to non-academic, non-quaternary care hospitals has been questioned [1719].

Our goal was to identify characteristics of patients and their tumors that predict NSLN status within the Bay Area SLN Database, comprised of diverse patient populations from one academic and 15 community-based medical centers in Northern California and Oregon. We constructed three new models and contrasted their performance with the Nomogram. We provide a model that has simpler input than the Nomogram and shows higher accuracy for our diverse patient population and for another population of SLN-positive patients with different patient characteristics from Northwestern University. We have created an internet-based calculator, the Stanford Online Calculator, for validation testing and clinical application.


Study patients

The Bay Area SLN Study for Detection of Axillary Metastasis in Breast Cancer is a multi-institutional collaboration involving 16 institutions in the Greater Bay Area of Northern California and Oregon, of which 15 are community hospitals. A total of 1,040 patients underwent SLN biopsy for biopsy-proven breast cancer between 1996 and 2002. After excluding 256 patients (criteria shown in Additional file 1), we analyzed 784 prospectively accrued subjects with primary invasive breast carcinoma and clinically negative axilla who underwent SLN biopsy with completion axillary lymph node dissection. 285 (36.4%) had tumor-involved SLNs. Among the 285 SLN-positive patients, 213 had pathologic information regarding presence or absence of angiolymphatic invasion (lymphovascular invasion, LVI); 171 patients had complete pathologic information on both angiolymphatic invasion and hormone receptor status. The Northwestern test dataset was compiled by chart review of all patients who underwent a SLN biopsy at Northwestern Memorial Hospital in Chicago, IL, between 2002 and 2006. It is comprised of 77 consecutively identified sentinel node positive patients with invasive breast cancer who underwent completion ALND and had complete pathologic information on tumor type, tumor size, tumor grade, hormone receptor status, HER2/neu status, angiolymphatic invasion status, number of nodes removed, and size of sentinel node metastases. Inclusion and exclusion criteria are similar to that outlined for the Stanford patients in Additional file 1. The Northwestern database was compiled by physicians not involved in generation of the predictive models. The Bay Area SLN study was performed under a protocol approved by the Stanford University Administrative Panel on Human Subjects in Medical Research and the Institutional Review Boards of each participating institution. An independent protocol was approved by the Institutional Review Board of Northwestern University for retrospective chart review and data collection to test the Stanford Online Calculator and MSKCC Nomogram.

SLN biopsy and pathological evaluation

SLN biopsy has been described previously [20]. The SLN was identified using peritumoral injection of 1% isosulfan blue dye, filtered 99mTc sulfur colloid radioactive tracer, or both, as decided by the operating surgeon. All lymph nodes that were blue and/or focally radioactive and/or suspicious by intraoperative palpation were denoted SLNs. All SLNs were evaluated by step-sectioning with hematoxylin and eosin (H&E) staining; in the Bay Area SLN study, SLNs without metastasis detectable by H&E underwent staining by immunohistochemistry (IHC) [21]. IHC was performed on at least four levels of the SLN using anti-keratin antibodies AE1 and CAM5.2. One pathologist directed and interpreted IHC studies on every SLN excised at 14 of the 16 participating institutions in the Bay Area SLN Study. NSLNs were evaluated by H&E only, without serial sectioning. In the Northwestern series, negative SLNs did not undergo IHC testing and individual tumor cells or clusters were identified on H&E only.

Statistical analyses

Thirteen characteristics were studied individually for predicting NSLN status: patient age, tumor histology, tumor size (as a continuous variable and as T size by 6th edition AJCC criteria), tumor grade [22], estrogen receptor (ER) status, progesterone receptor (PR) status, HER2/neu status, presence of angiolymphatic invasion, number of SLNs excised, number of positive SLNs, size of nodal metastasis (recorded according to revised 6th edition AJCC criteria) [23], and method of detecting nodal metastasis (H&E or IHC). Univariate testing was done with χ2 statistics and Wilcoxon rank sums. For multivariate analyses, tree-based classification and logistic regression were performed [24, 25]. Recognizing that some characteristics can be interdependent, we performed multivariate analyses with two approaches whereby interactions among variables are emphasized: recursive partitioning via receiver operating characteristic (RP-ROC) [26] curves and (boosted) classification and regression trees (CART®) [24, 27, 28].

RP-ROC uses the relationship of sensitivity and specificity to calculate the "best value" of each variable for predicting NSLN status. It then chooses the variable with best value. Successive partitioning permits use of ROC curves to compare predictive accuracy and best cut point on "best selected variable." Partitioning of the population into subgroups continues until only patients with or without NSLN metastases are segregated to the group, or until the putative p value of the split exceeds 0.01. RP-ROC was performed as is described in detail by Kraemer [26] (software available from Sierra-Pacific MIRECC [29]).

CART as we applied it uses both cross-validation and voting methods (boosting) to assess the stability and improve the accuracy of the final model [24, 27], (software available from Salford Systems, v5 [30]). Splits are chosen by what is termed the Gini criterion, whose goal is to render nodes of the tree as "pure" as possible in terms of positive or negative NSLN status. Boosting is a method designed to focus on "hard to classify" observations. In all classifications, there is dependence on the products by class of priors and costs of misclassification. For all classification trees, mixed priors (an average of equal priors and prevalence-based priors) were used. After surveying eighteen breast surgeons expert in SLN biopsy and not associated with this study, the costs of a false-positive and false-negative NSLN were set at 3 and 10, respectively.

A third technique, multivariate logistic regression (MLR) informed by CART, was performed with variable selection based on paths from the root to the five terminal nodes of unboosted CART [31]. Odds ratios were calculated individually for all terms that were candidates for inclusion in subsequent analyses. Those terms retained were entered into the MLR by forward selection based on the likelihood ratio. Wald statistics and odds ratios were determined for variables significant at putative p < 0.01 within the regression model [32]. A cutoff p < 0.01 was chosen in the interest of our ending with a focused, concise, predictive model.

In constructing the predictive models of NSLN status, we used tumor characteristics that were significant by univariate testing (Table 1): tumor size, tumor grade, ER status, PR status, angiolymphatic invasion, size of SLN metastasis, and SLN metastasis identification method. Statistical modeling of NSLN status allowed calculation of both the predictive capacity of significant variables and the critical interactions between and among variables, such as increasing angiolymphatic invasion with increasing tumor size. All models used identical variables, although not identical patients. RP-ROC requires complete data, where no values of features are missing, whereas CART does not. Instead, CART relies on the subtle notion of "surrogate split" [24]. Thus, boosted CART analyses were performed on all 285 SLN-positive patients as well as subsets with more complete information, while RP-ROC and MLR analyses were performed on the 213 patients with complete data for angiolymphatic invasion and on the 171 patients with complete data for angiolymphatic invasion and hormone receptor status.

Table 1 Characteristics of NSLN- and NSLN+ cases among SLN+ patients (Bay Area SLN Database).

The MSKCC Breast Cancer Nomogram for Prediction of ALN Status [13] (Nomogram) was applied to our patient population and, to provide fair comparison, calculated for only the 171 patients with complete information on the eight variables required for its application (pathologic size of primary tumor, tumor type with nuclear grade if ductal, LVI, multifocality of primary tumor, ER status, method of detecting SLN metastasis, number of positive SLNs, and number of negative SLNs; a ninth variable, whether a frozen section was performed, was not applicable to our patients). ROC curves were constructed for the Nomogram and the other methods to compare the area under the curve (AUC). Internal validation was performed by 10-fold cross-validation, as previously described [27]. Data were divided at random into 10 parts, as equal as possible in size. CART (in this instance, but more generally any other procedure) was then computed successively for 9/10 of the data with the remaining piece held out as "test sample." This was repeated 10 times and results on the 10 test samples were averaged. Cross-validation is an internal validation method that estimates performance on subsequent subjects by eliminating bias that owes to using the same, or even a portion of the same, data for both modeling and testing. However, even with internal validation, bias and variability can be introduced into subsequent analyses if the prevalence of features that predict outcome is different in future datasets than in the dataset from which the model was developed. The differences in distribution of variables (and in synergistic interactions between variables) for an original and a subsequent test dataset impacts a model's performance on future datasets and applies both to our models and to that of the Nomogram. For this reason, we tested our model and the Nomogram on the Northwestern dataset that differed from our original dataset in its distribution of patient, tumor, and sentinel node variables.

ROC curves were constructed for the Nomogram and the MLR informed by CART model for the Bay Area SLN study dataset and the independent Northwestern dataset.

Statistical analyses were performed with R [33].


Table 1 and Additional files 2 and 3 describe in detail the SLN-negative and SLN-positive patients of the Bay Area SLN dataset. As expected, the incidence of SLN metastasis increased with increasing tumor size: 29% of T1, 51% of T2, and 80% of T3 tumors had SLN metastasis. As tumor size increased over 1 cm, the incidence of angiolymphatic invasion doubled for both SLN-positive and SLN-negative patients but was higher for SLN-positive patients (Additional file 3). Among all 784 patients, the total number of women with any axillary lymph node metastasis was 316 (40%), including 31 (9.8%) with a false negative SLN (Additional file 2).

Among SLN-positive cases, the average number of SLNs removed was 1.91, with metastatic disease limited to a single SLN in 73%. Among tumor-involved SLNs, 23% contained isolated tumor cells or clusters (ITCs, ≤ 0.2 mm); 70% contained micrometastases (>0.2 mm to 2 mm); and 7% contained macrometastases (>2 mm). All SLNs containing ITCs required IHC for detection. Only one of 200 cases with SLNs involved by micrometastasis was not observed on H&E and required IHC staining for identification. All 21 cases with SLN macrometastasis were identified by H&E staining (Additional file 2).

Of 285 patients with tumor-involved SLNs, 101 (35.4%) were found to have NSLN metastases, with tumor metastases to two or more NSLNs in the majority of cases (median number of positive NSLNs 2; mean 3.5; range 1–19) (Additional file 2). By univariate analyses, 8 variables were highly predictive of NSLN status: tumor size (in cm), tumor size by AJCC T classification, tumor grade, ER status, PR status, angiolymphatic invasion, size of SLN metastasis, and whether the nodal metastasis was identified by H&E or IHC (Table 1). Of patients whose SLN was identified by H&E, 45% had NSLN metastases, whereas only 4.6% of patients whose SLN was identified by IHC had NSLN metastases (p < 0.001). Size of SLN metastasis and staining method for metastasis identification are highly correlated (p = 0.02, by χ2 testing) and therefore are not independent predictors of NSLN status. Thus, staining method for identifying tumor-involvement was not included in the multivariate analysis shown in Table 1. By multivariate analysis, tumor size, angiolymphatic invasion, and size of SLN metastasis remained significantly predictive of NSLN status (p < 0.001 by unconditional testing). Of the 285 patients with SLN metastases, NSLN metastases were found in 25% of patients with T1 tumors; in 46% with T2 tumors; and in 60% with T3 tumors (Figure 1A). When angiolymphatic invasion was present, there was a 3.9-fold increase in NSLN metastases (74% vs. 19%, Figure 1B). Among patients with isolated tumor cells or clusters within the SLN, 4.7% had NSLN metastasis; whereas 42% of patients with micrometastasis and 71% with macrometastasis had NSLN involvement (Figure 1C and Table 1).

Figure 1
figure 1

Fraction of patients in Bay Area SLN Database with and without NSLN metastases in relation to (A) tumor stage, (B) angiolymphatic invasion, and (C) size of SLN metastasis.

The models generated by RP-ROC (Figure 2A) and CART (Figure 2B, Additional files 4 and 5) ultimately included tumor size, angiolymphatic invasion, and size of SLN metastasis. At the final split, likelihood of NSLN metastases partitioned into groups by level of risk. The significant predictors as selected by multivariate tree-based modeling were tested individually, as well as all iterations of predictors, in a MLR model. Variables entered were tumor size, angiolymphatic invasion, and size of SLN metastasis (Table 2). Size of SLN metastases interacts with the status of angiolymphatic invasion; that is, the impact of the size of SLN metastases upon the presence or absence of NSLN metastases depends on whether there was angiolymphatic invasion. The tree suggests that one might enter angiolymphatic invasion (scored as 1 if present, 0 if absent) not only multiplied by SLN metastasis size to the first power, but also as the product of angiolymphatic invasion and the square of SLN metastasis size (scored as an ordinal variable with values of 1, 2, and 3 corresponding to the size classification of isolated tumor cells, micrometastasis, or macrometastasis). The MLR model identified two highly predictive composite variables: the product of angiolymphatic invasion and size of SLN metastasis (p < 0.0001, odds ratio of 4.73 with approximate 95% confidence interval 3.11–7.20) as well as the product of tumor size and squared size of SLN metastasis (p < 0.0001, odds ratio of 1.18 with 95% confidence interval 1.10–1.26). We emphasize that p-values are only approximate because CART was used as preprocessor to manufacturing the predictive variables. However, these p-values are so small, and the clinical logic so compelling, that we do not doubt their practical, let alone statistical, significance.

Figure 2
figure 2

Tree diagrams for RP-ROC and CART. As CART is able to impute missing data, it was calculated for all SLN positive patients, n = 285. RP-ROC requires complete data and was calculated for patients with known angiolymphatic invasion status, n = 213 (Bay Area SLN Database).

Table 2 Multivariate Logistic Regression (MLR) analysis informed by CART for predicting NSLN metastasis among SLN+ patients (n = 213) (Bay Area SLN Database).

Table 3 compares the sensitivities, specificities, and predictive accuracies of our three models, RP-ROC, boosted CART, and MLR, all computed with 10-fold cross validation [26]. As different models require different information, we evaluated models for the entire group (n = 285, only possible for CART) and subsets that contained complete information on angiolymphatic invasion (n = 213), and alternatively, on angiolymphatic invasion and ER status (n = 171). Cross-validated sensitivities/specificities of the three technologies for the group with known angiolymphatic invasion status (n = 213) were 79%/76% for RP-ROC, 88%/71% for boosted CART, and 78%/86% for MLR. Cross-validated specificity of boosted CART when inferred for the entire dataset (n = 285) was lower than when calculated using known values for angiolymphatic invasion (n = 213), suggesting that angiolymphatic invasion is informative in our dataset. This is supported by the continued selection of angiolymphatic invasion in CART modeling when patients have known angiolymphatic invasion status (n = 213) and known angiolymphatic status and ER status (n = 171) (Additional files 4 and 5, respectively). Overall diagnostic accuracy, based on areas under the ROC curve [34] (AUC), for predicting NSLN metastasis among patients in our database was greatest by MLR (83% and 85%) for the subsets of patients for whom the computation was possible (n = 213 and n = 171, respectively). Further, we applied the Nomogram to our SLN-positive patients who had complete data available for entry of its eight variables (n = 171, all patients with known angiolymphatic invasion status and ER status). Figure 3 shows a graph of the ROC curve that devolves from our MLR using our two composite variables (n = 213) and the ROC curve that devolves from the Nomogram (n = 171). Because much preprocessing has gone into our computations, p-values we might report (regarding a null hypothesis that the "true" areas under the curves are equal) would be suspect. However, the diagnostic accuracy or area under the curve (AUC) for our MLR is 83% (95% confidence interval 0.81–0.86), and the AUC for the Nomogram is 77% (95% confidence interval 0.73–0.81). When we use the same patients as used in the Nomogram for the MLR calculation (n = 171), our model achieves cross-validated AUC of 85% (95% confidence interval 0.81–0.89). Given that only three variables were used to calculate our MLR, the difference is noteworthy.

Figure 3
figure 3

ROC curves for MLR informed by CART calculation in blue, AUC = 0.83, and Nomogram in green, AUC = 0.77, when applied to the Bay Area SLN Database. Note that MLR informed by CART calculation was done for larger group of patients (n = 213). When it was performed for the same patient group as the Nomogram (n = 171), AUC increased to 0.85.

Table 3 Model comparisons for predicting NSLN metastasis among SLN+ patients (Bay Area SLN Database).

Finally, the MLR and Nomogram were applied to a database of 77 patients who received ALND for positive SLNs at Northwestern University (Additional file 6). The SLN metastases in this dataset were identified by H&E stain without IHC. Among the 77 SLN positive patients, 61% had T1 tumors, 36% had T2 tumors, and 2.6% had T3 tumors. Angiolymphatic invasion was present in 68% of patients' tumors, and the SLN metastases in the Northwestern dataset were predominantly of large tumor burden with 56% having macrometastasis. NSLN metastases were present in 24 patients (31%). This is in contrast to the Bay Area SLN dataset with 55% T1 tumors, 38% T2 tumors, and 7% T3 tumors; 45% with angiolymphatic invasion (when angiolymphatic invasion status was known); 7% with macrometastasis; and 35% NSLN metastases (Table 1 and Figure 1). Although the Northwestern tumors were somewhat smaller, the higher percentage of angiolymphatic invasion and SLN macrometastases suggest more biologically aggressive disease in their dataset, yet they had a slightly lower percentage of NSLN metastasis.

Both the MLR model and the Nomogram performed less well when applied to the Northwestern dataset; however, the MLR model was supported with an AUC of 77% (95% confidence interval 0.67–0.80). This is superior to the performance of the Nomogram among this population, 62% (95% confidence interval 0.55–0.68) (Figure 4).

Figure 4
figure 4

ROC curves for MLR informed by CART calculation in blue, AUC = 0.74, and Nomogram in green, AUC = 0.62, when applied to the Northwestern test set (n = 77). 24 patients had NSLN metastasis in this dataset.


Sentinel lymph node biopsy is a major advance in the treatment of women with breast cancer [35]. If no SLN metastases are identified, the likelihood of additional NSLN involvement is 9.8% in our series. Though above the goal false-negative rate proposed by the American Society of Breast Surgeons, this is comparable to that reported in NSABP-32 and recently by both Lyman and Veronesi ranging 9.7%, 8.4%, and 8.8% respectively [1, 36, 37]. Of our patients with positive SLNs, the majority presented with micrometastasis, 70%, or isolated tumor cells, 22%. Thus, our population contains a predominance of limited SLN disease burden relative to prior reports, including van Rijk's reported rate of 23% for micrometastasis and 16% for isolated tumor cells [38]. This may be important as suggested by Alran et al. who showed lower performance of the Nomogram in patients with only micrometastases [19]. Despite the seemingly low sentinel node tumor burden, 35% had NSLN metastases upon completion ALND. Unfortunately, no combination of clinical and/or pathologic characteristics enabled identification of all SLN-positive patients at risk for NSLN metastases. Although SLN-positive patients will receive systemic chemotherapy and/or hormone therapy, it is unknown whether occult NSLN metastases are eradicated by adjuvant treatment. Until results of large prospective clinical trials can demonstrate no long-term increase in mortality from omitting ALND in the setting of systemic therapy, prophylactic ALND for patients with tumor-involved SLNs, including those with and without NSLN involvement, remains standard surgical care [3943]. However, in practice, it is the patient and her physician who decide whether or not a completion axillary dissection is performed. This decision may be informed using online calculators such as the Nomogram and the one presented here.

Based on a multi-institutional sample set larger than most prior studies, we found that univariate predictors of NSLN status include tumor size (in cm and by AJCC T size classification), tumor grade, hormone receptor status (ER and PR), angiolymphatic invasion, size of SLN metastasis, and whether nodal tumor involvement is identified by H&E. By multivariate analyses, tumor size, angiolymphatic invasion, size of SLN metastases, and products of these variables predict NSLN tumor involvement. Others have also discovered the predictive strength of each of the three simple characteristics [5, 6, 10, 4450], although here we confirm their collective power in a unique way. Additionally, we found that angiolymphatic invasion is as strong a predictor of NSLN metastasis as is size of SLN metastasis.

For women with isolated tumor cells in the SLN, we found a 4.7% chance of NSLN involvement, similar to Calhoun and Giuliano's reported NSLN-involvement rate of 4.9% for the same subset of patients, and comparable to or lower than that found previously, 10–15% [47, 51, 52]. The benefits of no further axillary dissection must be weighed against the risk of harboring axillary metastasis that may potentially seed occult metastatic disease. Clinical context, with consideration of a patient's expected life-span and associated health problems, may impact the definition of a "minimal acceptable risk." Recommendations for clinical practice are difficult because the risk of NSLN metastasis in SLN-positive patients with isolated tumor cells is comparable or lower than the risk of NSLN metastases in patients without SLN metastases (9.8% in our study) [53]. These issues are being studied in large-scale prospective clinical trials [40, 41]. Future molecular technologies may also provide guidance [54, 55].

Our goal was to identify patients with tumor-free NSLNs who, with near certainty, may be spared completion ALND. Using multivariate tree-based modeling by RP-ROC, boosted CART, and MLR informed by CART, we identified tumor size, angiolymphatic invasion, and size of SLN metastasis as characteristics that optimized stratification of NSLN status. These refined, statistical analyses demonstrated a highly synergistic interaction between size of SLN metastasis and angiolymphatic invasion on risk of NSLN metastasis. Our models (Figures 2, Additional files 4 and 5) stratified patients with tumor-involved SLNs into four risk groups for having NSLN metastasis: low risk (10% or less), moderate risk (30–45%), high risk (about 60%), and very high risk (greater than 90%).

MLR modeling of NSLN status that was informed by CART in its selection of predictors provided the most accurate cross-validated technique for predicting NSLN metastases for patients with known angiolymphatic invasive status, with accuracy superior to boosted CART, RP-ROC, and the Memorial Sloan-Kettering Breast Cancer Nomogram. When applied to the Bay Area SLN Database, the Nomogram had an AUC of 77%. This compares with the accuracy of the Nomogram for the original MSKCC population (76%) and for a prospective cohort at MSKCC (77%) [13]. Seven subsequent studies have tested the Nomogram and show an accuracy of 63% to 86%, though as low as 54% when applied only to patients with SLN micrometastases [14, 1719, 5660]. In contrast, our MLR informed by CART model performed equally well among patients with isolated tumor cells, micrometastases, or macrometastases. Relative importance of size of SLN metastasis in the Nomogram is determined by method of detection including IHC, serial H&E, routine analysis, versus frozen section (among the subset for which this is performed) [13]. The improved predictive accuracy of the MLR model informed by CART, particularly among patients with isolated tumor cells or micrometastasis, may be due to the relative weight ascribed to the specific size of SLN metastasis in our model. Application of our MLR model to other patient populations is required to validate its performance.

Considering the risk of potential bias due to low sentinel node tumor burden in our dataset, we applied both our model and the MSKCC Nomogram to an independent dataset of 77 SLN positive patients who underwent completion ALND. These cases were not identified by IHC and this dataset contained cases with a much larger tumor burden: 56% of cases contained macrometastasis in the SLN compared to 7% of the Bay Area SLN dataset. Again, the MLR model showed superior performance to the Nomogram. However, the performance of both models decreased compared to the Bay Area SLN Database: the Stanford Online Calculator generated an AUC of 0.74, or 74%, and the Nomogram generated an AUC of 0.62, or 62%. This raises concern regarding the generalizable nature of any model. An underlying reason why neither model performed as well as anticipated is that when a model is developed based on data from one group of patients, and the model is subsequently applied to data from a different group of patients, performance is generally diminished [24, 27]. This is due to differences in the distributions of predictive features and differences in the synergistic impact (interactions) between and among these features in different groups of patients. Thus a model developed in one group of patients would not be expected to perform as well for a different group of patients, even if the performance of the model was validated internally (cross-validated) on the original group. Table 1 and Additional file 6 shows differences in the distribution of the three variables in our model – tumor size, size of SLN metastasis, and angiolymphatic invasion – between patients in the Bay Area SLN Database and Northwestern series. As we would also expect the interactions of these variables to be different for both groups, we believe these factors in aggregate may be responsible for our findings.

The predictive accuracy of the Nomogram requires assessment of eight tumor characteristics [13, 14, 17, 18, 56]. A hazard of multi-variable modeling is that its overall accuracy is dependent upon the accuracy and precision with which each individual variable is determined. Our MLR confirmed the importance of two composite variables from only three tumor characteristics: 1) the size of SLN metastasis when angiolymphatic invasion is present, and 2) tumor size times the square of the size of SLN metastasis. The first composite variable reflects the synergism between angiolymphatic invasion and size of SLN metastasis; the second involves tumor burden. Using these two composite variables, AUC is 83% or 85% compared to the Nomogram's AUC of 77% that relies on eight variables. By using statistical methods which allow assessment of the variable-variable interactions we demonstrate superior accuracy with fewer required variables. Our model is the first proposed which emphasizes the synergistic interactions among patient characteristics. By reducing the required variables, we are hopeful the MLR model may be applied a larger population of patients, without excluding those with incomplete, unavailable, or pending pathologic data.

Missing pathologic data is problematic for breast cancer patients nationwide. Though the generalizability of our model may have benefited from the diverse population represented, obtaining complete clinicopathologic information was partially limited by enrollment across 16 institutions during the years of our study, 1996–2002. Approximately 25% of our 285 SLN-positive patients had no histologic analysis for angiolymphatic invasion. Of the 213 patients with angiolymphatic data present, another 19.7% had no ER status performed or recorded. This is comparable to the 17.1% of invasive breast cancer patients without recorded ER status in 13 registries of the national Surveillance, Epidemiology, and End Results (SEER) database from 1999–2003 [61] (unpublished data, Jeffrey lab); presence of angiolymphatic invasion status was not requested by SEER. Thus, we analyzed our data using three patient groups: the entire SLN-positive dataset of 285 patients; 213 SLN-positive patients who had complete information on angiolymphatic invasion; and 171 SLN-positive patients who had complete information on angiolymphatic invasion and ER status. Even applying the smallest dataset, more SLN-positive patients are analyzed than in most other published studies.

Though not directly compared among identical patient populations, the AUC of our model is also superior to that of M.D. Anderson Cancer Center scoring system (70%) and the Hôpital Tenon scoring system derived in Paris, France (68%), as recently reported by Dauphine et al. [59, 62, 63]. Calculations using our MLR model are easily done over the internet with the Stanford Online Calculator [64]. We encourage others to access and test our model and directly compare it with other models for evaluating risk of NSLN metastasis.

Although no modeling technique has been able to identify patients without any risk of NSLN metastasis, Park et al. recently argued that ALND may be reasonably eliminated among patients with approximately 9% or less predicted risk of NSLN involvement [16]. A low risk subset of 287 patients with SLN metastasis were followed in a non-randomized study with a 2% observed rate of local recurrence. This recommendation, however, is limited by a follow-up of only 23 months. We expect that data from two large prospective clinical studies, NSABP-32 and American College of Surgeons trial Z0011, will more definitively resolve questions regarding the optimal surgical management of SLN-positive patients [40, 41]. In the meantime, we hope that our calculator may provide further guidance for risk evaluation.


Fewer than half of women undergoing completion axillary lymph node dissection (ALND) for breast cancer will have non-sentinel node (NSLN) metastasis. We present a new model and the Stanford Online Calculator developed from a Northern California and Oregon database with superior accuracy and simplicity (three versus eight required patient variables) compared to the Memorial Sloan-Kettering Breast Cancer Nomogram for our dataset and another independent dataset. We hope that other institutions will test our model using their datasets, which will contain different patient demographics, to validate its accuracy and to refine in which populations it may be best used. Further investigation of predictive models to stratify risk of non-sentinel lymph node metastasis will better define their role in guiding clinical decision-making, while we await the results of larger randomized trials.



axillary lymph node dissection


lymph node


sentinel lymph node


non-sentinel lymph node


recursive partitioning with receiver operating characteristic curves


boosted classification and regression trees


multivariate logistic regression


Memorial Sloan-Kettering Cancer Center


MSKCC Breast Cancer Nomogram


estrogen receptor


progesterone receptor


area under ROC curve


lymphovascular invasion


hematoxylin and eosin




isolated tumor cells.


  1. Lyman GH, Giuliano AE, Somerfield MR, Benson AB, Bodurka DC, Burstein HJ, Cochran AJ, Cody HS, Edge SB, Galper S, Hayman JA, Kim TY, Perkins CL, Podoloff DA, Sivasubramaniam VH, Turner RR, Wahl R, Weaver DL, Wolff AC, Winer EP: American Society of Clinical Oncology guideline recommendations for sentinel lymph node biopsy in early-stage breast cancer. J Clin Oncol. 2005, 23 (30): 7703-7720. 10.1200/JCO.2005.08.001.

    Article  PubMed  Google Scholar 

  2. Rubio IT, Korourian S, Cowan C, Krag DN, Colvert M, Klimberg VS: Sentinel lymph node biopsy for staging breast cancer. Am J Surg. 1998, 176 (6): 532-537. 10.1016/S0002-9610(98)00264-5.

    Article  CAS  PubMed  Google Scholar 

  3. Carlson RW, McCormick B: Update: NCCN breast cancer Clinical Practice Guidelines. J Natl Compr Canc Netw. 2005, 3 Suppl 1: S7-11.

    PubMed  Google Scholar 

  4. Sosa JA, Diener-West M, Gusev Y, Choti MA, Lange JR, Dooley WC, Zeiger MA: Association between extent of axillary lymph node dissection and survival in patients with stage I breast cancer. Ann Surg Oncol. 1998, 5 (2): 140-149. 10.1007/BF02303847.

    Article  CAS  PubMed  Google Scholar 

  5. Chu KU, Turner RR, Hansen NM, Brennan MB, Bilchik A, Giuliano AE: Do all patients with sentinel node metastasis from breast carcinoma need complete axillary node dissection?. Ann Surg. 1999, 229 (4): 536-541. 10.1097/00000658-199904000-00013.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Turner RR, Chu KU, Qi K, Botnick LE, Hansen NM, Glass EC, Giuliano AE: Pathologic features associated with nonsentinel lymph node metastases in patients with metastatic breast carcinoma in a sentinel lymph node. Cancer. 2000, 89 (3): 574-581. 10.1002/1097-0142(20000801)89:3<574::AID-CNCR12>3.0.CO;2-Y.

    Article  CAS  PubMed  Google Scholar 

  7. Rietman JS, Dijkstra PU, Geertzen JH, Baas P, De Vries J, Dolsma W, Groothoff JW, Eisma WH, Hoekstra HJ: Short-term morbidity of the upper limb after sentinel lymph node biopsy or axillary lymph node dissection for Stage I or II breast carcinoma. Cancer. 2003, 98 (4): 690-696. 10.1002/cncr.11545.

    Article  PubMed  Google Scholar 

  8. Hack TF, Cohen L, Katz J, Robson LS, Goss P: Physical and psychological morbidity after axillary lymph node dissection for breast cancer. J Clin Oncol. 1999, 17 (1): 143-149.

    CAS  PubMed  Google Scholar 

  9. Mansel RE, Fallowfield L, Kissin M, Goyal A, Newcombe RG, Dixon JM, Yiangou C, Horgan K, Bundred N, Monypenny I, England D, Sibbering M, Abdullah TI, Barr L, Chetty U, Sinnett DH, Fleissig A, Clarke D, Ell PJ: Randomized multicenter trial of sentinel node biopsy versus standard axillary treatment in operable breast cancer: the ALMANAC Trial. J Natl Cancer Inst. 2006, 98 (9): 599-609.

    Article  PubMed  Google Scholar 

  10. Reynolds C, Mick R, Donohue JH, Grant CS, Farley DR, Callans LS, Orel SG, Keeney GL, Lawton TJ, Czerniecki BJ: Sentinel lymph node biopsy with metastasis: can axillary dissection be avoided in some patients with breast cancer?. J Clin Oncol. 1999, 17 (6): 1720-1726.

    CAS  PubMed  Google Scholar 

  11. Abdessalam SF, Zervos EE, Prasad M, Farrar WB, Yee LD, Walker MJ, Carson WB, Burak WE: Predictors of positive axillary lymph nodes after sentinel lymph node biopsy in breast cancer. Am J Surg. 2001, 182 (4): 316-320. 10.1016/S0002-9610(01)00719-X.

    Article  CAS  PubMed  Google Scholar 

  12. Rahusen FD, Torrenga H, van Diest PJ, Pijpers R, van der Wall E, Licht J, Meijer S: Predictive factors for metastatic involvement of nonsentinel nodes in patients with breast cancer. Arch Surg. 2001, 136 (9): 1059-1063. 10.1001/archsurg.136.9.1059.

    Article  CAS  PubMed  Google Scholar 

  13. Van Zee KJ, Manasseh DM, Bevilacqua JL, Boolbol SK, Fey JV, Tan LK, Borgen PI, Cody HS, Kattan MW: A nomogram for predicting the likelihood of additional nodal metastases in breast cancer patients with a positive sentinel node biopsy. Ann Surg Oncol. 2003, 10 (10): 1140-1151. 10.1245/ASO.2003.03.015.

    Article  PubMed  Google Scholar 

  14. Smidt ML, Kuster DM, van der Wilt GJ, Thunnissen FB, Van Zee KJ, Strobbe LJ: Can the Memorial Sloan-Kettering Cancer Center Nomogram predict the likelihood of nonsentinel lymph node metastases in breast cancer patients in the Netherlands?. Ann Surg Oncol. 2005, 12 (12): 1066-1072. 10.1245/ASO.2005.07.022.

    Article  PubMed  Google Scholar 

  15. Specht MC, Kattan MW, Gonen M, Fey J, Van Zee KJ: Predicting nonsentinel node status after positive sentinel lymph biopsy for breast cancer: clinicians versus nomogram. Ann Surg Oncol. 2005, 12 (8): 654-659. 10.1245/ASO.2005.06.037.

    Article  PubMed  Google Scholar 

  16. Park J, Fey JV, Naik AM, Borgen PI, Van Zee KJ, Cody HS: A declining rate of completion axillary dissection in sentinel lymph node-positive breast cancer patients is associated with the use of a multivariate nomogram. Ann Surg. 2007, 245 (3): 462-468. 10.1097/01.sla.0000250439.86020.85.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Kocsis L, Svebis M, Boross G, Sinko M, Maraz R, Rajtar M, Cserni G: Use and limitations of a nomogram predicting the likelihood of non-sentinel node involvement after a positive sentinel node biopsy in breast cancer patients. Am J Surg. 2004, 70 (11): 1019-1024.

    Google Scholar 

  18. Degnim AC, Reynolds C, Pantvaidya G, Zakaria S, Hoskin T, Barnes S, Roberts MV, Lucas PC, Oh K, Koker M, Sabel MS, Newman LA: Nonsentinel node metastasis in breast cancer patients: assessment of an existing and a new predictive nomogram. Am J Surg. 2005, 190 (4): 543-550. 10.1016/j.amjsurg.2005.06.008.

    Article  PubMed  Google Scholar 

  19. Alran S, De Rycke Y, Fourchotte V, Charitansky H, Laki F, Falcou MC, Benamor M, Freneaux P, Salmon RJ, Sigal-Zifrani B: Validation and limitations of use of a breast cancer nomogram predicting the likelihood of non-sentinel node involvement after positive sentinel node biopsy. Ann Surg Oncol. 2007, 14 (8): 2195-2201. 10.1245/s10434-006-9331-2.

    Article  PubMed  Google Scholar 

  20. Cody HS, Borgen PI: State-of-the-art approaches to sentinel node biopsy for breast cancer: study design, patient selection, technique, and quality control at Memorial Sloan-Kettering Cancer Center. Surg Oncol. 1999, 8 (2): 85-91. 10.1016/S0960-7404(99)00029-8.

    Article  PubMed  Google Scholar 

  21. Cserni G, Amendoeira I, Apostolikas N, Bellocq JP, Bianchi S, Bussolati G, Boecker W, Borisch B, Connolly CE, Decker T, Dervan P, Drijkoningen M, Ellis IO, Elston CW, Eusebi V, Faverly D, Heikkila P, Holland R, Kerner H, Kulka J, Jacquemier J, Lacerda M, Martinez-Penuela J, De Miguel C, Peterse JL, Rank F, Regitnig P, Reiner A, Sapino A, Sigal-Zafrani B, Tanous AM, Thorstenson S, Zozaya E, Wells CA: Pathological work-up of sentinel lymph nodes in breast cancer. Review of current data to be considered for the formulation of guidelines. Eur J Cancer. 2003, 39 (12): 1654-1667. 10.1016/S0959-8049(03)00203-X.

    Article  CAS  PubMed  Google Scholar 

  22. Elston CW, Ellis IO: Pathological prognostic factors in breast cancer. I. The value of histological grade in breast cancer: experience from a large study with long-term follow-up. Histopathology. 1991, 19 (5): 403-410. 10.1111/j.1365-2559.1991.tb00229.x.

    Article  CAS  PubMed  Google Scholar 

  23. Singletary SE, Greene FL, Sobin LH: Classification of isolated tumor cells: clarification of the 6th edition of the American Joint Committee on Cancer Staging Manual. Cancer. 2003, 98 (12): 2740-2741. 10.1002/cncr.11865.

    Article  PubMed  Google Scholar 

  24. Breiman L, Friedman JH, Olshen RA, Stone CJ: Classification and Regression Trees. 1984, Belmont, CA , Wadsworth

    Google Scholar 

  25. Dalgaard P: Introductory Statistics with R. 2002, USA , Springer

    Google Scholar 

  26. Kraemer HC: Evaluating medical tests: objective and quantitative guidelines. 1992, Newbury Park, CA. , Sage Publications

    Google Scholar 

  27. Hastie T, Tibshirani R, Friedman JH: The Elements of Statistical Learning. 2001, USA , Springer

    Chapter  Google Scholar 

  28. Kwak LW, Halpern J, Olshen RA, Horning SJ: Prognostic significance of actual dose intensity in diffuse large-cell lymphoma: results of a tree-structured survival analysis. J Clin Oncol. 1990, 8 (6): 963-977.

    CAS  PubMed  Google Scholar 

  29. ROC software, Sierra-Pacific MIRECC. []

  30. CART software, Salford Systems. []

  31. McLachlan G: Discriminant Analysis and Statistical Pattern Recognition. 1992, USA , John Wiley and Sons, Inc.

    Chapter  Google Scholar 

  32. Stone CJ: A Course in Probability and Statistics. 1995, Belmont, CA , Duxbury Press

    Google Scholar 

  33. Team RDC: R: A Language and Environment for Statistical Computing. 2006, Vienna, Austria , R Foundation for Statistical Computing, []

    Google Scholar 

  34. DeLong ER, DeLong DM, Clarke-Pearson DL: Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988, 44 (3): 837-845. 10.2307/2531595.

    Article  CAS  PubMed  Google Scholar 

  35. Veronesi U, Paganelli G, Viale G, Luini A, Zurrida S, Galimberti V, Intra M, Veronesi P, Robertson C, Maisonneuve P, Renne G, De Cicco C, De Lucia F, Gennari R: A randomized comparison of sentinel-node biopsy with routine axillary dissection in breast cancer. N Engl J Med. 2003, 349 (6): 546-553. 10.1056/NEJMoa012782.

    Article  PubMed  Google Scholar 

  36. Esserman L, Sepucha K: Practice implications of the high false negative rate of sentinel lymph node biopsy reported in NSABP-32. J Clin Oncol. 2005, 23 (16S): 812-

    Google Scholar 

  37. Veronesi U, Paganelli G, Viale G, Luini A, Zurrida S, Galimberti V, Intra M, Veronesi P, Maisonneuve P, Gatti G, Mazzarol G, De Cicco C, Manfredi G, Fernandez JR: Sentinel-lymph-node biopsy as a staging procedure in breast cancer: update of a randomised controlled study. Lancet Onc. 2006, 7 (12): 983-990. 10.1016/S1470-2045(06)70947-0.

    Article  Google Scholar 

  38. van Rijk MC, Peterse JL, Nieweg OE, Oldenburg HS, Rutgers EJ, Kroon BB: Additional axillary metastases and stage migration in breast cancer patients with micrometastases or submicrometastases in sentinel lymph nodes. Cancer. 2006, 107 (3): 467-471. 10.1002/cncr.22069.

    Article  PubMed  Google Scholar 

  39. Orr RK: The impact of prophylactic axillary node dissection on breast cancer survival--a Bayesian meta-analysis. Ann Surg Oncol. 1999, 6 (1): 109-116. 10.1007/s10434-999-0109-1.

    Article  CAS  PubMed  Google Scholar 

  40. Krag DN, Julian TB, Harlow SP, Weaver DL, Ashikaga T, Bryant J, Single RM, Wolmark N: NSABP-32: Phase III, randomized trial comparing axillary resection with sentinal lymph node dissection: a description of the trial. Ann Surg Oncol. 2004, 11 (3 Suppl): 208S-10S.

    Article  PubMed  Google Scholar 

  41. Grube BJ, Giuliano AE: Observation of the breast cancer patient with a tumor-positive sentinel node: implications of the ACOSOG Z0011 trial. Semin Surg Oncol. 2001, 20 (3): 230-237. 10.1002/ssu.1038.

    Article  CAS  PubMed  Google Scholar 

  42. Fant JS, Grant MD, Knox SM, Livingston SA, Ridl K, Jones RC, Kuhn JA: Preliminary outcome analysis in patients with breast cancer and a positive sentinel lymph node who declined axillary dissection. Ann Surg Oncol. 2003, 10 (2): 126-130. 10.1245/ASO.2003.04.022.

    Article  PubMed  Google Scholar 

  43. Wada N, Imoto S, Yamauchi C, Hasebe T, Ochiai A: Predictors of tumour involvement in remaining axillary lymph nodes of breast cancer patients with positive sentinel lymph node. Eur J Surg Oncol. 2006, 32 (1): 29-33. 10.1016/j.ejso.2005.08.010.

    Article  CAS  PubMed  Google Scholar 

  44. Kamath VJ, Giuliano R, Dauway EL, Cantor A, Berman C, Ku NN, Cox CE, Reintgen DS: Characteristics of the sentinel lymph node in breast cancer predict further involvement of higher-echelon nodes in the axilla: a study to evaluate the need for complete axillary lymph node dissection. Arch Surg. 2001, 136 (6): 688-692. 10.1001/archsurg.136.6.688.

    Article  CAS  PubMed  Google Scholar 

  45. Yu JC, Hsu GC, Hsieh CB, Sheu LF, Chao TY: Prediction of metastasis to non-sentinel nodes by sentinel node status and primary tumor characteristics in primary breast cancer in Taiwan. World J Surg. 2005, 29 (7): 813-8; discussion 818-9. 10.1007/s00268-005-7744-x.

    Article  PubMed  Google Scholar 

  46. Viale G, Maiorano E, Pruneri G, Mastropasqua MG, Valentini S, Galimberti V, Zurrida S, Maisonneuve P, Paganelli G, Mazzarol G: Predicting the risk for additional axillary metastases in patients with breast carcinoma and positive sentinel lymph node biopsy. Ann Surg. 2005, 241 (2): 319-325. 10.1097/01.sla.0000150255.30665.52.

    Article  PubMed  PubMed Central  Google Scholar 

  47. Bolster MJ, Peer PGM, Bult P, Thunnissen FBJM, Schapers RFM, Meijer JWR, Strobbe LJA, van Berlo CLH, Klinkenbijl JHG, Beex LVAM, Wobbes T, Tjan-Heijnen VCG: Risk factors for non-sentinel lymph node metastases in patients with breast cancer. The outcome of a multi-institutional study. Ann Surg Oncol. 2007, 14 (1): 181-189. 10.1245/s10434-006-9065-1.

    Article  PubMed  Google Scholar 

  48. Carcoforo P, Maestroni U, Querzoli P, Lanzara S, Maravegias K, Feggi L, Soliani G, Basaglia E: Primary breast cancer features can predict additional lymph node involvement in patients with sentinel node micrometastases. World J Surg. 2006, 30 (9): 1653-1657. 10.1007/s00268-005-0083-0.

    Article  CAS  PubMed  Google Scholar 

  49. Rivers AK, Griffith KA, Hunt KK, Degnim AC, Sabel MS, Diehl KM, Cimmino VM, Chang AE, Lucas PC, Newman LA: Clinicopathologic features associated with having four or more metastatic axillary nodes in breast cancer patients with a positive sentinel lymph node. Ann Surg Oncol. 2006, 13 (1): 36-44. 10.1245/ASO.2006.03.080.

    Article  PubMed  Google Scholar 

  50. Ozmen V, Karanlik H, Cabioglu N, Igci A, Kecer M, Asoglu O, Tuzlali S, Mudun A: Factors predicting the sentinel and non-sentinel lymph node metastases in breast cancer. Breast Cancer Res Treat. 2006, 95 (1): 1-6. 10.1007/s10549-005-9007-9.

    Article  CAS  PubMed  Google Scholar 

  51. Calhoun KE, Hansen NM, Turner RR, Giuliano AE: Nonsentinel node metastases in breast cancer patients with isolated tumor cells in the sentinel node: implications for completion axillary node dissection. Am J Surg. 2005, 190 (4): 588-591. 10.1016/j.amjsurg.2005.06.018.

    Article  PubMed  Google Scholar 

  52. Cserni G, Gregori D, Merletti F, Sapino A, Mano MP, Ponti A, Sandrucci S, Baltas B, Bussolati G: Meta-analysis of non-sentinel node metastases associated with micrometastatic sentinel nodes in breast cancer. Br J Surg. 2004, 91 (10): 1245-1252. 10.1002/bjs.4725.

    Article  CAS  PubMed  Google Scholar 

  53. Cserni G, Bianchi S, Vezzosi V, Arisio R, Bori R, Peterse JL, Sapino A, Castellano I, Drijkoningen M, Kulka J, Eusebi V, Foschini MP, Bellocq JP, Marin C, Thorstenson S, Amendoeira I, Reiner-Concin A, Decker T, Lacerda M, Figueiredo P, Fejes G: Sentinel lymph node biopsy in staging small (up to 15 mm) breast carcinomas. Results from a European multi-institutional study. Path Onc Res. 2007, 13 (1): 5-14.

    Article  Google Scholar 

  54. Kohrt HE, Nouri N, Nowels K, Johnson D, Holmes S, Lee PP: Profile of immune cells in axillary lymph nodes predicts disease-free survival in breast cancer. PLoS Med. 2005, 2 (9): e284-10.1371/journal.pmed.0020284.

    Article  PubMed  PubMed Central  Google Scholar 

  55. Jeffrey SS, Lonning PE, Hillner BE: Genomics-based prognosis and therapeutic prediction in breast cancer. J Natl Compr Canc Netw. 2005, 3 (3): 291-300.

    PubMed  Google Scholar 

  56. Soni NK, Carmalt HL, Gillett DJ, Spillane AJ: Evaluation of a breast cancer nomogram for prediction of non-sentinel lymph node positivity. Eur J Surg Oncol. 2005, 31 (9): 958-964. 10.1016/j.ejso.2005.04.011.

    Article  CAS  PubMed  Google Scholar 

  57. Lambert LA, Ayers GD, Hwang RF, Hunt KK, Ross MI, Kuerer HM, Singletary SE, Babiera GV, Ames FC, Feig B, Lucci A, Krishnamurthy S, Meric-Bernstam F: Validation of a breast cancer nomogram for predicting nonsentinel lymph node metastases after a positive sentinel node biopsy. Ann Surg Oncol. 2006, 13 (3): 310-320. 10.1245/ASO.2006.03.078.

    Article  PubMed  Google Scholar 

  58. Cripe MH, Beran LC, Liang WC, Sickle-Santanello BJ: The likelihood of additional nodal disease following a positive sentinel lymph node biopsy in breast cancer patients: validation of a nomogram. Am J Surg. 2006, 192 (4): 484-10.1016/j.amjsurg.2006.06.016.

    Article  PubMed  Google Scholar 

  59. Dauphine CE, Haukoos JS, Vargas MP, Isaac NM, Khalkhali I, Vargas HI: Evaluation of three scoring systems predicting non sentinel node metastasis in breast cancer patients with a positive sentinel node biopsy. Ann Surg Oncol. 2007, 14 (3): 1014-1019. 10.1245/s10434-006-9223-5.

    Article  PubMed  Google Scholar 

  60. Ponzone R, Maggiorotto F, Mariani L, Jacomuzzi ME, Magistris A, Mininanni P, Biglia N, Sismondi P: Comparison of two models for the prediction of nonsentinel node metastases in breast cancer. Am J Surg. 2007, 193 (6): 686-692. 10.1016/j.amjsurg.2006.09.031.

    Article  PubMed  Google Scholar 

  61. Surveillance Epidemiology and End Results Statistics. []

  62. Hwang RF, Krishnamurthy S, Hunt KK, Mirza N, Ames FC, Feig B, Kuerer HM, Singletary SE, Babiera G, Meric F, Akins JS, Neely J, Ross MI: Clinicopathologic factors predicting involvement of nonsentinel axillary nodes in women with breast cancer. Ann Surg Oncol. 2003, 10 (3): 248-254. 10.1245/ASO.2003.05.020.

    Article  PubMed  Google Scholar 

  63. Barranger E, Coutant C, Flahault A, Delpech Y, Darai E, Uzan S: An axilla scoring system to predict non-sentinel lymph node status in breast cancer patients with sentinel lymph node involvement. Breast Cancer Res Treat. 2005, 91 (2): 113-119. 10.1007/s10549-004-5781-z.

    Article  PubMed  Google Scholar 

  64. Stanford Online Calculator. []

Pre-publication history

Download references


The authors thank Mme. Sophie DuLac for her generous support of the Bay Area SLN Study. RAO was supported in part by NIH grant 2-R01-EB-002784-30. We are indebted to Elizabeth Spence, Sunita Jones, Ph.D., Maureen Chang, and Susan Overholser for their skilled work in database management. We thank Bonnie Chung, Balasubramanian Narasimhan, and Adam Kapelner for their assistance in creating and improving the Web site and Stanford Online Calculator. We also thank David Siegmund, Ph.D., of Stanford's Department of Statistics for assistance in the development and initial analysis of the Bay Area Sentinel Node Database, as well as Dr. Don Goffinet of Stanford's Department of Radiation Oncology and current and former members of Stanford's Division of Nuclear Medicine, Drs. Michael Goris, Ross McDougall, William Pace, and H. William Strauss for facilitating this investigation.

We acknowledge all physicians and institutions participating in the Bay Area SLN Study including: Stanford University Medical Center, Stanford, CA: Stefanie Jeffrey, MD (Principal Investigator); James Badger, MD, Robyn Birdwell, MD, Martin Bronk, MD, Robert Carlson, MD, Frederick Dirbas, MD, Jocelyn Dunn, MD, Don Goffinet, MD, David Gregg, MD, Debra Ikeda, MD, Denise Johnson, MD, Gail Lebovic, MD, Mary Ellen Mahoney, MD, Richard Olshen, PhD, William Pace, MD, Robert Rouse, MD, Melanie Smitt, MD, Lynn Smolik, MD, Frank Stockdale, MD, H. William Strauss, MD, and Ward Trueblood, MD; Alta Bates Summit Medical Center, Berkeley, CA: Charles Jenkins, MD, and Lisa Bailey, MD; California Pacific Medical Center, SF, CA: William Goodson III, MD; Doctor's Medical Center, San Pablo, CA: Stuart J Gourlay, MD; Dominican Santa Cruz Hospital, Santa Cruz, CA: David Albritton, MD, and David Rose, MD; El Camino Hospital/El Camino Surgery Center, Mountain View, CA: Alfred Butner, MD; Enloe Hospital, Chico, CA: William Battinich, MD, F. David Collins, MD, and Eugene Cleek, MD; Mercy Medical Center, Redding, CA: Vicki Philben, MD; Mt. Diablo Hospital, Concord, CA: Burton Baker, MD, and Don Beerline, MD; Queen of the Valley Hospital, Napa, CA: Kristen Engle, MD, Leland Raymond, MD, and Wendell Wenneker, MD; San Ramon Regional Center, San Ramon, CA: Mary Estakhri, MD, and Michael Wynn, MD; Seton Hospital, Daly City, CA: Henry Chin, MD, and Dorothy McNoble, MD; St. Josephs Hospital, Stockton, CA: Dean Sloan, MD; The Sutter Cancer Center, Sacramento, CA: Vincent Caggiano, MD, Joyce Eaker, MD, Jay Owens, MD, and Mark Roberts, MD; Sutter Roseville Medical Center, Roseville, CA: Peter Krone, MD, Alan McNabb, MD, and Frederick Weiland, MD; Willamette Valley Medical Center, McMinnville, OR: Harold Hoover, MD, Harry McCulley, MD, and Erik Swensson, MD.

Author information

Authors and Affiliations



Corresponding author

Correspondence to Stefanie S Jeffrey.

Additional information

Competing interests

Richard A. Olshen is one of the original developers of CART®, the software for which is distributed by Salford Systems of San Diego, CA. The remaining authors have no competing interests to declare.

Authors' contributions

HEK, RAO, and SSJ conceived the study, performed data analysis, and drafted the manuscript with critical input from RWC, FMD, WHG, DLJ, RVR, FES, and ILW. LB, RWC, FMD, JJD, WHG, SSJ, DLJ, VJP, and FES developed the Bay Area SLN Study Database and assisted in patient accrual, clinical data acquisition, or data review. RVR processed and analyzed sentinel node tissue blocks. DJW and SH wrote the software for the Stanford Online Calculator and created the website. HRB obtained IRB approval and compiled all Northwestern SLN test data under the guidance of NMH. All authors read and approved the final manuscript.

Electronic supplementary material


Additional file 1: Schematic of patients accrued to Bay Area SLN Database. An overview of the entire Bay Area SLN Database and exclusion criteria for this study. (PDF 131 KB)


Additional file 2: Patient, primary tumor, and lymph node characteristics among SLN-negative and SLN-positive patients from the Bay Area SLN Database. This table describes the demographics of the patient, primary tumor, and lymph node characteristics among SLN-negative and SLN-positive patients from the Bay Area SLN Database. (DOC 205 KB)


Additional file 3: The relationship of angiolymphatic invasion and size of SLN metastasis to tumor size (Bay Area SLN Database). This table shows A. the occurrence of angiolymphatic invasion with increasing tumor size for SLN-negative and SLN-positive patients, and B. size of SLN metastasis with increasing tumor size (Bay Area SLN Database). (DOC 74 KB)


Additional file 4: CART decision tree for patients with complete data on angiolymphatic invasion status, n = 213 (Bay Area SLN Database). The figure shows the CART decision tree for 213 patients with complete data on angiolymphatic invasion status. (PDF 147 KB)


Additional file 5: CART decision tree for patients with complete data on angiolymphatic invasion status and ER status, n = 171 (Bay Area SLN Database). The figure shows the CART decision tree for patients with complete data on angiolymphatic invasion status and ER status. (PDF 181 KB)


Additional file 6: Patient, primary tumor, and lymph node characteristics among SLN-positive and SLN-negative patients from the Northwestern dataset. The table describes the demographics of the patient, primary tumor, and lymph node characteristics among SLN-negative and SLN-positive patients from the Northwestern dataset. (DOC 114 KB)

Authors’ original submitted files for images

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Kohrt, H.E., Olshen, R.A., Bermas, H.R. et al. New models and online calculator for predicting non-sentinel lymph node status in sentinel lymph node positive breast cancer patients. BMC Cancer 8, 66 (2008).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: