Skip to main content

Advertisement

Artificial neural network models to predict nodal status in clinically node-negative breast cancer

Abstract

Background

Sentinel lymph node biopsy (SLNB) is standard staging procedure for nodal status in breast cancer, but lacks therapeutic benefit for patients with benign sentinel nodes. For patients with positive sentinel nodes, individualized surgical strategies are applied depending on the extent of nodal involvement. Preoperative prediction of nodal status is thus important for individualizing axillary surgery avoiding unnecessary surgery. We aimed to predict nodal status in clinically node-negative breast cancer and identify candidates for SLNB omission by including patient-related and pathological characteristics into artificial neural network (ANN) models.

Methods

Patients with primary breast cancer were consecutively included between January 1, 2009 and December 31, 2012 in a prospectively maintained pathology database. Clinical- and radiological data were extracted from patient’s files and only clinically node-negative patients constituted the final study cohort. ANN-based models for nodal prediction were constructed including 15 risk variables for nodal status. Area under the receiver operating characteristic curve (AUC) and Hosmer-Lemeshow goodness-of-fit test (HL) were used to assess performance and calibration of three predictive ANN-based models for no lymph node metastasis (N0), metastases in 1–3 lymph nodes (N1) and metastases in ≥ 4 lymph nodes (N2). Linear regression models for nodal prediction were calculated for comparison.

Results

Eight hundred patients (N0, n = 514; N1, n = 232; N2, n = 54) were included. Internally validated AUCs for N0 versus N+ was 0.740 (95% CI = 0.723–0.758); median HL was 9.869 (P = 0.274), for N1 versus N0, 0.705 (95% CI = 0.686–0.724; median HL: 7.421; P = 0.492) and for N2 versus N0 and N1, 0.747 (95% CI = 0.728–0.765; median HL: 9.220; P = 0.324). Tumor size and vascular invasion were top-ranked predictors of all three end-points, followed by estrogen receptor status and lobular cancer for prediction of N2. For each end-point, ANN models showed better discriminatory performance than multivariable logistic regression models. Accepting a false negative rate (FNR) of 10% for predicting N0 by the ANN model, SLNB could have been abstained in 27.25% of patients with clinically node-negative axilla.

Conclusions

In this retrospective study, ANN showed promising result as decision-supporting tools for estimating nodal disease. If prospectively validated, patients least likely to have nodal metastasis could be spared SLNB using predictive models.

Trial registration

Registered in the ISRCTN registry with study ID ISRCTN14341750.

Date of registration 23/11/2018. Retrospectively registered.

Background

Sentinel lymph node biopsy (SLNB) is the standard axillary staging procedure for patients with clinically node-negative primary breast cancer. In the majority of patients, SLNB will prove negative and no nodal metastasis is diagnosed [1]. Moreover, in approximately half of the SLNB-positive cases, no further metastatic lymph nodes will be harvested during routine completion axillary lymph node dissection (ALND) [2]. While a prospective randomized trial [3] questioned the value of axillary surgical staging in selected low-risk patients, the American College of Surgeons Oncology Group (ACOSOG) Z0011 trial suggested that patients with 1–2 sentinel node metastasis were eligible for minimalistic axillary surgical interventions without completion ALND, and reported no negative consequences for survival or locoregional recurrence after 10 years of follow-up [4, 5]. However, patients with heavy-burden axillary disease (stage N2) could benefit from preoperative selection for neoadjuvant therapy or direct ALND. An improvement in breast cancer management has been to seek an individualized surgical approach to the axilla, and an accurate prediction of the axillary status preoperatively would facilitate individualized surgical decisions.

However, validation of prediction models for nodal status have shown diverse accuracies in estimating nodal involvement [6, 7] and mirror the complexity of factors related to axillary metastasis, and the paucity of models to analyze nonlinear dynamics between relevant variables. Artificial neural networks (ANNs) are nonlinear machine learning methods proposed as supplements to standard statistical models for predicting multifaceted biological events [8, 9], and help in the exploration of underlying nonlinear interactions of interconnected predictors [10]. ANNs have gained utility in various clinical settings, and are being used as diagnostic and prognostic tools in cancer [11, 12], and for prediction of surgical outcomes in various disease conditions [13, 14].

The primary aim of this study was to utilize commonly available patient-related and clinicopathological characteristics in ANN modeling to predict nodal axillary status. The end-points were chosen to reflect the extent of nodal metastatic burden, with an aim to designate no lymph node metastasis (N0), metastases in 1–3 lymph nodes (N1) and metastases in ≥ 4 lymph nodes (N2), respectively. A secondary aim was to assess possible clinical benefit in detecting disease-free axilla (N0). Patient stratification preoperatively using the ANN model applying nodal predictive variables would help identify patients least likely to benefit from SLNB, consequently reducing the rate of unbeneficial surgery. In the clinical setting, the models may be useful tools for risk-benefit analysis of axillary treatment and contribute to improved patient stratification for surgical axillary interventions.

Methods

Patient selection

Patients (n = 995) were included in a prospectively maintained pathological database and the following eligibility criteria were applied: consecutive patients diagnosed with primary breast cancer between January 2009 and December 2012 at the Skåne University Hospital (Lund, Sweden). Exclusion criteria were: male sex, previous ipsilateral breast or axillary surgery, previous neoadjuvant therapy, palpable axillary lymphadenopathy (palpable adenopathy or matted lymph nodes at the time of diagnoses) and omission of standard axillary staging procedure by SLNB or ALND (Fig. 1). Presence of micro (> 0.2 mm and/or more than 200 cells, but none > 2.0 mm)- or macrometastases (> 2.0 mm) on SLNB indicated axillary node-positivity. Patients gave verbal informed consent to participate at time of diagnosis and the ethics committee at Lund University approved this procedure (LU 2013/340). Patients were informed that they had the opportunity to opt-out if they were not willing to participate in the study.

Fig. 1
figure1

Study population. The flow chart shows the original patient population, excluded patients, and details of the surgical axillary nodal staging procedures. Abbreviations: SLNB, Sentinel lymph node biopsy; ALND, Axillary lymph node dissection. * Palpable adenopathy or matted lymph nodes at the time of diagnosis

Data collection

Data regarding previous breast or axillary surgery and mode of detection (mammography screening or symptomatic presentation) were obtained from The Swedish National Quality Registry for Breast Cancer and from the public mammography screening program records. Medical records were reviewed for age, menopausal status, clinical axillary status, and body mass index (BMI) data. A breast pathologist extracted the following histopathological variables: synchronous bilateral malignancy status, tumor localization in the breast (centrally or in quadrants, overlapping lesions were allocated equally into adjacent quadrants for analysis), multifocality (two or more tumor foci separated by benign breast tissue; multicentricity was not a separate entity), tumor size, histological type (ductal carcinoma of no special type, invasive lobular carcinoma, or other invasive carcinoma), histological grade, biomarker status (estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (HER2)), Ki-67 positivity, and lymphovascular invasion (LVI) status.

Acquisition of the ANN classifiers

Three ANN models were defined, each containing multi-layer perceptrons (MLPs) with three layers: an input layer corresponding to the number of risk variables (patient-related and clincopathological), one hidden layer, and a single node output layer. The chosen output reflected the extent of metastatic involvement: disease-free axilla (N0), N1 (1–3 metastatic nodes versus N0), and N2 (N ≥ 4 metastatic nodes versus N0 and N1).

An ensemble technique was applied; several ANNs were averaged into a single prediction model for each classification output. Each individual ANN in the ensemble was trained by standard back-propagation techniques using a cross-entropy error function [8] to learn the association to a given nodal status output. To avoid overfitting, the dropout technique [15, 16] was employed on the input layer. Internal model validation strategy was performed by 4-fold cross-validation, which was repeated five times. Missing data were handled by multiple random imputations for each of the five repetitions. This model validation strategy generated (5 × 4) 20 derivation sets and 20 test sets. For each of the derivation sets, a model selection procedure was carried out independently of the corresponding test set. The model selection strategy was based on a 5-fold cross-validation, which was repeated seven times. Model selection identified the best set of variables (dropout probability and number of hidden nodes) using a grid search. The model validation and selection procedures (Fig. 2) minimized information leakage as each test set did not influence the model selection in any way.

Fig. 2
figure2

Model selection and internal validation strategies. Internal validation was performed by 4-fold cross-validation, which was repeated five times. Each round of cross-validation involved partitioning the data into a test set and a derivation set. The model selection was carried out for each of the derivation sets, independent of the corresponding test set, and was aimed to minimize information leakage. Model selection strategy was based on a 5-fold cross-validation, which was repeated seven times. Abbreviations: D, Different parts of the derivation set in each round of cross-validation. T, The test set in each round of cross-validation

In total, 20 ANN ensemble models were trained and evaluated for each of the nodal status outputs. An ANN ensemble consisted of 15 individually trained MLPs, and the average of these networks was used as the ensemble output. Each of the MLPs was allowed to vary in size during model selection.

Predictive performance and statistical analysis

The performance of the ANN-based models to classify nodal status outputs was assessed by area under the receiver operating characteristic curve (AUC). Discriminatory performances were compared with multivariable logistic regression, using identical model test sets and risk variables, where differences were evaluated by comparing mean validation AUC values (Wilcoxon signed-rank test). Logistic regression analysis was performed and odds ratios (mean odds ratios calculated as per Lippman et al) [17], were used to quantify the association of a risk variable with the outcome. The overall importance of selected risk variables in each classification model was assessed by means of a permutation technique [18]. In short, a predictor variable was randomized across the evaluation cohort and the effect of this randomization on the estimated performance was measured. The predictor associated with the largest decrease in performance was assigned an importance value of 1. All other variables were assigned a position in this list based on the associated decrease in performance upon randomization.

Negative predictive value (NPV) and false negative rate (FNR) are two separate but related principles to assess the usefulness of a predictive tool. While the NPV indicates the probability that a patient with predicted disease-free axilla will be truly free of axillary disease, the FNR depicts the deficiency of the model in predicting nodal spread as a ratio relative to all pathology-defined nodal metastases [19]. A cut-off for classification of nodal-negativity was set based on maximized NPV in the ANN model for N0. As previously described [18], this threshold was aimed at identifying individuals with a very low probability of axillary disease. True positive (TP), true negative (TN), false positive (FP), and false negative (FN) results were assessed to evaluate the following: SLNB reduction rate = (TN + FN)/(TN + FN + TP + FP), and FNR = FN/FN + TP. Alternative cut-offs at FNR 5 and 10% were also applied and corresponding possible SLNB reduction rates were calculated.

The distribution of clinicopathological characteristics across the nodal classification outputs were evaluated by the Jonckheere-Terpstra test, Chi Square test for trend, Pearson Chi Square test, or the Fisher’s exact test, as appropriate. Hosmer-Lemeshow goodness-of-fit (HL) statistic was used to assess calibration. All analyses were performed with IBM SPSS Statistics for Windows (version 24.0) and with custom-made software written in C (gcc version 4.8.5), and Perl (version 5.18.2).

The developing of the predictive models and the reporting of the findings were in accordance with an EQUATOR Guideline for reporting machine learning predictive models [20], which supported the STROBE statement for the reporting of observational studies [21].

Results

Study population

Figure 1 displays the axillary surgical procedures performed for the overall study cohort (n = 800). The nodal status distribution was as follows: N0: 514 (64%); N1: 232 (29%); and N2: 54 (7%). Clinical and histopathological characteristics are summarized in Table 1.

Table 1 Patient and tumor characteristics

Predictive Clinicopathological variables

Table 1 displays the 15 potential clinicopathological risk variables for designation of nodal status (as N0, N1, or N2). Although the discriminatory effect of each variable cannot be expressed in terms of straightforward coefficients, mean odds ratios and sensitivity analysis can facilitate the interpretation of the relationship between an independent variable and the output. Table 2 displays selected variables, ranks, and mean odds ratios used for classification of each nodal status output. The ANN structure for predicting disease-free axilla status was characterized by a complex integration of predictors as input variables. The top ten variables were tumor size, LVI, multifocality, ER status, histological type, PR status, mode of detection, age, tumor localization in the breast, and Ki-67 positivity. To discriminate low-burden disease (N1), the same top variables were selected, with two exceptions: omission of mode of detection, and inclusion of menopausal status. While tumor size and LVI remained the top two variables most strongly associated with any nodal status output, other variables varied in rank of association with N0, N1, and N2 disease. Only six input variables were found to be predictive in the ANN structure for heavy-burden disease (N2): tumor size, LVI, ER status, histological type, and multifocality. A simplified illustration of the importance and relations of the different input variables as regression trees was constructed, for each of the models, and depicted in Fig. 3. Each tree was trained to predict the output (probabilities) of the corresponding ANN model. Sensitivity analysis of the assigned importance of the top rank predictive variables linearly scaled into a summation of 1 for the three models are given in Additional file 1.

Table 2 Top rank predictive clinicopathological variables in the ANN models for each of the axillary nodal status outcome
Fig. 3
figure3

Simplified illustration of the decision made by the ANN model. The regression trees were trained to predict the output probabilities made by the ANN model, given the identified top-ranked variables. Each tree was only allowed to grow to depth 4 and the full dataset was used to construct the trees. The numbers in the green boxes indicate average ANN output and the size of the data at the given node. A Decision tree N0 vs. N+; B Decision tree N1 vs. N0; C Decision tree N2 vs. N0 and N1

Discriminatory ability and calibration

Mean training AUC from the derivation sets was 0.735 for disease-free axilla. The corresponding internally validated AUC for N0 was 0.740 (95% confidence interval (CI) = 0.723–0.758) with an observed median HL statistic of 9.869 (P = 0.274). The ANN model to distinguish low-burden disease (N1 versus N0) showed an AUC of 0.706 in the training set, and an internally validated AUC of 0.705 (95% CI = 0.686–0.724). For high-burden disease (N2 versus N0 and N1), the training AUC was 0.735, while the internally validated AUC was 0.747 (95% CI = 0.728–0.765), with the corresponding median HL statistic values of 7.421 (P = 0.492) and 9.220 (P = 0.324), respectively. This indicated that the number of N0, N1, and N2 cases observed were not significantly different from those predicted by the models, and that the overall model calibration was good.

Performances in comparison to linear multivariable logistic regression

The discriminative abilities of internally validated ANN models and cross-validated multivariable logistic regression models (MLR) were compared. To distinguish N0, MLR models achieved a mean AUC of 0.727 (95% CI = 0.708–0.746). In 17 out of 20 test sets for N0, the AUC values from ANN models were greater than those obtained from the corresponding MLR models (P < 0.001). For N2 classification, MLR models obtained a mean AUC of 0.723 (95% CI = 0.694–0.750). Here, AUC values from the ANN models were greater than those obtained from the matching MLR models in 16 out of 20 test sets (P = 0.003). Likewise, the ANN models for N1 classification achieved greater discriminatory ability than did the corresponding MLR models in the majority (14 out of 20) of test sets (P = 0.040). However, an equivalent mean AUC of 0.700 (95% CI = 0.678–0.720) was obtained from the MLR models. Comparing ANN and MLR models based on the HL statistics, the ANN models were significantly more calibrated for N1 and N2 models (P = 0.003 and P = 0.006, respectively). However, for the N0 model the difference, in favor of the ANN model, was not significant (P = 0.09).

Clinical utility for SLNB reduction

To assess the clinical utility of ANN models in reducing unnecessary SLNB procedures, prediction of N0 status using a NPV-oriented cut-off was assessed. The maximized NPV was 95%. If the N0 model with this threshold were to be used in a preoperative setting to identify N0 patients, the ANN model would reduce SLNB procedures by 7.50%, with a corresponding FNR of 1.05%. If an alternative FNR of 5–10% was to be accepted, in accordance with the FNR for sentinel node biopsy technique, the corresponding SLNB reduction rate would be 17.75–27.25% (Table 3).

Table 3 SLNB reduction rates using the ANN model to predict disease-free axilla. Possible SLNB reduction rate corresponding to cut-offs at maximum negative predictive value, false negative rate 5 and 10%, respectively

Discussion

The current study presents ANN-based models for the prediction of nodal status based on routinely available clinicopathological characteristics, in a cohort of primary breast cancer patients consecutively and prospectively included in a pathology database. Internally validated performances displayed AUCs ranging from 0.705–0.747, with good calibration. These models highlighted the utility of nonlinear assessments of clinical characteristics and histological variables for prediction of axillary nodal status, especially in distinguishing disease-free axilla (N0) from high-burden disease (N2).

ANN models have been useful in detecting nodal metastasis on histopathological slides [22], and in evaluating risk of non-sentinel node involvement in breast malignancies [23]. Previous studies have proposed ANN-based algorithms for predicting nodal metastasis [24,25,26], though these studies were either based on small sample sizes or were conducted in selected patient cohorts. To the best of our knowledge, this is the largest study to present ANN-based algorithms predicting the extent of nodal metastatic burden in a population-based, contemporary, breast cancer cohort.

To predict nodal metastasis, our model integrates a complex set of input variables which reflect the multifactorial nature of the axillary metastatic process [27]. Variables in ANN-based models should not be taken for independent since the cause and effect reflect a dynamic process. Nevertheless, an attempt was made to better comprehend the importance of each variable by sensitivity analysis. While mean odds ratios were used for simplicity, the corresponding percentiles emphasized the dynamic nature of the input variables.

The present results reinforced tumor size [28] and LVI [29] as the most significant predictors of axillary metastasis. Age was significant in predicting disease-free axilla and low-burden disease. A nonlinear association between age and nodal status has previously been shown, with a low probability of nodal metastasis in those aged< 70 years, and increased probability in those aged > 70 years [30, 31]. In this study, positive ER and PR status was predictive of nodal metastasis, in agreement with literature; the TNBC subtype, although more aggressive, infrequently metastasizes to the axilla [32]. While a negative PR status has been shown to independently lower the risk of nodal metastasis [33], Ki-67 positivity has been associated with nodal metastasis [34, 35], and the present results are in agreement. Interestingly, alterations in the distribution of the breast cancer intrinsic subtypes has been reported to occur from the premenopausal state to the postmenopausal state [36, 37]. It is also noteworthy that menopausal status, in addition to hormone receptor status and age, was predictive of disease burden. Some publications have suggested a higher proportion of nodal metastasis in lobular cancer than that in the breast carcinoma of no special (ductal) type [38]; however, others have either found no significant differences [39], or have implied a lower incidence of nodal metastasis in the lobular than that in the ductal type [40]. The present results revealed a nonlinear association between histological type and nodal status. As supported by previous reports, the upper-outer quadrant localization of the tumor was the most common [41], and multiple lesions were predictive of axillary metastasis [33]. In accordance with published data [42], a tumor location in the inner quadrants, in comparison with that in the upper-outer quadrants, was predictive of disease-free axilla. However, medial tumor localization has also been related to increased risk of relapse [43]. Differences in lymphatic drainage patterns from the breast have been reported between palpable and nonpalpable lesions [44]. Of note, about 63% of the cases in the current cohort were diagnosed by mammography screening, and mode of detection was a significant factor in the prediction of axillary metastasis. Although the value of mammography screening is extensively debated [45], the current findings supported the notion that mode of detection complements information on tumor features and biology [46].

The ACOSOG Z0011 trial results [4, 5], were supportive of less extensive axillary surgery in patients with 1–2 metastatic sentinel nodes [47], and underlined the importance of distinguishing between low- and high-burden metastatic involvement; accordingly, ALND was omitted in women with cT1-2 N0 disease, and those with < 3 positive sentinel nodes underwent breast-conserving therapy with adjuvant treatments. As with the ACOSOG Z0011 trial, a negative clinical axillary status was a criterion in the present study. However, eligibility criteria for the current study were independent of the surgical intervention to the breast (mastectomy or breast-conserving surgery).

Predicting low-burden disease (≤3 metastatic nodes) was more challenging than was predicting disease-free axilla or high-burden disease. Nevertheless, identifying presence of metastatic burden is valuable. On an average, 2–3 lymph nodes are removed if SLNB alone is performed for nodal staging [48] and most metastatic nodes are identified with the excision of the first three sentinel nodes [49]. To improve the accuracy of predicting 1–3 metastatic lymph nodes, inclusion of imaging features from magnetic resonance imaging into the model might be beneficial. However, MRI is not always available in the preoperative setting whereas the chosen predictors in our model are. An accurate preoperative prediction of ≤3 metastatic nodes could provide clinicians with important information supporting SLNB staging procedure but spare a majority patients from completion ALND. For patients predicted to have N2 disease, the option of neoadjuvant therapy or upfront ALND can be discussed with the patient. The proportion of patients with node-positive disease is declining, and alternative non-invasive methods to surgical staging are increasingly being explored and our prediction model of N0 aims to add knowledge in this field. With a cut-off at maximized NPV to identify those with disease-free axilla, 7.50% of patients would be spared unnecessary SLNB. In comparison, the reported FNR for the SLNB procedure has been 5–10% [1, 48] and applying these FNR cut-offs for current N0 prediction would bring the SLNB reduction rate to 17.75–27.25%. While adopting the clinically accepted false negative rate of 10% for the SLNB procedure, nearly one third of all node negative patients with a predicted N0 by the model could have been spared a surgical staging procedure.

The present study has several limitations. Besides its retrospective nature, the models were developed from a single-center cohort. Furthermore, high-burden axillary metastasis was uncommon, which impacts the generalizability of the outcome. However, the cohort originated from a prospectively maintained database, which represents a contemporary population with access to a well-established public mammography-screening program. On the other hand, the possibility to obtain information on the risk variables from a monitored source strengthened the study. Unlike results relying on diverse registries, all histopathological characteristics analyzed in the study were managed by a single breast pathologist, which helped to minimize inaccuracies. However, the optimal preoperative utility of the models requires key variables such as LVI, which may not always be achievable on core-needle biopsy [50]. Although meticulous internal validation was performed with results supporting model robustness, further external validation in an independent cohort is necessary to confirm the utility of the models as guidance tools.

Conclusions

The current study showed that nodal status is related to several independent patient and tumor characteristics, and that a nonlinear association exists between preoperatively obtainable clinicopathological variables and degree of axillary metastatic involvement. ANN models proved especially favorable in distinguishing high-burden disease and disease-free axilla and could thus be useful as a clinical decision tool in the preoperative setting imputing selected risk variables. If a threshold for classification of node-negativity were applied for high NPV and low FNR, individuals with a very low probability of axillary disease would not have been selected for SLNB by the model, and would be spared from unbeneficial axillary surgery.

Availability of data and materials

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

Abbreviations

ACOSOG:

American College of Surgeons Oncology Group

ALND:

Axillary lymph node dissection

ANN:

Artificial neural network

AUC:

Area under the receiver operating characteristic curve

BMI:

Body mass index

ER:

Estrogen receptor

FN:

False negative

FNR:

False negative rate

FP:

False positive

HER2:

Human epidermal growth factor receptor 2

HL:

Hosmer-Lemeshow goodness-of-fit test

LVI:

Lymphovascular invasion

MLPs:

Multi-layer perceptrons

MLR:

Multivariable logistic regression models

NPV:

Negative predictive value

PR:

Progesterone receptor

SLNB:

Sentinel lymph node biopsy

TN:

True negative

TP:

True positive

References

  1. 1.

    Giuliano AE, Kirgan DM, Guenther JM, Morton DL. Lymphatic mapping and sentinel lymphadenectomy for breast cancer. Ann Surg. 1994;220:391–8.

  2. 2.

    Kim T, Giuliano AE, Lyman GH. Lymphatic mapping and sentinel lymph node biopsy in early-stage breast carcinoma: a metaanalysis. Cancer. 2006;106:4–16.

  3. 3.

    Gentilini O, Veronesi U. Abandoning sentinel lymph node biopsy in early breast cancer? A new trial in progress at the European Institute of Oncology of Milan (SOUND: sentinel node vs observation after axillary UltraSouND). Breast. 2012;21:678–81.

  4. 4.

    Giuliano AE, Hunt KK, Ballman KV, Beitsch PD, Whitworth PW, Blumencranz PW, et al. Axillary dissection vs no axillary dissection in women with invasive breast cancer and sentinel node metastasis: a randomized clinical trial. JAMA. 2011;305:569–75.

  5. 5.

    Giuliano AE, Ballman KV, McCall L, Beitsch PD, Brennan MB, Kelemen PR, et al. Effect of axillary dissection vs no axillary dissection on 10-year overall survival among women with invasive breast cancer and sentinel node metastasis: the ACOSOG Z0011 (Alliance) randomized clinical trial. JAMA. 2017;318:918–26.

  6. 6.

    Hessman CJ, Naik AM, Kearney NM, Jensen AJ, Diggs BS, Troxell ML, et al. Comparative validation of online nomograms for predicting nonsentinel lymph node status in sentinel lymph node-positive breast cancer. Arch Surg. 2011;146:1035–40.

  7. 7.

    Coutant C, Olivier C, Lambaudie E, Fondrinier E, Marchal F, Guillemin F, et al. Comparison of models to predict nonsentinel lymph node status in breast cancer patients with metastatic sentinel lymph nodes: a prospective multicenter study. J Clin Oncol. 2009;27:2800–8.

  8. 8.

    Bishop CM. Neural networks for pattern recognition. Oxford: Clarendon Press; Oxford University Press; 1995. p. 482.

  9. 9.

    Tu JV. Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes. J Clin Epidemiol. 1996;49:1225–31.

  10. 10.

    Sargent DJ. Comparison of artificial neural networks with other statistical approaches: results from medical data sets. Cancer. 2001;91:1636–42.

  11. 11.

    Burke HB, Goodman PH, Rosen DB, Henson DE, Weinstein JN, Harrell FE Jr, et al. Artificial neural networks improve the accuracy of cancer survival prediction. Cancer. 1997;79:857–62.

  12. 12.

    Lisboa PJ, Taktak AFG. The use of artificial neural networks in decision support in cancer: a systematic review. Neural Netw. 2006;19:408–15.

  13. 13.

    Doyle HR, Dvorchik I, Mitchell S, Marino IR, Ebert FH, McMichael J, et al. Predicting outcomes after liver transplantation. A connectionist approach. Ann Surg. 1994;219:408–15.

  14. 14.

    Esteva H, Nunez TG, Rodriguez RO. Neural networks and artificial intelligence in thoracic surgery. Thorac Surg Clin. 2007;17:359–67.

  15. 15.

    Hinton GE, Srivastava N, Krizhevsky A, Sutskever I, Salakhutdinov RR. Improving neural networks by preventing co-adaptation of feature detectors. 2012. Report No.: arXiv:1207.0580.

  16. 16.

    Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R. Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res. 2014;15:1929–58.

  17. 17.

    Lippmann RP, Shahian DM. Coronary artery bypass risk prediction using neural networks. Ann Thorac Surg. 1997;63:1635–43.

  18. 18.

    Mocellin S, Thompson JF, Pasquali S, Montesco MC, Pilati P, Nitti D, et al. Sentinel node status prediction by four statistical models: results from a large bi-institutional series (n = 1132). Ann Surg. 2009;250:964–9.

  19. 19.

    Nieweg OE, Estourgie SH. What is a sentinel node and what is a false-negative sentinel node? Ann Surg Oncol. 2004;11:169S–73S.

  20. 20.

    Luo W, Phung D, Tran T, Gupta S, Rana S, Karmakar C, et al. Guidelines for developing and reporting machine learning predictive models in biomedical research: a multidisciplinary view. J Med Internet Res. 2016;18:e323.

  21. 21.

    von Elm E, Altman DG, Egger M, Pocock SJ, Gotzsche PC, Vandenbroucke JP, et al. Strengthening the reporting of observational studies in epidemiology (STROBE) statement: guidelines for reporting observational studies. BMJ. 2007;335:806–8.

  22. 22.

    Ehteshami Bejnordi B, Veta M, Johannes van Diest P, van Ginneken B, Karssemeijer N, Litjens G, et al. Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. JAMA. 2017;318:2199–210.

  23. 23.

    Nowikiewicz T, Wnuk P, Malkowski B, Kurylcio A, Kowalewski J, Zegarski W. Application of artificial neural networks for predicting presence of non-sentinel lymph node metastases in breast cancer patients with positive sentinel lymph node biopsies. Arch Med Sci. 2017;13:1399–407.

  24. 24.

    Mattfeldt T, Kestler HA, Sinn HP. Prediction of the axillary lymph node status in mammary cancer on the basis of clinicopathological data and flow cytometry. Med Biol Eng Comput. 2004;42:733–9.

  25. 25.

    Karakis R, Tez M, Kihc YA, Kuru B, Guler I. A genetic algorithm model based on artificial neural network for prediction of the axillary lymph node status in breast cancer (vol 26, pg 945, 2013). Eng Appl Artif Intell. 2013;26:1641.

  26. 26.

    Mojarad S, Venturini B, Fulgenzi P, Papaleo R, Brisigotti M, Monti F, et al. Prediction of nodal metastasis and prognosis of breast cancer by ANN-based assessment of tumour size and p53, Ki-67 and steroid receptor expression. Anticancer Res. 2013;33:3925–33.

  27. 27.

    Nathanson SD, Shah R, Rosso K. Sentinel lymph node metastases in cancer: causes, detection and their role in disease progression. Semin Cell Dev Biol. 2015;38:106–16.

  28. 28.

    Bevilacqua JL, Kattan MW, Fey JV, Cody HS 3rd, Borgen PI, Van Zee KJ. Doctor, what are my chances of having a positive sentinel node? A validated nomogram for risk estimation. J Clin Oncol. 2007;25:3670–9.

  29. 29.

    Gajdos C, Tartter PI, Bleiweiss IJ. Lymphatic invasion, tumor size, and age are independent predictors of axillary lymph node metastases in women with T1 breast cancers. Ann Surg. 1999;230:692–6.

  30. 30.

    Wildiers H, Van Calster B, van de Poll-Franse LV, Hendrickx W, Roislien J, Smeets A, et al. Relationship between age and axillary lymph node involvement in women with breast cancer. J Clin Oncol. 2009;27:2931–7.

  31. 31.

    Lodi M, Scheer L, Reix N, Heitz D, Carin AJ, Thiebaut N, et al. Breast cancer in elderly women and altered clinico-pathological characteristics: a systematic review. Breast Cancer Res Treat. 2017;166:657–68.

  32. 32.

    Yang ZJ, Yu Y, Hou XW, Chi JR, Ge J, Wang X, et al. The prognostic value of node status in different breast cancer subtypes. Oncotarget. 2017;8:4563–71.

  33. 33.

    Viale G, Zurrida S, Maiorano E, Mazzarol G, Pruneri G, Paganelli G, et al. Predicting the status of axillary sentinel lymph nodes in 4351 patients with invasive breast carcinoma treated in a single institution. Cancer. 2005;103:492–500.

  34. 34.

    Jiang Y, Xu H, Zhang H, Ou X, Xu Z, Ai L, et al. Nomogram for prediction of level 2 axillary lymph node metastasis in proven level 1 node-positive breast cancer patients. Oncotarget. 2017;8:72389–99.

  35. 35.

    Tawfik K, Kimler BF, Davis MK, Fan F, Tawfik O. Ki-67 expression in axillary lymph node metastases in breast cancer is prognostically significant. Hum Pathol. 2013;44:39–46.

  36. 36.

    Prat A, Cheang MC, Martin M, Parker JS, Carrasco E, Caballero R, et al. Prognostic significance of progesterone receptor-positive tumor cells within immunohistochemically defined luminal a breast cancer. J Clin Oncol. 2013;31:203–9.

  37. 37.

    Prat A, Martin M, Nielsen TO, Perou CM. Reply to Y.Yamamoto et al. J Clin Oncol. 2013;31:2517–8.

  38. 38.

    Wasif N, Maggard MA, Ko CY, Giuliano AE. Invasive lobular vs. ductal breast cancer: a stage-matched comparison of outcomes. Ann Surg Oncol. 2010;17:1862–9.

  39. 39.

    Adachi Y, Ishiguro J, Kotani H, Hisada T, Ichikawa M, Gondo N, et al. Comparison of clinical outcomes between luminal invasive ductal carcinoma and luminal invasive lobular carcinoma. BMC Cancer. 2016;16:248.

  40. 40.

    Vandorpe T, Smeets A, Van Calster B, Van Hoorde K, Leunen K, Amant F, et al. Lobular and non-lobular breast cancers differ regarding axillary lymph node metastasis: a cross-sectional study on 4,292 consecutive patients. Breast Cancer Res Treat. 2011;128:429–35.

  41. 41.

    Sohn VY, Arthurs ZM, Sebesta JA, Brown TA. Primary tumor location impacts breast cancer survival. Am J Surg. 2008;195:641–4.

  42. 42.

    Chen K, Liu J, Li S, Jacobs L. Development of nomograms to predict axillary lymph node status in breast cancer patients. BMC Cancer. 2017;17:561.

  43. 43.

    Lohrisch C, Jackson J, Jones A, Mates D, Olivotto IA. Relationship between tumor location and relapse in 6,781 women with early invasive breast cancer. J Clin Oncol. 2000;18:2828–35.

  44. 44.

    Estourgie SH, Nieweg OE, Olmos RA, Rutgers EJ, Kroon BB. Lymphatic drainage patterns from the breast. Ann Surg. 2004;239:232–7.

  45. 45.

    Welch HG, Prorok PC, O’Malley AJ, Kramer BS. Breast-cancer tumor size, overdiagnosis, and mammography screening effectiveness. N Engl J Med. 2016;375:1438–47.

  46. 46.

    Drukker CA, Schmidt MK, Rutgers EJ, Cardoso F, Kerlikowske K, Esserman LJ, et al. Mammographic screening detects low-risk tumor biology breast cancers. Breast Cancer Res Treat. 2014;144:103–11.

  47. 47.

    Poodt IGM, Spronk PER, Vugts G, van Dalen T, Peeters M, Rots ML, et al. Trends on axillary surgery in nondistant metastatic breast cancer patients treated between 2011 and 2015: a Dutch population-based study in the ACOSOG-Z0011 and AMAROS Era. Ann Surg. Ann Surg. 2018;268:1084–1090.

  48. 48.

    Krag DN, Anderson SJ, Julian TB, Brown AM, Harlow SP, Ashikaga T, et al. Technical outcomes of sentinel-lymph-node resection and conventional axillary-lymph-node dissection in patients with clinically node-negative breast cancer: results from the NSABP B-32 randomised phase III trial. Lancet Oncol. 2007;8:881–8.

  49. 49.

    McCarter MD, Yeung H, Fey J, Borgen PI, Cody HS 3rd. The breast cancer patient with multiple sentinel nodes: when to stop? J Am Coll Surg. 2001;192:692–7.

  50. 50.

    Harris GC, Denley HE, Pinder SE, Lee AH, Ellis IO, Elston CW, et al. Correlation of histologic prognostic factors in core biopsies and therapeutic excisions of invasive breast carcinoma. Am J Surg Pathol. 2003;27:11–5.

Download references

Acknowledgements

The authors acknowledge the late Dr. Dorthe Aamand Grabau for meticulous pathological review of all specimens.

Funding

This study was supported by grants from The Swedish Breast Cancer Association (BRO), the Skåne County Councils Research and Developmental Foundation, the Governmental Funding of Clinical Research within the National Health Service (ALF), The Erling Persson Family Foundation and the Strategic Centre for Translational Cancer Research-CREATE Health. The funding bodies had no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Author information

Study concept and design: LD, MO, LR. Acquisition, analysis, or interpretation of data: LD, POB, PE, MO, LR. Drafting of the manuscript: LD, MO, LR. Critical revision of the manuscript for important intellectual content: LD, POB, PE, MO, LR. Statistical analysis: LD, POB, PE, MO. Obtained funding: LR. Administrative, technical, or material support: MO, LR. Study supervision: MO, POB, LR. All authors read and approved the final manuscript.

Authors’ information

Looket Dihge is a board-certified general surgeon and registrar in plastic surgery at Skåne University Hospital holding a PhD from Lund University. She has a specific interest in predictive risk models for nodal metastasis and aims to apply this approach also in the field of plastic surgery predicting postoperative results after reconstructive surgery.

Correspondence to Lisa Rydén.

Ethics declarations

Ethics approval and consent to participate

Patients gave verbal informed consent to participate at time of diagnosis and the ethics committee at Lund University approved this procedure (LU 2013/340). Patients were informed that they had the opportunity to opt-out if they were not willing to participate in the study.

Consent for publication

Not applicable

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional file

Additional file 1:

Sensitivity analysis of the assigned importance of the top rank predictive variables for the three models. Sensitivity analysis of the assigned importance of the top rank predictive variables for N0 vs. N+, N1 vs. N0 and N2 vs N0 and N1 linearly scaled into a summation of 1. (PDF 1083 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Keywords

  • Breast cancer
  • Sentinel lymph node biopsy
  • Nodal status
  • Artificial neural networks
  • Prediction models