Skip to main content
  • Research article
  • Open access
  • Published:

Development and validation of a new predictive model for breast cancer survival in New Zealand and comparison to the Nottingham prognostic index



The only available predictive models for the outcome of breast cancer patients in New Zealand (NZ) are based on data in other countries. We aimed to develop and validate a predictive model using NZ data for this population, and compare its performance to a widely used overseas model, the Nottingham Prognostic Index (NPI).


We developed a model to predict 10-year breast cancer-specific survival, using data collected prospectively in the largest population-based regional breast cancer registry in NZ (Auckland, 9182 patients), and assessed its performance in this data set (internal validation) and in an independent NZ population-based series of 2625 patients in Waikato (external validation). The data included all women with primary invasive breast cancer diagnosed from 1 June 2000 to 30 June 2014, with follow up to death or Dec 31, 2014. We used multivariate Cox proportional hazards regression to assess predictors and to calculate predicted 10-year breast cancer mortality, and therefore survival, probability for each patient. We assessed observed survival by the Kaplan Meier method. We assessed discrimination by the C statistic, and calibration by comparing predicted and observed survival rates for patients in 10 groups ordered by predicted 10-year survival. We compared this NZ model with the Nottingham Prognostic Index (NPI) in this validation data set.


Discrimination was good: C statistics were 0.84 for internal validity and 0.83 for an independent external validity. For calibration, for both internal and external validity the predicted 10-year survival probabilities in all groups of patients, ordered by predicted survival, were within the 95% confidence intervals (CI) of the observed Kaplan-Meier survival probabilities. The NZ model showed good discrimination even within the prognostic groups defined by the NPI.


These results for the New Zealand model show good internal and external validity, transportability, and potential clinical value of the model, and its clear superiority over the NPI. Further research is needed to assess other potential predictors, to assess the model’s performance in specific subgroups of patients, and to compare it to other models, which have been developed in other countries and have not yet been tested in NZ.

Peer Review reports


Currently clinicians in New Zealand (NZ) estimate a woman’s likely mortality or survival after a diagnosis of breast cancer (her prognosis) based on experience and clinical judgement. They may use some of the statistical models developed in other countries, such as the Nottingham Prognostic Index (NPI) [1]. A recent review of published data up to July 2012 found 996 articles, from which six prognostic models were identified based on clinical and pathological features [2]. These models were the NPI, (the earliest, published in 1982) [1], Adjuvant! [3], BC Nomogram [4], Options [5], Predict (and Predict+) [6,7,8], and CancerMath [9]. Validation studies are limited. All these models were developed in European or United States populations, and the few validation studies in other populations show less accurate prediction [10,11,12]. The only published validation of these models in NZ is a small study involving one of the current authors [13], and no work in Maori or Pacific populations has been done. The models are less accurate in younger and in older patients, e.g., under 40 years and over 75 years [14, 15]. These models use accepted clinical and pathological indicators, such as tumour size, nodal involvement, and receptor status. In addition to these factors, studies in NZ show that there are significant differences in breast cancer outcomes by social deprivation, rural residence, comorbidity, type of health care [16,17,18]. There are important differences by ethnicity, with Maori and Pacific women being at greater risks of death and recurrence than European NZ women, and these ethnic differences are complex and related to both clinic-pathological and demographic factors [16, 18,19,20]. Thus we aimed to develop and validate a model, based on NZ patient experience, to predict breast cancer outcomes. If such a model were shown to have acceptable accuracy, it would help clinicians as well as patients and their families, and would facilitate patient-doctor communication and clinical decision making. In this paper, we present a model, the New Zealand Model (NZM), developed to reliably categorise NZ patients into groups by their probability of breast cancer-specific survival within 10 years of an initial breast cancer diagnosis, and compare it to the NPI.


Patients and data

We used the data collected prospectively through the two largest and longest-established population-based regional breast cancer registries in NZ, in the Auckland and Waikato regions. These two regional registries are linked to include over 40% of all patients with breast cancer in NZ, and are representative of NZ women in terms of socioeconomic, demographic and ethnic background [16, 21]. The registries are linked to national mortality data and to the legally-mandated national cancer registry [22] and to other hospital discharge data to assess co-morbidity [22]; comparisons show that the registries are very complete (over 95%) [22]. Mortality and recurrences were documented from regular hospital follow-up, or for patients discharged from regular hospital follow up, from information provided by primary care and private practice physicians, updated annually or more frequently.

The data used are for all women diagnosed with a first primary invasive breast cancer between 1 January 2000 and 30 June 2014, followed up to death or to 31 Dec 2014. Data were missing or incomplete for up to 5% of several items, and 10% for numbers of involved nodes, so we used complete case analysis rather than imputation techniques. There were 10,586 such women in the Auckland registry, of whom 9182 had complete data; their data were used to develop the prediction model and assess its internal validity. Data from the Waikato registry, on 3071 total women and 2625 with complete data, were used to assess the model’s external validity.

Predictors of breast cancer mortality

Predictors of breast cancer mortality (and therefore of disease specific survival) were selected based on their accepted clinical importance, and then their empirical performance as predictors of 10-year breast cancer mortality [18, 23, 24]. The predictors considered were age at diagnosis, ethnicity, tumour size, number of positive lymph nodes, tumour grade, presence of metastasis at diagnosis (Stage 4), estrogen (ER) and progesterone (PR) receptor status, human epidermal growth factor receptor2 (HER2) status, histological type of tumour, and lymphovascular invasion (LVI). Menopausal status was not retained as it is strongly linked to age. We assessed factors as both continuous and categorical variables, but used categories in the final model. Patient ethnicity was identified from the breast cancer registries or where not available, from the national cancer registry or mortality data following NZ Ministry of Health ethnicity data protocols [25]. Ethnicity was categorized into NZ European, Māori, Pacific, and Other. Cancer stage at diagnosis was defined according to the Tumour, Node, and Metastasis (TNM) system [26]. Invasive tumour grade was defined according to the Elston and Ellis modified Scarff-Bloom-Richardson breast cancer grading system [27]. Estrogen (ER) and progesterone (PR) receptor status was based on the results of immunohistochemistry tests and classified as positive with 1% or more receptor positive cells [28], and grouped as both ER and PR positive, both negative, and either positive/negative or negative/positive status. HER-2 status was based on a Fluorescent In-Situ Hybridization (FISH) test or when this was not available, on immunohistochemistry [29], and categorised as positive, negative/equivocal, or not tested. HER2 assessment was introduced only in 2006.

Statistical methods

We performed Cox multivariate proportional hazards regression analysis. The outcome variable was defined as time from diagnosis until death due to breast cancer, with censoring at date of death from other causes, or 31 Dec 2014. Analyses were performed with SAS version IC.11 [30] and R version 3.2.5 [31].

The New Zealand model (NZM) was built using the Auckland data base. Various models were developed using continuous or categorical variables, with an improvement in goodness-of-fit assessed by a reduction in the Akaike Information Criterion (AIC) [32]. The models fit a mortality function, S(t) as the probability of mortality for time t, dependent on S0(t), the baseline mortality probability for time t, and Xβ, the linear combination of predictors of breast cancer mortality [32]. We obtained the estimated 10-year mortality probability at baseline, by setting the predictors to their reference levels and fitting the Cox multivariate regression. Then we used the mortality function [32], and computed 10-year mortality for each patient in the Auckland database. We tested the proportional hazards assumption using graphical log-log survival plots and the method of weighted residuals [33], and we tested for goodness-of-fit using the procedure of May and Hosmer [34].

We fitted a model with continuous variables of age, tumour size, and number of positive lymph nodes, and categorical variables of ethnicity, tumour stage, tumour grade, ER and PR receptors, HER2 status, histological type of tumour, and LVI. Then we fitted the model with categorical variables of all the predictors, and found that the fit improved, omitting LVI which was no longer significant (p > 0.05). Graphical representation of the model was done with a regression nomogram, enhanced with distribution of covariates shown by scaled-to-frequency boxes [35] and produced with R function regplot [36].

We assessed internal validity using the Auckland database. Internal validity was assessed with bootstrapping (200 replications). Bootstrapping samples were created by drawing random samples with replacement from the Auckland database. To assess discrimination, we used the C statistic [32]. The prediction model was fitted on each bootstrap sample and tested on the original sample. To assess calibration, we divided patients in the Auckland database into ten groups, ordered by their predicted 10-year breast cancer survival. We then compared for each group the mean of the predicted 10-year breast cancer survival with the observed 10-year breast cancer survival [32, 37] calculated by the Kaplan-Meier method [38].

To assess external validity and transportability of the model, we applied the Auckland-developed model to an independent data set, the Waikato registry. External validity of the model was assessed by bootstrapping (200 replications), using the Waikato database. For the Waikato data, we assessed the C statistic, and compared predicted and observed 10-year breast cancer survival in groups ordered by predicted survival. Since there were few patients with predicted survival under 30%, we combined patients with 10-year breast cancer survivals of 0–30% in one group, leaving 8 groups.

There are no simple calculations of statistical power in predictive models, but assessing external validity, a minimum of 100 events has been recommended for mortality analysis [39, 40]. Another study suggests a minimum of 10 events per predictor for proportional hazards regression [41]. We estimated that in the smaller external validation data registry there were 282 breast cancer specific deaths, giving 31 events per predictor variable, which was adequate.

Comparison with NPI

The NPI was calculated for patients in the Waikato data, based on tumour size, pathological grade, and number of positive nodes [42], which replaces nodal stage used originally [1], as in other validation studies of the NPI [43, 44]. In keeping with the development of the NPI, only patients with stage 1–3 breast cancer, and tumour size> 0 cm were included [1]. Thus 46 patients with Stage IV tumours and 2 with missing tumour grade were excluded, so the comparison was done in 2579 patients. Following NPI methods, we classified patients into three NPI prognostic groups, defined as good (NPI < 3.4), moderate (3.4 to 5.4), and poor (NPI > 5.4) [45]. Within these subgroups, subdivided by deciles of breast cancer-specific survival predicted by the NZ model, we compared the predicted and observed breast cancer-specific mortality. The mortality probability predicted by the NZ model in a subgroup was the mean of all the predicted probabilities generated for all patients in that subgroup. The model was considered accurate if the predicted outcome was within the 95% confidence interval (95%CI) of the observed outcome. An a priori alpha level of 0.05 was used. The difference of means of breast cancer-specific mortality predicted by the NZM in the three NPI groups was tested using one-way ANOVA. Because of the unequal sample size between the groups, a Tukey post hoc test was used for pairwise comparisons of means [46].


For the 9182 eligible women in the Auckland database, there were 864 breast cancer specific deaths over the 14-year time period; median follow up time was 67.6 months, and mean age of patients 56.9 years (Table 1). Patients were predominantly Stage 1 (43%) and 2 (39%), ER and PR positive (79 and 68%), HER-2 negative (69%), without lymphovascular invasion (73%), and with ductal tumours (81%). Of the patients, 71% were of NZ European ethnic group, with 8% Maori, 7% Pacific, and 14% other (such as Asian countries).

Table 1 Features of patients included in the derivation data set (Auckland) and the independent validation data set (Waikato)

The predictors in the final model are shown in Table 2. The risk of breast cancer mortality within 10 years of diagnosis increased significantly with age being over 70 years; higher tumour grade, larger tumour size, greater number of positive lymph nodes, presence of metastases at diagnosis, and with ER or PR negative tumours. Mortality risk was reduced with HER2 status positive, and histological types other than ductal or lobular. NZ European, Maori, and Pacific women had similar risks of mortality, but it was lower in patients in the ‘Other’ ethnic group. When tested for proportional hazards, most of the covariates met the criterion, but overall, the global criterion was not met. Nevertheless when the model was tested for goodness-of-fit [34] it was found to be an adequate fit (p = 0.26).

Table 2 Predictors of 10 year breast cancer mortality, Auckland data, 1 Jan 2000–31 Dec 2014

Fitting the model to the Auckland data, assessing internal validity, gave a C statistic of 0.84. Calibration is shown as survival rather than mortality as that is usually used clinically, comparing predicted and observed 10-year disease specific survival (1-mortality). Figure 1 (and Additional file 1: Table S1) show that for patients divided into ten groups, based on predicted 10-year breast cancer survival of 0–9%, 10–19%, etc., the predicted survivals were within the 95% confidence interval (CI) of observed survival for all groups.

Fig. 1
figure 1

Internal validity: 10-year breast cancer specific survival as predicted by the NZ model (horizontal axis) for 10 groups of patients in the derivation data set (Auckland), grouped by predicted survival,compared to observed (Kaplan-Meier) survival and its 95% confidence limits (vertical axis). Line of identity between predicted and observed survival shown

To assess external validity, the model developed in the Auckland data was applied to the independent data set, 2625 patients in the Waikato registry, with 282 deaths. These patients were of similar mean age to those in Auckland, but with relatively more Maori, fewer Pacific, and fewer of ‘other’ ethnic background (other than Maori, Pacific, and European NZ). The distributions of clinico-pathological factors was generally similar (Table 1). The Auckland-derived model showed good discrimination in this independent data set, with the C statistic being 0.83. For calibration (Fig. 2), from 10 groups ordered by predicted survival, the three groups with the lowest survival had few patients and were combined into a 0–30% group. In all the eight groups, the predicted 10-year breast cancer survivals were within the 95% confidence interval of the observed 10-year breast cancer survival.

Fig. 2
figure 2

External validity: 10-year breast cancer specific survival as predicted by the NZ model (horizontal axis) for 8 groups of patients in the validation data set (Waikato), grouped by predicted survival, using Auckland derived model, compared to observed (Kaplan-Meier) survival and its 95% confidence limits (vertical axis). Line of identity between predicted and observed survival shown

Representation of the final model by a nomogram is shown in Fig. 3. It is ranked showing the greatest contribution to the regression from top downwards. The figure also shows the total score, and its distribution, and 10-year (120 month) risk for a person with the indicated set of predictors (2.5% estimated mortality risk).

Fig. 3
figure 3

A nomogram of the fitted model, showing the relative contribution of variables to the model and also, by relative sizes of the boxes, the distribution of each. The distribution of the total score is also shown and the dots show a particular person with 10-year mortality risk of 2.5% (95% CI 1.3–3.6%)

The NPI was also applied to the independent data set, dividing patients into three prognostic groups (Table 3). While the mean predicted 10-year breast cancer survival showed significant differences and a trend, from 96.1% in ‘good’, 84.0% in ‘moderate’, to 57.8% in the ‘poor’ NPI groups, the range of predictions for individual patients was very large within each group. Within each of these NPI subgroups, the NZ model shows good discrimination and predictive ability (Table 4). In all assessable subgroups, the predicted breast cancer deaths were within the 95% CI of the observed deaths. Within the ‘good’ prognosis NPI group, patients fall into the highest 8 of the 10 deciles by predicted survival using the NZ model, with predicted survival rates ranging from 80 to 99%. Within the NPI ‘poor’ prognosis group, all but 3 patients are in the lowest 5 deciles, with predicted survival rates ranging from 42 to 88%.

Table 3 10-year survival predicted by the NZ model in three NPI prognostic groups
Table 4 Observed and predicted breast cancer survival by NPI prognostic groups and decile of risk predicted by the NZ model


In this study we used demographic, clinical and pathologic factors to build a statistical model, the NZ model, to estimate the probability of breast-cancer specific survival, or equivalently death, within 10 years of diagnosis in women diagnosed with primary invasive breast cancer in NZ. Our results confirmed that many factors affect a woman’s prognosis; age at diagnosis, number of positive lymph nodes, tumour size, tumour grade, presence of metastases at diagnosis, histological type of tumour, ER and PR receptors status, and HER2 status, all had significant associations with breast cancer specific survival. These factors have been used to predict breast cancer outcomes by several studies [4, 6, 7, 9, 14, 17, 47].

In terms of validity and reliability, we found that our predictive model performed well even in external validation on an independent data set. The C statistic was 0.83 for independent validation, compared to 0.84 for internal validation in the data from which it was derived, showing good discrimination [37]. Calibration assessment indicated good agreement between predicted and observed 10-year breast cancer specific survival. In both internal and external validation,predicted survivals were within the 95% CI of observed survival probabilities in all groups of patients. These are accepted approaches for assessing discrimination and calibration of prediction models of breast cancer outcomes [7, 14, 32, 37, 47]. Power was satisfactory: we had 282 breast cancer specific deaths, and 31 events per predictor variable in the smaller validation cohort, compared to recommendations of 100 events [39, 40], and 10 events per predictor variable being adequate [41]. In future work, we will explore the model’s performance in further groups of patients, and ultimately its validity at the individual patient level.

The optimum presentation of the results of any model is a complex issue. In Fig. 3 the model is represented by a regression nomogram; this way of representing risk models is becoming common and has been recommended [48]. The nomogram is enhanced with covariate and total distributions. When active on a computer it can present interactive calculations of individuals’ risks. We see the applications of our prognostic model in assisting informed decision-making by women with breast cancer, their doctors and their carers. In further research we will assess the most effective ways to use the prognostic estimates. Many current models use overly complex language in the presentation of results, such that one study found that fewer than half of patients understood their prognosis after an oncologist’s consultation [49]. Health literacy issues need to be considered as they contribute to health inequities [50, 51].

There are few comparisons of more than one risk prediction model applied to the same independent patient population. A recent study showed similar performance of three models in patients recorded in an international tissue bank, but these patients are not representative of all incident patients [47]. A 2009 review [52] concluded that the Nottingham index has been the most tested, and only two other models had any published validation in an independent population. Most validation studies have used 1000 patients or fewer [2]. Several other models are based on genetic profiles or novel biomolecular factors, e.g. Oncotype DX [11, 53], but not also including clinical and pathological factors.

In New Zealand, the patient’s ethnicity, with the three largest groups being NZ European, Maori, and Pacific, is of great general importance, so we kept ethnicity in the prognostic model. Maori and Pacific women in NZ with breast cancer have a worse prognosis and higher mortality [23, 54], however, the current results show that Maori or Pacific ethnicity have no independent predictive value on 10-year survival, once detailed clinical and pathological factors are taken into account. This suggests the ethnicity factors are mediated through these clinical and pathological variables, and agrees with other detailed analyses [16]. Thus, while at the population level different approaches may be needed to overcome the disparities in outcomes of the different ethnic groups, at a clinical level the assessment of prognosis in each women may not be affected by her ethnic group. Thus, for example, in women whose breast cancers have been screen-detected, outcomes are equivalent in Maori and non-Maori women in NZ [55]. However, there may be ethnic differences in total mortality and in morbidity or recurrence, so in further work we will assess whether ethnic-specific prognostic models have any advantages.

Among the strengths of our modelling process is that we have used large datasets for both development (over 9000 patients) and validation (over 2600 patients), and both are population-based, including virtually all diagnosed patients [22]. These women have undergone routine clinical treatment and their survival experience will reflect this and depend on their diagnosis date, from 2000 to 2014. As one comparison, the UK developed ‘Predict’ model was based on 5738 patients with complete data diagnosed from 1999 to 2003 [8], but has been shown to be applicable to several other populations. All predictive models based on actual patient experience will be limited by the assessment and treatments available at the time they were diagnosed.

We developed our model based on the patients with data available on all relevant factors, as in its application to new patients all information is likely to be available as it can be actively sought. Incomplete data were more common in patients with more advanced disease; probably because documentation of features of the primary disease such as tumour size or number of involved nodes is less relevant clinically in these patients. It implies that the prognostic model may be less accurate in patients with metastatic disease. For patients with some missing data, associations between known factors and 10-year survival were generally similar to those of patients with complete data.

We did not include treatment variables in our models. The models are derived from data on the experience of unselected population-based series of breast cancer patients. Their treatment will have been guided by international best practice, summarised in NZ clinical guidelines [56] and standards of service [57]. Many patients may not receive the standard recommended treatment for various reasons including patient choice, comorbidity, and barriers to access of health care. In future work, we will explore effects of treatment factors, both in regard to not receiving recommended treatments, and also receiving new treatments; to do this, we will include in the model estimates of treatment effects, where possible based on the results of randomised trials.

The NPI was the first breast cancer prognostic model published [1], has had the most extensive validation [2], and is still widely used. It does not predict survival for each patient, but divides patients into prognostic groups, usually three groups. It is based on three factors incorporated in the NZ model and so the two indices are related. We have shown here that the NPI is only very approximate, there being wide variations in survival within each NPI prognostic group. We have demonstrated that within each NPI group the NZ model subdivides patients into smaller groups efficiently, with good correlation between predicted and observed ten-year breast cancer specific survival rates. The NZ model could replace the NPI in those situations were the NPI is being used.

We will do further work to compare our model’s performance with that of other available models, developed in other countries. We will also assess other potential predictive factors and their effects on the performance of our model. These will include treatments, and comorbidity to account for associated health conditions of breast cancer patients. We will also assess other outcomes of breast cancer, such as overall mortality, local recurrence rate, and recurrence rate.


We have developed a NZ specific predictive model, which has good validity to predict breast cancer mortality in women with primary invasive breast cancer in NZ. The model is clearly superior to the widely used NPI, and categorises patients by predicted survival even within categories of the NPI. The NZ model shows potential to have important clinical value.



Estrogen receptor


Fluorescent in-situ hybridization


Human epidermal growth factor receptor 2


Hazard ratio




Lympho-vascular invasion


Nottingham prognostic index


New Zealand


New Zealand model


Progesterone receptor


Tumour, Node, Metastasis


  1. Haybittle JL, Blamey RW, Elston CW, Johnson J, Doyle PJ, Campbell FC, et al. A prognostic index in primary breast cancer. Br J Cancer. 1982;45:361–6.

    Article  CAS  Google Scholar 

  2. Engelhardt EG, Garvelink MM, de Haes JH, van der Hoeven JJ, Smets EM, Pieterse AH, et al. Predicting and communicating the risk of recurrence and death in women with early-stage breast cancer: a systematic review of risk prediction models. J Clin Oncol. 2014;32:238–50.

    Article  Google Scholar 

  3. Ravdin PM, Siminoff LA, Davis GJ, Mercer MB, Hewlett J, Gerson N, et al. Computer program to assist in making decisions about adjuvant therapy for women with early breast cancer. J Clin Oncol. 2001;19:980–91.

    Article  CAS  Google Scholar 

  4. Kattan MW, Giri D, Panageas KS, Hummer A, Cranor M, Van Zee KJ, et al. A tool for predicting breast carcinoma mortality in women who do not receive adjuvant therapy. Cancer. 2004;101:2509–15.

    Article  Google Scholar 

  5. Campbell HE, Gray AM, Harris AL, Briggs AH, Taylor MA. Estimation and external validation of a new prognostic model for predicting recurrence-free survival for early breast cancer patients in the UK. Br J Cancer. 2010;103:776–86.

    Article  CAS  Google Scholar 

  6. Wishart GC, Bajdik CD, Azzato EM, Dicks E, Greenberg DC, Rashbass J, et al. A population-based validation of the prognostic model PREDICT for early breast cancer. Eur J Surg Oncol. 2011;37:411–7.

    Article  CAS  Google Scholar 

  7. Wishart GC, Bajdik CD, Dicks E, Provenzano E, Schmidt MK, Sherman M, et al. PREDICT plus: development and validation of a prognostic model for early breast cancer that includes HER2. Br J Cancer. 2012;107:800–7.

    Article  CAS  Google Scholar 

  8. Candido Dos Reis FJ, Wishart GC, Dicks EM, Greenberg D, Rashbass J, Schmidt MK, et al. An updated PREDICT breast cancer prognostication and treatment benefit prediction model with independent validation. Breast Cancer Res. 2017;19:58–0852.

    Article  Google Scholar 

  9. Michaelson JS, Chen LL, Bush D, Fong A, Smith B, Younger J. Improved web-based calculators for predicting breast carcinoma outcomes. Breast Cancer Res Treat. 2011;128:827–35.

    Article  Google Scholar 

  10. Bhoo-Pathy N, Yip CH, Hartman M, Saxena N, Taib NA, Ho GF, et al. Adjuvant! Online is overoptimistic in predicting survival of Asian breast cancer patients. Eur J Cancer. 2012;48:982–9.

    Article  Google Scholar 

  11. Yorozuya K, Takeuchi T, Yoshida M, Mouri Y, Kousaka J, Fujii K, et al. Evaluation of Oncotype DX recurrence score as a prognostic factor in Japanese women with estrogen receptor-positive, node-negative primary stage I or IIA breast cancer. J Cancer Res Clin Oncol. 2010;136:939–44.

    Article  CAS  Google Scholar 

  12. Ishitobi M, Goranova TE, Komoike Y, Motomura K, Koyama H, Glas AM, et al. Clinical utility of the 70-gene MammaPrint profile in a Japanese population. Jpn J Clin Oncol. 2010;40:508–12.

    Article  Google Scholar 

  13. Van Belle V, Van CB, Brouckaert O, Vanden Bempt I, Pintens S, Harvey V, et al. Qualitative assessment of the progesterone receptor and HER2 improves the Nottingham prognostic index up to 5 years after breast cancer diagnosis. J Clin Oncol. 2010;28:4129–34.

    Article  Google Scholar 

  14. Olivotto IA, Bajdik CD, Ravdin PM, Speers CH, Coldman AJ, Norris BD, et al. Population-based validation of the prognostic model ADJUVANT! For early breast cancer. J Clin Oncol. 2005;23:2716–25.

    Article  Google Scholar 

  15. Maishman T, Copson E, Stanton L, Gerty S, Dicks E, Durcan L, et al. An evaluation of the prognostic model PREDICT using the POSH cohort of women aged 40 years at breast cancer diagnosis. Br J Cancer. 2015;112:983–91.

    Article  CAS  Google Scholar 

  16. Tin Tin S, Elwood JM, Brown C, Sarfati D, Campbell I, Scott N, et al. Ethnic disparities in breast cancer survival in New Zealand: which factors contribute? BMC Cancer. 2018;18:58.

    Article  Google Scholar 

  17. Tin Tin S, Elwood JM, Lawrenson R, Campbell I, Harvey V, Seneviratne S. Differences in breast cancer survival between public and private care in New Zealand: which factors contribute? PLoS One. 2016;11:e0153206.

    Article  CAS  Google Scholar 

  18. Seneviratne S, Campbell I, Scott N, Shirley R, Peni T, Lawrenson R. Ethnic differences in breast cancer survival in New Zealand: contributions of differences in screening, treatment, tumor biology, demographics and comorbidities. Cancer Causes Control. 2015;26:1813–24.

    Article  Google Scholar 

  19. Brown C, Lao C, Lawrenson R, Tin Tin S, Schaaf M, Kidd J, et al. Characteristics and differences between Pasifika women and New Zealand European women diagnosed with breast cancer in New Zealand. N Z Med J. 2017;130:50–61.

    PubMed  Google Scholar 

  20. Campbell I, Scott N, Seneviratne S, Kollias J, Walters D, Taylor C, et al. Breast cancer characteristics and survival differences between Maori, Pacific and other New Zealand women included in the quality audit program of breast surgeons of Australia and New Zealand. Asian Pac J Cancer Prev. 2015;16:2465–72.

    Article  Google Scholar 

  21. Lawrenson R, Lao C, Campbell I, Harvey V, Seneviratne S, Edwards M, et al. Treatment and survival disparities by ethnicity in New Zealand women with stage I-III breast cancer tumour subtypes. Cancer Causes Control. 2017;28:1417–27.

    Article  Google Scholar 

  22. Seneviratne S, Campbell I, Scott N, Shirley R, Peni T, Lawrenson R. Accuracy and completeness of the New Zealand Cancer registry for staging of invasive breast cancer. Cancer Epidemiol. 2014;38:638–44.

    Article  Google Scholar 

  23. Seneviratne SA, Campbell ID, Scott N, Lawrenson RA, Shirley R, Elwood JM. Risk factors associated with mortality from breast cancer in Waikato, New Zealand: a case-control study. Public Health. 2015;129:549–54.

    Article  CAS  Google Scholar 

  24. Quintyne KI, Woulfe B, Coffey JC, Merrigan A, Gupta RK. Lymph node ratio in sentinel lymph node biopsy era: are we losing prognostic information? Clin Breast Cancer. 2017;17:117–26.

    Article  Google Scholar 

  25. Ministry of Health. Ethnicity Data Protocols for the Health and Disability Sector. Wellington: Ministry of Health; 2004.

    Google Scholar 

  26. American Joint Committee on Cancer. Cancer staging manual. 7th ed. New York: Springer; 2010.

    Book  Google Scholar 

  27. Elston CW, Ellis IO. Pathological prognostic factors in breast cancer. I. The value of histological grade in breast cancer: experience from a large study with long-term follow-up. Histopathology. 1991;19:403–10.

    Article  CAS  Google Scholar 

  28. Hammond ME, Hayes DF, Dowsett M, Allred DC, Hagerty KL, Badve S, et al. American Society of Clinical Oncology/College of American Pathologists guideline recommendations for immunohistochemical testing of estrogen and progesterone receptors in breast cancer (unabridged version). Arch Pathol Lab Med. 2010;134:e48–72.

    PubMed  CAS  Google Scholar 

  29. Pauletti G, Godolphin W, Press MF, Slamon DJ. Detection and quantitation of HER-2/neu gene amplification in human breast cancer archival material using fluorescence in situ hybridization. Oncogene. 1996;13:63–72.

    PubMed  CAS  Google Scholar 

  30. Stata Corporation. Statistical Software: Release 14. College Station, Texas, USA: Stata Corporation; 2014.

    Google Scholar 

  31. CRAN CRAN. R: a language and environment for statistical computing. Version 3.2.5. Vienna, Austria: R Foundation for Statistical Computing; 2014.

    Google Scholar 

  32. Harrell FE, Jr.: Regression modeling strategies: with applications to linear models, logistic and ordinal regression, and survival analysis. New York: Springer; 2015.

  33. Grambsch PM, Therneau T. Proportional hazards tests and diagnostics based on weighted residuals. Biometrika. 1994;81:515–26.

    Article  Google Scholar 

  34. May S, Hosmer DW. A simplified method of calculating an overall goodness-of-fit test for the cox proportional hazards model. Lifetime Data Anal. 1998;4:109–20.

    Article  CAS  Google Scholar 

  35. Marshall RJ. Are regression "nomograms" useful? J Clin Epidemiol. 2016;78:4–6. Epub@2016 Apr 7.: 4-6.

    Article  PubMed  Google Scholar 

  36. Marshall RJ. regplot: Enhanced Nomogram Plot R package version 0.1. . 2016.

    Google Scholar 

  37. Steyerberg EW. Clinical prediction models. New York: Springer; 2010.

  38. Jager KJ, van Dijk PC, Zoccali C, Dekker FW. The analysis of survival data: the Kaplan-Meier method. Kidney Int. 2008;74:560–5.

    Article  Google Scholar 

  39. Jinks RC, Royston P, Parmar MK. Discrimination-based sample size calculations for multivariable prognostic models for time-to-event data. BMC Med Res Methodol. 2015;15(82) 82-0078.

  40. Vergouwe Y, Steyerberg EW, Eijkemans MJ, Habbema JD. Substantial effective sample sizes were required for external validation studies of predictive logistic regression models. J Clin Epidemiol. 2005;58:475–83.

    Article  Google Scholar 

  41. Peduzzi P, Concato J, Feinstein AR, Holford TR. Importance of events per independent variable in proportional hazards regression analysis. II. Accuracy and precision of regression estimates. J Clin Epidemiol. 1995;48:1503–10.

    Article  CAS  Google Scholar 

  42. Galea MH, Blamey RW, Elston CE, Ellis IO. The Nottingham prognostic index in primary breast cancer. Breast Cancer Res Treat. 1992;22:207–19.

    Article  CAS  Google Scholar 

  43. D'Eredita G, Giardina C, Martellotta M, Natale T, Ferrarese F. Prognostic factors in breast cancer: the predictive value of the Nottingham prognostic index in patients with a long-term follow-up that were treated in a single institution. Eur J Cancer. 2001;37:591–6.

    Article  CAS  Google Scholar 

  44. Kollias J, Murphy CA, Elston CW, Ellis IO, Robertson JF, Blamey RW. The prognosis of small primary breast cancers. Eur J Cancer. 1999;35:908–12.

    Article  CAS  Google Scholar 

  45. Balslev I, Axelsson CK, Zedeler K, Rasmussen BB, Carstensen B, Mouridsen HT. The Nottingham prognostic index applied to 9,149 patients from the studies of the Danish breast Cancer cooperative group (DBCG). Breast Cancer Res Treat. 1994;32:281–90.

    Article  CAS  Google Scholar 

  46. Dunnett CW. Pairwise multiple comparisons in the unequal variance case. J Am Statistical Assoc. 1980;75:796–800.

    Article  Google Scholar 

  47. Laas E, Mallon P, Delomenie M, Gardeux V, Pierga JY, Cottu P, et al. Are we able to predict survival in ER-positive HER2-negative breast cancer? A comparison of web-based models. Br J Cancer. 2015;112:912–7.

    Article  CAS  Google Scholar 

  48. Moons KG, Altman DG, Reitsma JB, Ioannidis JP, Macaskill P, Steyerberg EW, et al. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med. 2015;162:W1–73.

    Article  Google Scholar 

  49. Liu Y, Perez M, Aft RL, Massman K, Robinson E, Myles S, et al. Accuracy of perceived risk of recurrence among patients with early-stage breast cancer. Cancer Epidemiol Biomark Prev. 2010;19:675–80.

    Article  Google Scholar 

  50. Ministry of Health. Korero Marama Health Literacy and Maori: Results from the 2006 Adult Literacy and Life Skills Survey. 2010.

    Google Scholar 

  51. Batterham RW, Hawkins M, Collins PA, Buchbinder R, Osborne RH. Health literacy: applying current concepts to improve health services and reduce health inequalities. Public Health. 2016;132:3–12. Epub@2016 Feb 9.: 3-12.

    Article  PubMed  CAS  Google Scholar 

  52. Altman DG. Prognostic models: a methodological framework and review of models for breast cancer. Cancer Investig. 2009;27:235–43.

    Article  Google Scholar 

  53. Tang G, Cuzick J, Costantino JP, Dowsett M, Forbes JF, Crager M, et al. Risk of recurrence and chemotherapy benefit for patients with node-negative, estrogen receptor-positive breast cancer: recurrence score alone and integrated with pathologic and clinical factors. J Clin Oncol. 2011;29:4365–72.

    Article  CAS  Google Scholar 

  54. Seneviratne S, Lawrenson R, Harvey V, Ramsaroop R, Elwood M, Sarfati D, et al. Stage of breast cancer at diagnosis in New Zealand: impacts of socio-demographic factors, breast cancer screening and biology. BMC Cancer. 2016;16:129.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  55. Seneviratne S, Campbell I, Scott N, Shirley R, Lawrenson R. Impact of mammographic screening on ethnic and socioeconomic inequities in breast cancer stage at diagnosis and survival in New Zealand: a cohort study. BMC Public Health. 2015;15:46.

    Article  Google Scholar 

  56. Ministry of Health, New Zealand guidelines group. Management of early breast cancer 2009.

    Google Scholar 

  57. National Breast Cancer Tumour Standards Working Group. Standards of Service Provision for Breast Cancer Patients in New Zealand - Provisional. Wellington: Ministry of Health; 2013.

    Google Scholar 

Download references


We thank the New Zealand Breast Cancer Foundation, Auckland Breast Cancer Registry, Waikato Breast Cancer Trust, and Waikato Bay of Plenty Division of the Cancer Society for their support of these patient registries. We thank all patients who contributed their data, the staff of the registries, and the Ministry of Health (New Zealand) for facilitating data linkages.


This work was supported by the Auckland Medical Research Fund (AMRF) grant 1116017. The funder had no role in the study design, data collection or analysis, decision to publish, or preparation of the manuscript.

Availability of data and materials

The datasets used in this study contain personal information and are not publicly available, but may be requested from the Auckland Breast Cancer Registry and the Waikato Breast Cancer Trust, subject to ethical approvals.

Author information

Authors and Affiliations



ME and ST developed the concept, and ME, ET, ST, TP, and RM designed the study and analysed the data. IC, VH, and RL supervised data collection, ensured quality control of the data and contributed to the analysis. All authors contributed to the interpretation of the data and to drafts of the report, and reviewed and approved the submission.

Corresponding author

Correspondence to J. Mark Elwood.

Ethics declarations

Ethics approval and consent to participate

Ethical approval for this study and for the use of patient data from the Auckland and Waikato Breast Cancer Registers was given by the New Zealand Northern ‘A’ Health and Disability Ethics Committee (Ref. No. 16/NTA/22, approved 18 March 2016). The data were analysed anonymously. No patient consent was required for the analysis.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional file

Additional file 1:

Table S1. 10-year breast cancer predicted and observed survival, Auckland and Waikato databases. (DOCX 14 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Elwood, J.M., Tawfiq, E., TinTin, S. et al. Development and validation of a new predictive model for breast cancer survival in New Zealand and comparison to the Nottingham prognostic index. BMC Cancer 18, 897 (2018).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: