Skip to main content

Development and validation of a clinicoradiomic nomogram to assess the HER2 status of patients with invasive ductal carcinoma



The determination of HER2 expression status contributes significantly to HER2-targeted therapy in breast carcinoma. However, an economical, efficient, and non-invasive assessment of HER2 is lacking. We aimed to develop a clinicoradiomic nomogram based on radiomics scores extracted from multiparametric MRI (mpMRI, including ADC-map, T2W1, DCE-T1WI) and clinical risk factors to assess HER2 status.


We retrospectively collected 214 patients with pathologically confirmed invasive ductal carcinoma between January 2018 to March 2021 from Fudan University Shanghai Cancer Center, and randomly divided this cohort into training set (n = 128, 42 HER2-positive and 86 HER2-negative cases) and validation set (n = 86, 28 HER2-positive and 58 HER2-negative cases) at a ratio of 6:4. The original and transformed pretherapy mpMRI images were treated by semi-automated segmentation and manual modification on the DeepWise scientific research platform v1.6 (, then radiomics feature extraction was implemented with PyRadiomics library. Recursive feature elimination (RFE) based on logistic regression (LR) and LASSO regression were adpoted to identify optimal features before modeling. LR, Linear Discriminant Analysis (LDA), support vector machine (SVM), random forest (RF), naive Bayesian (NB) and XGBoost (XGB) algorithms were used to construct the radiomics signatures. Independent clinical predictors were identified through univariate logistic analysis (age, tumor location, ki-67 index, histological grade, and lymph node metastasis). Then, the radiomics signature with the best diagnostic performance (Rad score) was further combined with significant clinical risk factors to develop a clinicoradiomic model (nomogram) using multivariate logistic regression. The discriminative power of the constructed models were evaluated by AUC, DeLong test, calibration curve, and decision curve analysis (DCA).


70 (32.71%) of the enrolled 214 cases were HER2-positive, while 144 (67.29%) were HER2-negative. Eleven best radiomics features were retained to develop 6 radiomcis classifiers in which RF classifier showed the highest AUC of 0.887 (95%CI: 0.827–0.947) in the training set and acheived the AUC of 0.840 (95%CI: 0.758–0.922) in the validation set. A nomogram that incorporated the Rad score with two selected clinical factors (Ki-67 index and histological grade) was constructed and yielded better discrimination compared with Rad score (p = 0.374, Delong test), with an AUC of 0.945 (95%CI: 0.904–0.987) in the training set and 0.868 (95%CI: 0.789–0.948; p = 0.123) in the validation set. Moreover, calibration with the p-value of 0.732 using Hosmer–Lemeshow test demonstrated good agreement, and the DCA verified the benefits of the nomogram.


Post largescale validation, the clinicoradiomic nomogram may have the potential to be used as a non-invasive tool for determination of HER2 expression status in clinical HER2-targeted therapy prediction.

Peer Review reports


Breast carcinoma is the most widespread and lethal tumor among women [1]. The most common histological type is invasive ductal carcinoma (IDC), which accounts for roughly 80% of all breast carcinomas, with 20% – 30% expressing human epidermal growth factor receptor 2 (HER2) positivity [2]. Numerous studies substantiated the association of HER2 gene overexpression with the division, proliferation, and nourishment of new vasculature of breast carcinoma cells. Overexpression of HER2 is significantly correlated with high aggressiveness and poor prognosis in breast carcinoma patients [3,4,5,6]. Consequently, advancement in HER2-targeted therapy was considered a prominent breakthrough in breast carcinoma treatment [7, 8]. Therefore, determining the HER2 expression status in breast carcinoma patients is an essential precondition to identifying potential candidates for treatment decisions. The immunohistochemistry (IHC) method for testing HER-2 expression is a standard procedure in our laboratory and a part of our pathology reports in invasive breast carcinomas, and FISH is the subsequent and gold method in equivocal (2 +) IHC cases [9]. These two methods are more time-consuming, and takes 5–12 days after specimens are provided. FISH is difficult to apply, and requires trained technicians. It is more expensive since it requires test kits and a special microscope, and stained preparations cannot be archived [10]. Radiomics allows inference of tumoral molecular status from medical image-derived features, and it allows the study of the tumoral heterogeneity both spatially and over time; radiomics have the potential to enable spatio-longitudinal monitoring of tumor biology before and during treatment.

Existing literature has recently documented a correlation between MRI image expression and breast carcinoma HER2 expression level [11,12,13,14,15]. Song et al. [11] reported that texture features derived from kinetic parameter maps, calculated based on breast DCE-T1W1, can be used as imaging biomarkers to distinguish HER2-positive and HER2-negative breast cancer. The best model with features extracted from the slope of signal intensity (SIslope) map yielded an AUC of 0.79 in the test set. Another radiomic study by Li et al. [12] selected ER status and radiomics features in DWI images to establish the nomogram, which yielded good discrimination (AUC: 0.883/0.848) and calibration. The radiomics nomogram showed favorable performance for evaluating HER2 status in breast cancer. To the best of our knowledge, the HER2 express status in breast carcinoma could be better predicted by radiomics signature established from the mpMRI compared with single-parametric signature. Zhou et al. found the prediction ability of the developed SVM model in HER2 status of breast carcinoma patients based on T2WI, DCE-T1W1, and a combination of the two sequences. The multiparameter-related model manifested the best efficiency in forecasting the HER2 status with an AUC of 0.86 and 0.81 in the learning and test cohorts, respectively [16]. However, no published study has discussed the MRI combining with clinical pathological risk factors, such as Ki-67 index, histological grade, and lymph node metastasis, in predicting HER2 expression, and no published study has applied clinicoradiomic models (nomogram) using multiple machine learning algorithms in predicting HER2 expression in invasive ductal carcinoma.

This paper aims to investigate and compare the performance of radiomics features from different types of MR images (fat-suppressed T2-weighted images (T2WI), diffusion coefficient map (ADC-map), and dynamic contrast-enhanced T1-weighted images (DCE-T1WI)) and the performance of their fusions using different machine learning algorithms. In addition, a nomogram incorporating the mpMRI-based radiomics signature and clinical predictors was developed to improve the discriminative and interpretable ability of the classifier for HER2 status prediction.

Materials and methods

Study population

The Fudan University Shanghai Cancer Center and Shaoxing Central Hospital Ethical Committee approved this study protocol, and the requirement for written informed consent was waived. Between January 2018 and March 2021, we retrospectively collected breast tumor cases confirmed by clinical examination and diagnosed by ultrasound examination in our center. Inclusion criteria: (I) the patient had proven invasive breast ductal carcinoma of no special type (IDC-NST), as determined by histology; (II) complete breast mpMRI data and pathological data, HER2 2 + status verified by FISH; Exclusion criteria: (I) pregnant or lactating females, or a plan to get pregnant within six months; (II) prosthesis implantation; (III) history of breast surgery that might affect imaging diagnosis. A total of 302 consecutive patients from Fudan University Shanghai Cancer Center were eligible. However, poor image quality or incomplete lesion presentation in 25 cases, neoadjuvant therapy in 46 cases, and a history of breast carcinoma or bilateral breast carcinoma in 17 cases were excluded. In the case of multicentric lesions, the largest major lesion of them was selected to enter the group. Patient demographic data, including age, tumor location, menopause status, and a family history of breast carcinoma in first-degree relatives, were retrieved from the Electronic Medical Record System. Pathological data were tumor pathological type and histological grade, the expression status of HER2, Ki-67 index, and lymph node metastasis. The final cohort comprised of 214 patients and was randomly split with a ratio of 6:4 [17, 18], to create the training set (n = 128, 42 HER2-positive and 86 HER2-negative cases) and validation set (n = 86, 28 HER2-positive and 58 HER2-negative cases), respectively for subsequent analyses. Figure 1 illustrates the flowchart of this study.

Fig. 1
figure 1

The flowchart of this study

Imaging examination

For all patients the breast MRI examination was performed on the Aurora Dedicated Breast MRI System and a dedicated phased-array coil. The patients were scanned in the prone position to allow both mammary glands in the concave hole of the phased-array coil with a natural overhanging effect. All these sequences were obtained back-to-back in one imaging session. Imaging protocols were as follows: (1) transverse T2WI with fat-suppression (FS-T2WI, TR 6680 ms, TE 68 ms), with the layer thickness and spacing of 3 mm and 1 mm, respectively; (2) transverse DWI (TR8 400 ms, TE 84 ms), DWI signal intensity at b = 0 s/\({\mathrm{mm}}^{2}\), b = 1,000 s/\({\mathrm{mm}}^{2}\), with the layer thickness and spacing of 4 mm, and 6 mm, respectively, and a DWI-derived ADC map; (3) 3D-DCE, transverse T1WI with fat and water suppression (TR 5 ms, TE 29 ms) was selected, with a layer thickness of 1.1 mm and a layer spacing of 0, FOV 360 mm × 360 mm, matrix 360 × 360 × 128. The number of scanned layers in a single phase was 160. Phase I mask scan was done prior to the contrast enhancement scan. In the contrast scan, Gd-DTPA (Bayer Medical Care Ltd) was administered intravenously at a dose of 0.2 mmol/kg and a flow rate of 2.0 mL/s. After injection, contrast images were obtained in 5 phases in a row, with a scan time of 120 s per phase.

Image analysis

The breast MRI data of all sequences were imported as a DICOM file into the DeepWise scientific research platform v1.6 ( Two radiologists with more than 10-year experience in breast imaging diagnosis semi-automatically segmented lesions layer-by-layer and analyzed after merging them into the three-dimensional region of interests (3D ROIs, VOIs). After carefully scrutinizing the tumor region, disagreements were resolved by consensus-based discussion or decided by superior physicians. The 3rd sequence during the dynamic enhancement course was selected after about 240 s of injecting contrast medium. At this point, malignant lesions generally show a peak enhancement to present a clear contrast with the surrounding normal breast parenchyma, which is conducive to more accurate VOI delineation and feature extraction. The selected VOI should conform to the following criteria: (1) Include cystic lesion, necrosis, and halo-sign; (2) Invasion of surrounding structures: areas connected to VOI and have the same enhancement pattern with the VOI; (3) The lesion VOI < 5 \({\mathrm{mm}}^{3}\) is waived. The VOIs of T1-DCE were registered and applied to the other two sequences, and the image slicers and orientation were precisely matched between T2WI and ADC-map. Supplemental Figure S1 provided the process of segmentation and VOI selection of two typical HER2-positive and HER2-negative cases. Then, B-spline interpolation was carried out to standard the image into the same spatial resolution (1 mm × 1 mm × 1 mm)[19,20,21] and followed by the absolute gray-level discretization with fixed bin size (FBS) set to 5 as previous studies suggested [20,21,22,23]. In particular, the processes of image interpolation and gray-level discretization were both aligned to IBSI [21].

Radiomics feature extraction and screening

To emphasize the imaging characteristics, image filters such as log (Laplacian of Gaussian), gradient, lbp-2d/3d, and four common point-level transforms were applied prior to feature extraction. In addition, wavelet decomposition was applied at each channel for images to obtain all possible combinations in high-pass or low-pass filters (LLH, LHL, LHH, HLL, HLH, HHL, HHH, LLL). For original and other-transformed images, first-order, shape, and texture features were extracted, respectively, and implemented with the open-source PyRadiomics library ( Subsequently, Z-score transformation was used to normalize the feature distribution in the training set, and the data in the validation set were then standardized by the same calculated parameters to avoid data leakage. The implementation of feature extraction and standardization was compliant with Imaging Biomarker Standardization Initiative (IBSI) [21]. For each sequence, we initially extracted 1906 radiomics features from VOIs, including 396 first-order features, 14 shape features (including three 2D features), and 1496 texture features. A suffix was added for the different series (i.e., ‘1’ represents ADC-map, ‘2’ represents T2WI, ‘3’ represents DCE-T1WI). Intra-class correlation coefficient (ICC) was calculated to evaluate the intra- and inter-observer reproducibility of radiomics features. Initially, reader 1 and reader 2 completed the VOI segmentation and radiomics feature extraction on 30 randomly selected patients’ mpMRI data. Then, Reader 1 repeated the same procedure 1 month later. Intra- and inter-observer ICC variabilities were studied using the one-way random single measures ICC (ICC(1,1)) and two-way random single measures ICC (ICC(2,1)), respectively. And features with intra- or inter-observer ICC values lower than 0.75 were discarded before subsequent feature selection. Moreover, we applied Pearson’s correlation with a threshold of 0.9 in training set to minimize the potential collinearity of variables.

Model establishment and evaluation

Given the extracted high-throughput radiomics features, dimension reduction of the retained features was further performed with the algorithms of recursive feature elimination (RFE) and LASSO. These feature selection algorithms were used in this study because of efficiency and popularity. To achieve high and robust performance of the developed classifiers, six machine learning algorithms, Logistic Regression (LR), Linear Discriminant Analysis (LDA), Support Vector Machine (SVM), Random Forest (RF), Naive Bayesian (NB), XGBoost (XGB), were used to classifier construction. The reason for selecting and comparing these methods in this study was that they were common classifiers in the related study for breast in previous studies, such as undergoing mastectomy prediction [24], breast cancer prediction [25], axillary lymph node metastasis [26]. To avoid over-fitting in the modeling process, the hyper-parameter searching for optimal classifiers were completed by grid-search method using the tenfold cross-validation repeatedly. After the completion of radiomics classifiers, multivariate logistic regression analysis and backward stepwise regression analysis based on Akaike Information Criterion (AIC) were devised to establish a clinicoradiomic model (nomogram) incorporating significant clinical predictors and radiomics classifiers. Thus, we developed the visual nomogram of the clinicoradiomic model to calculate the probability of HER2-positive breast cancer.

Pathological analysis

Two pathologists with more than 15 years of expertise identified the histological type, histological grading, and immunohistochemical analysis. The histological type of breast carcinoma was defined according to the World Health Organization classification. The Elston–Ellis System was followed to estimate the tumor histological grade (Elston and Ellis 1991). IHC or FISH determined the HER2 status based on the clinical use instructions for HER2 experiment in breast carcinoma presented by the American Society of Clinical Oncology (ASCO)/College of American Pathologists (CAP) [27], IHC scores of 3 + and 0 or 1 + were considered positive, and negative, respectively, while an IHC score of 2 + of HER2 was regarded as indeterminate. To determine gene amplification, researchers used fluorescence in situ hybridization (FISH), and the ratio ≥ 2.0 was judged positive for HER2 [27, 28].

Statistical methods

Statistical analysis was conducted on R statistical software v3.6.1 ( Student’s t-test and Chi-square test were, respectively, used for continuous and categorical data with normal distribution, and the Mann–Whitney U test was applied for data with non-normal distribution. All tests were two-tailed, and p < 0.05 was considered statistically significant. The receiver operating characteristic (ROC) curve is defined for evaluating the performance of binary classifiers and is produced by calculating the true positive rate against the false positive rate for a binary classifier at a set of different thresholds. And we used Youden’s index (YI) to calculate the optimal threshold, which gives equal weight to specificity and sensitivity. The R package “glmnet” statistical software (R Foundation) was used to perform the modeling process of classifiers. “PROC” R package was mainly used in the ROC curve analysis. In addition, DeLong test was used to compare the performance of two different models. Calibration curves of the nomogram and Hosmer–Lemeshow test were used to validate the agreement between prediction and observation in validation set. Furthermore, we performed decision curve analysis (DCA) to visualize the net benefit for clinical decisions.


Clinical data and pathological diagnosis results

The detailed clinical and histopathologic characteristics of patients with IDC were summarized in Table 1. Of the 214 eligible cases, the age ranged from 16 to 86 years, with an average age of 50.77 ± 10.29 years. This cohort encompassed HER2-positive in 70 (32.71%, 70/214) cases and HER2-negative in 144 (67.29%,144/214) cases. There was no significant difference between the two groups in all clinical features (p > 0.05).

Table 1 Clinical and histopathologic characteristics of IDC patients

Predictive efficiency of the radiomics signatures

In total, 455 radiomics features were excluded through stability analysis (ICC ≤ 0.75). 345, 356, 449, and 1150 radiomic features selected from T2WI, ADC-map, DCE-T1WI, and mpMRI (including T2WI, ADC-map, DCE-T1WI) images with the |correlation coefficient|≤ 0.9 were used for next dimension reduction. The top 100 ranking of RFE remained features were further selected based on LR. Finally, 11 radiomics features were retained from the mpMRI images, T2WI (n = 2), ADC-map (n = 3), DCE-T1WI (n = 6), to construct the radiomics signature using the LASSO regression (Fig. 2). The relative importance of the 11 selected radiomics features for predicting HER2 expression status was shown in Fig. 2C. Figure 3 provided the qualitative visualizations of some of the top Radiomics features on the MRI sequences for a HER2-negative case and a HER2-positive case. We further verified that there was no statistically significant difference between the final selected radiomics features in the training and validation sets (Supplemental Table S1). For three sequences, the optimal performance was reported by the RF classifier (Rad score, Fig. 4) with the AUCs of 0.927 (95%CI: 0.876–0.978) (accuracy 0.851; sensitivity 0.881; specificity 0.837) in the training set and 0.826 (95%CI: 0.738–0.914) (accuracy 0.744; sensitivity 0.857; specificity 0.689) in the validation set (Table 2). The Rad score based on mpMRI images achieved higher AUC values than other single-parametric models (i.e., ADC-map, T2W1, and DCE-T1W1) in training (AUC: 0.816, 0.743 and 0.810) and validation sets (AUC: 0.719, 0.694 and 0.756) (Supplemental Figure S2).

Fig. 2
figure 2

A Feature coefficients corresponding to the value of parameter λ. Each curve represents the change trajectory of each independent variable. B The most valuable features were screened out by tuning λ using LASSO via minimum binomial deviation. The dotted vertical line represents the optimal log (λ) value. C The selected 11 radiomics features with the most discriminative value

Fig. 3
figure 3

Qualitative visualizations of 4 top Radiomics features on the DCE-T1W1 sequence between HER2-positive (a-d) and HER2-negative cases (eh). a,e Original_gldm_LargeDependenceHighGrayLevel Emphasis (b,f) wavelet-LLH_glcm_Idmn (c,g) wavelet-HLH_glcm_Idn (d,h) lbp-3D-k_gldm_DependenceEntropy

Fig. 4
figure 4

ROC curves of the six radiomics classifiers. A Training set. B Validation set

Table 2 Performances of the six machine learning classifiers for predicting HER2 status in the validation cohort

Nomogram establishment

The Rad score (OR = 58.909, 95%CI: 7.693 to 451.094, p < 0.01), histological grade (OR = 4.971, 95%CI: 2.297 to 10.757, p < 0.01), and ki-67 (OR = 4.435, 95%CI: 1.991 to 9.883, p < 0.01) were determined as independent predictors using univariate logistic regression analysis and were used to develop the nomogram (Fig. 5A, B) by multivariate logistic regression analysis (Table 3). The nomogram_RF obtained the AUC of 0.945 (95%CI: 0.904 to 0.987) in the training and 0.868 (95%CI: 0.789 to 948) in the validation cohorts. As comparison, the nomograms based on the other radiomics classifiers were constructed in the same approach (Fig. 5A). The nomogram_RF still produced the highest AUC though there was no significant added value between the radiomics classifiers and their corresponding nomograms in the training and validation sets (Supplemental Table S2). The calibration curves demonstrated that the nomogram_RF could provide an excellent calibration in the training cohort (Fig. 5C), in which the Hosmer–Lemeshow test showed a non-significant p-value of 0.732. The DCA curves claimed that the good application of the nomogram_RF in the clinical decisions, followed by considering the radiomics classifier alone (Fig. 5D). The performances in the validation set were shown in Supplemental Figure S3.

Fig. 5
figure 5

Nomogram performances for predicting HER2 status of breast carcinoma in the Training set. A. ROC curves of the six nomograms based on different radiomics classifiers. B. Nomogram_RF. C. Calibration curve of the nomogram_RF. D. Decision curves of the nomogram_RF and Rad score

Table 3 Results of univariate and multivariate logistic regression analysis for nomogram_RF


The present study nurtured the quantitative features from T2WI, ADC-map, and DCE-T1WI images to standardize radiomics models for the noninvasive evaluation of the HER2 status in breast carcinoma. The Rad score classifier based on the RF algorithm exhibited good efficiency in differentiating HER2-positive from HER2-negative disease, in which the AUCs, accuracy, true positive, and negative rates were 0.927 (95%CI: 0.876–0.978), 85.1%, 88.1% and 83.7% respectively, in the training cohort, and 0.826 (95%CI: 0.738–0.914), 74.4%, 85.7% and 68.9%, respectively, in the validation cohort. We then created a nomogram by combining the Rad score with the independent clinical predictors (histological grade and ki-67) with univariate and multivariate logistic regressions. The finding demonstrated that the nomogram could provide superior classification and recognition in HER2-positive patients (the AUCs of 0.945 and 0.868 in the training and validation sets). Thus, we concluded that applying radiomics features from mpMRI data incorporating machine learning algorithm contributed to the good discriminative performance in the HER2 status.

Recently, there has been a growing interest in applying quantitative imaging data to delineate intrinsic biological characteristics [29,30,31,32,33,34]. Diverse research has dissected the dependence between radiomic signatures and HER2 expression status [35,36,37,38,39,40]. Zhou et al. [35]. noninvasively evaluated the efficacy of mammography radiomics signatures in diagnosing the patients’ HER2 status with breast carcinoma, including mediolateral oblique (MLO) and cranial caudal (CC) views, with an AUC of 0.846 in the training set and an AUC of 0.74 in the testing set. Another similar study also highlighted the connection between radiomics signatures from multidetector computed breast mammography images with HER2 express status [36]. Daniele La Forgia et al. [41]. reported that radiomics combining contrast-enhanced spectral mammography performed well in predicting histological subtypes of breast cancer, with accuracies of 90.87%and 84.80% in discriminating HER2 + /HER2 − and Ki67 + /Ki67 − breast cancer, respectively. Those studies indicated that radiomics analysis was a useful analytical tool to predict HER2 status in breast cancer, which is consistent with the findings of this study.Another radiomic study by Bitencourt et al. [42].developed machine learning models (including both clinical and radiomics MRI features) to predict HER2 expression levels and pathologic response (pCR) after neoadjuvant chemotherapy in HER2 over-expressing breast cancer patients. The model predicted HER2 heterogeneity and pCR with diagnostic accuracy of 97.4% and 83.9% in the test set, respectively. More specifically, the HER2 express status in breast carcinoma could be better predicted by a radiomics signature established from the mpMRI compared with a single-parametric signature. Huang et al. [43].showed that multi-parametric MRI-based radiomics combined with different machine learning approaches could be a promising method to predict the molecular subtype and AR expression of breast cancer non-invasively. In this study, the results verified the conclusion with the superior performance of prediction, in which the Rad score based on mpMRI images achieved higher AUC values of 0.927 and 0.826 than other single-parametric models (i.e., ADC-map, T2W1, and DCE-T1W1) in training (AUC: 0.816, 0.743 and 0.810) and validation sets (AUC: 0.719, 0.694 and 0.756), respectively.

Many studies have reported the association between histological grade, Ki-67 index, and HER2 status; HER2-positive expression has a relationship with high nuclear grade and ki-67 index [44,45,46]. In contrast, our study showed that only the ki-67 index and histological grade could be used as an independent predictor of HER2 expression. The ki-67as proliferation marker has been successfully used as a tool for clinical decision-making, specifically how it can be used to select the optimal treatment for each individual patient [47]. Tumor histological grade has previously been linked to HER2 status. HER2 expression has previously been found to correlate with a higher nuclear grade but not with tumor stage [45]. However, none of these studies developed a predictive model that included these predictors. Our study is the first to develop a nomogram to perform this prediction. Recently, the added value of the nomogram model based on radiomics signature and clinical factors for pathological outcome prediction [48,49,50] has been investigated by some studies. Our results showed the highest performance compared with existing work, which is another clinical contribution of this work. In this study, 11 optimal features were finally retained for modeling, including morphological, first-order, GLCM, GLDM, and GLSZM characteristics, which are predominantly involved in tumor heterogeneity. Based on tumor size and morphology, a significant correlation existed between the characteristics of shape_Maximum 2D Diameter, shape_Maximum 3D Diameter, shape_Sphericity, and the HER2 expression of breast carcinoma. Consistent with the previous report [51], the morphology and size of lesions vary with the different expressions of hormone receptors (HR). Much irregular and larger lesions were reported in the cases of HR-negative plus HER2-positive compared with the hormone HR-positive plus HER2-negative. In the present study, high kurtosis was found in most HER2-positive cases, which is consistent with the existing work. Fan et al. [52] constructed a predictive model for four molecular subtypes classification of breast carcinoma based on the DCE-T1W1 images and two clinical risk factors, revealing the potential of kurtosis as an independent predictor for classifying molecular subtypes of breast carcinoma. Previous studies have shown that texture analysis can provide information on tumor cellular and histological components [53, 54]. The SizeZoneNonUniformityNormalized texture feature measures the variability of zone volumes throughout the image. A higher value indicates more inhomogeneity among zone volumes in the image; thus, we concluded the irregular volume distribution of the HER2-positive group, which may be caused by the unequal growth rate of tumors in various directions. The texture features of glcm_Idmn, glcm_Dmn, and gldm_Dependence from wavelet-transformed images can reflect the homogeneity and consistency of tumor texture characteristics, conducive to the better exploration of intra-tumor heterogeneity and subtle differences in grey and texture level feature [55]. The DependenceEntropy texture feature measures the intensity differences in the neighborhood; the higher value represents a more disordered internal density of the tumor; hence, we found the complex heterogeneity existed in the HER2-positive texture patterns.

The mpMRI facilitates a comprehensive analysis of radiomics features; the established nomogram can fully combine the characteristics of different sequences. Nonetheless, there are several limitations to this study. Firstly, the limited sample size and all the cases were from one center. Therefore, additional validation of the constructed models on multicentric studies with a large study population is required. Second, to avoid confounding influences associated with the pathological type, the study only enrolled patients with IDC-NST, the most common form of breast carcinoma. Thirdly, we only used the traditional machine learning classification algorithms to explore the predefined radiomics features instead of the more informative and abstract features extracted by deep learning algorithms from MR images.


This study showed that radiomics signature based on mpMRI could improve the discriminative performance of HER2 status prediction in patients with breast carcinoma. The nomogram with the radiomics signature and independent clinical risk factors is a key tool for noninvasive diagnosis.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the inside author on reasonable request. The study data can be made available publicly.



Least absolute shrinkage and selection operator


Recursive feature elimination

Rad score:

Radiomics score


Receiver operating characteristic


Decision curve analysis




Fluorescence in-situ hybridization


Invasive ductal carcinoma


T2-weighted images


Apparent diffusion coefficient map


Dynamic contrast-enhanced T1 weighted images


Invasive breast ductal carcinoma of no special type


Gray level co-occurrence matrix


Gray-level run-length matrix


Gray-level size zone matrix


Gray-level dependence matrix


Area under the curve


American Society of Clinical Oncology


College of American Pathologists


Logistic regression


Linear discriminant analysis


Support vector machine


Random forest


Naive Bayesian




  1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2021;71:209–49.

    Article  PubMed  Google Scholar 

  2. Casey M, Prakash A, Holian E, McGuire A, Kalinina O, Shalaby A, et al. Quantifying Argonaute 2 (Ago2) expression to stratify breast cancer. BMC Cancer. 2019;19:1–12.

    Article  CAS  Google Scholar 

  3. Pradeep CR, Zeisel A, Köstler WJ, Lauriola M, Jacob-Hirsch J, et al. Modeling invasive breast cancer: growth factors propel progression of HER2-positive premalignant lesions. Oncogene. 2012;31(31):3569–83.

    Article  CAS  PubMed  Google Scholar 

  4. Molina Crespo Á, Cadete A, Sarrio D, Gámez-Chiachio M, Martinez L, et al. Intracellular Delivery of an Antibody Targeting Gasdermin-B Reduces HER2 Breast Cancer Aggressiveness. Clin Cancer Res. 2019;25(15):4846–58.

    Article  CAS  PubMed  Google Scholar 

  5. Bhattacharjee A, Rajendra J, Dikshit R, Dutt S. HER2 borderline is a negative prognostic factor for primary malignant breast cancer. Breast Cancer Res Treat. 2020;181:225–31.

    Article  CAS  PubMed  Google Scholar 

  6. Waks AG, Winer EP. Breast cancer treatment: a review. JAMA. 2019;321:288–300.

    Article  CAS  PubMed  Google Scholar 

  7. Wildiers H, Tryfonidis K, Dal Lago L, Vuylsteke P, Curigliano G, Waters S, et al. Pertuzumab and trastuzumab with or without metronomic chemotherapy for older patients with HER2-positive metastatic breast cancer (EORTC 75111–10114): an open-label, randomised, phase 2 trial from the Elderly Task Force/Breast Cancer Group. Lancet Oncol. 2018;19:323–36.

    Article  CAS  PubMed  Google Scholar 

  8. Cameron D, Piccart-Gebhart MJ, Gelber RD, Procter M, Goldhirsch A, de Azambuja E, et al. 11 years’ follow-up of trastuzumab after adjuvant chemotherapy in HER2-positive early breast cancer: final analysis of the HERceptin Adjuvant (HERA) trial. Lancet. 2017;389:1195–205.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Hartman AK, Gorman BK, Chakraborty S, Mody DR, Schwartz MR. Determination of HER2/neu status: a pilot study comparing HER2/neu dual in situ hybridization DNA probe cocktail assay performed on cell blocks to immunohistochemisty and fluorescence in situ hybridization performed on histologic specimens. Arch Pathol Lab Med. 2014;138(4):553–8.

    Article  PubMed  Google Scholar 

  10. Murthy SS, Sandhya DG, Ahmed F, et al. Assessment of HER2/Neu status by fluorescence in situ hybridization in immunohistochemistry-equivocal cases of invasive ductal carcinoma and aberrant signal patterns: a study at a tertiary cancer center. Indian J Pathol Microbiol. 2011;54(3):532–8.

    Article  PubMed  Google Scholar 

  11. Song L, Li C, Yin J. Texture Analysis Using Semiquantitative Kinetic Parameter Maps from DCE-MRI: Preoperative Prediction of HER2 Status in Breast Cancer. Front Oncol. 2021;8(11):675160.

    Article  Google Scholar 

  12. Li C, Yin J. Radiomics Nomogram Based on Radiomics Score from Multiregional Diffusion-Weighted MRI and Clinical Factors for Evaluating HER-2 2+ Status of Breast Cancer. Diagnostics. 2021;11(8):1491.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  13. Nagasaka K, Satake H, Ishigaki S, Kawai H, Naganawa S. Histogram analysis of quantitative pharmacokinetic parameters on DCE-MRI: correlations with prognostic factors and molecular subtypes in breast cancer. Breast Cancer. 2019;26:113–24.

    Article  PubMed  Google Scholar 

  14. Wang H, Hu Y, Li H, Xie Y, Wang X, Wan W. Preliminary study on identification of estrogen receptor-positive breast cancer subtypes based on dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) texture analysis. Gland Surg. 2020;9:622–8.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Leithner D, Bernard-Davila B, Martinez DF, et al. Radiomic Signatures Derived from Diffusion-Weighted Imaging for the Assessment of Breast Cancer Receptor Status and Molecular Subtypes. Mol Imaging Biol. 2020;22(2):453–61.

    Article  PubMed  Google Scholar 

  16. Zhou J, Tan H, Li W, Liu Z, Wu Y, Bai Y, et al. Radiomics signatures based on multiparametric MRI for the preoperative prediction of the her2 status of patients with breast cancer. Acad Radiol. 2021;28:1352–60.

    Article  PubMed  Google Scholar 

  17. Li S, Zhang D, Zeng S, et al. Prognostic Value of Preoperative Albumin-to-Fibrinogen Ratio in Patients with Bladder Cancer. J Cancer. 2021;12(19):5864–73.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Li B, Cui Y, Diehn M, Li R. Development and Validation of an Individualized Immune Prognostic Signature in Early-Stage Nonsquamous Non-Small Cell Lung Cancer. JAMA Oncol. 2017;3(11):1529–37.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Vallières M, Freeman CR, Skamene SR, El Naqa I. A radiomics model from joint FDG-PET and MRI texture features for the prediction of lung metastases in soft-tissue sarcomas of the extremities. Phys Med Biol. 2015;60:5471–6.

    Article  PubMed  Google Scholar 

  20. Carré A, Klausner G, Edjlali M, et al. Standardization of brain MR images across machines and protocols: bridging the gap for MRI-based radiomics. Sci Rep. 2020;10:12340.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  21. Zwanenburg A, Vallières M, Abdalah MA, Aerts HJ, Andrearczyk V, Apte A, et al. The Image Biomarker Standardization Initiative: Standardized Quantitative Radiomics for High-Throughput Image-based Phenotyping. Radiology. 2020;295:328–38.

    Article  PubMed  Google Scholar 

  22. Duron L, Balvay D, Vande Perre S, et al. Gray-level discretization impacts reproducible MRI radiomics texture features. PLoS One. 2019;14:e0213459.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Hoebel KV, Patel JB, Beers AL, Chang K, Singh P, Brown JM, et al. Radiomics repeatability pitfalls in a scan-rescan MRI study of glioblastoma. Radiol Artif Intell. 2020;3:e190199.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Pfob A, Mehrara BJ, Nelson JA, Wilkins EG, Pusic AL, Sidey-Gibbons C. Machine learning to predict individual patient-reported outcomes at 2-year follow-up for women undergoing cancer-related mastectomy and breast reconstruction (INSPiRED-001). Breast. 2021;60:111–22.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Stark GF, Hart GR, Nartowt BJ, et al. Predicting breast cancer risk using personal health data and machine learning models. PLoS ONE. 2019;14(12):e226765.

    Article  CAS  Google Scholar 

  26. Chen C, Qin Y, Chen H, Zhu D, Gao F, Zhou X. A meta-analysis of the diagnostic performance of machine learning-based MRI in the prediction of axillary lymph node metastasis in breast cancer patients. Insights Imaging. 2021;12(1):156.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Wolff AC, Hammond MEH, Allison KH, Harvey BE, Mangu PB, Bartlett JM, et al. Human epidermal growth factor receptor 2 testing in breast cancer: American Society of Clinical Oncology/College of American Pathologists clinical practice guideline focused update. Arch Pathol Lab Med. 2018;142:1364–82.

    Article  PubMed  Google Scholar 

  28. Hanna WM, Slodkowska E, Lu F-I, Nafisi H, Nofech-Mozes S. Comparative analysis of human epidermal growth factor receptor 2 testing in breast cancer according to 2007 and 2013 American Society of Clinical Oncology/College of American Pathologists guideline recommendations. J Clin Oncol. 2017;35:3039–45.

    Article  CAS  PubMed  Google Scholar 

  29. Liu C, Ding J, Spuhler K, Gao Y, Serrano Sosa M, Moriarty M, et al. Preoperative prediction of sentinel lymph node metastasis in breast cancer by radiomic signatures from dynamic contrast-enhanced MRI. J Magn Reson Imag. 2019;49:131–40.

    Article  Google Scholar 

  30. Chamming’s F, Ueno Y, Ferré R, Kao E, Jannot A-S, Chong J, et al. Features from computerized texture analysis of breast cancers at pretreatment MR imaging are associated with response to neoadjuvant chemotherapy. Radiology. 2018;286:412–20.

    Article  PubMed  Google Scholar 

  31. Tan H, Gan F, Wu Y, Zhou J, Tian J, Lin Y, et al. Preoperative prediction of axillary lymph node metastasis in breast carcinoma using radiomics features based on the fat-suppressed T2 sequence. Acad Radiol. 2020;27:1217–25.

    Article  PubMed  Google Scholar 

  32. Han L, Zhu Y, Liu Z, Yu T, He C, Jiang W, et al. Radiomic nomogram for prediction of axillary lymph node metastasis in breast cancer. Eur Radiol. 2019;29:3820–9.

    Article  PubMed  Google Scholar 

  33. Fan M, Chen H, You C, Liu L, Gu Y, Peng W, et al. Radiomics of tumor heterogeneity during longitudinal dynamic contrast-enhanced magnetic resonance imaging for predicting response to neoadjuvant chemotherapy in breast cancer. Front Mol Biosci. 2021;8:119.

    Article  CAS  Google Scholar 

  34. Cain EH, Saha A, Harowicz MR, Marks JR, Marcom PK, Mazurowski MA. Multivariate machine learning models for prediction of pathologic response to neoadjuvant therapy in breast cancer using MRI features: a study using an independent validation set. Breast Cancer Res Treat. 2019;173:455–63.

    Article  CAS  PubMed  Google Scholar 

  35. Zhou J, Tan H, Bai Y, Li J, Lu Q, Chen R, et al. Evaluating the HER-2 status of breast cancer using mammography radiomics features. Eur J Radiol. 2019;121: 108718.

    Article  PubMed  Google Scholar 

  36. Li E, Guida JL, Tian Y, Sung H, Koka H, Li M, et al. Associations between mammographic density and tumor characteristics in Chinese women with breast cancer. Breast Cancer Res Treat. 2019;177:527–36.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Yang X, Wu L, Zhao K, Ye W, Liu W, Wang Y, et al. Evaluation of human epidermal growth factor receptor 2 status of breast cancer using preoperative multidetector computed tomography with deep learning and handcrafted radiomics features. Chin J Cancer Res. 2020;32:175–85.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Chang RF, Chen HH, Chang YC, Huang CS, Chen JH, Lo CM. Quantification of breast tumor heterogeneity for ER status, HER2 status, and TN molecular subtype evaluation on DCE-MRI. Magn Reson Imag. 2016;34:809–19.

    Article  CAS  Google Scholar 

  39. Ni M, Zhou X, Liu J, Yu H, Gao Y, Zhang X, Li Z. Prediction of the clinicopathological subtypes of breast cancer using a fisher discriminant analysis model based on radiomic features of diffusion-weighted MRI. BMC Cancer. 2020;20(1):1073.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Chen Y, Wang Z, Yin G, Sui C, Liu Z, Li X, et al. Prediction of HER2 expression in breast cancer by combining PET/CT radiomic analysis and machine learning. Ann Nucl Med. 2021;36:172–82.

    Article  PubMed  CAS  Google Scholar 

  41. La Forgia D, Fanizzi A, Campobasso F, et al. Radiomic Analysis in Contrast-Enhanced Spectral Mammography for Predicting Breast Cancer Histological Outcome. Diagnostics (Basel). 2020;10(9):708.

    Article  Google Scholar 

  42. Bitencourt AGV, Gibbs P, Rossi Saccarelli C, et al. MRI-based machine learning radiomics can predict HER2 expression level and pathologic response after neoadjuvant therapy in HER2 overexpressing breast cancer. EBioMedicine. 2020;61:103042.

    Article  PubMed  PubMed Central  Google Scholar 

  43. Huang Y, Wei L, Hu Y, Shao N, Lin Y, He S, Shi H, Zhang X, Lin Y. Multi-Parametric MRI-Based Radiomics Models for Predicting Molecular Subtype and Androgen Receptor Expression in Breast Cancer. Front Oncol. 2021;11:706733.

    Article  PubMed  PubMed Central  Google Scholar 

  44. Li C, Song L, Yin J. Intratumoral and Peritumoral Radiomics Based on Functional Parametric Maps from Breast DCE-MRI for Prediction of HER-2 and Ki-67 Status. J Magn Reson Imag. 2021;54:703–14.

    Article  Google Scholar 

  45. Guo Q, Chen K, Lin X, et al. A nomogram to predict HER2 status in breast cancer patients with HER2-borderline disease as determined via immunohistochemistry. Oncotarget. 2017;8(55):93492–501.

    Article  PubMed  PubMed Central  Google Scholar 

  46. Zhang Y, Zhu Y, Zhang K, et al. Invasive ductal breast cancer: preoperative predict Ki-67 index based on radiomics of ADC maps. Radiol Med. 2020;125(2):109–16.

    Article  PubMed  Google Scholar 

  47. Ma W, Ji Y, Qi L, Guo X, Jian X, Liu P. Breast cancer Ki67 expression prediction by DCE-MRI radiomics features. Clin Radiol. 2018;73(10):909.e1-909.e5.

    Article  CAS  Google Scholar 

  48. Tagliafico AS, Piana M, Schenone D, Lai R, Massone AM, Houssami N. Overview of radiomics in breast cancer diagnosis and prognostication. Breast. 2020;49:74–80.

    Article  PubMed  Google Scholar 

  49. Tang TY, Li X, Zhang Q, Guo CX, Zhang XZ, Lao MY, et al. Development of a novel multiparametric MRI radiomic nomogram for preoperative evaluation of early recurrence in resectable pancreatic cancer. In: Imag JMR, editor. Wiley Online Library; 2020. p. 231–45.

  50. Nie P, Yang G, Wang Z, Yan L, Miao W, Hao D, et al. A CT-based radiomics nomogram for differentiation of renal angiomyolipoma without visible fat from homogeneous clear cell renal cell carcinoma. Eur Radiol. 2020;30:1274–84.

    Article  PubMed  Google Scholar 

  51. Chen JH, Baek HM, Nalcioglu O, Su MY. Estrogen receptor and breast MR imaging features: a correlation study. J Magn Reson Imag. 2008;27:825–33.

    Article  Google Scholar 

  52. Fan M, Li H, Wang S, Zheng B, Zhang J, Li L. Radiomic analysis reveals DCE-MRI features for prediction of molecular subtypes of breast cancer. PLoS ONE. 2017;12:e0171683.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  53. Li TG, Wang SP, Zhao N. Gray-scale edge detection for gastric tumor pathologic cell images by morphological analysis. Com-put Biol Med. 2009;39:947–52.

    Article  Google Scholar 

  54. Weyn B, Jacob W, da Silva VD, et al. Data representation and reduction for chromatin texture in nuclei from premalig-nant prostatic, esophageal, and colonic lesions. Cytometry. 2000;41:133–8.

    Article  CAS  PubMed  Google Scholar 

  55. Waugh S, Purdie C, Jordan L, Vinnicombe S, Lerski R, Martin P, et al. Magnetic resonance imaging texture analysis classification of primary breast cancer. Eur Radiol. 2016;26:322–30.

    Article  CAS  PubMed  Google Scholar 

Download references


Not applicable.


This work was supported by the Medical and Health Research Project of Zhejiang Province (Grant Nos. 2021KY1161, 2022KY1316 and 2021KY1146); Zhejiang Province Chinese Medicine Science Research Fund Project (Grant No. 2021ZA138); and Institution from Key Laboratory of Functional Molecular Imaging of Tumor and Interventional Diagnosis and Treatment of Shaoxing City.

Author information

Authors and Affiliations



AX and XW conceived and designed this study. XC carried out to collect the clinical data. JZ collected Pathological data.AX and XC drafted the manuscript. FL performed the statistical analysis.DS performed image processing.SZ and SL put forward many opinions on the manuscript. All authors contributed to the article and approved the submitted version. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Aqiao Xu or Xiaobo Weng.

Ethics declarations

Ethics approval and consent to participate

The study was approved by the Institutional Review Board of the Fudan University Shanghai Cancer Center and Shaoxing Central Hospital (2020024). All methods were carried out in accordance with the Declaration of Helsinki guidelines and regulations. Multiple sequence MRI is a common, non-invasive and safe examination method, and it was a retrospective study. The requirement for written informed consent was waived by the Institutional Review Board of the Fudan University Shanghai Cancer Center and Shaoxing Central Hospital.

Consent for publication

Written informed consent was obtained from the patients for publication of this study. Documentation of the written consent will be provided to the journal upon request.

Competing interests

The authors declare that they have no competing interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

Statistically analysis of the final selected radiomics features based on mpMRI in the training and validation sets. Table S2. Statistically analysis of the six radiomcis classifiers and their corresponding nomograms in the validation cohort. Figure S1. Two typical HER2-positive and HER2-negative cases. (a) Original DCE-T1WI image of the HER2-positive sample. (b) The semi-automated segmentation of lesion on Deepwise. (c) Manual modification. (d) Lesion segmentation in T2WI. (e) Lesion segmentation in ADC-map. (f) The corresponding 3D ROI. (g-l) The corresponding images for the HER2-negative sample. Figure S2. ROC curves of T2WI model, ADC-map model, DCE-T1WI model, and Rad score in evaluating HER2 status of breast carcinoma. A. Training cohort; B. Validation cohort. Figure S3. Nomogram performances for predicting HER2 status of breast carcinoma in the Validation set. A. ROC curves of the six nomograms based on different radiomics classifiers. B. Calibration curve of the nomogram_RF. C. Decision curves of the nomogram_RF and Rad score.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xu, A., Chu, X., Zhang, S. et al. Development and validation of a clinicoradiomic nomogram to assess the HER2 status of patients with invasive ductal carcinoma. BMC Cancer 22, 872 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: