Skip to main content

Using CT radiomic features based on machine learning models to subtype adrenal adenoma

Abstract

Background

Functioning and non-functioning adrenocortical adenoma are two subtypes of benign adrenal adenoma, and their differential diagnosis is crucial. Current diagnostic procedures use an invasive method, adrenal venous sampling, for endocrinologic assessment.

Methods

This study proposes establishing an accurate differential model for subtyping adrenal adenoma using computed tomography (CT) radiomic features and machine learning (ML) methods. Dataset 1 (289 patients with adrenal adenoma) was collected to develop the models, and Dataset 2 (54 patients) was utilized for external validation. Cuboids containing the lesion were cropped from the non-contrast, arterial, and venous phase CT images, and 1,967 features were extracted from each cuboid. Ten discriminative features were selected from each phase or the combined phases. Random forest, support vector machine, logistic regression (LR), Gradient Boosting Machine, and eXtreme Gradient Boosting were used to establish prediction models.

Results

The highest accuracies were 72.7%, 72.7%, and 76.1% in the arterial, venous, and non-contrast phases, respectively, when using radiomic features alone with the ML classifier of LR. When features from the three CT phases were combined, the accuracy of LR reached 83.0%. After adding clinical information, the area under the receiver operating characteristic curve increased for all the machine learning methods except for LR. In Dataset 2, the accuracy of LR was the highest, reaching 77.8%.

Conclusion

The radiomic features of the lesion in three-phase CT images can potentially suggest the functioning or non-functioning nature of adrenal adenoma. The resulting radiomic models can be a non-invasive, low-cost, and rapid method of minimizing unnecessary testing in asymptomatic patients with incidentally discovered adrenal adenoma.

Peer Review reports

Introduction

Adrenal adenomas are common tumors, and their prevalence in the general population is about 6% [1], although the incidence increases from about 1% for 40 year-olds to 7% for 70 year-olds [2]. Adrenal adenomas can be divided into functioning adrenocortical adenoma (FAA) and nonfunctioning adrenocortical adenoma (NAA). This classification is mainly based on whether the endocrine function is affected. FAA can disrupt hormone levels in patients, leading to Cushing’s syndrome (CS) and primary hyperaldosteronism (PHA) [3, 4]. Among CS patients, the incidence is high in 20–50 year-olds, and the male-to-female ratio is about 1:3 [5]. For PHA, the incidence is high in 30–50 year-olds, and there is no significant difference in prevalence between men and women [6].

Both FAA and NAA are benign adrenal adenomas. The clinical diagnosis of the two diseases is mainly through endocrine tests for hormone levels or medical imaging [7]. Endocrine analysis requires several blood tests and related hormone induction procedures, which are tedious, time-consuming, and invasive. If NAA is present and there is no hormone abnormality, the hormone test may be unnecessary and delay treatment. Radiologic diagnosis mainly uses morphological information such as the location and size of the lesion. Its accuracy cannot be guaranteed, and misdiagnosis may occur [8], leading to unnecessary adrenalectomy.

Intervention methods are different for the two types of adrenal adenomas. FAA can cause hormone abnormalities, so further examination of hormone levels is required and treatment usually involves surgery [9]. Most NAAs are asymptomatic, and the treatment method is often determined according to the size of the tumor. If the tumor is small and benign, surgical resection is relatively rare [10] and conservative treatment is usually adopted [11], for which no hormone examination is required. Therefore, it is necessary to develop a fast, accurate, and reliable diagnostic method to improve patient intervention and treatment while minimizing unnecessary testing in cases of NAA [12].

Non-invasive computed tomography (CT) can generate high-resolution images and has become a routine examination method. In addition, many features can be extracted from CT images for quantitative analysis [13]. In oncology, texture analysis is a new tool to help diagnose disease [14, 15]. Diagnosis based on machine learning performs as well as experienced doctors in many cases [16, 17]. CT images are assumed to contain valuable information that reflects underlying tumor pathophysiology and quantitative image features (semantic and agnostic), and machine learning models can reveal these relationships [18]. This approach reduces the workload of radiologists and improves the accuracy and efficiency of diagnosis [19,20,21].

Radiomic studies of adrenal adenomas have been carried out. Elmohr et al. [22] conducted texture analysis on venous phase CT images and established a binary random forest (RF) classification model for benign and malignant large adrenal tumors. Moawad et al. [23] established a random forest classifier to distinguish between benign and malignant uncertain adrenal tumors. Daye et al. [24] used the support vector machine (SVM) classification model to predict the prognosis of ablation patients based on texture features in CT images of adrenal metastases before ablation. In the previous studies of adrenal adenoma, time-consuming manual segmentation of the lesion region was usually required. Moreover, two-dimensional radiomic features were extracted from the largest section, while three-dimensional features that can be extracted from a cuboid containing the adrenal adenoma region were seldom utilized. To the best of our knowledge, differentiation between functioning and non-functioning subtypes of adrenal adenoma using radiomic features and machine learning models has not been reported.

The purpose of this study is to establish an accurate model to distinguish FAA from NAA using CT radiomic features and machine learning methods. This model may provide a non-invasive, low-cost, and rapid method of adrenal adenoma stratification to help patients with NAA avoid unnecessary invasive hormone tests. Our study contributes to this goal in three ways. First, a CT radiomic model is developed for differentiating functioning from non-functioning subtypes of adrenal adenoma in a non-invasive, low-cost, and rapid way. Second, the tedious step of manual segmentation of adrenal adenoma from CT images is replaced by simple cropping of a cuboid containing the target adrenal adenoma region to extract three-dimensional features. Third, a combination of radiomic features obtained from multiple phases of CT images improves classification performance.

Materials and methods

Patients and data

Two datasets were collected in this study. Dataset 1 was from The First Affiliated Hospital of China Medical University and was used to develop the machine learning model. Dataset 2 was from Shengjing Hospital of China Medical University and was utilized for external validation. The Ethics Committee of The First Affiliated Hospital of China Medical University and Shengjing Hospital of China Medical University approved this retrospective study, and informed consent was waived.

The experimental subjects were patients who had been diagnosed with adrenal adenomas. The data mainly included patient clinical records and contrast-enhanced CT images. Specifically, the clinical data were the patient’s age, gender, examination time, and pathological type. Each CT scan included images of arterial, venous, and non-contrast phases. The three-phase CT images were used in all cases.

After screening, a total of 289 patients with lesions larger than 2 cm were included in Dataset 1. There were 191 patients with FAA (89 CS, 102 PHA) and 98 with NAA. In subsequent experiments, Dataset 1 was randomly divided into training (n = 201; 133 FAA and 68 NAA) and test (n = 88; 58 FAA and 30 NAA) sets. The training and test sets contained different patients to ensure that they were independent. In Dataset 2, there were 54 patients (30 FAA and 24 NAA). Table 1 summarizes basic patient information and adrenal adenoma characteristics.

Figure 1 shows the overall workflow of this study. After the acquisition of CT images, the cuboid containing the adrenal adenoma region was cut from the CT images of the arterial, venous, and non-contrast phases. Next, feature extraction and selection were conducted. Finally, a binary classification model was established to determine the final prediction results.

Fig. 1
figure 1

Workflow of the experiment, including CT image acquisition, cuboid cropping, feature extraction and selection, and model building and validation steps

Cuboid cropping

In the process of cuboid cropping, the central point of the adrenal adenoma was determined first, and then a 64*64*32 cuboid was automatically cropped from the CT image to ensure that the cuboid completely contained the lesion area. In the experiment, the cuboid was cropped from the non-contrast, arterial, and venous phase CT images and used as the region of interest (ROI). One radiologist with 8 years of experience in abdominal imaging cropped the cuboids and another radiologist with 10 years of experience examined and confirmed the cropped results.

Feature extraction

Pyradiomics software [25] was used to extract features from the cropped cuboids from the three-phase CT images. The extracted features initially included the following seven groups: (a) First Order Features, (b) Shape Features, (c) Gray Level Co-occurrence Matrix (GLCM), (d) Gray Level Size Zone Matrix (GLSZM), (e) Gray Level Run Length Matrix (GLRLM), (f) Neighboring Gray Tone Difference Matrix (NGTDM), and (g) Gray Level Dependence Matrix (GLDM). The definitions and detailed explanations of all the above texture features can be found in the Pyradiomics documentation [26]. Finally, a total of 1,967 radiomic features were extracted from each ROI.

Feature selection

Feature selection is performed to identify critical and discriminative features and reduce over-fitting risk in the final prediction model [27]. First, an independent two-sample t-test was used to calculate whether each feature was significantly different between the two subtypes of adrenal adenoma. If p < 0.05, the feature was retained. Second, the Least Absolute Shrinkage and Selection Operator (LASSO) was employed to identify the final discriminative features. LASSO improves both prediction accuracy and model interpretability by combining the superior qualities of ridge regression and subset selection [28] and therefore is commonly used for feature selection [29, 30]. LASSO can reduce the coefficient of variables (that have little effect on the regression) to 0 during the fitting process, hence achieving variable screening and complexity adjustment [31, 32].

Only the training set data is used in feature selection. By calculating the coefficients of each variable in the LASSO, the features of the model can be sorted in order of importance from high to low. For every phase of the CT images, the top 10 features were used to train the subsequent two-category classifiers.

The radiomic features from the three phases of CT images were combined to improve the classification performance. First, a two-sample t-test was conducted for features from each phase. Second, the radiomic features that had significant differences were obtained from the three phases of CT images and concatenated into a vector. Third, LASSO regression was used to calculate the coefficients of each feature in the vector. Fourth, the features were sorted and the top 10 features were selected to build the models.

Machine-learning classification model

Five machine learning algorithms, RF [33, 34], SVM [35, 36], Logistic Regression [37, 38] (LR) Gradient Boosting Machine (GBM), and eXtreme Gradient Boosting (XGBoost), were used to establish two-category classification models. For hyperparameter optimization of the classification model, grid-search was used for 10-fold cross-validation to search for the optimal parameters of the model [39].

The models using single-phase CT features, multi-phase fusion CT features, and radiomic-clinical features were trained, tested, and compared. An independent test set in Dataset 1 was used to evaluate the performance of the model using measures of accuracy (ACC), specificity, sensitivity, and area under the curve (AUC). Receiver operating characteristic (ROC) curves were drawn. Moreover, models with and without the hyperparameter optimization were also compared. Dataset 2 with 54 patients served as the external validation to investigate the model’s generalization capability.

The machine learning algorithms were completed using the open-source Python Scikit-Learn library [40] (Version 0.24.1). All experiments were conducted on Pycharm (Version 2020.2, Python Version 3.7.9). An independent two-sample t-test was used for statistical analysis of the age, gender, and location of adrenal adenomas. A p-value less than 0.05 indicated significant differences between the two independent groups.

Results

Demographic and clinical characteristics

The demographic and clinical characteristics of patients with adrenal adenomas in Datasets 1 and 2 are presented in Table 1. In Dataset 1, there was a significant age difference between FAA and NAA patients (p = 0.041). The average age of NAA patients was slightly higher than that of FAA patients, and no significant gender difference was found (p = 0.419). In FAA, 100 patients had lesions on the left, 77 on the right, and 14 on both sides. In NAA, 51, 40, and 7 patients had lesions on the left, right, or both sides, respectively. In Dataset 2, there was no significant difference in gender or age between the two groups.

Table 1 Demographic and clinical characteristics

Radiomic characteristics

Figure 2 shows the mean square error (MSE) with Lambda in the LASSO and the variation of each feature coefficient with Lambda. A two-sample t-test showed there were 661 features with significant differences, including 146, 343, and 172 from the arterial, venous, and non-contrast phases, respectively. While MSE reached the minimum value marked in the dotted line in Fig. 2, the number of features was reduced to 37.

Fig. 2
figure 2

Variation of MSE and coefficient of each feature with Lambda in the LASSO. The features were retained when the three phases were combined (two-sample t-test)

Using the absolute value of the coefficients, the top 10 discriminate features in a fusion of the three CT image phases are summarized in Table 2. The features (mean and standard deviation) from the CT images of the three phases selected by LASSO are compared in Fig. 3. It is noteworthy that there is a significant difference between groups, as shown by a two-sample t-test before LASSO. In Table 2; Fig. 3, the last letter in the name of the feature indicates which phase the feature is from (A, arterial; V, venous; N, non-contrast).

Table 2 Discriminative features in two-category models
Fig. 3
figure 3

Comparison of features (mean and standard deviation) selected by LASSO from the three CT image phases. It is noteworthy that there is a significant difference between groups, as shown by a two-sample t-test before LASSO.

The mean values of SizeZoneNonUniformityNormalized_A (SZNN), Imc2_V, SZNN_N, and ClusterShade_N were greater in FAA than in NAA. By contrast, NAA had higher mean values for the other features.

Performance of models using CT features from single phases

Figure 4 shows the ROC curve of five classification models with different machine-learning methods for each phase. The AUC of the five models did not change significantly when using features from single phases. Specifically, the AUC range of the five machine learning models was 0.69–0.74 for arterial phase features, 0.65–0.72 for venous phase features, and 0.70–0.76 for non-contrast phase features.

Fig. 4
figure 4

ROC curve of two-category models in each phase. (a) Arterial phase; (b) Venous phase; (c) Non-contrast phase

Table 3 Prediction performance of five two-category models in each phase

Table 3 describes the performance measures of the five machine learning methods using 10 features from the single phases. The highest accuracy was 72.7% in the arterial phase, 72.7% in the venous phase, and 76.1% in the non-contrast phase. In summary, the mean accuracy of the final prediction was less than 80.0% for single phases.

Performance of models using combined features from all three CT phases

The experiment used a grid-search algorithm to optimize the model’s hyperparameters. As a comparison, we first used the models without parameter optimization for prediction analysis. Table 4 shows the performance measures of the five machine learning models without parameter optimization for 10 features in three phases. Figure 5(a) shows the ROC curves of the five models. The results show that LR yielded the best performance, with precision, recall, and accuracy of 76.5%, 74.0%, and 78.4%, respectively, and the AUC value was 0.78.

Table 4 Prediction performance of five models without parameter optimization using combined features from all three phases in the test set
Fig. 5
figure 5

ROC curves of models with combined features from all three CT image phases: (a) no parameter optimization; (b) parameter optimization; (c) clinical features included

Table 5 shows the performance measures of five machine learning methods with 10 features in three stages following parameter optimization, and Fig. 5(b) shows the ROC curves of these models. Among them, LR performed best, with precision, recall, accuracy, and AUC values of 85.3%, 76.6%, 83.0%, and 0.86, respectively. Parameter optimization improved the accuracy and AUC of the models.

Table 5 Prediction performance of five models with parameter optimization using combined features from all three phases in the test set

Meanwhile, the patients had statistically significant age differences, so this clinical information was added to the training feature set to train the models and evaluate the effect of age. Table 6 shows the test results of the models after adding this clinical feature, and Fig. 5(c) shows the ROC curve of the five models. The accuracy of the models did not increase significantly after adding the clinical information as a feature, but AUC did increase significantly. Among the models, LR still performed best, with precision, recall, and accuracy values of 85.3%, 76.6%, and 83.0%, respectively, and an AUC of 0.86.

Table 6 The prediction performance of five models after adding a clinical feature to the test set

The p-value of the Delong test from the ROC curve for the LR models among the four CT image sets (three single phases and a combination of the three phases) is shown in Fig. 6. For FAA vs. NAA, the AUC of the LR model after feature fusion was significantly higher than that of the model using only the venous phase features (Delong test, p = 0.049). Although the AUC was also higher than that of the arterial and non-contrast phase models, no significant differences were observed in the Delong test (p = 0.11, 0.27, respectively).

Fig. 6
figure 6

The Delong test compares the ROC curves of the LR model among four CT image sets (three single phases and a combination of the three phases)

Performance using the external validation dataset

In the external validation dataset, the performance of the five machine learning models using radiomic-clinical features is presented in Table 7. LR achieved the highest accuracy of 77.8% (42/54). SVM, RF, GBM, and XGBoost yielded accuracies of 77.8% (42/54), 74.1% (40/54), 74.1% (40/54), and 72.2% (39/54), respectively. Figure 7 shows the ROC curve of the five models using the external validation dataset.

Table 7 The prediction performance of five models using the external validation dataset
Fig. 7
figure 7

ROC curve of models using the external validation dataset

Discussion

In this study, CT radiomic features extracted from CT images have been utilized to differentiate between functioning and non-functioning adrenal adenomas using machine learning models. Differences in demographic and clinical characteristics were observed between the two groups. Ten discriminative features from three-phase CT images were identified and analyzed while differentiating between the two subtypes. The accuracy of the final prediction using LR can reach as high as 83.0%.

Clinical significance of differentiating between adrenal adenoma subtypes

Adrenal adenomas are common urinary tract tumors. For tumor-based diseases, patients and their families are concerned about effective treatment, and early diagnosis of the disease is of great importance for determining the proper method and timing of subsequent treatments [41].

FAA and NAA require different follow-up treatments, including the type of medication and the need for surgery with its attendant risks. There are several difficulties with the current methods of clinical diagnosis. Radiologists cannot always make an accurate diagnosis directly from CT images, which can lead to unnecessary treatment [10]. Currently, patients with FAA need to have multiple invasive blood tests, often including adrenal venous sampling, to measure hormone levels that serve to guide surgeons in resecting the functioning lesion(s). Therefore, advanced diagnosis of NAA without invasive blood tests could reduce risk and discomfort for patients while simplifying and accelerating testing and treatment.

Using our models described here, the accuracy of differentiating FAA from NAA was more than 80%. This method can help doctors quickly estimate the type of adrenal adenoma, rapidly determine the corresponding effective treatment plan, and minimize unnecessary lab testing of patients who do not present with hormonal imbalance. In addition, this method is completely non-invasive, simple, and fast. The only material required is the patient’s CT images, which are routinely collected in the clinic.

Discriminative features between the two subtypes

Most of our patients with FAA and NAA were female, which is consistent with the results of previous studies showing that the incidence of adrenal adenomas is higher in women [5, 42]. In terms of age, FAA typically develops earlier in life than NAA, while the average age is over 50 years, demonstrating that the incidence of adrenal adenomas increases with age [2, 43, 44].

Cluster Shade is a quantity that describes the skewness and uniformity of the GLCM, with higher values indicating greater asymmetry about the mean. The value of Cluster Shade in assessing FAA is higher than that for NAA, which indicates NAA has a more uniform gray level distribution in CT images. Because of the differences in tumor cells that make up FAA, tumor cell morphology may be heterogeneous, thus increasing the heterogeneity of FAA in histopathology [45, 46].

SZNN is a measure of the variability of size zone volumes throughout the image, with a lower value indicating more homogeneity among zone size volumes in the image. The SZNN feature value for FAA was greater than for NAA, and therefore NAA was more homogeneous. The etiology of FAA includes extensive adrenal hyperplasia, which may show clinical, morphological, and molecular heterogeneity [45]. Long Run Emphasis (LRE) is a measure of the distribution of long run lengths, with a greater value indicative of long run lengths and more coarse structural textures. [25]. LRE is higher for NAA than FAA, so NAA has a rougher texture. This may correlate with tumor micro-environment heterogeneity [47]. Therefore, these key radiomic features can assist in analyzing the texture characteristics of tumors and evaluating the symmetry and heterogeneity of tumors.

Model performance after feature fusion

From the experimental results, we have found that among the classification models trained after the fusion of equivalent features from the three phases, LR had the best performance, with an accuracy of 83.0%. Compared with the LR models that only used a single-phase feature, performance improved significantly (arterial phase: 72.7%, venous phase: 72.7%, and non-contrast phase: 76.1%). The Delong test [48] confirmed this observation. Therefore, regardless of the accuracy or AUC of the model, fusing features from the three CT image phases to train the model is better than using only features from a single phase.

Methodological advantages

Table 8 compares our methods with those in previous studies, where three methodological advantages are apparent. First, in our study, we cropped three-dimensional 64*64*32 cuboids containing entire lesions, and features were extracted from 3D ROIs in CT images. Our method contains more feature information than previous studies using features extracted from a single 2D representative section of the largest tumor [49]. Daye et al. [24] used SVM to predict the prognosis of patients with metastatic adrenal tumors. However, the ROI was delineated only on the largest section of the lesion, while the characteristics of the entire 3D tumor were not considered.

Second, more information about the tumor was obtained in our study by fusing features from non-contrast-, arterial- and venous-phase CT images. Koo et al. [50] analyzed the value of 15-minute-delayed contrast-enhanced CT and chemical shift magnetic resonance (CSMR) for identifying adrenal masses and discovered that the delay-enhanced images were more effective for diagnosing adrenal sebaceous adenoma. In the study of Feng et al. [19], feature analysis and selection were performed in three phases of CT images, and the optimal feature subset was finally selected for the construction of the machine learning classification model.

Third, the model we developed can diagnose both FAA and NAA. At the same time, we collected patients’ CT data from two hospitals for validation. Previous studies have primarily focused on the distinction between benign and malignant adrenal adenomas [50, 51], while few studies have distinguished between functioning and non-functioning adrenal adenomas [52]. Additionally, there is no relevant research on the automatic differentiation of adrenal adenomas based on CT images.

Table 8 Comparison of our methods to those of previous studies

Limitations and future work

This study has a few limitations. First, although the number of cases of each subtype was larger than in previous studies, the sample size was relatively small, with about 100 NAA patients. Second, all patients were retrospectively registered from hospitals in the same region, and therefore the generalizability of the model is unclear. In the future, more data from multiple hospitals in multiple regions needs to be collected for testing and verification. Third, the final LR accuracy of the three-phase classification is 83.0%. Although it is a satisfactory result, the prediction would have been in error for nearly one-fifth of the patients with adrenal adenoma. We expect more high-quality CT images from different hospitals to be collected shortly, and advanced deep-learning methods may further improve the model’s predictive power. Additionally, more clinical non-image information can be used to optimize the model.

Conclusion

The radiomic features from the lesion region in three-phase CT images can potentially suggest the functioning or non-functioning nature of adrenal adenoma. The discriminative features identified here may help in understanding the heterogeneity of adrenal adenoma. The resulting radiomic models can be a non-invasive, low-cost, and rapid method to reduce unnecessary testing in asymptomatic patients with incidentally discovered adrenal adenoma.

Availability of data and material

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

Abbreviations

AUC:

Area under the curve

CS:

Cushing’s syndrome

CT:

Computed tomography

GLCM:

Gray level co-occurrence matrix

GLDM:

Gray level dependence matrix

GLSZM:

Gray level size zone matrix

GLRLM:

Gray level run length matrix

LASSO:

Least absolute shrinkage and selection operator

ML:

Machine learning

MSE:

Mean square error

FAA:

Functioning adrenocortical adenoma

NAA:

Non-functioning adrenocortical adenoma

NGTDM:

Neighboring gray tone difference matrix

PHA:

Primary hyperaldosteronism

RF:

Random Forest

ROC:

Receiver operating characteristic

SVM:

Support vector machine

GBM:

Gradient Boosting Machine

XGB:

eXtreme Gradient Boosting

References

  1. Davenport C, Liew A, Doherty B, Win HHN, Misran H, Hanna S, et al. The prevalence of adrenal incidentaloma in routine clinical practice. Endocrine. 2011;40(1):80–3.

    Article  CAS  Google Scholar 

  2. Barzon L, Sonino N, Fallo F, Palu G, Boscaro M. Prevalence and natural history of adrenal incidentalomas. Eur J Endocrinol. 2003;149(4):273–85.

    Article  CAS  Google Scholar 

  3. Saeger W. Histopathological classification of adrenal tumours. Eur J Clin Invest. 2000;30:58–62.

    Article  Google Scholar 

  4. Izaki H, Fukumori T, Takahashi M, Taue R, Kishimoto T, Tanimoto S, et al. Indications for laparoscopic adrenalectomy for non-functional adrenal tumor with hypertension: usefulness of adrenocortical scintigraphy. Int J Urol. 2006;13(6):677–81.

    Article  Google Scholar 

  5. Lacroix A, Feelders RA, Stratakis CA, Nieman LK. Cushing’s syndrome. Lancet. 2015;386(9996):913–27.

    Article  CAS  Google Scholar 

  6. Douma S, Petidis K, Doumas M, Papaefthimiou P, Zamboulis C. Prevalence of primary hyperaldosteronism in resistant hypertension: a retrospective observational study. Lancet. 2008;371(9628):1921–6.

    Article  CAS  Google Scholar 

  7. Findling JW, Raff H. Clinical review: Cushing’s syndrome: important issues in diagnosis and management. J Clin Endocrinol Metabolism. 2006;91(10):3746–53.

    Article  CAS  Google Scholar 

  8. Wiebke A, Michael B, Taylor AE, Stefanie H, Rossella L, Hughes BA et al. Urine Steroid Metabolomics as a Biomarker Tool for Detecting Malignancy in Adrenal Tumors.Journal of Clinical Endocrinology & Metabolism. (12):3775.

  9. Rosario, Pivonello, Monica D, Leo A et al. The Treatment of Cushing’s Disease.Endocrine reviews. 2015.

  10. Cawood TJ, Hunt PJ, O’Shea D, Cole D, Soule S. Recommended evaluation of adrenal incidentalomas is costly, has high false-positive rates and confers a risk of fatal cancer that is similar to the risk of the adrenal lesion becoming malignant; time for a rethink? Eur J Endocrinol. 2009;161(4):513.

    Article  CAS  Google Scholar 

  11. Fassnacht M, Arlt W, Bancos I, Dralle H, Newell-Price J, Sahdev A, et al. Management of adrenal incidentalomas: european society of Endocrinology Clinical Practice Guideline in collaboration with the European Network for the study of adrenal tumors. Eur J Endocrinol. 2016;175(2):G1.

    Article  CAS  Google Scholar 

  12. Crona J, Beuschlein F, Pacak K, Skogseid B. Advances in adrenal tumors 2018. Endocrine-related Cancer. 2018;25(7):R405–R20.

    Article  CAS  Google Scholar 

  13. Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, Aerts HJWL. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer. 2012;43(4):441–6.

    Article  Google Scholar 

  14. A MA, B JS CIEN. Beyond imaging: the promise of radiomics. Physica Med. 2017;38:122–39.

    Article  Google Scholar 

  15. Gillies RJ, Kinahan PE, Hricak H, Radiomics. Images Are More than Pictures, They Are Data. 2013.

  16. Jordan MI, Mitchell TM. Machine learning: Trends, perspectives, and prospects. Science. 2015;349(6245):255–60.

    Article  CAS  Google Scholar 

  17. Wang S, Summers RM. Machine learning and radiology. Med Image Anal. 2012;16(5):933–51.

    Article  CAS  Google Scholar 

  18. Gillies RJ, Kinahan PE, Hricak H. Radiomics: images are more than pictures, they are data. Radiology. 2016;278(2):563–77.

    Article  Google Scholar 

  19. Feng Z, Rong P, Peng C, Zhou Q, Zhu W, Yan Z, et al. Machine learning-based quantitative texture analysis of CT images of small renal masses: differentiation of angiomyolipoma without visible fat from renal cell carcinoma. Eur Radiol. 2018;28(4):1625–33.

    Article  Google Scholar 

  20. Radiomics machine-learning. Signature for diagnosis of hepatocellular carcinoma in cirrhotic patients with indeterminate liver nodules. Eur Radiol. 2020;30(1):558–70.

    Article  Google Scholar 

  21. Parmar C, Grossmann P, Bussink J, Lambin P, Aerts H. Machine learning methods for quantitative Radiomic biomarkers. Rep. 2015;5:13087.

    CAS  Google Scholar 

  22. Mme A, Df A, Mah B, Prb C, Aaq C et al. A E,. Machine learning-based texture analysis for differentiation of large adrenal cortical tumours on CT. Clinical Radiology. 2019;74(10):818.e1-.e7.

  23. Moawad AW, Ahmed A, Fuentes DT, Hazle JD, Habra MA, Elsayes KM. Machine learning-based texture analysis for differentiation of radiologically indeterminate small adrenal tumors on adrenal protocol CT scans. Abdom Radiol. 2021;46(10):4853–63.

    Article  Google Scholar 

  24. Daye D, Staziaki PV, Furtado VF, Tabari A, Fintelmann FJ, Frenk NE, et al. CT texture analysis and machine learning improve post-ablation prognostication in patients with adrenal metastases: a proof of Concept. Cardiovasc Interv Radiol. 2019;42(12):1771–6.

    Article  Google Scholar 

  25. van Griethuysen JJM, Fedorov A, Parmar C, Hosny A, Aucoin N, Narayan V, et al. Computational Radiomics System to Decode the Radiographic phenotype. Cancer Res. 2017;77(21):E104–E7.

    Article  Google Scholar 

  26. Pyradiomics Documentation. Available via https://pyradiomics.readthedocs.io/en/latest/. Accessed 6 July 2019

  27. van der Schaaf A, Xu CJ, van Luijk P, van’t Veld AA, Langendijk JA, Schilstra C. Multivariate modeling of complications with data driven variable selection: guarding against overfitting and effects of data set size. Radiother Oncol. 2012;105(1):115–21.

    Article  Google Scholar 

  28. Muthukrishnan R, Rohini R. LASSO: A Feature Selection Technique In Predictive Modeling For Machine Learning. In: IEEE International Conference on Advances in Computer Applications (ICACA): Oct 24 2016; Bharathiar Univ, Coimbatore, INDIA; 2016: 18–20.

  29. Chang R, Qi S, Zuo Y, Yue Y, Zhang X, Guan Y, Qian W. Predicting chemotherapy response in non-small-cell lung cancer via computed tomography radiomic features: Peritumoral, intratumoral, or combined? Front Oncol. 2022;12:915835.

    Article  Google Scholar 

  30. Liu Z, Wang S, Dong D, Wei J, Fang C, Zhou X, Sun K, Li L, Li B, Wang M, Tian J. The applications of Radiomics in Precision diagnosis and treatment of Oncology: Opportunities and Challenges. Theranostics. 2019;9(5):1303–22.

    Article  Google Scholar 

  31. Tibshirani\T R. the lasso method for variable selection in the cox model. 2017.

  32. Tibshirani R. Regression shrinkage and selection via the lasso: a retrospective. J Royal Stat Society: Ser B (Statistical Methodology). 2011;73(3):267–88.

    Google Scholar 

  33. Breiman L. Random forests, machine learning 45. J Clin Microbiol. 2001;2:199–228.

    Google Scholar 

  34. Liaw A, Wiener M. Classification and Regression by randomForest.R News. 2002;23(23).

  35. Cortes C, Vapnik V. Support-vector networks. Mach Learn. 1995;20(3):273–97.

    Article  Google Scholar 

  36. Sollich GP. Model selection for support vector machine classification.Neurocomputing. 2003.

  37. Wu S, Zheng J, Li Y, Yu H, Shi S, Xie W, et al. A Radiomics Nomogram for the preoperative prediction of Lymph Node Metastasis in bladder Cancer. Clin Cancer Res Official J Am Association Cancer Res. 2017;23(22):6904.

    Article  CAS  Google Scholar 

  38. Horeweg N, van Rosmalen J, Heuvelmans MA, van der Aalst CM, Vliegenthart R, Scholten ET, et al. Lung cancer probability in patients with CT-detected pulmonary nodules: a prespecified analysis of data from the NELSON trial of low-dose CT screening. Lancet Oncol. 2014;15(12):1332–41.

    Article  Google Scholar 

  39. Cawley GC, Talbot NLC. On over-fitting in Model Selection and subsequent selection Bias in performance evaluation. J Mach Learn Res. 2010;11:2079–107.

    Google Scholar 

  40. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O et al. Scikit-learn:Machine Learning in Python. 2012.

  41. Zeiger MA, Siegelman SS, Hamrahian AH. Medical and Surgical evaluation and treatment of adrenal incidentalomas. J Clin Endocrinol Metabolism. 2011;96(7):2004–15.

    Article  CAS  Google Scholar 

  42. Audenet F, Mejean A, Chartier-Kastler E, Roupret M. Adrenal tumours are more predominant in females regardless of their histological subtype: a review. World J Urol. 2013;31(5):1037–43.

    Article  Google Scholar 

  43. Young WF. Diagnosis and treatment of primary aldosteronism: practical clinical perspectives. J Intern Med. 2019;285(2):126–48.

    Article  Google Scholar 

  44. Duan K, Hernandez GK, Mete O. Clinicopathological correlates of adrenal Cushing’s syndrome (vol 68, pg 175, 2015).Journal of Clinical Pathology. 2015;68(7).

  45. Mete O, Duan K. The Many Faces of Primary Aldosteronism and Cushing Syndrome: A Reflection of Adrenocortical Tumor Heterogeneity.Frontiers in Medicine. 2018;5.

  46. Duan K, Mete O. Clinicopathologic correlates of primary Aldosteronism. Arch Pathol Lab Med. 2015;139(7):948–54.

    Article  CAS  Google Scholar 

  47. Junttila MR, de Sauvage FJ. Influence of tumour micro-environment heterogeneity on therapeutic response. Nature. 2013;501(7467):346–54.

    Article  CAS  Google Scholar 

  48. Molodianovitch K, Faraggi D, Reiser B. Comparing the areas under two correlated ROC curves: Parametric and non-parametric approaches. Biom J. 2006;48(5):745–57.

    Article  Google Scholar 

  49. Beig N, Khorrami M, Alilou M, Prasanna P, Braman N, Orooji M, et al. Perinodular and Intranodular Radiomic features on lung CT images distinguish adenocarcinomas from Granulomas. Radiology. 2019;290(3):783–92.

    Article  Google Scholar 

  50. Koo HJ, Choi HJ, Kim HJ, Kim SO, Cho KS. The value of 15-minute delayed contrast-enhanced CT to differentiate hyperattenuating adrenal masses compared with chemical shift MR imaging. Eur Radiol. 2014;24(6):1410–20.

    Article  Google Scholar 

  51. Vos EL, Grewal RK, Russo AE, Reidy-Lagunes D, Untch BR, Gavane SC, et al. Predicting malignancy in patients with adrenal tumors using(18)F-FDG-PET/CT SUVmax. J Surg Oncol. 2020;122(8):1821–6.

    Article  Google Scholar 

  52. Chen S, Lin H, Zhang H, Guo FC, Zhu SS, Cui XY, et al. Identifying functioning and nonfunctioning adrenal tumors based on blood serum surface-enhanced Raman spectroscopy. Anal Bioanal Chem. 2021;413(16):4289–99.

    Article  CAS  Google Scholar 

Download references

Acknowledgements

Not applicable.

Funding

This work was partly supported by the National Natural Science Foundation of China under Grant (Nos. 82072008), the 69th China Postdoctoral Science Foundation (2021M693916), Liaoning Natural Science Foundation (2021-YGJC-21), and the Fundamental Research Funds for the Central Universities (N2119010, N2224001-10).

Author information

Authors and Affiliations

Authors

Contributions

SQ performed experiments and analyzed the data along with YZ. SQ, JL, and ZZ conceived the study, presented the results, and wrote the manuscript along with YZ. KH and JL collected and analyzed the data. RC supervised the algorithm development and analyzed the data. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Jing Liu or Zhe Zhang.

Ethics declarations

Ethics approval and consent to participate

The Ethics Committee of The First Hospital of China Medical University and Shengjing Hospital of China Medical University approved this retrospective study, and the informed consent was waived. All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Qi, S., Zuo, Y., Chang, R. et al. Using CT radiomic features based on machine learning models to subtype adrenal adenoma. BMC Cancer 23, 111 (2023). https://doi.org/10.1186/s12885-023-10562-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12885-023-10562-6

Keywords