Identification of prognostic signatures in remnant gastric cancer through an interpretable risk model based on machine learning: a multicenter cohort study

Zhan, Zhouwei; Chen, Bijuan; Cheng, Hui; Xu, Shaohua; Huang, Chunping; Zhou, Sijing; Chen, Haiting; Lin, Xuanping; Lin, Ruyu; Huang, Wanting; Ma, Xiaohuan; Fu, Yu; Chen, Zhipeng; Zheng, Hanchen; Shi, Songchang; Guo, Zengqing; Zhang, Lihui

doi:10.1186/s12885-024-12303-9

Research
Open access
Published: 30 April 2024

Identification of prognostic signatures in remnant gastric cancer through an interpretable risk model based on machine learning: a multicenter cohort study

Zhouwei Zhan¹^na1,
Bijuan Chen²^na1,
Hui Cheng³^na1,
Shaohua Xu⁴,
Chunping Huang⁵,
Sijing Zhou¹,
Haiting Chen⁶,
Xuanping Lin⁶,
Ruyu Lin⁶,
Wanting Huang⁶,
Xiaohuan Ma⁶,
Yu Fu⁶,
Zhipeng Chen⁶,
Hanchen Zheng¹,
Songchang Shi⁷^na2,
Zengqing Guo¹^na2 &
…
Lihui Zhang⁷^na2

BMC Cancer volume 24, Article number: 547 (2024) Cite this article

279 Accesses
Metrics details

Abstract

Objective

The purpose of this study was to develop an individual survival prediction model based on multiple machine learning (ML) algorithms to predict survival probability for remnant gastric cancer (RGC).

Methods

Clinicopathologic data of 286 patients with RGC undergoing operation (radical resection and palliative resection) from a multi-institution database were enrolled and analyzed retrospectively. These individuals were split into training (80%) and test cohort (20%) by using random allocation. Nine commonly used ML methods were employed to construct survival prediction models. Algorithm performance was estimated by analyzing accuracy, precision, recall, F1-score, area under the receiver operating characteristic curve (AUC), confusion matrices, five-fold cross-validation, decision curve analysis (DCA), and calibration curve. The best model was selected through appropriate verification and validation and was suitably explained by the SHapley Additive exPlanations (SHAP) approach.

Results

Compared with the traditional methods, the RGC survival prediction models employing ML exhibited good performance. Except for the decision tree model, all other models performed well, with a mean ROC AUC above 0.7. The DCA findings suggest that the developed models have the potential to enhance clinical decision-making processes, thereby improving patient outcomes. The calibration curve reveals that all models except the decision tree model displayed commendable predictive performance. Through CatBoost-based modeling and SHAP analysis, the five-year survival probability is significantly influenced by several factors: the lymph node ratio (LNR), T stage, tumor size, resection margins, perineural invasion, and distant metastasis.

Conclusions

This study established predictive models for survival probability at five years in RGC patients based on ML algorithms which showed high accuracy and applicative value.

Peer Review reports

Introduction

Remnant gastric cancer (RGC), also known as gastric stump cancer, was initially reported by Balfour in 1922 as a cancer developing in the remnant stomach following previous gastric surgery for peptic ulcer disease (PUD)[1, 2]. More recently, the definition of RGC has evolved, and it is now described as any cancer occurring in the residual stomach following a previous partial gastrectomy for benign or malignant conditions[3]. In literature, the incidence of RGC ranges approximately from 1 to 7%[4,5,6,7,8]. Due to the absence of specific symptoms, RGC is often diagnosed at an advanced stage, resulting in low surgical resection rates and poor prognoses, making it an important clinical concern[4, 5]. The surgical outcomes for RGC vary across studies, with 5-year survival rates ranging from 7 to 80%[6, 9,10,11,12].

As the number of gastrectomies continues to rise, the incidence of RGC is escalating annually[13]. It’s crucial to identify relevant prognostic factors for RGC and develop effective follow-up treatment strategies. In clinical practice, the adjacent gastric mucosa in RGC demonstrates a lower degree of atrophy when compared to cases of primary gastric cancer (GC), which suggests a unique underlying pathological mechanism[14]. Furthermore, there is a significantly heightened incidence of serosal tumor invasion in RGC, affecting between 37 to 48% of patients, contrasting sharply with the rate of 19% seen in primary GC[15]. Additionally, surgical procedures for RGC result in a notably smaller total number of harvested lymph nodes compared to those in primary GC, particularly when the preceding surgery was for gastric malignancy, since the nodes would have already been removed. As such, the lymph node grouping applied in the TNM classification system for primary GC may not be suitable for staging RGC[16]. Moreover, RGC shows a significantly higher overall frequency of splenic hilar lymph node involvement when compared to primary GC. It is worth noting that jejunal mesentery lymph node involvement is predominantly observed following Billroth II reconstruction surgeries[17, 18].

RGC often exhibits a higher rate of invasion into adjacent organs, and lymph node metastasis is frequently observed[19], which can lead to a worse prognosis than primary GC[20]. However, some studies suggest that RGC prognoses are similar to primary GC[21]. Prior research has investigated the clinical characteristics of resectable RGC in small case studies, but the factors influencing patient outcomes remain unclear or controversial[22,23,24]. A meta-analysis disclosed that the significance of tumor location on survival varies among studies. Some literature indicates that tumor location does not significantly impact survival rates[25, 26], while other research reports that anastomotic site tumors may be a favorable prognostic factor[27]. Nonetheless, patients with anastomotic site tumors experience worse outcomes[23]. Thus, additional research is necessary to resolve this discrepancy.

Machine learning (ML) constitutes the bedrock of contemporary artificial intelligence advancements[28]. Although these algorithms have demonstrated substantial triumphs across various disciplines, their integration into the realms of medicine and healthcare is still in its nascent stages. The non-linear nature of real-world data impacts often challenges the effectiveness of traditional models like Linear Regression for classification forecasts and Cox Regression for predicting survival outcomes, as they are confined within a linear framework[29, 30]. In comparison with traditional mathematical models, ML excels notably in handling tasks related to classification and regression, finding broad application in developing predictive frameworks, determining tumor stages, and prognostic groupings[31,32,33,34].

ML can facilitate various problems, from patient-level observations to employing algorithms with numerous variables, seeking combinations, and ultimately reliably predicting risks and outcomes[35]. Numerous studies have developed valuable models utilizing ML techniques[36,37,38,39]. However, there is a dearth of research exploring the application of ML for predicting survival outcomes in RGC patients. Although ML presents significant benefits in constructing models to identify risk factors, the “black-box” nature of ML algorithms poses challenges in explaining why specific predictions are made for patients. In pursuit of these objectives, the SHapley Additive exPlanations (SHAP) methodology has recently been introduced[40, 41]. The SHAP method allows for the recognition and prioritization of attributes that influence complex classification and activity forecasting utilizing any ML model. Developing a visual predictive model to assist healthcare professionals in identifying individuals with poor prognoses would be advantageous.

Consequently, a central objective of our research was to construct and evaluate ML-based survival prediction models for patients with remnant stomach cancer over a five-year period. This endeavor encompassed not only the development of multiple ML algorithms but also an emphasis on visualizing these models to gain deeper insights into their inner workings. Furthermore, our study aimed to juxtapose the efficacy of these ML models against that of traditional linear regression models, thereby shedding light on the distinctive contributions and potential superiority of ML approaches in forecasting survival probabilities for this patient population. Through visualization, we sought to enhance interpretability and transparency, enabling a comprehensive evaluation and understanding of the complex relationships learned by the ML models in the context of RGC survival prediction.

Data and methods

Patients

Patients with RGC were enrolled at two tertiary hospitals (Fujian Provincial Hospital from June 2008 to May 2022, and Fujian Cancer Hospital from June 1999 to August 2021). RGC was characterized as an adenocarcinoma originating in the remnant stomach subsequent to a gastric resection for either a benign or malignant condition[3, 14, 42]. A total of 366 individuals participated in this study. Inclusion criteria consisted of patients who underwent surgical treatment, including radical and palliative surgery, with a follow-up duration of > 5 years or those who died. Patients with a history of neoadjuvant therapy, R1/R2 resection in previous gastrectomy, other malignant diseases within the past 5 years, death within 3 months after surgery, different pathological types, or incomplete clinicopathological data were excluded. Furthermore, patients with a follow-up duration of less than 5 years, no endpoints observed, or missing values exceeding 20% were also excluded from the study. Based on the inclusion and exclusion criteria, 286 participants remained in the study. The study’s flow chart is presented in Fig. 1. The study protocol adhered to the ethical guidelines of the 1995 Declaration of Helsinki, and was approved by the ethics committee of Fujian Cancer Hospital (ethical approval number K2021-100–01) and Fujian Provincial Hospital (ethical approval number K2022-08–034).

Data collection

Follow-up procedures encompassed outpatient visits, hospital appointments, and telephone inquiries. The follow-up period concluded on December 31, 2023. Patients’ survival time (in months) was calculated from the date of surgery to the date of death or the end of follow-up. Retrospective analysis was conducted on preoperative information (age, initial gastric disease, initial reconstruction methods, and interval between the initial surgery and RGC resection), operative details (operative approaches, combined resections, and either curative (R0) or non-curative resections (R1/2)), and postoperative data (RGC tumor location, histopathological findings, lymph nodes ratio (LNR), venous and perineural invasions, follow-up duration, and adjuvant therapy). TNM staging was performed according to the AJCC/UICC staging criteria (8th edition) after RGC surgery[43]. Histological types were classified as highly differentiated, moderately differentiated, and lowly differentiated (including signet-ring cell carcinoma, poorly differentiated, or mucinous). Tumor locations were categorized as anastomotic and non-anastomotic sites.

Study outcomes

The primary endpoint of the study was all-cause mortality within the 5-year follow-up period. All-cause mortality was defined as death resulting from any cause.

Feature selection and data preprocessing

ML algorithms were implemented in Python software, and the data were organized in the format required for applying these algorithms. Samples were classified into healthy or sepsis groups based on the outcome indicators for the classification prediction model. The K-Nearest Neighbor (KNN) algorithm[44] was used to fill in missing data. To prevent non-normal distributed features from causing incorrect outcomes in ML estimators, logistic regression (with L2 penalty and c = 0.01) was employed as an external estimator, assigning weights to each feature. This approach facilitated accurate and reliable predictions in our study.

Model development

Nine ML algorithms, including Artificial Neural Network (ANN), CatBoost, Decision Tree, Gradient Boosting Machine (GBM), Gaussian Naive Bayes (GNB), K-Nearest Neighbor (KNN), Logistic Regression, Random Forest, and Support Vector Machine (SVM), were employed to develop prognostic models. These models were compared with Linear Regression[45]. To divide the 286 patients into a training and a testing set, stratified random sampling was utilized based on the occurrence of the endpoint. The 8:2 ratio resulted in a training set of 228 patients and a test set of 58 patients.

Model performance evaluation

Various metrics and scoring methods were employed to quantify the accuracy of predictions, including application to the evaluated estimators such as accuracy, precision, recall, and F1-score. The model’s discrimination capability was assessed using the receiver operating characteristic (ROC) curve. To prevent overfitting, repeated resampling, model fitting, and evaluation were utilized. Additionally, decision curve analysis (DCA) and calibration curves were applied to calibrate the model and provide support for probability predictions.

Model interpretation

The Shapley Additive explanation (SHAP) package[46], a method for uniformly measuring feature importance in ML models, was employed for visualizing and explaining the prediction model. SHAP-based explanations offer a solid theoretical foundation and are the only attribution method that satisfies local accuracy, missingness, and consistency requirements[47]. The SHAP beeswarm plot provides a visual overview of the entire model, while sorting feature variables and creating scatter plots help explain the model. The SHAP dependence plot is used to visualize feature interactions and SHAP values, while the SHAP force plot enables visualization of the model at an individual level. We utilized SHAP to offer an explanation for our predictive model, which includes relevant risk factors contributing to mortality in patients with gastric stump cancer. This interpretation helps to enhance understanding of the model’s predictions and the factors influencing patient outcomes.

Statistical analysis

Numerical variables with normal distributions were presented as mean ± SD, while those without normal distributions were represented by median (lower quartile, upper quartile). Categorical variables were expressed as the sum (percentage). Data preprocessing was performed using R software (version 3.6.3). For missing data imputation, KNN[44], Sklearn[48], and SHAP packages[46], in Python (version 3.7) were utilized respectively. The KNN package filled in missing data, while the Sklearn package built and verified the risk models. The SHAP package was used for model visualization and explanation. All models were constructed using the Sklearn package.

Result

Clinicopathological features of RGC

A final dataset consisting of 286 patients with RGC was obtained based on the inclusion criteria. This included 250 male patients (87.4%) and 36 female patients (12.6%). The average age of all patients was 64.3 ± 10.7 years. During a 5-year follow-up period, 142 patients (49.65%) passed away. The basic participant information is presented in Table 1. The dataset encompassed 19 clinical features, including those related to the outcome variable. To prevent later model construction from being influenced by significantly correlated features, the linear correlation between continuous numerical variables in the dataset was analyzed. As shown in Supporting Information 1, there were no significantly correlated variables (r < 0.8). This ensures that the constructed model is minimally affected by redundant or confounding factors.

Table 1 The basic information of participants

Full size table

Feature variable selection

The data was prepared in the required format for implementing the ML algorithm. Nineteen observation indices were assessed for missing values. Aside from three instances where T-stage information was absent, no other variables exhibited any missing data. To fill in missing data, the K-Nearest Neighbor method was employed. For feature selection, recursive feature elimination (RFE) was utilized to enhance estimators’ accuracy scores or improve their performance on highly dimensional datasets. Logistic regression (with L2 penalty, c = 0.01, n = 10) was used as an external estimator to assign weights to features. This approach ensures that the selected features contribute effectively to the model’s predictive accuracy and performance.

Model performance

The predictive performance of the model during both training and testing, as measured by the AUC value, is detailed within Supporting Information 2. The confusion matrices illustrating the performance of the models trained on the test dataset are presented in Fig. 2. Upon comparison with conventional methodologies, the ML-built models showcased enhanced performance. Among all the models, CatBoost models emerged as having the highest f1-scores. The AUC ranged from 0.60 to 0.76 for the test set (refer to Supporting Information 3). Other metrics and scoring methods for quantifying the quality of risk models, such as False Negative Rate (FNR), False Positive Rate (FPR), False Discovery Rate (FDR), and False Omission Rate (FOR), are presented in Supplementary Table 4. Cross-validation serves as a principal method for internal validation[49], and in this study, five-fold cross-validation was employed. Table 2 showcases the performance metrics of the ML algorithms after being subjected to five-fold cross-validation on the test data. Notably, the KNN models achieved the most outstanding test set and f1-scores. Figure 3 further illuminates that, aside from the decision tree model, all other models delivered commendable performances, with an average AUC of the ROC exceeding 0.7, indicating their robustness and predictive capabilities.

Table 2 Metrics and Scoring for Quantifying the Quality of Model Performance with 5-Fold Stratified Cross-Validation on Test Set

Full size table

DCA is a method to determine whether using a prediction model for clinical decision-making provides benefits[50, 51]. In DCA, the net benefit is compared between two strategies: “treat all” and “treat none”. The optimal strategy is the one with the highest net benefit at a specific threshold probability. For the majority of models, the net benefit of the decision curve was higher than that for either “treat all” or “treat none” across all likely threshold probabilities. The GNB model showed a significant decrease in net benefit when threshold probabilities exceeded 80%. For the other eight models, a high net benefit was observed over a wide range of threshold probabilities. Consequently, the DCA results indicated that the constructed models could aid clinical decision-making to improve patient outcomes (Fig. 4).

Furthermore, the calibration curve was assessed to evaluate another measure of discrimination. The reference line is diagonal, and the calibration curve aligns with the reference when the predicted value equals the observed value. The curve is below the reference when risk is overestimated, and above when risk is underestimated. Figure 5 demonstrates that except for the decision tree model, the predicted values of the other eight models exhibited good performance.

Visualization and explanation of models

The 5-year death prediction model based on ML techniques performed satisfactorily in terms of model validity and clinical net benefit. Nonetheless, the opaque nature of ML models creates a lack of transparency. SHAP values reveal the individual contributions of each feature to the final prediction, effectively clarifying and interpreting model predictions for specific patients. After sorting features, SHAP was applied to distinguish the feature values for the selected variable (Fig. 6A). To explain the CatBoost-based model, the SHAP summary plot was utilized. The study findings suggested that a high lymph node ratio (red) had a negative impact on prognosis, while a low lymph node ratio (blue) contributed positively. Concurrently, a high Tstage (red) showed a negative effect on prognosis, whereas a low Tstage (blue) had a positive influence on the patient’s outlook. The results corresponded to those concerning resection margins, positive metastasis, and perineural invasion.

After several years of development, traditional ML methods have become capable of displaying feature variables. However, these methods fail to demonstrate the positive and negative relationships between features within the model (Fig. 6B).

The SHAP Dependence Plot enables visualization of the effects within the model. Each dot represents a sample (Fig. 6C). It was observed that as the T stage increased, so did the SHAP values. The SHAP Force Plot illustrates the individual level within the model. Figure 6D demonstrates the significance of influencing factors for the three subjects in the RGC. In comparison to the first sample (SHAP, 1.03) and third sample (SHAP, 1.54), the second sample (SHAP, -0.41) belonged to the low-risk group, possessing a decreased risk of 5-year death. Variables influencing the model’s outcomes are listed below the horizontal axis. Different individuals might have identical or slightly varying key variables affecting their outcomes.

Discussion

Our research harnessed ML techniques to create a set of ML models skilled at forecasting five-year survival prognoses for RGC following surgery. This is the first investigation to examine prognostic risk factors for RGC utilizing ML models. Through the development and validation of this model, we have showcased its consistent performance and superior reproducibility. Significantly, our risk model not only demonstrates robust stability compared to conventional techniques but also addresses the ‘black box’ issue associated with ML models by incorporating model visualization techniques. By visualizing the model, we enable healthcare professionals to more effectively discern post-surgery survival outcomes. These predictive indicators potentially grant clinicians an enhanced ability to tailor care strategies, thereby optimizing risk factor management for high-risk patients.

The proficiency, user-friendliness, and resilience of ML models in recognizing complex data significantly surpass traditional statistical models, overcoming their limitations regarding statistical efficiency[49]. In ML models, classes can be utilized for feature selection or dimensionality reduction to enhance the model’s accuracy score or improve its performance on high-dimensional datasets[52]. Gradient boosted decision trees (GBDTs), including XGBoost, LightGBM, and CatBoost, are potent tools for big data classification tasks. Our method provides not only a precise and clinically feasible technique for predicting RGC patient survival outcomes but also enhances the interpretability of the predictions. The SHAP value quantifies each feature marker’s contribution to the model’s identification results, enabling comprehensive global explanations[46, 53, 54]. The predictive capacity of a clinical factor in the XGBoost model elevates as the average absolute SHAP value of each factor rises. To obtain a uniform perspective, these factors were consolidated, and SHAP interpretation drew from individual patients. SHAP effectively addresses multicollinearity issues and determines whether an influence is beneficial, thanks to its ability to consider both individual factor effects and their synergies[41]. According to the SHAP values, LNR, T stage, tumor size, resection margins, perineural invasion, and distant metastasis were determined as the most crucial factors in identifying five-year survival prognoses for RGC. In essence, these factors can be considered an optimal subset representing the key players in survival risk assessment for RGC patients. The interpretability of the optimal subset stems from capturing and visualizing the effect direction of each feature and its contribution size to the prediction. This enables clinicians to gain specific insights into how individual predictions are influenced by various variables, affording a personalized, fine-grained understanding of different patients’ prognoses.

Most reports indicate that RGC is often diagnosed at an advanced stage, leading to a relatively low rate of curative resection and unfavorable prognosis. This suggests that RGC may possess distinct biological characteristics from primary GC[1, 55, 56]. However, some researchers have compared RGC to primary GC and found no significant difference in survival rates between the two[57,58,59]. A few studies have investigated the clinicopathologic features and prognosis of RGC, but consensus has not been reached yet[1, 60, 61]. Similar to prior research[56, 62, 63], our study noted that more than 80% of RGC patients were male. This may be attributed to the fact that men are more susceptible to developing both gastroduodenal ulcers and GC[64, 65].

In the majority of studies, RGC lymph node staging adheres to the UICC/AJCC grading criteria. However, in first-time GC patients, postoperative lymph node drainage changes and the lymph nodes detected by RGC cannot comprehensively determine the N stage, particularly given the occurrence of RGC after GC. The total number of postoperative lymph node dissections during re-surgery typically does not exceed 10, which is significantly fewer than the number of lymph nodes dissected by RGC after surgery for benign lesions. This may lead to inaccurate staging. A study analyzed the prognostic significance of LNR in resectable RGC using retrospective propensity score matching and found that LNR served as an independent prognostic factor for RGC, while the number of positive lymph nodes did not act as an independent prognostic factor[42]. Our study reinforced this notion using an ML method. Therefore, LNR may be a more dependable prognostic factor for RGC patients. However, some studies suggest that LNR is not superior to the number of positive lymph nodes[66]. Further analysis incorporating data from multiple centers with larger sample sizes is necessary.

Another study identified lymphatic invasion and pathological T stage as risk factors for lymph node metastasis in RGC[67]. Many researchers have proposed that high rates of adjacent organ invasion and lymph node metastasis contribute to RGC’s poorer prognosis[19, 20]. Nonetheless, one study found pathological T stage and venous invasion to be significant independent risk factors for survival among RGC patients[68]; however, pathological N stage showed no significant association with long-term survival[68]. This contradicts our study’s findings. In our research, venous infiltration was not included in the prognostic model, suggesting it is not an independent prognostic factor, and nerve invasion plays a crucial role. Given their small sample size (65 cases) and single-center retrospective study, the prognostic value of venous infiltration deserves further examination. It has been demonstrated that tumor site affects RGC’s prognosis[22, 23, 27]. RGC’s tumor location is a vital factor for predicting recurrence patterns and overall survival[69]. However, in our study, tumor location at the anastomotic site did not act as an independent prognostic factor, which aligns with previous reports[70, 71].

The current study unavoidably has several limitations. Firstly, due to its retrospective nature, there was selection bias. Secondly, the sample size was relatively small. Thirdly, some crucial information was incomplete or missing, likely caused by difficulties in gathering data about the initial operation. Further prospective studies involving RGC patients are necessary to comprehensively explore the clinicopathological characteristics of RGC.

Given the primary aim of our research to optimize the use of pathological features in predicting mortality risks for post-gastrectomy GC patients, we intentionally confined our analysis to these specific characteristics. Consequently, we did not incorporate other potentially influential mortality risk factors, such as comorbidities, laboratory indices, and other clinical attributes for stratification purposes. This deliberate focus on pathology data alone may have limited the model's ability to achieve its maximum predictive capacity. Nonetheless, this study serves as a foundational step towards refining risk prediction. Moving forward, we plan to extend our work by integrating additional clinical indicators and biomarkers to construct a more refined and comprehensive predictive model. Such a holistic approach will likely enhance the precision and practicality of risk assessment in this patient population.

Conclusion

In summary, utilizing the CatBoost ML model to develop a prognostic risk model for RGC can effectively assist clinicians in predicting patient outcomes, outperforming traditional ML methods. Moreover, combining SHAP and ML may serve as a suitable approach to identify individuals with poor prognoses.

Availability of data and materials

For reproducibility and transparency, the original data and the code used in this study are made publicly available at: https://github.com/Xiaofan2023/FJMUZZW-ML-Diagnostic-Model.git.

Abbreviations

ML:: Machine learning
RGC:: Remnant gastric cancer
GC:: Gastric cancer
AUC:: Area under the curve
DCA:: Decision curve analysis
SHAP:: SHapley Additive exPlanations
LNR:: Lymph nodes ratio
PUD:: Peptic ulcer disease
AJCC:: American Joint Committee on Cancer
UICC:: Union for International Cancer Control
KNN:: K-nearest neighbor
ANN:: Artificial neural network
GBM:: Gradient boosting machine
GNB:: Gaussian NB
SVM:: Support vector machine
ROC:: Receiver operating characteristic
DCA:: Decision curve analysis
RFE:: Recursive feature elimination
FNR:: False Negative Rate
FDR :: False Discovery Rate
FOR:: False Omission Rate
GBDT:: Gradient boosted decision tree

References

Ohira M, Toyokawa T, Sakurai K, Kubo N, Tanaka H, Muguruma K, Yashiro M, Onoda N, Hirakawa K. Current status in remnant gastric cancer after distal gastrectomy. World J Gastroenterol. 2016;22(8):2424–33.
Article CAS PubMed PubMed Central Google Scholar
Balfour DC. Factors influencing the life expectancy of patients operated on for gastric ulcer. Ann Surg. 1922;76(3):405–8.
Article CAS PubMed PubMed Central Google Scholar
Mak TK, Guan B, Peng J, Chong TH, Wang C, Huang S, Yang J. Prevalence and characteristics of gastric remnant cancer: a systematic review and meta-analysis. Asian J Surg. 2021;44(1):11–7.
Article PubMed Google Scholar
Onodera H, Tokunaga A, Yoshiyuki T, Kiyama T, Kato S, Matsukura N, Masuda G, Tajiri T. Surgical outcome of 483 patients with early gastric cancer: prognosis, postoperative morbidity and mortality, and gastric remnant cancer. Hepatogastroenterology. 2004;51(55):82–5.
PubMed Google Scholar
Ikeda Y, Saku M, Kishihara F, Maehara Y. Effective follow-up for recurrence or a second primary cancer in patients with early gastric cancer. Br J Surg. 2005;92(2):235–9.
Article CAS PubMed Google Scholar
Thorban S, Böttcher K, Etter M, Roder JD, Busch R, Siewert JR. Prognostic factors in gastric stump carcinoma. Ann Surg. 2000;231(2):188–94.
Article CAS PubMed PubMed Central Google Scholar
Sons HU, Borchard F. Gastric carcinoma after surgical treatment for benign ulcer disease: some pathologic-anatomic aspects. Int Surg. 1987;72(4):222–6.
CAS PubMed Google Scholar
Mezhir JJ, Gonen M, Ammori JB, Strong VE, Brennan MF, Coit DG. Treatment and outcome of patients with gastric remnant cancer after resection for peptic ulcer disease. Ann Surg Oncol. 2011;18(3):670–6.
Article PubMed Google Scholar
Inomata M, Shiraishi N, Adachi Y, Yasuda K, Aramaki M, Kitano S. Gastric remnant cancer compared with primary proximal gastric cancer. Hepatogastroenterology. 2003;50(50):587–91.
PubMed Google Scholar
Chen CN, Lee WJ, Lee PH, Chang KJ, Chen KM. Clinicopathologic characteristics and prognosis of gastric stump cancer. J Clin Gastroenterol. 1996;23(4):251–5.
Article CAS PubMed Google Scholar
Santoro R, Ettorre GM, Santoro E. Subtotal gastrectomy for gastric cancer. World J Gastroenterol. 2014;20(38):13667–80.
Article PubMed PubMed Central Google Scholar
Ikeguchi M, Kondou A, Shibata S, Yamashiro H, Tsujitani S, Maeta M, Kaibara N. Clinicopathologic differences between carcinoma in the gastric remnant stump after distal partial gastrectomy for benign gastroduodenal lesions and primary carcinoma in the upper third of the stomach. Cancer. 1994;73(1):15–21.
Article CAS PubMed Google Scholar
Hanyu T, Wakai A, Ishikawa T, Ichikawa H, Kameyama H, Wakai T. Carcinoma in the remnant stomach during long-term follow-up after distal gastrectomy for gastric cancer: analysis of cumulative incidence and associated risk factors. World J Surg. 2018;42(3):782–7.
Article PubMed Google Scholar
Shukla A, Kalayarasan R, Gnanasekaran S, Pottakkat B. Appraisal of gastric stump carcinoma and current state of affairs. World J Clin Cases. 2023;11(13):2864–73.
Article PubMed PubMed Central Google Scholar
Tokunaga M, Sano T, Ohyama S, Hiki N, Fukunaga T, Yamada K, Yamaguchi T. Clinicopathological characteristics and survival difference between gastric stump carcinoma and primary upper third gastric cancer. J Gastrointest Surg. 2013;17(2):313–8.
Article PubMed Google Scholar
Wang H, Qi H, Liu X, Gao Z, Hidasa I, Aikebaier A, Li K. Positive lymph node ratio is an index in predicting prognosis for remnant gastric cancer with insufficient retrieved lymph node in R0 resection. Sci Rep. 2021;11(1):2022.
Article CAS PubMed PubMed Central Google Scholar
Shimada H, Fukagawa T, Haga Y, Oba K. Does remnant gastric cancer really differ from primary gastric cancer? A systematic review of the literature by the Task Force of Japanese Gastric Cancer Association. Gastric Cancer. 2016;19(2):339–49.
Article PubMed Google Scholar
Han SL, Hua YW, Wang CH, Ji SQ, Zhuang J. Metastatic pattern of lymph node and surgery for gastric stump cancer. J Surg Oncol. 2003;82(4):241–6.
Article PubMed Google Scholar
Tanigawa N, Nomura E, Lee SW, Kaminishi M, Sugiyama M, Aikou T, Kitajima M. Current state of gastric stump carcinoma in Japan: based on the results of a nationwide survey. World J Surg. 2010;34(7):1540–7.
Article PubMed PubMed Central Google Scholar
Tran TB, Hatzaras I, Worhunsky DJ, Vitiello GA, Squires MH 3rd, Jin LX, Spolverato G, Votanopoulos KI, Schmidt C, Weber S, et al. Gastric remnant cancer: a distinct entity or simply another proximal gastric cancer? J Surg Oncol. 2015;112(8):877–82.
Article PubMed Google Scholar
Hu X, Tian DY, Cao L, Yu Y. Progression and prognosis of gastric stump cancer. J Surg Oncol. 2009;100(6):472–6.
Article PubMed Google Scholar
An JY, Youn HG, Ha TK, Choi MG, Kim KM, Noh JH, Sohn TS, Kim S. Clinical significance of tumor location in remnant gastric cancers developed after partial gastrectomy for primary gastric cancer. J Gastrointest Surg. 2008;12(4):689–94.
Article PubMed Google Scholar
Namikawa T, Kitagawa H, Iwabu J, Okabayashi T, Kobayashi M, Hanazaki K. Tumors arising at previous anastomotic site may have poor prognosis in patients with gastric stump cancer following gastrectomy. J Gastrointest Surg. 2010;14(12):1923–30.
Article PubMed Google Scholar
Zhang DW, Dong B, Li Z, Dai DQ. Clinicopathologic features of remnant gastric cancer over time following distal gastrectomy. World J Gastroenterol. 2015;21(19):5972–8.
Article PubMed PubMed Central Google Scholar
Lee SB, Kim JH, Kim DH, Jeon TY, Kim DH, Kim GH, Park DY. Clinicopathological characteristics and prognosis of remnant gastric cancer. J Gastric Cancer. 2010;10(4):219–25.
Article PubMed PubMed Central Google Scholar
Ojima T, Iwahashi M, Nakamori M, Nakamura M, Naka T, Katsuda M, Iida T, Tsuji T, Hayata K, Takifuji K, et al. Clinicopathological characteristics of remnant gastric cancer after a distal gastrectomy. J Gastrointest Surg. 2010;14(2):277–81.
Article PubMed Google Scholar
Firat O, Guler A, Sozbilen M, Ersin S, Kaplan H. Gastric remnant cancer: an old problem with novel concerns. Langenbecks Arch Surg. 2009;394(1):93–7.
Article PubMed Google Scholar
Subasi O, Bel O, Manzano J, Barker KJA. The landscape of modern machine learning: a review of machine, distributed and federated learning. 2023, abs/2312.03120. https://doi.org/10.48550/arXiv.2312.03120.
Li S, Yi H, Leng Q, Wu Y, Mao Y. New perspectives on cancer clinical research in the era of big data and machine learning. Surg Oncol. 2024;52:102009.
Article PubMed Google Scholar
Gross AJ, Pisano CE, Khunsriraksakul C, Spratt DE, Park HS, Sun Y, Wang M, Zaorsky NG. Real-World Data: Applications and relevance to cancer clinical trials. Semin Radiat Oncol. 2023;33(4):374–85.
Article PubMed Google Scholar
Capobianco E. High-dimensional role of AI and machine learning in cancer research. Br J Cancer. 2022;126(4):523–32.
Article PubMed PubMed Central Google Scholar
Lotter W, Hassett MJ, Schultz N, Kehl KL, Van Allen EM, Cerami E. Artificial intelligence in oncology: current landscape, challenges, and future directions. Cancer Discov. 2024:Of1-of16. https://doi.org/10.1158/2159-8290.
Zhu L, Pan J, Mou W, Deng L, Zhu Y, Wang Y, Pareek G, Hyams E, Carneiro BA, Hadfield MJ, et al. Harnessing artificial intelligence for prostate cancer management. Cell Rep Med. 2024;5:101506.
Article PubMed PubMed Central Google Scholar
Wang H, Zhang C, Li Q, Tian T, Huang R, Qiu J, Tian R. Development and validation of prediction models for papillary thyroid cancer structural recurrence using machine learning approaches. BMC Cancer. 2024;24(1):427.
Article CAS PubMed PubMed Central Google Scholar
Obermeyer Z, Emanuel EJ. Predicting the future - big data, machine learning, and clinical medicine. N Engl J Med. 2016;375(13):1216–9.
Article PubMed PubMed Central Google Scholar
Peng ZH, Tian JH, Chen BH, Zhou HB, Bi H, He MX, Li MR, Zheng XY, Wang YW, Chong T, et al. Development of machine learning prognostic models for overall survival of prostate cancer patients with lymph node-positive. Sci Rep. 2023;13(1):18424.
Article CAS PubMed PubMed Central Google Scholar
Karabacak M, Jagtiani P, Carrasquilla A, Germano IM, Margetis K. Prognosis individualized: survival predictions for WHO grade II and III gliomas with a machine learning-based web application. NPJ Digit Med. 2023;6(1):200.
Article PubMed PubMed Central Google Scholar
Ji L, Zhang W, Huang J, Tian J, Zhong X, Luo J, Zhu S, He Z, Tong Y, Meng X, et al. Bone metastasis risk and prognosis assessment models for kidney cancer based on machine learning. Front Public Health. 2022;10:1015952.
Article PubMed PubMed Central Google Scholar
Kuwayama N, Hoshino I, Mori Y, Yokota H, Iwatate Y, Uno T. Applying artificial intelligence using routine clinical data for preoperative diagnosis and prognosis evaluation of gastric cancer. Oncol Lett. 2023;26(5):499.
Article CAS PubMed PubMed Central Google Scholar
Rodríguez-Pérez R, Bajorath J. Interpretation of compound activity predictions from complex machine learning models using local approximations and shapley values. J Med Chem. 2020;63(16):8761–77.
Article PubMed Google Scholar
Nohara Y, Matsumoto K, Soejima H, Nakashima N. Explanation of machine learning models using shapley additive explanation and application for real data in hospital. Comput Methods Programs Biomed. 2022;214:106584.
Article PubMed Google Scholar
Yang B, Liu T, Cui H, Lu Z, Fang G, Xue X, Luo T. The value of lymph nodes ratios in the prognosis of resectable remnant gastric cancer through the retrospective propensity score matching analysis. World J Surg Oncol. 2023;21(1):245.
Article PubMed PubMed Central Google Scholar
O’Sullivan B, Brierley J, Byrd D, Bosman F, Kehoe S, Kossary C, Pineros M, Van Eycken E, Weir HK, Gospodarowicz M. The TNM classification of malignant tumours-towards common understanding and reasonable expectations. Lancet Oncol. 2017;18(7):849–51.
Article PubMed PubMed Central Google Scholar
Bania RK, Halder A. R-Ensembler: A greedy rough set based ensemble attribute selection algorithm with kNN imputation for classification of medical data. Comput Methods Programs Biomed. 2020;184: 105122.
Article PubMed Google Scholar
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, et al. Scikit-learn: machine learning in python. J Mach Learn Res. 2011;12:2825–30.
Google Scholar
Lundberg SM, Lee S-I. Proceedings of the 31st International conference on neural information processing systems. In: A unified approach to interpreting model predictions. Long Beach: Curran Associates Inc; 2017. p. 4768–77.
Google Scholar
Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, Katz R, Himmelfarb J, Bansal N, Lee S-I: From local explanations to global understanding with explainable AI for trees. Nature Machine Intelligence 2020;2(1):56-67.
Tanaka T. [Fundamentals] 5. Python+scikit-learn for machine learning in medical imaging. Nihon Hoshasen Gijutsu Gakkai Zasshi. 2023;79(10):1189–93.
Article PubMed Google Scholar
Colmenarejo G. Machine learning models to predict childhood and adolescent obesity: a review. Nutrients. 2020;12(8):2466.
Article PubMed PubMed Central Google Scholar
Vickers AJ, Elkin EB. Decision curve analysis: a novel method for evaluating prediction models. Med Decis Making. 2006;26(6):565–74.
Article PubMed PubMed Central Google Scholar
Vickers AJ, Cronin AM, Elkin EB, Gonen M. Extensions to decision curve analysis, a novel method for evaluating diagnostic tests, prediction models and molecular markers. BMC Med Inform Decis Mak. 2008;8:53.
Article PubMed PubMed Central Google Scholar
Abraham A, Pedregosa F, Eickenberg M, Gervais P, Mueller A, Kossaifi J, Gramfort A, Thirion B, Varoquaux G. Machine learning for neuroimaging with scikit-learn. Front Neuroinform. 2014;8:14.
Article PubMed PubMed Central Google Scholar
Lundberg SM, Nair B, Vavilala MS, Horibe M, Eisses MJ, Adams T, Liston DE, Low DK, Newman SF, Kim J, et al. Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Nat Biomed Eng. 2018;2(10):749–60.
Article PubMed PubMed Central Google Scholar
Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, Katz R, Himmelfarb J, Bansal N, Lee S-I. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell. 2020;2(1):56–67.
Article PubMed PubMed Central Google Scholar
Dhir M. Gastric Remnant Cancer: Is it different from primary gastric cancer? Insights into a unique clinical entity. Ann Surg Oncol. 2020;27(11):4079–81.
Article PubMed Google Scholar
Wang SH, Zhang JC, Zhu L, Li H, Hu KW. Does gastric stump cancer really differ from primary proximal gastric cancer? A multicentre, propensity score matching-used, retrospective cohort study. World J Gastrointest Surg. 2023;15(11):2553–63.
Article PubMed PubMed Central Google Scholar
Schaefer N, Sinning C, Standop J, Overhaus M, Hirner A, Wolff M. Treatment and prognosis of gastric stump carcinoma in comparison with primary proximal gastric cancer. Am J Surg. 2007;194(1):63–7.
Article PubMed Google Scholar
Galata C, Ronellenfitsch U, Weiß C, Blank S, Reißfelder C, Hardt J. Surgery for gastric remnant cancer results in similar overall survival rates compared with primary gastric cancer: a propensity score-matched analysis. Ann Surg Oncol. 2020;27(11):4196–203.
Article PubMed PubMed Central Google Scholar
Ramos M, Pereira MA, Dias AR, Dantas ACB, Szor DJ, Ribeiro U Jr, Zilberstein B, Cecconello I. Remnant gastric cancer: an ordinary primary adenocarcinoma or a tumor with its own pattern? World J Gastrointest Surg. 2021;13(4):366–78.
Article PubMed PubMed Central Google Scholar
Song XH, Liu K, Sun LF, Chen XL, Zhao LY, Zhang WH, Chen XZ, Yang K, Zhang B, Chen ZX, et al. Clinicopathological characteristics and prognostic factors of remnant gastric cancer: a single-center retrospective analysis of 90 patients. Int J Surg. 2018;51:97–103.
Article PubMed Google Scholar
Liao G, Wen S, Xie X, Wu Q. Laparoscopic gastrectomy for remnant gastric cancer: risk factors associated with conversion and a systematic analysis of literature. Int J Surg. 2016;34:17–22.
Article PubMed Google Scholar
Ubøe AAS, Våge C, Mjønes P, Bringeland EA, Fossmark R. Gastric remnant cancer and long-term survival in Central Norway 2001 to 2016 - a population-based study. Surg Oncol. 2023;51:102008.
Article PubMed Google Scholar
An JY, Choi MG, Noh JH, Sohn TS, Kim S. The outcome of patients with remnant primary gastric cancer compared with those having upper one-third gastric cancer. Am J Surg. 2007;194(2):143–7.
Article CAS PubMed Google Scholar
Di Leo A, Pedrazzani C, Bencivenga M, Coniglio A, Rosa F, Morgani P, Marrelli D, Marchet A, Cozzaglio L, Giacopuzzi S, et al. Gastric stump cancer after distal gastrectomy for benign disease: clinicopathological features and surgical outcomes. Ann Surg Oncol. 2014;21(8):2594–600.
Article PubMed Google Scholar
Sowa M, Kato Y, Onoda N, Kubo T, Maekawa H, Yoshikawa K, Nishimura M, Nakanishi I, Chung YS. Early cancer of the gastric remnant with special reference to the importance of follow-up of gastrectomized patients. Eur J Surg Oncol. 1993;19(1):43–9.
CAS PubMed Google Scholar
Nakagawa M, Choi YY, An JY, Hong JH, Kim JW, Kim HI, Cheong JH, Hyung WJ, Choi SH, Noh SH. Staging for remnant gastric cancer: the metastatic lymph node ratio vs. the UICC 7th Edition System. Ann Surg Oncol. 2016;23(13):4322–31.
Article PubMed Google Scholar
Hayashi M, Fujita T, Matsushita H. Evaluating the optimal treatment strategy for early and advanced remnant gastric cancer. ANZ J Surg. 2022;92(11):2907–14.
Article PubMed Google Scholar
Matsuo K, Lee SW, Tanaka R, Imai Y, Honda K, Taniguchi K, Tomiyama H, Uchiyama K. T stage and venous invasion are crucial prognostic factors for long-term survival of patients with remnant gastric cancer: a cohort study. World J Surg Oncol. 2021;19(1):291.
Article PubMed PubMed Central Google Scholar
Sun B, Zhang H, Wang J, Cai H, Xuan Y, Xu D. Tumor location causes different recurrence patterns in remnant gastric cancer. J Gastric Cancer. 2022;22(4):369–80.
Article PubMed PubMed Central Google Scholar
Takahashi M, Takeuchi H, Tsuwano S, Nakamura R, Takahashi T, Wada N, Kawakubo H, Saikawa Y, Kitagawa Y. Surgical resection of remnant gastric cancer following distal gastrectomy: a retrospective clinicopathological Study. Ann Surg Oncol. 2016;23(2):511–21.
Article PubMed Google Scholar
Irino T, Hiki N, Ohashi M, Nunobe S, Tokunaga M, Sano T, Yamaguchi T. Characteristics of gastric stump cancer: a single hospital retrospective analysis of 262 patients. Surgery. 2016;159(6):1539–47.
Article PubMed Google Scholar

Download references

Acknowledgements

We appreciate the collaboration and discussions with our co-authors, and we thank the funding agency for their support. We appreciate the valuable comments and suggestions from the anonymous reviewers.

Funding

This work was sponsored by National Clinical Key Specialty Construction Program and Key Clinical Specialty Discipline Construction Program of Fujian, China. This study was supported by grants from the National Clinical Key Specialty Construction Program; Fujian Provincial Clinical Research Center for Cancer Radiotherapy and Immunotherapy (grant number: 2020Y2012). This work was also sponsored by Joint Funds for the innovation of science and Technology, Fujian province (Grant number: 2021Y9204).

Author information

Zhouwei Zhan, Bijuan Chen and Hui Cheng contributed equally to this work and share first authorship.
Songchang Shi, Zengqing Guo and Lihui Zhang contributed equally to this work and share last authorship.

Authors and Affiliations

Department of Medical Oncology, Clinical Oncology School of Fujian Medical University, Fujian Cancer Hospital, No. 420 Fuma Road, Fuzhou, Fujian, 350014, People’s Republic of China
Zhouwei Zhan, Sijing Zhou, Hanchen Zheng & Zengqing Guo
Department of Radiation Oncology, Clinical Oncology School of Fujian Medical University, Fujian Cancer Hospital, Fuzhou, Fujian, 350014, People’s Republic of China
Bijuan Chen
Department of Pathology, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital, Fuzhou, Fujian, 350001, People’s Republic of China
Hui Cheng
Department of Hepatobiliary and Pancreatic Surgery, Clinical Oncology School of Fujian Medical University, Fujian Cancer Hospital, Fuzhou, Fujian, 350014, People’s Republic of China
Shaohua Xu
Department of Pharmacy, Clinical Oncology School of Fujian Medical University, Fujian Cancer Hospital, Fuzhou, Fujian, 350014, People’s Republic of China
Chunping Huang
School of Basic Medical Sciences of Fujian Medical University, Fuzhou, Fujian, 350004, People’s Republic of China
Haiting Chen, Xuanping Lin, Ruyu Lin, Wanting Huang, Xiaohuan Ma, Yu Fu & Zhipeng Chen
Department of Critical Care Medicine, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital South Branch, Fujian Provincial Hospital, Fuzhou, Fujian, 350001, People’s Republic of China
Songchang Shi & Lihui Zhang

Authors

Zhouwei Zhan
View author publications
You can also search for this author in PubMed Google Scholar
Bijuan Chen
View author publications
You can also search for this author in PubMed Google Scholar
Hui Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Shaohua Xu
View author publications
You can also search for this author in PubMed Google Scholar
Chunping Huang
View author publications
You can also search for this author in PubMed Google Scholar
Sijing Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Haiting Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xuanping Lin
View author publications
You can also search for this author in PubMed Google Scholar
Ruyu Lin
View author publications
You can also search for this author in PubMed Google Scholar
Wanting Huang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaohuan Ma
View author publications
You can also search for this author in PubMed Google Scholar
Yu Fu
View author publications
You can also search for this author in PubMed Google Scholar
Zhipeng Chen
View author publications
You can also search for this author in PubMed Google Scholar
Hanchen Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Songchang Shi
View author publications
You can also search for this author in PubMed Google Scholar
Zengqing Guo
View author publications
You can also search for this author in PubMed Google Scholar
Lihui Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Z.Z., B.C., and L.Z. were involved in conceptualization, formal analysis, writing the initial draft, and project management. S.S. and H.C. conducted formal analysis and developed the methodology. L.P., H.C., X.L., R.L., W.H., X.M., Y.F., Z.C., S.Z., S.X., and H.Z. gathered data. C.H., Z.G. and S.S. carried out critical revision of the manuscript. L.Z., Z.G., and S.S. act as guarantors, possess full access to all study data, and are responsible for data integrity and accuracy of analysis. Z.Z. provided financial support for the study.

Corresponding authors

Correspondence to Songchang Shi, Zengqing Guo or Lihui Zhang.

Ethics declarations

Ethics approval and consent to participate

Studies involving human participants were reviewed and granted approval by the ethics committee of Fujian Cancer Hospital and Fujian Provincial Hospital, Fuzhou, People’s Republic of China. The research adhered to the Declaration of Helsinki. Informed written consent was obtained from participants prior to study participation.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1:

Supporting Information 1. Correlation Matrix of different variables.

Additional file 2:

Supporting Information 2. Model Evaluation. ROC Curves for Test and Training Sets.

Additional file 3:

Supporting Table 3. Metrics and scoring for quantifying the Performance Quality of Risk Models on Test Set

Additional file 4:

Supporting Table 4. Other metrics and scoring for quantifying the quality of risk models

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Zhan, Z., Chen, B., Cheng, H. et al. Identification of prognostic signatures in remnant gastric cancer through an interpretable risk model based on machine learning: a multicenter cohort study. BMC Cancer 24, 547 (2024). https://doi.org/10.1186/s12885-024-12303-9

Download citation

Received: 16 January 2024
Accepted: 22 April 2024
Published: 30 April 2024
DOI: https://doi.org/10.1186/s12885-024-12303-9

Identification of prognostic signatures in remnant gastric cancer through an interpretable risk model based on machine learning: a multicenter cohort study

Abstract

Objective

Methods

Results

Conclusions

Introduction

Data and methods

Patients

Data collection

Study outcomes

Feature selection and data preprocessing

Model development

Model performance evaluation

Model interpretation

Statistical analysis

Result

Clinicopathological features of RGC

Feature variable selection

Model performance

Visualization and explanation of models

Discussion

Conclusion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary Information

Additional file 1:

Additional file 2:

Additional file 3:

Additional file 4:

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Cancer

Contact us