Online decision tools for personalized survival prediction and treatment optimization in elderly patients with lung squamous cell carcinoma: a retrospective cohort study

Background Despite major advances in cancer therapeutics, the therapeutic options of Lung Squamous Cell Carcinoma (LSCC)-specific remain limited. Furthermore, the current staging system is imperfect for defining a prognosis and guiding treatment due to its simplicity and heterogeneity. We sought to develop prognostic decision tools for individualized survival prediction and treatment optimization in elderly patients with LSCC. Methods Clinical data of 4564 patients (stageIB-IIIB) diagnosed from 2010 to 2015 were extracted from the Surveillance, Epidemiology, and End Results (SEER) database for prognostic nomograms development. The proposed models were externally validated using a separate group consisting of 1299 patients (stage IB-IIIB) diagnosed from 2012–2015 in China. The prognostic performance was measured using the concordance index (C-index), calibration curves, the average time-dependent area under the receiver operator characteristic curves (AUC), and decision curve analysis. Results Eleven candidate prognostic variables were identified by the univariable and multivariable Cox regression analysis. The calibration curves showed satisfactory agreement between the actual and nomogram-estimated Lung Cancer-Specific Survival (LCSS) rates. By calculating the c-indices and average AUC, our nomograms presented a higher prognostic accuracy than the current staging system. Clinical usefulness was revealed by the decision curve analysis. User-friendly online decision tools integrating proposed nomograms were created to estimate survival for patients with different treatment regimens. Conclusions The decision tools for individualized survival prediction and treatment optimization might facilitate clinicians with decision-making, medical teaching, and experimental design. Online tools are expected to be integrated into clinical practice by using the freely available website (https://loyal-brand-611803.framer.app/). Supplementary Information The online version contains supplementary material available at 10.1186/s12885-023-11309-z.


Introduction
Lung cancer remains the leading cause of cancer-related death worldwide, with an estimated 1.79 million deaths per year [1].Approximately 85% of total diagnoses are non-small cell lung cancer (NSCLC), of which lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LSCC) are the most common subtypes.While the ongoing discovery of novel oncogenic mutations and development of new targeted therapies have demonstrated efficacy in prolonging the progression-free survival (PFS) and overall survival (OS) in LUAD patients, therapies aimed at these pathways performed poorly among LSCC [2,3].The advent of immune checkpoint inhibitor (ICI) therapy provides a viable treatment option for LSCC [4].Despite showing significant benefits, immunotherapies were plagued by a relatively low response rate and a high rate of immunotherapy-related adverse effects in some cases.Furthermore, evidence suggests that adoption of ICIs has been linked to significant benefits for younger patients with NSCLC.Survival benefits in the old age group, on the other hand, were less substantial [5].Thus, the current treatment options for elderly patients with advanced LSCC remain limited.For these patients, postoperative cisplatin-based adjuvant therapy still needs to be actively considered.
The pathological staging system is a fundamental cornerstone of postoperative surveillance and clinical management.The current 8 th edition of the staging system for NSCLC includes 3 components: extent of the primary tumor (T), lymph node involvement (N, which is determined by the anatomical regions of metastatic lymph nodes) and distant metastasis (M) [6].Nevertheless, it remains imperfect to rely solely on this staging system for individual survival prediction and treatment decisionmaking.This might be explained for at least two reasons listed as follows: 1. the survival rates differ significantly within the same stage subgroup [7][8][9]; 2. Other potential prognostic factors were omitted, such as grade, the number of metastatic lymph nodes [10][11][12], age [13,14], etc.While several clinical models have been developed to improve risk stratification and survival prediction in NSCLC [15][16][17], few exist for the analysis of optimal postoperative treatment regimens (without adjuvant therapy [AT], adjuvant chemotherapy [ACT], adjuvant chemoradiotherapy [ACRT]) in elderly patients with LSCC.To address this gap, we sought to establish and validate online decision tools for individualized prognosis prediction and postoperative treatment guidance by incorporating the TNM staging system and other critical prognostic factors.We present this manuscript following the TRIPOD reporting checklist.2015).The screening criteria of the validation group were consistent with the training group, the related details are reported in Fig. 1.The outcome of interest in this study was Lung Cancer-Specific Survival (LCSS), which was defined as the interval between the time of diagnosis and lung cancer-related death.Individuals who die of causes other than the specified cause are considered to be censored.Written informed consent was waived due to retrospective anonymous analysis of data.

Prognostic nomograms development
Univariable Cox regression analyses were performed to screen candidate prognostic variables in the training group (n = 4564).Statistically significant variables (P < 0.05) were further analyzed in multivariable Cox regression model.We used restricted cubic splines to flexibly model and visualize the associations between age and cancer-specific survival based on Cox proportional hazard models.The optimal number of knots between 3 and 7 was chosen based on the Akaike Information Criterion (AIC).The AIC value is commonly used to balance model complexity and goodness of fit.A lower AIC value indicates that the model achieves sufficient goodness of fit with fewer parameters.Therefore, model with the smallest AIC value was finally chosen to reduce the risk of overfitting.The continuous variable (age at diagnosis) would be categorized if non-linearity is detected.Survival curves of different groups of adjuvant therapy (without AT, ACT, and ACRT) were plotted within each subgroup stratified by the eighth staging system using the Kaplan-Meier estimates and were compared using the log-rank test.Patients of training groups were also divided into 3 subgroups according to the different treatment regimens.Subsequently, in different subgroups, a backward stepwise selection based on AIC values was performed for significant variables identified in univariable or multivariable analysis.Cox proportional hazards modeling was used for multivariable regression analysis to develop the survival prediction model.

Nomograms validation and evaluation
Internal validation was performed using Bootstrapping with 500 resampling method and external validation was performed using an external validation group.
Discrimination and calibration of the proposed nomograms were evaluated by calculating the concordance index (C-index) and plotting calibration curves in the training groups and corresponding validation groups.Time-dependent receiver operating characteristic (ROC) curves were also plotted [19].The area under the curve (AUC) of time-dependent ROC was calculated every 2 months from the 1 st to the 60 th month.We used decision curve analysis (DCA) to estimate the net benefits at different threshold probabilities to assess the clinical usefulness of the proposed nomograms.C-indices and timedependent ROC curves were also employed to compare the performance between the nomograms and the 8 th TNM staging system.

Online prognostic tools establishment
To facilitate clinicians' usage of the established nomograms, we created a user-friendly website for individualized survival prediction and postoperative treatment optimization in elderly patients with LSCC.

Statistical analysis
The unpaired, χ 2 test or the Fisher exact test was used to assess differences in distributions between the categorical variables studied.The Shapiro-Wilk test was used to assess normality of the continuous data.Based on the test results, appropriate statistical tests (t-test or Mann-Whitney test) were used for comparing the continuous variables.All statistical analyses were performed using R software (version 4.2.0, http:// www.r-proji ect.org/) and SPSS for Windows (version 23.0, IBM, New York, USA).A 2-sided P value < 0.05 was considered significant.

Demographic and clinical characteristics
According to the inclusion and exclusion criteria (details of screening process were shown in Fig. 1), a total of 4564 patients were extracted from the SEER database and included as the training group.In the training group, 2247 deaths were observed during the follow-up time.The median follow-up was 31 months (IQR 9-49).The LCSS rates at 1, 3, and 5 years were 88.1%, 65.4%, and 52.4% (median survival time: 64 months).The external validation group comprised 1299 patients from the First Affiliated Hospital of Soochow University (n = 659) and Jinling Hospital of Nanjing Medical University (n = 640).In the validation group, there were 552 deaths observed during the follow-up time.The median follow-up was 28 months (IQR 10-47).The LCSS rates at 1, 3, and 5 years were 93.1%, 69.8%, and 59.1% (median survival time: 68 months).Detailed information of training group and validation group was illustrated in Table 1.

Development of prognostic nomograms in the training group
In univariable analysis, age, sex, primary site, grade, separate tumor nodules, pleural invasion record, T stage, N stage, Surgery, regional nodes positive, and adjuvant therapy were significantly associated with LCSS in the training group (all P < 0.05, Figure S1).Association between age and cancer-related mortality was depicted using the Restricted Cubic Spline Regression Model (Figure S2).The risk of cancer-specific mortality increased slowly until around 70 years of age at diagnosis and then started to increase rapidly afterward.Interestingly, our findings indicate that younger age may have a protective effect (hazard ratio, HR < 1) until approximately the age of 70.Subsequently, the HR exceeds 1, suggesting an increased risk of cancer-specific mortality.P for nonlinearity < 0.001, which implies a nonlinear relationship between age and cancer-specific mortality.Thus, age was then transformed into categorical variable as follows: 55-64, 65-74, and ≥ 75.According to the consequences of the multivariable analysis, the following variables remained statistically significant: age, sex, grade, pleural invasion record, T stage, N stage, and adjuvant therapy (P < 0.05, Fig. 2).Survival curves were plotted to depict survival differences between groups stratified by different treatment regimens in the training group (Figure S3).Patients receiving certain treatment regimens had significantly better LCSS compared with those without  3.To determine the optimal treatment regimen, use nomogram A, B, and C in sequence to calculate expected LCSS, then compare the 1-year, 3-year, and 5-year LCSS rates between these 3 groups.

Evaluation and validation of the proposed nomograms
As shown in Table 2, the discrimination of the proposed nomograms was superior to that of the TNM staging system in both the training groups (C-indices for LCSS estimates: without AT, 0.735 vs 0.693; ACT, 0.707 vs 0.662; ACRT, 0.718 vs 0.643; all P < 0.05) and validation groups (C-indices for LCSS estimates: without AT, 0.718 vs 0.658; ACT, 0.747 vs 0.705; ACRT, 0.701 vs 0.568; all P < 0.05).Similar results could be observed by calculating the time-dependent area under the receiver operator characteristic curves every 2 months from the 1 st to the 60 th month (Figure S4).The average AUCs of the proposed nomograms were higher than those of the TNM staging system in the training groups (without AT, 0.797 vs 0.745; ACT, 0.727 vs 0.665; ACRT, 0.84 vs 0.764), indicating the superior prognostic ability of these nomograms.Moreover, the calibration curves demonstrated good concordance between the nomogram-predicted and the actual 1-, 3-, and 5-year LCSS probability in the training groups (Figure S5) and validation groups (Figure S6).Decision curve analysis was performed to assess the net benefit of nomogram-assisted decisions at different threshold probabilities, compared with the net benefit of decisions made with the assumption that either all or no patient survive during the follow-up period [20].The decision analysis curves plotted for 1-, 3-, and 5-year LCSS in the training group demonstrated the potential benefits and clinical utility using these nomograms (Figure S7).

Online decision tools
We further created online decision-making tools with two main functions (https:// loyal-brand-611803.framer.app/).For individualized survival prediction, by entering the clinical information of the requested patient into the interface on the left side of the webpage, users could obtain the estimated Kaplan-Meier curve (top right) and the predicted survival probability (bottom right) of this case (Figure S8).The function of treatment optimization is implemented by using nomogram A, B, and C in sequence.Figure 4 displayed clinical information and predicted survival rates (using online decision tools above) of a hypothetical patient diagnosed with LSCC (T3N1M0, Stage III A).Detailed information of patient was shown in the diagram.For this patient, survival benefits (defined as patients receiving certain treatment regimens had significantly higher survival rate compared with those without adjuvant therapy) were observed with ACT, but not with ACRT.

Discussion
Despite significant progress in cancer therapeutics, there remains a lack of LSCC-specific treatment options, particularly for elderly patients with advanced diseases.For these patients, cisplatin-based postoperative adjuvant therapy remains the mainstream treatment option.However, the current TNM staging system is insufficient to serve as a treatment guideline due to its simplicity and heterogeneity.The formulation of individualized treatment regimens should take into account not only pathological stage, but also patient characteristics (e.g., age, comorbidities, individual will, performance status), medical assevssment, quality of resection [21] (grouped as complete, uncertain, and incomplete), and the balance of benefits and hazards [22,23].This large retrospective cohort study contributes significantly by developing and validating prognostic nomograms that incorporate the TNM staging system and other widely assessed prognostic variables for survival estimation.In addition, online decision tools that incorporate established nomograms for treatment optimization are expected to be integrated into clinical practice.
Nomogram A, B, and C were established and validated separately for patients with different treatment regimens.The higher c-indices indicated that these nomograms presented better discriminative capability than the TNM staging system both in the training and validation groups (all P < 0.05, Table 2).Similar superiority was also revealed by calculating time-dependent AUC Fig. 2 The forest plots showing the multivariable Regression Analysis of variables associated with Cancer-Specific Survival.Grade I, well differentiated; Grade II, moderately differentiated; Grade III, poorly differentiated; Grade IV, undifferentiated; PL0, tumor does not completely traverse the elastic layer of pleura; PL1 or PL2, invasion of visceral pleura present; PL3, tumor invades into or through the parietal pleura or chest wall; CT, chemotherapy; CRT, chemoradiotherapy in the training groups (Figure S4).Calibration curves (Figure S5) presented satisfactory consistency between actual observations and predicted CSS (Cancer-Specific Survival) probabilities in the training groups, indicating good repeatability of the nomograms.Similar outcomes (Figure S6) in the external validation groups proved that these nomograms could be widely used despite ethnic and geographical disparities.
A series of studies have demonstrated the feasibility of nomograms to predict survival for NSCLC [24][25][26][27].However, the analysis of their impact on treatment decision-making was largely overlooked.To date, only a few studies have applied nomograms for treatment recommendation.Zhang et al. created nomograms to guide clinicians in choosing the optimal treatment strategy for locoregional nasopharyngeal cancer [28].Jiang et al. developed a survival prediction model to guide individualized treatment recommendations for adjuvant chemotherapy in stage II/III gastric cancer [29].Meanwhile, the application and dissemination of these prediction models have been hindered due to the lack of user-friendly interactive interfaces.The online decision tools created in this study may fill the void to some extent.By intuitively displaying the predicted LCSS of patients with different treatment regimens, the online tools could facilitate clinicians with decision-making, improving patients' comprehension of the disease, medical teaching, experimental design, and so on.We displayed a hypothetical patient diagnosed with stage III A LSCC as example in Fig. 4. By comparing the CSS rates predicted by nomogram A, B, and C, we found that ACT might bring CSS benefits to this patient.However, no survival benefit was observed with ACRT.The ability of decision tools to predict the efficacy of AT is expected to be further used to guide related experimental design.
Several limitations existed in this study due to its retrospective nature, including but not limited to coding errors, selection bias (e.g., patients with missing clinical data were excluded, which may raise potential bias), and the absence of data in the SEER database (quality of surgery, targeted therapy information, immunotherapy information, radiotherapy dose, systemic therapy agents and so on).Moreover, performance status, a crucial variable influencing clinical decision-making, Determine the optimal treatment regimen by comparing the 1-year, 3-year, and 5-year CSS rates between these 3 groups.Grade I, well differentiated; Grade II, moderately differentiated; Grade III, poorly differentiated; Grade IV, undifferentiated; PL0, tumor does not completely traverse the elastic layer of pleura; PL1 or PL2, invasion of visceral pleura present; PL3, tumor invades into or through the parietal pleura or chest wall.To utilize the nomogram, start by drawing a vertical line to the top points row to assign points for each variable.Next, sum up the points from each variable and draw a vertical line from the total points row to determine the 1-year survival, 3-year survival, 5-year survival, and median survival time (in months) is also unavailable in the SEER database.It is worth mentioning that this study did not include patients who only received postoperative adjuvant radiotherapy due to its debated efficacy [22,30,31].The developed nomograms need to be further verified by external data form other centers.Further efforts are warranted to collect prospective data and investigate the possibility of including other prognostic factors to improve the predictive performance.Conducting comparative trials between patients whose treatment decisions were influenced by the online tool and those whose decisions were not could provide more insights into the impact of the tool on clinical decision making.Finally, as stated in the disclaimer of the web tools, these risk prediction tools are the subject of ongoing research and will continue to be refined.Unless under the guidance of relevant researchers or clinicians, we do not recommend patients to independently use the current version of the web-based prediction model to avoid anxiety about the predicted prognosis.

Conclusions
Prognostic nomograms that combine the TNM staging system with other prognostic variables were developed and validated in elderly patients with LSCC.The created online decision tools using these prognostic models are expected to be integrated into clinical practice for CSS estimation and postoperative treatment recommendation.• thorough peer review by experienced researchers in your field

Abbreviations
• rapid publication on acceptance • support for research data, including large and complex data types • gold Open Access which fosters wider collaboration and increased citations maximum visibility for your research: over 100M website views per year

•
At BMC, research is always in progress.

Learn more biomedcentral.com/submissions
Ready to submit your research Ready to submit your research ?Choose BMC and benefit from: ?Choose BMC and benefit from:

Fig. 1
Fig. 1 Flowchart of patient screening and study procedure

Fig. 3
Fig. 3 Nomograms for comparing expected Cancer-Specific Survival (CSS) with different postoperative treatment regimens (Nomogram A, without adjuvant therapy; Nomogram B, adjuvant chemotherapy; Nomogram C, adjuvant chemoradiotherapy).For an individual patient, use nomogram A, B, and C in sequence to calculate expected CSS with different regimens.Determine the optimal treatment regimen by comparing the 1-year, 3-year, and 5-year CSS rates between these 3 groups.Grade I, well differentiated; Grade II, moderately differentiated; Grade III, poorly differentiated; Grade IV, undifferentiated; PL0, tumor does not completely traverse the elastic layer of pleura; PL1 or PL2, invasion of visceral pleura present; PL3, tumor invades into or through the parietal pleura or chest wall.To utilize the nomogram, start by drawing a vertical line to the top points row to assign points for each variable.Next, sum up the points from each variable and draw a vertical line from the total points row to determine the 1-year survival, 3-year survival, 5-year survival, and median survival time (in months)

Fig. 4
Fig.4 Decision tools for treatment optimization by using online nomograms A, B, and C in sequence.Clinicians could compare the 1-year, 3-year, and 5-year CSS rates between these 3 groups to assist in clinical decision-making.CSS, cancer-specific survival; Without AT, without adjuvant therapy; ACT, adjuvant chemotherapy; ACRT, adjuvant chemoradiotherapy

Table 1
Characteristics of training group and validation group adjuvant therapy was defined as survival benefit.For ACT, survival benefit was observed in stage IIA, stage IIB, stage IIIA, and stage IIIB.For ACRT, survival benefit was observed in stage IIIB.

Table 1 (
Grade I, well differentiated; Grade II, moderately differentiated; Grade III, poorly differentiated; Grade IV, undifferentiated; PL0, tumor does not completely traverse the elastic layer of pleura; PL1 or PL2, invasion of visceral pleura present; PL3, tumor invades into or through the parietal pleura or chest wall; SD Standard deviation continued)

Table 2
Prognostic performance of 3 proposed nomograms and the 8 th edition of the UICC/AJCC staging system in the training groups and corresponding validation groups UICC Union for International Cancer Control, AJCC American Joint Committee on Cancer, C-index Concordance index, Without AT Without adjuvant therapy, ACT Adjuvant chemotherapy, ACRT Adjuvant chemoradiotherapy, TNM Tumor, node, and metastasis, p-value indicates the difference in the C-indices