- Research article
- Open Access
- Open Peer Review
A nomogram for determining the disease-specific survival in Ewing sarcoma: a population study
BMC Cancervolume 19, Article number: 667 (2019)
We aimed to develop and validate a nomogram for predicting the disease-specific survival of Ewing sarcoma (ES) patients.
The Surveillance, Epidemiology, and End Results (SEER) program database was used to identify ES from 1990 to 2015, in which the data was extracted from 18 registries in the US. Multivariate analysis performed using Cox proportional hazards regression models was performed on the training set to identify independent prognostic factors and construct a nomogram for the prediction of the 3-, 5-, and 10-year survival rates of patients with ES. The predictive values were compared by using concordance indexes (C-indexes), calibration plots, integrated discrimination improvement (IDI), net reclassification improvement (NRI), and decision curve analysis (DCA).
A total of 2,643 patients were identified. After multivariate Cox regression, a nomogram was established based on a new model containing the predictive variables of age, race, extent of disease, tumor size, and therapy of surgery. The new model provided better C-indexes (0.684 and 0.704 in the training and validation cohorts, respectively) than the model without therapy of surgery (0.661 and 0.668 in the training and validation cohorts, respectively). The good discrimination and calibration of the nomogram were demonstrated for both the training and validation cohorts. NRI and IDI were also improved. Finally, DCA demonstrated that the nomogram was clinically useful.
We developed a reliable nomogram for determining the prognosis and treatment outcomes of patients with ES in the US. However, the proposed nomogram still requires external data verification in future applications, especially for regions outside the US.
Ewing sarcoma (ES) is the second most common malignant primary osseous sarcoma in children and adolescents . Bone ES constitutes a family of malignant small round blue cell tumors with neuroectodermal origins, among which 85–90% have the classic t (11; 22) EWS/FLI1 translocation [1, 2]. The overall survival (OS) rate for ES has improved remarkably over the past two decades due to advances in multimodality therapies. In the US, the 5-year survival rates increased from 16% in the 1970s to 39% in the 1990s/early 2000s among patients with metastatic disease. The survival parameter in patients with localized disease increased from 44 to 68% . Despite these improvements, a large proportion of patients with ES still suffer from disease- or treatment-related morbidity or mortality. The early identification of high-risk patients can help provide adjuvant therapies or trial options. Given the clinical uniqueness of ES, prognostic tools are urgently needed to predict survival in ES patients accurately.
Nomograms are reliable and convenient tools for estimating tumor prognosis [4, 5]. In this study, we aimed to establish a comprehensive prognostic evaluation system. The data of ES patients in the Surveillance, Epidemiology, and End Results (SEER) program database registries during 1990–2015 were screened and extracted. We then analyzed the extracted data and subsequently created and validated a nomogram containing significant and reliable variables for quantifying the survival of ES patients.
Data source and inclusion criteria
We queried the SEER program database for ES records from 1990 to 2015 that covers approximately 30% of the US population and includes cases from 18 population-based registries . Utilizing data from the SEER program does not require informed patient consent, and no case-identifying information is provided by the SEER cancer registries.
We searched for patients with ES by using the histological subtype code of “Ewing sarcoma” (9260/3) in the third edition of the International Classification of Diseases for Oncology. The patient demographic variables of interest included age at diagnosis (categorized into ≤30 years old and > 30 years old), sex, race, and marital status (categorized into married, single/domestic partner, or divorced/separated/widowed). A composite socioeconomic status (SES) score corresponding to the percentage of persons in the country living below the national poverty threshold in the official 2000 census  was divided into three levels by using previously reported cutoff points [6, 7], namely, < 10% (low poverty), 10–19.99% (moderate poverty), and ≥ 20% (high poverty). The year of diagnosis (YOD) was categorized into 1990s, 2000s and 2010s. EOD was categorized into confined, local invasion, metastasis, and unknown . The primary site of ES was classified into extremity, axial skeleton, and others. Tumor size was grouped into ≤50 mm (small), > 50 and ≤ 100 mm (intermediate), and > 100 mm (large) . Surgery, radiotherapy, and chemotherapy were categorized into received and not received/unknown. Patients with missing or unknown of survival period were excluded.
Statistical analysis and nomogram construction
The categorical variables are expressed as frequencies and proportions and compared with the chi-square and Fisher’s exact tests. Multivariate analysis was performed by using Cox proportional hazards regression models to determine the factors associated with survival. On the basis of the predictive model with identified prognostic factors, a nomogram was constructed for predicting the 3-, 5-, and 10-year survival rates of ES patients.
Nomogram validation and performance evaluation
The nomogram was validated by measuring the discrimination and calibration curves both internally (training cohort) and externally (validation cohort). Receiver operating characteristic (ROC) curves were generated to evaluate the performance of the nomogram on the basis of the areas under the ROC curves. The agreement between the predicted probability and actual outcome was evaluated via calibration plotting. The nomogram was subjected to bootstrapping validation (1,000 bootstrap resamples) to calculate a relatively corrected concordance index (C-index). The improvement in the predictive accuracy of the models with and without prognostic therapies was estimated by calculating the relative integrated discrimination improvement (IDI) and the net reclassification improvement (NRI), as described by Cook . Finally, we evaluated the clinical usefulness and net benefit of the new predictive models by using decision curve analysis (DCA), as described by Vickers and Elkin .
Statistical analysis was conducted with SPSS (version 24.0; Chicago, IL, USA) and R (version 3.0.1; https://www.r-project.org/) softwares. P values < 0.05 of the two-sided tests were considered statistically significant.
Demographic baseline characteristics
The application of the inclusion and exclusion criteria listed in the Materials and Methods resulted in the identification of 2,643 patients with ES in the SEER program database. The survival period was known for all of the included patients. For nomogram construction and validation, we randomly assigned 70 and 30% of the patients to the training (n = 1,850) and validation (n = 793) cohorts, respectively. The majority of patients were ≤ 30 years old (78.4 and 79.8% in the training and validation cohorts, respectively) and male (58.9 and 62.7%), white (88.1 and 88.8%), had a marital status of single/domestic partner (76.3 and 79.8%), had been diagnosed in the 2000s or 2010s (82.4 and 82.1%), had unknown EOD (67.1 and 67.0%), and had undergone surgery (60.9 and 59.9%) and chemotherapy (91.0 and 93.9%). The clinicopathological characteristics of all patients are listed in Table 1.
Multivariate cox regression analysis results
Multivariate models were developed to identify independent prognostic variables. Sex, marital status, SES score, YOD, primary site, radiotherapy, and chemotherapy were not associated with the significant differences in survival. Thus, age at diagnosis, race, EOD, tumor size, and surgery were subjected to multivariate Cox regression analysis. The multivariate analysis demonstrated that age at diagnosis > 30 years old (adjusted hazard ratio , 2.153; 95% confidence interval [CI], 1.812 to 2.558; p < 0.001), being black (aHR, 1.497; 95% CI, 1.054 to 2.128; p < 0.05), metastasis (aHR, 4.839; 95% CI, 2.780 to 8.424; p < 0.001), unknown EOD (aHR, 2.127; 95% CI, 1.246 to 3.632; p < 0.01), tumor size > 50 and ≤ 100 mm (aHR, 1.469; 95% CI, 1.105 to 1.953; p < 0.01), tumor size > 100 mm (aHR, 2.273; 95% CI, 1.755 to 2.945; p < 0.001), and nonsurgical treatment (aHR, 1.951; 95% CI, 1.670 to 2.280; p < 0.001) were independent negative predictors of disease-specific survival (DSS). The multivariate analyses of DSS in the training set are listed in Table 2.
The results of the logistic regression model listed in Table 2 were utilized to construct a nomogram (Fig. 1). Each predictor was included in its line according to that scale. The total points on the nomogram were added and then converted into the probability of 3-, 5-, and 10-year survival with guidance from the linear parallel lines. The nomogram showed that metastasis, which had the largest absolute values, was the strongest contributor to the risk of prognosis, followed by age > 30 years old, large tumor (> 100 mm), nonsurgical treatment, local and unknown EOD, being black, intermediate tumor size (> 50 and ≤ 100 mm), and being male. Meanwhile, the protective factors included age ≤ 30 years old, being white, confined tumor, small size tumor (≤50 mm), and surgery performed.
Performance of the nomogram
Based on the C-index analysis of the SEER training cohort, the nomogram provided relatively high C-indexes for the 3-, 5-, and 10-year survivals at 0.721, 0.713, and 0.699, respectively; the corresponding values for the external validation cohort were also high at 0.721, 0.718, and 0.723. These findings indicated that the model had good discriminative ability (Fig. 2).
Validation of the nomogram
The new model for the established nomogram included the following variables that were entered into the multivariate Cox regression analysis: age at diagnosis, race, EOD, tumor size, and surgery. The new model that included therapy of surgery provided better C-indexes (0.684 and 0.704 in the training and validation cohorts, respectively) than that of the model without surgery (0.661 and 0.668). A high C-index indicates good ability to separate patients with different survival outcomes. The calibration curves in Fig. 3 depict the calibration of the new model in terms of the agreement between the predicted probabilities and observed outcomes for 3-, 5-, and 10-year survival.
The NRI values were 0.361 (95% CI, 0.241 to 0.525), 0.481 (95% CI, 0.260 to 0.562), and 0.520 (95% CI, 0.350 to 0.569) for 3, 5, and 10 years of follow-up examinations in the training cohort, respectively. In the validation cohort, the NRI values for 3, 5, and 10 years of follow-up were 0.351 (95% CI, 0.242 to 0.502), 0.458 (95% CI, 0.320 to 0.541), and 0.494 (95% CI, 0.350 to 0.670), respectively. These results showed that the new model exhibited superior predictive performance compared with the model without the therapy of surgery. Similarly, the IDI values for 3, 5, and 10 years of follow-up examinations were 0.026, 0.028, and 0.029 in the training cohort and 0.021, 0.023, and 0.025 in the validation cohort, respectively.
DCA graphically showed the large net benefits of the new model for predicting 3-, 5-, and 10-year survival (Fig. 4) to verify its clinical utilization and impact in practical decision-making.
ES is an rare and aggressive type of malignancy that normally develops in young patients from childhood to early adulthood . ES is the second most common primary malignant bone tumor in people younger than 30 years (second only to osteosarcoma) and the most common primary malignant bone tumor in those younger than 10 years. The annual incidence of ES among Caucasians is less than 3 per 1,000,000 , thereby indicating that data from single-center studies cannot provide adequate sample sizes. Therefore, this study was based on a large-sample database of the SEER program, which initially started with eight registries in 1973 and has continuously added other participating sites over time. At present, the database includes 18 geographically diverse areas representing 26% of the US population with efforts to reflect the racial, economic, and social diversity of the country as a whole [2, 6, 14]. The neoadjuvant chemoradiation treatment of ES began in the early 1990s . To obtain reliable research results, we identified 2,643 patients with ES in the SEER program database from 1990 to 2015. ES mostly occurs in young people. In our study, most of the patients were ≤ 30 years old, accounting for 78.4 and 79.8% in the training and validation cohorts, respectively. Table 1 presents that most of the patients were male, white, had a marital status of single/domestic partner, diagnosed in the 2000s or 2010s, treated with surgery, and treated with chemotherapy; these results were consistent with previous research findings [16,17,18,19]. Although ES has the highest incidence in people under the age of 30 , the prognosis is better for those with a younger age of onset and worse for those with a higher age of onset . Similarly, in our nomogram (Fig. 1), the prognosis of people older than 30 was worse than that of people younger than 30. Regarding the cause of this phenomenon, Lee et al.  and Grevener et al.  found that adult patients received few cases of chemotherapy, and older patients were more likely to have multiple comorbidities, including diabetes, high blood pressure, and secondary cancer, which complicated the situation.
The long-term survival rate of ES for nonmetastatic disease at presentation has improved from 10 to 15% to 60–70% since the early 1990s through the application of multimodality approaches, including surgery, radiotherapy, and neoadjuvant chemotherapy [12, 22, 23]. However, ES exhibits an aggressive behavior that often results in lung metastasis, which is a poor prognostic factor given that only 20% of patients with metastases can survive for a long time [1, 2, 20]. The early identification of high-risk ES patients is helpful in providing adjuvant treatment or trials. Existing clinical staging systems only consider tumor size and histological metastasis. For example, the staging system of the American Joint Committee on Cancer can only estimate the limited clinical risk of ES. Therefore, the use of Cox regression analysis and the developed nomogram provides a comprehensive predictive model that includes not only the system demographics but also the therapy of surgery and other clinical parameters.
A nomogram is a convenient graphical representation of a mathematical model. It provides an intuitive way to combine important factors and predict a specific endpoint. The nomogram is also a reliable tool for quantifying risk and widely used in applied tumor prognoses. A well-developed clinical nomogram is a popular decision-making tool that can be used to predict the outcome of an individual and benefit both clinicians and patients . Nomograms in many studies [2, 16, 17, 20, 25] indicate that being black and aged appear to be high-risk factors. However, small size tumor and surgery treatment demonstrate improved outcomes in DSS for ES. This trend is understandable given the aggressive therapies needed to treat such disease. Patients with metastatic diseases at the initial presentation have worse prognoses than those with confined diseases [2, 13, 18, 25]. Knowledge of these features will be helpful in clinical decisions.
Similar to previous studies [10, 26, 27], we applied IDI and NRI to evaluate whether the newly constructed prognostic model performed well and whether it should be used in clinical practice. Compared with radiotherapy and chemotherapy, surgery is the most effective means of treating ES [2, 18]. The new model containing the therapy of surgery showed good discrimination and calibration, in which both IDI and NRI for 3, 5, and 10 years of follow-up examinations showed improved C-index, as mentioned in the Results section.
Finally, our newly constructed nomogram model included a wide range of clinical risk factors, namely, age at diagnosis, race, EOD, tumor size, and surgery, which were easily available and routinely collected from historical records. Figure 4 shows the results of our DCA, wherein the abscissa and ordinate are the threshold probability and net benefit rate, respectively [28,29,30,31]. To the best of our knowledge, this study is the first to use IDI, NRI, and DCA in the verification of the predictive abilities of nomograms for ES. Thus, the nomogram is helpful to accurately predict the 3-, 5-, and 10-year survivals of ES patients.
First, important prognostic factors, such as tumor markers and the expression of the TP53 gene, were not available in the SEER database. Second, information was not available for some of the cases. Hence, we could only define the subclassifications as unknown, such as for EOD and tumor size. Third, similar to other malignant bone tumors, ES showed unavailable AJCC/TNM data in the SEER database that might have affected the diagnostic and predictive accuracy of our new tool . Finally, rather than representing absolutely accurate prognoses, the predicted values calculated from the nomogram were only suitable for interpretation by clinicians. Future studies can use the present findings to develop a well-accepted risk prediction tool for ES .
Nomograms are an important component of modern medical decision-making. We developed a reliable nomogram for determining the prognosis and treatment outcomes of ES patients in the US. However, external data verification is still required in future applications, especially for regions outside the US.
Availability of data and materials
Limited Use Agreement for Surveillance, Epidemiology, and End Results (SEER) Program (https://seer.cancer.gov) SEER*Stat Database: released in April 2017, based on the November 2016 submission. The data can be used publicly.
Area under the curve
Decision curve analysis
Divorced, separated and widowed
Integrated discrimination improvement
Net reclassification improvement
Surveillance, Epidemiology, and End Results
Year of diagnosis
Rodriguez-Galindo C, Navid F, Liu T, Billups CA, Rao BN, Krasin MJ. Prognostic factors for local and distant control in Ewing sarcoma family of tumors. Ann Oncol. 2008;19(4):814–20.
Arshi A, Sharim J, Park DY, Park HY, Yazdanshenas H, Bernthal NM, Shamie AN. Prognostic determinants and treatment outcomes analysis of osteosarcoma and Ewing sarcoma of the spine. Spine J. 2017;17(5):645–55.
Esiashvili N, Goodman M, Marcus RB Jr. Changes in incidence and survival of Ewing sarcoma patients over the past 3 decades: surveillance epidemiology and end results data. J Pediatr Hematol Oncol. 2008;30(6):425–30.
Lin Z, Yan S, Zhang J, Pan Q. A nomogram for distinction and potential prediction of liver metastasis in breast Cancer patients. J Cancer. 2018;9(12):2098–106.
Balachandran VP, Gonen M, Smith JJ, DeMatteo RP. Nomograms in oncology: more than meets the eye. Lancet Oncol. 2015;16(4):e173–80.
Wu J, Sun H, Li J, Guo Y, Zhang K, Lang C, Zou C, Ma H. Increased survival of patients aged 0-29 years with osteosarcoma: a period analysis, 1984-2013. Cancer Med. 2018;7(8):3652–61.
Ma H, Sun H, Sun X. Survival improvement by decade of patients aged 0-14 years with acute lymphoblastic leukemia: a SEER analysis. Sci Rep. 2014;4:4227.
Wang Z, Li S, Li Y, Lin N, Huang X, Liu M, Pan W, Yan X, Sun L, Li H, et al. Prognostic factors for survival among patients with primary bone sarcomas of small bones. Cancer Manag Res. 2018;10:1191–9.
Duchman KR, Gao Y, Miller BJ. Prognostic factors for survival in patients with Ewing's sarcoma using the surveillance, epidemiology, and end results (SEER) program database. Cancer Epidemiol. 2015;39(2):189–95.
Cook NR. Comments on 'Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond' by M. J. Pencina et al., statistics in medicine (DOI: 10.1002/sim.2929). Stat Med. 2008;27(2):191–5.
Vickers AJ, Elkin EB. Decision curve analysis: a novel method for evaluating prediction models. Med Decis Mak. 2006;26(6):565–74.
Paulussen M, Ahrens S, Dunst J, Winkelmann W, Exner GU, Kotz R, Amann G, Dockhorn-Dworniczak B, Harms D, Muller-Weihrich S, et al. Localized Ewing tumor of bone: final results of the cooperative Ewing's sarcoma study CESS 86. J Clin Oncol. 2001;19(6):1818–29.
Krakorova DA, Kubackova K, Dusek L, Tomas T, Janicek P, Tucek S, Prausova J, Kiss I, Zambo I. Advantages in prognosis of adult patients with Ewing sarcoma: 11-years experiences and current treatment management. Pathol Oncol Res. 2018;24(3):623–30.
Mirabello L, Troisi RJ, Savage SA. Osteosarcoma incidence and survival rates from 1973 to 2004: data from the surveillance, epidemiology, and end results program. Cancer. 2009;115(7):1531–43.
Gaspar N, Hawkins DS, Dirksen U, Lewis IJ, Ferrari S, Le Deley MC, Kovar H, Grimer R, Whelan J, Claude L, et al. Ewing sarcoma: current management and future approaches through collaboration. J Clin Oncol. 2015;33(27):3036–46.
Cheung MR. Optimization of predictors of Ewing sarcoma cause-specific survival: a population study. Asian Pac J Cancer Prev. 2014;15(10):4143–5.
Campbell K, Shulman D, Janeway KA, DuBois SG. Comparison of epidemiology, clinical features, and outcomes of patients with reported Ewing sarcoma and PNET over 40 years justifies current WHO classification and treatment approaches. Sarcoma. 2018;2018:1712964.
Wan ZH, Huang ZH, Chen LB. Survival outcome among patients with Ewing’s sarcoma of bones and joints: a population-based cohort study. Sao Paulo Med J. 2018;136(2):116–22.
Lee J, Hoang BH, Ziogas A, Zell JA. Analysis of prognostic factors in Ewing sarcoma using a population-based cancer registry. Cancer. 2010;116(8):1964–73.
Chakraborty D, Rangamani S, Kulothungan V, Chaturvedi M, Stephen S, Das P, Sudarshan KL, Janani Surya R, Sathish Kumar K, John A, et al. Trends in incidence of Ewing sarcoma of bone in India - evidence from the National Cancer Registry Programme (1982-2011). J Bone Oncol. 2018;12:49–53.
Grevener K, Haveman LM, Ranft A, van den Berg H, Jung S, Ladenstein R, Klco-Brosius S, Juergens H, Merks JH, Dirksen U. Management and outcome of Ewing sarcoma of the head and neck. Pediatr Blood Cancer. 2016;63(4):604–10.
Ataergin S, Ozet A, Solchaga L, Turan M, Beyzadeoglu M, Oysul K, Arpaci F, Komurcu S, Surenkok S, Ozturk M. Long-lasting multiagent chemotherapy in adult high-risk Ewing's sarcoma of bone. Med Oncol. 2009;26(3):276–86.
Akagunduz OO, Kamer SA, Kececi B, Demirag B, Oniz H, Kantar M, Cetingul N, Sabah D, Anacak Y. The role of radiotherapy in local control of nonextremity Ewing sarcomas. Tumori. 2016;102(2):162–7.
Liu RZ, Zhao ZR, Ng CS. Statistical modelling for thoracic surgery using a nomogram based on logistic regression. J Thorac Dis. 2016;8(8):E731–6.
Fukushima T, Ogura K, Akiyama T, Takeshita K, Kawai A. Descriptive epidemiology and outcomes of bone sarcomas in adolescent and young adult patients in Japan. BMC Musculoskelet Disord. 2018;19(1):297.
Chen LD, Liang JY, Wu H, Wang Z, Li SR, Li W, Zhang XH, Chen JH, Ye JN, Li X, et al. Multiparametric radiomics improve prediction of lymph node metastasis of rectal cancer compared with conventional radiomics. Life Sci. 2018;208:55–63.
Tan X, Ma Z, Yan L, Ye W, Liu Z, Liang C. Radiomics nomogram outperforms size criteria in discriminating lymph node metastasis in resectable esophageal squamous cell carcinoma. Eur Radiol. 2019;29(1):392–400.
Asuncion Esteve-Pastor M, Miguel Rivera-Caravaca J, Roldan V, Vicente V, Valdes M, Marin F, Lip GYH. Long-term bleeding risk prediction in ‘real world’ patients with atrial fibrillation: comparison of the HAS-BLED and ABC-bleeding risk scores. Thromb Haemost. 2017;117(10):1848–58.
Esteve-Pastor MA, Rivera-Caravaca JM, Roldan V, Vicente V, Valdes M, Marin F, Lip GYH. Long-term bleeding risk prediction in 'real world' patients with atrial fibrillation: comparison of the HAS-BLED and ABC-bleeding risk scores. The Murcia atrial fibrillation project. Thromb Haemost. 2017;117(10):1848–58.
Garcia-Fernandez A, Roldan V, Rivera-Caravaca JM, Hernandez-Romero D, Valdes M, Vicente V, Lip GY, Marin F. Does von Willebrand factor improve the predictive ability of current risk stratification scores in patients with atrial fibrillation? Sci Rep. 2017;7:41565.
Rodrigues G, Gonzalez-Maldonado S, Bauman G, Senan S, Lagerwaard F. A statistical comparison of prognostic index systems for brain metastases after stereotactic radiosurgery or fractionated stereotactic radiation therapy. Clin Oncol (R Coll Radiol). 2013;25(4):227–35.
Zhang WT, Zhang WW, He ZY, Sun JY, Zhang L, Xia Q, Wu SG. Comparison of the effects of local treatment strategies in non-metastatic Ewing sarcoma of bone. Expert Rev Anticancer Ther. 2018;18(5):501–6.
Zeng X, Zhang Y, Kwong JS, Zhang C, Li S, Sun F, Niu Y, Du L. The methodological quality assessment tools for preclinical and clinical studies, systematic review and meta-analysis, and clinical practice guideline: a systematic review. J Evid Based Med. 2015;8(1):2–10.
Duchman KR, Gao Y, Miller BJ. Prognostic factors for survival in patients with high-grade osteosarcoma using the surveillance, epidemiology, and end results (SEER) program database. Cancer Epidemiol. 2015;39(4):593–9.
The authors acknowledge the efforts of the SEER program in the creation of the SEER database.
This study was funded by National Social Science Foundation of China (No.16BGL183), and the Research Fund of Health Bureau of Xi’an (No.QFO1330). None of the funding sources were involved in design of the study, data collection and analysis, interpretation of results, writing of the manuscript, or in the decision to submit the manuscript for publication.
Ethics approval and consent to participate
The SEER program database is publicly available and provides de-identified case data. Owing to the data of SEER program is anonymous and cancer is a reportable disease in every state, the requirement for informed consent was therefore waived by the s in the US .
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.