A prediction model for lymph node metastases using pathologic features in patients intraoperatively diagnosed as stage I non-small cell lung cancer

Background There is little information on which pattern should be chosen to perform lymph node dissection for stage I non-small-cell lung cancer. This study aimed to develop a model for predicting lymph node metastasis using pathologic features of patients intraoperatively diagnosed as stage I non-small-cell lung cancer. Methods We collected pathology data from 284 patients intraoperatively diagnosed as stage I non-small-cell lung cancer who underwent lobectomy with complete lymph node dissection from 2013 through 2014, assessing various factors for an association with metastasis to lymph nodes (age, gender, pathology, tumour location, tumour differentiation, tumour size, pleural invasion, bronchus invasion, multicentric invasion and angiolymphatic invasion). After analysing these variables, we developed a multivariable logistic model to estimate risk of metastasis to lymph nodes. Results Univariate logistic regression identified tumour size >2.65 cm (p < 0.001), tumour differentiation (p < 0.001), pleural invasion (p = 0.034) and bronchus invasion (p < 0.001) to be risk factors significantly associated with the presence of metastatic lymph nodes. On multivariable analysis, only tumour size >2.65 cm (p < 0.001), tumour differentiation (p = 0.006) and bronchus invasion (p = 0.017) were independent predictors for lymph node metastasis. We developed a model based on these three pathologic factors that determined that the risk of metastasis ranged from 3% to 44% for patients intraoperatively diagnosed as stage I non-small-cell lung cancer. By applying the model, we found that the values ŷ > 0.80, 0.43 < ŷ ≤ 0.80, ŷ ≤ 0.43 plus tumour size >2 cm and ŷ ≤0.43 plus tumour size ≤2 cm yielded positive lymph node metastasis predictive values of 44%, 18%, 14% and 0%, respectively. Conclusions A non-invasive prediction model including tumour size, tumour differentiation and bronchus invasion may be useful to give thoracic surgeons recommendations on lymph node dissection for patients intraoperatively diagnosed as Stage I non-small cell lung cancer. Electronic supplementary material The online version of this article (doi:10.1186/s12885-017-3273-x) contains supplementary material, which is available to authorized users.


Background
Lung cancer is the leading cause of cancer death worldwide [1] and metastasis to lymph nodes directly determines the stage and prognosis of this disease. Computed tomography (CT) remains the most widely used tool for assessment of the tumour and lymph node involvement in patients with early-stage non-small-cell lung cancer (NSCLC) [2][3][4][5]. In general, lymph nodes with short-axis diameters of >1 cm seen on CT scan are considered metastatic. Unfortunately, the accuracy of CT scan for preoperative lymph node stage is only 45%-79% [2][3][4][5][6]. In addition, studies have demonstrated that 12%-17% of patients histologically confirmed as N2 are preoperatively diagnosed as N0 because their CT scan results showed the involved lymph nodes to have short-axis diameters of <1 cm [4,5,7]. Many other methods of preoperative N-staging, e.g. positron emission tomography, mediastinoscopy and endoscopic ultrasound-guided fine-needle aspiration, are not routinely used for patients with clinical stage I disease. In addition, these methods yield a considerable number of false-negative results [8][9][10].
There is ample high-quality evidence on the advantages of lymph node dissection in lung cancer surgery, including the American College of Surgeons Oncology Group (ACOSOG) Z0030 trial [11], although the benefits of complete lymph node dissection for patients with stage I NSCLC are still controversial [12][13][14]. There is little information on which pattern should be chosen to perform lymph node dissection for patients intraoperatively diagnosed as stage I non-small-cell lung cancer. A non-invasive prediction model that is able to predict lymph node metastasis would allow surgeons to make appropriate decisions on the extent of the dissection, removing lymph nodes that are most likely to contain metastases, while avoiding unnecessary tissue damage in order to accelerate patients' postoperative recovery.
The goal of this study was to identify risk factors that would predict differences in lymph node metastasis and to develop a scoring system to predict the presence of lymph node metastasis. The aim is to determine the appropriate pattern of lymph node dissection for various patients intraoperatively diagnosed as stage I NSCLC.

Patient selection
A total of 284 consecutive patients who underwent surgical resection for primary lung cancer at our hospital from January 2013 to December 2014 were reviewed retrospectively. The records of patients intraoperatively diagnosed as stage I NSCLC who underwent lobectomy with complete lymph node dissection according to the lymph node nomenclature were selected for this study. All patients met the criteria for stage I NSCLC based on the new International Staging System for NSCLC (National Comprehensive Cancer Network (NCCN) Guidelines Version 3.2014: Staging Non-Small Cell Lung Cancer) [15]. We excluded patients from this study who met any one of the following conditions: 1) tumour size > 4 cm and lymph node > 1 cm at the largest diameter on CT imaging or evidence of distant metastasis; 2) preoperative chemotherapy or radiotherapy; 3) previous or coexistent tuberculosis or malignant disease; 4) complete lymph node dissection that did not meet the current standards (i.e. all lymph node stations, including right-hand stations 2-4 and 7-9 and lefthand stations 2-9); 5) pure ground-glass opacity on CT imaging; 6) synchronous lung cancers, 7) sublobar resection, segmentectomy or partial resection or 8) Intraoperative frozen rapid pathological results showed tumour size > 4 cm in the largest diameter.
Patients were preoperatively assessed with chest x-ray, chest and upper abdominal CT scan, brain magnetic resonance imaging and bone scintigraphy. CT scan was used for preoperative N-staging. The surgical approach for primary lung cancer resection was via video-assisted thoracic surgery.

Statistical analysis
The baseline patient characteristics were summarized in percentages for categorical variables and as mean ± SD (Standard Deviation) for continuous variables. The chisquare test and Fisher's exact tests were used to analyse differences in these percentages between the groups. Differences between the groups were analysed using the Kruskal-Wallis test. Significance of associations with the outcome of nodal metastases was first evaluated using a univariate logistic analysis. Those significant variables were analysed by multivariable analysis as independent predictors for lymph node metastasis. Odds ratios (ORs) with 95% confidence intervals (CIs) were calculated. Clinically relevant variables obtained by multivariable analysis were included in the multivariable model. The resulting model coefficients were applied to the cohort to calculate predicted values from the logistic equation: ŷ = 1/[1 + exp. (−xβ)]. All confidence intervals, significance tests and resulting P values were two-sided, with an alpha level of 0.05. Statistical analyses were performed using STATA software, release 13.
Lymph node metastases were not found in 215 patients (group I) but were present in 69 (group II) ( Table 2). The characteristics in these two groups were compared in terms of age, gender, pathology, tumour  To evaluate the predictive value of tumour size between the groups, we used Receiver Operating Characteristic (ROC) curve analysis. As shown in Fig. 1, the area under the ROC curve for tumour size between group I and group II was 0.691 (95% CI: 0.621-0.761; P < 0.001); the optimal cut-off value was 2.650 cm (sensitivity: 67%; specificity: 70%; Youden's index: 0.364).
The probabilities of lymph node metastasis were calculated using the following formula (ŷ = 1/  Model performance and selecting cut-off values to discriminate patients with lymph node metastasis As shown in Fig. 2, the area under the ROC curve of the selected model was 0.753 (95% CI 0.692-0.814, standard error 0.031) and the optimal cut-off value was 0.7997 ≈ 0.80 (sensitivity: 71%, specificity: 71%, Youden's index: 0.417). In all patients, using a score threshold of ≤0.80, 20 (12%) of 172 patients with lymph node metastasis were correctly identified, whereas 152 (88%) of 172 without lymph node metastasis were correctly identified. Using a score threshold of >0.80, 49 (44%) of 112 patients with lymph node metastasis were correctly identified, whereas 63 (56%) of 112 without lymph node metastasis were correctly identified.
When all three covariates (tumour size, tumour differentiation, bronchus invasion) were equal to zero, we found that the cut-off value was 0.42685 ≈ 0.43. In all patients, using a score threshold of ≤0.43, 2 (3%) of 71 patients with lymph node metastasis were correctly identified, whereas 69 (97%) of 71 without lymph node metastasis were correctly identified. Using a score threshold of >0.43, 67 (31%) of 213 patients with lymph node metastasis were correctly identified, whereas 146 (69%) of 213 without lymph node metastasis were correctly identified.

Discussion
A complete lymph node dissection, removing all ipsilateral lymph nodes which can be seen at operation [16], can provide more accurate pathologic staging and better clinical outcomes for some patients. It is considered a standard surgical treatment for patients diagnosed preoperatively with lymph node metastases. However, complete lymph node dissection is not regarded as a routine surgical procedure for patients intraoperatively diagnosed as stage I NSCLC, as some studies have demonstrated a lack of significant differences in outcome between selective lymph node sampling and complete lymph node dissection in patients with earlystage lung cancer [13,17].
However each patient exhibits different clinical characteristics that affect the risk of lymph node metastasis in early-stage lung cancer. In this study, we collected   pathology data from 284 patients intraoperatively diagnosed as stage I NSCLC who underwent lobectomy with complete lymph node dissection and investigated factors that might be associated with metastasis to lymph nodes (age, gender, pathology, tumour location, tumour differentiation, tumour size, pleural invasion, bronchus invasion, multicentric invasion and angiolymphatic invasion). First, we used univariate analysis to find associations between pathologic factors and lymph node metastasis. The results showed that only the tumour size (>2.65 cm), tumour differentiation, pleural invasion and bronchus invasion were significant risk factors. The other factors tested, including age, gender, pathologic type, tumour location, multicentric invasion, angiolymphatic invasion and neural invasion were excluded as risk factors associated with lymph node metastasis.
Furthermore, multivariate analysis of the four risk factors identified on univariate analysis found that only tumour size (>2.65 cm), tumour differentiation and bronchus invasion were independent predictors of lymph node metastasis. Pleural invasion was excluded as an independent predictor in this analysis.
These three independent predictors were kept in the final model. After developing the multivariable logistic regression model, we finally obtained three score thresholds, ŷ ≤0.43, 0.43 < ŷ ≤ 0.80 and ŷ > 0.80 (Table 6). As shown  Thus we demonstrated that lymph node dissection is not necessary for those patients intraoperatively diagnosed as stage I NSCLC whose ŷ value obtained from the model is less than or equal to 0.43 and whose tumour size is ≤2 cm. Complete lymph node dissection or lymph node sampling would be appropriate if the ŷ value from the model is less than or equal to 0.43 but the tumour size is >2 cm or if ŷ is more than 0.43 and less than or equal to 0.80. Complete lymph node dissection must be performed for patients whose ŷ value obtained from the model is more than 0.80.
However, our study has some limitations. This study was conducted at a single institution with retrospective methods and demonstrated the necessity of further prospective study. Further prospective study with multicenter trial should be performed to comprehensively evaluate this model for prediction of lymph node metastases in patients intraoperatively diagnosed as Stage I non-small cell lung cancer.

Conclusions
After a comprehensive analysis of our results concerning various clinical factors, we conclude that the incidence of lymph node metastasis would be lowest when we obtained a ŷ value from the model less than or equal to 0.43 along with a tumour size ≤2 cm. For other patients intraoperatively diagnosed as stage I NSCLC, the risk of lymph node lymph node metastasis was greater, so that and complete lymph node dissection or lymph node sampling is necessary.