Skip to main content

Table 6 Comparisons of linear regression models, with and without outlier, of the natural logarithm of the rate of stomach cancer on the main explanatory variables. The difference between the residual sum of squares (RSS) before and after each explanatory variable had been added to regression (ΔRSS) was divided by RSS and multiplied by the error df to yield F, whose numerator df was 1 and denominator df was the error df.

From: Different regression equations relate age to the incidence of Lauren types 1 and 2 stomach cancer in the SEER database: these equations are unaffected by sex or race

WITHOUT OUTLIER

     

Model Covariates

RSS

ΔRSS

error df

F

P

Null

403.26

    

five year period

402.96

0.30

157

0.1

0.73

five year period + sex

388.32

14.64

156

5.9

0.02

five year period +sex + race

319.23

69.09

155

33.5

3.8 × 10-8

five year period + sex + race + Lauren type

255.93

63.30

154

38.1

5.7 × 10-9

five year period + sex + race + Lauren type + age

50.64

205.29

153

620.2

< 1 × 10 -10

WITH OUTLIER

     

Model Covariates

RSS

ΔRSS

error df

F

P

Null

437.81

    

five year period

436.79

1.02

158

0.4

0.54

five year period + sex

418.50

18.29

157

6.9

0.01

five year period +sex + race

342.38

76.12

156

34.7

2.3 × 10-8

five year period + sex + race + Lauren type

273.23

69.15

155

39.2

3.5 × 10-9

five year period + sex + race + Lauren type + age

58.14

215.10

154

569.8

< 1 × 10 -10