Skip to main content

Table 1 The datasets analyzed in this study

From: Identifying primary site of lung-limited Cancer of unknown primary based on relative gene expression orderings

Label Dataset Platform Sample size Ref (PMID)
Training sets
Primary CRC
 Set1 GSE21510 GPL570 123 21270110
 Set2 GSE14095 GPL570 189 21680303
 Set3 GSE41258 GPL96 186 19359472
Primary lung cancer
 Set4 GSE31210 GPL570 226 22080568
 Set5 GSE14814 GPL96 133 20823422
 Set6 GSE43580 GPL570 150 23966112
Validation sets
Primary CRC
  GSE2138 GPL96 20 16247484
  GSE7208 GPL96 59 17638901
  GSE39582 GPL570 566 23700391
  GSE5364 GPL96 9 18636107
  GSE19249 GPL571 15 20522636
Primary lung cancer
  GSE19804 GPL570 60 20802022
  GSE33532 GPL570 80 Michael Meister, et al.
  GSE18842 GPL570 46 20878980
  GSE5364 GPL96 18 18636107
  GSE19249 GPL571 7 20522636
Lung metastases of CRC
 Set7 GSE41258 GPL96 20 19359472
  GSE5851 GPL571 3 17664471
  GSE28702 GPL570 1 22095227