STEP 1 | STEP 2 | STEP 3 |
---|---|---|
Collection of data: . Coordination in relation with clinical teams and clinical research units identified in the EPICURE project . Coordination of all platforms and units to avoid missing data | DATABASE PREPROCESSING: Use of efficient statistical tools for the reduction of the dimension in order to get p < 5000 at least for n = 300 patients: . Sparse Canonical Correlation analysis (in each class of variables) . Principal Component Analysis, Partial Least Squares | Mathematical development Adaptation of recent statistical methods of Data Mining to manage this high-dimension problem such as .LASSO/SLOPE methods (which select solutions with a weak number of « lighted » variables) and their variants adapted to the problem (Sparse Cox model to manage the censured data |