| Title | Robust biomarker identification for cancer diagnosis using ensemble feature selection methods |
| Publication Type | Poster presentation |
| Year of Publication | 2009 |
| Authors | Abeel, T, Helleputte T, Dupont P, Saeys Y |
| Conference | BioMagnet 2009, Gent, Belgium; MLSB 2009, Ljubljana, Slovenia; BBC 2009, Liege, Belgium |
| Date Published | 10/2009 |
| Abstract | Biomarker discovery is an important topic in biomedical applications of computational biology, including applications such as gene and SNP selection from high dimensional data. Our first contribution is a general framework for the analysis of the robustness of a biomarker selection algorithm. Secondly, we conducted a large scale analysis of the recently introduced concept of ensemble feature selection, where multiple feature selections are combined in order to increase the robustness of the final set of selected features. We focus on selection methods that are embedded in the estimation of support vector machines (SVMs). SVMs are powerful classification models that have shown state-of-the-art performance on several diagnosis and prognosis tasks on biological data. Their feature selection extensions also offered good results for gene selection tasks. We show that the robustness of SVMs for biomarker discovery can be substantially increased by using ensemble feature selection techniques, while keeping the same classification performances. The proposed methodology is evaluated on four microarray data sets showing increases of up to 27\% in robustness of the selected biomarkers. The stability gain obtained with |