A new genetic algorithm in proteomics: Feature selection for SELDI-TOF data
Mass spectrometry from clinical specimens is used in order to identify biomarkers in a diagnosis. Thus, a reliable method for both feature selection and classification is required. A novel method is proposed to find biomarkers in SELDI-TOF in order to perform robust classification.The feature selection is based on a new genetic algorithm. Concerning the classification, a method which takes into account the great variability on intensity by using decision stumps has been developed. Moreover, as the samples are often small, it is more appropriate to use the decision stumps simultaneously than building a complete tree. The thresholds of the decision stumps are determined in the same genetic algorithm. Finally, the method was generalized to more than two groups based on pairwise coupling. The obtained algorithm was applied on two data sets: a publicly available one containing two groups allowing a comparison with other methods from the literature and a new one containing three groups.
Year of publication: |
2008
|
---|---|
Authors: | Reynès, Christelle ; Sabatier, Robert ; Molinari, Nicolas ; Lehmann, Sylvain |
Published in: |
Computational Statistics & Data Analysis. - Elsevier, ISSN 0167-9473. - Vol. 52.2008, 9, p. 4380-4394
|
Publisher: |
Elsevier |
Saved in:
Saved in favorites
Similar items by person
-
Sabatier, Robert, (2008)
-
Choice of B-splines with free parameters in the flexible discriminant analysis context
Reynes, Christelle, (2006)
-
Bounded optimal knots for regression splines
Molinari, Nicolas, (2004)
- More ...