TY - GEN
T1 - Identifying genes with the concept of customization
AU - Hua, Dong
AU - Chen, Dechang
AU - Youssef, Abdou
N1 - Publisher Copyright:
© 2005 IEEE Computer Society. All rights reserved.
PY - 2005
Y1 - 2005
N2 - Gene selection with microarray data is an important task towards the study of genomics. The goal is to identify the optimal subset of genes such that maximum discrimination power across samples (e.g., tumor types) while minimum redundancy among genes are achieved. Essentially, it is NP-complete. Approximation algorithms are usually solicited including individual ranking and sequential forward selection. Typically, from source input microarray data to output selected genes, multiple steps including preprocessing, discretization, discrimination modeling, redundancy modeling, optimization formularization, classification, and evaluation are involved in the presence of a number of options (techniques) for each of them. Putting them together, we form the concept of customization for gene selection in this paper, that is, configure the entire scenario such that various maybe trivial techniques can team work with superior performance rather than focus on certain technique within a single step (e.g., discrimination power modeling). One configuration following the principle of simplicity is constructed in this paper which identifies genes effectively shown by experiments.
AB - Gene selection with microarray data is an important task towards the study of genomics. The goal is to identify the optimal subset of genes such that maximum discrimination power across samples (e.g., tumor types) while minimum redundancy among genes are achieved. Essentially, it is NP-complete. Approximation algorithms are usually solicited including individual ranking and sequential forward selection. Typically, from source input microarray data to output selected genes, multiple steps including preprocessing, discretization, discrimination modeling, redundancy modeling, optimization formularization, classification, and evaluation are involved in the presence of a number of options (techniques) for each of them. Putting them together, we form the concept of customization for gene selection in this paper, that is, configure the entire scenario such that various maybe trivial techniques can team work with superior performance rather than focus on certain technique within a single step (e.g., discrimination power modeling). One configuration following the principle of simplicity is constructed in this paper which identifies genes effectively shown by experiments.
UR - http://www.scopus.com/inward/record.url?scp=85114713735&partnerID=8YFLogxK
U2 - 10.1109/CVPR.2005.472
DO - 10.1109/CVPR.2005.472
M3 - Conference contribution
AN - SCOPUS:85114713735
T3 - IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
BT - 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005 - Workshops
PB - IEEE Computer Society
T2 - 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005 - Workshops
Y2 - 21 September 2005 through 23 September 2005
ER -