Gene Assessment and Sample Classification for Gene Expression Data Using a Genetic Algorithm / k-nearest Neighbor Method

L. L., D. T.A., W. C.R., L. A.J., и P. L.G..
Combinatorial Chemistry & High Throughput Screening, (December 2001)
DOI: doi:10.2174/1386207013330733

Аннотация

Recent tools that analyze microarray expression data have exploited correlation-based approaches such as clustering analysis. We describe a new method for assessing the importance of genes for sample classification based on expression data. Our approach combines a genetic algorithm (GA) and the k-nearest neighbor (KNN) method to identify genes that jointly can discriminate between two types of samples (e.g. normal vs. tumor). First, many such subsets of differentially expressed genes are obtained independently using the GA. Then, the overall frequency with which genes were selected is used to deduce the relative importance of genes for sample classification. Sample heterogeneity is accommodated; that is, the method should be robust against the existence of distinct subtypes. We applied GA / KNN to expression data from normal versus tumor tissue from human colon. Two distinct clusters were observed when the 50 most frequently selected genes were used to classify all of the samples in the data sets studied and the majority of samples were classified correctly. Identification of a set of differentially expressed genes could aid in tumor diagnosis and could also serve to identify disease subtypes that may benefit from distinct clinical approaches to treatment.

ключ BibTeX: Li2001GeneAssessment
тип записи: article
год: December 2001
журнал: Combinatorial Chemistry & High Throughput Screening
страницы: 727-739(15)
том: 4
DOI: doi:10.2174/1386207013330733
url: http://www.ingentaconnect.com/content/ben/cchts/2001/00000004/00000008/art00010

тэги

imported

Пользователи данного ресурса

Комментарии и рецензиипоказать / перейти в невидимый режим

Пожалуйста, войдите в систему, чтобы принять участие в дискуссии (добавить собственные рецензию, или комментарий)

Цитировать эту публикацию

@article{Li2001GeneAssessment, abstract = {Recent tools that analyze microarray expression data have exploited correlation-based approaches such as clustering analysis. We describe a new method for assessing the importance of genes for sample classification based on expression data. Our approach combines a genetic algorithm (GA) and the k-nearest neighbor (KNN) method to identify genes that jointly can discriminate between two types of samples (e.g. normal vs. tumor). First, many such subsets of differentially expressed genes are obtained independently using the GA. Then, the overall frequency with which genes were selected is used to deduce the relative importance of genes for sample classification. Sample heterogeneity is accommodated; that is, the method should be robust against the existence of distinct subtypes. We applied GA / KNN to expression data from normal versus tumor tissue from human colon. Two distinct clusters were observed when the 50 most frequently selected genes were used to classify all of the samples in the data sets studied and the majority of samples were classified correctly. Identification of a set of differentially expressed genes could aid in tumor diagnosis and could also serve to identify disease subtypes that may benefit from distinct clinical approaches to treatment.}, added-at = {2010-01-25T16:04:55.000+0100}, author = {L., Li and T.A., Darden and C.R., Weingberg and A.J., Levine and L.G., Pedersen}, biburl = {https://www.bibsonomy.org/bibtex/2f80de985eccb78262f9cd94779fd4e83/rocioce2}, description = {IngentaConnect Gene Assessment and Sample Classification for Gene Expression Dat...}, doi = {doi:10.2174/1386207013330733}, interhash = {c294d564236fb182a487e9a1d4316874}, intrahash = {f80de985eccb78262f9cd94779fd4e83}, journal = {Combinatorial Chemistry & High Throughput Screening}, keywords = {imported}, pages = {727-739(15)}, timestamp = {2010-01-26T09:18:25.000+0100}, title = {Gene Assessment and Sample Classification for Gene Expression Data Using a Genetic Algorithm / k-nearest Neighbor Method}, url = {http://www.ingentaconnect.com/content/ben/cchts/2001/00000004/00000008/art00010}, volume = 4, year = {December 2001} }

BibSonomy