Article,

Identifying essential genes in bacterial metabolic networks with machine learning methods

K. Plaimas, R. Eils, and R. Konig.
BMC Systems Biology, 4 (1): 56+ (2010)
DOI: 10.1186/1752-0509-4-56

Abstract

BACKGROUND:Identifying essential genes in bacteria supports to identify potential drug targets and an understanding of minimal requirements for a synthetic cell. However, experimentally assaying the essentiality of their coding genes is resource intensive and not feasible for all bacterial organisms, in particular if they are infective.RESULTS:We developed a machine learning technique to identify essential genes using the experimental data of genome-wide knock-out screens from one bacterial organism to infer essential genes of another related bacterial organism. We used a broad variety of topological features, sequence characteristics and co-expression properties potentially associated with essentiality, such as flux deviations, centrality, codon frequencies of the sequences, co-regulation and phyletic retention. An organism-wise cross-validation on bacterial species yielded reliable results with good accuracies (area under the receiver-operator-curve of 75\% - 81\%). Finally, it was applied to drug target predictions for Salmonella typhimurium. We compared our predictions to the viability of experimental knock-outs of S. typhimurium and identified 35 enzymes, which are highly relevant to be considered as potential drug targets. Specifically, we detected promising drug targets in the non-mevalonate pathway.CONCLUSIONS:Using elaborated features characterizing network topology, sequence information and microarray data enables to predict essential genes from a bacterial reference organism to a related query organism without any knowledge about the essentiality of genes of the query organism. In general, such a method is beneficial for inferring drug targets when experimental data about genome-wide knockout screens is not available for the investigated organism.

BibTeX key: Plaimas2010Identifying
entry type: article
year: 2010
journal: BMC Systems Biology
number: 1
pages: 56+
volume: 4
citeulike-article-id: 7118291
citeulike-linkout-2: http://view.ncbi.nlm.nih.gov/pubmed/20438628
citeulike-linkout-1: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2874528/
citeulike-linkout-3: http://www.hubmed.org/display.cgi?uids=20438628
pmid: 20438628
priority: 2
posted-at: 2010-05-03 10:40:31
issn: 1752-0509
citeulike-linkout-0: http://dx.doi.org/10.1186/1752-0509-4-56
pmcid: PMC2874528
DOI: 10.1186/1752-0509-4-56
url: http://dx.doi.org/10.1186/1752-0509-4-56

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

%0 Journal Article %1 Plaimas2010Identifying %A Plaimas, Kitiporn %A Eils, Roland %A Konig, Rainer %D 2010 %J BMC Systems Biology %K drug-targets gene-essentiality machine-learning %N 1 %P 56+ %R 10.1186/1752-0509-4-56 %T Identifying essential genes in bacterial metabolic networks with machine learning methods %U http://dx.doi.org/10.1186/1752-0509-4-56 %V 4 %X BACKGROUND:Identifying essential genes in bacteria supports to identify potential drug targets and an understanding of minimal requirements for a synthetic cell. However, experimentally assaying the essentiality of their coding genes is resource intensive and not feasible for all bacterial organisms, in particular if they are infective.RESULTS:We developed a machine learning technique to identify essential genes using the experimental data of genome-wide knock-out screens from one bacterial organism to infer essential genes of another related bacterial organism. We used a broad variety of topological features, sequence characteristics and co-expression properties potentially associated with essentiality, such as flux deviations, centrality, codon frequencies of the sequences, co-regulation and phyletic retention. An organism-wise cross-validation on bacterial species yielded reliable results with good accuracies (area under the receiver-operator-curve of 75\% - 81\%). Finally, it was applied to drug target predictions for Salmonella typhimurium. We compared our predictions to the viability of experimental knock-outs of S. typhimurium and identified 35 enzymes, which are highly relevant to be considered as potential drug targets. Specifically, we detected promising drug targets in the non-mevalonate pathway.CONCLUSIONS:Using elaborated features characterizing network topology, sequence information and microarray data enables to predict essential genes from a bacterial reference organism to a related query organism without any knowledge about the essentiality of genes of the query organism. In general, such a method is beneficial for inferring drug targets when experimental data about genome-wide knockout screens is not available for the investigated organism.

BibSonomy

Identifying essential genes in bacterial metabolic networks with machine learning methods

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on