Article,

Exploring the optimal strategy to predict essential genes in microbes.

J. Deng, L. Tan, X. Lin, Y. Lu, and L. Lu.
Biomolecules, 2 (1): 1--22 (Dec 27, 2011)
DOI: 10.3390/biom2010001

Abstract

Accurately predicting essential genes is important in many aspects of biology, medicine and bioengineering. In previous research, we have developed a machine learning based integrative algorithm to predict essential genes in bacterial species. This algorithm lends itself to two approaches for predicting essential genes: learning the traits from known essential genes in the target organism, or transferring essential gene annotations from a closely related model organism. However, for an understudied microbe, each approach has its potential limitations. The first is constricted by the often small number of known essential genes. The second is limited by the availability of model organisms and by evolutionary distance. In this study, we aim to determine the optimal strategy for predicting essential genes by examining four microbes with well-characterized essential genes. Our results suggest that, unless the known essential genes are few, learning from the known essential genes in the target organism usually outperforms transferring essential gene annotations from a related model organism. In fact, the required number of known essential genes is surprisingly small to make accurate predictions. In prokaryotes, when the number of known essential genes is greater than 2\% of total genes, this approach already comes close to its optimal performance. In eukaryotes, achieving the same best performance requires over 4\% of total genes, reflecting the increased complexity of eukaryotic organisms. Combining the two approaches resulted in an increased performance when the known essential genes are few. Our investigation thus provides key information on accurately predicting essential genes and will greatly facilitate annotations of microbial genomes.

BibTeX key: Deng2011Exploring
entry type: article
year: 2011
month: dec
day: 27
journal: Biomolecules
number: 1
pages: 1--22
volume: 2
citeulike-article-id: 11354745
citeulike-linkout-2: http://view.ncbi.nlm.nih.gov/pubmed/24970124
citeulike-linkout-1: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4030871/
citeulike-linkout-3: http://www.hubmed.org/display.cgi?uids=24970124
pmid: 24970124
priority: 2
posted-at: 2014-07-31 10:19:22
issn: 2218-273X
citeulike-linkout-0: http://dx.doi.org/10.3390/biom2010001
pmcid: PMC4030871
DOI: 10.3390/biom2010001
url: http://dx.doi.org/10.3390/biom2010001

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

%0 Journal Article %1 Deng2011Exploring %A Deng, Jingyuan %A Tan, Lirong %A Lin, Xiaodong %A Lu, Yao %A Lu, Long J. %D 2011 %J Biomolecules %K centrality gene-essentiality lethality %N 1 %P 1--22 %R 10.3390/biom2010001 %T Exploring the optimal strategy to predict essential genes in microbes. %U http://dx.doi.org/10.3390/biom2010001 %V 2 %X Accurately predicting essential genes is important in many aspects of biology, medicine and bioengineering. In previous research, we have developed a machine learning based integrative algorithm to predict essential genes in bacterial species. This algorithm lends itself to two approaches for predicting essential genes: learning the traits from known essential genes in the target organism, or transferring essential gene annotations from a closely related model organism. However, for an understudied microbe, each approach has its potential limitations. The first is constricted by the often small number of known essential genes. The second is limited by the availability of model organisms and by evolutionary distance. In this study, we aim to determine the optimal strategy for predicting essential genes by examining four microbes with well-characterized essential genes. Our results suggest that, unless the known essential genes are few, learning from the known essential genes in the target organism usually outperforms transferring essential gene annotations from a related model organism. In fact, the required number of known essential genes is surprisingly small to make accurate predictions. In prokaryotes, when the number of known essential genes is greater than 2\% of total genes, this approach already comes close to its optimal performance. In eukaryotes, achieving the same best performance requires over 4\% of total genes, reflecting the increased complexity of eukaryotic organisms. Combining the two approaches resulted in an increased performance when the known essential genes are few. Our investigation thus provides key information on accurately predicting essential genes and will greatly facilitate annotations of microbial genomes.

@article{Deng2011Exploring, abstract = {Accurately predicting essential genes is important in many aspects of biology, medicine and bioengineering. In previous research, we have developed a machine learning based integrative algorithm to predict essential genes in bacterial species. This algorithm lends itself to two approaches for predicting essential genes: learning the traits from known essential genes in the target organism, or transferring essential gene annotations from a closely related model organism. However, for an understudied microbe, each approach has its potential limitations. The first is constricted by the often small number of known essential genes. The second is limited by the availability of model organisms and by evolutionary distance. In this study, we aim to determine the optimal strategy for predicting essential genes by examining four microbes with well-characterized essential genes. Our results suggest that, unless the known essential genes are few, learning from the known essential genes in the target organism usually outperforms transferring essential gene annotations from a related model organism. In fact, the required number of known essential genes is surprisingly small to make accurate predictions. In prokaryotes, when the number of known essential genes is greater than 2\% of total genes, this approach already comes close to its optimal performance. In eukaryotes, achieving the same best performance requires over 4\% of total genes, reflecting the increased complexity of eukaryotic organisms. Combining the two approaches resulted in an increased performance when the known essential genes are few. Our investigation thus provides key information on accurately predicting essential genes and will greatly facilitate annotations of microbial genomes.}, added-at = {2018-12-02T16:09:07.000+0100}, author = {Deng, Jingyuan and Tan, Lirong and Lin, Xiaodong and Lu, Yao and Lu, Long J.}, biburl = {https://www.bibsonomy.org/bibtex/231010d18819a6d4918f53625cd13cfff/karthikraman}, citeulike-article-id = {11354745}, citeulike-linkout-0 = {http://dx.doi.org/10.3390/biom2010001}, citeulike-linkout-1 = {http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4030871/}, citeulike-linkout-2 = {http://view.ncbi.nlm.nih.gov/pubmed/24970124}, citeulike-linkout-3 = {http://www.hubmed.org/display.cgi?uids=24970124}, day = 27, doi = {10.3390/biom2010001}, interhash = {a07ca6970d0e24623f64384e304beb1e}, intrahash = {31010d18819a6d4918f53625cd13cfff}, issn = {2218-273X}, journal = {Biomolecules}, keywords = {centrality gene-essentiality lethality}, month = dec, number = 1, pages = {1--22}, pmcid = {PMC4030871}, pmid = {24970124}, posted-at = {2014-07-31 10:19:22}, priority = {2}, timestamp = {2018-12-02T16:09:07.000+0100}, title = {Exploring the optimal strategy to predict essential genes in microbes.}, url = {http://dx.doi.org/10.3390/biom2010001}, volume = 2, year = 2011 }

BibSonomy

Exploring the optimal strategy to predict essential genes in microbes.

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on