copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

The Optimality of Naive Bayes

H. Zhang. Proceedings of the Seventeenth International Florida Artificial Intelligence Research Society Conference (FLAIRS 2004), AAAI Press, (2004)

Abstract

Naive Bayes is one of the most efficient and effective inductive learning algorithms for machine learning and data mining. Its competitive performance in classification is surprising, because the conditional independence assumption on which it is based, is rarely true in realworld applications. An open question is: what is the true reason for the surprisingly good performance of naive Bayes in classification? In this paper, we propose a novel explanation on the superb classification performance of naive Bayes. We show that, essentially, the dependence distribution; i.e., how the local dependence of a node distributes in each class, evenly or unevenly, and how the local dependencies of all nodes work together, consistently (supporting a certain classification) or inconsistently (canceling each other out), plays a crucial role. Therefore, no matter how strong the dependences among attributes are, naive Bayes can still be optimal if the dependences distribute evenly in classes, or if the dependences cancel each other out. We propose and prove a sufficient and necessary conditions for the optimality of naive Bayes. Further, we investigate the optimality of naive Bayes under the Gaussian distribution. We present and prove a sufficient condition for the optimality of naive Bayes, in which the dependence between attributes do exist. This provides evidence that dependence among attributes may cancel out each other. In addition, we explore when naive Bayes works well.

Links and resources

BibTeX key: Zhang2004
entry type: inproceedings
booktitle: Proceedings of the Seventeenth International Florida Artificial Intelligence Research Society Conference (FLAIRS 2004)
year: 2004
publisher: AAAI Press
timestamp: 2008.11.22
booktitleaddon: May 17-19, 2004
file: :./FLAIRS04ZhangH.pdf:PDF
owner: CK
venue: Miami Beach, Florida, USA

@cocus's tags highlighted

Cite this publication

@inproceedings{Zhang2004, abstract = {Naive Bayes is one of the most efficient and effective inductive learning algorithms for machine learning and data mining. Its competitive performance in classification is surprising, because the conditional independence assumption on which it is based, is rarely true in realworld applications. An open question is: what is the true reason for the surprisingly good performance of naive Bayes in classification? In this paper, we propose a novel explanation on the superb classification performance of naive Bayes. We show that, essentially, the dependence distribution; i.e., how the local dependence of a node distributes in each class, evenly or unevenly, and how the local dependencies of all nodes work together, consistently (supporting a certain classification) or inconsistently (canceling each other out), plays a crucial role. Therefore, no matter how strong the dependences among attributes are, naive Bayes can still be optimal if the dependences distribute evenly in classes, or if the dependences cancel each other out. We propose and prove a sufficient and necessary conditions for the optimality of naive Bayes. Further, we investigate the optimality of naive Bayes under the Gaussian distribution. We present and prove a sufficient condition for the optimality of naive Bayes, in which the dependence between attributes do exist. This provides evidence that dependence among attributes may cancel out each other. In addition, we explore when naive Bayes works well.}, added-at = {2011-03-27T19:35:34.000+0200}, author = {Zhang, Harry}, biburl = {https://www.bibsonomy.org/bibtex/29288cd3adf6e5273ce7f8b74beb4c6e2/cocus}, booktitle = {Proceedings of the Seventeenth International Florida Artificial Intelligence Research Society Conference (FLAIRS 2004)}, booktitleaddon = {May 17-19, 2004}, editor = {Barr, Valerie and Markov, Zdravko}, file = {:./FLAIRS04ZhangH.pdf:PDF}, interhash = {a8e31b4197a90abcb0bdb2b93504acda}, intrahash = {9288cd3adf6e5273ce7f8b74beb4c6e2}, keywords = {bayesian, naive-bayes}, owner = {CK}, publisher = {AAAI Press}, timestamp = {2011-03-27T19:35:45.000+0200}, title = {The Optimality of Naive Bayes}, venue = {Miami Beach, Florida, USA}, year = 2004 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

The Optimality of Naive Bayes

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML The Optimality of Naive Bayes

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

The Optimality of Naive Bayes

Comments and Reviews
(0)