Stephens, Christopher R.; Huerta, Hugo Flores; Linares, Ana Ruíz

doi:10.1007/s10994-017-5658-0

Back to matches

Your institution may have access to this item. Find your institution then sign in to continue.

Title: When is the Naive Bayes approximation not so naive?
Authors: Stephens, Christopher R.; Huerta, Hugo Flores; Linares, Ana Ruíz
Abstract: The Naive Bayes approximation (NBA) and associated classifier are widely used and offer robust performance across a large spectrum of problem domains. As it depends on a very strong assumption-independence among features-this has been somewhat puzzling. Various hypotheses have been put forward to explain its success and many generalizations have been proposed. In this paper we propose a set of 'local' error measures-associated with the likelihood functions for subsets of attributes and for each class-and show explicitly how these local errors combine to give a 'global' error associated to the full attribute set. By so doing we formulate a framework within which the phenomenon of error cancelation, or augmentation, can be quantified and its impact on classifier performance estimated and predicted a priori. These diagnostics allow us to develop a deeper and more quantitative understanding of why the NBA is so robust and under what circumstances one expects it to break down. We show how these diagnostics can be used to select which features to combine and use them in a simple generalization of the NBA, applying the resulting classifier to a set of real world data sets.
Subjects: NAIVE Bayes classification; APPROXIMATION theory; GENERALIZABILITY theory; MATHEMATICAL functions; PERFORMANCE evaluation
Publication: Machine Learning, 2018, Vol 107, Issue 2, p397
ISSN: 0885-6125
Publication type: Article
DOI: 10.1007/s10994-017-5658-0

We found a match

When is the Naive Bayes approximation not so naive?

Stephens, Christopher R.; Huerta, Hugo Flores; Linares, Ana Ruíz

NAIVE Bayes classification; APPROXIMATION theory; GENERALIZABILITY theory; MATHEMATICAL functions; PERFORMANCE evaluation

Machine Learning, 2018, Vol 107, Issue 2, p397

0885-6125

Article

10.1007/s10994-017-5658-0