Augmenting Naive Bayes Classifiers with Statistical Language Models

Peng, Fuchun

Open PDF

Open link

Publication date

January 2003

Publisher

ScholarWorks@UMass Amherst

Language

English

Abstract

We augment naive Bayes models with statistical n-gram language models to address short- comings of the standard naive Bayes text classifier. The result is a generalized naive Bayes classifier which allows for a local Markov dependence among observations; a model we re- fer to as the Chain Augmented Naive Bayes (CAN) Bayes classifier. CAN models have two advantages over standard naive Bayes classifiers. First, they relax some of the indepen- dence assumptions of naive Bayes—allowing a local Markov chain dependence in the observed variables—while still permitting efficient inference and learning. Second, they permit straight- forward application of sophisticated smoothing techniques from statistical language modeling, which allows one to obta...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Augmenting Naive Bayes Classifiers with Statistical Language Models

Abstract

Extracted data

Augmenting Naive Bayes Classifiers with Statistical Language Models

Abstract

Extracted data

Related items

Related items