The Support Vector Machine (SVM) typically outperforms other algorithms on text classification problems, but requires training time roughly quadratic in the number of training documents. In contrast, linear time algorithms like Naive Bayes have lower performance, but can easily handle huge training sets. In this paper, we describe a technique that creates a continuum of classifiers between the SVM and a Naive Bayes like algorithm. Included in that continuum is a classifier that approximates SVM performance with linear training time. Another classifier on this continuum can outperform the SVM, yielding a breakeven point that beats other published results on Reuters-21578. We give empirical and theoretical evidence that our hybrid approach su...
Discriminative training for structured outputs has found increasing applications in areas such as na...
Training a Support Vector Machine (SVM) requires the solution of a quadratic programming problem (QP...
Text categorization is the problem of classifying text documents into a set of predefined classes. A...
The Support Vector Machine (SVM) typi-cally outperforms other algorithms on text classification prob...
Abstract. Support vector machines (SVMs) have shown su-perb performance for text classification task...
This paper proposes and analyzes an efficient and effective approach for estimating the generalizati...
This paper proposes and analyzes an approach to estimating the generalization performance of a supp...
In this paper, we address the problem of dealing with a large collection of data and propose a met...
The Text mining and Data mining supports different kinds of algorithms for classification of large d...
We compare Naive Bayes and Support Vector Machines on the task of multiclass text classification. Us...
We present new decomposition algorithms for training multi-class support vector machines (SVMs), in ...
My first exposure to Support Vector Machines came this spring when heard Sue Dumais present impressi...
Thesis (Ph.D. (Computer Engineering))--North-West University, Potchefstroom Campus, 2012As digital c...
Training a support vector machine (SVM) requires the solution of a quadratic programming problem (QP...
Training SVM requires large memory and long cpu time when the pattern set is large. To alleviate the...
Discriminative training for structured outputs has found increasing applications in areas such as na...
Training a Support Vector Machine (SVM) requires the solution of a quadratic programming problem (QP...
Text categorization is the problem of classifying text documents into a set of predefined classes. A...
The Support Vector Machine (SVM) typi-cally outperforms other algorithms on text classification prob...
Abstract. Support vector machines (SVMs) have shown su-perb performance for text classification task...
This paper proposes and analyzes an efficient and effective approach for estimating the generalizati...
This paper proposes and analyzes an approach to estimating the generalization performance of a supp...
In this paper, we address the problem of dealing with a large collection of data and propose a met...
The Text mining and Data mining supports different kinds of algorithms for classification of large d...
We compare Naive Bayes and Support Vector Machines on the task of multiclass text classification. Us...
We present new decomposition algorithms for training multi-class support vector machines (SVMs), in ...
My first exposure to Support Vector Machines came this spring when heard Sue Dumais present impressi...
Thesis (Ph.D. (Computer Engineering))--North-West University, Potchefstroom Campus, 2012As digital c...
Training a support vector machine (SVM) requires the solution of a quadratic programming problem (QP...
Training SVM requires large memory and long cpu time when the pattern set is large. To alleviate the...
Discriminative training for structured outputs has found increasing applications in areas such as na...
Training a Support Vector Machine (SVM) requires the solution of a quadratic programming problem (QP...
Text categorization is the problem of classifying text documents into a set of predefined classes. A...