This paper addresses the problem of classification in situations where the data distribution is not homoge-neous: Data instances might come from different lo-cations or times, and therefore are sampled from re-lated but different distributions. In particular, features may appear in some parts of the data that are rarely or never seen in others. In most situations with non-homogeneous data, the training data is not representa-tive of the distribution under which the classifier must operate. We propose a method, based on probabilistic graphical models, for utilizing unseen features during classification. Our method introduces, for each such unseen feature, a continuous hidden variable describ-ing its influence on the class — whether it tends ...
Most positive and unlabeled data is subject to selection biases. The labeled examples can, for examp...
What are the building blocks of our visual representations? Whatever we look at, the things we see w...
In probabilistic approaches to classification and information extraction, one typically builds a sta...
This paper addresses the problem of classification in situations where the data distribution is not ...
Traditional data-driven classifier learning approaches become limited when the training data is inad...
It is difficult to apply machine learning to new domains because often we lack labeled problem insta...
Some learning techniques for classification tasks work indirectly, by first trying to fit a full pro...
98 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2003.With the growing use of comput...
For many interesting tasks, such as medical diagnosis and web page classification, a learner only ha...
This paper presents an approach to learning from noisy data that views the problem as one of reasoni...
The goal of binary classification is to train a model that can distinguish between examples belongin...
In this paper we propose a simple yet powerful method for learning representations in supervised lea...
In the problem of learning with positive and unlabeled examples, existing research all assumes that ...
As machine learning gains significant attention in many disciplines and research communities, the v...
In many important text classification problems, acquiring class labels for training documents is cos...
Most positive and unlabeled data is subject to selection biases. The labeled examples can, for examp...
What are the building blocks of our visual representations? Whatever we look at, the things we see w...
In probabilistic approaches to classification and information extraction, one typically builds a sta...
This paper addresses the problem of classification in situations where the data distribution is not ...
Traditional data-driven classifier learning approaches become limited when the training data is inad...
It is difficult to apply machine learning to new domains because often we lack labeled problem insta...
Some learning techniques for classification tasks work indirectly, by first trying to fit a full pro...
98 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2003.With the growing use of comput...
For many interesting tasks, such as medical diagnosis and web page classification, a learner only ha...
This paper presents an approach to learning from noisy data that views the problem as one of reasoni...
The goal of binary classification is to train a model that can distinguish between examples belongin...
In this paper we propose a simple yet powerful method for learning representations in supervised lea...
In the problem of learning with positive and unlabeled examples, existing research all assumes that ...
As machine learning gains significant attention in many disciplines and research communities, the v...
In many important text classification problems, acquiring class labels for training documents is cos...
Most positive and unlabeled data is subject to selection biases. The labeled examples can, for examp...
What are the building blocks of our visual representations? Whatever we look at, the things we see w...
In probabilistic approaches to classification and information extraction, one typically builds a sta...