We extend the traditional active learning framework to include feedback on features in addition to labeling instances, and we execute a careful study of the effects of feature selection and human feedback on features in the setting of text categorization. Our experiments on a variety of categorization tasks indicate that there is significant potential in improving classifier performance by feature re-weighting, beyond that achieved via membership queries alone (traditional active learning) if we have access to an oracle that can point to the important (most predictive) features. Our experiments on human subjects indicate that human feedback on feature relevance can identify a sufficient proportion of the most relevant features (over 50 % in...
The labeling of data sets is a time-consuming task, which is, however, an important prerequisite for...
In recent decades, the availability of a large amount of data has propelled the field of machine lea...
Active learning methods have been proposed to reduce the labeling effort of human experts: based on ...
We execute a careful study of the effects of feature selection and human feedback on features in act...
Methods that learn from prior information about input features such as generalized expectation (GE) ...
Supervised machine learning techniques rely on the availability of ample training data in the form o...
Supervised machine learning methods are increasingly employed in political science. Such models requ...
Traditional supervised machine learning algorithms are expected to have access to a large corpus of ...
One of the goals of artificial intelligence is to build predictive models that can learn from exampl...
Abstract. An improved active learning method taking advantage of feature selection technique is prop...
Abstract. Automated text categorisation systems learn a generalised hypothesis from large numbers of...
Despite the availability and ease of collecting a large amount of free, unlabeled data, the expensiv...
Graduation date: 2011In text classification, labeling features is often less time consuming than lab...
In this paper we present two very popular aspects in supervised Machine Learning algorithms: feature...
Supervised machine learning methods are increasingly employed in political science. Such models requ...
The labeling of data sets is a time-consuming task, which is, however, an important prerequisite for...
In recent decades, the availability of a large amount of data has propelled the field of machine lea...
Active learning methods have been proposed to reduce the labeling effort of human experts: based on ...
We execute a careful study of the effects of feature selection and human feedback on features in act...
Methods that learn from prior information about input features such as generalized expectation (GE) ...
Supervised machine learning techniques rely on the availability of ample training data in the form o...
Supervised machine learning methods are increasingly employed in political science. Such models requ...
Traditional supervised machine learning algorithms are expected to have access to a large corpus of ...
One of the goals of artificial intelligence is to build predictive models that can learn from exampl...
Abstract. An improved active learning method taking advantage of feature selection technique is prop...
Abstract. Automated text categorisation systems learn a generalised hypothesis from large numbers of...
Despite the availability and ease of collecting a large amount of free, unlabeled data, the expensiv...
Graduation date: 2011In text classification, labeling features is often less time consuming than lab...
In this paper we present two very popular aspects in supervised Machine Learning algorithms: feature...
Supervised machine learning methods are increasingly employed in political science. Such models requ...
The labeling of data sets is a time-consuming task, which is, however, an important prerequisite for...
In recent decades, the availability of a large amount of data has propelled the field of machine lea...
Active learning methods have been proposed to reduce the labeling effort of human experts: based on ...