Text is becoming a central source of data for social science research. With advances in digitization and open records practices, the central challenge has in large part shifted away from availability to usability. Automated text classification methodologies are becoming increasingly important within political science because they hold the promise of substantially reducing the costs of converting text to data for a variety of tasks. In this paper, we consider a number of questions of interest to prospective users of supervised learning methods, which are appropriate to classification tasks where known categories are applied. For the right task, supervised learning methods can dramatically lower the costs associated with labeling large volume...
In this poster we describe a pilot study of searching social science literature for legacy corpora t...
We examine methods for improving models for automatically labeling social media data. In particular ...
This repository contains the manuscript of my Ph.D. dissertation. Here is the abstract of the manusc...
Supervised machine learning methods are increasingly employed in political science. Such models requ...
The increasing availability of digitized text presents enormous opportunities for social scientists....
Comparative researchers in politics are deeply interested in the ways in which political discourse i...
ABSTRACT. Social scientists interested in mixed-methods research have traditionally turned to human ...
Text classification via supervised learning involves various steps from processing raw data, featur...
Social scientists have long hand-labeled texts to create datasets useful for studying topics from co...
In recent years, political science has witnessed an explosion of data. Political scientists have beg...
Social scientists often classify text documents to use the resulting labels as an outcome or a predi...
Natural language corpora are phenomenally rich resources for learning about people and society, and ...
Supervised machine learning methods are increasingly employed in political science. Such models requ...
This paper reviews the logic of attempts to automate the processes involved in computer-assisted tex...
The recent advent and evolution of deep learning models and pre-trained embedding techniques have cr...
In this poster we describe a pilot study of searching social science literature for legacy corpora t...
We examine methods for improving models for automatically labeling social media data. In particular ...
This repository contains the manuscript of my Ph.D. dissertation. Here is the abstract of the manusc...
Supervised machine learning methods are increasingly employed in political science. Such models requ...
The increasing availability of digitized text presents enormous opportunities for social scientists....
Comparative researchers in politics are deeply interested in the ways in which political discourse i...
ABSTRACT. Social scientists interested in mixed-methods research have traditionally turned to human ...
Text classification via supervised learning involves various steps from processing raw data, featur...
Social scientists have long hand-labeled texts to create datasets useful for studying topics from co...
In recent years, political science has witnessed an explosion of data. Political scientists have beg...
Social scientists often classify text documents to use the resulting labels as an outcome or a predi...
Natural language corpora are phenomenally rich resources for learning about people and society, and ...
Supervised machine learning methods are increasingly employed in political science. Such models requ...
This paper reviews the logic of attempts to automate the processes involved in computer-assisted tex...
The recent advent and evolution of deep learning models and pre-trained embedding techniques have cr...
In this poster we describe a pilot study of searching social science literature for legacy corpora t...
We examine methods for improving models for automatically labeling social media data. In particular ...
This repository contains the manuscript of my Ph.D. dissertation. Here is the abstract of the manusc...