Abstract. Political texts on the Web, documenting laws and policies and the pro-cess leading to them, are of key importance to government, industry, and every individual citizen. Yet access to such texts is difficult due to the ever increas-ing volume and complexity of the content, prompting the need for indexing or annotating them with a common controlled vocabulary or ontology. In this pa-per, we investigate the effectiveness of different sources of evidence—such as the labeled training data, textual glosses of descriptor terms, and the thesaurus structure—for automatically indexing political texts. Our main findings are the following. First, using a learning to rank (LTR) approach integrating all features, we observe significantly better...
Political texts are pervasive on the Web covering laws and policies in national and supranational ju...
Applications of automated text analysis measuring topics, ideology, sentiment or even personality ar...
Documents indexed with controlled vocabularies enable users of libraries to discover relevant docume...
Political texts on the Web, documenting laws and policies and the process leading to them, are of ke...
ABSTRACT: The following research discusses text analysis approaches to automatically categorize news...
Politics and political conflict often occur in the written and spoken word. Scholars have long recog...
Since 1995 the techniques and capacities to store new electronic data and to make it available to ma...
The growing availability of data about online information behaviour enables new possibilities for po...
Comparative researchers in politics are deeply interested in the ways in which political discourse i...
Topic indexing is the task of identifying the main topics covered by a document. These are useful fo...
Replication Materials (Data and Code) for 'Text as Data' Abstract: Politics and political conflict o...
The inference of politically-oriented information from text data is a popular research topic in Natu...
During the past 15 years, automatic text scaling has become one of the key tools of the Text as Data...
By the evolvement in technology, the way of expressing opinions switched direction to the digital wo...
Text has always been an important data source in political science. What has changed in recent years...
Political texts are pervasive on the Web covering laws and policies in national and supranational ju...
Applications of automated text analysis measuring topics, ideology, sentiment or even personality ar...
Documents indexed with controlled vocabularies enable users of libraries to discover relevant docume...
Political texts on the Web, documenting laws and policies and the process leading to them, are of ke...
ABSTRACT: The following research discusses text analysis approaches to automatically categorize news...
Politics and political conflict often occur in the written and spoken word. Scholars have long recog...
Since 1995 the techniques and capacities to store new electronic data and to make it available to ma...
The growing availability of data about online information behaviour enables new possibilities for po...
Comparative researchers in politics are deeply interested in the ways in which political discourse i...
Topic indexing is the task of identifying the main topics covered by a document. These are useful fo...
Replication Materials (Data and Code) for 'Text as Data' Abstract: Politics and political conflict o...
The inference of politically-oriented information from text data is a popular research topic in Natu...
During the past 15 years, automatic text scaling has become one of the key tools of the Text as Data...
By the evolvement in technology, the way of expressing opinions switched direction to the digital wo...
Text has always been an important data source in political science. What has changed in recent years...
Political texts are pervasive on the Web covering laws and policies in national and supranational ju...
Applications of automated text analysis measuring topics, ideology, sentiment or even personality ar...
Documents indexed with controlled vocabularies enable users of libraries to discover relevant docume...