The amount of text data has been growing exponentially and with it the demand for improved information extraction (IE) efforts to analyze and query such data. While automatic IE systems have proven useful in controlled experiments, in practice the gap between machine learning extraction and human extraction is still quite large. In this paper, we propose a system that uses crowdsourcing techniques to help close this gap. One of the fundamental issues inherent in using a large-scale human workforce is deciding the optimal questions to pose to the crowd. We demonstrate novel solutions using mutual information and token clustering techniques in the domain of bibliographic citation extraction. Our experiments show promising results in using cro...
Social media has led to the democratisation of opinion shar-ing. A wealth of information about publi...
Analysts synthesize complex, qualitative data to uncover themes and concepts, but the process is tim...
The amount of controversial issues being discussed on the Web has been growing dramatically. In arti...
The development of solutions to scale the extraction of data from Web sources is still a challenging...
Automatic information extraction (IE) enables the construction of very large knowledge bases (KBs), ...
Abstract: Named entity extraction is an established research area in the field of information extrac...
Automatic information extraction (IE) enables the construction of very large knowledge bases (KBs), ...
News articles, reports, blog posts and academic papers of-ten include graphical charts that serve to...
News articles, reports, blog posts and academic papers often include graphical charts that serve to ...
Abstract—Automatic information extraction (IE) enables the construction of very large knowledge base...
by Sarath Kumar KONDREDDI Ambiguity, complexity, and diversity in natural language textual expressio...
Crowdsourcing has recently been attracting increasing attention as a promising means of collecting l...
Abstract. In this paper, we introduce the CrowdTruth open-source soft-ware framework for machine-hum...
Ambiguity, complexity, and diversity in natural language textual expressions are major hindrances to...
This paper describes a crowdsourcing system that integrates machine learning techniques with hu-man ...
Social media has led to the democratisation of opinion shar-ing. A wealth of information about publi...
Analysts synthesize complex, qualitative data to uncover themes and concepts, but the process is tim...
The amount of controversial issues being discussed on the Web has been growing dramatically. In arti...
The development of solutions to scale the extraction of data from Web sources is still a challenging...
Automatic information extraction (IE) enables the construction of very large knowledge bases (KBs), ...
Abstract: Named entity extraction is an established research area in the field of information extrac...
Automatic information extraction (IE) enables the construction of very large knowledge bases (KBs), ...
News articles, reports, blog posts and academic papers of-ten include graphical charts that serve to...
News articles, reports, blog posts and academic papers often include graphical charts that serve to ...
Abstract—Automatic information extraction (IE) enables the construction of very large knowledge base...
by Sarath Kumar KONDREDDI Ambiguity, complexity, and diversity in natural language textual expressio...
Crowdsourcing has recently been attracting increasing attention as a promising means of collecting l...
Abstract. In this paper, we introduce the CrowdTruth open-source soft-ware framework for machine-hum...
Ambiguity, complexity, and diversity in natural language textual expressions are major hindrances to...
This paper describes a crowdsourcing system that integrates machine learning techniques with hu-man ...
Social media has led to the democratisation of opinion shar-ing. A wealth of information about publi...
Analysts synthesize complex, qualitative data to uncover themes and concepts, but the process is tim...
The amount of controversial issues being discussed on the Web has been growing dramatically. In arti...