Abstract Crowdsourcing is an emerging collaborative approach that can be used for the acquisition of annotated corpora and a wide range of other linguistic resources. Although the use of this approach is intensifying in all its key genres (paid-for crowdsourcing, games with a purpose, volunteering-based approaches), the community still lacks a set of best-practice guidelines similar to the annotation best practices for traditional, expert-based corpus acquisition. In this paper we focus on the use of crowdsourcing methods for corpus acquisition and propose a set of best practice guidelines based in our own experiences in this area and an overview of related literature. We also introduce GATE Crowd, a plugin of the GATE platform that relies ...
Spoken corpora have traditionally been assembled through careful recording and transcription of disc...
International audienceWhat would be a good method to provide a large collection of semantically anno...
Building language corpora for low resource languages such as South Africa’s isiXhosa is challenging ...
Thesis: S.M., Massachusetts Institute of Technology, School of Architecture and Planning, Program in...
Crowdsourcing is an increasingly popu-lar, collaborative approach for acquiring annotated corpora. D...
Crowdsourcing is an increasingly popular, collaborative approach for acquiring annotated corpora. ...
Crowdsourcing is an efficient approach for knowledge acquisition and data annotation that enables bu...
Crowdsourcing provides new ways of cheaply and quickly gathering large amounts of information contri...
International audienceText corpora represent the foundation on which most natural language processin...
The availability of large scale annotated corpora for coreference is essential to the development of...
This paper provides an overview of the needs for corpus annotation and exploitation, and some sugges...
Crowdsourcing has revolutionised the way tasks can be completed but the process is frequently ineffi...
Linguistic resources can be populated with data through the use of such approaches as crowdsourcing ...
Hand crafted annotated corpora are acknowledged as critical elements for the Human Language Technolo...
© 2017 Dr. Richard James FothergillWords can take on many meanings, and collecting and identifying e...
Spoken corpora have traditionally been assembled through careful recording and transcription of disc...
International audienceWhat would be a good method to provide a large collection of semantically anno...
Building language corpora for low resource languages such as South Africa’s isiXhosa is challenging ...
Thesis: S.M., Massachusetts Institute of Technology, School of Architecture and Planning, Program in...
Crowdsourcing is an increasingly popu-lar, collaborative approach for acquiring annotated corpora. D...
Crowdsourcing is an increasingly popular, collaborative approach for acquiring annotated corpora. ...
Crowdsourcing is an efficient approach for knowledge acquisition and data annotation that enables bu...
Crowdsourcing provides new ways of cheaply and quickly gathering large amounts of information contri...
International audienceText corpora represent the foundation on which most natural language processin...
The availability of large scale annotated corpora for coreference is essential to the development of...
This paper provides an overview of the needs for corpus annotation and exploitation, and some sugges...
Crowdsourcing has revolutionised the way tasks can be completed but the process is frequently ineffi...
Linguistic resources can be populated with data through the use of such approaches as crowdsourcing ...
Hand crafted annotated corpora are acknowledged as critical elements for the Human Language Technolo...
© 2017 Dr. Richard James FothergillWords can take on many meanings, and collecting and identifying e...
Spoken corpora have traditionally been assembled through careful recording and transcription of disc...
International audienceWhat would be a good method to provide a large collection of semantically anno...
Building language corpora for low resource languages such as South Africa’s isiXhosa is challenging ...