In this paper we present a survey on natural language corpora, with particular focus on corpora of large scale and those applicable to sentiment analysis. Natural language corpora are crucial for training various Software Engineering applications, from part-of-speech taggers and dependency parsers to dialog systems or sentiment analysis software. We compare several natural language corpora created for different languages, analyze their distinctive features and the amount of additional annotations provided by the developers of those corpora.ここに掲載した著作物の利用に関する注意 本著作物の著作権は日本ソフトウェア科学会 に帰属します.本著作物は著作権者である日本ソフトウェア科学会の許可のもとに掲載す るものです.ご利用に当たっては「著作権法」に従うことをお願いいたします. Notice for the use of this material: The copyright of this material is retained by th...
The present paper involves using a spoken corpus for the construction of a written corpus which in t...
As corpus building is an activity that takes times and costs money, readers may wish to use ready-ma...
Corpus-based Machine Learning of linguistic annotations has been a key topic for all areas of Natura...
The aim of this work is to evaluate available Czech and English corpora for the field of artificial ...
Recently there have been several initiatives to create locally accessible large scale corpora based ...
Computational approaches to sentiment analysis focus on the identification, extraction, summarizatio...
Corpora are often referred to as the ‘tools’ of corpus linguistics. However, it is important to reco...
This is the dataset created for the paper, "EmoWOZ: A Large-Scale Corpus and Labelling Scheme for Em...
Language-based emotion analysis finds itself in a paradoxical situation. In the past decades, a plet...
In computational linguistics, the increasing interest of the detection of emotional and personality ...
This paper presents our research on automaticannotation of a five-billion-word corpus ofJapanese blo...
Language-based emotion analysis finds itself in a paradoxical situation. In the past decades, a plet...
The thesis addresses the representation and automatic detection of emotions in natural speech. Most ...
Automatic sentiment analysis in texts has attracted considerable attention in recent years. Most of ...
In this paper we present the emotion annotation of 1.5 billion words Portuguese corpora, publicly av...
The present paper involves using a spoken corpus for the construction of a written corpus which in t...
As corpus building is an activity that takes times and costs money, readers may wish to use ready-ma...
Corpus-based Machine Learning of linguistic annotations has been a key topic for all areas of Natura...
The aim of this work is to evaluate available Czech and English corpora for the field of artificial ...
Recently there have been several initiatives to create locally accessible large scale corpora based ...
Computational approaches to sentiment analysis focus on the identification, extraction, summarizatio...
Corpora are often referred to as the ‘tools’ of corpus linguistics. However, it is important to reco...
This is the dataset created for the paper, "EmoWOZ: A Large-Scale Corpus and Labelling Scheme for Em...
Language-based emotion analysis finds itself in a paradoxical situation. In the past decades, a plet...
In computational linguistics, the increasing interest of the detection of emotional and personality ...
This paper presents our research on automaticannotation of a five-billion-word corpus ofJapanese blo...
Language-based emotion analysis finds itself in a paradoxical situation. In the past decades, a plet...
The thesis addresses the representation and automatic detection of emotions in natural speech. Most ...
Automatic sentiment analysis in texts has attracted considerable attention in recent years. Most of ...
In this paper we present the emotion annotation of 1.5 billion words Portuguese corpora, publicly av...
The present paper involves using a spoken corpus for the construction of a written corpus which in t...
As corpus building is an activity that takes times and costs money, readers may wish to use ready-ma...
Corpus-based Machine Learning of linguistic annotations has been a key topic for all areas of Natura...