Given the limited size of existing idiom corpora, we aim to enable progress in automatic idiom processing and linguistic analysis by creating the largest-to-date corpus of idioms for English. Using a fixed idiom list, automatic pre-extraction, and a strictly controlled crowdsourced annotation procedure, we show that it is feasible to build a high-quality corpus comprising more than 50K instances, an order of a magnitude larger than previous resources. Crucial ingredients of crowdsourcing were the selection of crowdworkers, clear and comprehensive instructions, and an interface that breaks down the task in small, manageable steps. Analysis of the resulting corpus revealed strong effects of genre on idiom distribution, providing new evidence ...
Online resources, such as Wiktionary, provide an accurate but incomplete source of idiomatic phrases...
AbstractIn this paper we investigate the role of idioms in automated approaches to sentiment analysi...
Idiomatic expressions can be problematic for natural language processing applications as their meani...
Given the limited size of existing idiom corpora, we aim to enable progress in automatic idiom proce...
Given the limited size of existing idiom corpora, we aim to enable progress in automatic idiom proce...
In this thesis, we are concerned with idiomatic expressions and how to handle them within NLP. Idiom...
This paper presents the details of a pilot study in which we tagged portions of the American Nationa...
“Idiomatic” expressions, usually called “idioms”, such as a dime a dozen, a busman’s holiday, or to ...
As a fascinating and colorful part of English language, idioms highly affect fluency, but they are q...
This paper reports the preliminary results of an experiment carried out on a large scale for the ext...
Expressions can be ambiguous between idiomatic and literal interpretation depending on the context t...
This paper reports the preliminary results of an experiment carried out on a large scale for the ext...
Idiomatic expressions (IE) play an important role in natural language, and have long been a “pain in...
Idioms are multi-word expressions whose meaning cannot always be deduced from the literal meaning of...
The goal of this paper is to present a procedure for the automatic retrieval of idiomatic expression...
Online resources, such as Wiktionary, provide an accurate but incomplete source of idiomatic phrases...
AbstractIn this paper we investigate the role of idioms in automated approaches to sentiment analysi...
Idiomatic expressions can be problematic for natural language processing applications as their meani...
Given the limited size of existing idiom corpora, we aim to enable progress in automatic idiom proce...
Given the limited size of existing idiom corpora, we aim to enable progress in automatic idiom proce...
In this thesis, we are concerned with idiomatic expressions and how to handle them within NLP. Idiom...
This paper presents the details of a pilot study in which we tagged portions of the American Nationa...
“Idiomatic” expressions, usually called “idioms”, such as a dime a dozen, a busman’s holiday, or to ...
As a fascinating and colorful part of English language, idioms highly affect fluency, but they are q...
This paper reports the preliminary results of an experiment carried out on a large scale for the ext...
Expressions can be ambiguous between idiomatic and literal interpretation depending on the context t...
This paper reports the preliminary results of an experiment carried out on a large scale for the ext...
Idiomatic expressions (IE) play an important role in natural language, and have long been a “pain in...
Idioms are multi-word expressions whose meaning cannot always be deduced from the literal meaning of...
The goal of this paper is to present a procedure for the automatic retrieval of idiomatic expression...
Online resources, such as Wiktionary, provide an accurate but incomplete source of idiomatic phrases...
AbstractIn this paper we investigate the role of idioms in automated approaches to sentiment analysi...
Idiomatic expressions can be problematic for natural language processing applications as their meani...