This paper addresses the task of automatic extraction of definitions by thoroughly exploring an approach that solely relies on machine learning techniques, and by focusing on the issue of the imbalance of relevant datasets. We obtained a breakthrough in terms of the automatic extraction of definitions, by extensively and systematically experimenting with different sampling techniques and their combination, as well as a range of different types of classifiers. Performance consistently scored in the range of 0.95–0.99 of area under the receiver operating characteristics, with a notorious improvement between 17 and 22 percentage points regarding the baseline of 0.73–0.77, for datasets with different rates of imbalance. Thus, the present paper ...
The field of service automation is progressing rapidly, and increasingly complex tasks are being aut...
In this paper, we explore a statistical framework for mutual bilingual terminology extraction. We pr...
This paper describes an evaluation of filtering methods for bilingual terminology extraction. Termin...
Abstract. This paper deals with the task of definition extraction with the training corpus suffering...
Paper presented at International Conference on Recent Advances in Natural Language Processing 2015 (...
International audienceThe main work in bilingual lexicon extraction from comparable corpora is based...
This paper describes a novel methodology to perform bilingual terminology extraction, in which autom...
International audienceThe main work in bilingual lexicon extraction from comparable corpora is based...
Although traditionally seen as a language-independent task, collocation extraction relies nowadays m...
We consider the identification, demarcation and extraction of definitions present in scholarly docum...
National audienceWe present a system for collocation extraction, using both monolingual and bilingua...
An automated approach of extracting bilingual lexicon (or dictionary) from comparable, non-parallel ...
Abstract—The article discusses methods of improving the ways of applying Balanced Random Forests (BR...
This thesis investigates different statistical methods for the automatic extraction of lexical chunk...
This paper introduces some key aspects of machine translation in order to situate the role of the bi...
The field of service automation is progressing rapidly, and increasingly complex tasks are being aut...
In this paper, we explore a statistical framework for mutual bilingual terminology extraction. We pr...
This paper describes an evaluation of filtering methods for bilingual terminology extraction. Termin...
Abstract. This paper deals with the task of definition extraction with the training corpus suffering...
Paper presented at International Conference on Recent Advances in Natural Language Processing 2015 (...
International audienceThe main work in bilingual lexicon extraction from comparable corpora is based...
This paper describes a novel methodology to perform bilingual terminology extraction, in which autom...
International audienceThe main work in bilingual lexicon extraction from comparable corpora is based...
Although traditionally seen as a language-independent task, collocation extraction relies nowadays m...
We consider the identification, demarcation and extraction of definitions present in scholarly docum...
National audienceWe present a system for collocation extraction, using both monolingual and bilingua...
An automated approach of extracting bilingual lexicon (or dictionary) from comparable, non-parallel ...
Abstract—The article discusses methods of improving the ways of applying Balanced Random Forests (BR...
This thesis investigates different statistical methods for the automatic extraction of lexical chunk...
This paper introduces some key aspects of machine translation in order to situate the role of the bi...
The field of service automation is progressing rapidly, and increasingly complex tasks are being aut...
In this paper, we explore a statistical framework for mutual bilingual terminology extraction. We pr...
This paper describes an evaluation of filtering methods for bilingual terminology extraction. Termin...