Research in Natural Language Processing (NLP) has in recent years benefited from the enormous amount of raw textual data available on the World Wide Web. The presence of standard search engines has made this data accessible to computational linguists as a corpus of a size that had never existed before. Althoug
The World Wide Web, being essentially an enormous database of mostly textual documents, offers great...
Parallel corpora have become an essential resource for work in multilingual natural language process...
This paper presents an overview of current approaches to the use of the web as a linguistic corpus a...
We describe two different ways of exploiting linguistic and general knowledge from the Web for impro...
Abstract. The 60-year-old dream of computational linguistics is to make computers capable of communi...
From the beginning of the twentieth century on, the use of the World Wide Web has become a current t...
The web is a potentially useful corpus for language study because it provides examples of language t...
The paper compares systematically the utility of specially-made text corpora and the textual resourc...
Lew The Web, teeming as it is with language data, of all manner of varieties and languages, in vast ...
WAC More and more people are using Web data for linguistic and NLP research. The Web as Corpusworksh...
We investigate the potential of using the web as a huge corpus for language studies. We test the hyp...
Corpus data have emerged as the raw data/benchmark for several NLP applications. Corpus is described...
The article analyzes the revival of corpus linguistics approach to natural language (NL) investigati...
International audienceThis paper presents an overview of the linguists' use of the Web as a corpus. ...
The Web is an inexhaustible reservoir of machine-readable texts in most of the world’s written langu...
The World Wide Web, being essentially an enormous database of mostly textual documents, offers great...
Parallel corpora have become an essential resource for work in multilingual natural language process...
This paper presents an overview of current approaches to the use of the web as a linguistic corpus a...
We describe two different ways of exploiting linguistic and general knowledge from the Web for impro...
Abstract. The 60-year-old dream of computational linguistics is to make computers capable of communi...
From the beginning of the twentieth century on, the use of the World Wide Web has become a current t...
The web is a potentially useful corpus for language study because it provides examples of language t...
The paper compares systematically the utility of specially-made text corpora and the textual resourc...
Lew The Web, teeming as it is with language data, of all manner of varieties and languages, in vast ...
WAC More and more people are using Web data for linguistic and NLP research. The Web as Corpusworksh...
We investigate the potential of using the web as a huge corpus for language studies. We test the hyp...
Corpus data have emerged as the raw data/benchmark for several NLP applications. Corpus is described...
The article analyzes the revival of corpus linguistics approach to natural language (NL) investigati...
International audienceThis paper presents an overview of the linguists' use of the Web as a corpus. ...
The Web is an inexhaustible reservoir of machine-readable texts in most of the world’s written langu...
The World Wide Web, being essentially an enormous database of mostly textual documents, offers great...
Parallel corpora have become an essential resource for work in multilingual natural language process...
This paper presents an overview of current approaches to the use of the web as a linguistic corpus a...