The Web is a very rich source of linguistic data, and in the last few years it has been used very intensively by linguists and lan-guage technologists for many tasks (see Kilgarriff and Grefen-stette 2003 for a review of some of the relevant work). Amon
The paper explores some of the issues raised by the notion of the web-as-corpus (Kilgarriff-Greffens...
At the beginning of the first chapter the interdisciplinary setting between linguistics, corpus ling...
Wacky! Working Papers on the Web as Corpus TABLE OF CONTENTS Front Matter (Includes author...
From the beginning of the twentieth century on, the use of the World Wide Web has become a current t...
Abstract. The 60-year-old dream of computational linguistics is to make computers capable of communi...
In this paper, we present an automated, quantitative, knowledge-poor method to evaluate the randomne...
World Wide Web has become an important knowl-edge source for many research fields, and quality of We...
We investigate the potential of using the web as a huge corpus for language studies. We test the hyp...
Lew The Web, teeming as it is with language data, of all manner of varieties and languages, in vast ...
Research in Natural Language Processing (NLP) has in recent years benefited from the enormous amount...
International audienceThis paper presents an overview of the linguists' use of the Web as a corpus. ...
and thorough introduction to the promising field of ‘Web as Corpus ’ (hereafter WaC) at a time when ...
The World Wide Web, being essentially an enormous database of mostly textual documents, offers great...
We have built a corpus containing texts in 106 languages from texts available on the Internet and on...
The quality of statistical measurements on corpora is strongly related to a strict definition of the...
The paper explores some of the issues raised by the notion of the web-as-corpus (Kilgarriff-Greffens...
At the beginning of the first chapter the interdisciplinary setting between linguistics, corpus ling...
Wacky! Working Papers on the Web as Corpus TABLE OF CONTENTS Front Matter (Includes author...
From the beginning of the twentieth century on, the use of the World Wide Web has become a current t...
Abstract. The 60-year-old dream of computational linguistics is to make computers capable of communi...
In this paper, we present an automated, quantitative, knowledge-poor method to evaluate the randomne...
World Wide Web has become an important knowl-edge source for many research fields, and quality of We...
We investigate the potential of using the web as a huge corpus for language studies. We test the hyp...
Lew The Web, teeming as it is with language data, of all manner of varieties and languages, in vast ...
Research in Natural Language Processing (NLP) has in recent years benefited from the enormous amount...
International audienceThis paper presents an overview of the linguists' use of the Web as a corpus. ...
and thorough introduction to the promising field of ‘Web as Corpus ’ (hereafter WaC) at a time when ...
The World Wide Web, being essentially an enormous database of mostly textual documents, offers great...
We have built a corpus containing texts in 106 languages from texts available on the Internet and on...
The quality of statistical measurements on corpora is strongly related to a strict definition of the...
The paper explores some of the issues raised by the notion of the web-as-corpus (Kilgarriff-Greffens...
At the beginning of the first chapter the interdisciplinary setting between linguistics, corpus ling...
Wacky! Working Papers on the Web as Corpus TABLE OF CONTENTS Front Matter (Includes author...