The Corpus class now handles missing data (#13). Support for more corpus languages. If no statistical language model is available, Corpus tries to use a basic ("blank") model. Improved documentation around dependencies and language support. Added tests
It is convenient to use the Internet to create a corpus. Because if the written texts of a certain l...
In this paper we argue that corpus linguistics needs to expand to cover a wider set of languages. Wh...
Since the inception of corpus linguistics (CL) the issue of absence has preoccupied both its practit...
Improve the handling of edge cases when initializing the Corpus and Textnet classes, such as empty d...
Adds Corpus.ngrams method as alternative to Corpus.noun_phrases. This is useful when working in lang...
Lots of changes, some of them breaking, but overall just providing nicer abstractions over the under...
Adds Catalan, Macedonian and Russian language models. Significantly speeds up backbone extraction by...
Adds params as a container for global parameters. This makes it possible to fix the random seed and ...
Python 3.9 compatibility! Updated documentation with conda-forge installation option. Bump versions ...
This release is an attempt to fix the cross-platform build and deploy pipeline (to ensure binary whe...
We examine effects that empty categories have on machine translation. Empty categories are elements ...
Item does not contain fulltextBespreking van: E. Tognini-Bonelli,Studies in Corpus Linguistics Amste...
Item does not contain fulltext[Münster] Workshop on Open Source Linguistic Resources, 13 mei 2004Mü...
With the advent of corpus linguistics, the use of corpora has become central in linguistics. One und...
Item does not contain fulltextKU Nijmegen, 2 oktober 1991Promotor : Aarts, J.M.G.A.267 p
It is convenient to use the Internet to create a corpus. Because if the written texts of a certain l...
In this paper we argue that corpus linguistics needs to expand to cover a wider set of languages. Wh...
Since the inception of corpus linguistics (CL) the issue of absence has preoccupied both its practit...
Improve the handling of edge cases when initializing the Corpus and Textnet classes, such as empty d...
Adds Corpus.ngrams method as alternative to Corpus.noun_phrases. This is useful when working in lang...
Lots of changes, some of them breaking, but overall just providing nicer abstractions over the under...
Adds Catalan, Macedonian and Russian language models. Significantly speeds up backbone extraction by...
Adds params as a container for global parameters. This makes it possible to fix the random seed and ...
Python 3.9 compatibility! Updated documentation with conda-forge installation option. Bump versions ...
This release is an attempt to fix the cross-platform build and deploy pipeline (to ensure binary whe...
We examine effects that empty categories have on machine translation. Empty categories are elements ...
Item does not contain fulltextBespreking van: E. Tognini-Bonelli,Studies in Corpus Linguistics Amste...
Item does not contain fulltext[Münster] Workshop on Open Source Linguistic Resources, 13 mei 2004Mü...
With the advent of corpus linguistics, the use of corpora has become central in linguistics. One und...
Item does not contain fulltextKU Nijmegen, 2 oktober 1991Promotor : Aarts, J.M.G.A.267 p
It is convenient to use the Internet to create a corpus. Because if the written texts of a certain l...
In this paper we argue that corpus linguistics needs to expand to cover a wider set of languages. Wh...
Since the inception of corpus linguistics (CL) the issue of absence has preoccupied both its practit...