Part of the Lecture Notes in Computer Science book series (LNCS, volume 9924)In this paper, we present an automatic system for the morphosyntactic annotation and lexicographical evaluation of historical Portuguese corpora. Using rule-based orthographical normalization, we were able to apply a standard parser (PALAVRAS) to historical data (Colonia corpus) and to achieve accurate annotation for both POS and syntax. By aligning original and standardized word forms, our method allows to create tailor-made standardization dictionaries for historical Portuguese with optional period or author frequencies
This article presents the theoretical and methodological road used to construct the Historical Dicti...
Based on the Penn-Helsinki Parsed Corpus of Middle English[1], the Tycho Brahe Parsed Corpus of Hist...
This paper presents a lexical comparison of pre (1954-74) and post (1974-94) revolution parliamentar...
International audienceAbbreviated forms offer a special challenge in a historical corpus, since they...
This paper discusses the methods that historical sociolinguists can use in order to avoid anachronis...
The Historical Dictionary of Brazilian Portuguese (HDBP), the first of its kind, is based on a corpu...
The impact-es diachronic corpus of historical Spanish compiles over one hundred books —containing ap...
The aim of the present paper is to present and discuss a work in progress that involves: – the creat...
Investigating the history of a language depends on fragmentary sources, but electronic corpora offer...
This paper presents the Digital Edition of the Vocabularies of the Academy of Sciences project, whic...
This work presents an enriched version of the Parish Memories (1758–1761), an essential Portuguese h...
This article describes the morphosyntactic annotation of the C-ORAL-BRASIL speech corpus, using an a...
This books contains a corpus of tagged clitic sequences which were extracted from fourteen 17th cent...
This article presents the theoretical and methodological road used to construct the Historical Dicti...
In this paper, we introduce TimeBankPT, a TimeML annotated corpus of Portuguese. It has been produce...
This article presents the theoretical and methodological road used to construct the Historical Dicti...
Based on the Penn-Helsinki Parsed Corpus of Middle English[1], the Tycho Brahe Parsed Corpus of Hist...
This paper presents a lexical comparison of pre (1954-74) and post (1974-94) revolution parliamentar...
International audienceAbbreviated forms offer a special challenge in a historical corpus, since they...
This paper discusses the methods that historical sociolinguists can use in order to avoid anachronis...
The Historical Dictionary of Brazilian Portuguese (HDBP), the first of its kind, is based on a corpu...
The impact-es diachronic corpus of historical Spanish compiles over one hundred books —containing ap...
The aim of the present paper is to present and discuss a work in progress that involves: – the creat...
Investigating the history of a language depends on fragmentary sources, but electronic corpora offer...
This paper presents the Digital Edition of the Vocabularies of the Academy of Sciences project, whic...
This work presents an enriched version of the Parish Memories (1758–1761), an essential Portuguese h...
This article describes the morphosyntactic annotation of the C-ORAL-BRASIL speech corpus, using an a...
This books contains a corpus of tagged clitic sequences which were extracted from fourteen 17th cent...
This article presents the theoretical and methodological road used to construct the Historical Dicti...
In this paper, we introduce TimeBankPT, a TimeML annotated corpus of Portuguese. It has been produce...
This article presents the theoretical and methodological road used to construct the Historical Dicti...
Based on the Penn-Helsinki Parsed Corpus of Middle English[1], the Tycho Brahe Parsed Corpus of Hist...
This paper presents a lexical comparison of pre (1954-74) and post (1974-94) revolution parliamentar...