In this paper a TEI corpus with sonnets from the Spanish Golden-Age is reviewed. Some of the 52 authors represented in the collection are Cervantes, Lope, Quevedo, Tirso, Calderón or Góngora. In total, the corpus contains more than 5000 sonnets. The project is currently under development at the University of Alicante, Spain. One of the strongest aspects of this corpus is the metrical annotation of each verse. The researchers have already analysed the corpus using topic modelling, a suitable technique for the structure of the collection and the size of the texts. The weakest aspect of this collection is the metadata of the files: the majority of them are redundant and some important aspects (e.g. identifiers of texts, author, collection, sou...
A single abstract from the DHd-2018 Book of Abstracts.Sofern eine editorische Arbeit an dieser Publi...
Rhyme is a relevant structural element in many poetic forms. Besides its aesthetic and musical funct...
This paper aims at presenting the Project of Corpus Linguistics in Spanish developed by the researc...
International audienceWe present a corpus covering 4094 sonnets in Spanish by 1204 authors, from the...
Several computational linguistics techniques are applied to analyze a large corpus of Span-ish sonne...
PREPRINT Abstract: With DISCO, the DIachronic Spanish Sonnet COrpus, we collected 4085 sonnets, from...
International audienceWith DISCO, the DIachronic Spanish Sonnet COrpus (Ruiz et al., 2018), we colle...
This paper analyzes the application of LDA topic modeling to a corpus of poetry. First, it explains ...
International audienceEnjambment takes place when a syntactic unit is broken up across two lines of ...
En este trabajo se desarrolla un análisis de los principales tipos de endecasílabos utilizados en lo...
The Oupoco Database is a collection of 4870 French sonnets developed in the framework of the Oupoco ...
Presentamos cómo la anotación automática de rasgos formales (rima, métrica, encabalgamiento) del sub...
In this article an automatic scansion model for fixed-metre Spanish poetry is presented. It is a hyb...
Dutton lyric corpus of 15th century cancioneros, and the Severin-Maguire corpus of didactic poetry (...
Enjambment takes place when a syntactic unit is broken up across two lines of poetry, giving rise to...
A single abstract from the DHd-2018 Book of Abstracts.Sofern eine editorische Arbeit an dieser Publi...
Rhyme is a relevant structural element in many poetic forms. Besides its aesthetic and musical funct...
This paper aims at presenting the Project of Corpus Linguistics in Spanish developed by the researc...
International audienceWe present a corpus covering 4094 sonnets in Spanish by 1204 authors, from the...
Several computational linguistics techniques are applied to analyze a large corpus of Span-ish sonne...
PREPRINT Abstract: With DISCO, the DIachronic Spanish Sonnet COrpus, we collected 4085 sonnets, from...
International audienceWith DISCO, the DIachronic Spanish Sonnet COrpus (Ruiz et al., 2018), we colle...
This paper analyzes the application of LDA topic modeling to a corpus of poetry. First, it explains ...
International audienceEnjambment takes place when a syntactic unit is broken up across two lines of ...
En este trabajo se desarrolla un análisis de los principales tipos de endecasílabos utilizados en lo...
The Oupoco Database is a collection of 4870 French sonnets developed in the framework of the Oupoco ...
Presentamos cómo la anotación automática de rasgos formales (rima, métrica, encabalgamiento) del sub...
In this article an automatic scansion model for fixed-metre Spanish poetry is presented. It is a hyb...
Dutton lyric corpus of 15th century cancioneros, and the Severin-Maguire corpus of didactic poetry (...
Enjambment takes place when a syntactic unit is broken up across two lines of poetry, giving rise to...
A single abstract from the DHd-2018 Book of Abstracts.Sofern eine editorische Arbeit an dieser Publi...
Rhyme is a relevant structural element in many poetic forms. Besides its aesthetic and musical funct...
This paper aims at presenting the Project of Corpus Linguistics in Spanish developed by the researc...