This paper analyzes the application of LDA topic modeling to a corpus of poetry. First, it explains how the most coherent LDA-topics have been established by running several tests and automatically evaluating the coherence of the resulting LDA-topics. Results show, on one hand, that when dealing with a corpus of poetry, lemmatization is not advisable because several poetic features are lost in the process; and, on the other hand, that a standard LDA algorithm is better than a specific version of LDA for short texts (LF-LDA). The resulting LDA-topics have then been manually analyzed in order to define the relation between word topics and poems. The analysis shows that there are mainly two kinds of semantic relations: an LDA-topic could repre...
International audienceWith DISCO, the DIachronic Spanish Sonnet COrpus (Ruiz et al., 2018), we colle...
The transmission of text in poetic form is a quasi-universal aspect in the oral tradition of every c...
In this paper we describe ongoing work in the restructuring of a tagset originally organised as a ta...
This paper analyzes the application of LDA topic modeling to a corpus of poetry. First, it explains ...
In this paper a TEI corpus with sonnets from the Spanish Golden-Age is reviewed. Some of the 52 auth...
Contains fulltext : 112943.pdf (publisher's version ) (Open Access)With the undert...
Several computational linguistics techniques are applied to analyze a large corpus of Span-ish sonne...
The creation and analysis of poetry have been commonly carried out by hand; with only a few computer...
Dissertação de Mestrado em Engenharia Informática apresentada à Faculdade de Ciências e Tecnologia ...
Topic modeling is a type of statistical modeling for discovering the abstract ``topics'' that occur ...
Rhyme is a relevant structural element in many poetic forms. Besides its aesthetic and musical funct...
We apply topic modeling to classifying the genre of Ghazal, a form common in Persian poetry. We show...
International audienceWe present a corpus covering 4094 sonnets in Spanish by 1204 authors, from the...
We investigate new ways of applying LDA topic models: rather than optimizing a single model for a sp...
In this paper we describe ongoing work in the restructuring of a tagset originally organised as a ta...
International audienceWith DISCO, the DIachronic Spanish Sonnet COrpus (Ruiz et al., 2018), we colle...
The transmission of text in poetic form is a quasi-universal aspect in the oral tradition of every c...
In this paper we describe ongoing work in the restructuring of a tagset originally organised as a ta...
This paper analyzes the application of LDA topic modeling to a corpus of poetry. First, it explains ...
In this paper a TEI corpus with sonnets from the Spanish Golden-Age is reviewed. Some of the 52 auth...
Contains fulltext : 112943.pdf (publisher's version ) (Open Access)With the undert...
Several computational linguistics techniques are applied to analyze a large corpus of Span-ish sonne...
The creation and analysis of poetry have been commonly carried out by hand; with only a few computer...
Dissertação de Mestrado em Engenharia Informática apresentada à Faculdade de Ciências e Tecnologia ...
Topic modeling is a type of statistical modeling for discovering the abstract ``topics'' that occur ...
Rhyme is a relevant structural element in many poetic forms. Besides its aesthetic and musical funct...
We apply topic modeling to classifying the genre of Ghazal, a form common in Persian poetry. We show...
International audienceWe present a corpus covering 4094 sonnets in Spanish by 1204 authors, from the...
We investigate new ways of applying LDA topic models: rather than optimizing a single model for a sp...
In this paper we describe ongoing work in the restructuring of a tagset originally organised as a ta...
International audienceWith DISCO, the DIachronic Spanish Sonnet COrpus (Ruiz et al., 2018), we colle...
The transmission of text in poetic form is a quasi-universal aspect in the oral tradition of every c...
In this paper we describe ongoing work in the restructuring of a tagset originally organised as a ta...