This paper analyzes the application of LDA topic modeling to a corpus of poetry. First, it explains how the most coherent LDA-topics have been established by running several tests and automatically evaluating the coherence of the resulting LDA-topics. Results show, on one hand, that when dealing with a corpus of poetry, lemmatization is not advisable because several poetic features are lost in the process; and, on the other hand, that a standard LDA algorithm is better than a specific version of LDA for short texts (LF-LDA). The resulting LDA-topics have then been manually analyzed in order to define the relation between word topics and poems. The analysis shows that there are mainly two kinds of semantic relations: an LDA-topic could repre...
The transmission of text in poetic form is a quasi-universal aspect in the oral tradition of every c...
The study of the poetic features of text, especially their rhythmic structure when forming verses, p...
International audienceWe present a corpus covering 4094 sonnets in Spanish by 1204 authors, from the...
This paper analyzes the application of LDA topic modeling to a corpus of poetry. First, it explains ...
With the undertake of various folktale digitalization initiatives, the need for computational aids t...
The creation and analysis of poetry have been commonly carried out by hand; with only a few computer...
Dissertação de Mestrado em Engenharia Informática apresentada à Faculdade de Ciências e Tecnologia ...
Topic modeling is a type of statistical modeling for discovering the abstract ``topics'' that occur ...
Several computational linguistics techniques are applied to analyze a large corpus of Span-ish sonne...
In this paper a TEI corpus with sonnets from the Spanish Golden-Age is reviewed. Some of the 52 auth...
We apply topic modeling to classifying the genre of Ghazal, a form common in Persian poetry. We show...
We investigate new ways of applying LDA topic models: rather than optimizing a single model for a sp...
In this paper we describe ongoing work in the restructuring of a tagset originally organised as a ta...
In this paper we describe ongoing work in the restructuring of a tagset originally organised as a ta...
Rhyme is a relevant structural element in many poetic forms. Besides its aesthetic and musical funct...
The transmission of text in poetic form is a quasi-universal aspect in the oral tradition of every c...
The study of the poetic features of text, especially their rhythmic structure when forming verses, p...
International audienceWe present a corpus covering 4094 sonnets in Spanish by 1204 authors, from the...
This paper analyzes the application of LDA topic modeling to a corpus of poetry. First, it explains ...
With the undertake of various folktale digitalization initiatives, the need for computational aids t...
The creation and analysis of poetry have been commonly carried out by hand; with only a few computer...
Dissertação de Mestrado em Engenharia Informática apresentada à Faculdade de Ciências e Tecnologia ...
Topic modeling is a type of statistical modeling for discovering the abstract ``topics'' that occur ...
Several computational linguistics techniques are applied to analyze a large corpus of Span-ish sonne...
In this paper a TEI corpus with sonnets from the Spanish Golden-Age is reviewed. Some of the 52 auth...
We apply topic modeling to classifying the genre of Ghazal, a form common in Persian poetry. We show...
We investigate new ways of applying LDA topic models: rather than optimizing a single model for a sp...
In this paper we describe ongoing work in the restructuring of a tagset originally organised as a ta...
In this paper we describe ongoing work in the restructuring of a tagset originally organised as a ta...
Rhyme is a relevant structural element in many poetic forms. Besides its aesthetic and musical funct...
The transmission of text in poetic form is a quasi-universal aspect in the oral tradition of every c...
The study of the poetic features of text, especially their rhythmic structure when forming verses, p...
International audienceWe present a corpus covering 4094 sonnets in Spanish by 1204 authors, from the...