The present study attempts to cluster Spanish-speaking countries into dialect regions by computational means. The frequencies of 592 lexical and grammatical features for 21 countries were obtained the from Corpus del Español-Web Dialects. Principal components analysis and hierarchical clustering analyses used the resulting data to group countries into dialect regions. A number of algorithms were used to rank features in terms of how much they aided in dialect classification, which allowed grouping based on a smaller set of features. Six dialect zones were identified: European (Spain), Southern Cone (Uruguay, Argentina), Southern Central America (Costa Rica, Panama), Caribbean (Puerto Rico, Dominican Republic), Northern Central America (Nica...
It is well known that canonical Spanish, the dialectal variant `central' of Spain, so called Castili...
As part of this study, we view dialect as a language variant used as a communicative tool by people ...
Following a dialectological approach, this project presents two multidimensional phonetic linguistic...
This paper maps the large-scale variation of the Spanish language by employing a corpus based on geo...
We perform a large-scale analysis of language diatopic variation using geotagged mi-croblogging data...
This article consider the linguistic aspect of language variability, references and is dedicated to ...
This paper studies the degree of cohesion among varieties of Spanish, proposing an analysis of Spani...
International audienceWe perform a large-scale analysis of language diatopic variation using geotagg...
Lexical variation, or the existence of multiple lexemes that can be used to denote a particular conc...
International audienceThis paper describes a pilot study in lexical encoding of multi-word expressio...
ABSTRACT: This paper is concerned with sketching future directions for corpus-based dialectology. We...
Most NLP applications assume that a particular language is homogeneous in the regions where it is sp...
Spanish is a global language, spoken in a big number of different countries with a big dialectal var...
In this paper we apply various clustering algorithms to the dialect pronuncia-tion data. At the same...
In this paper, I introduce methodologies to tap corpora for exploring aggregate linguistic distances...
It is well known that canonical Spanish, the dialectal variant `central' of Spain, so called Castili...
As part of this study, we view dialect as a language variant used as a communicative tool by people ...
Following a dialectological approach, this project presents two multidimensional phonetic linguistic...
This paper maps the large-scale variation of the Spanish language by employing a corpus based on geo...
We perform a large-scale analysis of language diatopic variation using geotagged mi-croblogging data...
This article consider the linguistic aspect of language variability, references and is dedicated to ...
This paper studies the degree of cohesion among varieties of Spanish, proposing an analysis of Spani...
International audienceWe perform a large-scale analysis of language diatopic variation using geotagg...
Lexical variation, or the existence of multiple lexemes that can be used to denote a particular conc...
International audienceThis paper describes a pilot study in lexical encoding of multi-word expressio...
ABSTRACT: This paper is concerned with sketching future directions for corpus-based dialectology. We...
Most NLP applications assume that a particular language is homogeneous in the regions where it is sp...
Spanish is a global language, spoken in a big number of different countries with a big dialectal var...
In this paper we apply various clustering algorithms to the dialect pronuncia-tion data. At the same...
In this paper, I introduce methodologies to tap corpora for exploring aggregate linguistic distances...
It is well known that canonical Spanish, the dialectal variant `central' of Spain, so called Castili...
As part of this study, we view dialect as a language variant used as a communicative tool by people ...
Following a dialectological approach, this project presents two multidimensional phonetic linguistic...