Most undeciphered lost languages exhibit two characteristics that pose significant decipherment challenges: (1) the scripts are not fully segmented into words; (2) the closest known language is not determined. We propose a decipherment model that handles both of these challenges by building on rich linguistic constraints reflecting consistent patterns in historical sound change. We capture the natural phonological geometry by learning character embeddings based on the International Phonetic Alphabet (IPA). The resulting generative framework jointly models word segmentation and cognate alignment, informed by phonological constraints. We evaluate the model on both deciphered languages (Gothic, Ugaritic) and an undeciphered one (Iberian). The ...
International audienceThis paper builds upon recent work in leveraging the corpora and tools origina...
International audienceThis paper builds upon recent work in leveraging the corpora and tools origina...
International audienceThis paper builds upon recent work in leveraging the corpora and tools origina...
Most undeciphered lost languages exhibit two characteristics that pose significant decipherment cha...
URL to paper listed on conference siteIn this paper we propose a method for the automatic decipherme...
© 2019 Association for Computational Linguistics In this paper we propose a novel neural approach fo...
© 2019 Association for Computational Linguistics In this paper we propose a novel neural approach fo...
Computational approaches in historical linguistics have been increasingly applied during the past de...
Can language relatedness be established without cognate words? This question has remained unresolved...
e article deals with an Antique language—Latin. A new method of phonostatistics is proposed here. It...
6siCan language relatedness be established without cognate words? This question has remained unresol...
The presence of hidden structure in human data--including natural language butalso sources like musi...
Describing the phonological history of languages has been a central topic in historical linguistics....
International audienceThis paper builds upon recent work in leveraging the corpora and tools origina...
International audienceThis paper builds upon recent work in leveraging the corpora and tools origina...
International audienceThis paper builds upon recent work in leveraging the corpora and tools origina...
International audienceThis paper builds upon recent work in leveraging the corpora and tools origina...
International audienceThis paper builds upon recent work in leveraging the corpora and tools origina...
Most undeciphered lost languages exhibit two characteristics that pose significant decipherment cha...
URL to paper listed on conference siteIn this paper we propose a method for the automatic decipherme...
© 2019 Association for Computational Linguistics In this paper we propose a novel neural approach fo...
© 2019 Association for Computational Linguistics In this paper we propose a novel neural approach fo...
Computational approaches in historical linguistics have been increasingly applied during the past de...
Can language relatedness be established without cognate words? This question has remained unresolved...
e article deals with an Antique language—Latin. A new method of phonostatistics is proposed here. It...
6siCan language relatedness be established without cognate words? This question has remained unresol...
The presence of hidden structure in human data--including natural language butalso sources like musi...
Describing the phonological history of languages has been a central topic in historical linguistics....
International audienceThis paper builds upon recent work in leveraging the corpora and tools origina...
International audienceThis paper builds upon recent work in leveraging the corpora and tools origina...
International audienceThis paper builds upon recent work in leveraging the corpora and tools origina...
International audienceThis paper builds upon recent work in leveraging the corpora and tools origina...
International audienceThis paper builds upon recent work in leveraging the corpora and tools origina...