Language users process utterances by segmenting them into many cognitive units, which vary in their sizes and linguistic levels. Although we can do such unitization/segmentation easily, its cognitive mechanism is still not clear. This paper proposes an unsupervised model, Less-is-Better (LiB), to simulate the human cognitive process with respect to language unitization/segmentation. LiB follows the principle of least effort and aims to build a lexicon which minimizes the number of unit tokens (alleviating the effort of analysis) and number of unit types (alleviating the effort of storage) at the same time on any given corpus. LiB’s workflow is inspired by empirical cognitive phenomena. The design makes the mechanism of LiB cognitively plaus...
From a cognitive point of view, words can be recognized based on learned data which can be obtained ...
This paper extends existing word segmentation models to take non-linguistic context into ac-count. I...
One of the challenges that infants have to solve when learn- ing their native language is to identif...
Words typically form the basis of psycholinguistic and computational linguistic studies about senten...
Words typically form the basis of psycholinguistic and computational linguistic studies about senten...
The informativity of a computational model of language acquisition is directly related to how closel...
This dissertation uses computational modeling to address three related questions regarding the acqui...
Documenting languages helps to prevent the extinction of endangered dialects, many of which are othe...
Abstract The informativity of a computational model of language acquisition is directly related to h...
Humans, even from infancy, are capable of unsupervised (“sta- tistical”) learning of linguistic info...
Documenting languages helps to prevent the extinction of endangered dialects – many of which are oth...
The ability to discover groupings in continuous stimuli on the basis of distributional information i...
This study investigates the joint influences of three factors on the discovery of new word-like unit...
Word segmentation is a crucial step in children's vocabulary learning. While computational models of...
This paper presents an unsupervised and incremental model of learning segmenta-tion that combines mu...
From a cognitive point of view, words can be recognized based on learned data which can be obtained ...
This paper extends existing word segmentation models to take non-linguistic context into ac-count. I...
One of the challenges that infants have to solve when learn- ing their native language is to identif...
Words typically form the basis of psycholinguistic and computational linguistic studies about senten...
Words typically form the basis of psycholinguistic and computational linguistic studies about senten...
The informativity of a computational model of language acquisition is directly related to how closel...
This dissertation uses computational modeling to address three related questions regarding the acqui...
Documenting languages helps to prevent the extinction of endangered dialects, many of which are othe...
Abstract The informativity of a computational model of language acquisition is directly related to h...
Humans, even from infancy, are capable of unsupervised (“sta- tistical”) learning of linguistic info...
Documenting languages helps to prevent the extinction of endangered dialects – many of which are oth...
The ability to discover groupings in continuous stimuli on the basis of distributional information i...
This study investigates the joint influences of three factors on the discovery of new word-like unit...
Word segmentation is a crucial step in children's vocabulary learning. While computational models of...
This paper presents an unsupervised and incremental model of learning segmenta-tion that combines mu...
From a cognitive point of view, words can be recognized based on learned data which can be obtained ...
This paper extends existing word segmentation models to take non-linguistic context into ac-count. I...
One of the challenges that infants have to solve when learn- ing their native language is to identif...