Studies of computational models of language acquisition depend to a large part on the input available for experiments. In this paper, we study the effect that input size has on the performance of word segmentation models embodying different kinds of linguistic assumptions. Because currently available corpora for word segmentation are not suited for addressing this question, we perform our study on a novel corpus based on the Providence Corpus (Demuth et al., 2006). We find that input size can have dramatic effects on segmentation performance and that, somewhat surprisingly, models performing well on smaller amounts of data can show a marked decrease in performance when exposed to larger amounts of data. We also present the data-set on which...
The informativity of a computational model of language acquisition is directly related to how closel...
Models of the acquisition of word segmen-tation are typically evaluated using phonem-ically transcri...
Lexical dependencies abound in natural language: words tend to follow particular words or word categ...
Abstract In studies of human cognition, Bayesian models are increasingly popular tools for understan...
The ability to discover groupings in continuous stimuli on the basis of distributional information i...
© The Author(s) 2011. This article is published with open access at Springerlink.com Abstract In rec...
Theoretical thesis."Thesis submitted for the degree of Doctor of Philosophy, Dr. phil. at the Depart...
International audience: There is large evidence that infants are able to exploit statistical cues to...
Since the experiments of Saffran et al. [Saffran, J., Aslin, R., & Newport, E. (1996). Statistical l...
Purely statistical models have accounted for infants ’ early ability to segment words out of f...
© The Author(s) 2011. This article is published with open access at Springerlink.com Abstract In rec...
This dissertation uses computational modeling to address three related questions regarding the acqui...
We investigate the impact of input data scale in corpus-based learning using a study style of Zipf’s...
Word segmentation is a crucial step in children’s vocabulary learning. While computational models of...
Theories of language acquisition and perceptual learning increasingly rely on statistical learning m...
The informativity of a computational model of language acquisition is directly related to how closel...
Models of the acquisition of word segmen-tation are typically evaluated using phonem-ically transcri...
Lexical dependencies abound in natural language: words tend to follow particular words or word categ...
Abstract In studies of human cognition, Bayesian models are increasingly popular tools for understan...
The ability to discover groupings in continuous stimuli on the basis of distributional information i...
© The Author(s) 2011. This article is published with open access at Springerlink.com Abstract In rec...
Theoretical thesis."Thesis submitted for the degree of Doctor of Philosophy, Dr. phil. at the Depart...
International audience: There is large evidence that infants are able to exploit statistical cues to...
Since the experiments of Saffran et al. [Saffran, J., Aslin, R., & Newport, E. (1996). Statistical l...
Purely statistical models have accounted for infants ’ early ability to segment words out of f...
© The Author(s) 2011. This article is published with open access at Springerlink.com Abstract In rec...
This dissertation uses computational modeling to address three related questions regarding the acqui...
We investigate the impact of input data scale in corpus-based learning using a study style of Zipf’s...
Word segmentation is a crucial step in children’s vocabulary learning. While computational models of...
Theories of language acquisition and perceptual learning increasingly rely on statistical learning m...
The informativity of a computational model of language acquisition is directly related to how closel...
Models of the acquisition of word segmen-tation are typically evaluated using phonem-ically transcri...
Lexical dependencies abound in natural language: words tend to follow particular words or word categ...