Learning words through probabilistic inferences about speakers’ communicative intentions
How do children learn the meanings of words? This chapter presents a probabilistic, communicative view of word learning that synthesizes insights from work on statistical learning and social learning. By describing the formal characteristics of models, it is possible to differentiate communicative models that make inferences about the speaker’s intentions from associative models that treat social information as a signal of salience. In addition, the probabilistic communicative framework can be integrated with models of pragmatic reasoning, leading to insights into how Gricean principles can facilitate word learning.
References
Akhtar, N., & Montague, L
(
1999)
Early lexical acquisition: The role of cross-situational learning.
First Language
, 19, 347–358.
Baker, C., Saxe, R., & Tenenbaum, J
(
2009)
Action understanding as inverse planning.
Cognition
, 113, 329–349.
Baldwin, D
(
1993)
Early referential understanding: Infants’ ability to recognize referential acts for what they are.
Developmental Psychology
, 29, 832–843.
Baldwin, D
(
1995)
Understanding the link between joint attention and language. In
C. Moore &
P. Dunham (Eds.),
Joint attention: Its origins and role in development
(pp. 131–158). Hillsdale, NJ: Lawrence Erlbaum Associates.
Bergelson, E., & Swingley, D
(
2012)
At 6–9 months, human infants know the meanings of many common nouns.
Proceedings of the National Academy of Sciences
, 109, 3253–3258.
Bloom, P
(
2002)
How children learn the meanings of words
. Cambridge, MA: The MIT Press.
Brown, P., Pietra, V., Pietra, S., & Mercer, R
(
1993)
The mathematics of statistical machine translation: Parameter estimation.
Computational Linguistics
, 19, 263–311.
Callanan, M
(
1985)
How parents label objects for young children: The role of input in the acquisition of category hierarchies.
Child Development
, 56, 508–523.
Carpenter, M., Nagell, K., & Tomasello, M
(
1998)
Social cognition, joint attention, and communicative competence from 9 to 15 months of age.
Monographs of the Society for Research in Child Development
, 63(4/Serial no. 255), 1–174.
Clark, E.V
(
1988)
On the logic of contrast.
Journal of Child Language
, 15, 317–335.
Clark, E.V
(
1990)
On the pragmatics of contrast.
Journal of Child Language
, 17, 417–431.
Clark, E.V
(
2003)
First language acquisition
. Cambridge: CUP.
Clark, H.H
(
1996)
Using language
. Cambridge: CUP.
Csibra, G., & Gergely, G
(
2009)
Natural pedagogy.
Trends in Cognitive Sciences
, 13, 148–153.
Fazly, A., Alishahi, A., & Stevenson, S
(
2010)
A probabilistic computational model of cross-situational word learning.
Cognitive Science
, 34, 1017–1063.
Fisher, C., Hall, D., Rakowitz, S., & Gleitman, L
(
1994)
When it is better to receive than to give: Syntactic and conceptual constraints on vocabulary growth.
Lingua
, 92, 333–375.
Frank, M.C., & Gibson, E
(
2011)
Overcoming memory limitations in rule learning.
Language Learning and Development
, 7, 130–148.
Frank, M.C., Goldwater, S., Griffiths, T.L., & Tenenbaum, J.B
(
2010)
Modeling human performance in statistical word segmentation.
Cognition
, 117, 107–125.
Frank, M.C., & Goodman, N.D
(
2012)
Predicting pragmatic reasoning in language games.
Science
, 336, 998.
Frank, M.C., & Goodman, N.D
Under review).
Inferring word meanings by assuming that speakers are informative.
Frank, M.C., Goodman, N.D., Lai, P., & Tenenbaum, J.B
(
2009)
Informative communication in word production and word learning. In
N. Taatgen &
H. van Rijn (Eds.),
Proceedings of the 31st Annual Conference of the Cognitive Science Society
, (pp. 1228–1233). Amsterdam: Cognitive Science Society.
Frank, M.C., Goodman, N.D., & Tenenbaum, J.B
(
2007)
A Bayesian framework for cross-situational word learning.
Advances in Neural Information Processing Systems
, 20.
Frank, M.C., Goodman, N.D., & Tenenbaum, J.B
(
2009)
Using speakers’ referential intentions to model early cross-situational word learning.
Psychological Science
, 20, 578–585.
Frank, M.C., Tenenbaum, J.B., & Fernald, A
(
2013)
Social and discourse contributions to the determination of reference in cross-situational word learning.
Language Learning and Development
, 9, 1–24.
Gallistel, C
(
1990)
The organization of learning
. Cambridge, MA: The MIT Press.
Gallistel, C., Fairhurst, S., & Balsam, P
(
2004)
The learning curve: Implications of a quantitative analysis.
Proceedings of the National Academy of Sciences
, 101, 13124–13131.
Geisler, W
(
2003)
Ideal observer analysis. In
L.M. Chalupa &
J.S. Werner (Eds.),
The visual neurosciences
(pp. 825–837). Cambridge, MA: The MIT Press.
Gergely, G., Bekkering, H., & Király, I
(
2002)
Rational imitation in preverbal infants.
Nature
, 415, 755.
Gillette, J., Gleitman, H., Gleitman, L., & Lederer, A
(
1999)
Human simulations of vocabulary learning.
Cognition
, 73, 135–176.
Gleitman, L
(
1990)
The structural sources of verb meanings.
Language Acquisition
, 1, 3–55.
Goodman, N.D., & Stuhlmüller, A
(
2013)
Knowledge and implicature: Modeling language understanding as social cognition.
Topics in Cognitive Science
, 5, 173–184.
Gopnik, A
(
2012)
Scientific thinking in young children. Theoretical advances, empirical research and policy implications.
Science
, 337, 1623–1627.
Grice, H
(
1975)
Logic and conversation.
Syntax and Semantics
, 3, 41–58.
Griffin, Z., & Bock, K
(
2000)
What the eyes say about speaking.
Psychological Science
, 11, 274–279.
Hollich, G., Hirsh-Pasek, K., & Golinkoff, R
(
2000)
Breaking the language barrier: An emergentist coalition model for the origins of word learning.
Monographs of the Society for Research in Child Development
, 65, 1–135.
James, W
(
1890)
The principles of psychology
, Vol. 1. New York, NY: Henry Holt and Company.
Johnson, M., Demuth, K., & Frank, M
(
2012)
Exploiting social information in grounded language learning via grammatical reductions. In
Proceedings of the Association for Computational Linguistics
(pp. 883–891).
Johnson, M., Griffiths, T., & Goldwater, S
(
2007)
Adaptor grammars: A framework for specifying compositional nonparametric Bayesian models.
Advances in Neural Information Processing Systems
, 19, 641–648.
Li, P., Farkas, I., & MacWhinney, B
(
2004)
Early lexical development in a self-organizing neural network.
Neural Networks
, 17, 1345–1362.
Locke, J
(
1690/1964)
An essay concerning human understanding
. Cleveland, OH: Meridian Books.
Markman, E.M
(
1991)
Categorization and naming in children: Problems of induction
. Cambridge, MA: The MIT Press.
Marr, D
(
1982)
Vision: A computational investigation into the human representation and processing of visual information
. New York, NY: Henry Holt & Co.
McMurray, B., Horst, J.S., & Samuelson, L.K
(
2012)
Word learning emerges from the interaction of online referent selection and slow associative learning.
Psychological Review
, 119, 831–877.
Medina, T., Snedeker, J., Trueswell, J., & Gleitman, L
(
2011)
How words can and cannot be learned by observation.
Proceedings of the National Academy of Sciences
, 108, 9014–9019.
Mitchell, C., & McMurray, B
(
2009)
On leveraged learning in lexical acquisition and its relationship to acceleration.
Cognitive Science
, 33, 1503–1523.
Onishi, K., & Baillargeon, R
(
2005)
Do 15-month-old infants understand false beliefs?
Science
, 308, 255–258.
Perfors, A., Tenenbaum, J., Griffiths, T., & Xu, F
(
2011)
A tutorial introduction to bayesian models of cognitive development.
Cognition
, 120, 302–321.
Pinker, S
(
1984)
Language learnability and language development
. Cambridge, MA: Harvard University Press.
Quine, W
(
1960)
Word and object
. Cambridge, MA: The MIT Press.
Regier, T
(
2005)
The emergence of words: Attentional learning in form and meaning.
Cognitive Science
, 29, 819–865.
Sanborn, A., Griffiths, T., & Navarro, D
(
2010)
Rational approximations to rational models: alternative algorithms for category learning.
Psychological Review
, 117, 1144.
Schmidt, L., Goodman, N.D., Barner, D., & Tenenbaum, J
(
2009)
How tall is tall? Compositionality, statistics, and gradable adjectives. In
N. Taatgen &
H. van Rijn (Eds.),
Proceedings of the 31st Annual Conference of the Cognitive Science Society
, (pp. 3151–3156).
Schulz, L., Kushnir, T., & Gopnik, A
(
2007)
Learning from doing: Intervention and causal inference. In
A. Gopnik &
L. Schulz (Eds.),
Causal learning: Psychology, philosophy, and computation
(pp. 67–85). Oxford: OUP.
Shafto, P., Goodman, N.D., & Frank, M
(
2012)
Learning from others the consequences of psychological reasoning for human learning.
Perspectives on Psychological Science
, 7, 341–351.
Siskind, J
(
1996)
A computational study of cross-situational techniques for learning word-to-meaning mappings.
Cognition
, 61, 39–91.
Smith, L.B., & Yu, C
(
2008)
Infants rapidly learn word-referent mappings via cross-situational statistics.
Cognition
, 106, 1558–1568.
St. Augustine
(
397/1963)
The confessions of St. Augustine
. Oxford: Clarendon Press.
Stiller, A., Goodman, N.D., & Frank, M
(
2011)
Ad-hoc scalar implicature in adults and children. In
L. Carlson,
C. Hoelscher, &
T.F. Shipley (Eds.),
Proceedings of the 33rd Annual Meeting of the Cognitive Science Society
, (pp. 2134–2139).
Tenenbaum, J., & Griffiths, T
(
2001)
Generalization, similarity, and Bayesian inference.
Behavioral and Brain Sciences
, 24, 629–640.
Tomasello, M
(
2003)
Constructing a language: A usage-based theory of language acquisition
. Cambridge, MA: Harvard University Press.
Trueswell, J.C., Medina, T.N., Hafri, A., & Gleitman, L.R
(
2013)
Propose but verify: Fast mapping meets cross-situational word learning.
Cognitive Psychology
, 66, 126–156.
Vouloumanos, A
(
2008)
Fine-grained sensitivity to statistical information in adult word learning.
Cognition
, 107, 729–742.
Vouloumanos, A., Onishi, K., & Pogue, A
(
2012)
Twelve-month-old infants recognize that speech can communicate unobservable intentions.
Proceedings of the National Academy of Sciences
, 109, 12933–12937.
Vouloumanos, A., & Werker, J
(
2009)
Infants’ learning of novel words in a stochastic environment.
Developmental Psychology
, 45, 1611–1617.
Waxman, S., & Booth, A
(
2001)
Seeing pink elephants: Fourteen-month-olds’ interpretations of novel nouns and adjectives.
Cognitive Psychology
, 43, 217–242.
Yu, C., & Ballard, D
(
2007)
A unified model of early word learning: Integrating statistical and social cues.
Neurocomputing
, 70, 2149–2165.
Yu, C., & Smith, L
(
2012)
Modeling cross-situational word–referent learning: Prior questions.
Psychological Review
, 119, 21.
Cited by
Cited by 1 other publications
Lee, Crystal & Chigusa Kurumada
2021.
Learning Maximum Absolute Meaning Through Reasoning About Speaker Intentions.
Language Learning 71:2
► pp. 326 ff.
This list is based on CrossRef data as of 12 april 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.