Concreteness, familiarity, and imageability: Chapter 5. Computer simulations of MRC Psycholinguistic Database word properties

Crossley, Scott A.; Feng, Shi; Cai, Zhiqiang; McNamara, Danielle S.

doi:10.1075/sibil.47.07ch5

Part of

Vocabulary Knowledge: Human ratings and automated measures
Edited by Scott Jarvis and Michael Daller
[Studies in Bilingualism 47] 2013
► pp. 135–156

Chapter 5. Computer simulations of MRC Psycholinguistic Database word properties

Concreteness, familiarity, and imageability

Scott A. Crossley | Georgia State University

Shi Feng | University of Memphis

Zhiqiang Cai | University of Memphis

Danielle S. McNamara | Arizona State University

This study investigates the potential for computational models informed through automated lexical indices to simulate human ratings of word concreteness, word familiarity, and word imageability. The goal of the study is to provide word information estimates for words with human ratings, thereby affording greater textual coverage and permitting a better understanding of features that underlie word properties. This study uses traditional automated word features such word length, word frequency, hypernymy, and polysemy along with novel automated word features such as word type attributes taken from WordNet, LSA dimensions, and inverse entropy weights as predictor variables. The model reported in this study for word concreteness predicted 61% of the variance in human ratings of word concreteness and demonstrated that more concrete words contain attributes related to people, animals, and food, have higher hypernymy levels, are related to two LSA dimensions, are more frequent, and are shorter. The model for word familiarity predicted 62% of the variance in the human ratings reported in the MRC database and demonstrated that more familiar words are found in a greater number of text samples and are more frequent. The model for word imageability ratings explained 42% of the variance in the human ratings and demonstrated that more concrete words contain attributes related to artifacts, animals, and plants, are related to two LSA dimensions, are more frequent, and are shorter.

Published online: 14 August 2013

https://doi.org/10.1075/sibil.47.07ch5

Cited by

Cited by 6 other publications

Order by:

Botarleanu, Robert-Mihai, Mihai Dascalu, Micah Watanabe, Scott Andrew Crossley & Danielle S. McNamara

2022. Age of Exposure 2.0: Estimating word complexity using iterative models of word embeddings. Behavior Research Methods 54:6 ► pp. 3015 ff.

Díez-Ortega, María & Kristopher Kyle

2024. Measuring the development of lexical richness of L2 Spanish: A longitudinal learner corpus study. Studies in Second Language Acquisition 46:1 ► pp. 169 ff.

Kim, YouJin, Scott Crossley, YeonJoo Jung, Kristopher Kyle & Sanghee Kang

2018. Chapter 3. The effects of task repetition and task complexity on L2 lexicon use. In Learning Language through Task Repetition [Task-Based Language Teaching, 11], ► pp. 75 ff.

Lin, You-Min & Michelle Y. Chen

2020. Understanding writing quality change: A longitudinal study of repeaters of a high-stakes standardized English proficiency test. Language Testing 37:4 ► pp. 523 ff.

Pathak, Abhishek, Carlos Velasco, Olivia Petit & Gemma Anne Calvert

2019. Going to great lengths in the pursuit of luxury: How longer brand names can enhance the luxury perception of a brand. Psychology & Marketing 36:10 ► pp. 951 ff.

Peti-Stantić, Anita, Maja Anđel, Vedrana Gnjidić, Gordana Keresteš, Nikola Ljubešić, Irina Masnikosa, Mirjana Tonković, Jelena Tušek, Jana Willer-Gold & Mateusz-Milan Stanojević

2021. The Croatian psycholinguistic database: Estimates for 6000 nouns, verbs, adjectives and adverbs. Behavior Research Methods 53:4 ► pp. 1799 ff.

This list is based on CrossRef data as of 24 april 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.