While analysing lexical data of Western Kho-Bwa languages of the Sino-Tibetan or Trans-Himalayan family with the
help of a computer-assisted approach for historical language comparison, we observed gaps in the data where one or more varieties
lacked forms for certain concepts. We employed a new workflow, combining manual and automated steps, to predict the most likely
phonetic realisations of the missing forms in our data, by making systematic use of the information on sound correspondences in
words that were potentially cognate with the missing forms. This procedure yielded a list of hypothetical reflexes of previously
identified cognate sets, which we first preregistered as an experiment on the prediction of unattested word forms and then
compared with actual word forms elicited during secondary fieldwork. In this study we first describe the workflow which we used to
predict hypothetical reflexes and the process of elicitation of actual word forms during fieldwork. We then present the results of
our reflex prediction experiment. Based on this experiment, we identify four general benefits of reflex prediction in historical
language comparison. These comprise (1) an increased transparency of linguistic research, (2) an increased efficiency of field and
source work, (3) an educational aspect which offers teachers and learners a wide plethora of linguistic phenomena, including the
regularity of sound change, and (4) the possibility of kindling speakers’ interest in their own linguistic heritage.
Bodt, Timotheus Adrianus. 2014b. Notes on the settlement
of the Gongri River valley of Western Arunachal Pradesh. In Anna Balikci Denjongpa & Jenny Bentley (eds.), The
dragon and the hidden land: Social and historical studies on Sikkim and Bhutan. Proceedings of the Bhutan-Sikkim Panel at the
13th Seminar of the International Association for Tibetan
Studies, 153–190. Ulaanbataar: International Association for Tibetan Studies.
Bodt, Timotheus Adrianus. 2019. The Duhumbi perspective
on Proto-Western Kho-Bwa rhymes. Die
Sprache 521 (2016 /
2017) 21: 141–176.
Bodt, Timotheus A. & Johann-Mattis List. 2019. Testing
the predictive strength of the comparative method: An ongoing experiment on unattested words in Western Kho- Bwa
languages. Papers in Historical
Phonology 4 (1): 22–44.
Bodt, Timotheus A. & Johann-Mattis List. 2020. The
multiple benefits of making predictions in linguistics. Babel: The Language
Magazine 311: 8–12.
Bodt, Timotheus A., Nathan W. Hill & Johann-Mattis List. 2018. Prediction
experiment for missing words in Kho-Bwa language data. Open science framework
preregistrationsOctober5. [URL]
Branner, David Prager. 2006. Some composite phonological
systems in Chinese. In David Prager Branner (ed.), The
Chinese rime tables: Linguistic philosophy and historical-comparative
phonology, 209–232. Amsterdam: Benjamins.
Driem, George van. 2001. Languages of the Himalayas: An
ethnolinguistic handbook of the Greater Himalayan
Region. 21. Leiden: Brill.
Eberhard, David M., Gary F. Simons & Charles D. Fennig (eds.). 2019. Ethnologue:
Languages of the world. Twenty-second edition. Dallas, Texas: SIL International. [URL]
Forkel, Robert, Johann-Mattis List, Simon J. Greenhill, Christoph Rzymski, Sebastian Bank, Michael Cysouw, Harald Hammarström, Martin Haspelmath, Gereon A. Kaiping & Russell D. Gray. 2018. Cross-linguistic
data formats, advancing data sharing and re-use in comparative linguistics. Scientific
Data 5 (180205): 1–10.
Genetti, Carol. 2016. The
Tibeto-Burman languages of South Asia: The languages, histories, and genetic
classification. In Hans Heinrich Hock & Elena Bashir (eds.), The
languages and linguistics of South Asia: A comprehensive
guide, 130–154. Berlin: Mouton de Gruyter.
Greenberg, Joseph H.1963. Some universals of grammar with
particular reference to the order of meaningful elements. In Joseph H. Greenberg, Universals of human
language, 73–113. Cambridge, Mass: MIT Press.
Grimm, Jacob. 1822. Deutsche
Grammatik. Erster
Theil. Göttingen: Dieterich.
Hammarström, Harald, Robert Forkel & Martin Haspelmath. 2020. Glottolog. Version
4.2.1. Jena, Max Planck Institute for the Science of Human History. [URL]
Lieberherr, Ismael & Timotheus Adrianus Bodt. 2017. Sub-grouping
Kho-Bwa based on shared core vocabulary. Himalayan
Linguistics 16 (2): 25–63.
List, Johann-Mattis. 2017. A
web-based interactive tool for creating, inspecting, editing, and publishing etymological
datasets. In Proceedings of the 15th Conference of the European
Chapter of the Association for Computational Linguistics. System
Demonstrations, 9–12.
List, Johann-Mattis. 2019. Automatic
inference of sound correspondence patterns across multiple languages. Computational
Linguistics 1 (45): 137–61.
Michael, Lev, Natalia Chousou-Polydouri, Keith Bartolomei, Erin Donnelly, Vivian Wauters, Sérgio Meira & Zachary O’Hagan. 2015. A
Bayesian phylogenetic classification of
Tupí-Guaraní. LIAMES 15 (2): 193–221.
Nosek, Brian, Emorie D. Beck, Lorne Campell, Jessica K. Flake, Tom E. Hardwicke, David T. Mellor, Anna E. van ‘t Veer & Simine Vazire. 2019. Preregistration
is hard, and worthwhile. Trends in Cognitive
Sciences 23(10): 815–818.
Post, Mark W. & Robbins Burling. 2017. The
Tibeto-Burman languages of Northeastern India. In Graham Thurgood & Randy J. LaPolla (eds.), The
Sino-Tibetan
languages, 213–233. Abingdon: Routledge.
Schweikhard, N. & J.-M. List. 2020. Developing
an annotation framework for word formation processes in comparative linguistics. SKASE Journal
of Theoretical
Linguistics 17.1: 2–26.
Sims-Williams, P.2018. Mechanising
historical phonology. Transactions of the Philological
Society. 116.3: 555–573.
Watkins, C.1962. Indo-European
origins of the Celtic verb. Volume I. The sigmatic
aorist. Dublin: Dublin Institute for Advanced Studies.
Wu, M.-S., N. Schweikhard, T. Bodt, N. Hill & J.-M. List. 2020. Computer-assisted
language comparison: State of the art. Journal of Open Humanities
Data 6.2: 1–14.
Cited by (9)
Cited by nine other publications
Blum, Frederic, Carlos Barrientos, Adriano Ingunza & Johann-Mattis List
2024. Cognate reflex prediction as hypothesis test for a genealogical relation between the Panoan and Takanan language families. Scientific Reports 14:1
2023. Open Problems in Computational Historical Linguistics. Open Research Europe 3 ► pp. 201 ff.
List, Johann-Mattis
2023. Evolutionary Aspects of Language Change. In Evolutionary Thinking Across Disciplines [Synthese Library, 478], ► pp. 103 ff.
List, Johann-Mattis
2024. Open Problems in Computational Historical Linguistics. Open Research Europe 3 ► pp. 201 ff.
List, Johann-Mattis, Robert Forkel, Simon J. Greenhill, Christoph Rzymski, Johannes Englisch & Russell D. Gray
2022. Lexibank, a public repository of standardized wordlists with computed phonological and lexical features. Scientific Data 9:1
List, Johann-Mattis & Robert Forkel
2021. Automated identification of borrowings in multilingual wordlists. Open Research Europe 1 ► pp. 79 ff.
List, Johann-Mattis & Robert Forkel
2021. Automated identification of borrowings in multilingual wordlists. Open Research Europe 1 ► pp. 79 ff.
List, Johann-Mattis & Robert Forkel
2022. Automated identification of borrowings in multilingual wordlists. Open Research Europe 1 ► pp. 79 ff.
This list is based on CrossRef data as of 3 january 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.