Erich R. Round | The University of Queensland | University of Surrey | Max Planck Institute for the Science of Human History
Phylogenetic methods have broad potential in linguistics beyond tree inference. Here, we show how a phylogenetic approach
opens the possibility of gaining historical insights from entirely new kinds of linguistic data – in this instance, statistical
phonotactics. We extract phonotactic data from 112 Pama-Nyungan vocabularies and apply tests for phylogenetic signal,
quantifying the degree to which the data reflect phylogenetic history. We test three datasets: (1) binary variables recording the presence
or absence of biphones (two-segment sequences) in a lexicon (2) frequencies of transitions between segments, and (3)
frequencies of transitions between natural sound classes. Australian languages have been characterized as having a high degree of
phonotactic homogeneity. Nevertheless, we detect phylogenetic signal in all datasets. Phylogenetic signal is greater in finer-grained
frequency data than in binary data, and greatest in natural-class-based data. These results demonstrate the viability of employing a new
source of readily extractable data in historical and comparative linguistics.
Austin, Peter. 1981. Proto-Kanyara
and Proto-Mantharta historical
phonology. Lingua 54(4). 295–333.
Baker, Brett. 2014. Word
structure in Australian languages. In Harold Koch & Rachel Nordlinger (eds.), The
languages and linguistics of Australia: A comprehensive
guide, 139–214. Berlin: De Gruyter Mouton.
Balisi, Mairin, Corinna Casey & Blaire Van Valkenburgh. 2018. Dietary
specialization is linked to reduced species durations in North American fossil canids. Royal Society Open
Science 5(4). 171861.
Birchall, Joshua. 2015. A
comparison of verbal person marking across Tupian languages. Boletim do Museu Paraense Emílio Goeldi.
Ciências
Humanas 10(2). 325–345.
Blasi, Damián E., Steven Moran, Scott R. Moisik, Paul Widmer, Dan Dediu & Balthasar Bickel. 2019. Human
sound systems are shaped by post-Neolithic changes in bite
configuration. Science 363(6432).
Blomberg, Simon P. & Theodore Garland Jr.2002. Tempo
and mode in evolution: Phylogenetic inertia, adaptation and comparative methods. Journal of Evolutionary
Biology 15(6). 899–910.
Blomberg, Simon P., Theodore Garland Jr. & Anthony R. Ives. 2003. Testing
for phylogenetic signal in comparative data: Behavioral traits are more
labile. Evolution 57(4). 717–745.
Bouckaert, Remco R., Claire Bowern & Quentin D. Atkinson. 2018. The
origin and expansion of Pama-Nyungan languages across Australia. Nature Ecology &
Evolution 2(4). 741–749.
Bowern, Claire. 2015. Pama-Nyungan
phylogenetics and beyond [plenary address]. In Lorentz center workshop on
phylogenetic methods in linguistics. Leiden University, Leiden, Netherlands.
Bowern, Claire. 2016. Chirila:
Contemporary and historical resources for the Indigenous languages of Australia. Language Documentation and
Conservation 101. [URL]
Bowern, Claire. 2017. Standard
Average Australian? In Association for Linguistic Typology
(ALT). Australian National University, Canberra, Australia.
Bowern, Claire & Quentin D. Atkinson. 2012. Computational
phylogenetics and the internal structure of
Pama-Nyungan. Language 88(4). 817–845.
Bowern, Claire, Patience Epps, Russell Gray, Jane Hill, Keith Hunley, Patrick McConvell & Jason Zentz. 2011. Does
lateral transmission obscure inheritance in hunter-gatherer languages?PLoS
ONE 6(9). e25195.
Busby, Peter A.1982. The distribution of phonemes in Australian
Aboriginal languages. Pacific Linguistics. Series
A. (60). 73–139.
Calude, Andreea S. & Annemarie Verkerk. 2016. The
typology and diachrony of higher numerals in Indo-European: A phylogenetic comparative study. Journal of
Language
Evolution 1(2). 91–108.
Campbell, Lyle. 2004. Historical
linguistics: An introduction. Cambridge, MA: MIT Press.
Capell, Arthur. 1956. A
new approach to Australian linguistics. Sydney: University of Sydney.
Chang, Will, Chundra Cathcart, David Hall & Andrew Garrett. 2015. Ancestry-constrained
phylogenetic analysis supports the Indo-European steppe
hypothesis. Language 91(1). 194–244.
Chao, Yuen-Ren. 1934. The
non-uniqueness of phonemic solutions of phonetic systems. Bulletin of the Institute of History and
Philology, Academia
Sinica 4(4). 363–397.
Coleman, John & Janet Pierrehumbert. 1997. Stochastic
phonological grammars and acceptability. In Computational phonology: ACL special
interest group in computational phonology, 49–56. Somerset, NJ: Association for Computational Linguistics. [URL] (8 March, 2018).
Crawford, Clifford J.2009. Adaptation and transmission in Japanese loanword
phonology. Cornell University thesis. [URL]
Cysouw, Michael & Jeff Good. 2007. Towards
a comprehensive languoid catalog. In Language catalogue
meeting. Leipsig, Germany. [URL] (11
December, 2019).
Delsuc, Frédéric, Henner Brinkmann & Hervé Philippe. 2005. Phylogenomics
and the reconstruction of the tree of life. Nature Reviews
Genetics 6(5). 361–375.
Dixon, R. M. W.1980. The
languages of Australia (Cambridge Language
Surveys). Cambridge: Cambridge University Press.
Dockum, Rikker. 2018. Phylogeny
in phonology: How Tai sound systems encode their past. In Annual meeting on
phonology (AMP). New York: Linguistic Society of America.
Dockum, Rikker & Claire Bowern. 2019. Swadesh
lists are not long enough: Drawing phonological generalizations from limited data. Language Documentation
and Description 161. 35–54. [URL]
Dresher, B. Elan. 2009. The contrastive hierarchy in
phonology. Cambridge: Cambridge University Press.
Dresher, B. Elan & Aditi Lahiri. 2005. Main
stress left in early Middle English. In Michael Fortescue, Jens E. Mogensen & Lene Schøsler (eds.), International
conference on historical linguistics
(ICHL), 76–85. Amsterdam: John Benjamins.
Dunn, Michael, Tonya Kim Dewey, Carlee Arnett, Thórhallur Eythórsson & Jóhanna Barðdal. 2017. Dative
sickness: A phylogenetic analysis of argument structure evolution in
Germanic. Language 93(1). e1–e22.
Dunn, Michael, Simon J. Greenhill, Stephen C. Levinson & Russell D. Gray. 2011. Evolved
structure of language shows lineage-specific trends in word-order
universals. Nature 473(7345). 79–82.
Dunn, Michael, Angela Terrill, Ger Reesink, Robert A. Foley & Stephen C. Levinson. 2005. Structural
phylogenetics and the reconstruction of ancient language
history. Science 309(5743). 2072–2075.
Durie, Mark & Malcolm Ross. 1996. The
comparative method reviewed: Regularity and irregularity in language change. New York: Oxford University Press.
Ernestus, Mirjam T. C. & R. Harald Baayen. 2003. Predicting
the unpredictable: Interpreting neutralized segments in
Dutch. Language 79(1). 5–38.
Felsenstein, Joseph. 1985. Phylogenies
and the comparative method. The American
Naturalist 125(1). 1–15.
Freckleton, Robert P., Paul H. Harvey & Mark Pagel. 2002. Phylogenetic
analysis and comparative data: A test and review of evidence. The American
Naturalist 160(6). 712–726.
Freckleton, Robert P. & Walter Jetz. 2009. Space
versus phylogeny: Disentangling phylogenetic and spatial signals in comparative data. Proceedings of the
Royal Society B: Biological
Sciences 276(1654). 21–30. .
Fritz, Susanne A. & Andy Purvis. 2010. Selectivity
in mammalian extinction risk and threat types: A new measure of phylogenetic signal strength in binary
traits. Conservation
Biology 24(4). 1042–1051.
Garland, Theodore, Jr. & Ramón Díaz-Uriarte. 1999. Polytomies
and phylogenetically independent contrasts: Examination of the bounded degrees of freedom
approach. Systematic
Biology 48(3). 547–558.
Gasser, Emily & Claire Bowern. 2014. Revisiting
phonotactic generalizations in Australian languages. In Annual meeting on
phonology (AMP). University of Massachusetts, Amherst: Linguistic Society of America.
Good, Jeff & Michael Cysouw. 2013. Languoid,
doculect, and glossonym: Formalizing the notion ‘language’. Language Documentation and
Conservation 71. 331–359. [URL]
Gordon, Matthew K.2016. Phonological
typology (Oxford Surveys in Phonology and Phonetics
1). Oxford: Oxford University Press.
Grafen, Alan. 1989. The
phylogenetic regression. Philosophical Transactions of the Royal Society of London. Series B, Biological
Sciences 326(1233). 119–157.
Greenhill, Simon J., Thomas E. Currie & Russell D. Gray. 2009. Does
horizontal transmission invalidate cultural phylogenies?Proceedings of the Royal Society of London B:
Biological
Sciences 276(1665). 2299–2306.
Greenhill, Simon J., Chieh-Hsi Wu, Xia Hua, Michael Dunn, Stephen C. Levinson & Russell D. Gray. 2017. Evolutionary
dynamics of language systems. Proceedings of the National Academy of
Sciences 114(42). E8822–E8829.
Hamilton, Philip J.1996. Phonetic constraints and markedness in the
phonotactics of Australian Aboriginal languages. Toronto: University of Toronto thesis.
Hayes, Bruce & Zsuzsa C. Londe. 2006. Stochastic
phonological knowledge: The case of Hungarian vowel
harmony. Phonology 23(1). 59–104.
Hockett, Charles F.1963. The problem of universals in
language. In Joseph Greenberg (ed.), Universals
of language, 1–29. Cambridge, MA: MIT Press.
Hutchinson, Matthew C., Marília P. Gaiarsa & Daniel B. Stouffer. 2018. Contemporary
ecological interactions improve models of past trait evolution. Systematic
Biology 67(5). 861–872.
Hyman, Larry M.1970. The role of borrowing in the justification
of phonological grammars. Studies in African
Linguistics 1(1). 1–48. [URL]
Hyman, Larry M.2008. Universals in
phonology. The Linguistic
Review 25(1). 83–137.
Kang, Yoonjung. 2011. Loanword
phonology. In Marc Van Oostendorp, Colin J. Ewen, Elizabeth Hume & Keren Rice (eds.), The
Blackwell companion to
phonology, vol. IV1, 2258–2282. Oxford: Wiley-Blackwell.
Kembel, Steven W., Peter D. Cowan, Mattew R. Helmus, William K. Cornwell, Helene Morlon, David D. Ackerly, Simon P. Blomberg & Campbell O. Webb. 2010. Picante:
R tools for integrating phylogenies and
ecology. Bioinformatics 26(11). 1463–1464.
Kiparsky, Paul. 2018. Formal
and empirical issues in phonological typology. In Larry M. Hyman & Frans Plank (eds.), Phonological
typology, 54–106. Berlin: De Gruyter Mouton.
Koch, Harold. 2014. Historical
relations among the Australian languages: Genetic classification and contact-based
diffusion. In Harold Koch & Rachel Nordlinger (eds.), The
languages and linguistics of Australia: A comprehensive
guide, 23–90. Berlin: De Gruyter Mouton.
Kolipakam, Vishnupriya, Fiona M. Jordan, Michael Dunn, Simon J. Greenhill, Remco R. Bouckaert, Russell D. Gray & Annemarie Verkerk. 2018. A
Bayesian phylogenetic study of the Dravidian language family. Royal Society Open
Science 5(3). 171504.
Lass, Roger. 1984. Vowel
system universals and typology: Prologue to theory. Phonology
Yearbook 11. 75–111.
Leff, Jonathan W., Richard D. Bardgett, Anna Wilkinson, Benjamin G. Jackson, William J. Pritchard, Jonathan R. Long, Simon Oakley, et al.2018. Predicting
the structure of soil communities from plant community taxonomy, phylogeny, and traits. The ISME
Journal 121. 1794–1805.
List, Johann-Mattis, Simon J. Greenhill & Russell D. Gray. 2017. The
potential of automatic word comparison for historical linguistics. PLoS
ONE 12(1). e0170046.
List, Johann-Mattis, Mary Walworth, Simon J. Greenhill, Tiago Tresoldi & Robert Forkel. 2018. Sequence
comparison in computational historical linguistics. Journal of Language
Evolution 3(2). 130–144.
Losos, Jonathan B.2008. Phylogenetic niche conservatism,
phylogenetic signal and the relationship between phylogenetic relatedness and ecological similarity among
species. Ecology
Letters 11(10). 995–1003.
Macklin-Cordes, Jayden L. & Erich R. Round. 2015. High-definition
phonotactics reflect linguistic pasts. In Johannes Wahle, Marisa Köllner, Harald Baayen, Gerhard Jäger & Tineke Baayen-Oudshoorn (eds.), Quantitative
investigations in theoretical linguistics
(QITL-6). Tübingen: University of Tübingen.
Marin, Julie, S. Blair Hedges & Koichiro Tamura. 2018. Undersampling
genomes has biased time and rate estimates throughout the tree of life. Molecular Biology and
Evolution 35(8). 2077–2084.
Maurits, Luke & Thomas L. Griffiths. 2014. Tracing
the roots of syntax with Bayesian phylogenetics. Proceedings of the National Academy of
Sciences 111(37). 13576–13581.
Meillet, Antoine. 1925. La
méthode comparative en linguistique historique. Paris: Honoré Champion.
Moran, Steven & Michael Cysouw. 2018. The
Unicode cookbook for linguists: Managing writing systems using orthography
profiles. Berlin: Language Science Press.
Moran, Steven, Eitan Grossman & Annemarie Verkerk. 2020. Investigating
diachronic trends in phonological inventories using BDPROTO. Language Resources and
Evaluation.
Moran, Steven & Annemarie Verkerk. 2018. Differential
rates of change in consonant and vowel systems. In C. Cuskley, M. Flaherty, H. Little, Luke McCrohon, A. Ravignani & T. Verhoef (eds.), The
evolution of language (EVOLANGXII). NCU Press. . [URL]
Münkemüller, Tamara, Sébastien Lavergne, Bruno Bzeznik, Stéphane Dray, Thibaut Jombart, Katja Schiffers & Wilfried Thuiller. 2012. How
to measure and test phylogenetic signal. Methods in Ecology and
Evolution 3(4). 743–756.
Nash, David, Patrick McConvell, Arthur Capell, Ken Hale, Peter Sutton, Deborah Bird Rose & Jim Wafer. 1988. Mudburra
wordlist. Word list. Canberra: Australian Institute of Aboriginal and Torres Strait Islander Studies, Australian Indigenous Languages Collection, ms. [URL]
Nichols, Johanna. 1997. Sprung
from two common sources: Sahul as a linguistic area. In Patrick McConvell & Nicholas Evans (eds.), Archaeology
and linguistics: Aboriginal Australia in global
perspective. Melbourne: Oxford University Press.
Nunn, Charles L.2011. The comparative approach in evolutionary
anthropology and biology. Chicago: University of Chicago Press.
O’Grady, Geoffrey N., Charles F. Voegelin & Florence M. Voegelin. 1966. Languages
of the world: Indo-Pacific fascicle six. Anthropological
Linguistics 1–197.
Orme, David, Rob Freckleton, Gavin Thomas, Thomas Petzoldt, Susanne Fritz, Nick Isaac & Will Pearse. 2013. caper:
Comparative analyses of phylogenetics and evolution in R. [URL]
Rama, Taraka, Johann-Mattis List, Johannes Wahle & Gerhard Jäger. 2018. Are
automatic methods for cognate detection good enough for phylogenetic reconstruction in historical
linguistics? In North American chapter of the Association for Computational
Linguistics (ACL): Human language technologies, volume 2 (short
papers), 393–400. New Orleans: Association for Computational Linguistics.
R Core Team. 2017. R: A language and environment
for statistical computing. Vienna: R Foundation for Statistical Computing. [URL]
Revell, Liam J., Luke J. Harmon, David C. Collar & Todd Oakley. 2008. Phylogenetic
signal, evolutionary process, and rate. Systematic
Biology 57(4). 591–601.
Rexová, Kateřina, Yvonne Bastin & Daniel Frynta. 2006. Cladistic
analysis of Bantu languages: A new tree based on combined lexical and grammatical
data. Naturwissenschaften 93(4). 189–194.
Round, Erich R.2017a. Matthew K. Gordon: Phonological typology
[book review]. Folia
Linguistica 51(3). 745–755.
Round, Erich R.2017b. The AusPhon-Lexicon project: 2 million
normalized segments across 300 Australian languages. In Poznań linguistic
meeting. Poznań, Poland. [URL]
Round, Erich R.2019a. Phonemic inventories of Australia [database
of 392 languages]. In Steven Moran & Daniel McCloy (eds.), PHOIBLE
2.0. Jena, Germany: Max Planck Institute for the Science of Human History.
Round, Erich R.2019b. Australian phonemic inventories contributed to
PHOIBLE 2.0: Essential explanatory notes. Zenodo.
Round, Erich R.2021a. Segment inventories in Australian
languages. In Claire Bowern (ed.), Oxford
handbook of Australian languages. Oxford: Oxford University Press.
Round, Erich R.2021b. Phonotactics in Australian
languages. In Claire Bowern (ed.), Oxford
handbook of Australian languages. Oxford: Oxford University Press.
Sallan, Lauren Cole & Matt Friedman. 2012. Heads
or tails: Staged diversification in vertebrate evolutionary radiations. Proceedings of the Royal Society B:
Biological
Sciences 279(1735). 2025–2032.
Schmidt, Wilhelm. 1919. Die
Gliederung australischen Sprachen: Geographische, bibliographische, linguistische Grundzüge der Erforschung der australischen
Sprachen. Vienna: Druck und Verlag der Mechitharisten-Buchdruckerei.
Silverman, Daniel. 1992. Multiple
scansions in loanword phonology: Evidence from
Cantonese. Phonology 9(2). 289–328.
Sookias, Roland B., Samuel Passmore & Quentin D. Atkinson. 2018. Deep
cultural ancestry and human development indicators across nation states. Royal Society Open
Science 5(4). 171411.
Steiner, Lydia, Michael Cysouw & Peter Stadler. 2011. A
pipeline for computational historical linguistics. Language Dynamics and
Change 1(1). 89–127.
Uyeda, Josef C., Rosana Zenil-Ferguson, Matthew W. Pennell & Nicholas Matzke. 2018. Rethinking
phylogenetic comparative methods. Systematic
Biology 67(6). 1091–1109.
Van der Hulst, Harry. 2017. Phonological
typology. In Alexandra Y. Aikhenvald & R. M. W. Dixon (eds.), The
Cambridge handbook of linguistic
typology, 39–77. Cambridge: Cambridge University Press.
Verkerk, Annemarie. 2017. Phylogenetic
comparative methods for typologists (focusing on families and regions: A plea for using phylogenetic comparative methods in linguistic
typology). In Quantitative analysis in typology: The logic of choice among
methods (workshop at the 12th conference of the Association for Linguistic Typology). Canberra, Australia: Australian National University.
Villemereuil, Pierre de & Shinichi Nakagawa. 2014. General
quantitative genetic methods for comparative biology. In László Z. Garamszegi (ed.), Modern
phylogenetic comparative methods and their application in evolutionary biology: Concepts and
practice, 287–303. Berlin: Springer.
Voegelin, Florence M., Stephen A. Wurm, Geoffrey O’Grady, Tokuichiro Matsuda & Charles F. Voegelin. 1963. Obtaining
an index of phonological differentiation from the construction of non-existent minimax
systems. International Journal of American
Linguistics 29(1). 4–28.
Walker, Robert S. & Lincoln A. Ribeiro. 2011. Bayesian
phylogeography of the Arawak expansion in lowland South America. Proceedings of the Royal Society B:
Biological
Sciences 278(1718). 2562–2567.
Webb, Campbell O., David D. Ackerly, Mark A. McPeek & Michael J. Donoghue. 2002. Phylogenies
and community ecology. Annual Review of Ecology and
Systematics 33(1). 475–505.
Weiss, Michael. 2014. The
comparative method. In Claire Bowern & Bethwyn Evans (eds.), The
Routledge handbook of historical linguistics (Routledge Handbooks in
Linguistics), 127–145. London: Routledge.
Widmer, Manuel, Sandra Auderset, Johanna Nichols, Paul Widmer & Balthasar Bickel. 2017. NP
recursion over time: Evidence from
Indo-European. Language 93(4). 799–826.
Wortley, Alexandra H., Paula J. Rudall, David J. Harris, Robert W. Scotland & Peter Linder. 2005. How
much data are needed to resolve a difficult phylogeny? Case study in lamiales. Systematic
Biology 54(5). 697–709.
Wurm, Stephen A.1963. Aboriginal languages: The present state of
knowledge. In Helen Shiels (ed.), Australian
Aboriginal studies: A symposium of papers presented at the 1961 research
conference, 127–148. Melbourne: Oxford University Press.
Wurm, Stephen A.1972. Languages of Australia and
Tasmania. The Hague: Mouton.
Zheng, Li, Anthony R. Ives, Theodore Garland Jr., Bret R. Larget, Yang Yu & Kunfang Cao. 2009. New
multivariate tests for phylogenetic signal and trait correlations applied to ecophysiological phenotypes of nine manglietia
species. Functional
Ecology 23(6). 1059–1069.
Zhou, Kevin & Claire Bowern. 2015. Quantifying
uncertainty in the phylogenetics of Australian numeral systems. Proceedings of the Royal Society
B 282(1815). 20151278.
Zuraw, Kie R.2000. Patterned exceptions in
phonology. Los Angeles: University of California dissertation.
Cited by (11)
Cited by 11 other publications
Cathcart, Chundra A.
2024. Multiple evolutionary pressures shape identical consonant avoidance in the world’s languages. Proceedings of the National Academy of Sciences 121:27
Hartmann, Frederik & George Walkden
2024. The strength of the phylogenetic signal in syntactic data. Glossa: a journal of general linguistics 9:1
Yin, Ruihua, Jeroen van de Weijer & Erich R. Round
2023. Frequent violation of the sonority sequencing principle in hundreds of languages: how often and by which sequences?. Linguistic Typology 27:2 ► pp. 381 ff.
Bowern, Claire
2022. Where have all the sound changes gone? Phonological stability and mechanisms of sound change. Linguistics Vanguard 8:s5 ► pp. 499 ff.
Hübler, Nataliia & Simon J Greenhill
2022. Modelling admixture across language levels to evaluate deep history claims. Journal of Language Evolution 7:2 ► pp. 166 ff.
Miceli, Luisa & Erich Round
2022. Where have all the sound changes gone? Examining the scarcity of evidence for regular sound change in Australian languages. Linguistics Vanguard 8:s5 ► pp. 509 ff.
Phillips, Joshua & Claire Bowern
2022. Bayesian methods for ancestral state reconstruction in morphosyntax: Exploring the history of argument marking strategies in a large language family. Journal of Language Evolution 7:1 ► pp. 1 ff.
Round, Erich, Rikker Dockum & Robin J. Ryder
2022. Evolution and Trade-Off Dynamics of Functional Load. Entropy 24:4 ► pp. 507 ff.
Macklin-Cordes, Jayden L. & Erich R. Round
2020. Re-evaluating Phoneme Frequencies. Frontiers in Psychology 11
Macklin-Cordes, Jayden L. & Erich R. Round
2022. Challenges of sampling and how phylogenetic comparative methods help: with a case study of the Pama-Nyungan laminal contrast. Linguistic Typology 26:3 ► pp. 533 ff.
This list is based on CrossRef data as of 4 july 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.