Notes / Notizen — Discussion / Diskussion
Sampling error in lexicostatistical measurements
A Slavic case study
Article outline
- 1.Introduction
- 2.Lexicostatistics and linguistic distances
- 3.Measurement error and sampling error
- 4.Quantifying sampling error with the McNemar test
- 5.Case study: Is Upper Sorbian closer to Czech or to Polish?
- 6.Conclusion
- Notes
-
References
References (45)
References
Bakker, Dik, André Müller, Viveka Velupillai, Søren Wichmann, Cecil H. Brown, Pamela Brown, Dmitry Egorov, Robert Mailhammer, Anthony Grant, & Eric W. Holman. 2009. Adding typology to lexicostatistics: A combined approach to language classification. Linguistic Typology 13(1). 169–181.
Black, Paul. 1976. Multidimensional scaling applied to linguistic relationships. Cahiers de l’Institut de Linguistique de Louvain 31. 43–92.
Čejka, Mirek & Arnošt Lamprecht. 1963. K otázce vzniku a diferenciace slovanských jazyků. Sborník prací Filozofické Fakulty Brněnské University, A, Řada jazykovědná 121, 5–20.
Čejka, Mirek. 1972. Lexicostatistic dating and Slavonic languages. Sborník Prací Filozofické Fakulty Brněnské University, A, Řada jazykovědná 211. 39–52.
Dolgopol’sky, Aaron B. 1986. A probabilistic hypothesis concerning the oldest relationships among the language families of Northern Eurasia. In V. Shevoroshkin & T. L. Markey (eds.), Typology, relationship and time: A collection of papers on language change and relationship by Soviet linguists, 27–50. Ann Arbor, MI: Karoma.
Dunn, Michael. 2014. Language phylogenies. In Claire Bowern & Bethwyn Evans (eds.), The Routledge handbook of historical linguistics, 190–211. London: Routledge.
Dyen, Isidore. 1962. The lexicostatistical classification of the Malayopolynesian languages. Language, 38(1). 38–46.
Dyen, Isidore, Joseph B. Kruskal & Paul Black. 1992. An Indoeuropean classification: A lexicostatistical experiment. Philadelphia: American Philosophical Society.
Embleton, Sheila M. 1986. Statistics in historical linguistics. Bochum: Brockmeyer.
Fields, Edda L. 2004. Before “Baga”: Settlement chronologies of the coastal Rio Nunez region, earliest times to C.1000 CE. The International Journal of African Historical Studies 37(2). 229–53.
Fodor, István. 1961. The validity of glottochronology on the basis of the Slavonic languages. Studia Slavica 7. 295–346. (Hungarianå version published as: A glottochronologia ervenyessege a szlav nyelvek anyaga alapjan. Nyelvtudományi Közlemények 63(2). 308–344.)
Gray, Russell D. & Quentin D. Atkinson. 2003. Language-tree divergence times support the Anatolian theory of Indo-European origin. Nature 4261. 435–439.
Grimes, Charles E., & Barbara D. Grimes. 1987. Languages of South Sulawesi. Canberra: Research School of Pacific and Asian Studies, Australia National University.
Gudschinsky, Sarah C. 1956. The ABC’S of lexicostatistics (Glottochronology). WORD, 12(2). 175–210.
Holman, Eric W., Søren Wichmann, Cecil H. Brown, Viveka Velupillai, André Müller, & Dik Bakker. 2008. Explorations in automated language classification. Folia Linguistica 42(3–4). 331–354.
Kessler, Brett. 2001. The significance of word lists. Stanford: CSLI Publications.
Kessler, Brett & Annukka Lehtonen. 2006. Multilateral comparison and significance testing of the Indo-Uralic question. In Colin Renfew, Peter Forster (eds.), Phylogenetic Methods and the Prehistory of Languages, 33–42. Cambridge: McDonald Institute.
Kroeber, Alfred L. 1958. Romance history and glottochronology. Language, 34(4). 454–457.
Lees, Robert B. 1953. The basis of glottochronology. Language, 29(2). 113–127.
Levenshtein, Vladimir I. 1965. Dvoichnye kody s ispravleniem vypadenij, vstavok i zameshhenij simvolov. Doklady Akademii Nauk SSSR 163(4), 845–848;
Levenshtein, Vladimir I. 1966. Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady 10(8). 707–710.
Maguire, Warren & April McMahon, A. 2011. Quantifying relations between dialects. In Warren Maguire & April McMahon (eds.), Analysing variation in English. Cambridge: Cambridge University Press.
Mańczak, Witold. 2009. The original homeland of the Slavs. Studia Mythologica Slavica 121. 135–145.
Marris, Emma. 2008. Language: The language barrier. Nature 453(7194). 446–448.
Mchugh, Mary L. 2013. The Chi-square test of independence: Lessons in biostatistics. Biochemia Medica 23(2), 143–9.
McMahon, April, & Robert McMahon. 2005. Language classification by numbers. Oxford: Oxford University Press.
Mead, David & Melanie Mead. 1991. Survey of the Pamona dialects of Kecamatan Bungku Tengah. Workpapers in Indonesian Languages and Cultures 111. 121–142.
Nakhleh, Luay, Don Ringe, & Tandy Warnow. 2005. Perfect phylogenetic networks: A new methodology for reconstructing the evolutionary history of natural languages. Language 81(2). 382–420.
Nicholls, Geoff K., & Russell D. Gray. 2006. Quantifying uncertainty in a stochastic model of vocabulary evolution. In Peter Forster & Colin Renfrew (eds.), Phylogenetic methods and the prehistory of language, 161–171. Cambridge: McDonald Institute.
Novotná, Petra & Václav Blažek. 2005. Glottochronologie a její aplikace pro slovanské jazyky. Sborník prací Filozofické fakulty brněnské university, A, Řada jazykovědná 541. 51–80.
Olmsted, David. 1957. Three tests of glottochronological theory. American Anthropologist 59(5). 839–842.
Oswalt, Robert L. 1971. Towards the construction of a standard lexicostatistic list. Anthropological Linguistics 13(9). 421–434. Retrieved from [URL]
Pereltsvaig, Asya & Martin W. Lewis. 2015. The Indo-European controversy: Facts and fallacies in historical linguistics. Cambridge: Cambridge University Press.
Serva, Maurizio & Filippo Petroni. 2008. Indo-European languages tree by Levenshtein distance. EPL (Europhysics Letters) 81(6). 68005.
Sheskin, David J. 2003. Handbook of parametric and nonparametric statistical procedures (3rd ed.). Chapman & Hall Crc.
Simons, Gary. 1977. Tables of significance for lexicostatistics. In Richard Loving, Gary Simons (eds.), Language variation and survey techniques: Workpapers in Papua New Guinea languages 211. 75–106. Ukarumpa: Summer Institute of Linguistics.
Starostin, Sergei A. 1992. Methodology of Long-Range Comparison. In Vitalii Shevoroshkin (ed.), Nostratic, Dene-Caucasian, Austric and Amerind, 75–59. Bochum: Brockmeyer.
Starostin, Sergei A. 2004. Data cited from Novotná and Blažek (2005), as described in the text on page 108.
Swadesh, Morris. 1952. Lexico-statistic dating of prehistoric ethnic contacts: With special reference to North American Indians and Eskimos. Proceedings of the American Philosophical Society 96(4). 452–463.
Swadesh, Morris. 1955. Towards greater accuracy in lexicostatistics dating. International Journal of American Linguistics 21(2). 121–137. Retrieved from [URL]
Tadmor, Uri. 2009. Loanwords in the world’s languages: Findings and results. In Martin Haspelmath, Uri Tadmor (eds.), Loanwords in the world’s languages: A comparative handbook, 55–75. Berlin: Walter de Gruyter.
Tischler, Johann. 1973. Glottochronologie und Lexikostatistik. Innsbruck: Kowatch.
Yang, Zhao, Xuezheng Sun & James W. Hardin. 2010. A note on the tests for clustered matched-pair binary data. Biometrical Journal 52(5). 638–652.