The case of Universal 18: Computational quantitative syntax

Gulordava, Kristina; Merlo, Paola

doi:10.1075/rllt.16.08gul

Part of

Romance Languages and Linguistic Theory 16: Selected papers from the 47th Linguistic Symposium on Romance Languages (LSRL), Newark, Delaware
Edited by Irene Vogel
[Romance Languages and Linguistic Theory 16] 2020
► pp. 109–132

Computational quantitative syntax

The case of Universal 18

Kristina Gulordava | University of Geneva

Paola Merlo | University of Geneva

Accounting for the constraints on the possible word orders of a sentence in a language and across the world languages is a core challenge for syntactic theory. In the spirit of computational quantitative syntax, in this paper we present quantitative evidence about Universal 18. We show that corpus data confirms a dispreference for the word order combination where adjectives precede but numerals follow the nouns (Adj-N and N-Num). We then investigate if this dispreference is better explained as a constraint expressed at the level of the dominant orders or at the level of individual structures. Corpus counts support the latter interpretation. Finally, we propose a formal model of how this bias against Adj-N-Num orders can be integrated in the grammar.

Keywords: Universal 18, quantitative syntax, corpus counts, adjective-noun order, noun-numeral order, treebanks, Latin, Ancient Greek, modelling

Article outline

1.Introduction
2.Typological data on Universal 18
3.Accounts of language universals and Universal 18
- 3.1Structure-level accounts
- 3.2Grammar-level accounts
4.Our approach to Universal 18
- 4.1Cross-linguistic corpus data
  - Corpus data for Latin and Ancient Greek
  - Preprocessing and collection of counts
  - Collected counts
- 4.2Results and discussion
5.Towards a model explaining Universal 18
- 5.1Comparison of models 1 and 2
6.Conclusions
Notes
References

Published online: 21 August 2020

https://doi.org/10.1075/rllt.16.08gul

References (41)

References

Abels, Klaus. 2013. “On 2-3-1.” UCL Working Papers in Linguistics 25.

Baker, Mark. 2001. The Atoms of Language: The Mind’s Hidden Rules of Grammar. New York, NY: Basic Books.

Bamman, David, and Gregory R. Crane. 2011. “The Ancient Greek and Latin Dependency Treebanks.” In Language Technology for Cultural Heritage (Theory and Applications of Natural Language Processing), ed. by Caroline Sporleder, Antal Bosch, and Kalliopi Zervanou, 79–98. Berlin Heidelberg: Springer.

Biberauer, Theresa Anders Holmberg, and Ian Roberts. 2014. “A Syntactic Universal and its Consequences”. Linguistic Inquiry 45 (2): 169–225.

Bresnan, Joan, Anna Cueni, Tatiana Nikitina, and Harald Baayen. 2007. “Predicting the Dative Alternation.” In Cognitive Foundations of Interpretation, ed. by G. Boume, I. Kraemer, and J. Zwarts, 69–94. Amsterdam: Royal Netherlands Academy of Science.

Bresnan, Joan, Shipra Dingare, and Christopher D. Manning. 2001. “Soft Constraints Mirror Hard Constraints: Voice and Person in English and Lummi.” In Proceedings of the LFG 01 Conference, 13–32. CSLI.

Buchholz, Sabine, and Erwin Marsi. 2006. “CoNLL-X Shared Task on Multilingual Dependency Parsing.” In Proceedings of the Tenth Conference on Computational Natural Language Learning, CoNLL-X ‘06, 149–164. Stroudsburg, PA, USA: Association for Computational Linguistics. [URL].

Cinque, Guglielmo. 2005. “Deriving Greenberg’s Universal 20 and its Exceptions”. Linguistic Inquiry 36 (3): 315–332.

Culbertson, Jennifer, and David Adger. 2014. “Language Learners Privilege Structured Meaning Over Surface Frequency”. Proceedings of the National Academy of Sciences of the United States of America 111 (16): 5842–5847.

Culbertson, Jennifer, and Paul Smolensky. 2012. “A Bayesian Model of Biases in Artificial Language Learning: The Case of a Word-Order Universal”. Cognitive Science 36 (8): 1468–1498.

Culbertson, Jennifer, Paul Smolensky, and Géraldine Legendre. 2012. “Learning Biases Predict a Word Order Universal”. Cognition 122 (3): 306–329.

Cysouw, Michael. 2010. “Dealing with Diversity: Towards an Explanation of NP Word Order Frequencies”. Linguistic Typology 14 (2): 253–287.

Dryer, Matthew S. 1989. “Discourse-Governed Word Order and Word Order Typology”. Belgian Journal of Linguistics 4: 69–90.

1992. “The Greenbergian Word Order Correlations”. Language 68: 81–138.

1995. “Frequency and Pragmatically Unmarked Word Order.” In Word Order in Discourse, ed. by Mickey Noonan, and Pamela Downing, 105–135. Amsterdam: John Benjamins.

Dryer, Matthew S., and Martin Haspelmath (eds.). 2011. The World Atlas of Language Structures Online. Munich: Max Planck Digital Library. [URL]

Dunn, Michael, Simon J. Greenhill, Stephen C. Levinson, and Russell D. Gray. 2011. “Evolved Structure of Language Shows Lineage-Specific Trends in Word-Order Universals”. Nature 473 (7345): 79–82.

Greenberg, Joseph H. 1963. “Some Universals of Grammar with Particular Reference to the Order of Meaningful Elements.” In Universals of Language, ed. by Joseph H. Greenberg, 73–113. MIT Press.

Gulordava, Kristina, and Paola Merlo. 2015. “Structural and Lexical Factors in Adjective Placement in Complex Noun Phrases across Romance Languages”. In Proceedings of the Nineteenth Conference on Computational Natural Language Learning , 247–257. Beijing, China.

Gulordava, Kristina, Paola Merlo, and Benoit Crabbé. 2015. “Dependency Length Minimisation Effects in Short Spans: A Large-Scale Analysis of Adjective Placement in Complex Noun Phrases.” In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), 477–482. Beijing, China.

Hajič, Jan, Massimiliano Ciaramita, Richard Johansson, Daisuke Kawahara, Maria Antònia Martí, Lluís Màrquez, Adam Meyers, Joakim Nivre, Sebastian Padó, Jan Štěpánek, Pavel Straňák, Mihai Surdeanu, Nianwen Xue, and Yi Zhang. 2009. “The CoNLL-2009 Shared Task: Syntactic and Semantic Dependencies in Multiple Languages.” In Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, CoNLL ‘09 , 1–18. Stroudsburg, PA, USA: Association for Computational Linguistics. [URL]

Haspelmath, Martin. 2006. “Against Markedness (and What to Replace it With)”. Journal of Linguistics 42 (01): 25–70.

Haug, Dag T. T., and Marius L. Jøhndal. 2008. “Creating a Parallel Treebank of the Old Indo-European Bible Translations.” In Proceedings of the 2nd Workshop on Language Technology for Cultural Heritage Data , 27–34. Marrakech, Morocco.

Hawkins, John A. 1983. Word Order Universals. New York: Academic Press.

1994. A Performance Theory of Order and Constituency. Cambridge: Cambridge University Press.

2009. “Language Universals and the Performance-Grammar Correspondence Hypothesis.” In Language Universals, Chapter 4, ed. by M. H. Christiansen, C. Collins, and S. Edelman, 54–79. Oxford University Press.

Holmberg, Anders. 2000. “Deriving OV Order in Finnish.” In The Derivation of VO and OV, 123–152. Amsterdam: John Benjamins.

Hurford, James R. 2003. “The Interaction between Numerals and Nouns.” In Noun Phrase Structure in the Languages of EuropeVolume 20–7 of Empirical Approaches to Language Typology, ed. by F. Plank. Mouton de Gruyter.

McDonald, Ryan, Joakim Nivre, Yvonne Quirmbach-Brundage, Yoav Goldberg, Dipanjan Das, Kuzman Ganchev, Keith Hall, Slav Petrov, Hao Zhang, Oscar Täckström, Claudia Bedini, Núria Bertomeu Castelló, and Jungmee Lee. 2013. “Universal dependency annotation for multilingual parsing.” In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 92–97. Association for Computational Linguistics. [URL]

Merlo, Paola, and Suzanne Stevenson. 2001. “Automatic Verb Classification Based on Statistical Distributions of Argument Structure”. Computational Linguistics 27 (3): 373–408.

. 2004. “Structure and frequency in verb classification.” In Proceedings of Incontro di Grammatica Generativa XXX , ed. by Laura Brugé, Giuliana Giusti, Nicola Munaro, Walter Schweikert, and Giuseppina Turano, Venice, Italy.

Mithun, Marianne. 1992. “Is Basic Word Order Universal? In Pragmatics of Word Order Flexibility, ed. by Doris L. Payne, 16–61. Amsterdam: John Benjamins..

Ouwayda, Sarah. 2014. “Where Number Lies: Plural Marking, Numerals, and the Collective-Distributive Distinction.” Ph.D. thesis, University of Southern California.

Petrov, Slav, Dipanjan Das, and Ryan McDonald. 2012. “A Universal Part-of-Speech Tagset.” In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC-2012), ed. by Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Jan Odijk, and Stelios Piperidis, 2089–2096. Istanbul, Turkey, European Language Resources Association (ELRA).

Rijkhoff, Jan. 1998. “Order in the Noun Phrase of the Languages of Europe.” In Constituent Order in the Languages of Europe, Volume 20–1 of Empirical Approaches to Language Typology, ed. by Anna Siewierska, 321–382. Mouton de Gruyter.

Samardžić, Tanja, and Paola Merlo. 2018. “Probability of External Causation: An Empirical Account of Cross-Linguistic Variation in Lexical Causatives”. Linguistics 56 (5): 895–938.

. 2012. “The Meaning of Lexical Causatives in Cross-Linguistic Variation”. Linguistic Issues in Language Technology 7 (1).

Schwarzschild, Roger. 2009. “Stubborn Distributivity, Multiparticipant Nouns and the Count/Mass Distinction.” In Proceedings of the 40th Annual Meeting of the North East Linguistic Society, Volume 39, 1–18. Mass.: MIT.

Sheehan, Michelle, Theresa Biberauer, Ian Roberts, and Anders Holmberg. 2017. The Final-Over-Final Condition: A Syntactic Universal, chapter The Final-Over-Final Condition and the Head-Final Filter. MIT Press.

Svenonius, Peter. 2007. “1…3-2.” In Oxford Handbook of Linguistic Interfaces, ed. by Gillian Ramchand, and Charles Reiss, 239–288. Oxford University Press.

Williams, Edwin. 1982. “Another Argument that Passive is Transformational”. Linguistic Inquiry 13: 160–163.

Cited by (1)

Cited by one other publication

Merlo, Paola & Giuseppe Samo

2022. Exploring T3 languages with quantitative computational syntax. Theoretical Linguistics 48:1-2 ► pp. 73 ff.

This list is based on CrossRef data as of 29 june 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.