Productivity and semantic transparency
An exploration of word formation in Mandarin Chinese
We used word embeddings to study the relation between productivity and semantic transparency. We compiled a dataset with around 2700 two-syllable compounds that shared position-specific constituents (henceforth pivots) and some 1100 suffixed words. For each pivot and suffix, we calculated measures of productivity as well as measures of semantic transparency. For compounds, productivity (P) was negatively correlated with the number of types (V) and with the semantic similarity between non-pivot constituents and their compounds. Conversely, the greater semantic similarity of the pivot with either the compound or the non-pivot constituent predicted higher degrees of productivity. Visualization with t-SNE revealed clustering of suffixed words’ embeddings, but no by-pivot clustering for compounds, except for a minority of pivots whose regions in semantic space did not contain intruding unrelated compounds. A subset of these pivots was found to realize a fixed shift in semantic space from the base word to the corresponding compound, a property that also emerged for several suffixes. For these pivots, no correlation between P and V was present. Thus, Mandarin compounds appear to realize, at one extreme, motivated but unsystematic concept formation (where other pivots could just as well have been used), and at the other extreme, systematic suffix-like semantics.
Article outline
- 1.Introduction
- 2.Productivity and semantic transparency
- 2.1Data
- 2.2Productivity
- 2.3Semantic transparency
- 3.The semantic geometry of Mandarin compounds
- 4.The semantic geometry of suffixation
- 5.General discussion
- Notes
-
References
References (20)
References
Aronoff, M. (1976). Word formation in generative grammar. MIT Press, Cambridge, Mass.
Baayen, R. H. (2001). Word frequency distributions. Kluwer Academic Publishers, Dordrecht.
Baayen, R. H. (2009). Corpus linguistics in morphology: morphological productivity. In Kytö, M. and Lüdeling, A., editors, Corpus Linguistics. An international handbook, pages 900–919. Mouton de Gruyter, Berlin.
Baayen, R. H., Janda, L. A., Nesset, T., Endresen, A., and Makarova, A. (2013). Making choices in Russian: Pros and cons of statistical methods for rival forms. Russian Linguistics, 371:253–291.
Baayen, R. H. and Lieber, R. (1991). Productivity and English derivation: a corpus-based study. Linguistics, 291:801–843.
Baayen, R. H., Piepenbrock, R., and Gulikers, L. (1995). The CELEX lexical database (CD-ROM). Linguistic Data Consortium, University of Pennsylvania, Philadelphia, PA.
Bauer, L. (1983). English word formation. Cambridge University Press, Cambridge.
Bauer, L. (2001). Morphological productivity. Cambridge University Press, Cambridge.
Booij, G. E. (1977). Dutch morphology. A study of word formation in generative grammar. Foris, Dordrecht.
Ceccagno, A. and Basciano, B. (2007). Compound headedness in Chinese: An analysis of neologisms. Morphology, 17(2):207–231.
Corbin, D. (1987). Morphologie derivationelle et structuration du lexique [Derivational morphology and lexical structure]. Niemeyer, Tubingen.
Gale, W. A. and Sampson, G. (1995). Good-Turing frequency estimation without tears. Journal of Quantitative Linguistics, 21:217–237.
Good, I. J. (1953). The population frequencies of species and the estimation of population parameters. Biometrika, 401:237–264.
Levi-Strauss, C. (1962). Savage mind. University of Chicago.
Maaten, L. V. D. and Hinton, G. (2008). Visualizing data using t-sne. Journal of machine learning research, 91(Nov):2579–2605.
Marle, J. V. (1985). On the paradigmatic dimensions of morphological creativity. Foris, Dordrecht.
Riddle, E. (1985). A historical perspective on the productivity of the suffixes -ness and -ity
. In Fisiak, J., editor, Historical Semantics, Historical Word-Formation, pages 435–461. Mouton, New York.
Schultink, H. (1961). Produktiviteit als morfologisch fenomeen [Productivity as a morphological phenomenon]. Forum der Letteren, 21:110–125.
Shen, T. and Baayen, R. H. (2021). Adjective-noun compounds in mandarin: a study on productivity. Corpus Linguistics and Linguistic Theory, 18(3):543–572.
Tarasova, E. (2013). Some new insights into the semantics of English N+ N compounds. PhD thesis, Open Access Victoria University of Wellington Te Herenga Waka.
Cited by (1)
Cited by one other publication
Ke, Sihui (Echo), Rui Jin & Keiko Koda
This list is based on CrossRef data as of 25 october 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.