In our earlier work on three Asian Englishes and British English, we showed how lexico-syntactic co-occurrence preferences for three argument structure constructions revealed differences between varieties that correlated well with Schneider’s (2003, 2007) model of evolutionary stages. Here, we turn to lexical co-occurrence preferences and investigate if and to what degree n-grams distinguish between different modes and varieties in the same components of the International Corpus of English. Our approach to n-grams differs from previous work in that we neither use raw frequencies nor (problematic) MI-values but the newly proposed measure of lexical gravity (cf. Daudaravičius & Marcinkevičienė 2004), which takes type frequencies into consideration. We show how lexical gravity can be extended to handle n-grams with n ≥ 3 and apply this method to our n-gram data; in addition, we suggest a new concept for describing the tendency of a word to occur in significant n-grams: lexical stickiness.
2022. Multi-word units (and tokenization more generally): a multi-dimensional and largely information-theoretic approach. Lexis :19
Hilpert, Martin
2022. Review of Laporte, Samantha. 2021. Corpora, Constructions, New Englishes. A Constructional and Variationist Approach to Verb Patterning. Amsterdam: John Benjamins. ISBN: 978-9-027-20850-7. https://doi.org/10.1075/scl.100. Research in Corpus Linguistics 10:2 ► pp. 147 ff.
Kirk, John
2022. Irish English as a World English. Frontiers in Communication 7
Meng, Fanqi, Yujie Zheng, Songbin Bao & Jingdong Wang
2021. 2021 10th IEEE International Conference on Communication Systems and Network Technologies (CSNT), ► pp. 886 ff.
Gries, Stefan Th. & Philip Durrant
2020. Analyzing Co-occurrence Data. In A Practical Handbook of Corpus Linguistics, ► pp. 141 ff.
2018. Multi-word Expressions: A Novel Computational Approach to Their Bottom-Up Statistical Extraction. In Lexical Collocation Analysis [Quantitative Methods in the Humanities and Social Sciences, ], ► pp. 85 ff.
2017. Frequency Consolidation Among Word N-Grams. In Computational and Corpus-Based Phraseology [Lecture Notes in Computer Science, 10596], ► pp. 432 ff.
DUNN, JONATHAN
2017. Computational learning of construction grammars. Language and Cognition 9:2 ► pp. 254 ff.
2017. Non-Canonical Syntax in South Asian Varieties of English: A Corpus-Based Pilot Study on Fronting. Zeitschrift für Anglistik und Amerikanistik 65:3 ► pp. 265 ff.
BERNAISCH, TOBIAS & CHRISTOPHER KOCH
2016. Attitudes towards Englishes in India. World Englishes 35:1 ► pp. 118 ff.
Edwards, Alison & Rutger-Jan Lange
2016. In case ofinnovation. International Journal of Learner Corpus Research 2:2 ► pp. 252 ff.
Pan, Fan, Randi Reppen & Douglas Biber
2016. Comparing patterns of L1 versus L2 English academic professionals: Lexical bundles in Telecommunications research journals. Journal of English for Academic Purposes 21 ► pp. 60 ff.
PARVIAINEN, HANNA
2016. The invariant tag isn't it in Asian Englishes. World Englishes 35:1 ► pp. 98 ff.
2014. New reflections on the evolutionary dynamics of world Englishes. World Englishes 33:1 ► pp. 9 ff.
Bernardini, Silvia & Adriano Ferraresi
2013. Old Needs, New Solutions: Comparable Corpora for Language Professionals. In Building and Using Comparable Corpora, ► pp. 303 ff.
Staples, Shelley, Jesse Egbert, Douglas Biber & Alyson McClair
2013. Formulaic sequences and EAP writing development: Lexical bundles in the TOEFL iBT writing section. Journal of English for Academic Purposes 12:3 ► pp. 214 ff.
Spina, Stefania & Elena Tanganelli
2012. Les collocations comme indice pour distinguer les genres textuels. Corpus :11
[no author supplied]
2013. Reference Guide for Varieties of English. In A Dictionary of Varieties of English, ► pp. 363 ff.
This list is based on CrossRef data as of 4 july 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.