In large reference corpora representativeness is attempted through carefully selected sampling and sheer size. The situation is different with special language corpora in that their very nature limits them in size. Their representativity is measured by reference to external selection criteria, generally following bibliographic classifications, which tend to be subjective. In order to overcome subjectivity in specialised corpora, a corpus-directed system of internal selection using lexical criteria is proposed. The aim is not to create rigid boundaries but to see clearly what is actually present in the corpus. The method adopted is demonstrated on a corpus consisting of research articles from specialised journals and conference proceedings in the field of plant biology. Restricted collocational networks are used to isolate prototypical groupings within the corpus. It is shown that audience is an important factor in strong and weak prototypical groupings in theme and domain specific corpora. Articles addressing domain specialists through a journal tend to be more central than those presented to a theme-specific discourse community through conference proceedings.
2020. Collocational semiosis in the academic discourse of the Corpus of Contemporary American English (COCA): The case of AFRICA. Semiotica 2020:235 ► pp. 185 ff.
Valencia Giraldo, M.ª Victoria & Gloria Corpas Pastor
2019. The Portrait of Dorian Gray: A Corpus-Based Analysis of Translated Verb + Noun (Object) Collocations in Peninsular and Colombian Spanish. In Computational and Corpus-Based Phraseology [Lecture Notes in Computer Science, 11755], ► pp. 417 ff.
2013. Pays et nation dans un corpus de presse en 2007 et 2009. Étude en linguistique de corpus en vue de l’élaboration de dictionnaires spécialisés. Mots :102
BOWLES, HUGO
2012. Analyzing Languages for Specific Purposes Discourse. The Modern Language Journal 96:s1 ► pp. 43 ff.
Liu, Zhanyi, Haifeng Wang, Hua Wu & Sheng Li
2011. Two-Word Collocation Extraction Using Monolingual Word Alignment Method. ACM Transactions on Intelligent Systems and Technology 3:1 ► pp. 1 ff.
Seretan, Violeta
2011. Introduction. In Syntax-Based Collocation Extraction [Text, Speech and Language Technology, 44], ► pp. 1 ff.
2011. Corpus Academicum Lithuanicum: Design Criteria, Methodology, Application. In Human Language Technology. Challenges for Computer Science and Linguistics [Lecture Notes in Computer Science, 6562], ► pp. 412 ff.
Krausse, Sylvana
2008. Gimme Context - Towards New Domain-Specific Collocational Dictionaries. Santalka 16:4 ► pp. 46 ff.
Nebot, Esther Monzó
2008. Corpus-based Activities in Legal Translator Training. The Interpreter and Translator Trainer 2:2 ► pp. 221 ff.
Williams, Geoffrey
2003. From meaning to words and back: Corpus linguistics and specialised lexicography. ASp 39-40 ► pp. 91 ff.
This list is based on CrossRef data as of 19 november 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.