From real-life situated discourse to video-stream data-mining
An argument for agent-oriented modeling for multimodal corpus compilation
Gu Yueguo | The Chinese Academy of Social Sciences
This paper presents an argument for agent-oriented modeling (AOM) as a research methodology and a metalanguage for corpus linguistics. It is triggered by three closely related issues arising from compiling multimodal corpora such as the Spoken Chinese Corpora of Situated Discourse (SCCSD). Given a real-life situation, there are three types of representation: (i) the Written Word representation, (ii) audio recording, and (iii) video recording. It is shown that the three types are all data-transformative and involve data loss, and that they are intrinsically flawed. The current multiple-layered approach to data integration is also shown to be inadequate. AOM is proposed to be a potential solution to the problems. Modeling decision tree, levels of modeling, and modeling schema written in XML are demonstrated. The philosophical basis of AOM, and its theoretical implications are also discussed.
2021. Toward multimodal corpus pragmatics: Rationale, case, and agenda. Digital Scholarship in the Humanities 36:1 ► pp. 101 ff.
Gu, Yueguo
2019. Morris’ Lost Pragmatics. Chinese Semiotic Studies 15:2 ► pp. 217 ff.
Pan, Mingwei
2016. Literature Review. In Nonverbal Delivery in Speaking Assessment, ► pp. 9 ff.
Pan, Mingwei
2016. Research Design and Methods. In Nonverbal Delivery in Speaking Assessment, ► pp. 109 ff.
Xu, Jiajin
2015. Corpus-based Chinese studies. Chinese Language and Discourse. An International and Interdisciplinary Journal 6:2 ► pp. 218 ff.
This list is based on CrossRef data as of 6 january 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.