Chapter 3
Word alignment in a parallel corpus of Old English prose
From asymmetry to inter-syntactic annotation
This chapter proposes a model of syntactic annotation for the Parallel Corpus of Old English Prose, an aligned corpus of Old English and Present Day English texts. The research focuses on areas of syntactic divergence between the aligned texts. Syntactic divergence is described in terms of four types of alignment asymmetry (markedness, constituency, order, and configuration) and is represented by means of two components: a structural description and a dependency tree. The main conclusion is that these two components constitute a historical micro-grammar that identifies stability and change with respect to specific categories and constructions.
Article outline
- 1.Introduction
- 2.Background
- 3.The design of an aligned parallel corpus of Old English prose
- 4.Inter-syntax and asymmetry
- 5.The scope and components of the inter-syntax
- 6.Conclusion and further research
-
The following abbreviations are used in this section
-
References
References (48)
References
Attenborough, Frederick L. (ed. and trans.). 1922. The Laws of the Earliest English Kings. Cambridge: Cambridge University Press.
Biber, D. 2007. “Representativeness in corpus design”, in: W. Teubert and R. Krishnamurthy (eds.), Corpus linguistics: Critical concepts in linguistics (Vol. II). London: Routledge. 134–165.
Cameron, Angus, Ashley C. Amos, and Antonette diPaolo Healey (eds.). 2018. The Dictionary of Old English in Electronic Form A-I. Toronto: Dictionary of Old English Project, Centre for Medieval Studies, University of Toronto.
Denison, David. 1993. English Historical Syntax: Verbal Constructions. London: Longman.
Enrique-Arias, Andrés. 2013. “On the usefulness of using parallel texts in diachronic investigations: insights from a parallel corpus of Spanish medieval Bible translations.” In New Methods in Historical Corpora, ed. by Paul Durrell, Martin Scheible, Silke Whitt, and Richard J. Bennett, 105–116. Tübinguen: Gunter Narr.
Faaß, G. 2017. “Lexicography and corpus linguistics.” In The Routledge Handbook of Lexicography, ed. by Pedro A. Fuertes-Olivera, 123–137. Abingdon: Routledge.
Fernández Cuesta, Julia, Nieves Rodríguez Ledesma, and Gloria Alvárez Benito (eds. and trans.). 1997. Prosa anglosajona. Sevilla: Universidad de Sevilla.
García Fernández, Laura. 2018. “Preterite-present verb lemmas from a corpus of Old English.” In Verbs, Clauses and Constructions: Functional and Typological Approaches, ed. by Pilar Guerrero Medina, Roberto Torre Alonso, and Raquel Vea Escarza, 59–76. Newcastle: Cambridge Scholars Publishing.
Garmonsway, George N. (ed. and trans.). 1972. The Anglo-Saxon Chronicle. London: Dent & Sons LTD.
Godden, Malcom. 2016. The Old English History of the World. An Anglo-Saxon Rewriting of Orosius. Cambridge, Massachusetts: Dumbarton Oaks.
Godden, Malcom, Susan Irvine (eds.), with Mark Griffith, and Rohini Jayatilaka. 2009. The Old English Boethius. Volume II. Oxford: Oxford University Press.
Hanks, Patrick. 2012. “Corpus Evidence and Electronic Lexicography.” In Electronic Lexicography, ed. by Sylviane Granger, and Magali Paquot, 57–82. Oxford University Press.
Healey, A. diPaolo (ed.) with John P. Wilkin, and Xin Xiang. 2004. The Dictionary of Old English Web Corpus. Toronto: Dictionary of Old English Project, Centre for Medieval Studies, University of Toronto.
Heid, Ulrich. 2008. “Corpus linguistics and lexicography.” In Corpus Linguistics. An International Handbook (Volume 1), ed. by Anke Lüdeling and Merja Kytö, 132–153. Berlin: Mouton de Gruyter.
Hogg, Richard M. and Robert D. Fulk. 2011. A Grammar of Old English. Volume 2: Morphology. Oxford: Blackwell.
Johnson, B. 2009. Using the Levenshtein algorithm for automatic lemmatization in Old English. MA Thesis, The University of Georgia.
K Aijmer, B Altenberg, M Johansson. 1996. Languages in contrast: papers from a Symposium on Text-based Cross-linguistic Studies, Lund, 4–5 March 1994. Lund: Lund University Press.
Krause, Thomas, and Amir Zeldes. 2016. ANNIS3: “A new architecture for generic corpus query and visualization.” Literary and Linguistic Computing 31(1): 118–139.
Kübler, Sandra, and Heike Zinsmeister. 2014. Corpus Linguistics and Linguistically Annotated Corpora. London: Bloomsbury.
Leonard, Henry C. (ed. and trans.). 1881. A Translation of the Anglo-Saxon Version of St. Mark’s Gospel. London: James Clarke & Co.
Lu, Xiaofei. 2014. Computational Methods for Corpus Annotation and Analysis. Dordrecht: Springer.
Martín Arista, Javier. 2000a. “Sintaxis medieval inglesa I: complementación, caso y sintaxis verbal.” In Lingüística histórica inglesa, ed. by Isabel de la Cruz Cabanillas and Javier Martín Arista, 224–312. Barcelona: Ariel.
Martín Arista, Javier. 2000b. “Sintaxis medieval inglesa II: funciones, construcciones y orden de constituyentes.” In Lingüística histórica inglesa, ed. by Isabel de la Cruz Cabanillas and Javier Martín Arista, 313–377. Barcelona: Ariel.
Martín Arista, Javier. 2013. Nerthus. “Lexical Database of Old English: From word-formation to meaning construction.” Lecture delivered at the Research Seminar, School of English, University of Sheffield.
Martín Arista, Javier. 2017a. “Toward a parallel corpus of Old English prose. Preliminary questions and initial design.” Lecture delivered at the Departmental Colloquium Series at the Department of Language and Linguistic Science, University of York.
Martín Arista, Javier. 2017b. “The Nerthus Project at the crossroads. From lexical database to parallel corpus of Old English.” Lecture delivered at the 2017 International Conference of SELIM, held at the University of Málaga.
Martín Arista, Javier. 2018. “The design and implementation of a pilot parallel corpus of Old English.” In Aspects of Medieval English Language and Literature, ed. by Michiko Ogura and Hans Sauer, 111–134. Berlin: Peter Lang.
Martín Arista, Javier, and Ana E. Ojanguren López. 2018. “Doing Electronic Lexicography of Old English with a Knowledge-Base.” Workshop delivered at the CLASP Project (University of Oxford).
McEnery, Tony. 1996. Corpus Linguistics. Edinburgh: Edinburgh University Press.
McEnery, Tony. 2003. “Corpus linguistics.” In Oxford handbook of computational linguistics, ed. by Ruslan Mitkov, 448–463. Oxford: Oxford University Press.
McEnery, Tony & Richard Xiao. 2007a. Parallel and comparable corpora: What are they up to? Incorporating Corpora: Translation and the Linguist. Translating Europe. Clevedon: Multilingual Matters.
McEnery, Tony, and Richard Xiao. 2007b. “Parallel and Comparable Corpora-The State of Play.” In Corpus-Based Perspectives in Linguistics, ed. by Yuji Kawaguchi, Toshihiro Takagaki, Nobuo Tomimori, and Yoichiro Tsuruga, 131–146. Amsterdam: John Benjamins.
Metola Rodríguez, Darío. 2017. “Strong Verb Lemmas from a Corpus of Old English. Advances and issues.” Revista de Lingüística y Lenguas Aplicadas 12: 65–76.
Mitchell, Bruce. 1985. Old English Syntax (2 vols.). Oxford: Oxford University Press.
Novo Urraca, Carmen, and Ana E. Ojanguren López. 2018. “Lemmatising Treebanks. Corpus Annotation with Knowledge Bases.” RAEL 17: 99–120.
Pintzuk, Susan, and Leendert Plug (comp.). 2001. The York-Helsinki Parsed Corpus of Old English Poetry. Department of Language and Linguistic Science, University of York.
Rauer, Christiane. (ed. and trans.). 2013. The Old English Martyrology. Cambridge: D. S. Brewer.
Ringe, Don, and Ann Taylor. 2014. A Linguistic History of English Volume II: The Development of Old English. Oxford: Oxford University Press.
Rissanen, Matti, Merja Kytö, L. Kahlas-Tarkka, Matti Kilpiö, Saara Nevanlinna, Irma Taavitsainen, Tertu Nevalainen and Helena Raumolin-Brunberg (comp.). 1991. The Helsinki Corpus of English Texts. Department of Modern Languages, University of Helsinki.
Schierholz, Stefan J. 2015. “Methods in Lexicography and Dictionary Research.” Lexikos: 25(1): 323–352.
Scrivner, Olga. 2015. “Tools for Digital Humanities: Parallel Corpus and Visualization.” Paper presented at the Conference Corpora 2015, held at Saint-Petersburg, Russia.
Skeat, Walter W. (ed.) 1881. Ælfric’s Lives of Saints. Volume I. Oxford: Oxford University Press.
Sweet, Henry (ed.). 1881. King Alfred’s West-Saxon Version of Gregory’s Pastoral Care. London: Trübner & Co.
Taylor, Ann, Anthony Warner, Susan Pintzuk and Frank Beths (comp.) 2003. The York-Toronto-Helsinki Parsed Corpus of Old English Prose. Department of Language and Linguistic Science, University of York.
The Holy Bible Translated from the Latin Vulgate (Douay Rheims Version) 1971 (1899). Rpt. Rockford, Illinois: Tan books.
Thorpe, Benjamin. (ed. and trans). 1844. The Homilies of the Anglo-Saxon Church. Volume I. London: Red Lion Court.
Tío Sáenz, Marta. 2015. “The Regularization of Old English Weak Verbs”. Revista de lingüística y lenguas aplicadas 10: 78–89.
Visser, Ferdinand. 1963–1973. An Historical Syntax of the English Language (4 vols.). Leiden: Brill.
Cited by (1)
Cited by one other publication
Ojanguren López, Ana Elvira
This list is based on CrossRef data as of 4 july 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.