Chapter 12
Manual to the LMEMT corpus
Article outline
- 1.Introduction
- 2.Collecting and annotating the texts
- 3.Corpora, digital editions and XML
- 4.Description of LMEMT files
- 4.1Digital edition
- 4.1.1Structure of the XML files
- 4.1.2Some examples of XML annotation in LMEMT
- 4.2Unannotated version
- 5.Getting started
-
Notes
-
References
- Author query
References
Anthony, Laurence
2019 AntConc 3.5.8. Corpus software. Tokyo, Japan: Waseda University. Online:
[URL].
Baron, Alistair
2008 VARD 2. Corpus software.
[URL].
Benoit, Kenneth et al.
2019 quanteda: Quantitative Analysis of Textual Data. R package. Online:
[URL].
Bird, Steven, Loper, Edward, and Klein, Ewan
2009 Natural Language Processing with Python. Sebastopol: O’Reilly Media Inc.
Gries, Stefan
2016 Quantitative Corpus Linguistics with R. A Practical Introduction. 2nd edition. New York: Routledge.
Hardie, Andrew
2014 “
Modest XML for corpora: Not a standard, but a suggestion.”
ICAME Journal 38: 73–103. doi:
Hiltunen, Turo and Taavitsainen, Irma
this volume). “
Towards new knowledge: The corpus of Late Modern English Medical Texts.”
Hornik, Kurt
2016 openNLP: Apache OpenNLP Tools Interface. R package. Online:
[URL].
Imao, Yasu
2019 Casual Conc. Computer software. Online:
[URL]. Wikipedia. 2019. “Comparison of XML editors.” Online:
[URL].
Marttila, Ville
2014 Creating Digital Editions for Corpus Linguistics: The Case of Potage Dyvers, a Family of Six Middle English Recipe Collections. Diss. Helsinki: University of Helsinki.
McEnery, Tony, Xiao, Richard, and Tono, Yukio
2006 Corpus-based Language Studies. An Advanced Resource Book. London: Routledge.
Meurman-Solin, Anneli
2007 “
Annotating variational space over time.” In
Annotating Variation and Change [
Studies in Variation, Contacts and Change in English 1],
Anneli Meurman-Solin and
Arja Nurmi (eds). Helsinki: VARIENG. Online:
[URL].
Meurman-Solin, Anneli
2014 “
Taxonomisation of features of visual prosody.” In
Principles and Practices for the Digital Editing and Annotation of Diachronic Data [
Studies in Variation, Contacts and Change in English 12],
Anneli Meurman-Solin and
Jukka Tyrkkö (eds). Helsinki: VARIENG. Online:
[URL]
Rissanen, Matti and Tyrkkö, Jukka
2013 “
The Helsinki Corpus of English Texts (HC)
.” In
Principles and Practices for the Digital Editing and Annotation of Diachronic Data [
Studies in Variation, Contacts and Change in English 12],
Anneli Meurman-Solin and
Jukka Tyrkkö (eds). Helsinki: VARIENG. Online:
[URL]
Rühlemann, Christoph, Bagoutdinov, Andrej, and O’Donnell, Matthew Brook
2015 “
Modest XPath and XQuery for corpora: Exploiting deep XML annotation.”
ICAME Journal 39: 47–84. doi:
Syncro Soft
2019 Oxygen Editor. Computer software. Online:
[URL].
The TEI Consortium
2019 TEI P5: Guidelines for Electronic Text Encoding and Interchange. Originally edited by
C. M. Sperberg-McQueen and
Lou Burnard for the ACH-ALLC-ACL Text Encoding Initiative. Online:
[URL].
Tyrkkö, Jukka, Marttila, Ville and Suhr, Carla
2013 “
The Culpeper Project: Digital editing of title-pages.” In
Principles and Practices for the Digital Editing and Annotation of Diachronic Data [
Studies in Variation, Contacts and Change in English 12],
Anneli Meurman-Solin and
Jukka Tyrkkö (eds). Helsinki: VARIENG. Online:
[URL].
Walsh, John, Simpson, Grant and Moaddeli, Saeed
2018 TEI Boilerplate. Online:
[URL]; Github repository:
[URL].
Weisser, Martin
2019 Corpus-based Linguistics – Introduction. Online:
[URL].
XML Copy Editor
2014 Computer Software. Online:
[URL].
Cited by
Cited by 2 other publications
Hiltunen, Turo & Irma Taavitsainen
Saario, Lassi, Tanja Säily, Samuli Kaislaniemi & Terttu Nevalainen
2021.
The burden of legacy: Producing the Tagged Corpus of Early English Correspondence Extension (TCEECE).
Research in Corpus Linguistics 9:1
► pp. 104 ff.
This list is based on CrossRef data as of 22 march 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.