The paper introduces a corpus of simultaneous interpretation, SIREN. SIREN is a parallel aligned bidirectional corpus of original
and simultaneously interpreted speech in Russian and English. At the moment the corpus contains 235,040 words and is enriched with
POS and shallow syntactic annotation.
After outlining the corpus design, I used scores for lexical variety, density and POS proportionalities to make tentative claims
about the linguistic variation between originals and interpretations. Low lexical variety and density are taken as indicators of
simplification, while a higher ratio of nominal to pronominal reference is seen as an indicator of explicitation. Atypical
wordclass distribution indicates the source language shining through. Somewhat contradictory results, with the Russian subcorpus
conforming to the predictions of translation theory and the English subcorpus exhibiting the opposite trend in all universals but
still shining through, invites further investigation of the data and once again puts into question unequivocal claims about
T-universals.
1997 “Interpreting Mediated TV Events.” In Proceedings of the 2nd International Conference on Current Trends in Studies of Translation and Interpreting, edited by Kinga Klaudy, János Kohn, and Mary Snell-Hornby, 171–74. Budapest: Scholastica.
1996 “Corpus-Based Translation Studies.” In Terminology, LSP and Translation: Studies in Language Engineering in Honour of Juan C. Sager, edited by Harold Somers, 175-. Amsterdam: Benjamins.
Bendazzoli, Claudio, and Annalisa Sandrelli
2005 “An Approach to Corpus-Based Interpreting Studies: Developing EPIC (European Parliament Interpreting Corpus).” In MuTra 2005 – Challenges of Multidimensional Translation, edited by Heidrun Gerzymisch-Arbogast and Sandra Nauert..
Bendazzoli, Claudio, and Annalisa Sandrelli
2009 “Corpus-Based Interpreting Studies: Early Work and Future Prospects.” Revista Tradumatica 71: 1–9.
Bendazzoli, Claudio, Annalisa Sandrelli, and Mariachiara Russo
2011 “Disfluencies in Simultaneous Interpreting: A Corpus-Based Analysis.” In Corpus-Based Translation Studies: Research and Applications, edited by Alet Kruger, Kim Wallmach, and Jeremy Munday, 282–306. London: Continuum.
Bendazzoli, Claudio
2012From international conferences to machine-readable corpora and back: an ethnographic approach to simultaneous interpreter-mediated communicative events. In Breaking Ground in Corpus-based Interpreting Studies, edited by Francesco Straniero Sergio and Caterina Falbo, 91-118. Bern: Peter Lang.
Bernardini, Silvia, Adriano Ferraresi, and Maja Miličević
1983 “Universals of Lexical Simplification.” In Strategies in Interlanguage Communication, edited by Claus Faerch and Gabriele Kasper. Vol. 281. London: Longman. .
Bros-Brann, Eliane
1993 “Simultaneous Interpretation and the Media: Interpreting Live for Television.” In Translation – the Vital Link. La Traduction Au Coeur de La Communication, 267–71. London: Institute of Translation and Interpreting.
Bucholtz, Mary
2000 “The Politics of Transcription.” Journal of Pragmatics 32 (10): 1439–65. .
Chesterman, Andrew
2004 “Hypotheses about Translation Universals.” In Claims, Changes and Challenges in Translation Studies, edited by Gyde Hansen, Kirsten Malmkjaer, and Daniel Gile, 1–14. Amsterdam: Benjamins.
Chernov, Ghelly
1987/2004Inference and Anticipation in Simultaneous Interpreting. Amsterdam: Benjamins.
Church, Ian
2009Official Report, (Hansard), House of Commons, Centenary Volume 1909–2009: An Anthology of Historic and Memorable House of Commons Spreeches to Celebrate the First 100 Years. London: House of Commons.
Daly, Albert F.
1985 “Interpreting for International Satellite Television.” Meta: Translators’ Journal 30 (1): 203–9.
Diwersy, Sascha, Stefan Evert, and Stella Neumann
2014 “A Weakly Supervised Multivariate Approach to the Study of Language Variation.” In Aggregating Dialectology, Typology, and Register Analysis, edited by Benedikt Szmrecsanyi and Bernhard Wälchli, 174–204. Berlin: De Gruyter.
Evert, Stefan, and Stella Neumann
2017 “The Impact of Translation Direction on the Characteristics of Translated Texts: A Multivariate Analysis for English and German.” In Empirical Translation Studies. New Methodological and Theoretical Traditions, edited by Gert De Sutter, Marie-Aude Lefer, and Isabelle Delaere. Berlin: De Gruyter.
Falbo, Caterina
2012 “CorIT (Italian Television Interpreting Corpus): Classification Criteria.” In Breaking Ground in Corpus-Based Interpreting Studies, edited by Francesco Straniero Sergio and Caterina Falbo, 155–86. Bern: Peter Lang.
Gorokhova, Anna
2007 “Sopostavlenie Prjamogo I Kosvennogo Sposobov Perevoda v Sinkhronnom I Pismennom Perevodakh.” Vestnik YGU 4 (2): 85–90.
2012 “The ETAPE Corpus for the Evaluation of Speech-Based TV Content Processing in the French Language.” In Eighth International Conference on Language Resources and Evaluation (LREC), 114–18. Istanbul. [URL].
Gurin, Ilya Viktorovich
2009 “Priemy Rechevoj Kompressii Pri Sinkhronnom Perevode S Russkogo Jazika Na Anglijskij.” Moscow State University.
Hansen-Schirra, Silvia, Stella Neumann, and Erich Steiner
2007 “Cohesive Explicitness and Explicitation in an English-German Translation Corpus.” Languages in Contrast 7 (2): 241–65.
Hansen-Schirra, Silvia, and Elke Teich
2009 “Corpora in Human Translation.” In Corpus Linguistics. An International Handbook. Vol. 21, edited by Anke Lüdeling and Merja Kyto, 1159–75. Berlin: Mouton de Gruyter. .
Hansen, Silvia
2003 “The Nature of Translated Text. An Interdisciplinary Methodology for the Investigation of the Specific Properties of Translation.” Saarbrücken.
Hockett, Charles F.
1967 “The Yawelmani Basic Verb.” Language 43 (1): 208–22.
1995 “Formation À La Traduction et À L’interprétation Pour Les Médias Audiovisuels.” In (Multi) Media Translation. Concepts, Practices, and Research, edited by Yves Gambier and Henrik Gottlieb, 431–42. Amsterdam: Benjamins.
Kenny, Dorothy
2001Lexis and Creativity in Translation: A Corpus-Based Study. London: Routledge.
Koehn, Philipp
2005 “Europarl: A Parallel Corpus for Statistical Machine Translation.” In MT Summit, 111:79–86. .
Kurz, Ingrid
1993 “The 1992 U.S. Presidential Elections: Interpreting the American Debathon for Austrian Television.” In Translation – the Vital Link. La Traduction Au Coeur de La Communication, edited by Catriona Picken, 11:441–45. London: Institute of Translation and Interpreting.
Kurz, Ingrid
1996 “Special Features of Media Interpreting as Seen by Interpreters and Users.” In New Horizons. Horizons Nouveaux. Proceedings of the XIVth FIT World Congress, 957–65. Melbourne: AUSIT.
Kurz, Ingrid, and Franz Pöchhacker
1995 “Quality in TV Interpreting.” In (Multi) Media Translation. Concepts, Practices, and Research, edited by Yves Gambier and Henrik Gottlieb, 350–58. Amsterdam: Benjamins.
Lapshinova-Koltunski, Ekaterina
2015 “Variation in Translation: Evidence from Corpora.” In New Directions in Corpus-Based Translation Studies, edited by Claudio Fantinuoli and Federico Zanettin, 79–99. Language Science Press.
Laviosa-Braithwaite, Sara
1996 “The English Comparable Corpus (ECC) : A Resource and a Methodology for the Empirical Study of Translation.” University of Manchester.
Laviosa, Sara
1997 “Investigating Simplification in an English Comparable Corpus of News Articles.” In Transference Necesse Est: Proceedings of the Second International Conference on Current Trends in Studies of Translation and Interpreting, 5–7 September 1996, Budapest, Hungary, edited by Kinga Klaudy and Janos Kohn, 531–40. Budapest: Scholastica.
Laviosa, Sara
1998 “Core Patterns of Lexical Use in a Comparable Corpus of English Narrative Prose.” Meta: Translators’ Journal 43 (4): 557–70. .
Lederer, Marianne
1981La Traduction Simultanée. Expérience et Théorie. Paris: Minard, Lettres Modernes.
Leech, Geoffrey, Paul Rayson, and Andrew Wilson
2001Companion Website for: Word Frequencies in Written and Spoken English. London: Longman. [URL].
2008 “Interpreting Proper Names: Different Interventions in Simultaneous and Consecutive Interpreting?” Trans-Kom 1 (1): 105–22.
Meyer, Bernd
2010Consecutive and Simultaneous Interpreting (CoSi). Archived in Hamburger Zentrum für Sprachkorpora. Version 1.1. Publication date 2010-02-26. [URL].
Mizuno, Akira
1997 “Broadcast Interpreting in Japan. Some Theoretical and Practical Aspects.” In Conference Interpreting: Current Trends in Research. Proceedings of the International Conference on Interpreting: What Do We Know and How?, edited by Yves Gambier, Daniel Gile, and Christopher Taylor, 192–94. Amsterdam: Benjamins.
Monti, Cristina, Claudio Bendazzoli, Annalisa Sandrelli, and Mariachiara Russo
2005 “Studying Directionality in Simultaneous Interpreting through an Electronic Corpus: EPIC (European Parliament Interpreting Corpus).” Meta: Translators’ Journal 50 (4). .
Müürisep, Kaili, and Helen Nigol
2000 “Disfluency Detection and Parsing of Transcribed Speech of Estonian.” In Human Language Technology. Challenges of the Information Society, edited by Zygmunt Vetulani and Hans Uszkoreit, 165–77. Heidelberg: Springer.
Ostendorf, M., B. Favre, D. Grishman, M. Hakkani-Tur, M. Harper, D. Hillard, J. Hirschberg, et al.
2008 “Speech Segmentation and Its Impact on Spoken Document Processing.” Signal Processing Magazine 25 (3): 59–69.
Pariy, Alexey, and Maria Kunilovskaya
2016 “Learner vs. Professional Translations into Russian: Lexical Profiles.” In Dialogue-21. Saint-Petersburg. [URL].
Piemontese, M. E.
1995 “Ll LIP: Uno Strumento per La Didattica Della Lingua Italiana in Ltalia E All’estero.” Italica 72 (4): 474–87.
Poluyan, Igor
2011Kompressiya v Sinkhronnom Perevode. Moscow: Valent.
Pym, Anthony
2007 “On Shlesinger ’s Proposed Equalizing Universal for Interpreting.” In Interpreting Studies and beyond: A Tribute to Miriam Shlesinger, edited by Franz Pöchhacker, A. L. Jakobsen, and I. M. Mees, 175–90. Copenhagen: Samfundslitteratur Press.
Rayson, Paul, and Roger Garside
1998 “The CLAWS Web Tagger.” ICAME Journal 221: 121–23.
Russo, Mariachiara, Claudio Bendazzoli, and Annalisa Sandrelli
Russo, Mariachiara, Claudio Bendazzoli, Annalisa Sandrelli, and Nicoletta Spinolo
2012 “The European Parliament Interpreting Corpus (EPIC): Implementation and Developments.” In Breaking Ground in Corpus-Based Interpreting Studies, edited by Francesco Straniero Sergio and Caterina Falbo, 53–90. Bern: Peter Lang.
Sandrelli, Annalisa, and Claudio Bendazzoli
2005 “Lexical Patterns in Simultaneous Interpreting: A Preliminary Investigation of EPIC (European Parliament Interpreting Corpu.” In Proceedings from the Corpus Linguistics Conference Series.
Sandrelli, Annalisa, and Claudio Bendazzoli
2006 “Tagging a Corpus of Interpreted Speeches: The European Parliament Interpreting Corpus (EPIC).” In 5th International Conference on Language Resources and Evaluation, 647–52. [URL].
Sandrelli, Annalisa, Claudio Bendazzoli, and Mariachiara Russo
2010 “European Parliament Interpreting Corpus (EPIC): Methodological Issues and Preliminary Results on Lexical Patterns in Simultaneous Interpreting.” International Journal of Translation Studies 22 (1–2): 167–206.
Schmid, Helmut
1995 “Improvements In Part-of-Speech Tagging With an Application To German.” In Proceedings of the ACL SIGDAT-Workshop, 47–50.
Seleskovitch, Danica
1978Interpreting for international conferences. Washington, DC: Pen and Booth.
Sandrelli, Annalisa
2012Introducing FOOTIE (Football in Europe): simultaneous interpreting in football press conferences. In Breaking Ground in Corpus-based Interpreting Studies, edited by Francesco Straniero Sergio and Caterina Falbo. 119-54. Bern: Peter Lang.
Straniero Sergio, Francesco, and Caterina Falbo
2016 “Studying Interpreting through Corpora. An Introduction.” In Breaking Ground in Corpus-Based Interpreting Studies, edited by Francesco Straniero Sergio and Caterina Falbo, 9–52. Bern: Peter Lang.
2004 “Editors’ critical foreword.” In Inference and Anticipation in Simultaneous Interpreting, Ghelly Chernov, ix–xxii. Amsterdam: Benjamins.
Setton, Robin
2011 “Corpus-Based Interpreting Studies (CIS): Overview and Prospects.” In Corpus-Based Translation Studies, edited by Alet Kruger, Kim Wallmach, and Jeremy Munday. London: Continuum.
Sharoff, Serge, Mikhail Kopotev, Tomaz Erjavec, Anna Feldman, and Dagmar Divjak
2008 “Designing and Evaluating a Russian Tagset.” In Proceedings of the Sixth International Language Resources and Evaluation (LREC’08), 279–85. [URL].
Shiriaev, Anatolij Fedorovich
1979Sinkhronnyj Perevod. Moscow: Voenizdat.
Shlesinger, Miriam
1989 “Simultaneous Interpretation as a Factor in Effecting Shifts in the Position of Texts on the Oral-Literate Continuum.” Tel Aviv University.
Shlesinger, Miriam
1998 “Corpus-Based Interpreting Studies as an Offshoot of Corpus-Based Translation Studies.” Meta: Translators’ Journal 43 (4): 486–93. .
Shlesinger, Miriam
2008 “Towards a Definition of Interpretese: An Intermodal, Corpus-Based Study.” In Efforts and Models in Interpreting and Translation Research: A Tribute to Daniel Gile, edited by Gyde Hansen, Andrew Chesterman, and Heidrun Gerzymisch-Arbogast, 237–53. Amsterdam: Benjamins.
Steiner, Erich
2012 “A Characterization of the Resource Based on Shallow Statistics.” In Cross-Linguistic Corpora for the Study of Translations, edited by Silvia Hansen-Schirra, Stella Neumann, and Erich Steiner, 71–90. Berlin: De Gruyter.
Straniero Sergio, Francesco
1999 “The Interpreter on the (Talk) Show: Interaction and Participation Frameworks.” Translator 5 (2): 303–26.
Straniero Sergio, Francesco
2013 “Media Interpreting.” In The Encyclopedia of Applied Linguistics, edited by Carol A. Chapelle, 1–5. London: Blackwell. .
Straniero Sergio, Francesco, and Caterina Falbo
2012aBreaking Ground in Corpus-Based Interpreting Studies. System. Vol. 441. Berlin: Peter Lang. .
Stubbs, Michael
1986 “Lexical Density: A Technique and Some Findings.” In Talking about Text, edited by Malcolm Coulthard, 27–42. Birmingham: University of Birmingham.
Teich, Elke
2003Cross-Linguistic Variation in System and Text. Berlin: De Gruyter.
Thompson, Paul
2005 “Spoken Language Corpora.” In Developing Linguistic Cor- Pora: A Guide to Good Practice, edited by Martin Wynne, 59–70. Oxford: Oxbow books.
2004 “CIAIR Simultaneous Interpretation Corpus.” In Proceedings of Oriental-COCOSDA. [URL].
Vanderauwera, Ria
1985Dutch Novels Translated into English: The Transformation of a Minority’’ Literature. Amsterdam: Rodopi.
Vandevoorde, Lore, and Gert De Sutter
2015 “On Semantic Differences between Translated and Non-Translated Dutch.” In Empirical Translation Studies. Interdisciplinary Methodologies Explored, edited by Ji Meng, 128–46. Sheffield: Equinox Publishing.
Volk, Martin, Johannes Graen, and Elena Callegaro
2014 “Innovations in Parallel Corpus Search Tools.” In Ninth International Conference on Language Resources and Evaluation (LREC’14), 3172–78. Reykjavik. [URL].
Zimnyaya, Irina, and Ghelly Chernov
1973Verojatnostnoe Prognozirovanie v Processe Sinkhronnogo Perevoda. Moscow.
2020. Du texte aux ressources multimodales : faire avancer la recherche en interprétation à partir d’un corpus déjà existant†. Meta 65:1 ► pp. 211 ff.
Dayter, Daria
2021. Strategies in a corpus of simultaneous interpreting. Effects of directionality, phraseological richness, and position in speech event. Meta 65:3 ► pp. 594 ff.
Dayter, Daria
2021. Variation in non-fluencies in a corpus of simultaneous interpreting vs. non-interpreted English. Perspectives 29:4 ► pp. 489 ff.
Dayter, Daria
2021. Dealing with interactionally risky speech acts in simultaneous interpreting: The case of self-praise. Journal of Pragmatics 174 ► pp. 28 ff.
Liu, Yi, Andrew K.F. Cheung & Kanglong Liu
2023. Syntactic complexity of interpreted, L2 and L1 speech: A constrained language perspective. Lingua 286 ► pp. 103509 ff.
Liu, Zhibo & Juhua Dou
2023. Lexical density, lexical diversity, and lexical sophistication in simultaneously interpreted texts: a cognitive perspective. Frontiers in Psychology 14
Locher, Miriam A. & Maria Sidiropoulou
2021. Introducing the special issue on the pragmatics of translation. Journal of Pragmatics 178 ► pp. 121 ff.
This list is based on CrossRef data as of 30 april 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers.
Any errors therein should be reported to them.