Gloria Corpas Pastor

Universidad de Málaga, Translation and Interpreting, Faculty Member

Followers

635

Following

Co-authors

Mentions

Public Views

Phone: +34 952133409
Address: http://www.lexytrad.es/en/research/publications/
http://www.lexytrad.es/en/members/gloria-corpas-pastor/

less

Steven Pinker

Harvard University

Michael Spivey

University of California, Merced

Andreas Umland

National University of "Kyiv-Mohyla Academy"

David Pierre Leibovitz

Carleton University

Lucja Biel

University of Warsaw

Mark W. Post

The University of Sydney

Martin Oliver

UCL Institute of Education

Tamás Tölgyesi

University of Vienna

Armando Marques-Guedes

UNL - New University of Lisbon

Eitan Grossman

The Hebrew University of Jerusalem

InterestsView All (14)

Uploads

Papers by Gloria Corpas Pastor

Chapter 1. Now what?

John Benjamins Publishing Company eBooks, Dec 8, 2021

Corpus-based resources for translators. An Overview

De Gruyter eBooks, 2017

La creatividad fraseológica: efectos semántico-pragmáticos y estrategias de traducción

Paremia, 2001

Argument-Structure Constructions in Organization Names in the English Eurolect

Names: A Journal of Onomastics, Jun 8, 2023

Download

Interpreting Tomorrow? How to Build a Computer-Assisted Glossary of Phraseological Units in (Almost) No Time

Lecture Notes in Computer Science, 2022

Extracción de fraseología para intérpretes a partir de corpus comparables compilados mediante reconocimiento automático del habla

Sistemas fraseológicos en contraste: enfoques computacionales y de corpus, Oct 4, 2021

This is an accepted manuscript of a chapter published by Editorial Comares in Sistemas fraseológi... more This is an accepted manuscript of a chapter published by Editorial Comares in Sistemas fraseológicos en contraste: Enfoques computacionales y de corpus, available online: https://www.comares.com/libro/sistemas-fraseologicos-en-contraste_130637/ The accepted version of the publication may differ from the final published version. For re-use please see publisher's terms and conditions.Today, automatic speech recognition is beginning to emerge strongly in the field of interpreting. Recent studies point to this technology as one of the main documentation resources for interpreters, among other possible uses. In this paper we present a novel documentation methodology that involves semi-automatic compilation of comparable corpora (transcriptions of oral speeches) and automatic corpus compilation of written documents on the same topic with a view to preparing an interpreting assignment. As a convenient background, we provide a brief overview of the use of automatic speech recognition in the context of interpreting technologies. Next, we will detail the protocol for designing and compiling our comparable corpora that we will exploit for analysis. In the last part of the paper, we will cover phraseology extraction and study some collocational patterns in both corpora. Mastering the specific phraseology of the specific subject matter of the assignment is one of the main stumbling blocks that interpreters face in their daily work. Our ultimate aim is to establish whether oral corpora could be of further benefit to the interpreter in the preliminary preparation phase

Remote Interpreting in Public Service Settings: Technology, Perceptions and Practice

Remote interpretation technology is developing extremely fast, enabling affordable and instant ac... more Remote interpretation technology is developing extremely fast, enabling affordable and instant access to interpreting services worldwide. This paper focuses on the subjective perceptions of public service interpreters about the psychological and physical impact of using remote interpreting, and the effects on their own performance. To this end, a survey study has been conducted by means of an on-line questionnaire. Both structured and unstructured questions have been used to tap into interpreters' view on technology, elicit information about perceived effects, and identify pitfalls and prospects.

Download

Las tecnologías de interpretación a distancia en los servicios públicos: uso e impacto

Interpreting in a Changing World: New Scenarios, Technologies, Training Challenges and Vulnerable GroupsLa Interpretación Un Mundo Cambiante: Nuevos Escenarios, Tecnologías, Retos Formativos Y Grupos Vulnerables, Feb 27, 2020

This chapter deals with the use of distance interpreting technologies and their impact on public ... more This chapter deals with the use of distance interpreting technologies and their impact on public services interpreters. Remote (or distance) interpreting offers a wide range of solutions in order to successfully satisfy the pressing need for languages services in both the public and private sectors. This study focuses on telephone-mediated and video-mediated interpreting, presenting their advantages and disadvantages. We have designed a survey to gather data about the psychological and physiological impact that remote interpreting technologies generate in community interpreters. Our main aim is to ascertain interpreters' general view on technology, so as to detect deficiencies and suggest ways of improvement. This study is a first contribution in the direction of optimising distance interpreting technologies. Current demand reveals the enormous potential of distance interpreting, its rapid evolution and generalised presence that this modality will have in the future. Keywords: distance interpreting, remote interpreting, interpreting technologies, psychological impact, interpreting at public services, video-mediated interpreting. 1 El presente trabajo se ha realizado en el marco del proyecto VI-Sistema integrado voz-texto para intérpretes (N.º de ref. FFI2016-75831-P, MINECO).

Download

Technology solutions for interpreters: the VIP system

Hermēneus. Revista de traducción e interpretación, 2022

Interpreting technologies have abruptly entered the profession in recent years. However, technolo... more Interpreting technologies have abruptly entered the profession in recent years. However, technology still remains a relatively marginal topic of academic debate, although interest in developing tailor-made solutions for interpreters has risen sharply. This paper presents the VIP system, one of the research outputs of the homonymous project VIP - Voice-text Integrated system for interPreters, and its continuation (VIP II). More specifically, a technology-based terminology workflow for simultaneous interpretation is presented.

Download

INTELITERM: In search of efficient terminology lookup tools for translators

Studies on Multilingual Lexicography, 2019

There is currently a pressing need to develop specific applications for translators as final user... more There is currently a pressing need to develop specific applications for translators as final users, with the purpose of fulfilling their particular professional requirements. Corpora and advanced lookup options bring benefits to translation and open up a wealth of opportunities in research. This paper presents Inteliterm, an innovative and integrated tool which combines corpus management tools with different types of searches and customisation options in order to enhance translation results and minimise translators' efforts when searching for terminology. Section 1 provides a brief foreground glimpse of the project rationale. Section 2 delves into the comprehension assistants developed in the 90's as a first step towards present-day intelligent dictionaries. Section 3 provides the theoretical foundation of the novel tool Inteliterm. Beside its proactive translation support functions, this web application also provides a TBX (TermBase eXchange) termbase editor, which allows users to create, edit or upload terminological databases in the standard-format TBX (ISO 30042: 2008) and to query their own databases using the Inteliterm tool. Users' responses and assessment of the tool are also provided in Section 4. Finally, Section 5 includes some concluding remarks with suggestions for further improvements.

Download

Corpus-based multilingual lexicographic resources for translators: an overview

Studies on Multilingual Lexicography, 2019

©De Gruyter, 2020. Published in Studies on Multilingual Lexicography, edited by María José Domíng... more ©De Gruyter, 2020. Published in Studies on Multilingual Lexicography, edited by María José Domínguez Vázquez, Mónica Mirazo Balsa and Carlos Valcárcel Riveiro, available online at: https://doi.org/10.1515/9783110607659 For information on re-use, please refer to the publisher’s terms and conditions

Un viaje de ida y vuelta: la traducción al español de documentos británicos de transmisión de derechos de aprovechamiento por turno

Aber nun der eigentliche Ubersetzer, der diese beiden ganz getrennten Personen, seinen Schriftste... more Aber nun der eigentliche Ubersetzer, der diese beiden ganz getrennten Personen, seinen Schriftsteller und seinen Leser, wirklich einander zufuhren, und dem letzten, ohne ihn jedoch aus dem Kreise seiner Muttersprache heraus zu notigen, zu einem moglichst richtigen und vollstandigen Verstandnis und Genuβ des ersten verhelfen will, was fur Wege kann er hierzu einschlagen? Meines Erachtens gibt es deren nur zwei. Entweder der Ubersetzer laβt den Schriftsteller moglichts in Ruhe, und bewegt den Leser ihm entgegen; oder er laβt den Leser moglichst in Ruhe und bewegt den Schriftsteller ihm entgegen. (Schleiermacher, 2000: 49)

Download

The Portrait of Dorian Gray: A Corpus-Based Analysis of Translated Verb + Noun (Object) Collocations in Peninsular and Colombian Spanish

Computational and Corpus-Based Phraseology, 2019

Corpus-based Translation Studies have promoted research on the features of translated language, b... more Corpus-based Translation Studies have promoted research on the features of translated language, by focusing on the process and product of translation, from a descriptive perspective. Some of these features have been proposed by Toury (1995/2012) under the term of laws of translation, namely the law of growing standardisation and the law of interference. The law of standardisation appears to be particularly at play in diatopy, and more specifically in the case of transnational languages (e.g. English, Spanish, French, German). In fact, some studies have revealed the tendency to standardise the diatopic varieties of Spanish in translated language (Corpas Pastor, 2015a, 2015b, 2017, 2018). This paper focuses on verb + noun (object) collocations of Spanish translations of The Portrait of Dorian Gray by Oscar Wilde. Two different varieties have been chosen (Peninsular and Colombian Spanish). Our main aim is to establish whether the Colombian Spanish translation actually matches the variety spoken in Colombia or it is closer to general or standard Spanish. For this purpose, the techniques used to translate this type of collocations in both Spanish translations will be analysed. Furthermore, the diatopic distribution of these collocations will be studied by means of large corpora.

Download

Translationese and Register Variation in English-To-Russian Professional Translation

New Perspectives on Corpus Translation Studies, 2021

This study explores the impact of register on the properties of translations. We compare sources,... more This study explores the impact of register on the properties of translations. We compare sources, translations and non-translated reference texts to describe the linguistic specificity of translations common and unique between four registers. Our approach includes bottom-up identification of translationese effects that can be used to define translations in relation to contrastive properties of each register. The analysis is based on an extended set of frequency features that reflect morphological, syntactic, and text-level characteristics of translations. We also experiment with lexisbased features from n-gram language models estimated on large bodies of originally authored texts from the included registers. Our parallel corpora are built from published English-to-Russian professional translations of general domain mass-media texts, popular scientific books, fiction, and analytical texts on political and economic news. The number of observations and the data sizes for parallel and reference components are comparable within each register and range from 166 (fiction) to 525 (media) text pairs; from 300 K to 1 M tokens. Methodologically, the research relies on a series of supervised and unsupervised machine learning techniques, including those that facilitate visual data exploration. We learn a number of text classification models and study their performance to assess our hypotheses. Further on, we analyse the usefulness of the features for these classifications to detect the best translationese indicators in each register. The multivariate analysis via text classification is complemented by univariate statistical analysis which helps to explain the observed deviation of translated registers through a number of translationese effects and detect the features that contribute to them. Our results demonstrate that each register generates a unique form of translationese that can be only partially explained by cross-linguistic factors. Translated registers differ in the amount and type of prevalent translationese. The same translationese tendencies in different registers are manifested through different features. In particular, the notorious shiningthrough effect is more noticeable in general media texts and news commentary and is less prominent in fiction.

Download

La tecnología habla-texto como herramienta de documentación para intérpretes: Nuevo método para compilar un corpus ad hoc y extraer terminología a partir de discursos orales en vídeo

TRANS. Revista de Traductología, 2020

Download

Translation Technology

The Oxford Handbook of Computational Linguistics 2nd edition, 2015

In today’s market, the use of technology by translators is no longer a luxury but a necessity if ... more In today’s market, the use of technology by translators is no longer a luxury but a necessity if they are to meet rising market demands for the quick delivery of high-quality texts in many languages. This chapter describes a selection of computer-aided translation tools, resources, and applications, most commonly employed by translators to help them increase productivity while maintaining high quality in their work. This chapter also considers some of the ways in which translation technology has influenced the practice and the product of translation, as well as translators’ professional competence and their preferences with regard to tools and resources.

Translation Technology

The Oxford Handbook of Computational Linguistics, Second edition (ed. R. Mitkov). Oxford: Oxford University Press. , 2022

Interpreting tomorrow? How to build a computer-assisted glossary of phraseological units in (almost) no time

Computational and Corpus-Based Phraseology Fourth International Conference, Europhras 2022, Malaga, Spain, September 28-30, 2022, Proceedings. (Eds.: G. Corpas Pastor and R. Mitkov). Springer. , 2022

Chapter 8 NLP-enhanced shift analysis of named entities in an English<>Spanish intermodal corpus of European petitions

Mediated Discourse at the European Parliament. “Translation and Multilingual Natural Language Processing” Series (eds. S. Bernardini et al.). Berlin: Language Science Press (ISBN 978-3-96110-393-5)., 2022

This chapter aims at presenting an NLP-enhanced corpus-based analysis of the translation and inte... more This chapter aims at presenting an NLP-enhanced corpus-based analysis of the translation and interpreting shifts observed in the named entities (NEs) of PETI-MOD, an English<>Spanish intermodal corpus of written and oral mediated texts from the Committee on Petitions of the European Parliament. Our main assumption is that shifts in institutional genres mostly occur in the transfer of NEs, and that NLP techniques such as automatic Named Entity Recognition (NER) can be applied to systematically extract and compare examples of these shifts, leading to the (possible) verification of translational and/or interpretational constraints. Results show that traits like normalisation, transformation and simplification depend not only on the language direction or the mediation mode, but also on the semantic category (person, organisation, etc.) of the NE involved. Further studies are needed in order to correlate observed shifts with different NE taxonomies.

Download

Now what?: A fresh look at language technologies and resources for translators and interpreters

Corpora in Translation and Contrastive Research in the Digital Age: Recent advances and explorations, 2021

This chapter provides a brief outline of language technologies applied to translation and interpr... more This chapter provides a brief outline of language technologies applied to translation and interpreting with a view to identifying challenges and research opportunities. Section 1 covers new trends within the automation of processes, the integration of language technologies in translators and interpreters’ workflows and industry demands. Section 2 moves on to other relevant technology-based resources for oral and written mediation, including new types of corpora for interpreting, computer-assisted interpreting tools (CAI) or cloud interpreting, among others. Section 3 delves into the concept of ‘tech-savviness’ and adoption of technology among language service providers (LSPs). Final thoughts are presented as a conclusion in Section 4.

Chapter 1. Now what?

John Benjamins Publishing Company eBooks, Dec 8, 2021

Corpus-based resources for translators. An Overview

De Gruyter eBooks, 2017

La creatividad fraseológica: efectos semántico-pragmáticos y estrategias de traducción

Paremia, 2001

Argument-Structure Constructions in Organization Names in the English Eurolect

Names: A Journal of Onomastics, Jun 8, 2023

Download

Interpreting Tomorrow? How to Build a Computer-Assisted Glossary of Phraseological Units in (Almost) No Time

Lecture Notes in Computer Science, 2022

Extracción de fraseología para intérpretes a partir de corpus comparables compilados mediante reconocimiento automático del habla

Sistemas fraseológicos en contraste: enfoques computacionales y de corpus, Oct 4, 2021

Remote Interpreting in Public Service Settings: Technology, Perceptions and Practice

Download

Las tecnologías de interpretación a distancia en los servicios públicos: uso e impacto

Download

Technology solutions for interpreters: the VIP system

Hermēneus. Revista de traducción e interpretación, 2022

Download

INTELITERM: In search of efficient terminology lookup tools for translators

Studies on Multilingual Lexicography, 2019

Download

Corpus-based multilingual lexicographic resources for translators: an overview

Studies on Multilingual Lexicography, 2019

Un viaje de ida y vuelta: la traducción al español de documentos británicos de transmisión de derechos de aprovechamiento por turno

Download

The Portrait of Dorian Gray: A Corpus-Based Analysis of Translated Verb + Noun (Object) Collocations in Peninsular and Colombian Spanish

Computational and Corpus-Based Phraseology, 2019

Download

Translationese and Register Variation in English-To-Russian Professional Translation

New Perspectives on Corpus Translation Studies, 2021

Download

La tecnología habla-texto como herramienta de documentación para intérpretes: Nuevo método para compilar un corpus ad hoc y extraer terminología a partir de discursos orales en vídeo

TRANS. Revista de Traductología, 2020

Download

Translation Technology

The Oxford Handbook of Computational Linguistics 2nd edition, 2015

Translation Technology

The Oxford Handbook of Computational Linguistics, Second edition (ed. R. Mitkov). Oxford: Oxford University Press. , 2022

Interpreting tomorrow? How to build a computer-assisted glossary of phraseological units in (almost) no time

Computational and Corpus-Based Phraseology Fourth International Conference, Europhras 2022, Malaga, Spain, September 28-30, 2022, Proceedings. (Eds.: G. Corpas Pastor and R. Mitkov). Springer. , 2022

Chapter 8 NLP-enhanced shift analysis of named entities in an English<>Spanish intermodal corpus of European petitions

Download

Now what?: A fresh look at language technologies and resources for translators and interpreters

Corpora in Translation and Contrastive Research in the Digital Age: Recent advances and explorations, 2021

Implantación de un cuestionario de triaje multilingüe adaptado al SARS-COV2

La calidad ante la COVID: una oportunidad de transformación. Libro de comunicaciones del I Congreso Virtual SECA (Sociedad Española de Calidad Asistencia), 2020

Download

[Poster] iCorpora: Compiling, Managing and Exploring Multilingual Data

by Gloria Corpas Pastor, Miriam Seghiri, and Hernani Costa

Download

[Poster] A comparative User Evaluation of Terminology Management Tools for Interpreters

by Hernani Costa and Gloria Corpas Pastor

Download

[Poster] MiniExperts: An SVM Approach for Measuring Semantic Textual Similarity

by Hernani Costa and Gloria Corpas Pastor

9th Int. Workshop on Semantic Evaluation (SemEval'15), Jun 4, 2015

Download

A Comparative User Evaluation of Terminology Management for Interpreters

When facing new fields, interpreters need to perform extensive searches for specialised knowledge... more When facing new fields, interpreters need to perform extensive searches for specialised knowledge and terminology. They require this information prior to an interpretation and have it accessible during the interpreting service. Fortunately, there are currently several terminology management tools capable of assisting interpreters before and during an interpretation service. Although these tools appear to be quite similar, they provide different kind of features and as a result they exhibit different degrees of usefulness. This paper aims at describing current terminology management tools with a view to establishing a set of features to assess the extent to which terminology tools meet the specific needs of the interpreters. Subsequently, a comparative analysis is performed to evaluate these tools based on the list of features previously identified.

Download

MiniExperts: An SVM approach for Measuring Semantic Textual Similarity

by Gloria Corpas Pastor and Constantin Orasan

This paper describes the system submitted by the University of Wolverhampton and the University o... more This paper describes the system submitted by the University of Wolverhampton and the University of Malaga for SemEval-2015 Task 2: Semantic Textual Similarity. The system uses a Supported Vector Machine approach based on a number of linguistically motivated features. Our system performed satisfactorily for English and obtained a mean 0.7216 Pearson correlation. However, it performed less adequately for Spanish, obtaining only a mean 0.5158.

Download

iCompileCorpora: A Web-based Application to Semi-automatically Compile Multilingual Comparable Corpora

by Gloria Corpas Pastor and Miriam Seghiri

This article presents an ongoing project that aims to design and develop a robust and agile web-b... more This article presents an ongoing project that aims to design and develop a robust and agile web-based application capable of semi-automatically compiling monolingual and multilingual comparable corpora, which we named iCompileCorpora. The dimensions that comprise iCompileCorpora can be represented in a layered model comprising a manual, a semi-automatic and a Cross-Language Information Retrieval (CLIR) layer. This design option will not only permit to increase the flexibility of the compilation process, but also to hierarchically extend the manual layer features to the semi-automatic web-based layer and then to the semi-automatic CLIR layer. The manual layer presents the option of compiling monolingual or multilingual corpora. It will allow the manual upload of documents from a local or remote directory onto the platform. The second layer will permit the exploitation of either monolingual or multilingual corpora mined from the Internet. As nowadays there is an increasing demand for systems that can somehow cross the language boundaries by retrieving information of various languages with just one query, the third layer aims to answer this demand by taking advantage of CLIR techniques to find relevant information written in a language different from the one semi-automatically retrieved by the methodology used in the previous layer.

Download

[Presentation] Measuring the Relatedness between Documents in Comparable Corpora

by Hernani Costa and Gloria Corpas Pastor

Download

Sistemas fraseológicos en contraste: enfoques computacionales y de corpus

Sistemas fraseológicos en contraste: enfoques computacionales y de corpus, 2021

La imbricación de la Fraseología con otras áreas, como, por ejemplo, el Procesamiento de Lenguaje... more La imbricación de la Fraseología con otras áreas, como, por ejemplo, el Procesamiento de Lenguaje Natural y la Lingüística de Corpus, ha supuesto sin duda un impulso decisivo para la disciplina. A este respecto, las contribuciones que componen el presente volumen ilustran las numerosas posibilidades que ofrecen los sistemas de corpus y computacionales en el tratamiento y análisis de las unidades fraseológicas (UF). Así, la obra se vertebra en tres bloques: en primer lugar, se analiza la traducción, en diversas combinaciones lingüísticas, de una heterogénea tipología de UF (colocaciones, locuciones, términos poliléxicos, enunciados fraseológicos). En segundo lugar, se abordan soluciones computacionales y de corpus para los múltiples desafíos que las UF presentan en la labor traductora: discontinuidad, desautomatización o anisomorfismo interlingüe, entre muchos otros. En tercer lugar, el volumen ahonda en los aspectos contrastivos de la fraseología, de forma que se tratan, desde una perspectiva empírica y basada en corpus, cuestiones tan variadas como la fraseología de las lenguas minoritarias, los fraseologismos bíblicos, los rasgos fraseológicos del discurso feminista y antifeminista, las construcciones medias y las colocaciones verbales en el turismo de aventura y en el ámbito marítimo. En suma, la selección tamizada de las contribuciones que vertebran esta obra ha dado como fruto un volumen que, de forma equilibrada, aporta, por un lado, variedad y riqueza de enfoques fraseológicos, y, por el otro, coherencia, calidad y rigurosidad científica.

Download

Computational Phraseology

John Benjamins, 2020

Whether you wish to deliver on a promise, take a walk down memory lane or even on the wild side, ... more Whether you wish to deliver on a promise, take a walk down memory lane or even on the wild side, phraseological units (also often referred to as phrasemes or multiword expressions) are present in most communicative situations and in all world’s languages. Phraseology, the study of phraseological units, has therefore become a rare unifying theme across linguistic theories. In recent years, an increasing number of studies have been concerned with the computational treatment of multiword expressions: these pertain among others to their automatic identification, extraction or translation, and to the role they play in various Natural Language Processing applications. Computational Phraseology is a comparatively new field where better understanding and more advances are urgently needed. This book aims to address this pressing need, by bringing together contributions focusing on different perspectives of this promising interdisciplinary field.

Computational and Corpus-Based Phraseology. Third International Conference, Europhras 2019, Malaga, Spain, September 25–27, 2019, Proceedings

Springer, 2019

This book constitutes the refereed proceedings of the Third International Conference on Computati... more This book constitutes the refereed proceedings of the Third International Conference on Computational and Corpus-Based Phraseology, Europhras 2019, held in Malaga, Spain, in September 2019.

The 31 full papers presented in this book were carefully reviewed and selected from 116 submissions. The papers in this volume cover a number of topics including general corpus-based approaches to phraseology, phraseology in translation and cross-linguistic studies, phraseology in language teaching and learning, phraseology in specialized languages, phraseology in lexicography, cognitive approaches to phraseology, the computational treatment of multiword expressions, and the development, annotation, and exploitation of corpora for phraseological studies.

Proceedings of the Second Workshop on Human-Informed Translation and Interpreting Technology (HiT-IT 2019)

hit-it/) organised in conjunction with RANLP 2017. Both workshops proved very popular attracting ... more

Download

Computational and Corpus-Based Phraseology. Third International Conference, Europhras 2019, Malaga, Spain, September 25–27, 2019, Proceedings.

Computational and Corpus-Based Phraseology. Third International Conference, Europhras 2019, Malaga, Spain, September 25–27, 2019, Proceedings., 2019

Computational and Corpus-based Phraseology. Proceedings of the Third International Conference, Europhras 2019 (short papers and posters)

by Gloria Corpas Pastor and Maria Kunilovskaya

Computational and Corpus-based Phraseology. Proceedings of the Third International Conference, Europhras 2019 (short papers and posters), 2019

This document is downloadable from www.tradulex.com and

Download

Multiword units in machine translation and translation technology

Multiword units in machine translation and translation technology. John Benjamins, 2018

Computerised and Corpus-based Approaches to Phraseology: Monolingual and Multilingual Perspectives (Full papers) - Fraseología computacional y basada en corpus: perspectivas monolingües y multilingües (Trabajos completos)

Editions Tradulex, Geneva, 2016

Download

Proceedings of the First Workshop on Human-Informed Translation and Interpreting Technology

Proceedings of the First Workshop on Human-Informed Translation and Interpreting Technology, 2017

This paper describes an approach to translating course unit descriptions from Italian and German ... more This paper describes an approach to translating course unit descriptions from Italian and German into English, using a phrase-based machine translation (MT) system. The genre is very prominent among those requiring translation by universities in European countries in which English is a non-native language. For each language combination, an in-domain bilingual corpus including course unit and degree program descriptions is used to train an MT engine, whose output is then compared to a baseline engine trained on the Europarl corpus. In a subsequent experiment, a bilingual terminology database is added to the training sets in both engines and its impact on the output quality is evaluated based on BLEU and postediting score. Results suggest that the use of domain-specific corpora boosts the engines quality for both language combinations, especially for German-English, whereas adding terminological resources does not seem to bring notable benefits.

Download

Multiword Units in Machine Translation and Translation Technology

John Benjamins, 2018

The correct interpretation of Multiword Units (MWUs) is crucial to many applications in Natural L... more The correct interpretation of Multiword Units (MWUs) is crucial to many applications in Natural Language Processing but is a challenging and complex task. In recent years, the computational treatment of MWUs has received considerable attention but there is much more to be done before we can claim that NLP and Machine Translation (MT) systems process MWUs successfully.

This volume provides a general overview of the field with particular reference to Machine Translation and Translation Technology and focuses on languages such as English, Basque, French, Romanian, German, Dutch and Croatian, among others. The chapters of the volume illustrate a variety of topics that address this challenge, such as the use of rule-based approaches, compound splitting techniques, MWU identification methodologies in multilingual applications, and MWU alignment issues.

Trends in E-Tools and Resources for Translators and Interpreters

by Gloria Corpas Pastor and Isabel Durán-Muñoz

Brill / Rodopi, 2018

Trends in E-Tools and Resources for Translators and Interpreters offers a collection of contribut... more Trends in E-Tools and Resources for Translators and Interpreters offers a collection of contributions from key players in the field of translation and interpreting that accurately outline some of the most cutting-edge technologies in this field that are available or under development at the moment in both professional and academic contexts. Particularly, this volume provides a wide picture of the state of the art, looking not only at the world of technology for translators but also at the hitherto overlooked world of technology for interpreters. This volume is accessible and comprehensive enough to be of benefit to different categories of readers: scholars, professionals and trainees. Contributors are: Pierrette Bouillon, Gloria Corpas Pastor, Hernani Costa, Isabel Durán-Muñoz, Claudio Fantinuoli, Johanna Gerlach, Joanna Gough, Asheesh Gulati, Veronique Hoste, Amélie Josselin, David Lewis, Lieve Macken, John Moran, Aurelie Picton, Emmanuel Planas, Éric Poirier, Victoria Porro, Celia Rico Pérez, Christian Saam, Pilar Sánchez-Gijón, Míriam Seghiri Domínguez, Violeta Seretan, Arda Tezcan, Olga Torres, and Anna Zaretskaya. For more information: http://www.brill.com/products/book/trends-e-tools-and-resources-translators-and-interpreters

New Horizons in Translation and Interpreting Studies (Full Papers)

by Miriam Seghiri and Gloria Corpas Pastor

Download

New Horizons in Translation and Interpreting Studies (Abstracts)/Nuevos horizontes en los Estudios de Traducción e Interpretación (Resúmenes)/Novos horizontes dos Estudos da Tradução e Interpretação (Resumos)

by Miriam Seghiri and Gloria Corpas Pastor

Download

Corpus-based Approaches to Translation and Interpreting: from theory to applications

by Miriam Seghiri and Gloria Corpas Pastor

Corpus-based translation studies have come a long way since they were introduced in the last deca... more Corpus-based translation studies have come a long way since they were introduced in the last decade of the 20th century. This volume offers a balanced collection of theoretical and application-orientated contributions which establish novel trends in the area of corpus-based translation and interpreting studies. Most of the theoretical contributions report on studies related to translation universals such as simplification, explicitation, normalisation, convergence or transfer. The application-orientated contributions cover areas as diverse as corpus-based applied research, training, practice and the use of computer-assisted translation tools.

Download

Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Malaga, Spain, 1-2 July, 2015

by Gloria Corpas Pastor and Johanna Monti

This volume documents the proceedings of the 2nd Workshop on Multi-word Units in Machine Translat... more This volume documents the proceedings of the 2nd Workshop on Multi-word Units in Machine Translation and Translation Technology (MUMTTT 2015), held on 1-2 July 2015 as part of the EUROPHRAS 2015 conference: "Computerised and Corpus-based Approaches to Phraseology: Monolingual and Multilingual Perspectives" (Málaga, 29 June – 1 July 2015). The workshop was sponsored by European COST Action PARSing and Multi-word Expressions (PARSEME) under the auspices of the European Society of Phraseology (EUROPHRAS), the Special Interest Group on the Lexicon of the Association for Computational Linguistics (SIGLEX), and SIGLEX's Multiword Expressions Section (SIGLEX-MWE). The workshop was co-chaired by
Gloria Corpas Pastor (Universidad de Málaga), Ruslan Mitkov (University of Wolverhampton), Johanna Monti (Università degli Studi di Sassari), and Violeta Seretan (Université de Genève). It received the support
of the Advisory Board, composed of Dmitrij O. Dobrovol'skij (Russian Academy of Sciences, Moscow), Kathrin Steyer (Institut für Deutsche Sprache, Mannheim), Agata Savary (Université François Rabelais
Tours), Michael Rosner (University of Malta), and Carlos Ramisch (Aix-Marseille Université).

The topic of the workshop was the integration of multi-word units in machine translation and translation technology tools. In spite of the recent progress achieved in machine translation and translation
technology, the identification, interpretation and translation of multi-word units still represent open challenges, both from a theoretical and from a practical point of view. The idiosyncratic morpho-syntactic,
semantic and translational properties of multi-word units poses many obstacles even to human translators, mainly because of intrinsic ambiguities, structural and lexical asymmetries between languages, and, finally, cultural differences. After a successful first edition held in Nice on 3 September 2013 as part of the Machine Translation Summit XIV, the present edition provided a forum for researchers working in the fields of Linguistics, Computational Linguistics, Translation Studies and Computational Phraseology to discuss recent advances in the area of multi-word unit processing and to coordinate research efforts across disciplines.

Download

Proceedings of the EXPERT Scientific and Technological Workshop

by Hernani Costa, Gloria Corpas Pastor, and Miriam Seghiri

EXPERT (EXPloiting Empirical appRoaches to Translation): http://expert-itn.eu

Download

You are driving me up the wall! A corpus-based study of a special class of resultative constructions

Lexis, 2022

This paper focuses on resultative constructions from a computational and corpus-based approach. W... more This paper focuses on resultative constructions from a computational and corpus-based approach. We claim that the array of expressions (traditionally classed as idioms, collocations, free word combinations, etc.) that are used to convey a person's change of mental state (typically negative) are basically instances of the same resultative construction. The first part of the study will introduce basic tenets of Construction Grammar and resultatives. Then, our corpus-based methodology will be spelled out, including a description of the two giga-token corpora used and a detailed account of our protocolised heuristic strategies and tasks. Distributional analysis of matrix slot fillers will be presented next, together with a discussion on restrictions, novel instances, and productivity. A final section will round up our study, with special attention to notions like "idiomaticity", "productivity" and "variability" of the pairings of form and meaning analysed. To the best of our knowledge, this is one of the first studies based on giga-token corpora that explores idioms as integral parts of higher-order resultative constructions.

Download

Decálogo de características de la literatura poscolonial: propuesta de una taxonomía para la crítica literaria y los estudios de literatura comparada

by Gloria Corpas Pastor and María Remedios Fernández Ruiz, PhD

Revista de Literatura, 2021

El objetivo de este artículo es ofrecer una propuesta de clasificación de los rasgos presentes, e... more El objetivo de este artículo es ofrecer una propuesta de clasificación de los rasgos presentes, en mayor o menor medida, en la literatura poscolonial en cualquier idioma. A pesar de que esta taxonomía toma como punto de partida definiciones teóricas previas de los conceptos clave relacionados con la literatura poscolonial (Edwards 2008, Nayar 2008 y Ramone 2011), parece ser la primera clasificación formal que se ha elaborado al respecto. De este modo, se analizan conceptos consolidados a la par que presenta la nueva noción de plasticidad de géneros literarios y explora las corrientes actuales en la investigación de la interseccionalidad. Como resultado, proporcionaremos un decálogo de características de la literatura poscolonial que favorecerá la crítica literaria y los estudios de literatura comparada. Palabras Clave: literatura poscolonial; taxonomía; teoría de la literatura; crítica poscolonial; plasticidad. This paper aims to offer a classification proposal of the features that are typically found, to a greater or lesser extent, in postcolonial texts in any language. In spite of the fact that this taxonomy model is based on previous theoretical definitions of the key concepts related to postcolonial literature (Edwards 2008, Nayar 2008, and Ramone 2011), it seems to be the first formal classification to be devised. It examines well-established concepts, presents the new notion of plasticity of literary genres and explores the current trends regarding research on intersectionality. The decalogue of characteristics of postocolonial literature obtained as a result will facilitate literary criticism and comparative literature studies.
Keywords: Postcolonial literature; Taxonomy; Literary theory; Postcolonial criticism; Plasticity.

Technology Solutions for Interpreters: the VIP System Soluciones tecnológicas para intérpretes: el sistema VIP

Hermēneus. Revista de Traducción e Interpretación , 2021

Interpreting technologies have abruptly entered the profession in recent years. However, technolo... more Interpreting technologies have abruptly entered the profession in recent years. However, technology still remains a relatively marginal topic of academic debate, although interest in developing tailor-made solutions for interpreters has risen sharply. This paper presents the VIP system, one of the research outputs of the homonymous project VIP-Voicetext Integrated system for interPreters, and its continuation (VIP II). More specifically, a technology-based terminology workflow for simultaneous interpretation is presented.

Download

Über die Terminologie der Merkmale der übersetzten Sprache: Normen, Universalien oder Übersetzungsgesetze? / On the Terminology to Refer to Features of Translated Language: Norms, Universals or Laws of Translation?

by Gloria Corpas Pastor and Mª Ángeles Recio Ariza

Lebende Sprachen, 2021

There is a lack of agreement on the way general features of translational behaviour are termed, d... more There is a lack of agreement on the way general features of translational behaviour are termed, defined and operationalised. In addition, most research on this subject has mainly been focused on pinpointing and describing these features, while, apparently, disregarding the factors exerting influence on the recurrence of a specific translational behaviour. In this paper, we will intend to clarify and distinguish between the terminology used for discussing regular tendencies of translational behaviour. Furthermore, we will discuss the reasons why the concept of laws of translation may be more useful than others to describe and explain these features.

Decálogo de características de la literatura poscolonial: propuesta de una taxonomía para la crítica literaria y los estudios de literatura comparada

by María Remedios Fernández Ruiz, PhD and Gloria Corpas Pastor

Revista de Literatura (CSIC), 2021

El objetivo de este artículo es ofrecer una propuesta de clasificación de los rasgos presentes, e... more El objetivo de este artículo es ofrecer una propuesta de clasificación de los rasgos presentes, en mayor o menor medida, en la literatura poscolonial en cualquier idioma. A pesar de que esta taxonomía toma como punto de partida definiciones teóricas previas de los conceptos clave relacionados con la literatura poscolonial (Edwards 2008, Nayar 2008 y Ramone 2011), parece ser la primera clasificación formal que se ha elaborado al respecto. De este modo, se analizan conceptos consolidados a la par que presenta la nueva noción de plasticidad de géneros literarios y explora las corrientes actuales en la investigación de la interseccionalidad. Como resultado, proporcionaremos un decálogo de características de la literatura poscolonial que favorecerá la crítica literaria y los estudios de literatura comparada. Palabras Clave: literatura poscolonial; taxonomía; teoría de la literatura; crítica poscolonial; plasticidad. This paper aims to offer a classification proposal of the features that are typically found, to a greater or lesser extent, in postcolonial texts in any language. In spite of the fact that this taxonomy model is based on previous theoretical definitions of the key concepts related to postcolonial literature (Edwards 2008, Nayar 2008, and Ramone 2011), it seems to be the first formal classification to be devised. It examines well-established concepts, presents the new notion of plasticity of literary genres and explores the current trends regarding research on intersectionality. The decalogue of characteristics of postocolonial literature obtained as a result will facilitate literary criticism and comparative literature studies. Keywords: Postcolonial literature; Taxonomy; Literary theory; Postcolonial criticism; Plasticity.

Download

Herramientas y recursos electrónicos para la traducción de la manipulación fraseológica: un estudio de caso centrado en el estudiante

Clina, 2021

En el presente artículo se analiza un estudio de caso llevado a cabo con estudiantes de la asigna... more En el presente artículo se analiza un estudio de caso llevado a cabo con estudiantes de la asignatura Traducción General «BA-AB» (II) - Inglés-Español / Español-Inglés, impartida en el segundo semestre del segundo curso del Grado en Traducción e Interpretación de la Universidad de Málaga. En él, en una primera fase, se les enseñó a los estudiantes cómo sacar el máximo partido de diferentes recursos y herramientas documentales electrónicos (corpus lingüísticos, recursos lexicográficos o la web, entre otros) para la creación de equivalencias textuales en aquellos casos en los que, fruto del anisomorfismo fraseológico interlingüe, la modificación creativa de unidades fraseológicas (UF) en el texto origen y la ausencia de correspondencias biunívocas presentan serias dificultades para el proceso traslaticio. De esta manera, a una primera actividad formativa sobre la traducción de usos creativos de unidades fraseológicas le sucede una sesión práctica en la que los alumnos tuvieron que enfrentarse a distintos casos de manipulación en el texto origen. Con el análisis de dichos resultados se podrá vislumbrar en qué medida los distintos recursos documentales ayudan a los traductores en formación a superar el desafío de la manipulación fraseológica.

Download

Speech-to-Text Technology as a Documentation Tool for Interpreters: a new approach to compiling an ad hoc corpus and extracting terminology from video-recorded speeches

TRANS. Revista de Traductologia, 2020

Although interpreting has not yet benefited from technology as much as its sister field, translat... more Although interpreting has not yet benefited from technology as much as its sister field, translation, interest in developing tailor-made solutions for interpreters has risen sharply in recent years. In particular, Automatic Speech Recognition (ASR) is being used as a central component of Computer-Assisted Interpreting (CAI) tools, either bundled or standalone. This study pursues three main aims: (i) to establish the most suitable ASR application for building ad hoc corpora by comparing several ASR tools and assessing their performance; (ii) to use ASR in order to extract terminology from the transcriptions obtained from video-recorded speeches, in this case talks on climate change and adaptation; and (iii) to promote the adoption of ASR as a new documentation tool among interpreters. To the best of our knowledge, this is one of the first studies to explore the possibility of Speech-to-Text (S2T) technology for meeting the preparatory needs of interpreters as regards terminology and back-ground/domain knowledge.

Aunque el ámbito de la interpretación no se ha beneficiado de los desarrollos tecnológicos en la misma medida que en traducción, actualmente asistimos al surgimiento de gran interés por desarrollar soluciones adaptadas a las necesidades de los intérpretes. En concreto, el Reconocimiento Automático de Habla (RAH) comienza a ser utilizado como parte de las herramientas de interpretación asistida, bien como componente de tales sistemas o como aplicación autónoma. El presente estudio persigue tres objetivos principales: i) determinar la herramienta de transcripción automática más apropiada para la compilación de corpus ad hoc, comparando diversos sistemas de transcripción automática y evaluando su rendimiento; ii) utilizar RAH para extraer terminología a partir de las transcripciones de discursos orales en vídeo; y iii) promover el uso de RAH como nueva herramienta documental en interpretación. Se trata de uno de los primeros estudios en los que se abordan las posibilidades que ofrece la tecnología habla-texto para cubrir las necesidades terminológicas y documentales de los intérpretes en la fase de preparación de un encargo dado.

PALABRAS CLAVE: transcripción automática, herramientas de interpretación asistida por ordenador, extracción de terminología, corpus ad hoc, tecnologías de la interpretación.

Speech-to-Text Technology as a Documentation Tool for Interpreters: a new approach to compiling an ad hoc corpus and extracting terminology from video-recorded speeches
La tecnología habla-texto como herramienta de documentación para intérpretes: Nuevo método para compilar un corpus ad hoc y extraer terminología a partir de discursos orales en vídeo
MISCELÁNEA 263-281 TRANS.

Download

Remote interpreting in public service settings: technology, perceptions and practice

SKASE Journal of Translation and Interpretation 13(2):58, 2020

Download

Constructional idioms of 'insanity' in English and Spanish: A corpus-based study

Lingua, 2021

This paper presents a corpus-based study of constructions in English and Spanish, with a special ... more This paper presents a corpus-based study of constructions in English and Spanish, with a special emphasis on equivalent semantic-functional counterparts, and potential mismatches. Although usage/corpus-based Construction Grammar (CxG) has attracted much attention in recent years, most studies have dealt exclusively with monolingual constructions. In this paper we will focus on two constructions that represent conventional ways to express 'insanity' in both languages. The analysis will cover grammatical, semantic and informative aspects in order to establish a multi-linguistic prototype of the constructions. To that end, data from several giga-token corpora of contemporary spoken English and Spanish (parallel and comparable) have been selected. This study advances the explanatory potential of constructional idioms for the study of idiomaticity, variability and cross-language analysis. In addition, relevant findings on the dialectal distribution of certain idiom features across both languages and their national varieties are also reported.

Download

Crossing the border between postcolonial reality and the outer world: Translation and representation of the third space into a fourth space (2019). Revista Cultura, lenguaje y representación

by María Remedios Fernández Ruiz, PhD and Gloria Corpas Pastor

Cultura, lenguaje y representación, 2019

ABSTRACT: Stemming from poststructuralist interpretations of space and following Bhabha’s third s... more ABSTRACT: Stemming from poststructuralist interpretations of space and following Bhabha’s third space enunciation, in this paper we have coined the term fourth space and used this concept as a heuristic tool to address the need to establish a coherent standpoint for the analysis of postcolonial literature reception within a society with no immediate relation to the specific decolonisation process of the author’s country. We explore this concept through the case of the Spanish reception of African postcolonial literature. In Spain, this perspective has remained under-theorised in an era when representation of hybridity is at a vital point, since such representation will provide the social scaffolding for eachperson’s identity construction. Under
these circumstances, literature can be transformative and the role of translation as a decolonising tool can help to create unbiased knowledge through an intentionally objective and unprejudiced interpretation of the original texts. We will analyse how those differentiating elements affect the translational process.

A raíz de las interpretaciones postestructuralistas del espacio y siguiendo parcialmente la articulación del tercer espacio de Bhabha, en este artículo hemos acuñado el término cuarto espacio, utilizando este concepto como una herramienta heurística que aborde la necesidad de establecer una postura coherente con el análisis de la recepción de la literatura poscolonial en una sociedad que carezca de una relación inmediata con el proceso de descolonización específico del país del autor. Exploramos el concepto a través de la recepción de la literatura poscolonial africana en España. En este país, dicha perspectiva aún tiene muchas vertientes sin desarrollar en una época en la que la representación de la hibridación está en un momento vital, ya que la representación proporciona el andamiaje social para la construcción de la identidad individual. En este contexto, la literatura se torna transformadora y el rol que asume la traducción como herramienta descolonizadora puede contribuir a crear conocimiento sin sesgos a partir de una interpretación del texto original que se pretende sea objetiva y carente de prejuicios. Se analizará cómo estos elementos diferenciadores afectan al proceso traslativo.

Download

Nine terminology extraction tools – Are they useful for translators

by Miriam Seghiri, Anna Zaretskaya, and Gloria Corpas Pastor

Terminology extraction tools (TETs) have become an indispensable resource in education, research ... more Terminology extraction tools (TETs) have become an indispensable resource in education, research and business. Today, users can find a great variety of terminology extraction tools of all kinds, and they all offer different features. Apart from many other areas, these tools are especially helpful in the professional translation setting. We do not know, however, if the existing tools have all the necessary features for this kind of work. In search for the answer, we looked at nine selected tools available on the market to find out if they provide the translators’ most favorite features....

Download

Entornos de formación en red: Tutoría Online. E-learning y e-moderación para la enseñanza-aprendizaje de la traducción científica

by Cristina Castillo, Gloria Corpas Pastor, and Miriam Seghiri

Gloria Corpas Pastor

Uploads

Papers by Gloria Corpas Pastor

Log In