Skip to main content

Tanara Zingano Kuhn

University of Coimbra, Celga-Iltec, Researcher

Universidade de Lisboa, Departamento de Linguistica Geral e Românica, Alumna

Followers

150

Following

150

Co-authors

21

Mentions

1

Public Views

PhD in Applied Linguistics
Address: Universidade de Coimbra|University of Coimbra
Centro de Estudos de Linguística Geral e Aplicada (CELGA-ILTEC)|Centre for General and Applied Linguistics Studies (CELGA-ILTEC)
Faculdade de Letras • Largo da Porta Férrea • 3004-530 COIMBRA • PORTUGAL
E-mail: tanarazingano@uc.pt |tanarazingano@outlook.com

less

InterestsView All (9)

Uploads

Papers by Tanara Zingano Kuhn

Data preparation in crowdsourcing for pedagogical purposes: the case of the CrowLL game

by Tanara Zingano Kuhn and Kristina Koppel

Slovenščina 2.0, 2022

One way to stimulate the use of corpora in language education is by making pedagogically appropri... more One way to stimulate the use of corpora in language education is by making pedagogically appropriate corpora, labeled with different types of problems (sensitive content, offensive language, structural problems). However, manually labeling corpora is extremely time-consuming and a better approach should be found. We thus propose a combination of two approaches to the creation of problem-labeled pedagogical corpora of Dutch, Estonian, Slovene and Brazilian Portuguese: the use of games with a purpose and of crowdsourcing for the task. We conducted initial experiments to establish the suitability of the crowdsourcing task, and used the lessons learned to design the Crowdsourcing for Language Learning (CrowLL) game in which players identify problematic sentences, classify them, and indicate problematic excerpts. The focus of this paper is on data preparation, given the crucial role that such a stage plays in any crowdsourcing project dealing with the creation of language learning resources. We present the methodology for data preparation, offering a detailed presentation of source corpora selection, pedagogically oriented GDEX configurations, and the creation of lemma lists, with a special focus on common and language-dependent decisions. Finally, we offer a discussion of the challenges that emerged and the solutions that have been implemented so far.

O desenho de uma aplicação de MAVL em PLE destinado a aprendentes chineses

by Tanara Zingano Kuhn and Margarita Correia

Entrepalavras, Fortaleza, 2022

O presente trabalho tem como objetivo apresentar o desenho de uma aplicação1 de Mobile-assisted V... more O presente trabalho tem como objetivo apresentar o desenho de uma
aplicação1 de Mobile-assisted Vocabulary Learning (MAVL) em Português como Língua Estrangeira (PLE) destinada a aprendentes chineses, a UVA. O conteúdo do desenho é baseado em investigações sobre ensino-aprendizagem de vocabulário em língua estrangeira (NATION, 1990, 2000; MA, 2006, 2009; BEATTY, 2010a; BEATTY, 2010b; JIANG, 2000) e na adaptação das estratégias de O’Malley e Chamot (1990)
e Oxford (1990a). Além disso, o processo de aprendizagem na aplicação baseia-se em diversos estudos no âmbito da
aprendizagem assistida por tecnologia (GOODFELLOW, 2006; LAUFER et
al., 2000; GROOT, 2000). Na UVA, pretende-se dar conta da realidade da
aprendizagem de vocabulário de língua portuguesa e dos hábitos e necessidades no uso de aplicações de MAVL dos
aprendentes chineses. Para isso, foi aplicado um inquérito2 a 133 aprendentes chineses, cujos resultados nos ofereceram informação imprescindível para um desenho da aplicação mais adequado ao público-alvo. A estrutura da UVA consiste em cinco módulos: Escolha de Vocabulário a aprender; Aprendizagem de Vocabulário
(subdividido em três etapas: dedução, consolidação e retomada); Dicionário; Administração de Aprendizagem e Campo Social. Trata-se de um recurso inédito que busca facilitar e flexibilizar

Crowdsourcing pedagogical corpora for lexicographical purposes

by Tanara Zingano Kuhn and Rina zviel-girshin

Proceedings of EURALEX 2020 Conference, Volume II. Komotini: SynMorPhoSe Lab, Democritus University of Thrace, v.2., 2021

O Corpus de Português Escrito em Periódicos - CoPEP

DELTA: Documentação de Estudos em Lingüística Teórica e Aplicada, 2020

O presente estudo tem como objetivo descrever os desafios e soluções encontrados na compilação do... more O presente estudo tem como objetivo descrever os desafios e soluções encontrados na compilação do Corpus de Português Escrito em Periódicos - CoPEP, que contém aproximadamente 40 milhões de palavras, é equilibrado entre as variedades português brasileiro e português europeu em número de palavras e cobre seis grandes áreas de conhecimento. Primeiramente, apresentaremos o contexto de criação do CoPEP, qual seja, a elaboração de um dicionário on-line de português para universitários, para o qual serviu como fonte primária de obtenção de evidências linguísticas. Assim, foram as características desse projeto lexicográfico que informaram os critérios de criação do desenho do CoPEP e as consequentes tomadas de decisão. A seguir, descreveremos a metodologia de aquisição de dados, com foco especial nos desafios enfrentados e nas soluções encontradas. Terminaremos com a descrição da fase final de compilação, na qual aplicamos uma série de procedimentos para obtenção de equilíbrio.

Português como Língua Adicional no Brasil - perfis e contextos implicados [2020]

by Gabriela Bulla and Tanara Zingano Kuhn

Revista Virtual de Estudos da Linguagem - ReVEL, 2020

A área de Português como Língua Adicional (PLA) tradicionalmente abarca questões relativas à educ... more A área de Português como Língua Adicional (PLA) tradicionalmente abarca questões relativas à educação4 e políticas linguísticas5 envolvendo o português para falantes de outras línguas, ou seja, em contextos em que não é a língua de socialização inicial do estudante/examinando ou de determinada comunidade. Neste artigo, apresentamos uma introdução à área de PLA por meio da discussão de algumas variações terminológicas no que tange ao próprio nome da área no Brasil, bem como da breve exploração de públicos e contextos em que profissionais de PLA podem atuar em termos de ensino, avaliação, pesquisa, produção técnico-científica e políticas linguísticas.

Português como Língua Adicional: uma entrevista com Marisa Mendonça [2020]

by Gabriela Bulla and Tanara Zingano Kuhn

Revista Virtual de Estudos da Linguagem - ReVEL, 2020

A professora Marisa Mendonça inicia esta entrevista com a história da constituição da área de PLA... more A professora Marisa Mendonça inicia esta entrevista com a história da constituição da área de PLA em Moçambique. Em seguida, oferece uma apresentação das características sócio-históricas, linguísticas e culturais do continente africano de modo a contextualizar as especificidades e os desafios ali encontrados em relação ao ensino e à aprendizagem de PLA. Também reflete sobre o papel do IILP para a área de PLA e compartilha sua opinião especializada quanto ao que entende ser essencial para um currículo de formação inicial e continuada de professores de PLA, destacando os principais desafios e problemáticas para a área de PLA no futuro. Por fim, nos deixa indicações de leituras para interessados em ingressar nessa área de estudos.

STATE-OF-THE-ART ON MONOLINGUAL LEXICOGRAPHY FOR BRAZIL

Slovenščina 2.0, 2019

Zingano Kuhn, Tanara: State-of-the-art on monolingual lexicography for Brazil (Brazilian Portugue... more Zingano Kuhn, Tanara: State-of-the-art on monolingual lexicography for Brazil (Brazilian Portuguese). Slovenščina 2.0, 7 (1): 98-112. This paper is a minireview of the current status of monolingual lexicography in Brazil. Firstly, a brief contextualization of the origins of Brazilian Portuguese dictionary-making is provided. Then, an account of contemporary monolingual dictionaries is given and a more detailed overview on print, digital, spelling, and school dictionaries is presented. Next, research into dictionary use is reviewed. Finally, the perception among the Brazilians with regards to corpora and use of crowdsourcing in lexicography is discussed.

Identification and automatic extraction of good dictionary examples: the case(s) of GDEX

by Kristina Koppel and Tanara Zingano Kuhn

International Journal of Lexicography

Examples have always been an important part of a dictionary entry. As Rundell and Atkins (2008: 4... more Examples have always been an important part of a dictionary entry. As Rundell and Atkins (2008: 454) point out, ‘you sometimes find that an entry is almost incomprehensible without its examples.’ This argument is strengthened by the recent findings of Frankenberg-Garcia (2012, 2014) that several corpus examples can sometimes be even more useful than the definition. ... Selecting examples is a great challenge to lexicographers, not only because they need to find examples that meet criteria of a good dictionary example (criteria may differ depending on the target users) but also because the sources of examples, i.e. corpora, are getting larger and larger, nowadays containing several billion words or more, and it is inconceivable that...

THE IMAGE OF THE MONOLINGUAL DICTIONARY ACROSS EUROPE. RESULTS OF THE EUROPEAN SURVEY OF DICTIONARY USE AND CULTURE

International Journal of Lexicography, 2018

The article presents the results of a survey on dictionary use in Europe, focusing ongeneral mon... more The article presents the results of a survey on dictionary use in Europe, focusing ongeneral monolingual dictionaries. The survey is the broadest survey of dictionaryuse to date, covering close to 10,000 dictionary users (and non-users) in nearly thirtycountries. Our survey covers varied user groups, going beyond the students andtranslators who have tended to dominate such studies thus far. The survey wasdelivered via an online survey platform, in language versions speciﬁc to each targetcountry. It was completed by 9,562 respondents, over 300 respondents per countryon average. The survey consisted of the general section, which was translated andpresented to all participants, as well as country-speciﬁc sections for a subset of 11countries, which were drafted by collaborators at the national level. The present re-port covers the general section.

A Design Proposal of an Online Corpus-Driven Dictionary of Portuguese for University Students (Dissertation abstract)

Journal of Portuguese Linguistics, 2019

The objective of this PhD project was to propose the design of an online corpus-driven dictionary... more The objective of this PhD project was to propose the design of an online corpus-driven dictionary of Portuguese for university students (DOPU), aimed at both speakers of Portuguese as a mother tongue and as an additional language and covering Brazilian and European Portuguese varieties. For that, the highly innovative semi-automated approach to dictionary-making (Gantar, Kosem and Krek 2016) was adopted, which involves automatic extraction of data from the corpus and import into dictionary writing system. As a method that had never been applied for lexicographical projects of the Portuguese language, it was necessary to experiment the approach for the first time. Thus, all the required pre-requisites were newly developed, namely, a corpus of academic texts, sketch grammar, GDEX configuration, and a specially-tailored procedure for automatic extraction of data. The experiment indicated that not only can this approach be successfully used as a means to provide lexical content for the design of DOPU, but it can also be beneficial to other lexicographical projects of Portuguese.

The CPLP Corpus : A Pluricentric Corpus for the Common Portuguese Spelling Dictionary ( VOC )

by Tanara Zingano Kuhn and Margarita Correia

Proceedings of Euralex 2018, 2018

DEVISING A SKETCH GRAMMAR FOR ACADEMIC PORTUGUESE

by Tanara Zingano Kuhn and Iztok Kosem

Slovenšcina 2.0: empirical, applied and interdisciplinary research, 2016

This paper presents the development of a new sketch grammar designed specifically for CoPEP, a ne... more This paper presents the development of a new sketch grammar designed specifically for CoPEP, a newly compiled 40-million corpus comprising texts from academic journals, tagged with Freeling v3, the default tagger available in the Sketch Engine for corpora of Portuguese. We first provide an overview and evaluation of existing sketch grammars for Portuguese, followed by a detailed description of the development of a new sketch grammar, and the presentation of some of the problems encountered. We conclude by summarizing the main findings, highlighting important implications, and offering suggestions for further improvement of the sketch grammar. More accurate and varied word sketch results than those offered by the current default sketch grammar indicate that our sketch grammar can be used for advanced lexicographic tasks such as automatic extraction of lexical data from CoPEP, the methodology of knowledge acquisition planned for the compilation of a dictionary of Portuguese for university students. Moreover, this new sketch grammar can be used with any other corpus of Portuguese tagged with Freeling v3, which makes it an important resource for lexicographic and corpus linguistic research of the Portuguese language.

Trabalhando gêneros orais em um curso Técnico em Biotecnologia: sugestão de tarefa para estudar a organização interna de uma palestra

Revista Bem Legal, 2015

Resenha Oxford Learner's Dictionary of Academic English

BELT - Brazilian English Language Teaching Journal, 2015

VOCABULÁRIO CONTROLADO E REDAÇÃO DE DEFINIÇÕES EM DICIONÁRIOS DE PORTUGUÊS PARA ESTRANGEIROS: ENSAIOS PARA UMA LÉXICO-ESTATÍSTICA TEXTUAL

by Aline Evers, Aline Maciel Pereira, and Tanara Zingano Kuhn

Initial study in lexical-textual statistics that aims at collecting data to support the construct... more Initial study in lexical-textual statistics that aims at collecting data to support the construction of a basic controlled vocabulary (CV) to be a reference for writing definitions in a Portuguese learner’s dictionary. We used vocabulary frequency data from Brazilian popular newspapers and we also analyzed three different corpora. After comparing the most frequent words of each source, we evaluated the use of CVs to prepare a set of test entries. The results demonstrate the proper use of these corpora for the composition of a CV and the relevance of statistical linguistics for its compilation.

Foregrounding the Development of an Online Dictionary for Intermediate-level Learners of Brazilian Portuguese as an Additional Language: Initial Contributions

Proceedings of Euralex 2012, 2012

On the proposal of an on-line Brazilian Portuguese dictionary for speakers of Asian languages: an ongoing experiment

Proceedings of ASIALEX 2011, 2011

Marcadores discursivos em conversa no português brasileiro: proposta de atividade didática para nível básico o ensino de português como língua adicional

Atas CONFERÊNCIA DA KALUBS, 2011

Uso de vocabulário controlado em dicionários de português como língua estrangeira em formato on-line: uma experiência em andamento para uso de aprendizes coreanos

Atas do IIISIMELP: A formação de novas gerações de falantes de português no mundo, 2011

Produção de livro didático de nível básico para ensino de português brasileiro para falantes de línguas distantes: decisões teórico-práticas

Atas do IIISIMELP: A formação de novas gerações de falantes de português no mundo, 2011

Data preparation in crowdsourcing for pedagogical purposes: the case of the CrowLL game

by Tanara Zingano Kuhn and Kristina Koppel

Slovenščina 2.0, 2022

One way to stimulate the use of corpora in language education is by making pedagogically appropri... more One way to stimulate the use of corpora in language education is by making pedagogically appropriate corpora, labeled with different types of problems (sensitive content, offensive language, structural problems). However, manually labeling corpora is extremely time-consuming and a better approach should be found. We thus propose a combination of two approaches to the creation of problem-labeled pedagogical corpora of Dutch, Estonian, Slovene and Brazilian Portuguese: the use of games with a purpose and of crowdsourcing for the task. We conducted initial experiments to establish the suitability of the crowdsourcing task, and used the lessons learned to design the Crowdsourcing for Language Learning (CrowLL) game in which players identify problematic sentences, classify them, and indicate problematic excerpts. The focus of this paper is on data preparation, given the crucial role that such a stage plays in any crowdsourcing project dealing with the creation of language learning resources. We present the methodology for data preparation, offering a detailed presentation of source corpora selection, pedagogically oriented GDEX configurations, and the creation of lemma lists, with a special focus on common and language-dependent decisions. Finally, we offer a discussion of the challenges that emerged and the solutions that have been implemented so far.

O desenho de uma aplicação de MAVL em PLE destinado a aprendentes chineses

by Tanara Zingano Kuhn and Margarita Correia

Entrepalavras, Fortaleza, 2022

O presente trabalho tem como objetivo apresentar o desenho de uma aplicação1 de Mobile-assisted V... more O presente trabalho tem como objetivo apresentar o desenho de uma
aplicação1 de Mobile-assisted Vocabulary Learning (MAVL) em Português como Língua Estrangeira (PLE) destinada a aprendentes chineses, a UVA. O conteúdo do desenho é baseado em investigações sobre ensino-aprendizagem de vocabulário em língua estrangeira (NATION, 1990, 2000; MA, 2006, 2009; BEATTY, 2010a; BEATTY, 2010b; JIANG, 2000) e na adaptação das estratégias de O’Malley e Chamot (1990)
e Oxford (1990a). Além disso, o processo de aprendizagem na aplicação baseia-se em diversos estudos no âmbito da
aprendizagem assistida por tecnologia (GOODFELLOW, 2006; LAUFER et
al., 2000; GROOT, 2000). Na UVA, pretende-se dar conta da realidade da
aprendizagem de vocabulário de língua portuguesa e dos hábitos e necessidades no uso de aplicações de MAVL dos
aprendentes chineses. Para isso, foi aplicado um inquérito2 a 133 aprendentes chineses, cujos resultados nos ofereceram informação imprescindível para um desenho da aplicação mais adequado ao público-alvo. A estrutura da UVA consiste em cinco módulos: Escolha de Vocabulário a aprender; Aprendizagem de Vocabulário
(subdividido em três etapas: dedução, consolidação e retomada); Dicionário; Administração de Aprendizagem e Campo Social. Trata-se de um recurso inédito que busca facilitar e flexibilizar

Crowdsourcing pedagogical corpora for lexicographical purposes

by Tanara Zingano Kuhn and Rina zviel-girshin

Proceedings of EURALEX 2020 Conference, Volume II. Komotini: SynMorPhoSe Lab, Democritus University of Thrace, v.2., 2021

O Corpus de Português Escrito em Periódicos - CoPEP

DELTA: Documentação de Estudos em Lingüística Teórica e Aplicada, 2020

O presente estudo tem como objetivo descrever os desafios e soluções encontrados na compilação do... more O presente estudo tem como objetivo descrever os desafios e soluções encontrados na compilação do Corpus de Português Escrito em Periódicos - CoPEP, que contém aproximadamente 40 milhões de palavras, é equilibrado entre as variedades português brasileiro e português europeu em número de palavras e cobre seis grandes áreas de conhecimento. Primeiramente, apresentaremos o contexto de criação do CoPEP, qual seja, a elaboração de um dicionário on-line de português para universitários, para o qual serviu como fonte primária de obtenção de evidências linguísticas. Assim, foram as características desse projeto lexicográfico que informaram os critérios de criação do desenho do CoPEP e as consequentes tomadas de decisão. A seguir, descreveremos a metodologia de aquisição de dados, com foco especial nos desafios enfrentados e nas soluções encontradas. Terminaremos com a descrição da fase final de compilação, na qual aplicamos uma série de procedimentos para obtenção de equilíbrio.

Português como Língua Adicional no Brasil - perfis e contextos implicados [2020]

by Gabriela Bulla and Tanara Zingano Kuhn

Revista Virtual de Estudos da Linguagem - ReVEL, 2020

A área de Português como Língua Adicional (PLA) tradicionalmente abarca questões relativas à educ... more A área de Português como Língua Adicional (PLA) tradicionalmente abarca questões relativas à educação4 e políticas linguísticas5 envolvendo o português para falantes de outras línguas, ou seja, em contextos em que não é a língua de socialização inicial do estudante/examinando ou de determinada comunidade. Neste artigo, apresentamos uma introdução à área de PLA por meio da discussão de algumas variações terminológicas no que tange ao próprio nome da área no Brasil, bem como da breve exploração de públicos e contextos em que profissionais de PLA podem atuar em termos de ensino, avaliação, pesquisa, produção técnico-científica e políticas linguísticas.

Português como Língua Adicional: uma entrevista com Marisa Mendonça [2020]

by Gabriela Bulla and Tanara Zingano Kuhn

Revista Virtual de Estudos da Linguagem - ReVEL, 2020

A professora Marisa Mendonça inicia esta entrevista com a história da constituição da área de PLA... more A professora Marisa Mendonça inicia esta entrevista com a história da constituição da área de PLA em Moçambique. Em seguida, oferece uma apresentação das características sócio-históricas, linguísticas e culturais do continente africano de modo a contextualizar as especificidades e os desafios ali encontrados em relação ao ensino e à aprendizagem de PLA. Também reflete sobre o papel do IILP para a área de PLA e compartilha sua opinião especializada quanto ao que entende ser essencial para um currículo de formação inicial e continuada de professores de PLA, destacando os principais desafios e problemáticas para a área de PLA no futuro. Por fim, nos deixa indicações de leituras para interessados em ingressar nessa área de estudos.

STATE-OF-THE-ART ON MONOLINGUAL LEXICOGRAPHY FOR BRAZIL

Slovenščina 2.0, 2019

Zingano Kuhn, Tanara: State-of-the-art on monolingual lexicography for Brazil (Brazilian Portugue... more Zingano Kuhn, Tanara: State-of-the-art on monolingual lexicography for Brazil (Brazilian Portuguese). Slovenščina 2.0, 7 (1): 98-112. This paper is a minireview of the current status of monolingual lexicography in Brazil. Firstly, a brief contextualization of the origins of Brazilian Portuguese dictionary-making is provided. Then, an account of contemporary monolingual dictionaries is given and a more detailed overview on print, digital, spelling, and school dictionaries is presented. Next, research into dictionary use is reviewed. Finally, the perception among the Brazilians with regards to corpora and use of crowdsourcing in lexicography is discussed.

Identification and automatic extraction of good dictionary examples: the case(s) of GDEX

by Kristina Koppel and Tanara Zingano Kuhn

International Journal of Lexicography

Examples have always been an important part of a dictionary entry. As Rundell and Atkins (2008: 4... more Examples have always been an important part of a dictionary entry. As Rundell and Atkins (2008: 454) point out, ‘you sometimes find that an entry is almost incomprehensible without its examples.’ This argument is strengthened by the recent findings of Frankenberg-Garcia (2012, 2014) that several corpus examples can sometimes be even more useful than the definition. ... Selecting examples is a great challenge to lexicographers, not only because they need to find examples that meet criteria of a good dictionary example (criteria may differ depending on the target users) but also because the sources of examples, i.e. corpora, are getting larger and larger, nowadays containing several billion words or more, and it is inconceivable that...

THE IMAGE OF THE MONOLINGUAL DICTIONARY ACROSS EUROPE. RESULTS OF THE EUROPEAN SURVEY OF DICTIONARY USE AND CULTURE

International Journal of Lexicography, 2018

The article presents the results of a survey on dictionary use in Europe, focusing ongeneral mon... more The article presents the results of a survey on dictionary use in Europe, focusing ongeneral monolingual dictionaries. The survey is the broadest survey of dictionaryuse to date, covering close to 10,000 dictionary users (and non-users) in nearly thirtycountries. Our survey covers varied user groups, going beyond the students andtranslators who have tended to dominate such studies thus far. The survey wasdelivered via an online survey platform, in language versions speciﬁc to each targetcountry. It was completed by 9,562 respondents, over 300 respondents per countryon average. The survey consisted of the general section, which was translated andpresented to all participants, as well as country-speciﬁc sections for a subset of 11countries, which were drafted by collaborators at the national level. The present re-port covers the general section.

A Design Proposal of an Online Corpus-Driven Dictionary of Portuguese for University Students (Dissertation abstract)

Journal of Portuguese Linguistics, 2019

The objective of this PhD project was to propose the design of an online corpus-driven dictionary... more The objective of this PhD project was to propose the design of an online corpus-driven dictionary of Portuguese for university students (DOPU), aimed at both speakers of Portuguese as a mother tongue and as an additional language and covering Brazilian and European Portuguese varieties. For that, the highly innovative semi-automated approach to dictionary-making (Gantar, Kosem and Krek 2016) was adopted, which involves automatic extraction of data from the corpus and import into dictionary writing system. As a method that had never been applied for lexicographical projects of the Portuguese language, it was necessary to experiment the approach for the first time. Thus, all the required pre-requisites were newly developed, namely, a corpus of academic texts, sketch grammar, GDEX configuration, and a specially-tailored procedure for automatic extraction of data. The experiment indicated that not only can this approach be successfully used as a means to provide lexical content for the design of DOPU, but it can also be beneficial to other lexicographical projects of Portuguese.

The CPLP Corpus : A Pluricentric Corpus for the Common Portuguese Spelling Dictionary ( VOC )

by Tanara Zingano Kuhn and Margarita Correia

Proceedings of Euralex 2018, 2018

DEVISING A SKETCH GRAMMAR FOR ACADEMIC PORTUGUESE

by Tanara Zingano Kuhn and Iztok Kosem

Slovenšcina 2.0: empirical, applied and interdisciplinary research, 2016

This paper presents the development of a new sketch grammar designed specifically for CoPEP, a ne... more This paper presents the development of a new sketch grammar designed specifically for CoPEP, a newly compiled 40-million corpus comprising texts from academic journals, tagged with Freeling v3, the default tagger available in the Sketch Engine for corpora of Portuguese. We first provide an overview and evaluation of existing sketch grammars for Portuguese, followed by a detailed description of the development of a new sketch grammar, and the presentation of some of the problems encountered. We conclude by summarizing the main findings, highlighting important implications, and offering suggestions for further improvement of the sketch grammar. More accurate and varied word sketch results than those offered by the current default sketch grammar indicate that our sketch grammar can be used for advanced lexicographic tasks such as automatic extraction of lexical data from CoPEP, the methodology of knowledge acquisition planned for the compilation of a dictionary of Portuguese for university students. Moreover, this new sketch grammar can be used with any other corpus of Portuguese tagged with Freeling v3, which makes it an important resource for lexicographic and corpus linguistic research of the Portuguese language.

Trabalhando gêneros orais em um curso Técnico em Biotecnologia: sugestão de tarefa para estudar a organização interna de uma palestra

Revista Bem Legal, 2015

Resenha Oxford Learner's Dictionary of Academic English

BELT - Brazilian English Language Teaching Journal, 2015

VOCABULÁRIO CONTROLADO E REDAÇÃO DE DEFINIÇÕES EM DICIONÁRIOS DE PORTUGUÊS PARA ESTRANGEIROS: ENSAIOS PARA UMA LÉXICO-ESTATÍSTICA TEXTUAL

by Aline Evers, Aline Maciel Pereira, and Tanara Zingano Kuhn

Initial study in lexical-textual statistics that aims at collecting data to support the construct... more Initial study in lexical-textual statistics that aims at collecting data to support the construction of a basic controlled vocabulary (CV) to be a reference for writing definitions in a Portuguese learner’s dictionary. We used vocabulary frequency data from Brazilian popular newspapers and we also analyzed three different corpora. After comparing the most frequent words of each source, we evaluated the use of CVs to prepare a set of test entries. The results demonstrate the proper use of these corpora for the composition of a CV and the relevance of statistical linguistics for its compilation.

Foregrounding the Development of an Online Dictionary for Intermediate-level Learners of Brazilian Portuguese as an Additional Language: Initial Contributions

Proceedings of Euralex 2012, 2012

On the proposal of an on-line Brazilian Portuguese dictionary for speakers of Asian languages: an ongoing experiment

Proceedings of ASIALEX 2011, 2011

Marcadores discursivos em conversa no português brasileiro: proposta de atividade didática para nível básico o ensino de português como língua adicional

Atas CONFERÊNCIA DA KALUBS, 2011

Uso de vocabulário controlado em dicionários de português como língua estrangeira em formato on-line: uma experiência em andamento para uso de aprendizes coreanos

Atas do IIISIMELP: A formação de novas gerações de falantes de português no mundo, 2011

Produção de livro didático de nível básico para ensino de português brasileiro para falantes de línguas distantes: decisões teórico-práticas

Atas do IIISIMELP: A formação de novas gerações de falantes de português no mundo, 2011

Proposta de desenvolvimento de uma Plataforma On-line de Dicionários de Colocações Acadêmicas

I Congresso de Português como Língua Estrangeira na Columbia University, 2021

Ottaiano, Adriane Orenha; Kuhn, Tanara Zingano; Valencio, Carlos Roberto; Tenório, William

The building of an Online Platform for Monolingual Dictionaries of Academic Collocations in Portuguese and English

56th Linguistics Colloquium, 2020

Pluricentrismo e sistemas de certificação de competências em língua portuguesa – o caso dos estudantes estrangeiros em Portugal

by Tanara Zingano Kuhn, Catarina Gaspar, and Margarita Correia

III Simpósio Internacional de Ensino de Português como Língua Adicional (SINEPLA) - programação e resumos, 2021

Mou, Xiao; Gaspar, Catarina; Correia, Margarita; Kuhn, Tanara Zingano

Gamifying the path to corpus-based pedagogical dictionaries

Electronic lexicography in the 21st century (eLex 2021): Post-editing lexicography. Book of abstracts, 2021

Corpus cleaning for language learning resource development

EUROCALL Conference , 2019

Desenvolvimento de uma configuração GDEX para um corpus de português acadêmico

VII Simpósio Mundial de Estudos de Língua Portuguesa – SIMELP, 2019

Corpus Filtering via Crowdsourcing for Developing a Learner’s Dictionary

Electronic lexicography in the 21st century (eLex 2019): Smart Lexicography., 2019

Ensino de português como língua de acolhimento em Portugal: Análise do material didático Caderno de Formação – propostas de atividades e exercícios

Livro de Resumos VI Jornadas Pedagógicas de Língua Portuguesa, 2019

Crowdsourcing corpus cleaning for language learning - an approach proposal

by Tanara Zingano Kuhn and Rina zviel-girshin

3rd enetCollect Annual Meeting, 2019

Introducing CoPEP, the Corpus de Português Escrito em Periódicos (Corpus of Portuguese from Academic Journals)

14th American Association for Corpus Linguistics (AACL) Conference, 2014

Dando corpo às diversas vozes do português: o projeto corpus CPLP

by Tanara Zingano Kuhn and Margarita Correia

II Simpósio Internacional de Ensino de Português Língua Adicional-SINEPLA, 2018

Uma experiência no Curso de Português-Espanhol para Intercâmbio (CEPI): a formação de professoras para contextos on-line e inserção de vídeos explicativos

by Tanara Zingano Kuhn and Kétina Timboni

Salão de Ensino UFRGS, 2018

Analisando pacotes lexicais em um corpus multinacional de português acadêmico

by Tanara Zingano Kuhn and Margarita Correia

IX Escola Brasileira de Linguística Computacional (EBRALC2017) e XIV Encontro de Linguística de Corpus (ELC 2017), 2017

Reporting on the development of sketch grammar for academic Portuguese

Caderno de Resumos ELC-EBRALC 2017, 2017

Extended abstract

Experimenting automatic creation of content for a dictionary of academic Portuguese

ELEX 2017. Electronic Lexicography in the 21st Century. Lexicography from Scratch, 2017

Dealing with multiple orthographic standards within a single corpus: the case of Portuguese in the CoPEP corpus

by Tanara Zingano Kuhn and Margarita Correia

9èmes Journées Internationales de la Linguistique de Corpus, 2017. Livret., 2017

Extended abstract

Princípios e parâmetros para o desenho de um dicionário on-line de português para estudantes universitários

CLUL-LINGME - Linguistic Meeting for Young Researchers, 2016

Building a corpus of written academic texts in Portuguese

12th Teaching and Language Corpora Conference (TALC12), 2016

Usando o Sketch Engine para a obtenção de evidências lexicográficas de um corpus de português

Colóquio Comemorativo dos 40 Anos do Centro de Linguística da Universidade do Porto, 2016

O uso de dicionários de português por estudantes universitários

X Fórum de Partilha Linguística, 2015

Português língua pluricêntrica: das políticas às práticas

Português língua pluricêntrica: das políticas às práticas, 2022

Num momento em que se avalia o crescente valor econômico do português e em que os decisores polít... more Num momento em que se avalia o crescente valor econômico do português e em que os decisores políticos consagraram o uso do termo “português como língua pluricêntrica”, faz-se necessário discutir em que se consubstancia o pluricentrismo, o que ele significa para os seus falantes, que implicações traz para a investigação linguística e literária, a formação de professores, as práticas de ensino e sistemas de avaliação. O III Simpósio Internacional sobre o Ensino de Português como Língua Adicional (SINEPLA), com o tema “Português língua pluricêntrica: das políticas às práticas”, realizado virtualmente de 16 a 18 de junho de 2021 e organizado por CELGA-ILTEC/Universidade de Coimbra, Universidade de Westminster e Instituto de Letras da UFRGS, buscou propiciar uma oportunidade de reflexão sobre esses temas. Os textos publicados no livro “Português língua pluricêntrica: das políticas às práticas” resultam de trabalhos apresentados no Simpósio.
Ao longo de 17 capítulos, o livro traz reflexões sobre políticas e práticas em português como língua pluricêntrica, o ensino de PLA para fins e públicos específicos, a reflexão linguística, a formação de professores, o uso de textos literários em aulas de PLA e o exame Celpe-Bras. Trata-se de uma obra que procura contribuir para que o debate acerca do uso do conceito “português como língua pluricêntrica” siga ampliando a compreensão da complexidade de fatores envolvidos na nomeação das línguas com as quais se trabalha e nas possíveis implicações de seu uso.

Electronic lexicography in the 21st century. Proceedings of the eLex 2019 conference.

by Tanara Zingano Kuhn, Margarita Correia, Maarten Janssen, and Miloš Jakubíček

Proceedings of the eLex 2019 conference.1-3 October 2019, Sintra, Portugal. , 2019

edited by Iztok Kosem, Tanara Zingano Kuhn, Margarita Correia, José Pedro Ferreira, Maarten Janse... more

Dicionário de linguística da enunciação

Equipe Roman Jakobson

Introdução aos estudos de Roman Jakobson sobre afasia

Apresentação. Português como língua pluricêntrica nas práticas de profissionais da linguagem participantes do III SINEPLA

by Tanara Zingano Kuhn and Margarita Correia

Português língua pluricêntrica: das políticas às práticas, 2022

ANÁLISE COMPARATIVA DAS EDIÇÕES PORTUGUESA E BRASILEIRA DA OBRA OS LIVROS QUE DEVORARAM O MEU PAI, DE AFONSO CRUZ

by Isabel Garcez and Tanara Zingano Kuhn

HISTÓRIA, CULTURA E POLÍTICA NO MUNDO LUSÓFONO, 2021

Os processos de revisão editorial, nos últimos anos, têm vindo a beneficiar de reflexões e orient... more Os processos de revisão editorial, nos últimos anos, têm vindo a beneficiar
de reflexões e orientações da linguística, enquanto ciência da linguagem,
mas também de ferramentas de processamento de linguagem natural ou linguística
computacional, que podem servir para desenvolver tarefas de análise de
corpora, geração e sumarização de textos, tradução, parafraseamento, entre outros.

Developing pedagogically appropriate language corpora through crowdsourcing and gamification

by Rina zviel-girshin, Tanara Zingano Kuhn, and Branislava Šandrih

CALL and professionalisation: short papers from EUROCALL 2021

Despite the unquestionable academic interest on corpus-based approaches to language education, th... more Despite the unquestionable academic interest on corpus-based approaches to language education, the use of corpora by teachers in their everyday practice is still not very widespread. One way to promote usage of corpora in language teaching is by making pedagogically appropriate corpora, labelled with different types of problems (for instance, sensitive content, offensive language, structural problems), so that teachers can select authentic examples according to their needs. Because manually labelling corpora is extremely time-consuming, we propose to use crowdsourcing for this task. After a first exploratory phase, we are currently developing a multimode, multilanguage game in which players first identify problematic sentences and then classify them.

Vocabulário Ortográfico Comum da Língua Portuguesa (VOC)

by Tanara Zingano Kuhn and Gildaris Pandim

Panorama da contribuição do Brasil para a difusão do português. Fundação Alexandre Gusmão. Ministério das Relações Exteriores. , 2021

Panorama da contribuição do Brasil para a difusão do português Descrição: Trata-se de publicaç... more Panorama da contribuição do Brasil para a difusão do português
Descrição:
Trata-se de publicação de referência que reúne 33 verbetes, escritos por reputados especialistas em diversas áreas do conhecimento e 17 depoimentos de consagrados escritores, artistas e intelectuais que revelam a importância da cultura brasileira em sua formação como artífices da palavra em língua portuguesa.
Organizadores Alexandre Pilati | Nelson Viana

One Book, Two Language Varieties

by Isabel Garcez, Anabela Barreiro, and Tanara Zingano Kuhn

springer, 2020

This paper presents a comparative study of alignment pairs, either contrasting expressions or sty... more This paper presents a comparative study of alignment pairs, either contrasting expressions or stylistic variants of the same expression in the European (EP) and the Brazilian (BP) varieties of Portuguese. The alignments were collected semi-automatically using the CLUE-Aligner tool, which allows to record all pairs of paraphrastic units resulting from the alignment task in a database. The corpus used was a children's literature book Os Livros Que Devoraram o Meu Pai (The Books that Devoured My Father) by the Portuguese author Afonso Cruz and the Brazilian adaptation of this book. The main goal of the work presented here is to gather equivalent phrasal expressions and different syntactic constructions, which convey the same meaning in EP and BP, and contribute to the optimisation of editorial processes compulsory in the adaptation of texts, but which are suitable for any type of editorial process. This study provides a scientific basis for future work in the area of editing, proofreading and converting text to and from any variety of Portuguese from a computational point of view, namely to be used in a paraphrasing system with a variety adaptation functionality, even in the case of a literary text. We contemplate "challenging" cases, from a literary point of view, looking for alternatives that do not tamper with the imagery richness of the original version.

Proposta de critérios norteadores para produção de manual didático de português brasileiro língua adicional

Bulla, Gabriela S.; Uflacker, Cristina M.; Schlatter, Margarete. Práticas pedagógicas e materiais didáticos para o ensino de Português como Língua Adicional., 2019

Princípios de análise enunciativa de fatos de língua

MA dissertation, 2009

A Design Proposal of an Online Corpus-Driven Dictionary of Portuguese for University Students

PhD Thesis, 2017