University of Szeged
Department of General Linguistics
Cikkünkben bemutatjuk a laikusoknak címzett hivatalos szövegek osztályozási kísérletét felügyelt gépi tanuló algoritmusok segítségével. Vizsgálatunkhoz szakértők által, kézzel készített korpuszt használunk, amely közérthetőre fogalmazott... more
The current article briefl y presents a pilot machinelearning experiment on the classifi cation of offi cial texts addressed to lay readers with the use of support vector machine as a baseline and fastText models. For this purpose, a... more
The current article briefly presents a pilot machine-learning experiment on the classification of official texts addressed to lay readers with the use of support vector machine as a baseline and fastText models. For this purpose, a... more
In this paper, a brief study will be presented with regard to the issue of Named Entity Recognition (NER) in legal texts. To get an overall picture, we examined closely the output of two existing analysers: the "magyarlanc" linguistic... more
This paper outlines the ParlaMint project from the perspective of its goals, tasks, participants, results and applications potential. The project produced language corpora from the sessions of the national parliaments of 17 countries,... more
A dolgozatban bizonyos pragmatikai és szemantikai sajátságokat vizsgálunk magyar nyelvű, nagy méretű spontánbeszéd-korpusz (StaffTalk) alapján. A vizsgálati korpusz is hétköznapi szituációkban, külső hatásoknak is kitett munkahelyi... more
The growing number of digitally accessible text corpora and the accelerating development of NLP tools and methods (particularly the emergence of powerful large-scale language models) have allowed their widespread use in various... more
Harnessing the power of Deep Learning is becoming commonplace nowadays, and the legal field is no exception. Most solutions embrace supervised approaches that require a lot of labeled data. Active Learning is a technique that exploits the... more
This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY
One crucial aspect of access to justice and access to legal information is the comprehensibility of legal text. The complexity and specialized terminology of legal language often prevents citizens from understanding legal texts and... more
Automated sentiment analysis of textual data is one of the central and most challenging tasks in political communication studies. However, the toolkits available are primarily for English texts and require contextual adaptation to produce... more