ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models

Belouadi, Jonas; Eger, Steffen

doi:10.18653/v1/2023.acl-long.406

Computer Science > Computation and Language

arXiv:2212.10474 (cs)

[Submitted on 20 Dec 2022 (v1), last revised 22 May 2023 (this version, v2)]

Title:ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models

Authors:Jonas Belouadi, Steffen Eger

View PDF

Abstract:State-of-the-art poetry generation systems are often complex. They either consist of task-specific model pipelines, incorporate prior knowledge in the form of manually created constraints, or both. In contrast, end-to-end models would not suffer from the overhead of having to model prior knowledge and could learn the nuances of poetry from data alone, reducing the degree of human supervision required. In this work, we investigate end-to-end poetry generation conditioned on styles such as rhyme, meter, and alliteration. We identify and address lack of training data and mismatching tokenization algorithms as possible limitations of past attempts. In particular, we successfully pre-train ByGPT5, a new token-free decoder-only language model, and fine-tune it on a large custom corpus of English and German quatrains annotated with our styles. We show that ByGPT5 outperforms other models such as mT5, ByT5, GPT-2 and ChatGPT, while also being more parameter efficient and performing favorably compared to humans. In addition, we analyze its runtime performance and demonstrate that it is not prone to memorization. We make our code, models, and datasets publicly available.

Comments:	Accepted at ACL 2023 (main track)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2212.10474 [cs.CL]
	(or arXiv:2212.10474v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2212.10474
Related DOI:	https://doi.org/10.18653/v1/2023.acl-long.406

Submission history

From: Jonas Belouadi [view email]
[v1] Tue, 20 Dec 2022 17:49:49 UTC (9,958 KB)
[v2] Mon, 22 May 2023 21:15:06 UTC (9,959 KB)

Computer Science > Computation and Language

Title:ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators