CiteME: Can Language Models Accurately Cite Scientific Claims?

Press, Ori; Hochlehnert, Andreas; Prabhu, Ameya; Udandarao, Vishaal; Press, Ofir; Bethge, Matthias

Computer Science > Computation and Language

arXiv:2407.12861 (cs)

[Submitted on 10 Jul 2024 (v1), last revised 3 Nov 2024 (this version, v2)]

Title:CiteME: Can Language Models Accurately Cite Scientific Claims?

Authors:Ori Press, Andreas Hochlehnert, Ameya Prabhu, Vishaal Udandarao, Ofir Press, Matthias Bethge

View PDF HTML (experimental)

Abstract:Thousands of new scientific papers are published each month. Such information overload complicates researcher efforts to stay current with the state-of-the-art as well as to verify and correctly attribute claims. We pose the following research question: Given a text excerpt referencing a paper, could an LM act as a research assistant to correctly identify the referenced paper? We advance efforts to answer this question by building a benchmark that evaluates the abilities of LMs in citation attribution. Our benchmark, CiteME, consists of text excerpts from recent machine learning papers, each referencing a single other paper. CiteME use reveals a large gap between frontier LMs and human performance, with LMs achieving only 4.2-18.5% accuracy and humans 69.7%. We close this gap by introducing CiteAgent, an autonomous system built on the GPT-4o LM that can also search and read papers, which achieves an accuracy of 35.3\% on CiteME. Overall, CiteME serves as a challenging testbed for open-ended claim attribution, driving the research community towards a future where any claim made by an LM can be automatically verified and discarded if found to be incorrect.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2407.12861 [cs.CL]
	(or arXiv:2407.12861v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2407.12861

Submission history

From: Ori Press [view email]
[v1] Wed, 10 Jul 2024 11:31:20 UTC (1,825 KB)
[v2] Sun, 3 Nov 2024 20:58:35 UTC (3,631 KB)

Computer Science > Computation and Language

Title:CiteME: Can Language Models Accurately Cite Scientific Claims?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:CiteME: Can Language Models Accurately Cite Scientific Claims?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators