A Systematization of the Wagner Framework: Graph Theory Conjectures and Reinforcement Learning

Angileri, Flora; Lombardi, Giulia; Fois, Andrea; Faraone, Renato; Metta, Carlo; Salvi, Michele; Bianchi, Luigi Amedeo; Fantozzi, Marco; Galfrè, Silvia Giulia; Pavesi, Daniele; Parton, Maurizio; Morandin, Francesco

Computer Science > Machine Learning

arXiv:2406.12667 (cs)

[Submitted on 18 Jun 2024 (v1), last revised 17 Sep 2024 (this version, v2)]

Title:A Systematization of the Wagner Framework: Graph Theory Conjectures and Reinforcement Learning

Authors:Flora Angileri, Giulia Lombardi, Andrea Fois, Renato Faraone, Carlo Metta, Michele Salvi, Luigi Amedeo Bianchi, Marco Fantozzi, Silvia Giulia Galfrè, Daniele Pavesi, Maurizio Parton, Francesco Morandin

View PDF HTML (experimental)

Abstract:In 2021, Adam Zsolt Wagner proposed an approach to disprove conjectures in graph theory using Reinforcement Learning (RL). Wagner's idea can be framed as follows: consider a conjecture, such as a certain quantity f(G) < 0 for every graph G; one can then play a single-player graph-building game, where at each turn the player decides whether to add an edge or not. The game ends when all edges have been considered, resulting in a certain graph G_T, and f(G_T) is the final score of the game; RL is then used to maximize this score. This brilliant idea is as simple as innovative, and it lends itself to systematic generalization. Several different single-player graph-building games can be employed, along with various RL algorithms. Moreover, RL maximizes the cumulative reward, allowing for step-by-step rewards instead of a single final score, provided the final cumulative reward represents the quantity of interest f(G_T). In this paper, we discuss these and various other choices that can be significant in Wagner's framework. As a contribution to this systematization, we present four distinct single-player graph-building games. Each game employs both a step-by-step reward system and a single final score. We also propose a principled approach to select the most suitable neural network architecture for any given conjecture, and introduce a new dataset of graphs labeled with their Laplacian spectra. Furthermore, we provide a counterexample for a conjecture regarding the sum of the matching number and the spectral radius, which is simpler than the example provided in Wagner's original paper.
The games have been implemented as environments in the Gymnasium framework, and along with the dataset, are available as open-source supplementary materials.

Comments:	Accepted at the 27th International Conference on Discovery Science this http URL
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2406.12667 [cs.LG]
	(or arXiv:2406.12667v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.12667

Submission history

From: Maurizio Parton [view email]
[v1] Tue, 18 Jun 2024 14:40:20 UTC (53 KB)
[v2] Tue, 17 Sep 2024 09:42:43 UTC (143 KB)

Computer Science > Machine Learning

Title:A Systematization of the Wagner Framework: Graph Theory Conjectures and Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Systematization of the Wagner Framework: Graph Theory Conjectures and Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators