ELIAS: End-to-End Learning to Index and Search in Large Output Spaces

Gupta, Nilesh; Chen, Patrick H.; Yu, Hsiang-Fu; Hsieh, Cho-Jui; Dhillon, Inderjit S

Computer Science > Machine Learning

arXiv:2210.08410 (cs)

[Submitted on 16 Oct 2022 (v1), last revised 9 Jan 2023 (this version, v2)]

Title:ELIAS: End-to-End Learning to Index and Search in Large Output Spaces

Authors:Nilesh Gupta, Patrick H. Chen, Hsiang-Fu Yu, Cho-Jui Hsieh, Inderjit S Dhillon

View PDF

Abstract:Extreme multi-label classification (XMC) is a popular framework for solving many real-world problems that require accurate prediction from a very large number of potential output choices. A popular approach for dealing with the large label space is to arrange the labels into a shallow tree-based index and then learn an ML model to efficiently search this index via beam search. Existing methods initialize the tree index by clustering the label space into a few mutually exclusive clusters based on pre-defined features and keep it fixed throughout the training procedure. This approach results in a sub-optimal indexing structure over the label space and limits the search performance to the quality of choices made during the initialization of the index. In this paper, we propose a novel method ELIAS which relaxes the tree-based index to a specialized weighted graph-based index which is learned end-to-end with the final task objective. More specifically, ELIAS models the discrete cluster-to-label assignments in the existing tree-based index as soft learnable parameters that are learned jointly with the rest of the ML model. ELIAS achieves state-of-the-art performance on several large-scale extreme classification benchmarks with millions of labels. In particular, ELIAS can be up to 2.5% better at precision@1 and up to 4% better at recall@100 than existing XMC methods. A PyTorch implementation of ELIAS along with other resources is available at this https URL.

Comments:	21 pages, 9 figures, NeurIPS 2022 camera-ready publication
Subjects:	Machine Learning (cs.LG); Information Retrieval (cs.IR)
Cite as:	arXiv:2210.08410 [cs.LG]
	(or arXiv:2210.08410v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2210.08410

Submission history

From: Nilesh Gupta [view email]
[v1] Sun, 16 Oct 2022 01:34:17 UTC (3,782 KB)
[v2] Mon, 9 Jan 2023 19:40:35 UTC (3,782 KB)

Computer Science > Machine Learning

Title:ELIAS: End-to-End Learning to Index and Search in Large Output Spaces

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:ELIAS: End-to-End Learning to Index and Search in Large Output Spaces

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators