ELIAS: End-to-End Learning to Index and Search in Large Output Spaces

被引：0

作者：

Gupta, Nilesh ^{[1
]}

Chen, Patrick H. ^{[2
]}

Hsiang-Fu Yu ^{[3
]}

Cho-Jui Hsieh ^{[2
]}

Dhillon, Inderjit S. ^{[1
,4
]}

机构：

[1] UT Austin, Austin, TX 78712 USA

[2] Univ Calif Los Angeles, Los Angeles, CA 90024 USA

[3] Amazon, Seattle, WA USA

[4] Google, Mountain View, CA 94043 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Extreme multi-label classification (XMC) is a popular framework for solving many real-world problems that require accurate prediction from a very large number of potential output choices. A popular approach for dealing with the large label space is to arrange the labels into a shallow tree-based index and then learn an ML model to efficiently search this index via beam search. Existing methods initialize the tree index by clustering the label space into a few mutually exclusive clusters based on pre-defined features and keep it fixed throughout the training procedure. This approach results in a sub-optimal indexing structure over the label space and limits the search performance to the quality of choices made during the initialization of the index. In this paper, we propose a novel method ELIAS which relaxes the tree-based index to a specialized weighted graph-based index which is learned end-to-end with the final task objective. More specifically, ELIAS models the discrete cluster-to-label assignments in the existing tree-based index as soft learnable parameters that are learned jointly with the rest of the ML model. ELIAS achieves state-of-the-art performance on several large-scale extreme classification benchmarks with millions of labels. In particular, ELIAS can be up to 2.5% better at precision@1 and up to 4% better at recall@100 than existing XMC methods. A PyTorch implementation of ELIAS along with other resources is available at https://github.com/nilesh2797/ELIAS.

引用

页数：12

共 50 条

[41] A survey on end-to-end point cloud learning
Tang, Xikai
Huang, Fangzheng
Li, Chao
Ban, Dayan
IET IMAGE PROCESSING, 2023, 17 (05) : 1307 - 1321
[42] Machine Learning for End-to-End Congestion Control
Zhang, Ticao
Mao, Shiwen
IEEE COMMUNICATIONS MAGAZINE, 2020, 58 (06) : 52 - 57
[43] End-to-end deep learning with neuromorphic photonics
Dabos, G.
Mourgias-Alexandris, G.
Totovic, A.
Kirtas, M.
Passalis, N.
Tefas, A.
Pleros, N.
INTEGRATED OPTICS: DEVICES, MATERIALS, AND TECHNOLOGIES XXV, 2021, 11689
[44] End-to-end Learning for Encrypted Image Retrieval
Feng, Qihua
Li, Peiya
Lu, ZhiXun
Liu, Guan
Huang, Feiran
2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1839 - 1845
[45] End-To-End Learning for Action Quality Assessment
Li, Yongjun
Chai, Xiujuan
Chen, Xilin
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 125 - 134
[46] End-to-End Learning of Decision Trees and Forests
Hehn, Thomas M.
Kooij, Julian F. P.
Hamprecht, Fred A.
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (04) : 997 - 1011
[47] End-to-End Ultrametric Learning for Hierarchical Segmentation
Lapertot, Raphael
Chierchia, Giovanni
Perret, Benjamin
DISCRETE GEOMETRY AND MATHEMATICAL MORPHOLOGY, DGMM 2024, 2024, 14605 : 286 - 297
[48] An End-to-End Learning Framework for Video Compression
Lu, Guo
Zhang, Xiaoyun
Ouyang, Wanli
Chen, Li
Gao, Zhiyong
Xu, Dong
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3292 - 3308
[49] End-to-End Multitask Learning With Vision Transformer
Tian, Yingjie
Bai, Kunlong
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 9579 - 9590
[50] SynthNet: Learning to Synthesize Music End-to-End
Schimbinschi, Florin
Walder, Christian
Erfani, Sarah M.
Bailey, James
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3367 - 3374

← 1 2 3 4 5 →