ELIAS: End-to-End Learning to Index and Search in Large Output Spaces

被引：0

作者：

Gupta, Nilesh ^{[1
]}

Chen, Patrick H. ^{[2
]}

Hsiang-Fu Yu ^{[3
]}

Cho-Jui Hsieh ^{[2
]}

Dhillon, Inderjit S. ^{[1
,4
]}

机构：

[1] UT Austin, Austin, TX 78712 USA

[2] Univ Calif Los Angeles, Los Angeles, CA 90024 USA

[3] Amazon, Seattle, WA USA

[4] Google, Mountain View, CA 94043 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Extreme multi-label classification (XMC) is a popular framework for solving many real-world problems that require accurate prediction from a very large number of potential output choices. A popular approach for dealing with the large label space is to arrange the labels into a shallow tree-based index and then learn an ML model to efficiently search this index via beam search. Existing methods initialize the tree index by clustering the label space into a few mutually exclusive clusters based on pre-defined features and keep it fixed throughout the training procedure. This approach results in a sub-optimal indexing structure over the label space and limits the search performance to the quality of choices made during the initialization of the index. In this paper, we propose a novel method ELIAS which relaxes the tree-based index to a specialized weighted graph-based index which is learned end-to-end with the final task objective. More specifically, ELIAS models the discrete cluster-to-label assignments in the existing tree-based index as soft learnable parameters that are learned jointly with the rest of the ML model. ELIAS achieves state-of-the-art performance on several large-scale extreme classification benchmarks with millions of labels. In particular, ELIAS can be up to 2.5% better at precision@1 and up to 4% better at recall@100 than existing XMC methods. A PyTorch implementation of ELIAS along with other resources is available at https://github.com/nilesh2797/ELIAS.

引用

页数：12

共 50 条

[1] Joint discriminative representation learning for end-to-end person search
Zhang, Pengcheng
Yu, Xiaohan
Bai, Xiao
Wang, Chen
Zheng, Jin
Ning, Xin
PATTERN RECOGNITION, 2024, 147
[2] Learning Scene-Pedestrian Graph for End-to-End Person Search
Song, Zifan
Zhao, Cairong
Hu, Guosheng
Miao, Duoqian
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (02) : 2979 - 2990
[3] End-to-End Latent Fingerprint Search
Cao, Kai
Dinh-Luan Nguyen
Tymoszek, Cori
Jain, Anil K.
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 (15) : 880 - 894
[4] End-to-End Incremental Learning
Castro, Francisco M.
Marin-Jimenez, Manuel J.
Guil, Nicolas
Schmid, Cordelia
Alahari, Karteek
COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 241 - 257
[5] END-TO-END TRAINING OF A LARGE VOCABULARY END-TO-END SPEECH RECOGNITION SYSTEM
Kim, Chanwoo
Kim, Sungsoo
Kim, Kwangyoun
Kumar, Mehul
Kim, Jiyeon
Lee, Kyungmin
Han, Changwoo
Garg, Abhinav
Kim, Eunhyang
Shin, Minkyoo
Singh, Shatrughan
Heck, Larry
Gowda, Dhananjaya
2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 562 - 569
[6] Learning an End-to-End Structure for Retrieval in Large-Scale Recommendations
Gao, Weihao
Fan, Xiangjun
Wang, Chong
Sun, Jiankai
Jia, Kai
Xiao, Wenzhi
Ding, Ruofan
Bin, Xingyan
Yang, Hui
Liu, Xiaobing
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 524 - 533
[7] Sequential Transformer for End-to-End Person Search
Chen, Long
Xu, Jinhua
NEURAL INFORMATION PROCESSING, ICONIP 2023, PT IV, 2024, 14450 : 226 - 238
[8] End-to-End Open Vocabulary Keyword Search
Yusuf, Bolaji
Gok, Alican
Gundogdu, Batuhan
Saraclar, Murat
INTERSPEECH 2021, 2021, : 4388 - 4392
[9] Cascade Transformers for End-to-End Person Search
Yu, Rui
Du, Dawei
LaLonde, Rodney
Davila, Daniel
Funk, Christopher
Hoogs, Anthony
Clipp, Brian
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7257 - 7266
[10] Towards Automated End-to-End Health Misinformation Free Search with a Large Language Model
Pradeep, Ronak
Lin, Jimmy
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT IV, 2024, 14611 : 78 - 86

← 1 2 3 4 5 →