ELIAS: End-to-End Learning to Index and Search in Large Output Spaces

被引:0
|
作者
Gupta, Nilesh [1 ]
Chen, Patrick H. [2 ]
Hsiang-Fu Yu [3 ]
Cho-Jui Hsieh [2 ]
Dhillon, Inderjit S. [1 ,4 ]
机构
[1] UT Austin, Austin, TX 78712 USA
[2] Univ Calif Los Angeles, Los Angeles, CA 90024 USA
[3] Amazon, Seattle, WA USA
[4] Google, Mountain View, CA 94043 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extreme multi-label classification (XMC) is a popular framework for solving many real-world problems that require accurate prediction from a very large number of potential output choices. A popular approach for dealing with the large label space is to arrange the labels into a shallow tree-based index and then learn an ML model to efficiently search this index via beam search. Existing methods initialize the tree index by clustering the label space into a few mutually exclusive clusters based on pre-defined features and keep it fixed throughout the training procedure. This approach results in a sub-optimal indexing structure over the label space and limits the search performance to the quality of choices made during the initialization of the index. In this paper, we propose a novel method ELIAS which relaxes the tree-based index to a specialized weighted graph-based index which is learned end-to-end with the final task objective. More specifically, ELIAS models the discrete cluster-to-label assignments in the existing tree-based index as soft learnable parameters that are learned jointly with the rest of the ML model. ELIAS achieves state-of-the-art performance on several large-scale extreme classification benchmarks with millions of labels. In particular, ELIAS can be up to 2.5% better at precision@1 and up to 4% better at recall@100 than existing XMC methods. A PyTorch implementation of ELIAS along with other resources is available at https://github.com/nilesh2797/ELIAS.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Joint discriminative representation learning for end-to-end person search
    Zhang, Pengcheng
    Yu, Xiaohan
    Bai, Xiao
    Wang, Chen
    Zheng, Jin
    Ning, Xin
    PATTERN RECOGNITION, 2024, 147
  • [2] Learning Scene-Pedestrian Graph for End-to-End Person Search
    Song, Zifan
    Zhao, Cairong
    Hu, Guosheng
    Miao, Duoqian
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (02) : 2979 - 2990
  • [3] End-to-End Latent Fingerprint Search
    Cao, Kai
    Dinh-Luan Nguyen
    Tymoszek, Cori
    Jain, Anil K.
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 (15) : 880 - 894
  • [4] End-to-End Incremental Learning
    Castro, Francisco M.
    Marin-Jimenez, Manuel J.
    Guil, Nicolas
    Schmid, Cordelia
    Alahari, Karteek
    COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 241 - 257
  • [5] END-TO-END TRAINING OF A LARGE VOCABULARY END-TO-END SPEECH RECOGNITION SYSTEM
    Kim, Chanwoo
    Kim, Sungsoo
    Kim, Kwangyoun
    Kumar, Mehul
    Kim, Jiyeon
    Lee, Kyungmin
    Han, Changwoo
    Garg, Abhinav
    Kim, Eunhyang
    Shin, Minkyoo
    Singh, Shatrughan
    Heck, Larry
    Gowda, Dhananjaya
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 562 - 569
  • [6] Learning an End-to-End Structure for Retrieval in Large-Scale Recommendations
    Gao, Weihao
    Fan, Xiangjun
    Wang, Chong
    Sun, Jiankai
    Jia, Kai
    Xiao, Wenzhi
    Ding, Ruofan
    Bin, Xingyan
    Yang, Hui
    Liu, Xiaobing
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 524 - 533
  • [7] Sequential Transformer for End-to-End Person Search
    Chen, Long
    Xu, Jinhua
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT IV, 2024, 14450 : 226 - 238
  • [8] End-to-End Open Vocabulary Keyword Search
    Yusuf, Bolaji
    Gok, Alican
    Gundogdu, Batuhan
    Saraclar, Murat
    INTERSPEECH 2021, 2021, : 4388 - 4392
  • [9] Cascade Transformers for End-to-End Person Search
    Yu, Rui
    Du, Dawei
    LaLonde, Rodney
    Davila, Daniel
    Funk, Christopher
    Hoogs, Anthony
    Clipp, Brian
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7257 - 7266
  • [10] Towards Automated End-to-End Health Misinformation Free Search with a Large Language Model
    Pradeep, Ronak
    Lin, Jimmy
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT IV, 2024, 14611 : 78 - 86