ELIAS: End-to-End Learning to Index and Search in Large Output Spaces

被引:0
|
作者
Gupta, Nilesh [1 ]
Chen, Patrick H. [2 ]
Hsiang-Fu Yu [3 ]
Cho-Jui Hsieh [2 ]
Dhillon, Inderjit S. [1 ,4 ]
机构
[1] UT Austin, Austin, TX 78712 USA
[2] Univ Calif Los Angeles, Los Angeles, CA 90024 USA
[3] Amazon, Seattle, WA USA
[4] Google, Mountain View, CA 94043 USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extreme multi-label classification (XMC) is a popular framework for solving many real-world problems that require accurate prediction from a very large number of potential output choices. A popular approach for dealing with the large label space is to arrange the labels into a shallow tree-based index and then learn an ML model to efficiently search this index via beam search. Existing methods initialize the tree index by clustering the label space into a few mutually exclusive clusters based on pre-defined features and keep it fixed throughout the training procedure. This approach results in a sub-optimal indexing structure over the label space and limits the search performance to the quality of choices made during the initialization of the index. In this paper, we propose a novel method ELIAS which relaxes the tree-based index to a specialized weighted graph-based index which is learned end-to-end with the final task objective. More specifically, ELIAS models the discrete cluster-to-label assignments in the existing tree-based index as soft learnable parameters that are learned jointly with the rest of the ML model. ELIAS achieves state-of-the-art performance on several large-scale extreme classification benchmarks with millions of labels. In particular, ELIAS can be up to 2.5% better at precision@1 and up to 4% better at recall@100 than existing XMC methods. A PyTorch implementation of ELIAS along with other resources is available at https://github.com/nilesh2797/ELIAS.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] A survey on end-to-end point cloud learning
    Tang, Xikai
    Huang, Fangzheng
    Li, Chao
    Ban, Dayan
    IET IMAGE PROCESSING, 2023, 17 (05) : 1307 - 1321
  • [42] Machine Learning for End-to-End Congestion Control
    Zhang, Ticao
    Mao, Shiwen
    IEEE COMMUNICATIONS MAGAZINE, 2020, 58 (06) : 52 - 57
  • [43] End-to-end deep learning with neuromorphic photonics
    Dabos, G.
    Mourgias-Alexandris, G.
    Totovic, A.
    Kirtas, M.
    Passalis, N.
    Tefas, A.
    Pleros, N.
    INTEGRATED OPTICS: DEVICES, MATERIALS, AND TECHNOLOGIES XXV, 2021, 11689
  • [44] End-to-end Learning for Encrypted Image Retrieval
    Feng, Qihua
    Li, Peiya
    Lu, ZhiXun
    Liu, Guan
    Huang, Feiran
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1839 - 1845
  • [45] End-To-End Learning for Action Quality Assessment
    Li, Yongjun
    Chai, Xiujuan
    Chen, Xilin
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 125 - 134
  • [46] End-to-End Learning of Decision Trees and Forests
    Hehn, Thomas M.
    Kooij, Julian F. P.
    Hamprecht, Fred A.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (04) : 997 - 1011
  • [47] End-to-End Ultrametric Learning for Hierarchical Segmentation
    Lapertot, Raphael
    Chierchia, Giovanni
    Perret, Benjamin
    DISCRETE GEOMETRY AND MATHEMATICAL MORPHOLOGY, DGMM 2024, 2024, 14605 : 286 - 297
  • [48] An End-to-End Learning Framework for Video Compression
    Lu, Guo
    Zhang, Xiaoyun
    Ouyang, Wanli
    Chen, Li
    Gao, Zhiyong
    Xu, Dong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3292 - 3308
  • [49] End-to-End Multitask Learning With Vision Transformer
    Tian, Yingjie
    Bai, Kunlong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 9579 - 9590
  • [50] SynthNet: Learning to Synthesize Music End-to-End
    Schimbinschi, Florin
    Walder, Christian
    Erfani, Sarah M.
    Bailey, James
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3367 - 3374