Transformer Memory as a Differentiable Search Index

被引:0
|
作者
Tay, Yi [1 ]
Tran, Vinh Q. [1 ]
Dehghani, Mostafa [1 ]
Ni, Jianmo [1 ]
Bahri, Dara [1 ]
Mehta, Harsh [1 ]
Qin, Zhen [1 ]
Hui, Kai [1 ]
Zhao, Zhe [1 ]
Gupta, Jai [1 ]
Schuster, Tal [1 ]
Cohen, WilliamW. [1 ]
Metzler, Donald [1 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we demonstrate that information retrieval can be accomplished with a single Transformer, in which all information about the corpus is encoded in the parameters of the model. To this end, we introduce the Differentiable Search Index (DSI), a new paradigm that learns a text-to-text model that maps string queries directly to relevant docids; in other words, a DSI model answers queries directly using only its parameters, dramatically simplifying the whole retrieval process. We study variations in how documents and their identifiers are represented, variations in training procedures, and the interplay between models and corpus sizes. Experiments demonstrate that given appropriate design choices, DSI significantly outperforms strong baselines such as dual encoder models. Moreover, DSI demonstrates strong generalization capabilities, outperforming a BM25 baseline in a zero-shot setup.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Memory-Efficient Differentiable Transformer Architecture Search
    Zhao, Yuekai
    Dong, Li
    Shen, Yelong
    Zhang, Zhihua
    Wei, Furu
    Chen, Weizhu
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4254 - 4264
  • [2] De-DSI: Decentralised Differentiable Search Index
    Neague, Petru
    Gregoriadis, Marcel
    Pouwelse, Johan
    [J]. PROCEEDINGS OF THE 2024 4TH WORKSHOP ON MACHINE LEARNING AND SYSTEMS, EUROMLSYS 2024, 2024, : 134 - 143
  • [3] Semantic-Enhanced Differentiable Search Index Inspired by Learning Strategies
    Tang, Yubao
    Zhang, Ruqing
    Guo, Jiafeng
    Chen, Jiangui
    Zhu, Zuowei
    Wang, Shuaiqiang
    Yin, Dawei
    Cheng, Xueqi
    [J]. PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 4904 - 4913
  • [4] DAST: Differentiable Architecture Search with Transformer for 3D Medical Image Segmentation
    Yang, Dong
    Xu, Ziyue
    He, Yufan
    Nath, Vishwesh
    Li, Wenqi
    Myronenko, Andriy
    Hatamizadeh, Ali
    Zhao, Can
    Roth, Holger R.
    Xu, Daguang
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT III, 2023, 14222 : 747 - 756
  • [5] RADARS: Memory Efficient Reinforcement Learning Aided Differentiable Neural Architecture Search
    Yan, Zheyu
    Jiang, Weiwen
    Hu, Xiaobo Sharon
    Shi, Yiyu
    [J]. 27TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2022, 2022, : 128 - 133
  • [6] Differentiable Subset Pruning of Transformer Heads
    Li, Jiaoda
    Cotterell, Ryan
    Sachan, Mrinmaya
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2021, 9 : 1442 - 1459
  • [7] LMD-DARTS: Low-Memory, Densely Connected, Differentiable Architecture Search
    Li, Zhongnian
    Xu, Yixin
    Ying, Peng
    Chen, Hu
    Sun, Renke
    Xu, Xinzheng
    [J]. ELECTRONICS, 2024, 13 (14)
  • [8] Direct Differentiable Augmentation Search
    Liu, Aoming
    Huang, Zehao
    Huang, Zhiwu
    Wang, Naiyan
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12199 - 12208
  • [9] Cyclic Differentiable Architecture Search
    Yu, Hongyuan
    Peng, Houwen
    Huang, Yan
    Fu, Jianlong
    Du, Hao
    Wang, Liang
    Ling, Haibin
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 211 - 228
  • [10] Differentiable quantum architecture search
    Zhang, Shi-Xin
    Hsieh, Chang-Yu
    Zhang, Shengyu
    Yao, Hong
    [J]. QUANTUM SCIENCE AND TECHNOLOGY, 2022, 7 (04)