Semantic-Enhanced Differentiable Search Index Inspired by Learning Strategies

被引:7
|
作者
Tang, Yubao [1 ]
Zhang, Ruqing [1 ]
Guo, Jiafeng [1 ]
Chen, Jiangui [1 ]
Zhu, Zuowei [2 ]
Wang, Shuaiqiang [2 ]
Yin, Dawei [2 ]
Cheng, Xueqi [1 ]
机构
[1] Univ Chinese Acad Sci, CAS, ICT, CAS Key Lab Network Data Sci & Technol, Beijing, Peoples R China
[2] Baidu Inc, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
DSI; Elaboration Strategies; Rehearsal Strategies;
D O I
10.1145/3580305.3599903
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, a new paradigm called Differentiable Search Index (DSI) has been proposed for document retrieval, wherein a sequence-to-sequence model is learned to directly map queries to relevant document identifiers. The key idea behind DSI is to fully parameterize traditional "index-retrieve" pipelines within a single neural model, by encoding all documents in the corpus into the model parameters. In essence, DSI needs to resolve two major questions: (1) how to assign an identifier to each document, and (2) how to learn the associations between a document and its identifier. In this work, we propose a Semantic-Enhanced DSI model (SE-DSI) motivated by Learning Strategies in the area of Cognitive Psychology. Our approach advances original DSI in two ways: (1) For the document identifier, we take inspiration from Elaboration Strategies in human learning. Specifically, we assign each document an Elaborative Description based on the query generation technique, which is more meaningful than a string of integers in the original DSI; and (2) For the associations between a document and its identifier, we take inspiration from Rehearsal Strategies in human learning. Specifically, we select fine-grained semantic features from a document as Rehearsal Contents to improve document memorization.
引用
收藏
页码:4904 / 4913
页数:10
相关论文
共 50 条
  • [1] Semantic-enhanced information search and retrieval
    Wei, Wang
    Barnaghi, Payam M.
    Bargiela, Andrzej
    [J]. ALPIT 2007: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ADVANCED LANGUAGE PROCESSING AND WEB INFORMATION TECHNOLOGY, 2007, : 218 - +
  • [2] Semantic-enhanced spatial keyword search
    Han, Jun
    Fan, Ju
    Zhou, Lizhu
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2015, 52 (09): : 1954 - 1964
  • [3] Machine learning in the Internet of Things: A semantic-enhanced approach
    Ruta, Michele
    Scioscia, Floriano
    Loseto, Giuseppe
    Pinto, Agnese
    Di Sciascio, Eugenio
    [J]. SEMANTIC WEB, 2019, 10 (01) : 183 - 204
  • [4] Semantic-Enhanced Image Clustering
    Cai, Shaotian
    Qiu, Liping
    Chen, Xiaojun
    Zhang, Qin
    Chen, Longteng
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 6869 - 6878
  • [5] Semantic-enhanced Contrastive Learning for Session-based Recommendation
    Liu, Zhicheng
    Wang, Yulong
    Liu, Tongcun
    Zhang, Lei
    Li, Wei
    Liao, Jianxin
    He, Ding
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 280
  • [6] Semantic-enhanced discriminative embedding learning for cross-modal retrieval
    Pan, Hao
    Huang, Jun
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022, 11 (03) : 369 - 382
  • [7] Semantic-enhanced discriminative embedding learning for cross-modal retrieval
    Hao Pan
    Jun Huang
    [J]. International Journal of Multimedia Information Retrieval, 2022, 11 : 369 - 382
  • [8] Semantic-enhanced Recommendation of Video Lectures
    Macedo Borges, Marcos Vinicius
    dos Reis, Julio Cesar
    [J]. 2019 IEEE 19TH INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT 2019), 2019, : 42 - 46
  • [9] Semantic-enhanced distribution & adaptation networks
    Shen, B
    Xu, ZC
    Wee, S
    Apostolopoulos, J
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 2135 - 2138
  • [10] Semantic-enhanced distribution & adaptation networks
    Shen, B
    Xu, ZC
    Wee, S
    Apostolopoulos, J
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1383 - 1386