Learning to Rank Short Text Pairs with Convolutional Deep Neural Networks

被引:413
|
作者
Severyn, Aliaksei [1 ]
Moschitti, Alessandro [2 ]
机构
[1] Google Inc, Zurich, Switzerland
[2] Qatar Comp Res Inst, Doha, Qatar
关键词
Convolutional neural networks; learning to rank; Question Answering; Microblog search;
D O I
10.1145/2766462.2767738
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Learning a similarity function between pairs of objects is at the core of learning to rank approaches. In information retrieval tasks we typically deal with query-document pairs, in question answering question-answer pairs. However, before learning can take place, such pairs needs to be mapped from the original space of symbolic words into some feature space encoding various aspects of their relatedness, e.g. lexical, syntactic and semantic. Feature engineering is often a laborious task and may require external knowledge sources that are not always available or difficult to obtain. Recently, deep learning approaches have gained a lot of attention from the research community and industry for their ability to automatically learn optimal feature representation for a given task, while claiming state-of-the-art performance in many tasks in computer vision, speech recognition and natural language processing. In this paper, we present a convolutional neural network architecture for reranking pairs of short texts, where we learn the optimal representation of text pairs and a similarity function to relate them in a supervised way from the available training data. Our network takes only words in the input, thus requiring minimal preprocessing. In particular, we consider the task of reranking short text pairs where elements of the pair are sentences. We test our deep learning system on two popular retrieval tasks from TREC: Question Answering and Microblog Retrieval. Our model demonstrates strong performance on the first task beating previous state-of-the-art systems by about 3% absolute points in both MAP and MRR and shows comparable results on tweet reranking, while enjoying the benefits of no manual feature engineering and no additional syntactic parsers.
引用
收藏
页码:373 / 382
页数:10
相关论文
共 50 条
  • [41] WELDON: Weakly Supervised Learning of Deep Convolutional Neural Networks
    Durand, Thibaut
    Thome, Nicolas
    Cord, Matthieu
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4743 - 4752
  • [42] Learning Deep Graph Representations via Convolutional Neural Networks
    Ye, Wei
    Askarisichani, Omid
    Jones, Alex
    Singh, Ambuj
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (05) : 2268 - 2279
  • [43] Curriculum Learning for Depth Estimation with Deep Convolutional Neural Networks
    Surendranath, Ajay
    Jayagopi, Dinesh Babu
    [J]. PROCEEDINGS OF THE 2ND MEDITERRANEAN CONFERENCE ON PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE (MEDPRAI-2018), 2018, : 95 - 100
  • [44] Deep Learning Convolutional Neural Networks with Dropout - a Parallel Approach
    Shen, Jingyi
    Shafiq, M. Omair
    [J]. 2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 572 - 577
  • [45] Deep Learning With Convolutional Neural Networks for EEG Decoding and Visualization
    Schirrmeister, Robin Tibor
    Springenberg, Jost Tobias
    Fiederer, Lukas Dominique Josef
    Glasstetter, Martin
    Eggensperger, Katharina
    Tangermann, Michael
    Hutter, Frank
    Burgard, Wolfram
    Ball, Tonio
    [J]. HUMAN BRAIN MAPPING, 2017, 38 (11) : 5391 - 5420
  • [46] Deep Learning With Convolutional Neural Networks for Sleep Arousal Detection
    Jia, Dongya
    Yu, Shengfeng
    Yan, Cong
    Zhao, Wei
    Hu, Jing
    Wang, Hongmei
    You, Tianyuan
    [J]. 2018 COMPUTING IN CARDIOLOGY CONFERENCE (CINC), 2018, 45
  • [47] Learning Deep Movement Primitives using Convolutional Neural Networks
    Pervez, Affan
    Mao, Yuecheng
    Lee, Dongheui
    [J]. 2017 IEEE-RAS 17TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTICS (HUMANOIDS), 2017, : 191 - 197
  • [48] Layer Removal for Transfer Learning with Deep Convolutional Neural Networks
    Zhi, Weiming
    Chen, Zhenghao
    Yueng, Henry Wing Fung
    Lu, Zhicheng
    Zandavi, Seid Miad
    Chung, Yuk Ying
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT II, 2017, 10635 : 460 - 469
  • [49] Deep Convolutional Neural Networks
    Gonzalez, Rafael C.
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2018, 35 (06) : 79 - 87
  • [50] Chinese Short Text Classification with Mutual-Attention Convolutional Neural Networks
    Hao, Ming
    Xu, Bo
    Liang, Jing-Yi
    Zhang, Bo-Wen
    Yin, Xu-Cheng
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (05)