Learning to Rank Short Text Pairs with Convolutional Deep Neural Networks

被引:413
|
作者
Severyn, Aliaksei [1 ]
Moschitti, Alessandro [2 ]
机构
[1] Google Inc, Zurich, Switzerland
[2] Qatar Comp Res Inst, Doha, Qatar
关键词
Convolutional neural networks; learning to rank; Question Answering; Microblog search;
D O I
10.1145/2766462.2767738
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Learning a similarity function between pairs of objects is at the core of learning to rank approaches. In information retrieval tasks we typically deal with query-document pairs, in question answering question-answer pairs. However, before learning can take place, such pairs needs to be mapped from the original space of symbolic words into some feature space encoding various aspects of their relatedness, e.g. lexical, syntactic and semantic. Feature engineering is often a laborious task and may require external knowledge sources that are not always available or difficult to obtain. Recently, deep learning approaches have gained a lot of attention from the research community and industry for their ability to automatically learn optimal feature representation for a given task, while claiming state-of-the-art performance in many tasks in computer vision, speech recognition and natural language processing. In this paper, we present a convolutional neural network architecture for reranking pairs of short texts, where we learn the optimal representation of text pairs and a similarity function to relate them in a supervised way from the available training data. Our network takes only words in the input, thus requiring minimal preprocessing. In particular, we consider the task of reranking short text pairs where elements of the pair are sentences. We test our deep learning system on two popular retrieval tasks from TREC: Question Answering and Microblog Retrieval. Our model demonstrates strong performance on the first task beating previous state-of-the-art systems by about 3% absolute points in both MAP and MRR and shows comparable results on tweet reranking, while enjoying the benefits of no manual feature engineering and no additional syntactic parsers.
引用
收藏
页码:373 / 382
页数:10
相关论文
共 50 条
  • [1] Combining Knowledge with Deep Convolutional Neural Networks for Short Text Classification
    Wang, Jin
    Wang, Zhongyuan
    Zhang, Dawei
    Yan, Jun
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2915 - 2921
  • [2] Deep Pyramid Convolutional Neural Networks for Text Categorization
    Johnson, Rie
    Zhang, Tong
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 562 - 570
  • [3] Investigation on the Chinese Text Sentiment Analysis Based on Convolutional Neural Networks in Deep Learning
    Xu, Feng
    Zhang, Xuefen
    Xin, Zhanhong
    Yang, Alan
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2019, 58 (03): : 697 - 709
  • [4] Rank-based pooling for deep convolutional neural networks
    Shi, Zenglin
    Ye, Yangdong
    Wu, Yunpeng
    [J]. NEURAL NETWORKS, 2016, 83 : 21 - 31
  • [5] Short Text Classification With A Convolutional Neural Networks Based Method
    Hu, Yibo
    Li, Yang
    Yang, Tao
    Pan, Quan
    [J]. 2018 15TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2018, : 1432 - 1435
  • [6] Squeezed Very Deep Convolutional Neural Networks for Text Classification
    Duque, Andrea B.
    Santos, Lua Lazaro J.
    Macedo, David
    Zanchettin, Cleber
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 193 - 207
  • [7] Text Classification and Transfer Learning Based on Character-Level Deep Convolutional Neural Networks
    Sato, Minato
    Orihara, Ryohei
    Sei, Yuichi
    Tahara, Yasuyuki
    Ohsuga, Akihiko
    [J]. AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART 2017), 2018, 10839 : 62 - 81
  • [8] Deep compression of convolutional neural networks with low-rank approximation
    Astrid, Marcella
    Lee, Seung-Ik
    [J]. ETRI JOURNAL, 2018, 40 (04) : 421 - 434
  • [9] Learning to rank influential nodes in complex networks via convolutional neural networks
    Ahmad, Waseem
    Wang, Bang
    Chen, Si
    [J]. APPLIED INTELLIGENCE, 2024, 54 (04) : 3260 - 3278
  • [10] Learning to rank influential nodes in complex networks via convolutional neural networks
    Waseem Ahmad
    Bang Wang
    Si Chen
    [J]. Applied Intelligence, 2024, 54 : 3260 - 3278