Learning to Rank Short Text Pairs with Convolutional Deep Neural Networks

被引:413
|
作者
Severyn, Aliaksei [1 ]
Moschitti, Alessandro [2 ]
机构
[1] Google Inc, Zurich, Switzerland
[2] Qatar Comp Res Inst, Doha, Qatar
关键词
Convolutional neural networks; learning to rank; Question Answering; Microblog search;
D O I
10.1145/2766462.2767738
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Learning a similarity function between pairs of objects is at the core of learning to rank approaches. In information retrieval tasks we typically deal with query-document pairs, in question answering question-answer pairs. However, before learning can take place, such pairs needs to be mapped from the original space of symbolic words into some feature space encoding various aspects of their relatedness, e.g. lexical, syntactic and semantic. Feature engineering is often a laborious task and may require external knowledge sources that are not always available or difficult to obtain. Recently, deep learning approaches have gained a lot of attention from the research community and industry for their ability to automatically learn optimal feature representation for a given task, while claiming state-of-the-art performance in many tasks in computer vision, speech recognition and natural language processing. In this paper, we present a convolutional neural network architecture for reranking pairs of short texts, where we learn the optimal representation of text pairs and a similarity function to relate them in a supervised way from the available training data. Our network takes only words in the input, thus requiring minimal preprocessing. In particular, we consider the task of reranking short text pairs where elements of the pair are sentences. We test our deep learning system on two popular retrieval tasks from TREC: Question Answering and Microblog Retrieval. Our model demonstrates strong performance on the first task beating previous state-of-the-art systems by about 3% absolute points in both MAP and MRR and shows comparable results on tweet reranking, while enjoying the benefits of no manual feature engineering and no additional syntactic parsers.
引用
收藏
页码:373 / 382
页数:10
相关论文
共 50 条
  • [21] Deep learning electromagnetic inversion with convolutional neural networks
    Puzyrev, Vladimir
    [J]. GEOPHYSICAL JOURNAL INTERNATIONAL, 2019, 218 (02) : 817 - 832
  • [22] Self-Taught convolutional neural networks for short text clustering
    Xu, Jiaming
    Xu, Bo
    Wang, Peng
    Zheng, Suncong
    Tian, Guanhua
    Zhao, Jun
    Xu, Bo
    [J]. NEURAL NETWORKS, 2017, 88 : 22 - 31
  • [23] Deep learning classification of biomedical text using convolutional neural network
    Dollah R.
    Sheng C.Y.
    Zakaria N.
    Othman M.S.
    Rasib A.W.
    [J]. International Journal of Advanced Computer Science and Applications, 2019, 10 (08): : 512 - 517
  • [24] Deep Learning Classification of Biomedical Text using Convolutional Neural Network
    Dollah, Rozilawati
    Sheng, Chew Yi
    Zakaria, Norhawaniah
    Othman, Mohd Shahizan
    Rasib, Abd Wahid
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (08) : 512 - 517
  • [25] An Analysis of Low-Rank Decomposition Selection for Deep Convolutional Neural Networks
    Liu, Baichen
    Jia, Huidi
    Han, Zhi
    Chen, Xi'ai
    Tang, Yandong
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT II, 2022, 13456 : 480 - 490
  • [26] Learning Text Component Features via Convolutional Neural Networks for Scene Text Detection
    Khlif, Wafa
    Nayef, Nibal
    Burie, Jean-Christophe
    Ogier, Jean-Marc
    Alimi, Adel
    [J]. 2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 79 - 84
  • [27] Text normalization with convolutional neural networks
    Yolchuyeva, Sevinj
    Nemeth, Geza
    Gyires-Toth, Balint
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (03) : 589 - 600
  • [28] Convolutional Neural Networks for Text Hashing
    Xu, Jiaming
    Wang, Peng
    Tian, Guanhua
    Xu, Bo
    Zhao, Jun
    Wang, Fangyuan
    Hao, Hongwei
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 1369 - 1375
  • [29] Text detection with convolutional neural networks
    Delakis, Manolis
    Garcia, Christophe
    [J]. VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2008, : 290 - 294
  • [30] Detection of pneumonia using convolutional neural networks and deep learning
    Szepesi, Patrik
    Szilagyi, Laszlo
    [J]. BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2022, 42 (03) : 1012 - 1022