Learning to Rank Short Text Pairs with Convolutional Deep Neural Networks

被引：413

作者：

Severyn, Aliaksei ^{[1
]}

Moschitti, Alessandro ^{[2
]}

机构：

[1] Google Inc, Zurich, Switzerland

[2] Qatar Comp Res Inst, Doha, Qatar

来源：

SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL | 2015年

关键词：

Convolutional neural networks; learning to rank; Question Answering; Microblog search;

D O I：

10.1145/2766462.2767738

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Learning a similarity function between pairs of objects is at the core of learning to rank approaches. In information retrieval tasks we typically deal with query-document pairs, in question answering question-answer pairs. However, before learning can take place, such pairs needs to be mapped from the original space of symbolic words into some feature space encoding various aspects of their relatedness, e.g. lexical, syntactic and semantic. Feature engineering is often a laborious task and may require external knowledge sources that are not always available or difficult to obtain. Recently, deep learning approaches have gained a lot of attention from the research community and industry for their ability to automatically learn optimal feature representation for a given task, while claiming state-of-the-art performance in many tasks in computer vision, speech recognition and natural language processing. In this paper, we present a convolutional neural network architecture for reranking pairs of short texts, where we learn the optimal representation of text pairs and a similarity function to relate them in a supervised way from the available training data. Our network takes only words in the input, thus requiring minimal preprocessing. In particular, we consider the task of reranking short text pairs where elements of the pair are sentences. We test our deep learning system on two popular retrieval tasks from TREC: Question Answering and Microblog Retrieval. Our model demonstrates strong performance on the first task beating previous state-of-the-art systems by about 3% absolute points in both MAP and MRR and shows comparable results on tweet reranking, while enjoying the benefits of no manual feature engineering and no additional syntactic parsers.

引用

页码：373 / 382

页数：10

共 50 条

[41] WELDON: Weakly Supervised Learning of Deep Convolutional Neural Networks
Durand, Thibaut
Thome, Nicolas
Cord, Matthieu
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4743 - 4752
[42] Learning Deep Graph Representations via Convolutional Neural Networks
Ye, Wei
Askarisichani, Omid
Jones, Alex
Singh, Ambuj
[J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (05) : 2268 - 2279
[43] Curriculum Learning for Depth Estimation with Deep Convolutional Neural Networks
Surendranath, Ajay
Jayagopi, Dinesh Babu
[J]. PROCEEDINGS OF THE 2ND MEDITERRANEAN CONFERENCE ON PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE (MEDPRAI-2018), 2018, : 95 - 100
[44] Deep Learning Convolutional Neural Networks with Dropout - a Parallel Approach
Shen, Jingyi
Shafiq, M. Omair
[J]. 2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 572 - 577
[45] Deep Learning With Convolutional Neural Networks for EEG Decoding and Visualization
Schirrmeister, Robin Tibor
Springenberg, Jost Tobias
Fiederer, Lukas Dominique Josef
Glasstetter, Martin
Eggensperger, Katharina
Tangermann, Michael
Hutter, Frank
Burgard, Wolfram
Ball, Tonio
[J]. HUMAN BRAIN MAPPING, 2017, 38 (11) : 5391 - 5420
[46] Deep Learning With Convolutional Neural Networks for Sleep Arousal Detection
Jia, Dongya
Yu, Shengfeng
Yan, Cong
Zhao, Wei
Hu, Jing
Wang, Hongmei
You, Tianyuan
[J]. 2018 COMPUTING IN CARDIOLOGY CONFERENCE (CINC), 2018, 45
[47] Learning Deep Movement Primitives using Convolutional Neural Networks
Pervez, Affan
Mao, Yuecheng
Lee, Dongheui
[J]. 2017 IEEE-RAS 17TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTICS (HUMANOIDS), 2017, : 191 - 197
[48] Layer Removal for Transfer Learning with Deep Convolutional Neural Networks
Zhi, Weiming
Chen, Zhenghao
Yueng, Henry Wing Fung
Lu, Zhicheng
Zandavi, Seid Miad
Chung, Yuk Ying
[J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT II, 2017, 10635 : 460 - 469
[49] Deep Convolutional Neural Networks
Gonzalez, Rafael C.
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2018, 35 (06) : 79 - 87
[50] Chinese Short Text Classification with Mutual-Attention Convolutional Neural Networks
Hao, Ming
Xu, Bo
Liang, Jing-Yi
Zhang, Bo-Wen
Yin, Xu-Cheng
[J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (05)

← 1 2 3 4 5 →