MIX: Multi-Channel Information Crossing for Text Matching

被引:33
|
作者
Chen, Haolan [1 ]
Han, Fred X. [2 ]
Niu, Di [2 ]
Liu, Dong [1 ]
Lai, Kunfeng [1 ]
Wu, Chenglin [1 ]
Xu, Yu [1 ]
机构
[1] Tencent, Mobile Internet Grp, Toronto, ON, Canada
[2] Univ Alberta, Edmonton, AB, Canada
关键词
Text matching; ad-hoc retrieval; question answering; convolutional neural networks; attention mechanism;
D O I
10.1145/3219819.3219928
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Short Text Matching plays an important role in many natural language processing tasks such as information retrieval, question answering, and dialogue systems. Conventional text matching methods rely on predefined templates and rules. However, for a short piece of text with a limited number of words, these rules are unable to generalize well to unobserved data. With the success of deep learning in fields like computer vision, speech recognition and recommender systems, many recent efforts have been made to apply deep neural network models to natural language processing tasks to reduce the cost of manual feature engineering. In this paper, we present the design of Multi-Channel Information Crossing (MIX), a multi-channel convolutional neural network (CNN) model for text matching in a production environment, with additional attention mechanisms on sentences and semantic features. MIX compares text snippets at varied granularities to form a series of multi-channel similarity matrices, which are then crossed with another set of carefully designed attention matrices to expose the rich structures of sentences to deep neural networks. We implemented MIX and deployed the system on Tencent's Venus distributed computation platform. Thanks to the well-engineered multi-channel information crossing, evaluation results suggest that MIX outperforms a wide range of state-of-the-art deep neural network models by at least 11.1% in terms of the normalized discounted cumulative gain (NDCG@3), on the English WikiQA dataset. Moreover, we performed online A/B tests with real users on the search service of Tencent QQ Browser. Results show that MIX raised the number of clicks on the returned results by 5.7%, due to the increased accuracy in query-document matching, which demonstrates the superior performance of MIX in a real-world production environment.
引用
收藏
页码:110 / 119
页数:10
相关论文
共 50 条
  • [1] Scene Text Detection by Leveraging Multi-channel Information and Local Context
    Wang, Runmin
    Qian, Shengyou
    Yang, Jianfeng
    Gao, Changxin
    [J]. MIPPR 2017: PATTERN RECOGNITION AND COMPUTER VISION, 2017, 10609
  • [2] Efficient template matching for multi-channel images
    Mattoccia, Stefano
    Tombari, Federico
    Di Stefano, Luigi
    [J]. PATTERN RECOGNITION LETTERS, 2011, 32 (05) : 694 - 700
  • [3] Multi-channel Adaptive Information Systems
    Paolo Atzeni
    Tiziana Catarci
    Barbara Pernici
    [J]. World Wide Web, 2007, 10 : 345 - 347
  • [4] Multi-channel adaptive information systems
    Atzeni, Paolo
    Catarci, Tiziana
    Pernici, Barbara
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2007, 10 (04): : 345 - 347
  • [5] Deterministic multi-channel information exchange
    Holzer, Stephan
    Locher, Thomas
    Pignolet, Yvonne Anne
    Wattenhofer, Roger
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2017, 87 : 84 - 103
  • [6] Multi-channel curve crossing problems: analytically solvable model
    Chakraborty, Aniruddha
    [J]. MOLECULAR PHYSICS, 2009, 107 (23-24) : 2459 - 2465
  • [7] Multi-channel speed converter for navigation information
    Piechowski, Leszek
    Iwaszkiewicz, Jan
    Muc, Adam
    Lewczuk, Michal
    [J]. PRZEGLAD ELEKTROTECHNICZNY, 2023, 99 (09): : 238 - 241
  • [8] Architectural issues in multi-channel information systems
    Adorni, M
    Arcelli, F
    Raibulet, C
    Tisato, F
    [J]. 8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL II, PROCEEDINGS: COMPUTING TECHNIQUES, 2004, : 1 - 4
  • [9] Multi-Channel Information Operations on Quantum Images
    Sun, Bo
    Iliyasu, Abdullah M.
    Yan, Fei
    Sanchez, Jesus A. Garcia
    Dong, Fangyan
    Al-Asmari, Awad Kh
    Hirota, Kaoru
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2014, 18 (02) : 140 - 149
  • [10] Age of Information Optimization in Multi-Channel Network With Sided Information
    Chen, Yuchao
    Wang, Jintao
    Wang, Xiaoqing
    Song, Jian
    [J]. IEEE COMMUNICATIONS LETTERS, 2023, 27 (03) : 1030 - 1034