Sentence Matching with Deep Self-attention and Co-attention Features

被引:1
|
作者
Wang, Zhipeng [1 ]
Yan, Danfeng [1 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing, Peoples R China
关键词
Sentence matching; Natural language processing; Neural network; Attention mechanism;
D O I
10.1007/978-3-030-82147-0_45
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentence matching refers to extracting the semantic relation between two sentences which is widely applied in many natural language processing tasks such as natural language inference, paraphrase identification and question answering. Many previous methods apply a siamese network to capture semantic features and calculate cosine similarity to represent sentences relation. However, they could be effective for overall rough sentence semantic but not sufficient for word-level matching information. In this paper, we proposed a novel neural network based on attention mechanism which focuses on learning richer interactive features of two sentences. There are two complementary components in our model: semantic encoder and interactive encoder. Interactive encoder compares sentences semantic features which are encoded by semantic encoder. In addition, semantic encoder considers the output of interactive encoder as supplementary matching features. Experiments on three benchmark datasets proved that self-attention network and cross-attention network can efficiently learn the semantic and interactive features of sentences, and achieved state-of-the-art results.
引用
收藏
页码:550 / 561
页数:12
相关论文
共 50 条
  • [1] Dual self-attention with co-attention networks for visual question answering
    Liu, Yun
    Zhang, Xiaoming
    Zhang, Qianyun
    Li, Chaozhuo
    Huang, Feiran
    Tang, Xianghong
    Li, Zhoujun
    [J]. PATTERN RECOGNITION, 2021, 117
  • [2] Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering
    Li, Xiangpeng
    Song, Jingkuan
    Gao, Lianli
    Liu, Xianglong
    Huang, Wenbing
    He, Xiangnan
    Gan, Chuang
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8658 - 8665
  • [3] Co-Attention for Conditioned Image Matching
    Wiles, Olivia
    Ehrhardt, Sebastien
    Zisserman, Andrew
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15915 - 15924
  • [4] SELF-ATTENTION GUIDED DEEP FEATURES FOR ACTION RECOGNITION
    Xiao, Renyi
    Hou, Yonghong
    Guo, Zihui
    Li, Chuankun
    Wang, Pichao
    Li, Wanqing
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1060 - 1065
  • [5] SAR and Optical Image Registration Based on Deep Learning with Co-Attention Matching Module
    Chen, Jiaxing
    Xie, Hongtu
    Zhang, Lin
    Hu, Jun
    Jiang, Hejun
    Wang, Guoqian
    [J]. REMOTE SENSING, 2023, 15 (15)
  • [6] INTEGRATING DEPENDENCY TREE INTO SELF-ATTENTION FOR SENTENCE REPRESENTATION
    Ma, Junhua
    Li, Jiajun
    Liu, Yuxuan
    Zhou, Shangbo
    Li, Xue
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8137 - 8141
  • [7] CGSPN : cascading gated self-attention and phrase-attention network for sentence modeling
    Yanping Fu
    Yun Liu
    [J]. Journal of Intelligent Information Systems, 2021, 56 : 147 - 168
  • [8] CGSPN : cascading gated self-attention and phrase-attention network for sentence modeling
    Fu, Yanping
    Liu, Yun
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2021, 56 (01) : 147 - 168
  • [9] Hermitian Co-Attention Networks for Text Matching in Asymmetrical Domains
    Tay, Yi
    Anh Tuan Luu
    Hui, Siu Cheung
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 4425 - 4431
  • [10] COMatchNet: Co-Attention Matching Network for Video Object Segmentation
    Huang, Lufei
    Sun, Fengming
    Yuan, Xia
    [J]. PATTERN RECOGNITION, ACPR 2021, PT I, 2022, 13188 : 271 - 284