Duplicate Question Detection based on Neural Networks and Multi-head Attention

被引:0
|
作者
Zhang, Heng [1 ]
Chen, Liangyu [1 ]
机构
[1] East China Normal Univ, Shanghai Key Lab Trustworthy Comp, Shanghai, Peoples R China
关键词
deep learning; multi-head attention; ensemble learning;
D O I
10.1109/ialp48816.2019.9037671
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is well known that using only one neural network can not get a satisfied accuracy for the problem of Duplicate Question Detection. In order to break through this dilemma, different neural networks are ensembled serially to strive for better accuracy. However, many problems, such as vanishing gradient or exploding gradient, will be encountered if the depth of neural network is blindly increased. Worse, the serial integration may be poor in computational performance since it is less parallelizable and needs more time to train. To solve these problems, we use ensemble learning with treating different neural networks as individual learners, calculating in parallel, and proposing a new voting mechanism to get better detection accuracy. In addition to the classical models based on recurrent or convolutional neural network, Multi Head Attention is also integrated to reduce the correlation and the performance gap between different models. The experimental results in Quora question pairs dataset show that the accuracy of our method can reach 89.3 %.
引用
收藏
页码:13 / 18
页数:6
相关论文
共 50 条
  • [41] Multi-Head Attention Neural Network for Smartphone Invariant Indoor Localization
    Tiku, Saideep
    Gufran, Danish
    Pasricha, Sudeep
    [J]. 2022 IEEE 12TH INTERNATIONAL CONFERENCE ON INDOOR POSITIONING AND INDOOR NAVIGATION (IPIN 2022), 2022,
  • [42] Hierarchical Gated Convolutional Networks with Multi-Head Attention for Text Classification
    Du, Haizhou
    Qian, Jingu
    [J]. 2018 5TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2018, : 1170 - 1175
  • [43] Multi-head neural networks for simulating particle breakage dynamics
    Gupta, Abhishek
    Mishra, Barada Kanta
    [J]. THEORETICAL AND APPLIED MECHANICS LETTERS, 2024, 14 (02)
  • [44] MAGRU-IDS: A Multi-Head Attention-Based Gated Recurrent Unit for Intrusion Detection in IIoT Networks
    Ullah, Safi
    Boulila, Wadii
    Koubaa, Anis
    Ahmad, Jawad
    [J]. IEEE ACCESS, 2023, 11 : 114590 - 114601
  • [45] Multi-Head Attention with Disagreement Regularization
    Li, Jian
    Tu, Zhaopeng
    Yang, Baosong
    Lyu, Michael R.
    Zhang, Tong
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2897 - 2903
  • [46] An Ensemble of Text Convolutional Neural Networks and Multi-Head Attention Layers for Classifying Threats in Network Packets
    Kim, Hyeonmin
    Yoon, Young
    [J]. ELECTRONICS, 2023, 12 (20)
  • [47] Hybrid neural network model based on multi-head attention for English text emotion analysis
    Li, Ping
    [J]. EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2022, 9 (35)
  • [48] Multi-Head Attention-Based Hybrid Deep Neural Network for Aeroengine Risk Assessment
    Li, Jian-Hang
    Gao, Xin-Yue
    Lu, Xiang
    Liu, Guo-Dong
    [J]. IEEE ACCESS, 2023, 11 : 113376 - 113389
  • [49] Hybrid graph convolutional networks with multi-head attention for location recommendation
    Zhong, Ting
    Zhang, Shengming
    Zhou, Fan
    Zhang, Kunpeng
    Trajcevski, Goce
    Wu, Jin
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2020, 23 (06): : 3125 - 3151
  • [50] Hybrid graph convolutional networks with multi-head attention for location recommendation
    Ting Zhong
    Shengming Zhang
    Fan Zhou
    Kunpeng Zhang
    Goce Trajcevski
    Jin Wu
    [J]. World Wide Web, 2020, 23 : 3125 - 3151