AHNN: An Attention-Based Hybrid Neural Network for Sentence Modeling

被引:1
|
作者
Zhang, Xiaomin [1 ]
Huang, Li [1 ]
Qu, Hong [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 610054, Sichuan, Peoples R China
基金
美国国家科学基金会;
关键词
Nature Language Processing (NLP); Sentence modeling; News Headline Categorization; Convolutional neural networks; Recurrent neural networks;
D O I
10.1007/978-3-319-73618-1_63
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks (DNNs) are powerful models that achieved excellent performance on many fields, especially in Nature Language Processing (NLP). Convolutional neural networks (CNN) and Recurrent neural networks (RNN) are two mainstream architectures of DNNs, are wildly explored to handle NLP tasks. However, those two type models adopt totally different ways to work. CNN is supposed to be good at capturing local features while RNN is considered to be able to summarize global information. In this paper, we combine the strengths of both architectures and propose a hybird model AHNN: Attention-based hybrid Neural Network, and use it in sentence modeling study. The AHNN utilizes attention based bidirectional dynamic lstm to obtain a better representation of global sentence information, then uses a parallel convolutional layer which has three different size filters and a max pooling layer to obtain significant local information. Finally, the two results are used together to feed into an expert layer to obtain results. Experiments show that the proposed architecture AHNN is able to summarize the context of the sentence and capture significant local features of sentence which is important for sentence modeling. We evaluate the proposed architecture AHNN on NLPCC News Headline Categorization test set and achieve 0.8098 test accuracy, it is a competitive performance compare with other teams in this task.
引用
收藏
页码:731 / 740
页数:10
相关论文
共 50 条
  • [31] Attention-Based Convolutional Neural Network for Earthquake Event Classification
    Ku, Bonhwa
    Kim, Gwantae
    Ahn, Jae-Kwang
    Lee, Jimin
    Ko, Hanseok
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (12) : 2057 - 2061
  • [32] Attention-based convolutional neural network for deep face recognition
    Hefei Ling
    Jiyang Wu
    Junrui Huang
    Jiazhong Chen
    Ping Li
    Multimedia Tools and Applications, 2020, 79 : 5595 - 5616
  • [33] Attention-based convolutional neural network for Bangla sentiment analysis
    Sadia Sharmin
    Danial Chakma
    AI & SOCIETY, 2021, 36 : 381 - 396
  • [34] Attention-based novel neural network for mixed frequency data
    Li, Xiangpeng
    Yu, Hong
    Xie, Yongfang
    Li, Jie
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2021, 6 (03) : 301 - 311
  • [35] Interpretable clinical prediction via attention-based neural network
    Chen, Peipei
    Dong, Wei
    Wang, Jinliang
    Lu, Xudong
    Kaymak, Uzay
    Huang, Zhengxing
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (Suppl 3)
  • [36] Attention-Based Graph Neural Network for Molecular Solubility Prediction
    Ahmad, Waciar
    Tayara, Hilal
    Chong, Kil To
    ACS OMEGA, 2023, 8 (03): : 3236 - 3244
  • [37] Multitask Attention-Based Neural Network for Intraoperative Hypotension Prediction
    Shi, Meng
    Zheng, Yu
    Wu, Youzhen
    Ren, Quansheng
    BIOENGINEERING-BASEL, 2023, 10 (09):
  • [38] ATTENTION-BASED NEURAL NETWORK FOR JOINT DIARIZATION AND SPEAKER EXTRACTION
    Chazan, Shlomo E.
    Gannot, Sharon
    Goldberger, Jacob
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 301 - 305
  • [39] Attention-based convolutional neural network for Bangla sentiment analysis
    Sharmin, Sadia
    Chakma, Danial
    AI & SOCIETY, 2021, 36 (01) : 381 - 396
  • [40] Deep Attention-based Neural Network for Electricity Theft Detection
    Zhang, Yufan
    Ji, Yugang
    Xiao, Ding
    PROCEEDINGS OF 2020 IEEE 11TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2020), 2020, : 154 - 157