AHNN: An Attention-Based Hybrid Neural Network for Sentence Modeling

被引:1
|
作者
Zhang, Xiaomin [1 ]
Huang, Li [1 ]
Qu, Hong [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 610054, Sichuan, Peoples R China
基金
美国国家科学基金会;
关键词
Nature Language Processing (NLP); Sentence modeling; News Headline Categorization; Convolutional neural networks; Recurrent neural networks;
D O I
10.1007/978-3-319-73618-1_63
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks (DNNs) are powerful models that achieved excellent performance on many fields, especially in Nature Language Processing (NLP). Convolutional neural networks (CNN) and Recurrent neural networks (RNN) are two mainstream architectures of DNNs, are wildly explored to handle NLP tasks. However, those two type models adopt totally different ways to work. CNN is supposed to be good at capturing local features while RNN is considered to be able to summarize global information. In this paper, we combine the strengths of both architectures and propose a hybird model AHNN: Attention-based hybrid Neural Network, and use it in sentence modeling study. The AHNN utilizes attention based bidirectional dynamic lstm to obtain a better representation of global sentence information, then uses a parallel convolutional layer which has three different size filters and a max pooling layer to obtain significant local information. Finally, the two results are used together to feed into an expert layer to obtain results. Experiments show that the proposed architecture AHNN is able to summarize the context of the sentence and capture significant local features of sentence which is important for sentence modeling. We evaluate the proposed architecture AHNN on NLPCC News Headline Categorization test set and achieve 0.8098 test accuracy, it is a competitive performance compare with other teams in this task.
引用
收藏
页码:731 / 740
页数:10
相关论文
共 50 条
  • [21] Attention-Based Recurrent Neural Network for Sequence Labeling
    Li, Bofang
    Liu, Tao
    Zhao, Zhe
    Du, Xiaoyong
    WEB AND BIG DATA (APWEB-WAIM 2018), PT I, 2018, 10987 : 340 - 348
  • [22] Attention-based Recurrent Neural Network for Location Recommendation
    Xia, Bin
    Li, Yun
    Li, Qianmu
    Li, Tao
    2017 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (IEEE ISKE), 2017,
  • [23] Local Pose Optimization with an Attention-based Neural Network
    Liu, Yiling
    Wang, Hesheng
    Xu, Fan
    Wang, Yong
    Chen, Weidong
    Tang, Qirong
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 3084 - 3089
  • [24] Sentence Level Human Translation Quality Estimation with Attention-based Neural Networks
    Yuan, Yu
    Sharoff, Serge
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 1858 - 1865
  • [25] Multi-Head Attention-Based Hybrid Deep Neural Network for Aeroengine Risk Assessment
    Li, Jian-Hang
    Gao, Xin-Yue
    Lu, Xiang
    Liu, Guo-Dong
    IEEE ACCESS, 2023, 11 : 113376 - 113389
  • [26] Hybrid model of a cement rotary kiln using an improved attention-based recurrent neural network
    Zheng, Jinquan
    Zhao, Liang
    Du, Wenli
    ISA TRANSACTIONS, 2022, 129 : 631 - 643
  • [27] Enhanced electroencephalogram signal classification: A hybrid convolutional neural network with attention-based feature selection
    Liu, Bao
    Wang, Yuxin
    Gao, Lei
    Cai, Zhenxin
    BRAIN RESEARCH, 2025, 1851
  • [28] Quality Prediction Modeling for Industrial Processes Using Multiscale Attention-Based Convolutional Neural Network
    Yuan, Xiaofeng
    Huang, Lingfeng
    Ye, Lingjian
    Wang, Yalin
    Wang, Kai
    Yang, Chunhua
    Gui, Weihua
    Shen, Feifan
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (05) : 2696 - 2707
  • [29] Attention-based deep neural network for driver behavior recognition
    Xiao, Weichu
    Liu, Hongli
    Ma, Ziji
    Chen, Weihong
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 132 : 152 - 161
  • [30] Attention-based recurrent neural network for influenza epidemic prediction
    Zhu, Xianglei
    Fu, Bofeng
    Yang, Yaodong
    Ma, Yu
    Hao, Jianye
    Chen, Siqi
    Liu, Shuang
    Li, Tiegang
    Liu, Sen
    Guo, Weiming
    Liao, Zhenyu
    BMC BIOINFORMATICS, 2019, 20 (Suppl 18)