Leap-LSTM: Enhancing Long Short-Term Memory for Text Categorization

被引:0
|
作者
Huang, Ting [1 ]
Shen, Gehui [1 ]
Zhi-Hong Deng [1 ]
机构
[1] Peking Univ, Sch Elect Engn & Comp Sci, Key Lab Machine Percept, Minist Educ, Beijing, Peoples R China
基金
国家高技术研究发展计划(863计划);
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recurrent Neural Networks (RNNs) are widely used in the field of natural language processing (NLP), ranging from text categorization to question answering and machine translation. However, RNNs generally read the whole text from beginning to end or vice versa sometimes, which makes it inefficient to process long texts. When reading a long document for a categorization task, such as topic categorization, large quantities of words are irrelevant and can be skipped. To this end, we propose Leap-LSTM, an LSTM-enhanced model which dynamically leaps between words while reading texts. At each step, we utilize several feature encoders to extract messages from preceding texts, following texts and the current word, and then determine whether to skip the current word. We evaluate Leap-LSTM on several text categorization tasks: sentiment analysis, news categorization, ontology classification and topic classification, with five benchmark data sets. The experimental results show that our model reads faster and predicts better than standard LSTM. Compared to previous models which can also skip words, our model achieves better trade-offs between performance and efficiency.
引用
收藏
页码:5017 / 5023
页数:7
相关论文
共 50 条
  • [31] Long Short-Term Memory Description and its Application in Text Processing
    Skovajsova, Lenka
    [J]. 2017 COMMUNICATION AND INFORMATION TECHNOLOGIES (KIT), 2017, : 136 - 139
  • [32] A method of automatic text summarisation based on long short-term memory
    Fang, Wei
    Jiang, TianXiao
    Jiang, Ke
    Zhang, Feihong
    Ding, Yewen
    Sheng, Jack
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2020, 22 (01) : 39 - 49
  • [33] Emotion Detection in Text using Nested Long Short-Term Memory
    Haryadi, Daniel
    Kusuma, Gede Putra
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (06) : 351 - 357
  • [34] Emotion detection in text using nested Long Short-Term Memory
    Haryadi, Daniel
    Kusuma, Gede Putra
    [J]. International Journal of Advanced Computer Science and Applications, 2019, 10 (06): : 351 - 357
  • [35] Long short-term memory
    Hochreiter, S
    Schmidhuber, J
    [J]. NEURAL COMPUTATION, 1997, 9 (08) : 1735 - 1780
  • [36] MFOA-Bi-LSTM: An optimized bidirectional long short-term memory model for short-term traffic flow prediction
    Naheliya, Bharti
    Redhu, Poonam
    Kumar, Kranti
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2024, 634
  • [37] Control of an active suspension system based on long short-term memory (LSTM) learning
    Dridi, Issam
    Hamza, Anis
    Ben Yahia, Noureddine
    [J]. ADVANCES IN MECHANICAL ENGINEERING, 2023, 15 (02)
  • [38] Well performance prediction based on Long Short-Term Memory (LSTM) neural network
    Huang, Ruijie
    Wei, Chenji
    Wang, Baohua
    Yang, Jian
    Xu, Xin
    Wu, Suwei
    Huang, Suqi
    [J]. JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2022, 208
  • [39] Brain tumor detection: a long short-term memory (LSTM)-based learning model
    Javaria Amin
    Muhammad Sharif
    Mudassar Raza
    Tanzila Saba
    Rafiq Sial
    Shafqat Ali Shad
    [J]. Neural Computing and Applications, 2020, 32 : 15965 - 15973