Language Modeling Using Part-of-speech and Long Short-Term Memory Networks

被引:0
|
作者
Norouzi, Sanaz Saki [1 ]
Akbari, Ahmad [1 ]
Nasersharif, Babak [2 ]
机构
[1] Iran Univ Sci & Technol, Comp Engn Dept, Tehran, Iran
[2] KN Toosi Univ Technol, Comp Engn Dept, Tehran, Iran
关键词
language modeling; part-of-speech; LSTM; bidirectional LSTM;
D O I
10.1109/iccke48569.2019.8964806
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, neural networks have been widely used for language modeling in different tasks of natural language processing. Results show that long shortterm memory (LSTM) neural networks are appropriate for language modeling due to their ability to process long sequences. Furthermore, many studies are shown that extra information improve language models (LMs) performance. In this research, we propose parallel structures for incorporating part-of-speech tags into language modeling task using both the unidirectional and bidirectional type of LSTMs. Words and part-of-speech tags are given to the network as parallel inputs. In this way, to concatenate these two paths, two different structures are proposed according to the type of network used in the parallel part. We analyze the efficiency on Penn Treebank (PTB) dataset using perplexity measure. These two proposed structures show improvements in comparison to the baseline models. Not only does the bidirectional LSTM method gain the lowest perplexity, but it also has the lowest training parameters among our proposed methods. The perplexity of proposed structures has reduced 1.5% and %13 for unidirectional and bidirectional LSTMs, respectively.
引用
下载
收藏
页码:182 / 187
页数:6
相关论文
共 50 条
  • [31] Research on Long Short-Term Memory Networks Speech Separation Algorithm Based on Beamforming
    Lan Chaofeng
    Liu Yan
    Zhao Hongyun
    Liu Chundong
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2022, 44 (07) : 2531 - 2538
  • [32] Speech-to-Text Conversion in Indonesian Language Using a Deep Bidirectional Long Short-Term Memory Algorithm
    Dwijayanti, Suci
    Tami, Muhammad Abid
    Suprapto, Bhakti Yudho
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (03) : 225 - 230
  • [33] Long Short-Term Memory Neural Networks for Modeling Nonlinear Electronic Components
    Moradi, Mahvash A.
    Sadrossadat, Sayed Alireza
    Derhami, Vali
    IEEE TRANSACTIONS ON COMPONENTS PACKAGING AND MANUFACTURING TECHNOLOGY, 2021, 11 (05): : 840 - 847
  • [34] Emotion Recognition From Speech and Text using Long Short-Term Memory
    Venkateswarlu, Sonagiri China
    Jeevakala, Siva Ramakrishna
    Kumar, Naluguru Udaya
    Munaswamy, Pidugu
    Pendyala, Dhanalaxmi
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2023, 13 (04) : 11166 - 11169
  • [35] SIGN LANGUAGE RECOGNITION WITH LONG SHORT-TERM MEMORY
    Liu, Tao
    Zhou, Wengang
    Li, Hougiang
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 2871 - 2875
  • [36] Forecast Customer Flow using Long Short-Term Memory Networks
    Yin, Zongming
    Zhu, Junzhang
    Zhang, Xiaofeng
    2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 61 - 66
  • [37] Modular Multitarget Tracking Using Long Short-Term Memory Networks
    Verma, Rishabh
    Rajesh, R.
    Easwaran, M. S.
    JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2021, 18 (10): : 751 - 754
  • [38] Using long short-term memory networks for river flow prediction
    Xu, Wei
    Jiang, Yanan
    Zhang, Xiaoli
    Li, Yi
    Zhang, Run
    Fu, Guangtao
    HYDROLOGY RESEARCH, 2020, 51 (06): : 1358 - 1376
  • [39] ICU Mortality Prediction Using Long Short-Term Memory Networks
    Mili, Manel
    Kerkeni, Asma
    Ben Abdallah, Asma
    Bedoui, Mohamed Hedi
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2022, 2022, 13756 : 242 - 251
  • [40] Part-of-Speech-Based Long Short-Term Memory Network for Learning Sentence Representations
    Zhu, Wenhao
    Yao, Tengjun
    Zhang, Wu
    Wei, Baogang
    IEEE ACCESS, 2019, 7 : 51810 - 51816