Automatic Pitch Accent Detection Using Long Short-Term Memory Neural Networks

被引:2
|
作者
Wu, Yizhi [1 ]
Li, Sha [1 ]
Li, Hongyan [1 ]
机构
[1] Donghua Univ, Coll Informat Sci & Technol, 2999 Renmin Rd North, Shanghai, Peoples R China
关键词
Pitch accent detection; LSTM; lexical and syntactic features; acoustic features;
D O I
10.1145/3364908.3365291
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prosody detection is gaining increasingly popularity in the domain of prosody research because of its significance in Text to Sound, Computer-aided pronunciation training (CAPT), etc. Pitch accent is an important part of prosody and many recognition models of both static and dynamic have been investigated for automatic labeling it. Recently, artificial neural networks, especially Recurrent Neural Networks (RNNs) have been applied in pitch accent detection. However, traditional recurrent neural networks are unable to learn and remember over long sequences due to the issue of back-propagated error decay. To solve this problem, this paper investigates the use of Long Short-Term Memory (LSTM) neural networks for automatic pitch accent detection. This paper encodes lexical and syntactic features as binary variables and uses syllable-based acoustic features including syllable duration, syllable energy, features related to the fundamental frequency. Our experimental results show that LSTM-RNNs for pitch accent detection achieves an accuracy of 89.0%, which is better than the results of using classical detection methods by about 83.2%.
引用
收藏
页码:41 / 45
页数:5
相关论文
共 50 条
  • [1] Short-Term Traffic Prediction Using Long Short-Term Memory Neural Networks
    Abbas, Zainab
    Al-Shishtawy, Ahmad
    Girdzijauskas, Sarunas
    Vlassov, Vladimir
    [J]. 2018 IEEE INTERNATIONAL CONGRESS ON BIG DATA (IEEE BIGDATA CONGRESS), 2018, : 57 - 65
  • [2] Intrusion Detection Using Multilayer Perceptron and Neural Networks with Long Short-Term Memory
    Borisenko, B. B.
    Erokhin, S. D.
    Fadeev, A. S.
    Martishin, I. D.
    [J]. 2021 SYSTEMS OF SIGNAL SYNCHRONIZATION, GENERATING AND PROCESSING IN TELECOMMUNICATIONS (SYNCHROINFO), 2021,
  • [3] Automatic Cause Inference of Construction Accident Using Long Short-Term Memory Neural Networks
    Wu, Hengqin
    Shen, Geoffrey Qiping
    Zhou, Zhenzong
    Li, Wenpeng
    Li, Xin
    [J]. CARBON PEAK AND NEUTRALITY STRATEGIES OF THE CONSTRUCTION INDUSTRY (ICCREM 2022), 2022, : 269 - 275
  • [4] Automatic temporal segment detection via bilateral long short-term memory recurrent neural networks
    Sun, Bo
    Cao, Siming
    He, Jun
    Yu, Lejun
    Li, Liandong
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2017, 26 (02)
  • [5] Automatic Fall Detection Using Long Short-Term Memory Network
    Magalhaes, Carlos
    Ribeiro, Joao
    Leite, Argentina
    Pires, E. J. Solteiro
    Pavao, Joao
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2021, PT I, 2021, 12861 : 359 - 371
  • [6] Deepfake Detection using Capsule Networks and Long Short-Term Memory Networks
    Mehra, Akul
    Spreeuwers, Luuk
    Strisciuglio, Nicola
    [J]. VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 4: VISAPP, 2021, : 407 - 414
  • [7] Long Short-Term Memory Networks for Automatic Generation of Conversations
    Fujita, Tomohiro
    Bai, Wenjun
    Quan, Changqin
    [J]. 2017 18TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNDP 2017), 2017, : 483 - 487
  • [8] Dialog State Tracking Using Long Short-term Memory Neural Networks
    Yang, Xiaohao
    Liu, Jia
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1800 - 1804
  • [9] An Incremental Learning Approach Using Long Short-Term Memory Neural Networks
    Lemos Neto, Alvaro C.
    Coelho, Rodrigo A.
    de Castro, Cristiano L.
    [J]. JOURNAL OF CONTROL AUTOMATION AND ELECTRICAL SYSTEMS, 2022, 33 (05) : 1457 - 1465
  • [10] Deflated reputation using multiplicative long short-term memory neural networks
    Ma, Yixuan
    Zhang, Zhenji
    Li, Deming
    Tang, Mincong
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 118 : 198 - 207