Automatic Pitch Accent Detection Using Long Short-Term Memory Neural Networks

被引:2
|
作者
Wu, Yizhi [1 ]
Li, Sha [1 ]
Li, Hongyan [1 ]
机构
[1] Donghua Univ, Coll Informat Sci & Technol, 2999 Renmin Rd North, Shanghai, Peoples R China
关键词
Pitch accent detection; LSTM; lexical and syntactic features; acoustic features;
D O I
10.1145/3364908.3365291
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prosody detection is gaining increasingly popularity in the domain of prosody research because of its significance in Text to Sound, Computer-aided pronunciation training (CAPT), etc. Pitch accent is an important part of prosody and many recognition models of both static and dynamic have been investigated for automatic labeling it. Recently, artificial neural networks, especially Recurrent Neural Networks (RNNs) have been applied in pitch accent detection. However, traditional recurrent neural networks are unable to learn and remember over long sequences due to the issue of back-propagated error decay. To solve this problem, this paper investigates the use of Long Short-Term Memory (LSTM) neural networks for automatic pitch accent detection. This paper encodes lexical and syntactic features as binary variables and uses syllable-based acoustic features including syllable duration, syllable energy, features related to the fundamental frequency. Our experimental results show that LSTM-RNNs for pitch accent detection achieves an accuracy of 89.0%, which is better than the results of using classical detection methods by about 83.2%.
引用
收藏
页码:41 / 45
页数:5
相关论文
共 50 条
  • [41] Detection of Deepfake Video Using Residual Neural Network and Long Short-Term Memory
    Karandikar, A. M.
    Thakare, Y. N.
    Sah, O.
    Sah, R. K.
    Nafde, S.
    Kumar, S.
    [J]. INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2023, 14 (01): : 67 - 73
  • [42] Efficient Neural Architecture Search for Long Short-Term Memory Networks
    Abed, Hamdi
    Gyires-Toth, Balint
    [J]. 2021 IEEE 19TH WORLD SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI 2021), 2021, : 287 - 292
  • [43] Long Short-Term Memory Neural Networks for Artificial Dialogue Generation
    Selouani, Sid Ahmed
    Yakoub, Mohammed Sidi
    [J]. 2018 IEEE 42ND ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 1, 2018, : 761 - 768
  • [44] On Speaker Adaptation of Long Short-Term Memory Recurrent Neural Networks
    Miao, Yajie
    Metze, Florian
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1101 - 1105
  • [45] Android Malware Detection Using Long Short Term Memory Recurrent Neural Networks
    Georgieva, Lilia
    Lamarque, Basile
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON APPLIED CYBER SECURITY (ACS) 2021, 2022, 378 : 42 - 52
  • [46] Using long short term memory and convolutional neural networks for driver drowsiness detection
    Quddus, Azhar
    Zandi, Ali Shahidi
    Prest, Laura
    Comeau, Felix J. E.
    [J]. ACCIDENT ANALYSIS AND PREVENTION, 2021, 156
  • [47] Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks
    Zazo, Ruben
    Lozano-Diez, Alicia
    Gonzalez-Dominguez, Javier
    Toledano, Doroteo T.
    Gonzalez-Rodriguez, Joaquin
    [J]. PLOS ONE, 2016, 11 (01):
  • [48] Anomaly detection of earthquake precursor data using long short-term memory networks
    Yin Cai
    Mei-Ling Shyu
    Yue-Xuan Tu
    Yun-Tian Teng
    Xing-Xing Hu
    [J]. Applied Geophysics, 2019, 16 : 257 - 266
  • [49] Anomaly detection of earthquake precursor data using long short-term memory networks
    Cai, Yin
    Shyu, Mei-Ling
    Tu, Yue-Xuan
    Teng, Yun-Tian
    Hu, Xing-Xing
    [J]. APPLIED GEOPHYSICS, 2019, 16 (03) : 257 - 266
  • [50] Long Short-Term Memory Recurrent Neural Network for Automatic Speech Recognition
    Oruh, Jane
    Viriri, Serestina
    Adegun, Adekanmi
    [J]. IEEE ACCESS, 2022, 10 : 30069 - 30079