Automatic Pitch Accent Detection Using Long Short-Term Memory Neural Networks

被引:2
|
作者
Wu, Yizhi [1 ]
Li, Sha [1 ]
Li, Hongyan [1 ]
机构
[1] Donghua Univ, Coll Informat Sci & Technol, 2999 Renmin Rd North, Shanghai, Peoples R China
关键词
Pitch accent detection; LSTM; lexical and syntactic features; acoustic features;
D O I
10.1145/3364908.3365291
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prosody detection is gaining increasingly popularity in the domain of prosody research because of its significance in Text to Sound, Computer-aided pronunciation training (CAPT), etc. Pitch accent is an important part of prosody and many recognition models of both static and dynamic have been investigated for automatic labeling it. Recently, artificial neural networks, especially Recurrent Neural Networks (RNNs) have been applied in pitch accent detection. However, traditional recurrent neural networks are unable to learn and remember over long sequences due to the issue of back-propagated error decay. To solve this problem, this paper investigates the use of Long Short-Term Memory (LSTM) neural networks for automatic pitch accent detection. This paper encodes lexical and syntactic features as binary variables and uses syllable-based acoustic features including syllable duration, syllable energy, features related to the fundamental frequency. Our experimental results show that LSTM-RNNs for pitch accent detection achieves an accuracy of 89.0%, which is better than the results of using classical detection methods by about 83.2%.
引用
收藏
页码:41 / 45
页数:5
相关论文
共 50 条
  • [21] Intelligent Islanding Detection of Microgrids Using Long Short-Term Memory Networks
    Bukhari, Syed Basit Ali
    Mehmood, Khawaja Khalid
    Wadood, Abdul
    Park, Herie
    [J]. ENERGIES, 2021, 14 (18)
  • [22] Damage Detection in a Benchmark Structure Using Long Short-term Memory Networks
    Lin, Zhiwei
    Liu, Yonggui
    Zhou, Linren
    [J]. 2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2300 - 2305
  • [23] A Study on Automatic Detection of Sleep Spindles using a Long Short-Term Memory Network
    Yasuhara, Narumi
    Natori, Takahiro
    Hayashi, Mitsuo
    Aikawa, Naoyuki
    [J]. 2019 IEEE 62ND INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2019, : 45 - 48
  • [24] Automated Sleep Apnea Detection in Raw Respiratory Signals Using Long Short-Term Memory Neural Networks
    Van Steenkiste, Tom
    Groenendaal, Willemijn
    Deschrijver, Dirk
    Dhaene, Tom
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2019, 23 (06) : 2354 - 2364
  • [25] Epileptic seizure detection using 1 D-convolutional long short-term memory neural networks
    Hussain, Waqar
    Sadiq, Muhammad Tariq
    Siuly, Siuly
    Rehman, Ateeq Ur
    [J]. APPLIED ACOUSTICS, 2021, 177
  • [26] Epileptic seizure detection using 1 D-convolutional long short-term memory neural networks
    Hussain, Waqar
    Sadiq, Muhammad Tariq
    Siuly, Siuly
    Rehman, Ateeq Ur
    [J]. Applied Acoustics, 2021, 177
  • [27] Session Based Recommendations Using Recurrent Neural Networks - Long Short-Term Memory
    Dobrovolny, Michal
    Selamat, Ali
    Krejcar, Ondrej
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2021, 2021, 12672 : 53 - 65
  • [28] Classification of Antibacterial Peptides Using Long Short-Term Memory Recurrent Neural Networks
    Youmans, Michael
    Spainhour, John C. G.
    Qiu, Peng
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (04) : 1134 - 1140
  • [29] Shooting sound analysis using convolutional neural networks and long short-term memory
    Kang, Se Hyeok
    Cho, Ji Woong
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2022, 41 (03): : 312 - 318
  • [30] LOMBARD SPEECH SYNTHESIS USING LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORKS
    Bollepalli, Bajibabu
    Airaksinen, Manu
    Alku, Paavo
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5505 - 5509