Automatic Speech Recognition System for Tonal Languages: State-of-the-Art Survey

被引:0
|
作者
Jaspreet Kaur
Amitoj Singh
Virender Kadyan
机构
[1] Department of Computational Sciences,
[2] Maharaja Ranjit Singh Punjab Technical University,undefined
[3] Department of Informatics,undefined
[4] School of Computer Science,undefined
[5] University of Petroleum & Energy Studies (UPES),undefined
[6] Energy Acres,undefined
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Natural language and human–machine interaction is a very much traversed as well as challenging research domain. However, the main objective is of getting the system that can communicate in well-organized manner with the human, regardless of operational environment. In this paper a systematic survey on Automatic Speech Recognition (ASR) for tonal languages spoken around the globe is carried out. The tonal languages of Asian, Indo-European and African continents are reviewed but the tonal languages of American and Austral-Asian are not reviewed. The most important part of this paper is to present the work done in the previous years on the ASR of Asian continent tonal languages like Chinese, Thai, Vietnamese, Mandarin, Mizo, Bodo and Indo-European continent tonal languages like Punjabi, Lithuanian, Swedish, Croatian and African continent tonal languages like Yoruba and Hausa. Finally, the synthesis analysis is explored based on the findings. Many issues and challenges related with tonal languages are discussed. It is observed that the lot of work have been done for the Asian continent tonal languages i.e. Chinese, Thai, Vietnamese, Mandarin but little work been reported for the Mizo, Bodo, Indo-European tonal languages like Punjabi, Latvian, Lithuanian as well for the African continental tonal languages i.e. Hausa and Yourba.
引用
收藏
页码:1039 / 1068
页数:29
相关论文
共 50 条
  • [21] STATE-OF-THE-ART SPEECH RECOGNITION WITH SEQUENCE-TO-SEQUENCE MODELS
    Chiu, Chung-Cheng
    Sainath, Tara N.
    Wu, Yonghui
    Prabhavalkar, Rohit
    Nguyen, Patrick
    Chen, Zhifeng
    Kannan, Anjuli
    Weiss, Ron J.
    Rao, Kanishka
    Gonina, Ekaterina
    Jaitly, Navdeep
    Li, Bo
    Chorowski, Jan
    Bacchiani, Michiel
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4774 - 4778
  • [22] Automatic License Plate Recognition (ALPR): A State-of-the-Art Review
    Du, Shan
    Ibrahim, Mahmoud
    Shehata, Mohamed
    Badawy, Wael
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (02) : 322 - 336
  • [23] COMPUTER SPEECH - STATE-OF-THE-ART
    LAGARDE, PM
    [J]. SOUTH AFRICAN JOURNAL OF SCIENCE, 1987, 83 (03) : 125 - 127
  • [24] AUTOMATION OF SYSTEM BUILDING FOR STATE-OF-THE-ART LARGE VOCABULARY SPEECH RECOGNITION USING EVOLUTION STRATEGY
    Moriya, Takafumi
    Tanaka, Tomohiro
    Shinozaki, Takahiro
    Watanabe, Shinji
    Duh, Kevin
    [J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 610 - 616
  • [25] Automatic speech recognition: a survey
    Mishaim Malik
    Muhammad Kamran Malik
    Khawar Mehmood
    Imran Makhdoom
    [J]. Multimedia Tools and Applications, 2021, 80 : 9411 - 9457
  • [26] Automatic speech recognition: a survey
    Malik, Mishaim
    Malik, Muhammad Kamran
    Mehmood, Khawar
    Makhdoom, Imran
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (06) : 9411 - 9457
  • [27] Invariant Feature Extraction for Facial Recognition: A Survey of the State-of-the-art
    Hassan, Adam
    Viriri, Serestina
    [J]. 2018 CONFERENCE ON INFORMATION COMMUNICATIONS TECHNOLOGY AND SOCIETY (ICTAS), 2018,
  • [28] The State-of-the-Art Sensing Techniques in Human Activity Recognition: A Survey
    Bian, Sizhen
    Liu, Mengxi
    Zhou, Bo
    Lukowicz, Paul
    [J]. SENSORS, 2022, 22 (12)
  • [29] Integration of tonal knowledge into phonetic HMMs for recognition of speech in tone languages
    Demeechai, T
    Mäkeläinen, K
    [J]. SIGNAL PROCESSING, 2000, 80 (10) : 2241 - 2247
  • [30] A Low Power Tone Recognition for Automatic Tonal Speech Recognizer
    Chaiwongsai, Jirabhorn
    Chiracharit, Werapon
    Chamnongthai, Kosin
    Miyanaga, Yoshikazu
    Higuchi, Kohji
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2013, E96A (06) : 1403 - 1411