Automatic Speech Recognition System for Tonal Languages: State-of-the-Art Survey

被引：0

作者：

Jaspreet Kaur

Amitoj Singh

Virender Kadyan

机构：

[1] Department of Computational Sciences,

[2] Maharaja Ranjit Singh Punjab Technical University,undefined

[3] Department of Informatics,undefined

[4] School of Computer Science,undefined

[5] University of Petroleum & Energy Studies (UPES),undefined

[6] Energy Acres,undefined

来源：

Archives of Computational Methods in Engineering | 2021年 / 28卷

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Natural language and human–machine interaction is a very much traversed as well as challenging research domain. However, the main objective is of getting the system that can communicate in well-organized manner with the human, regardless of operational environment. In this paper a systematic survey on Automatic Speech Recognition (ASR) for tonal languages spoken around the globe is carried out. The tonal languages of Asian, Indo-European and African continents are reviewed but the tonal languages of American and Austral-Asian are not reviewed. The most important part of this paper is to present the work done in the previous years on the ASR of Asian continent tonal languages like Chinese, Thai, Vietnamese, Mandarin, Mizo, Bodo and Indo-European continent tonal languages like Punjabi, Lithuanian, Swedish, Croatian and African continent tonal languages like Yoruba and Hausa. Finally, the synthesis analysis is explored based on the findings. Many issues and challenges related with tonal languages are discussed. It is observed that the lot of work have been done for the Asian continent tonal languages i.e. Chinese, Thai, Vietnamese, Mandarin but little work been reported for the Mizo, Bodo, Indo-European tonal languages like Punjabi, Latvian, Lithuanian as well for the African continental tonal languages i.e. Hausa and Yourba.

引用

页码：1039 / 1068

页数：29

共 50 条

[21] STATE-OF-THE-ART SPEECH RECOGNITION WITH SEQUENCE-TO-SEQUENCE MODELS
Chiu, Chung-Cheng
Sainath, Tara N.
Wu, Yonghui
Prabhavalkar, Rohit
Nguyen, Patrick
Chen, Zhifeng
Kannan, Anjuli
Weiss, Ron J.
Rao, Kanishka
Gonina, Ekaterina
Jaitly, Navdeep
Li, Bo
Chorowski, Jan
Bacchiani, Michiel
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4774 - 4778
[22] Automatic License Plate Recognition (ALPR): A State-of-the-Art Review
Du, Shan
Ibrahim, Mahmoud
Shehata, Mohamed
Badawy, Wael
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (02) : 322 - 336
[23] COMPUTER SPEECH - STATE-OF-THE-ART
LAGARDE, PM
[J]. SOUTH AFRICAN JOURNAL OF SCIENCE, 1987, 83 (03) : 125 - 127
[24] AUTOMATION OF SYSTEM BUILDING FOR STATE-OF-THE-ART LARGE VOCABULARY SPEECH RECOGNITION USING EVOLUTION STRATEGY
Moriya, Takafumi
Tanaka, Tomohiro
Shinozaki, Takahiro
Watanabe, Shinji
Duh, Kevin
[J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 610 - 616
[25] Automatic speech recognition: a survey
Mishaim Malik
Muhammad Kamran Malik
Khawar Mehmood
Imran Makhdoom
[J]. Multimedia Tools and Applications, 2021, 80 : 9411 - 9457
[26] Automatic speech recognition: a survey
Malik, Mishaim
Malik, Muhammad Kamran
Mehmood, Khawar
Makhdoom, Imran
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (06) : 9411 - 9457
[27] Invariant Feature Extraction for Facial Recognition: A Survey of the State-of-the-art
Hassan, Adam
Viriri, Serestina
[J]. 2018 CONFERENCE ON INFORMATION COMMUNICATIONS TECHNOLOGY AND SOCIETY (ICTAS), 2018,
[28] The State-of-the-Art Sensing Techniques in Human Activity Recognition: A Survey
Bian, Sizhen
Liu, Mengxi
Zhou, Bo
Lukowicz, Paul
[J]. SENSORS, 2022, 22 (12)
[29] Integration of tonal knowledge into phonetic HMMs for recognition of speech in tone languages
Demeechai, T
Mäkeläinen, K
[J]. SIGNAL PROCESSING, 2000, 80 (10) : 2241 - 2247
[30] A Low Power Tone Recognition for Automatic Tonal Speech Recognizer
Chaiwongsai, Jirabhorn
Chiracharit, Werapon
Chamnongthai, Kosin
Miyanaga, Yoshikazu
Higuchi, Kohji
[J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2013, E96A (06) : 1403 - 1411

← 1 2 3 4 5 →