A study on the challenges and opportunities of speech recognition for Bengali language

被引:10
|
作者
Mridha, M. F. [1 ]
Ohi, Abu Quwsar [1 ]
Hamid, Md Abdul [2 ]
Monowar, Muhammad Mostafa [2 ]
机构
[1] Bangladesh Univ Business & Technol, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] King Abdulaziz Univ, Fac Comp & Informat Technol, Dept Informat Technol, Jeddah 21589, Saudi Arabia
关键词
Automatic speech recognition; Bengali; Phoneme; Speech to text; Language-dependent challenges; Language-independent challenges; MODELS;
D O I
10.1007/s10462-021-10083-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech recognition is a fascinating process that offers the opportunity to interact and command the machine in the field of human-computer interactions. Speech recognition is a language-dependent system constructed directly based on the linguistic and textual properties of any language. Automatic speech recognition (ASR) systems are currently being used to translate speech to text flawlessly. Although ASR systems are being strongly executed in international languages, ASR systems' implementation in the Bengali language has not reached an acceptable state. In this research work, we sedulously disclose the current status of the Bengali ASR system's research endeavors. In what follows, we acquaint the challenges that are mostly encountered while constructing a Bengali ASR system. We split the challenges into language-dependent and language-independent challenges and guide how the particular complications may be overhauled. Following a rigorous investigation and highlighting the challenges, we conclude that Bengali ASR systems require specific construction of ASR architectures based on the Bengali language's grammatical and phonetic structure.
引用
收藏
页码:3431 / 3455
页数:25
相关论文
共 50 条
  • [1] A study on the challenges and opportunities of speech recognition for Bengali language
    M. F. Mridha
    Abu Quwsar Ohi
    Md Abdul Hamid
    Muhammad Mostafa Monowar
    [J]. Artificial Intelligence Review, 2022, 55 : 3431 - 3455
  • [2] An Overview of Bengali Speech Recognition: Methods, Challenges, and Future Direction
    Tasnia, Nabila
    Islam, Mahidul
    Rony, Mahi Shahriar
    Tanzim, Nishat
    Hasib, Khan Md
    Alam, Mohammad Shafiul
    [J]. 2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 873 - 878
  • [3] A Speech Recognition System for Bengali Language using Recurrent Neural Network
    Islam, Jahirul
    Mubassira, Masiath
    Islam, Md. Rakibul
    Das, Amit Kumar
    [J]. 2019 IEEE 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2019), 2019, : 73 - 76
  • [4] Bengali Speech Emotion Recognition
    Mohanta, Abhijit
    Sharma, Uzzal
    [J]. PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 2812 - 2814
  • [5] Opportunities and Challenges of Automatic Speech Recognition Systems for Low-Resource Language Speakers
    Reitmaier, Thomas
    Wallington, Electra
    Raju, Dani Kalarikalayil
    Klejch, Ondrej
    Pearson, Jennifer
    Jones, Matt
    Bell, Peter
    Robinson, Simon
    [J]. PROCEEDINGS OF THE 2022 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI' 22), 2022,
  • [6] Entity Recognition in Bengali Language
    Das, Sujit Kumar
    Dhar, Sourish
    [J]. 2015 INTERNATIONAL SYMPOSIUM ON ADVANCED COMPUTING AND COMMUNICATION (ISACC), 2015, : 157 - 160
  • [7] How many Mel-frequency cepstral coefficients to be utilized in speech recognition? A study with the Bengali language
    Hasan, Md. Rakibul
    Hasan, Md. Mahbub
    Hossain, Md Zakir
    [J]. JOURNAL OF ENGINEERING-JOE, 2021, 2021 (12): : 817 - 827
  • [8] Challenges and opportunities for speech and language therapists in secondary schools
    Malcolm, A
    Myers, L
    [J]. INTERNATIONAL JOURNAL OF LANGUAGE & COMMUNICATION DISORDERS, 2001, 36 (02) : 481 - 486
  • [9] Sign language recognition for Bengali characters
    Ayshee, Tanzila Ferdous
    Raka, Sadia Afrin
    Hasib, Quazi Ridwan
    Rahman, Rashedur M.
    Hossain, Md.
    [J]. International Journal of Fuzzy System Applications, 2015, 4 (04) : 1 - 14
  • [10] Automatic Speech Recognition of Bengali Using Kaldi
    Guchhait, Subhadeep
    Hans, Arnold Sachith A.
    Augustine, Jacob
    [J]. PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON SUSTAINABLE EXPERT SYSTEMS (ICSES 2021), 2022, 351 : 153 - 166