An Overview of Bengali Speech Recognition: Methods, Challenges, and Future Direction

被引:0
|
作者
Tasnia, Nabila [1 ]
Islam, Mahidul [1 ]
Rony, Mahi Shahriar [1 ]
Tanzim, Nishat [1 ]
Hasib, Khan Md [2 ]
Alam, Mohammad Shafiul [1 ]
机构
[1] Ahsanullah Univ Sci & Technol, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] Bangladesh Univ Business & Technol, Dept Comp Sci & Engn, Dhaka, Bangladesh
关键词
Speech recognition; Automatic Speech Recognition; Bengali language; Language model; Bengali ASR;
D O I
10.1109/CCWC57344.2023.10099382
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the subject of human-computer interactions, speech recognition is an appealing technique that gives users the opportunity to interact with and control the machine. Currently, automatic speech recognition (ASR) systems are being utilized to flawlessly convert speech to text. The implementation of ASR systems in Bengali has not yet achieved an acceptable standard, despite the fact that they are being used extensively in other languages. So far, various ASR models have been implemented for speech recognition like LSTM (Long Short-Term Memory), Transformer-based models like RNN (Recurrent Neural Network) and CNN (Convolutional Neural Network) are also quite popular for speech recognition. The Bengali language is more grammatically and structurally diverse than English. Therefore, it is difficult for researchers to use the same language model as English or any other language. So, the Bengali language is difficult to work with. Different studies have been carried out on Bengali speech recognition. We want to enlist the numerous models that have been used for Bengali speech recognition from 2009 to 2022. In this paper, we will discuss the challenges that were faced and the scope of future research in this field. This survey paper also provides datasets utilized in numerous research studies.
引用
收藏
页码:873 / 878
页数:6
相关论文
共 50 条
  • [1] A study on the challenges and opportunities of speech recognition for Bengali language
    Mridha, M. F.
    Ohi, Abu Quwsar
    Hamid, Md Abdul
    Monowar, Muhammad Mostafa
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (04) : 3431 - 3455
  • [2] A study on the challenges and opportunities of speech recognition for Bengali language
    M. F. Mridha
    Abu Quwsar Ohi
    Md Abdul Hamid
    Muhammad Mostafa Monowar
    Artificial Intelligence Review, 2022, 55 : 3431 - 3455
  • [3] Bengali Speech Emotion Recognition
    Mohanta, Abhijit
    Sharma, Uzzal
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 2812 - 2814
  • [4] Automatic Speech Recognition of Bengali Using Kaldi
    Guchhait, Subhadeep
    Hans, Arnold Sachith A.
    Augustine, Jacob
    PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON SUSTAINABLE EXPERT SYSTEMS (ICSES 2021), 2022, 351 : 153 - 166
  • [5] A BCI system for imagined Bengali speech recognition
    Hossain, Arman
    Das, Kathak
    Khan, Protima
    Kader, Md. Fazlul
    MACHINE LEARNING WITH APPLICATIONS, 2023, 13
  • [6] Image Segmentation Methods: Overview, Challenges, and Future Directions
    Al Garea, Salwa
    Das, Saptarshi
    PROCEEDINGS 2024 SEVENTH INTERNATIONAL WOMEN IN DATA SCIENCE CONFERENCE AT PRINCE SULTAN UNIVERSITY, WIDS-PSU 2024, 2024, : 56 - 61
  • [7] A Survey on Bengali Speech-to-Text Recognition Techniques
    Sultana, Rumia
    Palit, Ratesh
    2014 9TH INTERNATIONAL FORUM ON STRATEGIC TECHNOLOGY (IFOST), 2014, : 26 - 29
  • [8] Invited paper: Automatic speech recognition: History, methods and challenges
    O'Shaughnessy, Douglas
    PATTERN RECOGNITION, 2008, 41 (10) : 2965 - 2979
  • [9] An Overview of Snow Water Equivalent: Methods, Challenges, and Future Outlook
    Taheri, Mercedeh
    Mohammadian, Abdolmajid
    SUSTAINABILITY, 2022, 14 (18)
  • [10] An Overview of Speech Recognition Technology
    Zhang, Xinman
    Peng, Yurui
    Xu, Xuebin
    2019 4TH INTERNATIONAL CONFERENCE ON CONTROL, ROBOTICS AND CYBERNETICS (CRC 2019), 2019, : 81 - 85