Malayalam Speech Recognition System and Its Application for visually impaired people

被引:0
|
作者
Anand, Anu V. [1 ]
Devi, P. Shobana [1 ]
Stephen, Jose [1 ]
Bhadran, V. K. [1 ]
机构
[1] Ctr Dev Adv Comp, Trivandrum, Kerala, India
关键词
LVCSR; Speech Recognition; HMM; CMU SPHINX; Voice enabled OpenOffice; Speech Recognition application;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper describes the development of state-of-the-art large vocabulary continuous speech recognition (LVCSR) system for the Malayalam language with an application for visually challenged. For an LVCSR, building a high accurate acoustic models and large-scale language models are the challenging task. Speech corpus for training the system is collected from 80 native speakers in room environment ensuring the speaker variance. Mel-frequency Cepstral Coefficients (MFCC) method is used as a front-end to extract acoustic features from the input signal. Acoustic model is built on 30 hours of speech data based on Hidden Markov Model (HMM). A hybrid model, integrating rule based and statistical method is used to handle pronunciation variations in the dictionary. The best configuration of the system achieved word accuracy of 75% in average. Accuracy of the system is further increased up to 80% in average, by implementing speaker adaptation technique. The developed system is integrated to OpenOffice Writer together with TTS for making it user friendly editor for visually challenged people.
引用
收藏
页码:619 / 624
页数:6
相关论文
共 50 条
  • [41] Facial emotion recognition and encoding application for the visually impaired
    Pushpalatha, M. N.
    Meherishi, Harshubh
    Vaishnav, Avani
    Pillai, R. Anurag
    Gupta, Aman
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (01): : 749 - 755
  • [42] A Systematic Review on Product Recognition for Aiding Visually Impaired People
    Machado, Andre
    Veras, Rodrigo
    Aires, Kelson
    Neto, Laurindo Britto
    IEEE LATIN AMERICA TRANSACTIONS, 2021, 19 (04) : 592 - 603
  • [43] A new Android application for blind and visually impaired people
    Kardys, Piotr
    Dabrowski, Adam
    Iwanowski, Marcin
    Huderek, Damian
    2016 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2016, : 152 - 155
  • [44] Facial emotion recognition and encoding application for the visually impaired
    M. N Pushpalatha
    Harshubh Meherishi
    Avani Vaishnav
    R. Anurag Pillai
    Aman Gupta
    Neural Computing and Applications, 2023, 35 : 749 - 755
  • [45] TSPS VISUALLY IMPAIRED OPERATOR SYSTEM - OPENING UP JOB POSSIBILITIES FOR VISUALLY IMPAIRED PEOPLE
    BAUER, TM
    BENKO, TW
    BELL LABORATORIES RECORD, 1983, 61 (01): : 25 - 31
  • [46] Web Based Programming Tool with Speech Recognition for Visually Impaired Users
    Lunuwilage, Kaveendra
    Abeysekara, Sameera
    Witharama, Lahiru
    Mendis, Shamini
    Thelijjagoda, Samantha
    2017 11TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT AND APPLICATIONS (SKIMA), 2017,
  • [47] DEEP-SEE FACE: A Mobile Face Recognition System Dedicated to Visually Impaired People
    Mocanu, Bogdan
    Tapu, Ruxandra
    Zaharia, Titus
    IEEE ACCESS, 2018, 6 : 51975 - 51985
  • [48] A Lightweight Facial Emotion Recognition System Using Partial Transfer Learning for Visually Impaired People
    Shehada, Dina
    Turky, Ayad
    Khan, Wasiq
    Khan, Bilal
    Hussain, Abir
    IEEE ACCESS, 2023, 11 : 36961 - 36969
  • [49] The performance of HSDPA (3.5 G) network for application in a navigation system for visually impaired people
    Alhajri, Khalid
    Ai-Salihi, Nawzad
    Garaj, Vanja
    Hunaiti, Ziad
    Balachandran, Wamadeva
    CNSR 2008: PROCEEDINGS OF THE 6TH ANNUAL COMMUNICATION NETWORKS AND SERVICES RESEARCH CONFERENCE, 2008, : 440 - 446
  • [50] An Intelligent Banknote Recognition System by using Machine Learning with Assistive Technology for Visually Impaired People
    Ng, Sin-Chun
    Kwok, Chok-Pang
    Chung, Sin-Hang
    Leung, Yuen-Yan
    Pang, Hoi-Shan
    2020 10TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2020, : 185 - 193