Detection of Attributes for Bengali Phoneme in Continuous Speech using Deep Neural Network

被引:0
|
作者
Bhowmik, Tanmay [1 ]
Mukherjee, Sankar [2 ]
Das Mandal, Shyamal Kumar [1 ]
机构
[1] IIT Kharagpur, Ctr Educ Technol, Kharagpur, W Bengal, India
[2] Aix Marseille Univ, CNRS, Lab Parole & Langage, Marseille, France
关键词
speech attribute; place of articulation; manner of articulation; deep neural network; deep belief network;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Hidden Markov Model (HMM) has contributed greatly in the area of speech recognition during last two decades. However, in recent days, detection-based, bottom-up speech recognition techniques achieve high success rate. In this detection-based, bottom-up approach of speech recognition, first step is detection of speech attributes like place and manner of articulation of the phonemes. This paper describes about the detection of attributes which leads to identification of place and manner of articulation of Bengali phonemes using Deep Neural Network (DNN).
引用
收藏
页码:103 / 108
页数:6
相关论文
共 50 条
  • [21] Prosodic word boundary detection from Bengali continuous speech
    Tanmay Bhowmik
    Shyamal Kumar Das Mandal
    Language Resources and Evaluation, 2020, 54 : 747 - 765
  • [22] Sentiment Polarity Detection in Bengali Tweets Using Deep Convolutional Neural Networks
    Sarkar, Kamal
    JOURNAL OF INTELLIGENT SYSTEMS, 2019, 28 (03) : 377 - 386
  • [23] Phoneme-based Thai speech recognition using fuzzy system and neural network
    Cheirsilp, R
    Santiprabhob, P
    IC-AI'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 1-III, 2000, : 393 - 397
  • [24] A Hybrid Approach for Speech Enhancement Using MoG Model and Neural Network Phoneme Classifier
    Chazan, Shlomo E.
    Goldberger, Jacob
    Gannot, Sharon
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (12) : 2516 - 2530
  • [25] Deep Neural Network Quantizers Outperforming Continuous Speech Recognition Systems
    Watzel, Tobias
    Li, Lujun
    Kuerzinger, Ludwig
    Rigoll, Gerhard
    SPEECH AND COMPUTER, SPECOM 2019, 2019, 11658 : 530 - 539
  • [26] Neural speech recognition: continuous phoneme decoding using spatiotemporal representations of human cortical activity
    Moses, David A.
    Mesgarani, Nima
    Leonard, Matthew K.
    Chang, Edward F.
    JOURNAL OF NEURAL ENGINEERING, 2016, 13 (05)
  • [27] Optimal parameters in neural network models for speech phoneme characterization
    Esposito, A
    Aversano, G
    Quek, F
    NEURAL NETS WIRN VIETRI-01, 2002, : 178 - 186
  • [28] A PHONEME-BASED PRE-TRAINING APPROACH FOR DEEP NEURAL NETWORK WITH APPLICATION TO SPEECH ENHANCEMENT
    Chazan, Shlomo E.
    Gannot, Sharon
    Goldberger, Jacob
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [29] Phoneme Segmentation using Deep Learning for Speech Synthesis
    Lee, Young Han
    Yang, Jong-Yeol
    Cho, Choongsang
    Jung, Hyedong
    PROCEEDINGS OF THE 2018 CONFERENCE ON RESEARCH IN ADAPTIVE AND CONVERGENT SYSTEMS (RACS 2018), 2018, : 59 - 61
  • [30] Speech Activity Detection Using Deep Neural Networks
    Shahsavari, Sajad
    Sameti, Hossein
    Hadian, Hossein
    2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1564 - 1568