Improved Phone Recognition Using Excitation Source Features

被引:0
|
作者
Hisham, P. M. [1 ]
Pravena, D. [1 ]
Pardhu, Y. [2 ]
Gokul, V. [2 ]
Abhitej, B. [2 ]
Govind, D. [1 ]
机构
[1] Amrita Vishwa Vidyapeetham Univ, Ctr Excellence Computat Engn & Networking, Coimbatore 641112, Tamil Nadu, India
[2] Amrita Vishwa Vidyapeetham, Dept Comp Sci & Engn, Kollam 690525, Kerala, India
关键词
D O I
10.1007/978-3-319-23036-8_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Phone recognizers serve as the preprocessing unit for speech recognition systems and phonetic engines. Even though, most of the state of the art speech recognition achieve relatively better accuracy at the sentence level, the phone level recognition performance falls way below the sentence level performance. The increased recognition rates at the sentence levels are achieved with help of refined language models used for the language under consideration. Therefore, the objective of the present work is to improve the phoneme level accuracy of the hidden markov model(HMM) based acoustic phone models by combining excitation source features with the conventional mel frequency cepstral coefficients (MFCC) for American English. TIMIT and CMU Arctic database, is used for the experiments in the present work. The average spectral energy around the zero-frequency region of each frame is used as the excitation source feature to combine with the 13 MFCC features. The effectiveness of the phoneme recognition is confirmed by a 0.5% increase in the phone recognition accuracy against the state of the art HMM-GMM acoustic models with MFCC features.
引用
收藏
页码:147 / 152
页数:6
相关论文
共 50 条
  • [1] Source and system features for phone recognition
    Manjunath, K.
    Rao, K.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2015, 18 (02) : 257 - 270
  • [2] Improvement of Phone Recognition Accuracy using Source and System Features
    Manjunath, K. E.
    Rao, K. Sreenivasa
    Reddy, Gurunath M.
    2015 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION ENGINEERING SYSTEMS (SPACES), 2015, : 501 - 505
  • [3] Dialect Recognition System Using Excitation Source Features
    Choudhury, Akash Roy
    Chittaragi, Nagaratna B.
    Koolagudi, Shashidhar G.
    IEEE INDICON: 15TH IEEE INDIA COUNCIL INTERNATIONAL CONFERENCE, 2018,
  • [4] Infant Cry Recognition using Excitation Source Features
    Singh, Avinash Kumar
    Mukhopadhyay, Jayanta
    Kumar, Sunil S. B.
    Rao, K. Sreenivasa
    2013 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2013,
  • [5] Recognition of Emotions from Speech using Excitation Source Features
    Koolagudi, Shashidhar G.
    Devliyal, Swati
    Chawla, Bhavna
    Barthwal, Anurag
    Rao, K. Sreenivasa
    INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 : 3409 - 3417
  • [6] Emotion Recognition from Speech Signals using Excitation Source and Spectral Features
    Choudhury, Akash Roy
    Ghosh, Anik
    Pandey, Rahul
    Barman, Subhas
    PROCEEDINGS OF 2018 IEEE APPLIED SIGNAL PROCESSING CONFERENCE (ASPCON), 2018, : 257 - 261
  • [7] Source phone identification using sketches of features
    Kotropoulos, Constantine L.
    IET BIOMETRICS, 2014, 3 (02) : 75 - 83
  • [8] Analysis of Excitation Source Features of Speech for Emotion Recognition
    Kadiri, Sudarsana Reddy
    Gangamohan, P.
    Gangashetty, Suryakanth V.
    Yegnanarayana, B.
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1324 - 1328
  • [9] Improvement of Phone Recognition Accuracy Using Articulatory Features
    K. E. Manjunath
    K. Sreenivasa Rao
    Circuits, Systems, and Signal Processing, 2018, 37 : 704 - 728
  • [10] Improvement of Phone Recognition Accuracy Using Articulatory Features
    Manjunath, K. E.
    Rao, K. Sreenivasa
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2018, 37 (02) : 704 - 728