Automatic phonetic segmentation of Hindi speech using hidden Markov model

被引:0
|
作者
Balyan, Archana [1 ]
Agrawal, S. [2 ]
Dev, Amita [3 ]
机构
[1] Guru Gobind Singh Indraprastha Univ, Maharaja Surajmal Inst Technol, C-4,Janakpuri, New Delhi 110058, India
[2] KIIT Coll Engn, Gurgaon, Haryana, India
[3] Bhai Parmanand Inst Business Studies, Delhi, India
关键词
Automatic phonetic segmentation; Hidden Markov models; Text to speech; Corpus-based speech synthesis Gaussian mixture models; Unit selection;
D O I
10.1007/s00146-012-0386-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we study the performance of baseline hidden Markov model (HMM) for segmentation of speech signals. It is applied on single-speaker segmentation task, using Hindi speech database. The automatic phoneme segmentation framework evolved imitates the human phoneme segmentation process. A set of 44 Hindi phonemes were chosen for the segmentation experiment, wherein we used continuous density hidden Markov model (CDHMM) with a mixture of Gaussian distribution. The left-to-right topology with no skip states has been selected as it is effective in speech recognition due to its consistency with the natural way of articulating the spoken words. This system accepts speech utterances along with their orthographic "transcriptions'' and generates segmentation information of the speech. This corpus was used to develop context-independent hidden Markov models (HMMs) for each of the Hindi phonemes. The system was trained using numerous sentences that are relevant to provide information to the passengers of the Metro Rail. The system was validated against a few manually segmented speech utterances. The evaluation of the experiments shows that the best performance is obtained by using a combination of two Gaussians mixtures and five HMM states. A category-wise phoneme error analysis has been performed, and the performance of the phonetic segmentation has been reported. The modeling of HMMs has been implemented using Microsoft Visual Studio 2005 (C++), and the system is designed to work on Windows operating system. The goal of this study is automatic segmentation of speech at phonetic level.
引用
收藏
页码:543 / 549
页数:7
相关论文
共 50 条
  • [1] Automatic phonetic segmentation of Hindi speech using hidden Markov model
    Archana Balyan
    S. S. Agrawal
    Amita Dev
    [J]. AI & SOCIETY, 2012, 27 (4) : 543 - 549
  • [2] Automatic Urdu Speech Recognition Using Hidden Markov Model
    Asadullah
    Shaukat, Arslan
    Ali, Hazrat
    Akram, Usman
    [J]. 2016 INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2016), 2016, : 135 - 139
  • [3] Automatic Speech Segmentation Using the Arabic Phonetic Database
    Al-Manie, Mohammed A.
    Alkanhal, Mohammed I.
    Al-Ghamdi, Mansour M.
    [J]. RECENT ADVANCES IN AUTOMATION & INFORMATION: PROCEEDINGS OF THE 10TH WSEAS INTERNATIONAL CONFERENCE ON AUTOMATION & INFORMATION (ICAI'09), 2009, : 76 - +
  • [4] Automatic Segmentation of Stabilometric Signals Using Hidden Markov Model Regression
    Safi, Khaled
    Mohammed, Samer
    Attal, Ferhat
    Amirat, Yacine
    Oukhellou, Latifa
    Khalil, Mohamad
    Gracies, Jean-Michel
    Hutin, Emilie
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2018, 15 (02) : 545 - 555
  • [5] MARKOV MODEL ACOUSTIC PHONETIC COMPONENT FOR AUTOMATIC SPEECH RECOGNITION
    TAPPERT, CC
    [J]. INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1977, 9 (03): : 363 - 373
  • [6] Continuous Density Hidden Markov Model for Context Dependent Hindi speech Recognition
    Sinha, Shweta
    Agrawal, S. S.
    Jain, Aruna
    [J]. 2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 1953 - 1958
  • [7] Automatic speech recognition using hidden Markov models
    Botros, N.M.
    Teh, C.K.
    [J]. Microcomputer Applications, 1994, 13 (01): : 6 - 12
  • [8] Automatic outlier detection using hidden Markov model for cerebellar lobule segmentation
    Zuo, Lianrui
    Carass, Aaron
    Han, Shuo
    Prince, Jerry L.
    [J]. MEDICAL IMAGING 2018: BIOMEDICAL APPLICATIONS IN MOLECULAR, STRUCTURAL, AND FUNCTIONAL IMAGING, 2018, 10578
  • [9] Automatic Segmentation and Recognition in Body Sensor Networks Using a Hidden Markov Model
    Guenterberg, Eric
    Ghasemzadeh, Hassan
    Jafari, Roozbeh
    [J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2012, 11
  • [10] Named Entity Recognition in Hindi Using Hidden Markov Model
    Chopra, Deepti
    Joshi, Nisheeth
    Mathur, Iti
    [J]. 2016 SECOND INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE & COMMUNICATION TECHNOLOGY (CICT), 2016, : 581 - 586