Spoken language identification for Indian languages using split and merge EM algorithm

被引:0
|
作者
Manwani, Naresh [1 ]
Mitra, Suman K. [1 ]
Joshi, M. V. [1 ]
机构
[1] Dhirubhai Ambani Inst Informat & Commun Technol, Gandhinagar, India
来源
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS | 2007年 / 4815卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Performance of Language Identification (LID) System using Gaussian Mixture Models (GMM) is limited by the convergence of Expectation Maximization (EM) algorithm to local maxima. In this paper an LID system is described using Gaussian Mixture Models for the extracted features which are then trained using Split and Merge Expectation Maximization Algorithm that improves the global convergence of EM algorithm. It improves the learning of mixture models which in turn gives better LID performance. A maximum likelihood classifier is used for classification or identifying a language. The superiority of the proposed method is tested for four languages.
引用
收藏
页码:463 / 468
页数:6
相关论文
共 50 条
  • [41] Multi-resolution approach to Identification of spoken languages and to improve overall Language Diarization System using Whisper Model
    Vachhani, Bhavik
    Singh, Dipesh
    Lawyer, Rustom
    INTERSPEECH 2023, 2023, : 1993 - 1997
  • [42] Solving the storm split-merge problem-A combined storm identification, tracking algorithm
    Zan, Beilei
    Yu, Ye
    Li, Jianglin
    Zhao, Guo
    Zhang, Tong
    Ge, Jun
    ATMOSPHERIC RESEARCH, 2019, 218 : 335 - 346
  • [43] A Pre-classification-Based Language Identification for Northeast Indian Languages Using Prosody and Spectral Features
    Bhanja, Chuya China
    Laskar, Mohammad Azharuddin
    Laskar, Rabul Hussain
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (05) : 2266 - 2296
  • [44] A Pre-classification-Based Language Identification for Northeast Indian Languages Using Prosody and Spectral Features
    Chuya China Bhanja
    Mohammad Azharuddin Laskar
    Rabul Hussain Laskar
    Circuits, Systems, and Signal Processing, 2019, 38 : 2266 - 2296
  • [45] Automatic Spoken Language Identification by Digital Signal Processing Methods. Tatar and Russian Languages
    Latypov, Rustam
    Nigmatullin, Ruslan
    Stolov, Evgeni
    INFORMATION AND SOFTWARE TECHNOLOGIES (ICIST 2017), 2017, 756 : 539 - 549
  • [46] Identification of the marks of psychic trauma in spoken language: Definition of the "SPLIT-10" diagnostic scale
    Gayraud, Frederique
    Auxemery, Yann
    ANNALES MEDICO-PSYCHOLOGIQUES, 2022, 180 (03): : 195 - 212
  • [47] Video object tracking using region split and merge and a Kalman filter tracking algorithm
    Vigus, SA
    Bull, DR
    Canagarajah, CN
    2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2001, : 650 - 653
  • [48] Spoken language identification using large vocabulary speech recognition
    Bell Lab, Murray Hill, United States
    Int Conf Spoken Lang Process ICSLP Proc, 1600, (1780-1783):
  • [49] Improving Indian Spoken-Language Identification by Feature Selection in Duration Mismatch Framework
    Bakshi A.
    Kopparapu S.K.
    SN Computer Science, 2021, 2 (6)
  • [50] Improved Particle Swarm Optimization for Detection of Pancreatic Tumor using Split and Merge Algorithm
    Dhruv, Bhawna
    Mittal, Neetu
    Modi, Megha
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2022, 10 (01): : 38 - 47