Spoken language identification for Indian languages using split and merge EM algorithm

被引：0

作者：

Manwani, Naresh ^{[1
]}

Mitra, Suman K. ^{[1
]}

Joshi, M. V. ^{[1
]}

机构：

[1] Dhirubhai Ambani Inst Informat & Commun Technol, Gandhinagar, India

来源：

PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS | 2007年 / 4815卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Performance of Language Identification (LID) System using Gaussian Mixture Models (GMM) is limited by the convergence of Expectation Maximization (EM) algorithm to local maxima. In this paper an LID system is described using Gaussian Mixture Models for the extracted features which are then trained using Split and Merge Expectation Maximization Algorithm that improves the global convergence of EM algorithm. It improves the learning of mixture models which in turn gives better LID performance. A maximum likelihood classifier is used for classification or identifying a language. The superiority of the proposed method is tested for four languages.

引用

页码：463 / 468

页数：6

共 50 条

[41] Multi-resolution approach to Identification of spoken languages and to improve overall Language Diarization System using Whisper Model
Vachhani, Bhavik
Singh, Dipesh
Lawyer, Rustom
INTERSPEECH 2023, 2023, : 1993 - 1997
[42] Solving the storm split-merge problem-A combined storm identification, tracking algorithm
Zan, Beilei
Yu, Ye
Li, Jianglin
Zhao, Guo
Zhang, Tong
Ge, Jun
ATMOSPHERIC RESEARCH, 2019, 218 : 335 - 346
[43] A Pre-classification-Based Language Identification for Northeast Indian Languages Using Prosody and Spectral Features
Bhanja, Chuya China
Laskar, Mohammad Azharuddin
Laskar, Rabul Hussain
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (05) : 2266 - 2296
[44] A Pre-classification-Based Language Identification for Northeast Indian Languages Using Prosody and Spectral Features
Chuya China Bhanja
Mohammad Azharuddin Laskar
Rabul Hussain Laskar
Circuits, Systems, and Signal Processing, 2019, 38 : 2266 - 2296
[45] Automatic Spoken Language Identification by Digital Signal Processing Methods. Tatar and Russian Languages
Latypov, Rustam
Nigmatullin, Ruslan
Stolov, Evgeni
INFORMATION AND SOFTWARE TECHNOLOGIES (ICIST 2017), 2017, 756 : 539 - 549
[46] Identification of the marks of psychic trauma in spoken language: Definition of the "SPLIT-10" diagnostic scale
Gayraud, Frederique
Auxemery, Yann
ANNALES MEDICO-PSYCHOLOGIQUES, 2022, 180 (03): : 195 - 212
[47] Video object tracking using region split and merge and a Kalman filter tracking algorithm
Vigus, SA
Bull, DR
Canagarajah, CN
2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2001, : 650 - 653
[48] Spoken language identification using large vocabulary speech recognition
Bell Lab, Murray Hill, United States
Int Conf Spoken Lang Process ICSLP Proc, 1600, (1780-1783):
[49] Improving Indian Spoken-Language Identification by Feature Selection in Duration Mismatch Framework
Bakshi A.
Kopparapu S.K.
SN Computer Science, 2021, 2 (6)
[50] Improved Particle Swarm Optimization for Detection of Pancreatic Tumor using Split and Merge Algorithm
Dhruv, Bhawna
Mittal, Neetu
Modi, Megha
COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2022, 10 (01): : 38 - 47

← 1 2 3 4 5 →