AUTOMATIC LANGUAGE IDENTIFICATION OF THREE INDIAN LANGUAGES USING VECTOR QUANTIZATION

被引：0

作者：

Roy, Pinki ^{[1
]}

Das, Pradip K. ^{[2
]}

机构：

[1] NIT Silchar, Silchar, Assam, India

[2] IIT Guwahati, Gauhati, Assam, India

来源：

FOURTH INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING (ICCEE 2011) | 2011年

关键词：

Language Identification; Vector Quantization; LPC; Mean Square Error;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The main aim of this paper is to carry out automatic identification of Indian languages using vector quantization. In this particular work we have applied vector quantization classification technique on LPC derived features for identifying 3 Indian languages Assamese, Bengali and Indian English. Experimental results show that recognition accuracy for Assamese & Indian English is better compared to Bengali language. Stop words were used for reducing the overhead of testing process and it shows that Assamese gives optimal recognition rate at 67 % and Bengali, Indian English gives 100% recognition rate. It has also been observed here from mean square error that quality of speech signal used here is very good.

引用

页码：293 / +

页数：2

共 50 条

[41] Automatic language identification using multivariate analysis
Babu, VJ
Baskaran, S
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2005, 3406 : 789 - 792
[42] On using prosodic cues in automatic language identification
ThymeGobbel, AE
Hutchins, SE
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1768 - 1771
[43] Automatic Language Identification and Content Separation from Indian Multilingual Documents Using Unicode Transformation Format
Rakholia, Rajnish M.
Saini, Jatinderkumar R.
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 1, 2017, 468 : 369 - 378
[44] Speaker identification based on vector quantization
Radová, V
Svenda, Z
TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 341 - 344
[45] Language Identification for Austronesian Languages
Dunn, Jonathan
Nijhof, Wikke
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6530 - 6539
[46] Using MMI Criterion to Realize Language Identification of Minority Languages
Cheng, Yang
Yang, Jian
Kui, Liping
PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 3651 - 3655
[47] Script identification and language detection of 12 Indian languages using DWT and template matching of Frequently Occurring Character(s)
Sarungbam, Jeelen Kumar
Kumar, Bhupendra
Choudhary, Ankur
Proceedings of the 5th International Conference on Confluence 2014: The Next Generation Information Technology Summit, 2014, : 669 - 674
[48] Automatic Carnatic Raga Identification using Octave Mapping and Note Quantization
Pillai, Rohan T.
Mahajan, Shrinivas P.
2017 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2017, : 645 - 649
[49] Script Identification and Language Detection of 12 Indian Languages using DWT and Template Matching of Frequently Occurring Character(s)
Sarungbam, Jeelen Kumar
Kumar, Bhupendra
Choudhary, Ankur
2014 5TH INTERNATIONAL CONFERENCE CONFLUENCE THE NEXT GENERATION INFORMATION TECHNOLOGY SUMMIT (CONFLUENCE), 2014, : 669 - 674
[50] Identification of Indian Languages in romanized form
Yadav, Pratibha
Mishra, Girish
Saxena, P. K.
PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, 2007, : 112 - +

← 1 2 3 4 5 →