AUTOMATIC LANGUAGE IDENTIFICATION OF THREE INDIAN LANGUAGES USING VECTOR QUANTIZATION

被引:0
|
作者
Roy, Pinki [1 ]
Das, Pradip K. [2 ]
机构
[1] NIT Silchar, Silchar, Assam, India
[2] IIT Guwahati, Gauhati, Assam, India
来源
FOURTH INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING (ICCEE 2011) | 2011年
关键词
Language Identification; Vector Quantization; LPC; Mean Square Error;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The main aim of this paper is to carry out automatic identification of Indian languages using vector quantization. In this particular work we have applied vector quantization classification technique on LPC derived features for identifying 3 Indian languages Assamese, Bengali and Indian English. Experimental results show that recognition accuracy for Assamese & Indian English is better compared to Bengali language. Stop words were used for reducing the overhead of testing process and it shows that Assamese gives optimal recognition rate at 67 % and Bengali, Indian English gives 100% recognition rate. It has also been observed here from mean square error that quality of speech signal used here is very good.
引用
收藏
页码:293 / +
页数:2
相关论文
共 50 条
  • [41] Automatic language identification using multivariate analysis
    Babu, VJ
    Baskaran, S
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2005, 3406 : 789 - 792
  • [42] On using prosodic cues in automatic language identification
    ThymeGobbel, AE
    Hutchins, SE
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1768 - 1771
  • [43] Automatic Language Identification and Content Separation from Indian Multilingual Documents Using Unicode Transformation Format
    Rakholia, Rajnish M.
    Saini, Jatinderkumar R.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 1, 2017, 468 : 369 - 378
  • [44] Speaker identification based on vector quantization
    Radová, V
    Svenda, Z
    TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 341 - 344
  • [45] Language Identification for Austronesian Languages
    Dunn, Jonathan
    Nijhof, Wikke
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6530 - 6539
  • [46] Using MMI Criterion to Realize Language Identification of Minority Languages
    Cheng, Yang
    Yang, Jian
    Kui, Liping
    PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 3651 - 3655
  • [47] Script identification and language detection of 12 Indian languages using DWT and template matching of Frequently Occurring Character(s)
    Sarungbam, Jeelen Kumar
    Kumar, Bhupendra
    Choudhary, Ankur
    Proceedings of the 5th International Conference on Confluence 2014: The Next Generation Information Technology Summit, 2014, : 669 - 674
  • [48] Automatic Carnatic Raga Identification using Octave Mapping and Note Quantization
    Pillai, Rohan T.
    Mahajan, Shrinivas P.
    2017 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2017, : 645 - 649
  • [49] Script Identification and Language Detection of 12 Indian Languages using DWT and Template Matching of Frequently Occurring Character(s)
    Sarungbam, Jeelen Kumar
    Kumar, Bhupendra
    Choudhary, Ankur
    2014 5TH INTERNATIONAL CONFERENCE CONFLUENCE THE NEXT GENERATION INFORMATION TECHNOLOGY SUMMIT (CONFLUENCE), 2014, : 669 - 674
  • [50] Identification of Indian Languages in romanized form
    Yadav, Pratibha
    Mishra, Girish
    Saxena, P. K.
    PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, 2007, : 112 - +