Automatic Language Identification using Machine learning Techniques

被引:0
|
作者
Venkatesan, Hariraj [1 ]
Venkatasubramanian, T. Varun [1 ]
Sangeetha, J. [1 ]
机构
[1] SASTRA Deemed Univ, Sch Comp, Thanjavur 613401, India
关键词
Automatic language identification (LID); Mel frequency cepstral coefficients; Support vector machines; Decision trees;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Investigation in the area of spoken language identification on regional languages aids to broaden the outreach of technology to regional language speakers and also gives to the preservation of regional languages. In this paper, we report our work on identifying spoken data in four local Indian languages Kannada, Hindi, Tamil and Telugu. Automatic Language Identification systems take a speech signal as input and perform computations on the speech input to classify it into one of the natural languages. Mathematical computations performed on the properties of a speech signal such as frequency or amplitude can be used to derive information about the audio and its speaker. In this paper, Mel Frequency Cepstral Coefficients (MFCC) has been used to derive features of speech signals that can be used for identifying languages. For classification purposes, Support Vector Machines and Decision Tree classifiers were used and we got accuracies of 76% and 73% respectively.
引用
收藏
页码:583 / 588
页数:6
相关论文
共 50 条
  • [1] Automatic Identification of Honeypot Server Using Machine Learning Techniques
    Huang, Cheng
    Han, Jiaxuan
    Zhang, Xing
    Liu, Jiayong
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2019, 2019
  • [2] Automatic Identification of Ontology Versions Using Machine Learning Techniques
    Allocca, Carlo
    [J]. SEMANTIC WEB: RESEARCH AND APPLICATIONS, PT I, 2011, 6643 : 352 - 366
  • [3] Language identification of character images using machine learning techniques
    Liu, YH
    Lin, CC
    Chang, F
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 630 - 634
  • [4] Harnessing Twitter for Automatic Sentiment Identification Using Machine Learning Techniques
    Dash, Amiya Kumar
    Rout, Jitendra Kumar
    Jena, Sanjay Kumar
    [J]. PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND INFORMATICS, ICACNI 2015, VOL 2, 2016, 44 : 507 - 514
  • [5] Automatic identification of residential building features using machine learning techniques
    Pietro, Carpanese
    Marco, Dona
    Francesca, da Porto
    [J]. XIX ANIDIS CONFERENCE, SEISMIC ENGINEERING IN ITALY, 2023, 44 : 1980 - 1987
  • [6] Paraphrase Identification using Machine Learning Techniques
    Chitra, A.
    Kumar, C. S. Saravana
    [J]. RECENT ADVANCES IN NETWORKING, VLSI AND SIGNAL PROCESSING, 2010, : 245 - +
  • [7] AUTOMATIC BUILDING IDENTIFICATION USING GPS AND MACHINE LEARNING
    Woodley, Robert
    Noll, Warren
    Barker, Joseph
    Wunsch, Donald C., II
    [J]. 2010 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2010, : 2739 - 2742
  • [8] Identification of Spoken Language using Machine Learning Approach
    Shahriar, Md Asif
    Aziz, Iftekhar
    Banik, Shovan
    Sattar, Abdus
    [J]. 2020 23RD INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2020), 2020,
  • [9] Automatic tagging web services using machine learning techniques
    Lin, Maria
    Cheung, David W.
    [J]. 2014 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 2, 2014, : 258 - 265
  • [10] Automatic Classification of Foot Thermograms Using Machine Learning Techniques
    Filipe, Vitor
    Teixeira, Pedro
    Teixeira, Ana
    [J]. ALGORITHMS, 2022, 15 (07)