Using probabilistic characteristic vector based on both phonetic and prosodic features for language identification

被引:0
|
作者
Hosseini Amereei S.A. [1 ]
Homayounpour M.M. [1 ]
机构
[1] Laboratory for Intelligent Sound and Speech Processing, Amirkabir University of Technology, Tehran
关键词
APRLM; GPRLM; Language identification; Pitch contour polynomial approximation; Probabilistic sequence kernel; Support vector machine;
D O I
10.1109/ISTEL.2010.5734122
中图分类号
学科分类号
摘要
Language identification (LID) is an important task in indexing of audio signals. This paper introduces a LID system with a generative frontend based on both phonetic and prosodic features. The generative frontend is built upon an ensemble of Gaussian densities. Half of these Gaussian densities are trained to represent elementary speech sound units and the others are trained to represent prosodic properties that both characterize a wide variety of languages. Shifted Delta Cepstral (SDC) and Pitch Contour Polynomial Approximation (PCPA) are used as feature. The backend classifier is Support Vector Machine (SVM). Several language identification experiments were conducted and the proposed improvements were evaluated using OGI-MLTS corpus. Using SVM with (Generalized Linear Discriminant Analysis) GLDS and Probabilistic Sequence Kernel (PSK) outperforms GMM where all systems are based on PCPA, and improves LID performance about 2.1% and 5.9% respectively. Furthermore, something in the region of 4% improvement was achieved by combining both phonetic and prosodic features in our four languages identification experiments. © 2010 IEEE.
引用
收藏
页码:750 / 754
页数:4
相关论文
共 50 条
  • [21] American Dialect Identification using Phonotactic and Prosodic Features
    Etman, A.
    Beex, A. A.
    2015 SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), 2015, : 963 - 970
  • [22] Analysis of Children's Prosodic Features Using Emotion Based Utterances in Urdu Language
    Khan, Sallar
    Ali, Syed Abbas
    Sallar, Jawaria
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2018, 8 (03) : 2954 - 2957
  • [23] Arabic Dialect Identification based on Probabilistic-Phonetic Modeling
    Terbeh, Naim
    Maraoui, Mohsen
    Zrigui, Mounir
    COMPUTACION Y SISTEMAS, 2018, 22 (03): : 863 - 870
  • [24] Language Classification Using Prosodic Features: Comparing Intensity and Pitch
    Zulu, Peleira Nicholas
    2013 Pan African International Conference on Information Science, Computing and Telecommunications (PACT), 2013, : 116 - 121
  • [25] Speech segmentation using probabilistic phonetic feature hierarchy and support vector machines
    Juneja, A
    Espy-Wilson, C
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 675 - 679
  • [26] Identification of Confusion and Surprise in Spoken Dialog using Prosodic Features
    Kumar, Rohit
    Rose, Carolyn P.
    Litman, Diane J.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1842 - +
  • [27] American Midland Dialect Identification Using Prosodic Features and SVM
    Etman, A.
    Beex, A. A.
    2015 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2015, : 516 - 521
  • [28] Language identification using vector quantization
    Qu, D
    Wang, BX
    Wei, X
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 492 - 495
  • [29] A Novel Emotion Recognizer from Speech Using Both Prosodic and Linguistic Features
    Suzuki, Motoyuki
    Tsuchiya, Seiji
    Ren, Fuji
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT I: 15TH INTERNATIONAL CONFERENCE, KES 2011, 2011, 6881 : 456 - 465
  • [30] A probabilistic framework for landmark detection based on phonetic features for automatic speech recognition
    Juneja, Amit
    Espy-Wilson, Carol
    Journal of the Acoustical Society of America, 2008, 123 (02): : 1154 - 1168