Dravidian language classification from speech signal using spectral and prosodic features

被引:6
|
作者
Koolagudi S.G. [1 ]
Bharadwaj A. [1 ]
Srinivasa Murthy Y.V. [1 ]
Reddy N. [1 ]
Rao P. [1 ]
机构
[1] Department of CSE, National Institute of Technology Karnataka, Mangalore
关键词
Artificial neural networks; Dravidian language classification; Indian languages; Language identification; Legendre polynomial; Mel-frequency cepstral coefficients; Principle component analysis; Prosody analysis; Shifted delta cepstral features;
D O I
10.1007/s10772-017-9466-5
中图分类号
学科分类号
摘要
The interesting aspect of the Dravidian languages is a commonality through a shared script, similar vocabulary, and their common root language. In this work, an attempt has been made to classify the four complex Dravidian languages using cepstral coefficients and prosodic features. The speech of Dravidian languages has been recorded in various environments and considered as a database. It is demonstrated that while cepstral coefficients can indeed identify the language correctly with a fair degree of accuracy, prosodic features are added to the cepstral coefficients to improve language identification performance. Legendre polynomial fitting and the principle component analysis (PCA) are applied on feature vectors to reduce dimensionality which further resolves the issue of time complexity. In the experiments conducted, it is found that using both cepstral coefficients and prosodic features, a language identification rate of around 87% is obtained, which is about 18% above the baseline system using Mel-frequency cepstral coefficients (MFCCs). It is observed from the results that the temporal variations and prosody are the important factors needed to be considered for the tasks of language identification. © 2017, Springer Science+Business Media, LLC.
引用
收藏
页码:1005 / 1016
页数:11
相关论文
共 50 条
  • [1] Spectral and prosodic features-based speech pattern classification
    Sinha, Shweta
    Jain, Aruna
    Agrawal, S. S.
    [J]. INTERNATIONAL JOURNAL OF APPLIED PATTERN RECOGNITION, 2015, 2 (01) : 96 - 110
  • [2] HMM Based Language Identification from Speech Utterances of Popular Indic Languages Using Spectral and Prosodic Features
    Sadanandam, Manchala
    [J]. TRAITEMENT DU SIGNAL, 2021, 38 (02) : 521 - 528
  • [3] Dialect recognition from Telugu speech utterances using spectral and prosodic features
    Shivaprasad, S.
    Sadanandam, M.
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 27 (2) : 515 - 515
  • [4] Hierarchical emotion recognition from speech using source, power spectral and prosodic features
    Arijul Haque
    K. Sreenivasa Rao
    [J]. Multimedia Tools and Applications, 2024, 83 : 19629 - 19661
  • [5] Hierarchical emotion recognition from speech using source, power spectral and prosodic features
    Haque, Arijul
    Rao, K. Sreenivasa
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 19629 - 19661
  • [6] Speech/Music Classification Using Features From Spectral Peaks
    Bhattacharjee, Mrinmoy
    Prasanna, S. R. Mahadeva
    Guha, Prithwijit
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1549 - 1559
  • [7] Language Classification Using Prosodic Features: Comparing Intensity and Pitch
    Zulu, Peleira Nicholas
    [J]. 2013 Pan African International Conference on Information Science, Computing and Telecommunications (PACT), 2013, : 116 - 121
  • [8] Improving Speech Emotion Recognition System Using Spectral and Prosodic Features
    Chakhtouna, Adil
    Sekkate, Sara
    Adib, Abdellah
    [J]. INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, ISDA 2021, 2022, 418 : 399 - 409
  • [9] Classification of lexical stress using spectral and prosodic features for computer-assisted language learning systems
    Ferrer, Luciana
    Bratt, Harry
    Richey, Colleen
    Franco, Horacio
    Abrash, Victor
    Precoda, Kristin
    [J]. SPEECH COMMUNICATION, 2015, 69 : 31 - 45
  • [10] Evaluation of influence of spectral and prosodic features on GMM classification of Czech and Slovak emotional speech
    Jiří Přibil
    Anna Přibilová
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2013