Spectral and prosodic features-based speech pattern classification

被引:0
|
作者
Sinha, Shweta [1 ]
Jain, Aruna [1 ]
Agrawal, S. S. [2 ]
机构
[1] Birla Inst Technol, Dept Comp Sci & Engn, Ranchi, Bihar, India
[2] KIIT Grp Coll, Gurgaon, Haryana, India
关键词
speech pattern identification; dialect classification; auto-associative neural network; AANN; feature compression; Hindi dialects; speech features; prosodic features;
D O I
10.1504/IJAPR.2015.068947
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech pattern produced by individuals are unique. This uniqueness is due to the accent influenced by individual's native dialect. Prior knowledge of spoken dialect provides valuable information for speaker profiling and incorporating them in the decision parameter can improve the system performance. In this paper, an auto-associative neural network model has been proposed to model intrinsic characteristics of speech features for dialect classification. This paper highlights the sufficiency of few spectral and prosodic features for identification of Hindi dialects. Experimental results show that system performance is the best when both spectral and prosodic features are combined to use as input. In the presence of noise, performance of a conventional ASR starts to degrade. The NOISEX-92 database is used to add white noise to the recorded utterances in the range of 0 dB to 20 dB. This paper evaluates the dialect classification system's performance for SNRs in this range.
引用
收藏
页码:96 / 110
页数:15
相关论文
共 50 条
  • [1] Dravidian language classification from speech signal using spectral and prosodic features
    Koolagudi S.G.
    Bharadwaj A.
    Srinivasa Murthy Y.V.
    Reddy N.
    Rao P.
    [J]. International Journal of Speech Technology, 2017, 20 (4) : 1005 - 1016
  • [2] A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic Features
    Zhou, Yu
    Li, Junfeng
    Sun, Yanqing
    Zhang, Jianping
    Yan, Yonghong
    Akagi, Masato
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (10) : 2813 - 2821
  • [3] Evaluation of influence of spectral and prosodic features on GMM classification of Czech and Slovak emotional speech
    Jiří Přibil
    Anna Přibilová
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2013
  • [4] Evaluation of influence of spectral and prosodic features on GMM classification of Czech and Slovak emotional speech
    Pribil, Jiri
    Pribilova, Anna
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,
  • [5] Discrimination Capability of Prosodic and Spectral Features for Emotional Speech Recognition
    Delic, V.
    Bojanic, M.
    Gnjatovic, M.
    Secujski, M.
    Jovicic, S. T.
    [J]. ELEKTRONIKA IR ELEKTROTECHNIKA, 2012, 18 (09) : 51 - 54
  • [6] Spectral and spatial features-based HSI classification using multiple neuron-based learning approach
    Venkatesan, R.
    Prabu, Sevugan
    [J]. International Journal of Cloud Computing, 2020, 9 (2-3): : 163 - 177
  • [7] PERFORMANCE ANALYSIS OF SPECTRAL AND PROSODIC FEATURES AND THEIR FUSION FOR EMOTION RECOGNITION IN SPEECH
    Gaurav, Manish
    [J]. 2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 313 - 316
  • [8] Improving Speech Emotion Recognition System Using Spectral and Prosodic Features
    Chakhtouna, Adil
    Sekkate, Sara
    Adib, Abdellah
    [J]. INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, ISDA 2021, 2022, 418 : 399 - 409
  • [9] A Statistical Features-based Color Difference Classification Method
    Su Feng-wu
    Jiang Mai
    [J]. 2013 25TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2013, : 2063 - 2067
  • [10] PRINCIPAL FEATURES-BASED TEXTURE CLASSIFICATION WITH NEURAL NETWORKS
    SHANG, CG
    BROWN, K
    [J]. PATTERN RECOGNITION, 1994, 27 (05) : 675 - 687