Significance of Joint Features Derived from the Modified Group Delay Function in Speech Processing

被引:0
|
作者
Rajesh M. Hegde
Hema A. Murthy
V. R. R. Gadde
机构
[1] University of California San Diego,Department of Electrical and Computer Engineering
[2] Indian Institute of Technology Madras,Department of Computer Science and Engineering
[3] SRI International,STAR Lab
关键词
Acoustics; Speech Recognition; Group Delay; Conventional Group; Resonant Structure;
D O I
暂无
中图分类号
学科分类号
摘要
This paper investigates the significance of combining cepstral features derived from the modified group delay function and from the short-time spectral magnitude like the MFCC. The conventional group delay function fails to capture the resonant structure and the dynamic range of the speech spectrum primarily due to pitch periodicity effects. The group delay function is modified to suppress these spikes and to restore the dynamic range of the speech spectrum. Cepstral features are derived from the modified group delay function, which are called the modified group delay feature (MODGDF). The complementarity and robustness of the MODGDF when compared to the MFCC are also analyzed using spectral reconstruction techniques. Combination of several spectral magnitude-based features and the MODGDF using feature fusion and likelihood combination is described. These features are then used for three speech processing tasks, namely, syllable, speaker, and language recognition. Results indicate that combining MODGDF with MFCC at the feature level gives significant improvements for speech recognition tasks in noise. Combining the MODGDF and the spectral magnitude-based features gives a significant increase in recognition performance of 11% at best, while combining any two features derived from the spectral magnitude does not give any significant improvement.
引用
收藏
相关论文
共 50 条
  • [1] Significance of Joint Features Derived from the Modified Group Delay Function in Speech Processing
    Hegde, Rajesh M.
    Murthy, Hema A.
    Gadde, V. R. R.
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2007, 2007 (1)
  • [2] Speech processing using joint features derived from the modified group delay function
    Hegde, RM
    Murthy, HA
    Rao, GVR
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 541 - 544
  • [3] Significance of the modified group delay feature in speech recognition
    Hegde, Rajesh M.
    Murthy, Hema A.
    Gadde, Venkata Ramana Rao
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 190 - 202
  • [4] Applications of group delay functions in speech processing
    Yegnanarayana, B.
    Madhu Murthy, K.V.
    Murthy, Hema A.
    IETE Journal of Research, 1988, 34 (01) : 20 - 29
  • [5] SPEECH PROCESSING USING GROUP DELAY FUNCTIONS
    MURTHY, HA
    YEGNANARAYANA, B
    SIGNAL PROCESSING, 1991, 22 (03) : 259 - 267
  • [6] Significance of Group Delay based Acoustic Features in the Linguistic Search Space for Robust Speech Recognition
    Ramya, R.
    Hegde, Rajesh M.
    Murthy, Hema A.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1537 - +
  • [7] Modified Group Delay Features for Emotion Recognition
    Uthiraa, S.
    Pusuluri, Aditya
    Patil, Hemant A.
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2023, 2023, 14301 : 321 - 330
  • [8] SIGNIFICANCE OF THE MUSIC-GROUP DELAY SPECTRUM IN SPEECH ACQUISITION FROM DISTANT MICROPHONES
    Shukla, Mrityunjaya
    Hegde, Rajesh M.
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 2738 - 2741
  • [9] Speech enhancement using source features and group delay analysis
    Prasanna, SRM
    Moorthy, PK
    Yegnanarayana, B
    INDICON 2005 PROCEEDINGS, 2005, : 19 - 23
  • [10] An alternative representation of speech using the modified group delay feature
    Hegde, RM
    Hurthy, HA
    2004 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING & COMMUNICATIONS (SPCOM), 2004, : 280 - 284