Modulation Spectral Features for Intrusive Measurement of Reverberant Speech Quality

被引:0
|
作者
Ma, Sai [1 ]
Zhang, Hui [1 ]
Xie, Lingyun [1 ]
Xie, Xi [1 ]
机构
[1] Commun Univ China, Beijing, Peoples R China
来源
关键词
Reverberation; Direct-to-Reverberant energy ratio; Modulation; Instantaneous Frequency; Mutual Information; INTELLIGIBILITY;
D O I
10.1007/978-981-13-8138-6_24
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Temporal fine structure includes temporal envelope and fine structure which also called carrier, and instantaneous frequency is the partial derivative of the carrier. An intrusive reverberant speech quality measurement is investigated with the representation of corresponding instantaneous frequency. A Gammatone filterbank is used to simulate auditory mechanism and a modulation filterbank is used to improve frequency resolution. The mean mutual information between reference and reverberant modulation spectral instantaneous frequency probability distribution is taken as the final measurement score. Experimental results show the proposed method outperforming two benchmark algorithms in some practical application conditions.
引用
收藏
页码:284 / 295
页数:12
相关论文
共 50 条
  • [41] Non-intrusive quality assessment of noise-suppressed speech using unsupervised deep features
    Soni, Meet H.
    Patil, Hemant A.
    SPEECH COMMUNICATION, 2021, 130 : 27 - 44
  • [42] NON-INTRUSIVE QUALITY ASSESSMENT FOR ENHANCED SPEECH SIGNALS BASED ON SPECTRO-TEMPORAL FEATURES
    Li, Qiaohong
    Fang, Yuming
    Lin, Weisi
    Thalmann, Daniel
    2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2014,
  • [43] Non-intrusive Objective Speech Quality Assessment using a Combination of MFCC, PLP and LSF Features
    Dubey, Rajesh Kumar
    Kumar, Arun
    2013 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICSC), 2013, : 297 - 302
  • [44] Non-intrusive speech quality assessment: A survey
    Shen, Kailai
    Yan, Diqun
    Hu, Jing
    Ye, Zhe
    NEUROCOMPUTING, 2024, 580
  • [45] Quality Measurement of Speech Recognition Features in Context of Nearest Neighbour Classifier
    Lileikyte, R.
    Telksnys, L.
    ELEKTRONIKA IR ELEKTROTECHNIKA, 2012, 118 (02) : 9 - 12
  • [46] Subband Temporal Modulation Spectrum Normalization for Automatic Speech Recognition in Reverberant Environments
    Lu, Xugang
    Unoki, Masashi
    Nakamura, Satoshi
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2475 - 2478
  • [47] A STUDY ON JOINT BEAMFORMING AND SPECTRAL ENHANCEMENT FOR ROBUST SPEECH RECOGNITION IN REVERBERANT ENVIRONMENTS
    Xiong, Feifei
    Meyer, Bernd T.
    Goetze, Stefan
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5043 - 5047
  • [48] Reverberation Time Estimation based on a Model for the Power Spectral Density of Reverberant Speech
    Faraji, Neda
    Ahadi, Seyed Mohammad
    Sheikhzadeh, Hamid
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1453 - 1457
  • [49] Combined spectral and speech features for pig speech recognition
    Wu, Xuan
    Zhou, Silong
    Chen, Mingwei
    Zhao, Yihang
    Wang, Yifei
    Zhao, Xianmeng
    Li, Danyang
    Pu, Haibo
    PLOS ONE, 2022, 17 (12):
  • [50] PERFORMANCE COMPARISON OF INTRUSIVE AND NON-INTRUSIVE INSTRUMENTAL QUALITY MEASURES FOR ENHANCED SPEECH
    Avila, Anderson
    Cauchi, Benjamin
    Goetze, Stefan
    Doclo, Simon
    Falk, Tiago
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,