Ability of Human Auditory Perception to Distinguish Human-Imitated Speech

被引:0
|
作者
Zaman, Khalid [1 ]
Li, Kai [1 ]
Samiul, Islam J. A. M. [1 ]
Uezu, Yasufumi [1 ]
Kidani, Shunsuke [1 ]
Unoki, Masashi [1 ]
机构
[1] Japan Adv Inst Sci & Technol, Grad Sch Adv Sci & Technol, Nomi, Ishikawa 9231292, Japan
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Accuracy; Measurement; Speech recognition; Speech enhancement; Sensitivity; Noise measurement; Noise; Focusing; Target recognition; Performance evaluation; Human-imitated speech; human auditory perception; timbral features; human listeners; DISCRIMINATION;
D O I
10.1109/ACCESS.2025.3526631
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Distinguishing human-imitated speech from genuine speech presents a significant challenge for listeners due to their natural resemblance. Human auditory perception (HAP) has been widely studied to understand its mechanisms, with HAP-based acoustic features and metrics applied in various applications to assess sound quality and discriminate sound events. Leveraging these insights, this study specifically aims to evaluate HAP's effectiveness in differentiating genuine from imitated speech through a systematic subject test. To this end, the study applies HAP to the task of distinguishing genuine from imitated speech, using a specially developed dataset of human-imitated speech, due to the lack of comparable publicly available datasets. A three-phase, human-centered approach was used to evaluate HAP ability, and participants achieved an average accuracy of 70.10% in distinguishing genuine from imitated speech in the final test. Additionally, a feasibility study was conducted using two feature sets for machine classification; among the timbral features, boominess and depth performed best with accuracies of 62% and 60%, respectively, while general features like Mel-spectrograms reached 51%. These results underscore the importance of auditory-related features in effectively detecting imitated speech.
引用
收藏
页码:6225 / 6236
页数:12
相关论文
共 50 条
  • [21] Human Speech Perception and Feature Extraction
    Lobdell, Bryce E.
    Hasegawa-Johnson, Mark A.
    Allen, Jont B.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1797 - 1800
  • [22] Human perception of alcoholic intoxication in speech
    Baumeister, Barbara
    Schiel, Florian
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1418 - 1422
  • [23] A Human Auditory Perception Loss Function Using Modified Bark Spectral Distortion for Speech Enhancement
    Shu, Xiaofeng
    Zhou, Yi
    Liu, Hongqing
    Truong, Trieu-Kien
    NEURAL PROCESSING LETTERS, 2020, 51 (03) : 2945 - 2957
  • [24] Activation of human auditory cortex during speech perception: Effects of monaural, binaural, and dichotic presentation
    Stefanatos, Gerry A.
    Joe, Wilson Q.
    Aguirre, Geoffrey K.
    Detre, John A.
    Wetmore, Gabriel
    NEUROPSYCHOLOGIA, 2008, 46 (01) : 301 - 315
  • [25] A Human Auditory Perception Loss Function Using Modified Bark Spectral Distortion for Speech Enhancement
    Xiaofeng Shu
    Yi Zhou
    Hongqing Liu
    Trieu-Kien Truong
    Neural Processing Letters, 2020, 51 : 2945 - 2957
  • [26] Auditory grouping in speech perception
    Darwin, CJ
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 1996, 31 (3-4) : 4693 - 4693
  • [27] AUDITORY ACUITY AND PERCEPTION OF SPEECH
    KRYTER, KD
    GREEN, DM
    WILLIAMS, C
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1962, 34 (09): : 1217 - &
  • [28] Pitch perception and auditory learning modify the human auditory cortex
    Pantev, C
    PSYCHOPHYSIOLOGY, 2001, 38 : S15 - S15
  • [29] RESPONSE OF PRIMARY AUDITORY NEURON TO HUMAN SPEECH
    GOLDSTEI.AJ
    MUNDIE, JR
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1972, 52 (01): : 140 - &
  • [30] Reconstructing Speech from Human Auditory Cortex
    Pasley, Brian N.
    David, Stephen V.
    Mesgarani, Nima
    Flinker, Adeen
    Shamma, Shihab A.
    Crone, Nathan E.
    Knight, Robert T.
    Chang, Edward F.
    PLOS BIOLOGY, 2012, 10 (01)