Person Recognition using Humming, Singing and Speech

被引:4
|
作者
Patil, Hemant A. [1 ]
Madhavi, Maulik C. [1 ]
Chhayani, Nirav H. [1 ]
机构
[1] Dhirubhai Ambani Inst Informat & Commun Technol D, Gandhinagar, India
关键词
Biometric; Humming; Corpus development; Speaker recognition; Singer recognition; SPEAKER RECOGNITION;
D O I
10.1109/IALP.2012.58
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speaker recognition deals with designing the system which recognizes the person by speech with the help of computers. In this paper, the various biometric signals produced by humans, viz., speech, singing and humming are considered for person recognition task. Corpus has been developed from 28 subjects in real-life settings. For person recognition task, state-of-the-art feature set, viz., Mel Frequency Cepstral Coefficients (MFCC) and a discriminatively-trained polynomial classifier of 2nd order approximation are used as spectral feature and classification techniques, respectively. Our experimental results indicate that the performance of person recognition system obtained using humming outperforms other biometric patterns (i.e., speech and singing) by 9 % in EER and 9 % in Identification Rate. We believe that this may be due to the person-specific characteristics are better captured in humming sounds, (which are nasalized sounds) than speech and singing.
引用
收藏
页码:149 / 152
页数:4
相关论文
共 50 条
  • [1] Development of Corpora for Person Recognition using Humming, Singing and Speech
    Chhayani, Nirav H.
    Patil, Hemant A.
    2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [2] Significance of Phase-based Features for Person Recognition Using Humming
    Sailor, Hardik B.
    Madhavi, Maulik C.
    Patil, Hemant A.
    PERCEPTION AND MACHINE INTELLIGENCE, 2015, 2015, : 99 - 103
  • [3] Combining evidences from magnitude and phase information using VTEO for person recognition using humming
    Patil, Hemant A.
    Madhavi, Maulik C.
    COMPUTER SPEECH AND LANGUAGE, 2018, 52 : 225 - 256
  • [4] MUSIC RETRIEVAL BY SINGING AND HUMMING USING INFORMATION FUSION
    Milner, John N.
    Hsu, D. Frank
    PROCEEDINGS OF THE 2013 12TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI CC 2013), 2013, : 332 - 338
  • [5] SPEECH ANALYSIS OF SUNG-SPEECH AND LYRIC RECOGNITION IN MONOPHONIC SINGING
    Kawai, Dairoku
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 271 - 275
  • [6] The singing, humming,whistling, hollering, growling, storytelling bassist
    Mezzacappa, Lisa
    STRAD, 2019, 130 (1547): : 48 - 53
  • [7] Effective Results Ranking for Mobile Query by Singing/Humming Using a Hybrid Recommendation Mechanism
    Liu, Ning-Han
    IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (05) : 1407 - 1420
  • [8] Clinical and anatomic characteristics of humming and singing in partial seizures
    Bartolomei, F.
    McGonigal, A.
    Guye, M.
    Guedj, E.
    Chauvel, P.
    NEUROLOGY, 2007, 69 (05) : 490 - 492
  • [9] Static and dynamic information derived from source and system features for person recognition from humming
    Patil, Hemant
    Madhavi, Maulik
    Parhi, Keshab
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (03) : 393 - 406
  • [10] Phoneme Recognition in Korean Singing Voices Using Self-Supervised English Speech Representations
    Wu, Wenqin
    Lee, Joonwhoan
    APPLIED SCIENCES-BASEL, 2024, 14 (18):