Person Recognition using Humming, Singing and Speech

被引:4
|
作者
Patil, Hemant A. [1 ]
Madhavi, Maulik C. [1 ]
Chhayani, Nirav H. [1 ]
机构
[1] Dhirubhai Ambani Inst Informat & Commun Technol D, Gandhinagar, India
关键词
Biometric; Humming; Corpus development; Speaker recognition; Singer recognition; SPEAKER RECOGNITION;
D O I
10.1109/IALP.2012.58
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speaker recognition deals with designing the system which recognizes the person by speech with the help of computers. In this paper, the various biometric signals produced by humans, viz., speech, singing and humming are considered for person recognition task. Corpus has been developed from 28 subjects in real-life settings. For person recognition task, state-of-the-art feature set, viz., Mel Frequency Cepstral Coefficients (MFCC) and a discriminatively-trained polynomial classifier of 2nd order approximation are used as spectral feature and classification techniques, respectively. Our experimental results indicate that the performance of person recognition system obtained using humming outperforms other biometric patterns (i.e., speech and singing) by 9 % in EER and 9 % in Identification Rate. We believe that this may be due to the person-specific characteristics are better captured in humming sounds, (which are nasalized sounds) than speech and singing.
引用
收藏
页码:149 / 152
页数:4
相关论文
共 50 条
  • [41] Discriminative Named Entity Recognition of Speech Data using Speech Recognition Confidence
    Sudoh, Katsuhito
    Tsukada, Hajime
    Isozaki, Hideki
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 337 - 340
  • [42] Robust distributed speech recognition using speech enhancement
    Flynn, Ronan
    Jones, Edward
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2008, 54 (03) : 1267 - 1273
  • [43] Automated cleft speech evaluation using speech recognition
    Vucovich, Megan
    Hallac, Rami R.
    Kane, Alex A.
    Cook, Julie
    Van'T Slot, Cortney
    Seaward, James R.
    JOURNAL OF CRANIO-MAXILLOFACIAL SURGERY, 2017, 45 (08) : 1268 - 1271
  • [44] Estimation of Speech Intelligibility Using Speech Recognition Systems
    Takano, Yusuke
    Kondo, Kazuhiro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (12): : 3368 - 3376
  • [45] Robust recognition of noisy speech using speech enhancement
    Xu, YF
    Zhang, JJ
    Yao, KS
    Cao, ZG
    Ma, ZX
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 734 - 737
  • [46] Multidimensional humming transcription using a statistical approach for query by humming systems
    Shih, HH
    Narayanan, SS
    Kuo, CCJ
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 541 - 544
  • [47] Speech bandwidth extension method using speech recognition and speech synthesis
    Takashina, Masashi
    Kuroiwa, Shingo
    Tsuge, Satoru
    Ren, Fuji
    2006 10TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, VOLS 1 AND 2, PROCEEDINGS, 2006, : 1273 - +
  • [48] Implementation of a Practical Query-by-Singing/Humming (QbSH) System and Its Commercial Applications
    Song, Chai-Jong
    Park, Hochong
    Yang, Chang-Mo
    Jang, Sei-Jin
    Lee, Seok-Phil
    2013 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2013, : 102 - +
  • [49] Implementation of a Practical Query-by-Singing/Humming (QbSH) System and Its Commercial Applications
    Song, Chai-Jong
    Park, Hochong
    Yang, Chang-Mo
    Jang, Sei-Jin
    Lee, Seok-Pil
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2013, 59 (02) : 407 - 414
  • [50] Recognition of environmental sounds using speech recognition techniques
    Cowling, M
    Sitte, R
    ADVANCED SIGNAL PROCESSING FOR COMMUNICATION SYSTEMS, 2002, 703 : 31 - 46