Person Recognition using Humming, Singing and Speech

被引：4

作者：

Patil, Hemant A. ^{[1
]}

Madhavi, Maulik C. ^{[1
]}

Chhayani, Nirav H. ^{[1
]}

机构：

[1] Dhirubhai Ambani Inst Informat & Commun Technol D, Gandhinagar, India

来源：

2012 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2012) | 2012年

关键词：

Biometric; Humming; Corpus development; Speaker recognition; Singer recognition; SPEAKER RECOGNITION;

D O I：

10.1109/IALP.2012.58

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Speaker recognition deals with designing the system which recognizes the person by speech with the help of computers. In this paper, the various biometric signals produced by humans, viz., speech, singing and humming are considered for person recognition task. Corpus has been developed from 28 subjects in real-life settings. For person recognition task, state-of-the-art feature set, viz., Mel Frequency Cepstral Coefficients (MFCC) and a discriminatively-trained polynomial classifier of 2nd order approximation are used as spectral feature and classification techniques, respectively. Our experimental results indicate that the performance of person recognition system obtained using humming outperforms other biometric patterns (i.e., speech and singing) by 9 % in EER and 9 % in Identification Rate. We believe that this may be due to the person-specific characteristics are better captured in humming sounds, (which are nasalized sounds) than speech and singing.

引用

页码：149 / 152

页数：4

共 50 条

[41] Discriminative Named Entity Recognition of Speech Data using Speech Recognition Confidence
Sudoh, Katsuhito
Tsukada, Hajime
Isozaki, Hideki
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 337 - 340
[42] Robust distributed speech recognition using speech enhancement
Flynn, Ronan
Jones, Edward
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2008, 54 (03) : 1267 - 1273
[43] Automated cleft speech evaluation using speech recognition
Vucovich, Megan
Hallac, Rami R.
Kane, Alex A.
Cook, Julie
Van'T Slot, Cortney
Seaward, James R.
JOURNAL OF CRANIO-MAXILLOFACIAL SURGERY, 2017, 45 (08) : 1268 - 1271
[44] Estimation of Speech Intelligibility Using Speech Recognition Systems
Takano, Yusuke
Kondo, Kazuhiro
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (12): : 3368 - 3376
[45] Robust recognition of noisy speech using speech enhancement
Xu, YF
Zhang, JJ
Yao, KS
Cao, ZG
Ma, ZX
2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 734 - 737
[46] Multidimensional humming transcription using a statistical approach for query by humming systems
Shih, HH
Narayanan, SS
Kuo, CCJ
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 541 - 544
[47] Speech bandwidth extension method using speech recognition and speech synthesis
Takashina, Masashi
Kuroiwa, Shingo
Tsuge, Satoru
Ren, Fuji
2006 10TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, VOLS 1 AND 2, PROCEEDINGS, 2006, : 1273 - +
[48] Implementation of a Practical Query-by-Singing/Humming (QbSH) System and Its Commercial Applications
Song, Chai-Jong
Park, Hochong
Yang, Chang-Mo
Jang, Sei-Jin
Lee, Seok-Phil
2013 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2013, : 102 - +
[49] Implementation of a Practical Query-by-Singing/Humming (QbSH) System and Its Commercial Applications
Song, Chai-Jong
Park, Hochong
Yang, Chang-Mo
Jang, Sei-Jin
Lee, Seok-Pil
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2013, 59 (02) : 407 - 414
[50] Recognition of environmental sounds using speech recognition techniques
Cowling, M
Sitte, R
ADVANCED SIGNAL PROCESSING FOR COMMUNICATION SYSTEMS, 2002, 703 : 31 - 46

← 1 2 3 4 5 →