Person Recognition using Humming, Singing and Speech

被引：4

作者：

Patil, Hemant A. ^{[1
]}

Madhavi, Maulik C. ^{[1
]}

Chhayani, Nirav H. ^{[1
]}

机构：

[1] Dhirubhai Ambani Inst Informat & Commun Technol D, Gandhinagar, India

来源：

2012 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2012) | 2012年

关键词：

Biometric; Humming; Corpus development; Speaker recognition; Singer recognition; SPEAKER RECOGNITION;

D O I：

10.1109/IALP.2012.58

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Speaker recognition deals with designing the system which recognizes the person by speech with the help of computers. In this paper, the various biometric signals produced by humans, viz., speech, singing and humming are considered for person recognition task. Corpus has been developed from 28 subjects in real-life settings. For person recognition task, state-of-the-art feature set, viz., Mel Frequency Cepstral Coefficients (MFCC) and a discriminatively-trained polynomial classifier of 2nd order approximation are used as spectral feature and classification techniques, respectively. Our experimental results indicate that the performance of person recognition system obtained using humming outperforms other biometric patterns (i.e., speech and singing) by 9 % in EER and 9 % in Identification Rate. We believe that this may be due to the person-specific characteristics are better captured in humming sounds, (which are nasalized sounds) than speech and singing.

引用

页码：149 / 152

页数：4

共 50 条

[1] Development of Corpora for Person Recognition using Humming, Singing and Speech
Chhayani, Nirav H.
Patil, Hemant A.
2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
[2] Significance of Phase-based Features for Person Recognition Using Humming
Sailor, Hardik B.
Madhavi, Maulik C.
Patil, Hemant A.
PERCEPTION AND MACHINE INTELLIGENCE, 2015, 2015, : 99 - 103
[3] Combining evidences from magnitude and phase information using VTEO for person recognition using humming
Patil, Hemant A.
Madhavi, Maulik C.
COMPUTER SPEECH AND LANGUAGE, 2018, 52 : 225 - 256
[4] MUSIC RETRIEVAL BY SINGING AND HUMMING USING INFORMATION FUSION
Milner, John N.
Hsu, D. Frank
PROCEEDINGS OF THE 2013 12TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI CC 2013), 2013, : 332 - 338
[5] SPEECH ANALYSIS OF SUNG-SPEECH AND LYRIC RECOGNITION IN MONOPHONIC SINGING
Kawai, Dairoku
Yamamoto, Kazumasa
Nakagawa, Seiichi
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 271 - 275
[6] The singing, humming,whistling, hollering, growling, storytelling bassist
Mezzacappa, Lisa
STRAD, 2019, 130 (1547): : 48 - 53
[7] Effective Results Ranking for Mobile Query by Singing/Humming Using a Hybrid Recommendation Mechanism
Liu, Ning-Han
IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (05) : 1407 - 1420
[8] Clinical and anatomic characteristics of humming and singing in partial seizures
Bartolomei, F.
McGonigal, A.
Guye, M.
Guedj, E.
Chauvel, P.
NEUROLOGY, 2007, 69 (05) : 490 - 492
[9] Static and dynamic information derived from source and system features for person recognition from humming
Patil, Hemant
Madhavi, Maulik
Parhi, Keshab
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (03) : 393 - 406
[10] Phoneme Recognition in Korean Singing Voices Using Self-Supervised English Speech Representations
Wu, Wenqin
Lee, Joonwhoan
APPLIED SCIENCES-BASEL, 2024, 14 (18):

← 1 2 3 4 5 →