Autonomous Framework for Person Identification by Analyzing Vocal Sounds and Speech Patterns

被引：0

作者：

Hassan, Bilal ^{[1
]}

Ahmed, Ramsha ^{[2
]}

Li, Bo ^{[3
]}

Hassan, Omar ^{[4
]}

Hassan, Taimur ^{[5
]}

机构：

[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing, Peoples R China

[2] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing, Peoples R China

[3] Beihang Univ, Sch Comp Sci & Engn, Beijing, Peoples R China

[4] Sir Syed CASE Inst Technol SSCIT, Dept Elect & Comp Engn, Islamabad, Pakistan

[5] Natl Univ Sci & Technol NUST, Dept Comp & Software Engn, Islamabad, Pakistan

来源：

CONFERENCE PROCEEDINGS OF 2019 5TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR) | 2019年

基金：

国家重点研发计划;

关键词：

speech processing; cepstrum; Support Vector Machines (SVM); SPEAKER IDENTIFICATION;

D O I：

10.1109/iccar.2019.8813463

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Speech processing has emerged as one of the important and crucial domain over the past decade. Many researchers have worked on voice recognition and verification. Some of the reported work has been done in the field of biometrics. However, this paper proposes an autonomous algorithm for the person identification by analyzing their vocal sounds and speech patterns. First, the input voice signal is introduced to our proposed system from which the low frequency contents are extracted using finite response low pass filter based on hamming window. Then the proposed system performs a cepstral analysis and extracts two distinct features from the signal spectrum i.e. the maximum pitch frequency and maximum cepstrum value. The 2D extracted feature set is passed on to the multi-level classification system constructed on supervised Support Vector Machine (SVM), which first discriminates between the person's gender and then classifies the person based on the gender. Total 120 samples were used to train the proposed classification system and the proposed system correctly identifies the speaker with the accuracy, specificity and sensitivity of 83.33% 86.67% and 80% respectively.

引用

页码：649 / 653

页数：5

共 50 条

[31] ESTIMATION OF VOCAL-TRACT SHAPES FROM ACOUSTIC ANALYSIS OF SPEECH SOUNDS
WAKITA, H
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 62 : S37 - S37
[32] Analyzing Stimulus-Stimulus Pairing Effects on Preferences for Speech Sounds
Anna Ingeborg Petursdottir
Charlotte L. Carp
Derek W. Matthies
Barbara E. Esch
The Analysis of Verbal Behavior, 2011, 27 (1) : 45 - 60
[33] Perceptive vs Productive Skills In Analyzing Speech Sounds From Words
Summers, Raymond
JOURNAL OF SPEECH AND HEARING DISORDERS, 1953, 18 (02): : 140 - 148
[34] Analyzing Stimulus-Stimulus Pairing Effects on Preferences for Speech Sounds
Petursdottir, Anna Ingeborg
Carp, Charlotte L.
Matthies, Derek W.
Esch, Barbara E.
ANALYSIS OF VERBAL BEHAVIOR, 2011, 27 (01): : 45 - 60
[35] Private speech in autistic children: vocal patterns and circumstances
Shimada, Yohko M.
Funabiki, Yasuko
INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2016, 51 : 421 - 421
[36] EFFECT OF FORWARD AND BACKWARD COARTICULATION ON IDENTIFICATION OF SPEECH SOUNDS
SHARF, DJ
OSTREICHER, H
LANGUAGE AND SPEECH, 1973, 16 (JUL-S) : 196 - 206
[37] SEMIAUTOMATIC SPEECH SOUNDS AURAL IDENTIFICATION PROCEDURE WITH ITS APPLICATION TO SPEECH ANALYSIS
CHRISTOV, PD
ACUSTICA, 1973, 29 (06): : 347 - 349
[38] PATTERNS OF RESIDUAL MASKING FOR SOUNDS WITH SPEECH-LIKE CHARACTERISTICS
HEINZ, JM
LINDBLOM, BE
LINDQVIST, JC
IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS, 1968, AU16 (01): : 107 - +
[39] Sounds Like a Funny Joke: Effects of Vocal Pitch and Speech Rate on Satire Liking
Brugman, Britta C.
Burgers, Christian
CANADIAN JOURNAL OF EXPERIMENTAL PSYCHOLOGY-REVUE CANADIENNE DE PSYCHOLOGIE EXPERIMENTALE, 2021, 75 (02): : 221 - 227
[40] Analyzing acoustic patterns of vowel sounds produced by native Rangri speakers
Abbasi A.M.
Butt B.
Gopang I.B.
Khan A.
Naz K.
Shehwar D.
International Journal of Speech Technology, 2024, 27 (02) : 471 - 481

← 1 2 3 4 5 →