Autonomous Framework for Person Identification by Analyzing Vocal Sounds and Speech Patterns

被引:0
|
作者
Hassan, Bilal [1 ]
Ahmed, Ramsha [2 ]
Li, Bo [3 ]
Hassan, Omar [4 ]
Hassan, Taimur [5 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing, Peoples R China
[2] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing, Peoples R China
[3] Beihang Univ, Sch Comp Sci & Engn, Beijing, Peoples R China
[4] Sir Syed CASE Inst Technol SSCIT, Dept Elect & Comp Engn, Islamabad, Pakistan
[5] Natl Univ Sci & Technol NUST, Dept Comp & Software Engn, Islamabad, Pakistan
基金
国家重点研发计划;
关键词
speech processing; cepstrum; Support Vector Machines (SVM); SPEAKER IDENTIFICATION;
D O I
10.1109/iccar.2019.8813463
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech processing has emerged as one of the important and crucial domain over the past decade. Many researchers have worked on voice recognition and verification. Some of the reported work has been done in the field of biometrics. However, this paper proposes an autonomous algorithm for the person identification by analyzing their vocal sounds and speech patterns. First, the input voice signal is introduced to our proposed system from which the low frequency contents are extracted using finite response low pass filter based on hamming window. Then the proposed system performs a cepstral analysis and extracts two distinct features from the signal spectrum i.e. the maximum pitch frequency and maximum cepstrum value. The 2D extracted feature set is passed on to the multi-level classification system constructed on supervised Support Vector Machine (SVM), which first discriminates between the person's gender and then classifies the person based on the gender. Total 120 samples were used to train the proposed classification system and the proposed system correctly identifies the speaker with the accuracy, specificity and sensitivity of 83.33% 86.67% and 80% respectively.
引用
收藏
页码:649 / 653
页数:5
相关论文
共 50 条
  • [31] ESTIMATION OF VOCAL-TRACT SHAPES FROM ACOUSTIC ANALYSIS OF SPEECH SOUNDS
    WAKITA, H
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 62 : S37 - S37
  • [32] Analyzing Stimulus-Stimulus Pairing Effects on Preferences for Speech Sounds
    Anna Ingeborg Petursdottir
    Charlotte L. Carp
    Derek W. Matthies
    Barbara E. Esch
    The Analysis of Verbal Behavior, 2011, 27 (1) : 45 - 60
  • [33] Perceptive vs Productive Skills In Analyzing Speech Sounds From Words
    Summers, Raymond
    JOURNAL OF SPEECH AND HEARING DISORDERS, 1953, 18 (02): : 140 - 148
  • [34] Analyzing Stimulus-Stimulus Pairing Effects on Preferences for Speech Sounds
    Petursdottir, Anna Ingeborg
    Carp, Charlotte L.
    Matthies, Derek W.
    Esch, Barbara E.
    ANALYSIS OF VERBAL BEHAVIOR, 2011, 27 (01): : 45 - 60
  • [35] Private speech in autistic children: vocal patterns and circumstances
    Shimada, Yohko M.
    Funabiki, Yasuko
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2016, 51 : 421 - 421
  • [36] EFFECT OF FORWARD AND BACKWARD COARTICULATION ON IDENTIFICATION OF SPEECH SOUNDS
    SHARF, DJ
    OSTREICHER, H
    LANGUAGE AND SPEECH, 1973, 16 (JUL-S) : 196 - 206
  • [37] SEMIAUTOMATIC SPEECH SOUNDS AURAL IDENTIFICATION PROCEDURE WITH ITS APPLICATION TO SPEECH ANALYSIS
    CHRISTOV, PD
    ACUSTICA, 1973, 29 (06): : 347 - 349
  • [38] PATTERNS OF RESIDUAL MASKING FOR SOUNDS WITH SPEECH-LIKE CHARACTERISTICS
    HEINZ, JM
    LINDBLOM, BE
    LINDQVIST, JC
    IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS, 1968, AU16 (01): : 107 - +
  • [39] Sounds Like a Funny Joke: Effects of Vocal Pitch and Speech Rate on Satire Liking
    Brugman, Britta C.
    Burgers, Christian
    CANADIAN JOURNAL OF EXPERIMENTAL PSYCHOLOGY-REVUE CANADIENNE DE PSYCHOLOGIE EXPERIMENTALE, 2021, 75 (02): : 221 - 227
  • [40] Analyzing acoustic patterns of vowel sounds produced by native Rangri speakers
    Abbasi A.M.
    Butt B.
    Gopang I.B.
    Khan A.
    Naz K.
    Shehwar D.
    International Journal of Speech Technology, 2024, 27 (02) : 471 - 481