Autonomous Framework for Person Identification by Analyzing Vocal Sounds and Speech Patterns

被引:0
|
作者
Hassan, Bilal [1 ]
Ahmed, Ramsha [2 ]
Li, Bo [3 ]
Hassan, Omar [4 ]
Hassan, Taimur [5 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing, Peoples R China
[2] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing, Peoples R China
[3] Beihang Univ, Sch Comp Sci & Engn, Beijing, Peoples R China
[4] Sir Syed CASE Inst Technol SSCIT, Dept Elect & Comp Engn, Islamabad, Pakistan
[5] Natl Univ Sci & Technol NUST, Dept Comp & Software Engn, Islamabad, Pakistan
基金
国家重点研发计划;
关键词
speech processing; cepstrum; Support Vector Machines (SVM); SPEAKER IDENTIFICATION;
D O I
10.1109/iccar.2019.8813463
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech processing has emerged as one of the important and crucial domain over the past decade. Many researchers have worked on voice recognition and verification. Some of the reported work has been done in the field of biometrics. However, this paper proposes an autonomous algorithm for the person identification by analyzing their vocal sounds and speech patterns. First, the input voice signal is introduced to our proposed system from which the low frequency contents are extracted using finite response low pass filter based on hamming window. Then the proposed system performs a cepstral analysis and extracts two distinct features from the signal spectrum i.e. the maximum pitch frequency and maximum cepstrum value. The 2D extracted feature set is passed on to the multi-level classification system constructed on supervised Support Vector Machine (SVM), which first discriminates between the person's gender and then classifies the person based on the gender. Total 120 samples were used to train the proposed classification system and the proposed system correctly identifies the speaker with the accuracy, specificity and sensitivity of 83.33% 86.67% and 80% respectively.
引用
收藏
页码:649 / 653
页数:5
相关论文
共 50 条
  • [21] IDENTIFICATION OF SPEECH SOUNDS FROM ANALYTICAL RECORDS
    WENTE, EC
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1954, 26 (05): : 952 - 952
  • [22] EFFECT OF SELECTIVE ADAPTATION ON IDENTIFICATION OF SPEECH SOUNDS
    DIEHL, RL
    PERCEPTION & PSYCHOPHYSICS, 1975, 17 (01): : 48 - 52
  • [23] IDENTIFICATION OF SPEECH SOUNDS DISPLAYED ON A VIBROTACTILE VOCODER
    YENIKOMSHIAN, GH
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 62 (01): : 194 - 198
  • [24] Setting the Stage for Speech Production: Infants Prefer Listening to Speech Sounds With Infant Vocal Resonances
    Polka, Linda
    Masapollo, Matthew
    Menard, Lucie
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2022, 65 (01): : 109 - 120
  • [25] ACOUSTIC PROPERTIES USED FOR THE IDENTIFICATION OF SPEECH SOUNDS
    STEVENS, KN
    ANNALS OF THE NEW YORK ACADEMY OF SCIENCES, 1983, 405 (JUN) : 2 - 17
  • [26] Identification and Automatic Detection of Parasitic Speech Sounds
    Matousek, Jindrich
    Skarnitzl, Radek
    Machac, Pavel
    Trmal, Jan
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 840 - +
  • [27] Analyzing the vocal tract characteristics for out-of-breath speech
    Sahoo, Sibasis
    Dandapat, Samarendra
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2021, 150 (02): : 1524 - 1533
  • [28] A Framework for Analyzing Adaptive Autonomous Aerial Vehicles
    Mason, Ian A.
    Nigam, Vivek
    Talcott, Carolyn
    Brito, Alisson
    SOFTWARE ENGINEERING AND FORMAL METHODS, SEFM 2017, 2018, 10729 : 406 - 422
  • [29] Sounds of melody-Pitch patterns of speech in autism
    Sharda, Megha
    Subhadra, T. Padma
    Sahay, Sanchita
    Nagaraja, Chetan
    Singh, Latika
    Mishra, Ramesh
    Sen, Amit
    Singhal, Nidhi
    Erickson, Donna
    Singh, Nandini C.
    NEUROSCIENCE LETTERS, 2010, 478 (01) : 42 - 45
  • [30] Estimation of vocal tract shapes from speech sounds with a physiological articulatory model
    Dang, JW
    Honda, K
    JOURNAL OF PHONETICS, 2002, 30 (03) : 511 - 532