Autonomous Framework for Person Identification by Analyzing Vocal Sounds and Speech Patterns

被引:0
|
作者
Hassan, Bilal [1 ]
Ahmed, Ramsha [2 ]
Li, Bo [3 ]
Hassan, Omar [4 ]
Hassan, Taimur [5 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing, Peoples R China
[2] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing, Peoples R China
[3] Beihang Univ, Sch Comp Sci & Engn, Beijing, Peoples R China
[4] Sir Syed CASE Inst Technol SSCIT, Dept Elect & Comp Engn, Islamabad, Pakistan
[5] Natl Univ Sci & Technol NUST, Dept Comp & Software Engn, Islamabad, Pakistan
基金
国家重点研发计划;
关键词
speech processing; cepstrum; Support Vector Machines (SVM); SPEAKER IDENTIFICATION;
D O I
10.1109/iccar.2019.8813463
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech processing has emerged as one of the important and crucial domain over the past decade. Many researchers have worked on voice recognition and verification. Some of the reported work has been done in the field of biometrics. However, this paper proposes an autonomous algorithm for the person identification by analyzing their vocal sounds and speech patterns. First, the input voice signal is introduced to our proposed system from which the low frequency contents are extracted using finite response low pass filter based on hamming window. Then the proposed system performs a cepstral analysis and extracts two distinct features from the signal spectrum i.e. the maximum pitch frequency and maximum cepstrum value. The 2D extracted feature set is passed on to the multi-level classification system constructed on supervised Support Vector Machine (SVM), which first discriminates between the person's gender and then classifies the person based on the gender. Total 120 samples were used to train the proposed classification system and the proposed system correctly identifies the speaker with the accuracy, specificity and sensitivity of 83.33% 86.67% and 80% respectively.
引用
收藏
页码:649 / 653
页数:5
相关论文
共 50 条
  • [1] DETECTION AND IDENTIFICATION OF SPEECH SOUNDS USING CORTICAL ACTIVITY PATTERNS
    Centanni, T. M.
    Sloan, A. M.
    Reed, A. C.
    Engineer, C. T.
    Rennaker, R. L., II
    Kilgard, M. P.
    NEUROSCIENCE, 2014, 258 : 292 - 306
  • [2] EFFECTS OF VOCAL FORCE ON THE INTELLIGIBILITY OF SPEECH SOUNDS
    PICKETT, JM
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1956, 28 (05): : 902 - 905
  • [3] IDENTIFICATION OF TURBULENT SPEECH SOUNDS
    WIREN, J
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1957, 29 (11): : 1255 - 1255
  • [4] VOCAL TRACT MODEL CREATES ACCURATE SPEECH SOUNDS
    不详
    BELL LABORATORIES RECORD, 1967, 45 (01): : 24 - &
  • [5] Intuitive identification of infant vocal sounds by parents
    Oller, DK
    Eilers, RE
    Basinger, D
    DEVELOPMENTAL SCIENCE, 2001, 4 (01) : 49 - 60
  • [6] Stethoscope-Sensed Speech and Breath-Sounds for Person Identification With Sparse Training Data
    Van-Thuan Tran
    Tsai, Wei-Ho
    IEEE SENSORS JOURNAL, 2020, 20 (02) : 848 - 859
  • [7] Representation of the vocal roughness of aperiodic speech sounds in the auditory cortex
    Yrttiaho, Santeri
    Alku, Paavo
    May, Patrick J. C.
    Tiitinen, Hannu
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (05): : 3177 - 3185
  • [8] Improved vocal tract model for the analysis of nasal speech sounds
    Liu, MS
    Lacroix, A
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 801 - 804
  • [9] VOCAL AND SPEECH PATTERNS OF DEPRESSIVE PATIENTS
    DARBY, JK
    HOLLIEN, H
    FOLIA PHONIATRICA, 1977, 29 (04): : 279 - 291
  • [10] Response Advantage for the Identification of Speech Sounds
    Moskowitz, Howard S.
    Lee, Wei Wei
    Sussman, Elyse S.
    FRONTIERS IN PSYCHOLOGY, 2020, 11