Objective Gender and Age Recognition from Speech Sentences

被引:4
|
作者
Faek, Fatima K. [1 ]
机构
[1] Salahaddin Univ, Dept Elect, Engn Coll, Zanko St,Kirkuk Rd, Erbil, Kurdistan Regio, Iraq
来源
关键词
Age classification from speech; gender classification from speech; MFCC based gender and age recognition; SVM classifier;
D O I
10.14500/aro.10072
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this work, an automatic gender and age recognizer from speech is investigated. The relevant features to gender recognition are selected from the first four formant frequencies and twelve MFCCs and feed the SVM classifier. While the relevant features to age has been used with k-NN classifier for the age recognizer model, using MATLAB as a simulation tool. A special selection of robust features is used in this work to improve the results of the gender and age classifiers based on the frequency range that the feature represents. The gender and age classification algorithms are evaluated using 114 (clean and noisy) speech samples uttered in Kurdish language. The model of two classes (adult males and adult females) gender recognition, reached 96% recognition accuracy. While for three categories classification (adult males, adult females, and children), the model achieved 94% recognition accuracy. For the age recognition model, seven groups according to their ages are categorized. The model performance after selecting the relevant features to age achieved 75.3%. For further improvement a de-noising technique is used with the noisy speech signals, followed by selecting the proper features that are affected by the de-noising process and result in 81.44% recognition accuracy.
引用
收藏
页码:24 / 29
页数:6
相关论文
共 50 条
  • [1] Speech Material Recognition Technology on an Objective Evaluation System for the Rhythm of English Sentences
    Zhang, Jing
    Zhang, Min
    [J]. ADVANCES IN COMPUTER SCIENCE, INTELLIGENT SYSTEM AND ENVIRONMENT, VOL 1, 2011, 104 : 501 - 507
  • [2] Gender-to-Age Hierarchical Recognition for Speech
    Chen, Chih-Chang
    Lu, Ping-Tsung
    Hsia, Meng-Lin
    Ke, Jia-You
    Chen, Oscal T-C
    [J]. 2011 IEEE 54TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2011,
  • [3] A Short Review of Age and Gender Recognition based on Speech
    Zhao, Huijuan
    Wang, Ping
    [J]. 2019 IEEE 5TH INTL CONFERENCE ON BIG DATA SECURITY ON CLOUD (BIGDATASECURITY) / IEEE INTL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING (HPSC) / IEEE INTL CONFERENCE ON INTELLIGENT DATA AND SECURITY (IDS), 2019, : 183 - 185
  • [4] Application of convolutional neural network for gender and age group recognition from speech
    Pham Tuan Dat
    Le The Anh
    [J]. PROCEEDINGS OF 2019 6TH NATIONAL FOUNDATION FOR SCIENCE AND TECHNOLOGY DEVELOPMENT (NAFOSTED) CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2019, : 489 - 493
  • [5] Study the Influence of Gender and Age in Recognition of Emotions from Algerian Dialect Speech
    Houari, Horkous
    Guerti, Mhania
    [J]. TRAITEMENT DU SIGNAL, 2020, 37 (03) : 413 - 423
  • [6] THE CONTEXT EFFECT IN SPEECH RECOGNITION OF SENTENCES
    ZUST, HJ
    TSCHOPP, K
    [J]. LARYNGO-RHINO-OTOLOGIE, 1995, 74 (04) : 259 - 263
  • [7] Age group classification and gender recognition from speech with temporal convolutional neural networks
    Sanchez-Hevia, Hector A.
    Gil-Pita, Roberto
    Utrilla-Manso, Manuel
    Rosa-Zurera, Manuel
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (03) : 3535 - 3552
  • [8] Age group classification and gender recognition from speech with temporal convolutional neural networks
    Héctor A. Sánchez-Hevia
    Roberto Gil-Pita
    Manuel Utrilla-Manso
    Manuel Rosa-Zurera
    [J]. Multimedia Tools and Applications, 2022, 81 : 3535 - 3552
  • [9] Speech Recognition and Listening Effort of Meaningful Sentences Using Synthetic Speech
    Ibelings, Saskia
    Brand, Thomas
    Holube, Inga
    [J]. TRENDS IN HEARING, 2022, 26
  • [10] Robust Automatic Speech Recognition System for the Recognition of Continuous Kannada Speech Sentences in the Presence of Noise
    Mahadevaswamy
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2023, 130 (03) : 2039 - 2058