Hiligaynon Language 5-Word Vocabulary Speech Recognition Using Mel Frequency Cepstrum Coefficients and Genetic Algorithm

被引:0
|
作者
Billones, Robert Kerwin C. [1 ]
Dadios, Elmer P. [2 ]
机构
[1] De La Salle Univ, Gokongwei Coll Engn, Manila, Philippines
[2] De La Salle Univ, Gokongwei Coll Engn, MEM Dept, Manila, Philippines
关键词
Hiligaynon Language; Speech Processing; Speech Recognition; Mel Frequency Cepstrum Coefficients; Genetic Algorithm; Neighbourhood Selection; Elitist Survival; Adaptive Database System;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In the study conducted by the Department of Health National Epidemiology Center, there is a high incidence and mortality rates of breast cancer among Western Visayan women, specifically in Bacolod city, Philippines. The development of breast self-examination (BSE) multimedia training system that can be easily used by the local female population in Western Visayas can help awareness and prevention of this dreaded disease. This system incorporates Hiligaynon speech recognition for motion control commands. Hiligaynon language, popularly known as Ilonggo, is an Austronesian language spoken in the Western Visayas region of the Philippines with approximately 11 million speakers, 7 million of which are native speakers. This study focuses on a 5-word vocabulary Hiligaynon language speech recognition for the BSE multimedia training system with feature extraction using Mel frequency cepstrum coefficients and pattern recognition using genetic algorithm. The genetic algorithm uses Euclidean distance, neighbourhood selection, two point crossover and elitist survival techniques. The system has an adaptive database system which improves the training and classification of the Hiligaynon words. The results showed that the combined Mel frequency cepstrum coefficients and genetic algorithm techniques used together with the adaptive database system can effectively recognized the different Hiligaynon words with 97.50% accuracy.
引用
收藏
页数:6
相关论文
共 23 条
  • [1] A Study of Speech, Speaker and Emotion Recognition using Mel Frequency Cepstrum Coefficients and Support Vector Machines
    Rajasekhar, Ashwini
    Hota, Malaya Kumar
    [J]. PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 114 - 118
  • [2] Convolution neural network based automatic speech emotion recognition using Mel-frequency Cepstrum coefficients
    Pawar, Manju D.
    Kokate, Rajendra D.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (10) : 15563 - 15587
  • [3] Convolution neural network based automatic speech emotion recognition using Mel-frequency Cepstrum coefficients
    Manju D. Pawar
    Rajendra D. Kokate
    [J]. Multimedia Tools and Applications, 2021, 80 : 15563 - 15587
  • [4] Algorithm for Gunshot Detection Using Mel-Frequency Cepstrum Coefficients (MFCC)
    Suman, Preetam
    Karan, Subhdeep
    Singh, Vrijendra
    Maringanti, R.
    [J]. PROCEEDINGS OF NINTH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATION AND SENSOR NETWORKS (WCSN 2013), 2014, 299 : 155 - 166
  • [5] RECOGNITION OF NON-SPEECH SOUNDS USING MEL-FREQUENCY CEPSTRUM COEFFICIENTS AND DYNAMIC TIME WARPING METHOD
    Disken, Gokay
    Ibrikci, Turgay
    [J]. 2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 144 - 147
  • [6] ACOUSTIC PORNOGRAPHY RECOGNITION USING FUSED PITCH AND MEL-FREQUENCY CEPSTRUM COEFFICIENTS
    Banaeeyan, Rasoul
    Karim, Hezerul Abdul
    Lye, Haris
    Fauzi, Mohamad Faizal Ahmad
    Mansor, Sarina
    See, John
    [J]. INTERNATIONAL JOURNAL OF TECHNOLOGY, 2019, 10 (07) : 1335 - 1343
  • [7] Speaker Recognition Using Mel-Frequency Cepstrum Coefficients and Sum Square Error
    Charisma, Atik
    Hidayat, M. Reza
    Zainal, Yuda Bakti
    [J]. 2017 3RD INTERNATIONAL CONFERENCE ON WIRELESS AND TELEMATICS (ICWT), 2017, : 160 - 163
  • [8] Boosting speech/non-speech classification using averaged Mel-frequency Cepstrum Coefficients features
    Xiong, ZY
    Huang, TS
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2002, PROCEEDING, 2002, 2532 : 573 - 580
  • [9] Improved DTW Speech Recognition Algorithm Based on the MEL Frequency Cepstral Coefficients
    Wei Ming-zhe
    Li Xi
    Ren Li-mian
    [J]. 12TH ANNUAL MEETING OF CHINA ASSOCIATION FOR SCIENCE AND TECHNOLOGY ON INFORMATION AND COMMUNICATION TECHNOLOGY AND SMART GRID, 2010, : 235 - 238
  • [10] Recognition of Human Speech Emotion Using Variants of Mel-Frequency Cepstral Coefficients
    Palo, Hemanta Kumar
    Chandra, Mahesh
    Mohanty, Mihir Narayan
    [J]. ADVANCES IN SYSTEMS, CONTROL AND AUTOMATION, 2018, 442 : 491 - 498