GIF-SP: GA-based Informative Feature for Noisy Speech Recognition

被引:0
|
作者
Tamura, Satoshi [1 ]
Tagami, Yoji [1 ]
Hayamizu, Satoru [1 ]
机构
[1] Gifu Univ, Dept Informat Sci, Gifu, Japan
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes a novel discriminative feature extraction method. The method consists of two stages; in the first stage, a classifier is built for each class, which categorizes an input vector into a certain class or not. From all the parameters of the classifiers, a first transformation can be formed. In the second stage, another transformation that generates a feature vector is subsequently obtained to reduce the dimension and enhance recognition ability. These transformations are computed applying genetic algorithm. In order to evaluate the performance of the proposed feature, speech recognition experiments were conducted. Results in clean training condition shows that GIF greatly improves recognition accuracy compared to conventional MFCC in noisy environments. Multi-condition results also clarifies that out proposed scheme is robust against differences of conditions.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] GIF-LR:GA-based Informative Feature for Lipreading
    Ukai, Naoya
    Seko, Takumi
    Tamura, Satoshi
    Hayamizu, Satoru
    [J]. 2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [2] GA-based noisy speech recognition using two-dimensional cepstrum
    Lin, CT
    Nein, HW
    Hwu, JY
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (06): : 664 - 675
  • [3] GA-based object recognition in a complex noisy environment
    Xin, J
    Liu, D
    Liu, H
    Yang, YX
    [J]. 2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 1586 - 1589
  • [4] Statistical Voice Conversion using GA-based Informative Feature
    Sawada, Kohei
    Tagami, Yoji
    Tamura, Satoshi
    Takehara, Masanori
    Hayamizu, Satoru
    [J]. 2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [5] GA-based speaking mouth correlative speech feature abstraction
    Jia, Xibin
    Yin, Baocai
    Sun, Yanfeng
    Lin, Xianping
    [J]. PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, VOLS 1 AND 2, 2006, : 114 - 119
  • [6] A GA-based fuzzy feature evaluation algorithm for pattern recognition
    Huang, HP
    Liu, YH
    [J]. 10TH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3: MEETING THE GRAND CHALLENGE: MACHINES THAT SERVE PEOPLE, 2001, : 833 - 836
  • [7] Feature weighting in noisy speech recognition
    Huang, KC
    Juang, YT
    [J]. ELECTRONICS LETTERS, 2003, 39 (12) : 938 - 939
  • [8] GA-based Parameterization and Feature Selection for Automatic Music Genre Recognition
    Serwach, Marcin
    Stasiak, Bartlomiej
    [J]. PROCEEDINGS OF 2016 17TH INTERNATIONAL CONFERENCE COMPUTATIONAL PROBLEMS OF ELECTRICAL ENGINEERING (CPEE), 2016,
  • [9] A GA-based feature selection approach with an application to handwritten character recognition
    De Stefano, C.
    Fontanella, F.
    Marrocco, C.
    di Freca, A. Scotto
    [J]. PATTERN RECOGNITION LETTERS, 2014, 35 : 130 - 141
  • [10] Word graph based feature enhancement for noisy speech recognition
    Yan, Zhi-Jie
    Soong, Frank K.
    Wang, Ren-Hua
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 373 - +