Maximizing information content in feature extraction

被引:15
|
作者
Padmanabhan, M [1 ]
Dharanipragada, S
机构
[1] Renaissance Technol, E Setauket, NY 11733 USA
[2] Citadel Investment Grp, Chicago, IL 60603 USA
来源
关键词
classifiers; optimal feature projections; optimum feature extraction; penalized mutual information; speech recognition;
D O I
10.1109/TSA.2005.848876
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we consider the problem of quantifying the amount of information contained in a set of features, to discriminate between various classes. We explore these ideas in the context of a speech recognition system, where an important classification sub-problem is to predict the phonetic class, given an observed acoustic feature vector. The connection between information content and speech recognition system performance is first explored in the context of various feature extraction schemes used in speech recognition applications. Subsequently, the idea of optimizing the information content to improve recognition accuracy is generalized to a linear projection of the underlying features. We show that several prior methods to compute linear transformations (such as linear/heteroscedastic discriminant analysis) can be interpreted in this general framework of maximizing the information content. Subsequently, we extend this reasoning and propose a new objective function to maximize a penalized mutual information (pMI) measure. This objective function is seen to be very well correlated with the word error rate of the final system. Finally experimental results are provided that show that the proposed pMI projection consistently outperforms other methods for a variety of cases, leading to relative improvements in the word error rate of 5%-16% over earlier methods.
引用
收藏
页码:512 / 519
页数:8
相关论文
共 50 条
  • [41] Content-based authentication watermarking with improved audio content feature extraction
    Gulbis, Michael
    Mueller, Erika
    Steinebach, Martin
    2008 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, PROCEEDINGS, 2008, : 620 - +
  • [42] Exploiting Content Redundancy for Web Information Extraction
    Gulhane, Pankaj
    Rastogi, Rajeev
    Sengamedu, Srinivasan H.
    Tengli, Ashwin
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (01): : 578 - 587
  • [43] Image feature extraction algorithm based on visual information
    Xu, Zhaosheng
    Ahmad, Suzana
    Liao, Zhongming
    Xu, Xiuhong
    Xiang, Zhongqi
    JOURNAL OF INTELLIGENT SYSTEMS, 2023, 32 (01)
  • [44] Temporal feature extraction from temporal information systems
    Synak, P
    FOUNDATIONS OF INTELLIGENT SYSTEMS, 2003, 2871 : 270 - 278
  • [45] Feature Extraction from Mammograms by Using A 'Priory Information
    Mijovic, S.
    Dakovic, M.
    2013 2ND MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2013,
  • [46] Feature extraction and classification using power demand information
    Imanishi, Tomoya
    Tennekoon, Rajitha
    Nishi, Hiroaki
    2016 IEEE INTERNATIONAL CONFERENCE ON SMART GRID COMMUNICATIONS (SMARTGRIDCOMM), 2016,
  • [47] FEATURE EXTRACTION AND CLASSIFICATION FOR AUDIO INFORMATION IN NEWS VIDEO
    Song, Yu
    Wang, Wen-Hong
    Guo, Feng-Juan
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION, 2009, : 43 - +
  • [48] IMAGE FEATURE EXTRACTION BASED ON SPECTRAL GRAPH INFORMATION
    Kang, Jieqi
    Lu, Shan
    Gong, Weibo
    Kelly, Patrick A.
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 46 - 50
  • [49] Leveraging Feature Extraction and Context Information for Image Relighting
    Fang, Chenrong
    Wang, Ju
    Chen, Kan
    Su, Ran
    Lai, Chi-Fu
    Sun, Qian
    ELECTRONICS, 2023, 12 (20)
  • [50] Locust information extraction using hue and shape feature
    Mao, Wenhua
    Zheng, Yongjun
    Yuan, Yanwei
    Zhang, Xiaochao
    Nongye Jixie Xuebao/Transactions of the Chinese Society of Agricultural Machinery, 2008, 39 (09): : 104 - 107