Facial expression recognition using facial-componentbased bag of words and PHOG descriptors

被引:8
|
作者
Li Z. [1 ]
Imai J.-I. [1 ]
Kaneko M. [1 ]
机构
[1] Department of Electronic Engineering, University of Electro Communications, Tokyo 182-8585, 1-5-1 Chofugaoka, Chofu-shi
来源
Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers | 2010年 / 64卷 / 02期
关键词
Appearance extraction; Bag of words; Facial expression recognition; PHOG; Shape extraction; SIFT;
D O I
10.3169/itej.64.230
中图分类号
学科分类号
摘要
Facial expression recognition has many potential applications in areas such as human-computer interaction (HCI), emotion analysis, and synthetic face animation. This paper proposes a novel framework of facial appearance and shape information extraction for facial expression recognition. For appearance information extraction, a facial-componentbased bag of words method is presented. We segment face images into four component regions: forehead, eye-eyebrow, nose, and mouth. We then partition them into 4 × 4 sub-regions. Dense SIFT (scale-invariant feature transform) features are calculated over the sub-regions and vector quantized into 4 × 4 sets of codeword distributions. For shape information extraction, PHOG (pyramid histogram of orientated gradient) descriptors are computed on the four facial component regions to obtain the spatial distribution of edges. Multi-class SVM classifiers are applied to classify the six basic facial expressions using the facial-component-based bag of words and PHOG descriptors respectively. Then the appearance and shape information is fused at decision level to further improve the recognition rate. Our framework provides holistic characteristics for the local texture and shape features by enhancing the structure-based spatial information, and makes it possible to use the bag of words method and the local descriptors in facial expression recognition for the first time. The recognition rate achieved by the fusion of appearance and shape features at decision level using the Cohn-Kanade database is 96.33%, which outperforms the state-of-the-art research works.
引用
收藏
页码:230 / 236
页数:6
相关论文
共 50 条
  • [1] Facial-component-based Bag of Words and PHOG Descriptor for Facial Expression Recognition
    Li, Zisheng
    Imai, Jun-ichi
    Kaneko, Masahide
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 1353 - 1358
  • [2] Facial expression recognition using bag of distances
    Fu-Song Hsu
    Wei-Yang Lin
    Tzu-Wei Tsai
    Multimedia Tools and Applications, 2014, 73 : 309 - 326
  • [3] Facial expression recognition using bag of distances
    Hsu, Fu-Song
    Lin, Wei-Yang
    Tsai, Tzu-Wei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 73 (01) : 309 - 326
  • [4] Facial Expression Recognition Based on PHOG Feature and Sparse Representation
    Wang Hui
    Gao Jing
    Tong Lifeng
    Yu Lijun
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 3869 - 3874
  • [5] Robust dynamic facial expressions recognition using Lbp-Top descriptors and Bag-of-Words classification model
    Spizhevoy A.S.
    Spizhevoy, A.S. (Alexey.Spizhevoy@itseez.com), 1600, Izdatel'stvo Nauka (26): : 216 - 220
  • [6] Affective Computing: Using Covariance Descriptors for Facial Expression Recognition
    Naidoo, Ashaylin
    Tapamo, Jules Raymond
    Khutlang, Rethabile
    IMAGE AND SIGNAL PROCESSING (ICISP 2018), 2018, 10884 : 435 - 443
  • [7] Exploring Bag of Words Architectures in the Facial Expression Domain
    Sikka, Karan
    Wu, Tingfan
    Susskind, Josh
    Bartlett, Marian
    COMPUTER VISION - ECCV 2012, PT II, 2012, 7584 : 250 - 259
  • [8] Facial Expression Recognition Using Facial Graph
    Mohseni, Sina
    Zarei, Niloofar
    Miandji, Ehsan
    FACE AND FACIAL EXPRESSION RECOGNITION FROM REAL WORLD VIDEOS, 2015, 8912 : 58 - 66
  • [9] Automatic facial expression recognition based on spatiotemporal descriptors
    Ji, Yi
    Idrissi, Khalid
    PATTERN RECOGNITION LETTERS, 2012, 33 (10) : 1373 - 1380
  • [10] Evaluation of Spatiotemporal Detectors and Descriptors for Facial Expression Recognition
    Hayat, M.
    Bennamoun, M.
    El-Sallam, A.
    2012 5TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTIONS (HSI 2012), 2012, : 43 - 47