Facial expression recognition using facial-componentbased bag of words and PHOG descriptors

被引：8

作者：

Li Z. ^{[1
]}

Imai J.-I. ^{[1
]}

Kaneko M. ^{[1
]}

机构：

[1] Department of Electronic Engineering, University of Electro Communications, Tokyo 182-8585, 1-5-1 Chofugaoka, Chofu-shi

来源：

Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers | 2010年 / 64卷 / 02期

关键词：

Appearance extraction; Bag of words; Facial expression recognition; PHOG; Shape extraction; SIFT;

D O I：

10.3169/itej.64.230

中图分类号：

学科分类号：

摘要：

Facial expression recognition has many potential applications in areas such as human-computer interaction (HCI), emotion analysis, and synthetic face animation. This paper proposes a novel framework of facial appearance and shape information extraction for facial expression recognition. For appearance information extraction, a facial-componentbased bag of words method is presented. We segment face images into four component regions: forehead, eye-eyebrow, nose, and mouth. We then partition them into 4 × 4 sub-regions. Dense SIFT (scale-invariant feature transform) features are calculated over the sub-regions and vector quantized into 4 × 4 sets of codeword distributions. For shape information extraction, PHOG (pyramid histogram of orientated gradient) descriptors are computed on the four facial component regions to obtain the spatial distribution of edges. Multi-class SVM classifiers are applied to classify the six basic facial expressions using the facial-component-based bag of words and PHOG descriptors respectively. Then the appearance and shape information is fused at decision level to further improve the recognition rate. Our framework provides holistic characteristics for the local texture and shape features by enhancing the structure-based spatial information, and makes it possible to use the bag of words method and the local descriptors in facial expression recognition for the first time. The recognition rate achieved by the fusion of appearance and shape features at decision level using the Cohn-Kanade database is 96.33%, which outperforms the state-of-the-art research works.

引用

页码：230 / 236

页数：6

共 50 条

[1] Facial-component-based Bag of Words and PHOG Descriptor for Facial Expression Recognition
Li, Zisheng
Imai, Jun-ichi
Kaneko, Masahide
2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 1353 - 1358
[2] Facial expression recognition using bag of distances
Fu-Song Hsu
Wei-Yang Lin
Tzu-Wei Tsai
Multimedia Tools and Applications, 2014, 73 : 309 - 326
[3] Facial expression recognition using bag of distances
Hsu, Fu-Song
Lin, Wei-Yang
Tsai, Tzu-Wei
MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 73 (01) : 309 - 326
[4] Facial Expression Recognition Based on PHOG Feature and Sparse Representation
Wang Hui
Gao Jing
Tong Lifeng
Yu Lijun
PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 3869 - 3874
[5] Robust dynamic facial expressions recognition using Lbp-Top descriptors and Bag-of-Words classification model
Spizhevoy A.S.
Spizhevoy, A.S. (Alexey.Spizhevoy@itseez.com), 1600, Izdatel'stvo Nauka (26): : 216 - 220
[6] Affective Computing: Using Covariance Descriptors for Facial Expression Recognition
Naidoo, Ashaylin
Tapamo, Jules Raymond
Khutlang, Rethabile
IMAGE AND SIGNAL PROCESSING (ICISP 2018), 2018, 10884 : 435 - 443
[7] Exploring Bag of Words Architectures in the Facial Expression Domain
Sikka, Karan
Wu, Tingfan
Susskind, Josh
Bartlett, Marian
COMPUTER VISION - ECCV 2012, PT II, 2012, 7584 : 250 - 259
[8] Facial Expression Recognition Using Facial Graph
Mohseni, Sina
Zarei, Niloofar
Miandji, Ehsan
FACE AND FACIAL EXPRESSION RECOGNITION FROM REAL WORLD VIDEOS, 2015, 8912 : 58 - 66
[9] Automatic facial expression recognition based on spatiotemporal descriptors
Ji, Yi
Idrissi, Khalid
PATTERN RECOGNITION LETTERS, 2012, 33 (10) : 1373 - 1380
[10] Evaluation of Spatiotemporal Detectors and Descriptors for Facial Expression Recognition
Hayat, M.
Bennamoun, M.
El-Sallam, A.
2012 5TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTIONS (HSI 2012), 2012, : 43 - 47

← 1 2 3 4 5 →