Learning Facial Action Units from Web Images with Scalable Weakly Supervised Clustering

被引：41

作者：

Zhao, Kaili ^{[1
]}

Chu, Wen-Sheng ^{[2
]}

Martinez, Aleix M. ^{[3
]}

机构：

[1] Beijing Univ Posts & Telecom, Sch Comm & Info Engn, Beijing, Peoples R China

[2] Carnegie Mellon Univ, Robot Inst, Pittsburgh, PA 15213 USA

[3] Ohio State Univ, Dept Elect & Comp Engn, Columbus, OH 43210 USA

来源：

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年

基金：

美国国家卫生研究院;

关键词：

D O I：

10.1109/CVPR.2018.00223

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a scalable weakly supervised clustering approach to learn facial action units (AUs) from large, freely available web images. Unlike most existing methods (e.g., CNNs) that rely on fully annotated data, our method exploits web images with inaccurate annotations. Specifically, we derive a weakly-supervised spectral algorithm that learns an embedding space to couple image appearance and semantics. The algorithm has efficient gradient update, and scales up to large quantities of images with a stochastic extension. With the learned embedding space, we adopt rank-order clustering to identify groups of visually and semantically similar images, and re-annotate these groups for training AU classifiers. Evaluation on the 1 millon EmotioNet dataset demonstrates the effectiveness of our approach: (1) our learned annotations reach on average 91.3% agreement with human annotations on 7 common AUs, (2) classifiers trained with re-annotated images perform comparably to, sometimes even better than, its supervised CNN-based counterpart, and (3) our method offers intuitive outlier/noise pruning instead of forcing one annotation to every image. Code is available.(1)

引用

页码：2090 / 2099

页数：10

共 50 条

[1] Weakly Supervised Action Recognition and Localization Using Web Images
Liu, Cuiwei
Wu, Xinxiao
Jia, Yunde
COMPUTER VISION - ACCV 2014, PT V, 2015, 9007 : 642 - 657
[2] Weakly Supervised Dual Learning for Facial Action Unit Recognition
Wang, Shangfei
Peng, Guozhu
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (12) : 3218 - 3230
[3] CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images
Guo, Sheng
Huang, Weilin
Zhang, Haozhi
Zhuang, Chenfan
Dong, Dengke
Scott, Matthew R.
Huang, Dinglong
COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 139 - 154
[4] Weakly Supervised Regional and Temporal Learning for Facial Action Unit Recognition
Yan, Jingwei
Wang, Jingjing
Li, Qiang
Wang, Chunmao
Pu, Shiliang
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1760 - 1772
[5] Learning CNNs from weakly annotated facial images
Franc, Vojtech
Cech, Jan
IMAGE AND VISION COMPUTING, 2018, 77 : 10 - 20
[6] Exploiting Web Images for Weakly Supervised Object Detection
Tao, Qingyi
Yang, Hao
Cai, Jianfei
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (05) : 1135 - 1146
[7] Learning realistic facial expressions from web images
Yu, Kaimin
Wang, Zhiyong
Zhuo, Li
Wang, Jiajun
Chi, Zheru
Feng, Dagan
PATTERN RECOGNITION, 2013, 46 (08) : 2144 - 2155
[8] A Weakly Supervised learning technique for classifying facial expressions
Happy, S. L.
Dantcheva, Antitza
Bremond, Francois
PATTERN RECOGNITION LETTERS, 2019, 128 : 162 - 168
[9] LIGHTWEIGHT FACIAL LANDMARK DETECTION WITH WEAKLY SUPERVISED LEARNING
Lai, Shenqi
Liu, Lei
Chai, Zhenhua
Wei, Xiaolin
2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
[10] Weakly Supervised Action Selection Learning in Video
Ma, Junwei
Gorti, Satya Krishna
Volkovs, Maksims
Yu, Guangwei
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7583 - 7592

← 1 2 3 4 5 →