Learning Facial Action Units from Web Images with Scalable Weakly Supervised Clustering

被引:41
|
作者
Zhao, Kaili [1 ]
Chu, Wen-Sheng [2 ]
Martinez, Aleix M. [3 ]
机构
[1] Beijing Univ Posts & Telecom, Sch Comm & Info Engn, Beijing, Peoples R China
[2] Carnegie Mellon Univ, Robot Inst, Pittsburgh, PA 15213 USA
[3] Ohio State Univ, Dept Elect & Comp Engn, Columbus, OH 43210 USA
基金
美国国家卫生研究院;
关键词
D O I
10.1109/CVPR.2018.00223
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a scalable weakly supervised clustering approach to learn facial action units (AUs) from large, freely available web images. Unlike most existing methods (e.g., CNNs) that rely on fully annotated data, our method exploits web images with inaccurate annotations. Specifically, we derive a weakly-supervised spectral algorithm that learns an embedding space to couple image appearance and semantics. The algorithm has efficient gradient update, and scales up to large quantities of images with a stochastic extension. With the learned embedding space, we adopt rank-order clustering to identify groups of visually and semantically similar images, and re-annotate these groups for training AU classifiers. Evaluation on the 1 millon EmotioNet dataset demonstrates the effectiveness of our approach: (1) our learned annotations reach on average 91.3% agreement with human annotations on 7 common AUs, (2) classifiers trained with re-annotated images perform comparably to, sometimes even better than, its supervised CNN-based counterpart, and (3) our method offers intuitive outlier/noise pruning instead of forcing one annotation to every image. Code is available.(1)
引用
收藏
页码:2090 / 2099
页数:10
相关论文
共 50 条
  • [1] Weakly Supervised Action Recognition and Localization Using Web Images
    Liu, Cuiwei
    Wu, Xinxiao
    Jia, Yunde
    COMPUTER VISION - ACCV 2014, PT V, 2015, 9007 : 642 - 657
  • [2] Weakly Supervised Dual Learning for Facial Action Unit Recognition
    Wang, Shangfei
    Peng, Guozhu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (12) : 3218 - 3230
  • [3] CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images
    Guo, Sheng
    Huang, Weilin
    Zhang, Haozhi
    Zhuang, Chenfan
    Dong, Dengke
    Scott, Matthew R.
    Huang, Dinglong
    COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 139 - 154
  • [4] Weakly Supervised Regional and Temporal Learning for Facial Action Unit Recognition
    Yan, Jingwei
    Wang, Jingjing
    Li, Qiang
    Wang, Chunmao
    Pu, Shiliang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1760 - 1772
  • [5] Learning CNNs from weakly annotated facial images
    Franc, Vojtech
    Cech, Jan
    IMAGE AND VISION COMPUTING, 2018, 77 : 10 - 20
  • [6] Exploiting Web Images for Weakly Supervised Object Detection
    Tao, Qingyi
    Yang, Hao
    Cai, Jianfei
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (05) : 1135 - 1146
  • [7] Learning realistic facial expressions from web images
    Yu, Kaimin
    Wang, Zhiyong
    Zhuo, Li
    Wang, Jiajun
    Chi, Zheru
    Feng, Dagan
    PATTERN RECOGNITION, 2013, 46 (08) : 2144 - 2155
  • [8] A Weakly Supervised learning technique for classifying facial expressions
    Happy, S. L.
    Dantcheva, Antitza
    Bremond, Francois
    PATTERN RECOGNITION LETTERS, 2019, 128 : 162 - 168
  • [9] LIGHTWEIGHT FACIAL LANDMARK DETECTION WITH WEAKLY SUPERVISED LEARNING
    Lai, Shenqi
    Liu, Lei
    Chai, Zhenhua
    Wei, Xiaolin
    2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
  • [10] Weakly Supervised Action Selection Learning in Video
    Ma, Junwei
    Gorti, Satya Krishna
    Volkovs, Maksims
    Yu, Guangwei
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7583 - 7592