Learning Facial Action Units from Web Images with Scalable Weakly Supervised Clustering

被引:41
|
作者
Zhao, Kaili [1 ]
Chu, Wen-Sheng [2 ]
Martinez, Aleix M. [3 ]
机构
[1] Beijing Univ Posts & Telecom, Sch Comm & Info Engn, Beijing, Peoples R China
[2] Carnegie Mellon Univ, Robot Inst, Pittsburgh, PA 15213 USA
[3] Ohio State Univ, Dept Elect & Comp Engn, Columbus, OH 43210 USA
基金
美国国家卫生研究院;
关键词
D O I
10.1109/CVPR.2018.00223
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a scalable weakly supervised clustering approach to learn facial action units (AUs) from large, freely available web images. Unlike most existing methods (e.g., CNNs) that rely on fully annotated data, our method exploits web images with inaccurate annotations. Specifically, we derive a weakly-supervised spectral algorithm that learns an embedding space to couple image appearance and semantics. The algorithm has efficient gradient update, and scales up to large quantities of images with a stochastic extension. With the learned embedding space, we adopt rank-order clustering to identify groups of visually and semantically similar images, and re-annotate these groups for training AU classifiers. Evaluation on the 1 millon EmotioNet dataset demonstrates the effectiveness of our approach: (1) our learned annotations reach on average 91.3% agreement with human annotations on 7 common AUs, (2) classifiers trained with re-annotated images perform comparably to, sometimes even better than, its supervised CNN-based counterpart, and (3) our method offers intuitive outlier/noise pruning instead of forcing one annotation to every image. Code is available.(1)
引用
收藏
页码:2090 / 2099
页数:10
相关论文
共 50 条
  • [11] Weakly Supervised Facial Action Unit Recognition With Domain Knowledge
    Wang, Shangfei
    Peng, Guozhu
    Chen, Shiyu
    Ji, Qiang
    IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (11) : 3265 - 3276
  • [12] Scalable Discrete Supervised Multimedia Hash Learning With Clustering
    Zhang, Shifeng
    Li, Jianmin
    Jiang, Mengqing
    Yuan, Peijiang
    Zhang, Bo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 2716 - 2729
  • [13] Multimodal Generative Models for Scalable Weakly-Supervised Learning
    Wu, Mike
    Goodman, Noah
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [14] Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization
    Lee, Pilhyeon
    Byun, Hyeran
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13628 - 13637
  • [15] Weakly supervised learning for an effective focused web crawler
    Dhanith, P. R. Joe
    Saeed, Khalid
    Rohith, G.
    Raja, S. P.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 132
  • [16] Weakly Supervised Learning of Object Segmentations from Web-Scale Video
    Hartmann, Glenn
    Grundmann, Matthias
    Hoffman, Judy
    Tsai, David
    Kwatra, Vivek
    Madani, Omid
    Vijayanarasimhan, Sudheendra
    Essa, Irfan
    Rehg, James
    Sukthankar, Rahul
    COMPUTER VISION - ECCV 2012: WORKSHOPS AND DEMONSTRATIONS, PT I, 2012, 7583 : 198 - 208
  • [17] Weakly-supervised Deep Convolutional Neural Network Learning for Facial Action Unit Intensity Estimation
    Zhang, Yong
    Dong, Weiming
    Hu, Bao-Gang
    Ji, Qiang
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2314 - 2323
  • [18] Weakly Supervised Deep Soft Clustering for Flood Identification in SAR Images
    Ma, Fei
    Xiang, Deliang
    Yang, Kun
    Yin, Qiang
    Zhang, Fan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [19] Joint weakly and fully supervised learning for surface defect segmentation from images
    Hu, Bin
    Wang, Xinggang
    Yu, Wenyong
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 107
  • [20] Weakly Supervised Facial Action Unit Recognition through Adversarial Training
    Peng, Guozhu
    Wang, Shangfei
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2188 - 2196