Attentive Sequences Recurrent Network for Social Relation Recognition from Video

被引:4
|
作者
Lv, Jinna [1 ,2 ]
Wu, Bin [1 ]
Zhang, Yunlei [1 ]
Xiao, Yunpeng [3 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing 100876, Peoples R China
[2] Beijing Informat Sci & Technol Univ, Beijing, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Chongqing 400065, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
social relation recognition; video analysis; deep learning; LSTM; attention mechanism;
D O I
10.1587/transinf.2019EDP7104
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, social relation analysis receives an increasing amount of attention from text to image data. However, social relation analysis from video is an important problem, which is lacking in the current literature. There are still some challenges: 1) it is hard to learn a satisfactory mapping function from low-level pixels to high-level social relation space; 2) how to efficiently select the most relevant information from noisy and unsegmented video. In this paper, we present an Attentive Sequences Recurrent Network model, called ASRN, to deal with the above challenges. First, in order to explore multiple clues, we design a Multiple Feature Attention (MFA) mechanism to fuse multiple visual features (i.e. image, motion, body, and face). Through this manner, we can generate an appropriate mapping function from low-level video pixels to high-level social relation space. Second, we design a sequence recurrent network based on Global and Local Attention (GLA) mechanism. Specially, an attention mechanism is used in GLA to integrate global feature with local sequence feature to select more relevant sequences for the recognition task. Therefore, the GLA module can better deal with noisy and unsegmented video. At last, extensive experiments on the SRIV dataset demonstrate the performance of our ASRN model.
引用
收藏
页码:2568 / 2576
页数:9
相关论文
共 50 条
  • [41] An enhanced attentive implicit relation embedding for social recommendation
    Ma, Xintao
    Dong, Liyan
    Wang, Yuequn
    Li, Yongli
    Liu, Zhen
    Zhang, Hao
    DATA & KNOWLEDGE ENGINEERING, 2023, 145
  • [42] Attentive Neural Network for Named Entity Recognition in Vietnamese
    Kim Anh Nguyen
    Ngan Dong
    Cam-Tu Nguyen
    2019 IEEE - RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF), 2019, : 266 - 271
  • [43] Improving Speech Emotion Recognition Using Graph Attentive Bi-directional Gated Recurrent Unit Network
    Su, Bo-Hao
    Chang, Chun-Min
    Lin, Yun-Shao
    Lee, Chi-Chun
    INTERSPEECH 2020, 2020, : 506 - 510
  • [44] Semantic three-stream network for social relation recognition
    Yan, Haibin
    Song, Chaohui
    PATTERN RECOGNITION LETTERS, 2019, 128 : 78 - 84
  • [45] Facial Expression Recognition in Video Sequences
    Tai, Shenchuan
    Huang, Hungfu
    ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 3, PROCEEDINGS, 2009, 5553 : 1026 - 1033
  • [46] Facial Expression Recognition in Video Sequences
    Wan, Chuan
    Tian, Yantao
    Liu, Shuaishi
    PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 4766 - 4770
  • [47] A multi-scale attentive recurrent network for image dehazing
    Wang, Yibin
    Yin, Shibai
    Basu, Anup
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (21-23) : 32539 - 32565
  • [48] Image Quality Caption with Attentive and Recurrent Semantic Attractor Network
    Yang, Wen
    Wu, Jinjian
    Li, Leida
    Dong, Weisheng
    Shi, Guangming
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4501 - 4509
  • [49] A multi-scale attentive recurrent network for image dehazing
    Yibin Wang
    Shibai Yin
    Anup Basu
    Multimedia Tools and Applications, 2021, 80 : 32539 - 32565
  • [50] Group based emotion recognition from video sequence with hybrid optimization based recurrent fuzzy neural network
    Velagapudi Sreenivas
    Varsha Namdeo
    E. Vijay Kumar
    Journal of Big Data, 7