MLP-AIR: An effective MLP-based module for actor interaction relation learning in group activity recognition

被引:0
|
作者
Xu, Guoliang [1 ]
Yin, Jianqin [1 ]
Zhang, Shaojie [1 ]
Gong, Moonjun [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Group activity recognition; Interaction relation modeling; Feature refinement; Multi-layer perceptron; NETWORK;
D O I
10.1016/j.knosys.2024.112453
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modeling actor interaction relations is crucial for group activity recognition. Previous approaches often adopt a fixed paradigm that involves calculating an affinity matrix to model these interaction relations, yielding significant performance. On the one hand, the affinity matrix introduces an inductive bias that actor interaction relations should be dynamically computed based on the input actor features. On the other hand, MLPs with static parameterization, in which parameters are fixed after training, can represent arbitrary functions. Therefore, it is an open question whether inductive bias is necessary for modeling actor interaction relations. To explore the impact of this inductive bias, we propose an affinity matrix-free paradigm that directly uses the MLP with static parameterization to model actor interaction relations. We term this approach MLP-AIR. This paradigm overcomes the limitations of the inductive bias and enhances the capture of implicit actor interaction relations. Specifically, MLP-AIR consists of two sub-modules: the MLP-based Interaction relation modeling module (MLP-I) and the MLP-based Relation refining module (MLP-R). MLP-I is used to model the spatial-temporal interaction relations by emphasizing cross-actor and cross-frame feature learning. Meanwhile, MLP-R is used to refine the relation between different channels of each relation feature, thereby enhancing the expression ability of the features. MLP-AIR is a plug-and-play module. To evaluate our module, we applied MLP-AIR to replicate three representative methods. We conducted extensive experiments on two widely used benchmarks-the Volleyball and Collective Activity datasets. The experiments demonstrate that MLP-AIR achieves favorable results. The code is available at https://github.com/Xuguoliang12/MLP-AIR.
引用
收藏
页数:9
相关论文
共 7 条
  • [1] Effective MLP and CNN based ensemble learning for speech emotion recognition
    Middya A.I.
    Nag B.
    Roy S.
    [J]. Multimedia Tools and Applications, 2024, 83 (36) : 83963 - 83990
  • [2] ConMLP: MLP-Based Self-Supervised Contrastive Learning for Skeleton Data Analysis and Action Recognition
    Dai, Chuan
    Wei, Yajuan
    Xu, Zhijie
    Chen, Minsi
    Liu, Ying
    Fan, Jiulun
    [J]. SENSORS, 2023, 23 (05)
  • [3] Learning Actor Relation Graphs for Group Activity Recognition
    Wu, Jianchao
    Wang, Limin
    Wang, Li
    Guo, Jie
    Wu, Gangshan
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9956 - 9966
  • [4] Part Based Interaction Learning for Group Activity Recognition
    Tian, Meng
    Lang, Xianglong
    Xiang, Ye
    Huang, Yan
    Wu, Lifang
    Li, Yunxiang
    [J]. 2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [5] Dual-AI: Dual-path Actor Interaction Learning for Group Activity Recognition
    Han, Mingfei
    Zhang, David Junhao
    Wang, Yali
    Yan, Rui
    Yao, Lina
    Chang, Xiaojun
    Qiao, Yu
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2980 - 2989
  • [6] Machine Learning Model for Group Activity Recognition Based on Discriminative Interaction Contextual Relationship
    Kulkarni, Smita S.
    Jadhav, Sangeeta
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [7] Group Activity Recognition Based on Interaction Contextual Information in Videos Using Machine Learning
    Kulkarni, Smita S.
    Jadhav, Sangeeta
    [J]. INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2022, 13 (02): : 141 - 150