Multi-object behaviour recognition based on object detection cascaded image classification in classroom scenes

被引:1
|
作者
Dang, Min [1 ,3 ]
Liu, Gang [1 ,2 ,3 ]
Li, Hao [1 ,3 ]
Xu, Qijie [1 ,3 ]
Wang, Xu [1 ,3 ]
Pan, Rong [1 ,3 ]
机构
[1] Xidian Univ, Sch Life Sci & Technol, 266,Xinglong Sect,Xifeng Rd, Xian 710126, Shaanxi, Peoples R China
[2] Xidian Univ, Guangzhou Inst Technol, 83 Zhiming, Guangzhou 510555, Guangdong, Peoples R China
[3] 266,Xinglong Sect,Xifeng Rd, Xian 710126, Shaanxi, Peoples R China
关键词
Classroom scene; Behaviour recognition; Object detection; Image classification; Vision transformer;
D O I
10.1007/s10489-024-05409-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For multi-object behaviour recognition in classroom scenes, crowded objects have heavy occlusion, invisible keypoints, scale variation, which directly overwhelms the recognition performance. Due to the dense student objects and similar student behaviours, multi-object behaviour recognition brings great challenges. Therefore, we proposed multi-object behaviour recognition based on object detection cascaded image classification. Specifically, object detection extracts student objects, followed by Vision Transformer (ViT) classification of student behaviour. To ensure the accuracy of behaviour recognition, it is first necessary to improve the detection performance of object detection. This paper proposes the Shallow Auxiliary Module for object detection to assist the backbone network in extracting hybrid multi-scale feature information. The multi-scale and multi-channel feature information is fused to alleviate object overlap and scale variation. We propose a Scale Assignment Fusion Mechanism that non-heuristically guides objects to learn the optimal feature layer. Furthermore, the Anchor-free Dynamic Label Assignment can suppress the prediction of low-quality bounding boxes, stabling training and improving detection performance. The proposed student object detector achieves the state-of-the-art mAP50\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$<^>{50}$$\end{document} of 88.03 and APl\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$_l$$\end{document} of 57.64, outperforming state-of-the-art object detection methods. Our multi-object behaviour recognition method achieves the recognition of four behaviour classes, which is significantly better than the results of other comparison methods.
引用
收藏
页码:4935 / 4951
页数:17
相关论文
共 50 条
  • [1] A Method for Object Recognition and Robot Grasping Detection in Multi-object Scenes
    Zheng, Jiajun
    Zou, Yuanyuan
    Xu, Jie
    Fang, Lingshen
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT III, 2022, 13457 : 189 - 196
  • [2] Learning behaviour recognition based on multi-object image in single viewpoint
    Su K.
    Li X.
    Zhou C.
    Chen X.
    [J]. Personal and Ubiquitous Computing, 2021, 25 (6) : 1081 - 1090
  • [3] Multi-object behavior recognition based on object detection for dense crowds
    Dang, Min
    Liu, Gang
    Xu, Qijie
    Li, Ke
    Wang, Di
    He, Lihuo
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 248
  • [4] Coverless Image Steganography Based on Multi-Object Recognition
    Luo, Yuanjing
    Qin, Jiaohua
    Xiang, Xuyu
    Tan, Yun
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (07) : 2779 - 2791
  • [5] Multi-Object Detection in Traffic Scenes Based on Improved SSD
    Wang, Xinqing
    Hua, Xia
    Xiao, Feng
    Li, Yuyang
    Hu, Xiaodong
    Sun, Pengyu
    [J]. ELECTRONICS, 2018, 7 (11)
  • [6] Multi-Object Tracking Model Based on Detection Tracking Paradigm in Panoramic Scenes
    Shen, Jinfeng
    Yang, Hongbo
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (10):
  • [7] Keypoint-Based Robotic Grasp Detection Scheme in Multi-Object Scenes
    Li, Tong
    Wang, Fei
    Ru, Changlei
    Jiang, Yong
    Li, Jinghong
    [J]. SENSORS, 2021, 21 (06) : 1 - 15
  • [8] Multi-Object Detection of Chinese License Plate in Complex Scenes
    Liu, Dan
    Wu, Yajuan
    He, Yuxin
    Qin, Lu
    Zheng, Bochuan
    [J]. COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2021, 36 (01): : 145 - 156
  • [9] Contour Based Multi-object Classification Technology
    Nie, Qing
    Zhan, Shou-yi
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS, PT I, PROCEEDINGS, 2008, 5314 : 795 - +
  • [10] Multi-object road waste detection and classification based on binocular vision
    Guo, He
    Chen, Lumin
    [J]. JOURNAL OF ENGINEERING-JOE, 2024, 2024 (05):