YOLO-AMM: A Real-Time Classroom Behavior Detection Algorithm Based on Multi-Dimensional Feature Optimization

被引:0
|
作者
Cao, Yi [1 ]
Cao, Qian [2 ]
Qian, Chengshan [1 ,2 ]
Chen, Deji [1 ,3 ]
机构
[1] Wuxi Univ, Sch Internet Things Engn, Wuxi 214105, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Sch Automat, Nanjing 210044, Peoples R China
[3] Jiangsu Foreign Expert Lab, Wuxi 214105, Peoples R China
关键词
YOLOv8; classroom behavior detection; AEFF; MFFN;
D O I
10.3390/s25041142
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Classroom behavior detection is a key task in constructing intelligent educational environments. However, the existing models are still deficient in detail feature capture capability, multi-layer feature correlation, and multi-scale target adaptability, making it challenging to realize high-precision real-time detection in complex scenes. This paper proposes an improved classroom behavior detection algorithm, YOLO-AMM, to solve these problems. Firstly, we constructed the Adaptive Efficient Feature Fusion (AEFF) module to enhance the fusion of semantic information between different features and improve the model's ability to capture detailed features. Then, we designed a Multi-dimensional Feature Flow Network (MFFN), which fuses multi-dimensional features and enhances the correlation information between features through the multi-scale feature aggregation module and contextual information diffusion mechanism. Finally, we proposed a Multi-Scale Perception and Fusion Detection Head (MSPF-Head), which significantly improves the adaptability of the head to different scale targets by introducing multi-scale feature perception, feature interaction, and fusion mechanisms. The experimental results showed that compared with the YOLOv8n model, YOLO-AMM improved the mAP0.5 and mAP0.5-0.95 by 3.1% and 4.0%, significantly improving the detection accuracy. Meanwhile, YOLO-AMM increased the detection speed (FPS) by 12.9 frames per second to 169.1 frames per second, which meets the requirement for real-time detection of classroom behavior.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] A real-time hand detection system based on multi-feature
    Mei, Kuizhi
    Xu, Lu
    Li, Boliang
    Lin, Bin
    Wang, Fang
    NEUROCOMPUTING, 2015, 158 : 184 - 193
  • [22] CPU Based YOLO: A Real Time Object Detection Algorithm
    Ullah, Md Bahar
    2020 IEEE REGION 10 SYMPOSIUM (TENSYMP) - TECHNOLOGY FOR IMPACTFUL SUSTAINABLE DEVELOPMENT, 2020, : 552 - 555
  • [23] Real-time detection algorithm for digital meters based on multi-scale feature fusion and GCS
    Hao, Zhaoming
    Zhang, Xiaoqiong
    Li, Hongyan
    Xu, Meng
    Zhang, Ziyang
    Wang, Zhan
    Wang, Weifeng
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (02)
  • [24] A Multi-dimensional Resource Allocation Algorithm Based on Real-Time Services in the Space-Ground Integrated Network
    Qu, Hua
    Wei, Feng
    Zhao, Jihong
    Ma, Nan
    Yu, Yongyue
    2022 IEEE 2ND INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SOFTWARE ENGINEERING (ICICSE 2022), 2022, : 161 - 166
  • [25] Real-time detection algorithm for digital meters based on multi-scale feature fusion and GCS
    Zhaoming Hao
    Xiaoqiong Zhang
    Hongyan Li
    Meng Xu
    Ziyang Zhang
    Zhan Wang
    Weifeng Wang
    Journal of Real-Time Image Processing, 2024, 21
  • [26] Real-time pedestrian detection based on improved YOLO model
    Zhao, Congcong
    Chen, Bin
    2019 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC 2019), VOL 2, 2019, : 25 - 28
  • [27] Real-time vehicle detection and counting based on YOLO and DeepSORT
    Thanh-Nghi Doan
    Minh-Tuyen Truong
    2020 12TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (IEEE KSE 2020), 2020, : 67 - 72
  • [28] Multi-Dimensional Scheduling for Real-Time Tasks on Heterogeneous Clusters
    Xiao-Min Zhu
    Pei-Zhong Lu
    Journal of Computer Science and Technology, 2009, 24 : 434 - 446
  • [29] Multi-Dimensional Scheduling for Real-Time Tasks on Heterogeneous Clusters
    朱晓敏
    陆佩忠
    Journal of Computer Science & Technology, 2009, 24 (03) : 434 - 446
  • [30] Multi-dimensional denoising of real-time OCT imaging data
    Ralston, Tyler S.
    Atkinson, Ian
    Kamalabadi, Farzad
    Boppart, Stephen A.
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 2396 - 2399