Teacher-Student Model Using Grounding DINO and You Only Look Once for Multi-Sensor-Based Object Detection

被引:0
|
作者
Son, Jinhwan [1 ]
Jung, Heechul [1 ]
机构
[1] Kyungpook Natl Univ, Dept Artificial Intelligence, Daegu 41566, South Korea
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 06期
关键词
deep learning; computer vision; object detection; auto-labeling;
D O I
10.3390/app14062232
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Object detection is a crucial research topic in the fields of computer vision and artificial intelligence, involving the identification and classification of objects within images. Recent advancements in deep learning technologies, such as YOLO (You Only Look Once), Faster-R-CNN, and SSDs (Single Shot Detectors), have demonstrated high performance in object detection. This study utilizes the YOLOv8 model for real-time object detection in environments requiring fast inference speeds, specifically in CCTV and automotive dashcam scenarios. Experiments were conducted using the 'Multi-Image Identical Situation and Object Identification Data' provided by AI Hub, consisting of multi-image datasets captured in identical situations using CCTV, dashcams, and smartphones. Object detection experiments were performed on three types of multi-image datasets captured in identical situations. Despite the utility of YOLO, there is a need for performance improvement in the AI Hub dataset. Grounding DINO, a zero-shot object detector with a high mAP performance, is employed. While efficient auto-labeling is possible with Grounding DINO, its processing speed is slower than YOLO, making it unsuitable for real-time object detection scenarios. This study conducts object detection experiments using publicly available labels and utilizes Grounding DINO as a teacher model for auto-labeling. The generated labels are then used to train YOLO as a student model, and performance is compared and analyzed. Experimental results demonstrate that using auto-generated labels for object detection does not lead to degradation in performance. The combination of auto-labeling and manual labeling significantly enhances performance. Additionally, an analysis of datasets containing data from various devices, including CCTV, dashcams, and smartphones, reveals the impact of different device types on the recognition accuracy for distinct devices. Through Grounding DINO, this study proves the efficacy of auto-labeling technology in contributing to efficiency and performance enhancement in the field of object detection, presenting practical applicability.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Weighted multi-error information entropy based you only look once network for underwater object detection
    Ma, Haiping
    Zhang, Yajing
    Sun, Shengyi
    Zhang, Weijia
    Fei, Minrui
    Zhou, Huiyu
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 130
  • [2] Weighted multi-error information entropy based you only look once network for underwater object detection
    Ma, Haiping
    Zhang, Yajing
    Sun, Shengyi
    Zhang, Weijia
    Fei, Minrui
    Zhou, Huiyu
    Engineering Applications of Artificial Intelligence, 2024, 130
  • [3] Investigation of You Only Look Once Networks for Vision-based Small Object Detection
    Yang, Li
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (04) : 69 - 82
  • [4] You Only Look Once: Unified, Real-Time Object Detection
    Redmon, Joseph
    Divvala, Santosh
    Girshick, Ross
    Farhadi, Ali
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 779 - 788
  • [5] Efficient Small Object Detection You Only Look Once: A Small Object Detection Algorithm for Aerial Images
    Luo, Jie
    Liu, Zhicheng
    Wang, Yibo
    Tang, Ao
    Zuo, Huahong
    Han, Ping
    Sensors, 2024, 24 (21)
  • [6] A novel three-dimensional object detection with the modified You Only Look Once
    Zhao, Xia
    Jia, Haihang
    Ni, Yingting
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2018, 15 (02):
  • [7] Robust Vehicle Detection Based on Improved You Look Only Once
    Kumar, Sunil
    Jailia, Manisha
    Varshney, Sudeep
    Pathak, Nitish
    Urooj, Shabana
    Abd Elmunim, Nouf
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (02): : 3561 - 3577
  • [8] Automated Vehicle Counting from Pre-Recorded Video Using You Only Look Once (YOLO) Object Detection Model
    Majumder, Mishuk
    Wilmot, Chester
    JOURNAL OF IMAGING, 2023, 9 (07)
  • [9] Detection of natural disaster victims using You Only Look Once (YOLO)
    Sarosa, M.
    Muna, N.
    Rohadi, E.
    5TH ANNUAL APPLIED SCIENCE AND ENGINEERING CONFERENCE (AASEC 2020), 2021, 1098
  • [10] Vehicle Multi-Object Detection and Tracking Algorithm Based on Improved You Only Look Once 5s Version and DeepSORT
    Bui, Thioanh
    Wang, Guihao
    Wei, Geng
    Zeng, Qian
    APPLIED SCIENCES-BASEL, 2024, 14 (07):