HCLT-YOLO: A Hybrid CNN and Lightweight Transformer Architecture for Object Detection in Complex Traffic Scenes

被引:0
|
作者
Chen, Zhige [1 ]
Yang, Kai [1 ]
Wu, Yandong [1 ]
Yang, Hao [2 ]
Tang, Xiaolin [1 ]
机构
[1] Chongqing Univ, Coll Mech & Vehicle Engn, State Key Lab Mech Transmiss Adv Equipment, Chongqing 400030, Peoples R China
[2] Chongqing Univ Technol, Key Lab Adv Mfg Technol Automobile Parts, Minist Educ, Chongqing 400054, Peoples R China
基金
中国国家自然科学基金;
关键词
Autonomous driving; deep learning; lightweight transformer; traffic sign detection; FRAMEWORK;
D O I
10.1109/TVT.2024.3496513
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The swift and accurate detection of traffic signs in traffic scenes is a pivotal aspect of environmental perception technology in autonomous driving systems. Traffic signs provide essential road information and regulatory instructions, which are critical to ensuring road safety. This paper presents the HCLTYOLO model to address the challenges of false alarms and missed detections in complex traffic environments. Specifically, we propose a novel hybrid CNN-transformer network architecture that efficiently integrates both local and global features, thereby improving traffic sign feature representation. To further enhance the model & acirc;s sensitivity to small traffic signs, we optimize the structure by introducing a dedicated small-object detection layer through upsampling and by leveraging SIoU to improve detection accuracy and computational efficiency. However, the addition of the small object detection layer and the Transformer module increases the overall computational complexity and parameter count, potentially affecting real-time performance. To address this issue, we introduce the DG-C2f module, which employs linear transformations for feature mapping, streamlining the convolution process and enhancing real-time feasibility. Experimental evaluations on the GTSDB and TT100K datasets demonstrate that the proposed model improves detection accuracy by 2.5% and 6.8%, respectively, compared to YOLOv8s models. Notably, the detection accuracy for small traffic signs improved significantly, by 6.9% and 11.7%, respectively. Additionally, processor-in-the-loop experiments on the NVIDIA Jetson AGX Orin show that the model achieves an inference speed of 46 FPS, meeting the real-time requirements for in-vehicle applications.
引用
收藏
页码:3681 / 3694
页数:14
相关论文
共 50 条
  • [41] TC-Radar: Transformer-CNN Hybrid Network for Millimeter-Wave Radar Object Detection
    Jia, Fengde
    Li, Chenyang
    Bi, Siyi
    Qian, Junhui
    Wei, Leizhe
    Sun, Guohao
    REMOTE SENSING, 2024, 16 (16)
  • [42] Vehicular Trajectory Classification and Traffic Anomaly Detection in Videos Using a Hybrid CNN-VAE Architecture
    Santhosh, Kelathodi Kumaran
    Dogra, Debi Prosad
    Roy, Partha Pratim
    Mitra, Adway
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 11891 - 11902
  • [43] A method of object detection with attention mechanism and C2f_DCNv2 for complex traffic scenes☆
    Cai, Fei
    Qu, Zhong
    Xia, Shufang
    Wang, Shiyan
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 267
  • [44] Object Detection in Complex Traffic Scenes Based on Environmental Perception Attention and Three-Scale Feature Fusion
    Yuan, Chunmiao
    Liu, Jinlong
    Wang, Haobo
    Yang, Qingyong
    APPLIED SCIENCES-BASEL, 2025, 15 (06):
  • [45] A Novel Convolutional Neural Network Based Architecture for Object Detection and Recognition with an Application to Traffic Sign Recognition from Road Scenes
    Karthika, R.
    Parameswaran, Latha
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2022, 32 (02) : 351 - 362
  • [46] A Novel Convolutional Neural Network Based Architecture for Object Detection and Recognition with an Application to Traffic Sign Recognition from Road Scenes
    R. Karthika
    Latha Parameswaran
    Pattern Recognition and Image Analysis, 2022, 32 : 351 - 362
  • [47] A Hybrid CNN-Transformer Network for Object Detection in Optical Remote Sensing Images: Integrating Local and Global Feature Fusion
    Huang, Youxiang
    Jiao, Donglai
    Huang, Xingru
    Tang, Tiantian
    Gui, Guan
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 241 - 254
  • [48] AstroYOLO: A hybrid CNN-Transformer deep-learning object-detection model for blue horizontal-branch stars
    He, Yuchen
    Wu, Jingjing
    Wang, Wenyu
    Jiang, Bin
    Zhang, Yanxia
    PUBLICATIONS OF THE ASTRONOMICAL SOCIETY OF JAPAN, 2023, 75 (06) : 1311 - 1323
  • [49] Lightweight Real-Time Object Detection via Enhanced Global Perception and Intra-Layer Interaction for Complex Traffic Scenarios
    Liang, Ben
    Su, Jia
    Feng, Kangkang
    Zhang, Yongqiang
    Hou, Weimin
    IEEE LATIN AMERICA TRANSACTIONS, 2024, 22 (04) : 312 - 320
  • [50] A method of cross-layer fusion multi-object detection and recognition based on improved faster R-CNN model in complex traffic environment ?
    Li, Cui-jin
    Qu, Zhong
    Wang, Sheng-ye
    Liu, Ling
    PATTERN RECOGNITION LETTERS, 2021, 145 : 127 - 134