Improving the Robustness of Pedestrian Detection in Autonomous Driving With Generative Data Augmentation

被引:4
|
作者
Wu, Yalun [1 ]
Xiang, Yingxiao [2 ]
Tong, Endong [1 ]
Ye, Yuqi [1 ]
Cui, Zhibo [1 ]
Tian, Yunzhe [1 ]
Zhang, Lejun [3 ]
Liu, Jiqiang [1 ]
Han, Zhen [1 ]
Niu, Wenjia [1 ]
机构
[1] Beijing Jiaotong Univ, Beijing Key Lab Secur & Privacy Intelligent Transp, Beijing 100044, Peoples R China
[2] Inst Informat Engn, Chinese Acad Sci, Beijing 100085, Peoples R China
[3] Guangzhou Univ, Cyberspace Inst Adv Technol, Guangzhou 510006, Peoples R China
来源
IEEE NETWORK | 2024年 / 38卷 / 03期
关键词
Pedestrians; Data augmentation; Data models; Autonomous vehicles; Feature extraction; Semantics; Image capture; pedestrian detection; diffusion model; generative data augmentation; image caption;
D O I
10.1109/MNET.2024.3366232
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Pedestrian detection plays a crucial role in autonomous driving by identifying the position, size, orientation, and dynamic features of pedestrians in images or videos, assisting autonomous vehicles in making better decisions and controls. It's worth noting that the performance of pedestrian detection models largely depends on the quality and diversity of available training data. Current datasets for autonomous driving have limitations in terms of diversity, scale, and quality. In recent years, numerous studies have proposed the use of data augmentation strategies to expand the coverage of datasets, aiming to maximize the utilization of existing training data. However, these data augmentation methods often overlook the diversity of data scenarios. To overcome this challenge, in this paper, we propose a more comprehensive method for data augmentation, based on image descriptions and diffusion models. This method aims to cover a wider range of scene variations, including different weather conditions and lighting situations. We have designed a classifier to select data samples for augmentation, followed by extracting visual features based on image captions and converting them into high-level semantic information as textual descriptions for the corresponding samples. Finally, we utilize diffusion models to generate new variants. Additionally, we have designed three modification patterns to increase diversity in aspects such as weather conditions, lighting, and pedestrian poses within the data. We conducted extensive experiments on the KITTI dataset and in real-world environments, demonstrating that our proposed method significantly enhances the performance of pedestrian detection models in complex scenarios. This meticulous consideration of data augmentation will notably enhance the applicability and robustness of pedestrian detection models in actual autonomous driving scenarios.
引用
收藏
页码:63 / 69
页数:7
相关论文
共 50 条
  • [1] Advanced Pedestrian Dataset Augmentation for Autonomous Driving
    Vobecky, Antonin
    Uricar, Michal
    Hurych, David
    Skoviera, Radoslav
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2367 - 2372
  • [2] Multispectral pedestrian detection in autonomous driving: A review
    Li, Yuting
    IEIE Transactions on Smart Processing and Computing, 2021, 10 (01): : 10 - 16
  • [3] Targeted Data Augmentation for Improving Model Robustness
    Mikolajczyk-Barela, Agnieszka
    Ferlin, Maria
    Grochowski, Michal
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2025, 35 (01) : 143 - 155
  • [4] Traffic Context Aware Data Augmentation for Rare Object Detection in Autonomous Driving
    Li, Naifan
    Song, Fan
    Zhang, Ying
    Liang, Pengpeng
    Cheng, Erkang
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 4548 - 4554
  • [5] A Robust and Efficient Pedestrian Detection Approach for Autonomous Driving
    Wang, Yijing
    Zheng, Kaifu
    Zuo, Zhiqiang
    Zhang, Zhicheng
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 7666 - 7671
  • [6] GPU-based pedestrian detection for autonomous driving
    Campmany, V.
    Silva, S.
    Espinosa, A.
    Moure, J. C.
    Vazquez, D.
    Lopez, A. M.
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016), 2016, 80 : 2377 - 2381
  • [7] Real-time Pedestrian Detection for Autonomous Driving
    Yang, Zhiheng
    Li, Jun
    Li, Huiyun
    2018 INTERNATIONAL CONFERENCE ON INTELLIGENT AUTONOMOUS SYSTEMS (ICOIAS), 2018, : 9 - 13
  • [8] Toward Robust Pedestrian Detection With Data Augmentation
    Cygert, Sebastian
    Czyzewski, Andrzej
    IEEE ACCESS, 2020, 8 (08): : 136674 - 136683
  • [9] Improving RGB-Thermal Semantic Scene Understanding With Synthetic Data Augmentation for Autonomous Driving
    Li, Haotian
    Chu, Henry K.
    Sun, Yuxiang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (05): : 4452 - 4459
  • [10] About the Ambiguity of Data Augmentation for 3D Object Detection in Autonomous Driving
    Reuse, Matthias
    Simon, Martin
    Sick, Bernhard
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 979 - 987