PARTS BASED ATTENTION FOR HIGHLY OCCLUDED PEDESTRIAN DETECTION WITH TRANSFORMERS

被引:0
|
作者
Shastry, K. N. Ajay [1 ]
Chaudhari, Jayesh [1 ]
Thapar, Daksh [2 ]
Nigam, Aditya [2 ]
Arora, Chetan [1 ]
机构
[1] Indian Inst Technol, Delhi, India
[2] Indian Inst Technol, Mandi, Himachal Prades, India
关键词
D O I
10.1109/ICIP49359.2023.10222651
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the significant progress made in pedestrian detection in last decade, detecting pedestrians under heavy occlusion still remains a challenging problem. In state of the art (SOTA), convolutional neural network (CNN) based models, the reason is attributed to non-maximal-suppression (NMS), which often erroneously deletes true positives when one pedestrian is occluding other. SOTA transformer based models do not have such NMS step, yet fail to detect highly occluded pedestrians. In this paper, we study the reasons for such failures. We observe that such models first predict key-points, and then compute the attention at the specific key-points. Our analysis reveals that the key-points do not have any preference towards semantically important body parts. Under heavy occlusion, such key-points end up attending to non-discriminative regions or background, leading to false negatives. We take inspiration from the conventional wisdom of detecting objects using their parts, and bias the attention of proposed transformer architecture towards semantically important, and highly discriminative human body parts. The intervention leads to SOTA results on benchmark Citypersons and Caltech datasets, achieving 30.75%, and 32.96% miss-rate (lower is better) respectively, against 32.6%, and 38.2% by the current SOTA. Code is available at https://ajayshastry08.github.io/pa_dino
引用
收藏
页码:3085 / 3089
页数:5
相关论文
共 50 条
  • [1] Occluded Pedestrian Detection Algorithm Based on Attention Mechanism
    Zou Ziyin
    Gai Shaoyan
    Da Feipeng
    Li Yu
    ACTA OPTICA SINICA, 2021, 41 (15)
  • [2] Occluded Pedestrian Detection Through Guided Attention in CNNs
    Zhang, Shanshan
    Yang, Jian
    Schiele, Bernt
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6995 - 7003
  • [3] Attention guided neural network models for occluded pedestrian detection
    Zou, Tengtao
    Yang, Shangming
    Zhang, Yun
    Ye, Mao
    PATTERN RECOGNITION LETTERS, 2020, 131 : 91 - 97
  • [4] Mask-Guided Attention Network for Occluded Pedestrian Detection
    Pang, Yanwei
    Xie, Jin
    Khan, Muhammad Haris
    Anwer, Rao Muhammad
    Khan, Fahad Shahbaz
    Shao, Ling
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4966 - 4974
  • [5] Single-Stage Detector with Semantic Attention for Occluded Pedestrian Detection
    Wen, Fang
    Lin, Zehang
    Yang, Zhenguo
    Liu, Wenyin
    MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 : 414 - 425
  • [6] Guided Attention in CNNs for Occluded Pedestrian Detection and Re-identification
    Zhang, Shanshan
    Chen, Di
    Yang, Jian
    Schiele, Bernt
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (06) : 1875 - 1892
  • [7] Guided Attention in CNNs for Occluded Pedestrian Detection and Re-identification
    Shanshan Zhang
    Di Chen
    Jian Yang
    Bernt Schiele
    International Journal of Computer Vision, 2021, 129 : 1875 - 1892
  • [8] Correlation-and-Correction Fusion Attention Network for Occluded Pedestrian Detection
    Zou, Fengmin
    Li, Xu
    Xu, Qimin
    Sun, Zhengliang
    Zhu, Jianxiao
    IEEE SENSORS JOURNAL, 2023, 23 (06) : 6061 - 6073
  • [9] Occluded Pedestrian Detection Based on Joint Attention Mechanism of Channel-wise and Spatial Information
    Chen Yong
    Liu Xi
    Liu Huanlin
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (06) : 1486 - 1493
  • [10] Swin Transformer for Pedestrian and Occluded Pedestrian Detection
    Liang, Jung-An
    Ding, Jian-Jiun
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,