Swin-fisheye: Object detection for fisheye images

被引:0
|
作者
Zhang, Dawei [1 ]
Yang, Tingting [1 ]
Zhao, Bokai [1 ]
机构
[1] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou, Henan, Peoples R China
基金
中国国家自然科学基金;
关键词
computer vision; object detection; object recognition;
D O I
10.1049/ipr2.13216
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fisheye cameras have been widely used in autonomous navigation, visual surveillance, and automatic driving. Due to severe geometric distortion, fisheye images cannot be processed effectively by conventional methods. The existing object detection algorithms cannot better detect the small targets or the objects with large distortion in the fisheye images. The size and scene of available fisheye datasets (such as WoodScape and VOC-360) cannot satisfy the training of robust network models. Herein, the authors propose Swin-Fisheye, an end-to-end object detection algorithm based on Swin Transformer. A feature pyramid module based on deformable convolution (DFPM) is designed to obtain richer contextual information from the multi-scale feature maps. In addition, a projection transformation algorithm (PTA) is proposed, which can convert rectilinear images into fisheye images more accurately, and then create a fisheye image dataset (COCO-Fish). The results of extensive experiments conducted on VOC-360, WoodScape, and COCO-Fish demonstrate that the proposed algorithm can achieve satisfactory results compared with state-of-the-art methods. The authors propose Swin-Fisheye, an end-to-end object detection algorithm based on Swin Transformer. A feature pyramid module based on deformable convolution (DFPM) is designed to obtain richer contextual information from the multi-scale feature maps. Projective transformation algorithm (PTA) is also proposed, which can convert rectilinear images into fisheye images more accurately, and create a fisheye image dataset (COCO-Fish). image
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Datasets for face and object detection in fisheye images
    Fu, Jianglin
    Bajic, Ivan V.
    Vaughan, Rodney G.
    [J]. DATA IN BRIEF, 2019, 27
  • [2] Improved YOLOv7 models based on modulated deformable convolution and swin transformer for object detection in fisheye images
    Zhou, Jie
    Yang, Degang
    Song, Tingting
    Ye, Yichen
    Zhang, Xin
    Song, Yingze
    [J]. IMAGE AND VISION COMPUTING, 2024, 144
  • [3] SilhoNet-Fisheye: Adaptation of A ROI Based Object Pose Estimation Network to Monocular Fisheye Images
    Billings, Gideon
    Johnson-Roberson, Matthew
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (03) : 4241 - 4248
  • [4] Expandable Spherical Projection and Feature Fusion Methods for Object Detection from Fisheye Images
    Kim, Songeun
    Park, Soon-Yong
    [J]. PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021), 2021,
  • [5] 3D Object Detection from a Single Fisheye Image Without a Single Fisheye Training Image
    Plaut, Elad
    Ben Yaacov, Erez
    El Shlomo, Bat
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3654 - 3662
  • [6] FisheyeSuperPoint: Keypoint Detection and Description Network for Fisheye Images
    Konrad, Anna
    Eising, Ciaran
    Sistu, Ganesh
    McDonald, John
    Villing, Rudi
    Yogamani, Senthil
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 4, 2022, : 340 - 347
  • [7] Instance segmentation in fisheye images
    Dufour, Remi
    Meurie, Cyril
    Strauss, Clement
    Lezoray, Olivier
    [J]. 2020 TENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2020,
  • [8] Semantic Segmentation of Fisheye Images
    Blott, Gregor
    Takami, Masato
    Heipke, Christian
    [J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT I, 2019, 11129 : 181 - 196
  • [9] FACE RECOGNITION FOR FISHEYE IMAGES
    Lo, Yi-Cheng
    Huang, Chiao-Chun
    Tsai, Yueh-Feng
    Lo, I-Chan
    Andy Wu, An-Yeu
    Chen, Homer H.
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 146 - 150
  • [10] Downside Hemisphere Object Detection and Localization of MAV by Fisheye Camera
    Zhu, Jun
    Zhu, Jiangcheng
    Wan, Xudong
    Xu, Chao
    [J]. 2018 15TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2018, : 532 - 537