Image Enhancement Guided Object Detection in Visually Degraded Scenes

被引:33
|
作者
Liu, Hongmin [1 ,2 ]
Jin, Fan [1 ,2 ]
Zeng, Hui [1 ,2 ]
Pu, Huayan [3 ]
Fan, Bin [1 ,2 ]
机构
[1] Univ Sci & Technol Beijing, Sch Intelligence Sci & Technol, Beijing 100083, Peoples R China
[2] Univ Sci & Technol Beijing, Inst Artificial Intelligence, Beijing 100083, Peoples R China
[3] Chongqing Univ, State Key Lab Mech & Transmiss, Chongqing 400044, Peoples R China
基金
中国国家自然科学基金;
关键词
Image enhancement; object detection; visually degraded scenes;
D O I
10.1109/TNNLS.2023.3274926
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection accuracy degrades seriously in visually degraded scenes. A natural solution is to first enhance the degraded image and then perform object detection. However, it is suboptimal and does not necessarily lead to the improvement of object detection due to the separation of the image enhancement and object detection tasks. To solve this problem, we propose an image enhancement guided object detection method, which refines the detection network with an additional enhancement branch in an end-to-end way. Specifically, the enhancement branch and detection branch are organized in a parallel way, and a feature guided module is designed to connect the two branches, which optimizes the shallow feature of the input image in the detection branch to be as consistent as possible with that of the enhanced image. As the enhancement branch is frozen during training, such a design plays a role in using the features of enhanced images to guide the learning of object detection branch, so as to make the learned detection branch being aware of both image quality and object detection. When testing, the enhancement branch and feature guided module are removed, and so no additional computation cost is introduced for detection. Extensive experimental results, on underwater, hazy, and low-light object detection datasets, demonstrate that the proposed method can improve the detection performance of popular detection networks (YOLO v3, Faster R-CNN, DetectoRS) significantly in visually degraded scenes.
引用
收藏
页码:14164 / 14177
页数:14
相关论文
共 50 条
  • [31] Application of Image Enhancement Technology Based on EnlightenGAN in Apple Detection in Natural Scenes
    Song, Huaibo
    Yang, Hanru
    Su, Xiaowei
    Zhou, Yuhong
    Gao, Xinyi
    Shang, Yuying
    Zhang, Shujin
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2024, 55 (08): : 266 - 279
  • [32] Multiple-object tracking and visually guided touch
    Mallory E. Terry
    Lana M. Trick
    Attention, Perception, & Psychophysics, 2021, 83 : 1907 - 1927
  • [33] Multiple-object tracking and visually guided touch
    Terry, Mallory E.
    Trick, Lana M.
    ATTENTION PERCEPTION & PSYCHOPHYSICS, 2021, 83 (05) : 1907 - 1927
  • [34] Object detection in natural scenes by feedback
    Hamker, FH
    Worcester, J
    BIOLOGICALLY MOTIVATED COMPUTER VISION, PROCEEDINGS, 2002, 2525 : 398 - 407
  • [35] Consistency Guided Network for Degraded Image Classification
    Pei, Yanting
    Huang, Yaping
    Zhang, Xingyuan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (06) : 2231 - 2246
  • [36] Bayesian object detection in dynamic scenes
    Sheikh, Y
    Shah, M
    2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol 1, Proceedings, 2005, : 74 - 79
  • [37] CLUSTERING SCENES IN COOKING VIDEO GUIDED BY OBJECT ACCESS
    Matsumura, Yuki
    Hashimoto, Atsushi
    Mori, Shinsuke
    Mukunoki, Masayuki
    Minoh, Michihiko
    2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), 2015,
  • [38] Enhancement of degraded image based on neural network
    Hu, Defa
    Wu, Zhuang
    Metallurgical and Mining Industry, 2015, 7 (04): : 281 - 287
  • [39] Multi-modal Visual-Thermal Saliency-based Object Detection in Visually-degraded Environments
    Tsiourva, Maria
    Papachristos, Christos
    2020 IEEE AEROSPACE CONFERENCE (AEROCONF 2020), 2020,
  • [40] Image Enhancement for Degraded Binary Document Images
    Shi, Zhixin
    Seltur, Srirangaraj
    Govindaraju, Venu
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 895 - 899