Image Enhancement Guided Object Detection in Visually Degraded Scenes

被引：33

作者：

Liu, Hongmin ^{[1
,2
]}

Jin, Fan ^{[1
,2
]}

Zeng, Hui ^{[1
,2
]}

Pu, Huayan ^{[3
]}

Fan, Bin ^{[1
,2
]}

机构：

[1] Univ Sci & Technol Beijing, Sch Intelligence Sci & Technol, Beijing 100083, Peoples R China

[2] Univ Sci & Technol Beijing, Inst Artificial Intelligence, Beijing 100083, Peoples R China

[3] Chongqing Univ, State Key Lab Mech & Transmiss, Chongqing 400044, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Image enhancement; object detection; visually degraded scenes;

D O I：

10.1109/TNNLS.2023.3274926

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Object detection accuracy degrades seriously in visually degraded scenes. A natural solution is to first enhance the degraded image and then perform object detection. However, it is suboptimal and does not necessarily lead to the improvement of object detection due to the separation of the image enhancement and object detection tasks. To solve this problem, we propose an image enhancement guided object detection method, which refines the detection network with an additional enhancement branch in an end-to-end way. Specifically, the enhancement branch and detection branch are organized in a parallel way, and a feature guided module is designed to connect the two branches, which optimizes the shallow feature of the input image in the detection branch to be as consistent as possible with that of the enhanced image. As the enhancement branch is frozen during training, such a design plays a role in using the features of enhanced images to guide the learning of object detection branch, so as to make the learned detection branch being aware of both image quality and object detection. When testing, the enhancement branch and feature guided module are removed, and so no additional computation cost is introduced for detection. Extensive experimental results, on underwater, hazy, and low-light object detection datasets, demonstrate that the proposed method can improve the detection performance of popular detection networks (YOLO v3, Faster R-CNN, DetectoRS) significantly in visually degraded scenes.

引用

页码：14164 / 14177

页数：14

共 50 条

[31] Application of Image Enhancement Technology Based on EnlightenGAN in Apple Detection in Natural Scenes
Song, Huaibo
Yang, Hanru
Su, Xiaowei
Zhou, Yuhong
Gao, Xinyi
Shang, Yuying
Zhang, Shujin
Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2024, 55 (08): : 266 - 279
[32] Multiple-object tracking and visually guided touch
Mallory E. Terry
Lana M. Trick
Attention, Perception, & Psychophysics, 2021, 83 : 1907 - 1927
[33] Multiple-object tracking and visually guided touch
Terry, Mallory E.
Trick, Lana M.
ATTENTION PERCEPTION & PSYCHOPHYSICS, 2021, 83 (05) : 1907 - 1927
[34] Object detection in natural scenes by feedback
Hamker, FH
Worcester, J
BIOLOGICALLY MOTIVATED COMPUTER VISION, PROCEEDINGS, 2002, 2525 : 398 - 407
[35] Consistency Guided Network for Degraded Image Classification
Pei, Yanting
Huang, Yaping
Zhang, Xingyuan
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (06) : 2231 - 2246
[36] Bayesian object detection in dynamic scenes
Sheikh, Y
Shah, M
2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol 1, Proceedings, 2005, : 74 - 79
[37] CLUSTERING SCENES IN COOKING VIDEO GUIDED BY OBJECT ACCESS
Matsumura, Yuki
Hashimoto, Atsushi
Mori, Shinsuke
Mukunoki, Masayuki
Minoh, Michihiko
2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), 2015,
[38] Enhancement of degraded image based on neural network
Hu, Defa
Wu, Zhuang
Metallurgical and Mining Industry, 2015, 7 (04): : 281 - 287
[39] Multi-modal Visual-Thermal Saliency-based Object Detection in Visually-degraded Environments
Tsiourva, Maria
Papachristos, Christos
2020 IEEE AEROSPACE CONFERENCE (AEROCONF 2020), 2020,
[40] Image Enhancement for Degraded Binary Document Images
Shi, Zhixin
Seltur, Srirangaraj
Govindaraju, Venu
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 895 - 899

← 1 2 3 4 5 →