Graph fusion network for multi-oriented object detection

被引:6
|
作者
Zhang, Shi-Xue [1 ]
Zhu, Xiaobin [1 ]
Hou, Jie-Bo [1 ]
Yin, Xu-Cheng [1 ,2 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing, Peoples R China
[2] Univ Sci & Technol Beijing, Inst Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph fusion network; Non-maximum suppression; Graph convolutional; Multi-oriented object detection; TEXT DETECTION;
D O I
10.1007/s10489-022-03396-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In object detection, non-maximum suppression (NMS) methods are extensively adopted to remove horizontal duplicates of detected dense boxes for generating final object instances. However, due to the degraded quality of dense detection boxes and not explicit exploration of the context information, existing NMS methods via simple intersection-over-union (IoU) metrics tend to underperform on multi-oriented and long-size objects detection. Distinguishing with general NMS methods via duplicate removal, we propose a novel graph fusion network, named GFNet, for multi-oriented object detection. Our GFNet is extensible and adaptively fuse dense detection boxes to detect more accurate and holistic multi-oriented object instances. Specifically, we first adopt a locality-aware clustering algorithm to group dense detection boxes into different clusters. We will construct an instance sub-graph for the detection boxes belonging to one cluster. Then, we propose a graph-based fusion network via Graph Convolutional Network (GCN) to learn to reason and fuse the detection boxes for generating final instance boxes. Extensive experiments both on public available multi-oriented text datasets (including MSRA-TD500, ICDAR2015, ICDAR2017-MLT) and multi-oriented object datasets (DOTA) verify the effectiveness and robustness of our method against general NMS methods in multi-oriented object detection.
引用
收藏
页码:2280 / 2294
页数:15
相关论文
共 50 条
  • [31] Multi⁃level Fusion Based Weakly Supervised Object Detection Network
    Cao, Huan
    Chen, Zengping
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2024, 37 (05): : 424 - 434
  • [32] Object detection network pruning with multi-task information fusion
    Li, Shengming
    Xue, Linsong
    Feng, Lin
    Wang, Yifan
    Wang, Dong
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2022, 25 (04): : 1667 - 1683
  • [33] Object detection network pruning with multi-task information fusion
    Shengming Li
    Linsong Xue
    Lin Feng
    Yifan Wang
    Dong Wang
    World Wide Web, 2022, 25 : 1667 - 1683
  • [34] Multi-level feature fusion pyramid network for object detection
    Guo, Zebin
    Shuai, Hui
    Liu, Guangcan
    Zhu, Yisheng
    Wang, Wenqing
    VISUAL COMPUTER, 2023, 39 (09): : 4267 - 4277
  • [35] Multi-level feature fusion pyramid network for object detection
    Zebin Guo
    Hui Shuai
    Guangcan Liu
    Yisheng Zhu
    Wenqing Wang
    The Visual Computer, 2023, 39 : 4267 - 4277
  • [36] A Multi-Template Fusion Object Tracking Algorithm Based on Graph Attention Network
    Lu, Xiaofeng
    Li, Xiaopeng
    Wang, Zhengyang
    Hei, Xinhong
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2023, 18 (02) : 243 - 253
  • [37] REFINETEXT: REFINING MULTI-ORIENTED SCENE TEXT DETECTION WITH A FEATURE REFINEMENT MODULE
    Xie, Pengyuan
    Xiao, Jing
    Cao, Yang
    Zhu, Jia
    Khan, Asad
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1756 - 1761
  • [38] Deep neural networks nombined with STN for multi-oriented text detection and recognition
    Katper S.H.
    Gilal A.R.
    Waqas A.
    Alshanqiti A.
    Alsughayyir A.
    Jaafar J.
    1600, Science and Information Organization (11): : 178 - 184
  • [39] A New Technique for Multi-Oriented Scene Text Line Detection and Tracking in Video
    Wu, Liang
    Shivakumara, Palaiahnakote
    Lu, Tong
    Tan, Chew Lim
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (08) : 1137 - 1152
  • [40] MULTI-SCALE GRAPH CONVOLUTIONAL INTERACTION NETWORK FOR SALIENT OBJECT DETECTION
    Che, Wenqi
    Sun, Luoyi
    Xie, Zhifeng
    Ding, Youdong
    Han, Kaili
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 679 - 683