Graph fusion network for multi-oriented object detection

被引:6
|
作者
Zhang, Shi-Xue [1 ]
Zhu, Xiaobin [1 ]
Hou, Jie-Bo [1 ]
Yin, Xu-Cheng [1 ,2 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing, Peoples R China
[2] Univ Sci & Technol Beijing, Inst Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph fusion network; Non-maximum suppression; Graph convolutional; Multi-oriented object detection; TEXT DETECTION;
D O I
10.1007/s10489-022-03396-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In object detection, non-maximum suppression (NMS) methods are extensively adopted to remove horizontal duplicates of detected dense boxes for generating final object instances. However, due to the degraded quality of dense detection boxes and not explicit exploration of the context information, existing NMS methods via simple intersection-over-union (IoU) metrics tend to underperform on multi-oriented and long-size objects detection. Distinguishing with general NMS methods via duplicate removal, we propose a novel graph fusion network, named GFNet, for multi-oriented object detection. Our GFNet is extensible and adaptively fuse dense detection boxes to detect more accurate and holistic multi-oriented object instances. Specifically, we first adopt a locality-aware clustering algorithm to group dense detection boxes into different clusters. We will construct an instance sub-graph for the detection boxes belonging to one cluster. Then, we propose a graph-based fusion network via Graph Convolutional Network (GCN) to learn to reason and fuse the detection boxes for generating final instance boxes. Extensive experiments both on public available multi-oriented text datasets (including MSRA-TD500, ICDAR2015, ICDAR2017-MLT) and multi-oriented object datasets (DOTA) verify the effectiveness and robustness of our method against general NMS methods in multi-oriented object detection.
引用
收藏
页码:2280 / 2294
页数:15
相关论文
共 50 条
  • [1] Graph fusion network for multi-oriented object detection
    Shi-Xue Zhang
    Xiaobin Zhu
    Jie-Bo Hou
    Xu-Cheng Yin
    Applied Intelligence, 2023, 53 : 2280 - 2294
  • [2] Multi-Oriented Rotation-Equivariant Network for Object Detection on Remote Sensing Images
    Zhu, Kun
    Zhang, Xiaodong
    Chen, Guanzhou
    Li, Xianwei
    Cai, Peihua
    Liao, Puyun
    Wang, Tong
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [3] Convolutional Regression Network for Multi-Oriented Text Detection
    Gao, Junyu
    Wang, Qi
    Yuan, Yuan
    IEEE ACCESS, 2019, 7 : 96424 - 96433
  • [4] Multi-Oriented Object Detection in Aerial Images With Double Horizontal Rectangles
    Nie, Guangtao
    Huang, Hua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4932 - 4944
  • [5] Gliding Vertex on the Horizontal Bounding Box for Multi-Oriented Object Detection
    Xu, Yongchao
    Fu, Mingtao
    Wang, Qimeng
    Wang, Yukang
    Chen, Kai
    Xia, Gui-Song
    Bai, Xiang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (04) : 1452 - 1459
  • [6] Multi-Oriented Moving Text Detection
    Khare, Vijeta
    Shivakumara, Palaiahnakote
    Raveendran, Paramesaran
    2014 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2014, : 347 - 352
  • [7] Constrained-SIoU: A Metric for Horizontal Candidates in Multi-Oriented Object Detection
    Zhang, Yanan
    Li, Haichang
    Wang, Rui
    Zhang, Mengya
    Hu, Xiaohui
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 956 - 967
  • [8] Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection
    Liu, Yuliang
    Jin, Lianwen
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3454 - 3461
  • [9] MULTI-ORIENTED TEXT DETECTION IN SCENE IMAGES
    Basavanna, M.
    Shivakumara, P.
    Srivatsa, S. K.
    Kumar, G. Hemantha
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (07)
  • [10] A Laplacian Approach to Multi-Oriented Text Detection in Video
    Shivakumara, Palaiahnakote
    Phan, Trung Quy
    Tan, Chew Lim
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (02) : 412 - 419