Graph fusion network for multi-oriented object detection

被引:6
|
作者
Zhang, Shi-Xue [1 ]
Zhu, Xiaobin [1 ]
Hou, Jie-Bo [1 ]
Yin, Xu-Cheng [1 ,2 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing, Peoples R China
[2] Univ Sci & Technol Beijing, Inst Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph fusion network; Non-maximum suppression; Graph convolutional; Multi-oriented object detection; TEXT DETECTION;
D O I
10.1007/s10489-022-03396-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In object detection, non-maximum suppression (NMS) methods are extensively adopted to remove horizontal duplicates of detected dense boxes for generating final object instances. However, due to the degraded quality of dense detection boxes and not explicit exploration of the context information, existing NMS methods via simple intersection-over-union (IoU) metrics tend to underperform on multi-oriented and long-size objects detection. Distinguishing with general NMS methods via duplicate removal, we propose a novel graph fusion network, named GFNet, for multi-oriented object detection. Our GFNet is extensible and adaptively fuse dense detection boxes to detect more accurate and holistic multi-oriented object instances. Specifically, we first adopt a locality-aware clustering algorithm to group dense detection boxes into different clusters. We will construct an instance sub-graph for the detection boxes belonging to one cluster. Then, we propose a graph-based fusion network via Graph Convolutional Network (GCN) to learn to reason and fuse the detection boxes for generating final instance boxes. Extensive experiments both on public available multi-oriented text datasets (including MSRA-TD500, ICDAR2015, ICDAR2017-MLT) and multi-oriented object datasets (DOTA) verify the effectiveness and robustness of our method against general NMS methods in multi-oriented object detection.
引用
收藏
页码:2280 / 2294
页数:15
相关论文
共 50 条
  • [41] SCALE-INVARIANT MULTI-ORIENTED TEXT DETECTION IN WILD SCENE IMAGE
    Dasgupta, Kinjal
    Das, Sudip
    Bhattacharya, Ujjwal
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2041 - 2045
  • [42] Multi-oriented text detection from natural scene images based on a CNN and pruning non-adjacent graph edges
    Wei, Yuanwang
    Shen, Wei
    Zeng, Dan
    Ye, Lihua
    Zhang, Zhijiang
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 64 : 89 - 98
  • [43] Rotation-Invariant Features for Multi-Oriented Text Detection in Natural Images
    Yao, Cong
    Zhang, Xin
    Bai, Xiang
    Liu, Wenyu
    Ma, Yi
    Tu, Zhuowen
    PLOS ONE, 2013, 8 (08):
  • [44] Multi-oriented props and homotopy algebras with branes
    Merkulov, Sergei
    LETTERS IN MATHEMATICAL PHYSICS, 2020, 110 (06) : 1425 - 1475
  • [45] FC2RN: A FULLY CONVOLUTIONAL CORNER REFINEMENT NETWORK FOR ACCURATE MULTI-ORIENTED SCENE TEXT DETECTION
    Qin, Xugong
    Zhou, Yu
    Guo, Youhui
    Wu, Dayan
    Wang, Weiping
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4350 - 4354
  • [46] Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation
    Lyu, Pengyuan
    Yao, Cong
    Wu, Wenhao
    Yan, Shuicheng
    Bai, Xiang
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7553 - 7563
  • [47] Deep Neural Networks Combined with STN for Multi-Oriented Text Detection and Recognition
    Katper, Saif Hassan
    Gilal, Abdul Rehman
    Waqas, Ahmad
    Alsughayyie, Ae Shah
    Alshanqiti, Abdullah
    Jaafar, Jafreezal
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (04) : 178 - 184
  • [48] Multi-oriented English text line identification
    Pal, U
    Sinha, S
    Chaudhuri, BB
    IMAGE ANALYSIS, PROCEEDINGS, 2003, 2749 : 1146 - 1153
  • [49] Multi-oriented props and homotopy algebras with branes
    Sergei Merkulov
    Letters in Mathematical Physics, 2020, 110 : 1425 - 1475
  • [50] Recognition of Indian multi-oriented and curved text
    Pal, U
    Tripathy, N
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 141 - 145