Graph fusion network for multi-oriented object detection

被引:6
|
作者
Zhang, Shi-Xue [1 ]
Zhu, Xiaobin [1 ]
Hou, Jie-Bo [1 ]
Yin, Xu-Cheng [1 ,2 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing, Peoples R China
[2] Univ Sci & Technol Beijing, Inst Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph fusion network; Non-maximum suppression; Graph convolutional; Multi-oriented object detection; TEXT DETECTION;
D O I
10.1007/s10489-022-03396-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In object detection, non-maximum suppression (NMS) methods are extensively adopted to remove horizontal duplicates of detected dense boxes for generating final object instances. However, due to the degraded quality of dense detection boxes and not explicit exploration of the context information, existing NMS methods via simple intersection-over-union (IoU) metrics tend to underperform on multi-oriented and long-size objects detection. Distinguishing with general NMS methods via duplicate removal, we propose a novel graph fusion network, named GFNet, for multi-oriented object detection. Our GFNet is extensible and adaptively fuse dense detection boxes to detect more accurate and holistic multi-oriented object instances. Specifically, we first adopt a locality-aware clustering algorithm to group dense detection boxes into different clusters. We will construct an instance sub-graph for the detection boxes belonging to one cluster. Then, we propose a graph-based fusion network via Graph Convolutional Network (GCN) to learn to reason and fuse the detection boxes for generating final instance boxes. Extensive experiments both on public available multi-oriented text datasets (including MSRA-TD500, ICDAR2015, ICDAR2017-MLT) and multi-oriented object datasets (DOTA) verify the effectiveness and robustness of our method against general NMS methods in multi-oriented object detection.
引用
收藏
页码:2280 / 2294
页数:15
相关论文
共 50 条
  • [21] MASK-MOST NET: MASK APPROXIMATION BASED MULTI-ORIENTED SCENE TEXT DETECTION NETWORK
    Guo, Xiaobao
    Li, Jinxing
    Chen, Bingzhi
    Lu, Guangming
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 206 - 211
  • [22] Multi-Oriented Enhancement Branch and Context-Aware Module for Few-Shot Oriented Object Detection in Remote Sensing Images
    Su, Haozheng
    You, Yanan
    Liu, Sixu
    REMOTE SENSING, 2023, 15 (14)
  • [23] Fused Text Segmentation Networks for Multi-oriented Scene Text Detection
    Dai, Yuchen
    Huang, Zheng
    Gao, Yuting
    Xu, Youxuan
    Chen, Kai
    Guo, Jie
    Qiu, Weidong
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3604 - 3609
  • [24] Multi-oriented text detection and verification in video frames and scene images
    Sain, Aneeshan
    Bhunia, Ayan Kumar
    Roy, Partha Pratim
    Pal, Umapada
    NEUROCOMPUTING, 2018, 275 : 1531 - 1549
  • [25] Script independent approach for multi-oriented text detection in scene image
    Dey, Sounak
    Shivakumara, Palaiahnakote
    Raghunandan, K. S.
    Pal, Umapada
    Lu, Tong
    Kumar, G. Hemantha
    Chan, Chee Seng
    NEUROCOMPUTING, 2017, 242 : 96 - 112
  • [26] A new Histogram Oriented Moments descriptor for multi-oriented moving text detection in video
    Khare, Vijeta
    Shivakumara, Palaiahnakote
    Raveendran, Paramesran
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (21) : 7627 - 7640
  • [27] A Deep Learning Approach for Robust, Multi-oriented, and Curved Text Detection
    Ranjbarzadeh, Ramin
    Jafarzadeh Ghoushchi, Saeid
    Anari, Shokofeh
    Safavi, Sadaf
    Tataei Sarshar, Nazanin
    Babaee Tirkolaee, Erfan
    Bendechache, Malika
    COGNITIVE COMPUTATION, 2024, 16 (04) : 1979 - 1991
  • [28] Multi-Oriented Object Detection in High-Resolution Remote Sensing Imagery Based on Convolutional Neural Networks with Adaptive Object Orientation Features
    Dong, Zhipeng
    Wang, Mi
    Wang, Yanli
    Liu, Yanxiong
    Feng, Yikai
    Xu, Wenxue
    REMOTE SENSING, 2022, 14 (04)
  • [29] Semantic Compensation Based Dual-Stream Feature Interaction Network for Multi-oriented Scene Text Detection
    Wang, Siyan
    Li, Sumei
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [30] Recognition of English multi-oriented characters
    Pal, U.
    Kimura, F.
    Roy, K.
    Pal, T.
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 873 - +