Efficient algorithm for directed text detection based on rotation decoupled bounding box

被引:0
|
作者
Wei, Songma [1 ]
Lu, Minrui [2 ]
Chen, Bingsan [1 ]
Zhang, Tengjian [2 ]
Zhang, Fujiang [1 ]
Peng, Xiaodong [1 ]
机构
[1] Fujian Univ Technol, Fujian Key Lab Intelligent Machining Technol & Equ, Fuzhou, Peoples R China
[2] Fujian Wuyi Leaf Tobacco Co Ltd, Shaowu, Peoples R China
关键词
Directional text; Target detection; Rotation Decoupled bounding box;
D O I
10.7717/peerj-cs.1352
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A more effective directed text detection algorithm is proposed for the problem of low accuracy in detecting text with multiple sources, dense distribution, large aspect ratio and arbitrary alignment direction in the industrial intelligence process. The algorithm is based on the YOLOv5 model architecture, inspired by the idea of DenseNet dense connection, a parallel cross-scale feature fusion method is proposed to overcome the problem of blurring the underlying feature semantic information and deep location information caused by the sequential stacking approach and to improve the multiscale feature information extraction capability. Furthermore, a rotational decoupling border detection module, which decouples the rotational bounding box into horizontal bounding box during positive sample matching, is provided, overcoming the angular instability in the process of matching the rotational bounding box with the horizontal anchor to obtain higher-quality regression samples and improve the precision of directed text detection. The MSRA-TD500 and ICDAR2015 datasets are used to evaluate the method, and results show that the algorithm measured precision and F1-score of 89.2% and 88.1% on the MSRA-TD500 dataset, respectively, and accuracy and F1-score of 90.6% and 89.3% on the ICDAR2015 dataset, respectively. The proposed algorithm has better competitive ability than the SOTA text detection algorithm.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] SLAM algorithm based on bounding box and deep continuity in dynamic scene
    Fang B.
    Han X.
    Wang Z.
    Yuan X.
    International Journal of Wireless and Mobile Computing, 2021, 21 (04) : 349 - 364
  • [22] Solution of minimum bounding box of scattered points based on genetic algorithm
    Sun, Dianzhu
    Shi, Yang
    Liu, Huadong
    Li, Yanrui
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2013, 39 (08): : 995 - 998
  • [23] An Improved Bounding Box Localization Algorithm Based on Optimum Node Selection
    Qian, Kaiguo
    Wang, Yujian
    Li, Xiaoming
    Dai, Zucheng
    MECHANICAL COMPONENTS AND CONTROL ENGINEERING III, 2014, 668-669 : 1359 - +
  • [24] Efficient Collision Detection Based on Hybrid Bounding Volumes
    Zheng, Yan-Bin
    Guo, Ling-Yun
    Liu, Jing-Jing
    ADVANCED MANUFACTURING TECHNOLOGY, PTS 1-4, 2012, 472-475 : 2608 - 2611
  • [25] Efficient Text Bounding Box Identification Using Mask R-CNN: Case of Thai Documents
    Kiatphaisansophon, Phanthakan
    Wanvarie, Dittaya
    Cooharojananone, Nagul
    IEEE ACCESS, 2024, 12 (49306-49328) : 49306 - 49328
  • [26] Segmentation-based bounding box generation for omnidirectional pedestrian detection
    Masato Tamura
    Tomoaki Yoshinaga
    The Visual Computer, 2024, 40 : 2505 - 2516
  • [27] Remote Multi-object detection based on bounding box field
    Liu, Jin
    Li, RongHao
    Gao, YongJian
    MIPPR 2019: MULTISPECTRAL IMAGE ACQUISITION, PROCESSING, AND ANALYSIS, 2020, 11428
  • [28] BBBD: Bounding Box Based Detector for Occlusion Detection and Order Recovery
    Saleh, Kaziwa
    Vamossy, Zoltan
    IMPROVE: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND VISION ENGINEERING, 2022, : 78 - 84
  • [29] Machine Learning Based Bounding Box Regression for Improved Pedestrian Detection
    Toprak, Tugce
    Gunel, Serkan
    Belenlioglu, Burak
    Aydin, Burak
    Zoral, E. Yesim
    Selver, M. Alper
    2019 INTERNATIONAL SYMPOSIUM ON ADVANCED ELECTRICAL AND COMMUNICATION TECHNOLOGIES (ISAECT), 2019,
  • [30] Segmentation-based bounding box generation for omnidirectional pedestrian detection
    Tamura, Masato
    Yoshinaga, Tomoaki
    VISUAL COMPUTER, 2024, 40 (04): : 2505 - 2516