UFNet: A Multi-scale Fusion Feature based Text Detection Method

被引:0
|
作者
Chai, Zhengpeng [1 ]
Zhu, Rui [1 ]
Wang, Wei [2 ]
机构
[1] Wuhan Inst Technol, Sch Comp Sci & Engn, Wuhan, Peoples R China
[2] Wuhan 1 Hosp, Wuhan 430205, Peoples R China
关键词
feature fusion; binarization; segmentation network; text detection;
D O I
10.1145/3655532.3655558
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, the field of text detection has witnessed a growing trend, with more and more segmentation-based methods incorporating feature sampling. Segmentation methods possess a natural advantage in detecting text with both regular and irregular shapes due to their ability to effectively segment diverse targets and backgrounds that exhibit significant differences.The common sampling method in networks is typically the Feature Pyramid Network (FPN), which is used to match different dimensions for detecting the scale of images. However, due to the inherent limitations of scene text, such as variations in aspect ratio, dense text, and differences in width-to-height ratio, these general sampling methods (FPN) may not effectively address these issues. To ease this problem, we have proposed a novel network architecture called Unified Feature Fusion Network (UFNet), which integrates feature sampling. Compared to the DBU network, UFNet achieves significantly better performance in terms of accuracy and recall on English text detection datasets such as ICDAR2015 and the mixed English-Chinese dataset MSRA-TD500. Text detection results indicate that this algorithm solves the problem of poor performance in handling variations in aspect ratio and width-to-height ratio in images.
引用
收藏
页码:163 / 168
页数:6
相关论文
共 50 条
  • [1] Text Detection Algorithm Based on Multi-Scale Attention Feature Fusion
    She, Xiangyang
    Liu, Zhe
    Dong, Lihong
    [J]. Computer Engineering and Applications, 2024, 60 (01) : 198 - 206
  • [2] Hierarchical Feature Fusion With Text Attention For Multi-scale Text Detection
    Liu, Chao
    Zou, Yuexian
    Guan, Wenjie
    [J]. 2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [3] A Multi-Scale Natural Scene Text Detection Method Based on Attention Feature Extraction and Cascade Feature Fusion
    Li, Nianfeng
    Wang, Zhenyan
    Huang, Yongyuan
    Tian, Jia
    Li, Xinyuan
    Xiao, Zhiguo
    [J]. SENSORS, 2024, 24 (12)
  • [4] Drone Detection Based on Multi-scale Feature Fusion
    Zeng, Zhenni
    Wang, Zhenning
    Qin, Lang
    Li, Hui
    [J]. 2021 6TH INTERNATIONAL CONFERENCE ON UK-CHINA EMERGING TECHNOLOGIES (UCET 2021), 2021, : 194 - 198
  • [5] Vehicle detection method based on adaptive multi-scale feature fusion network
    Shen, Xuanjing
    Li, Hanyu
    Huang, Yongping
    Wang, Yu
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (04)
  • [6] UAV reaction detection based on multi-scale feature fusion
    He, Jianfeng
    Liu, Ming
    Yu, Chuanjiang
    [J]. 2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 640 - 643
  • [7] Insulator Defect Detection Based on Multi-Scale Feature Fusion
    Bin, Li
    Luyao, Qu
    Xinshan, Zhu
    Zhimin, Guo
    Yangyang, Tian
    [J]. Diangong Jishu Xuebao/Transactions of China Electrotechnical Society, 2023, 38 (01): : 60 - 70
  • [8] Research on Bone Stick Text Recognition Method with Multi-Scale Feature Fusion
    Du, Mengxiu
    Wang, Huiqin
    Liu, Rui
    Wang, Ke
    Wang, Zhan
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (24):
  • [9] Image Dehazing Method Based on Multi-scale Feature Fusion
    Yao, Minghai
    Miao, Qi
    Hao, Qiaohong
    [J]. PROCEEDINGS OF THE 2017 3RD INTERNATIONAL CONFERENCE ON ECONOMICS, SOCIAL SCIENCE, ARTS, EDUCATION AND MANAGEMENT ENGINEERING (ESSAEME 2017), 2017, 119 : 2163 - 2166
  • [10] A Lightweight Road Defect Detection Method Based on Multi-scale Hybrid Feature Fusion
    Kuang, Jin
    Liu, Dong
    Lv, Hong
    Xu, Xinyue
    Zhang, Lingrong
    [J]. THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083