EK-NET:REAL-TIME SCENE TEXT DETECTION WITH EXPAND KERNEL DISTANCE

被引:1
|
作者
Zhu, Boyuan [1 ]
Liu, Fagui [1 ,2 ]
Chen, Xi [1 ]
Tang, Quan [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
关键词
Scene Text Detection; Arbitrary Shapes; Real-Time; Three-stages Regression; Expand Kernel Distance;
D O I
10.1109/ICASSP48485.2024.10448354
中图分类号
学科分类号
摘要
Recently, scene text detection has received significant attention due to its wide application. However, accurate detection in complex scenes of multiple scales, orientations, and curvature remains a challenge. Numerous detection methods adopt the Vatti clipping (VC) algorithm for multiple-instance training to address the issue of arbitrary-shaped text. Yet we identify several bias results from these approaches called the "shrinked kernel". Specifically, it refers to a decrease in accuracy resulting from an output that overly favors the text kernel. In this paper, we propose a new approach named Expand Kernel Network (EK-Net) with expand kernel distance to compensate for the previous deficiency, which includes three-stages regression to complete instance detection. Moreover, EK-Net not only realize the precise positioning of arbitrary-shaped text, but also achieve a trade-off between performance and speed. Evaluation results demonstrate that EK-Net achieves state-of-the-art or competitive performance compared to other advanced methods, e.g., F-measure of 85.72% at 35.42 FPS on ICDAR 2015, F-measure of 85.75% at 40.13 FPS on CTW1500.
引用
收藏
页码:6380 / 6384
页数:5
相关论文
共 50 条
  • [1] EK-Net plus plus : Real-time scene text detection with expand kernel distance and Epoch Adaptive Weight
    Zhu, Boyuan
    Chen, Xi
    Tang, Quan
    Chen, C. L. Philip
    Liu, Fagui
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 267
  • [2] Text kernel expansion for real-time scene text detection
    He, Tao
    Huang, Sheng
    Tang, Wenhao
    Liu, Bo
    PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (04)
  • [3] Real-Time Scene Text Detection with Differentiable Binarization
    Liao, Minghui
    Wan, Zhaoyi
    Yao, Cong
    Chen, Kai
    Bai, Xiang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11474 - 11481
  • [4] Real-time Scene Text Detection Based on Stroke Model
    Liu, Yi
    Zhang, Dongming
    Zhang, Yongdong
    Lin, Shouxun
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3116 - 3120
  • [5] Real-Time Scene Text Localization and Recognition
    Neumann, Lukas
    Matas, Jiri
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 3538 - 3545
  • [6] A Real-Time Scene Text to Speech System
    Neumann, Lukas
    Matas, Jiri
    COMPUTER VISION - ECCV 2012, PT III, 2012, 7585 : 619 - 622
  • [7] Real-Time Scene Text Detection With Differentiable Binarization and Adaptive Scale Fusion
    Liao, Minghui
    Zou, Zhisheng
    Wan, Zhaoyi
    Yao, Cong
    Bai, Xiang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 919 - 931
  • [8] A Real-Time Scene Uyghur Text Detection Network Based on Feature Complementation
    Ibrayim, Mayire
    Chen, Mengmeng
    Hamdulla, Askar
    Kane, Jianjun
    Zhang, Chunhu
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT V, 2024, 14808 : 59 - 75
  • [9] An Improved System For Real-Time Scene Text Recognition
    Yang, Haojin
    Wang, Cheng
    Che, Xiaoyin
    Luo, Sheng
    Meinel, Christoph
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 657 - 660
  • [10] Real-time Lexicon-free Scene Text Retrieval
    Mafla, Andres
    Tito, Ruben
    Dey, Sounak
    Gomez, Lluis
    Rusinol, Marcal
    Valveny, Ernest
    Karatzas, Dimosthenis
    PATTERN RECOGNITION, 2021, 110