Max-Pooling based Scene Text Proposal for Scene Text Detection

被引:3
|
作者
Dinh Nguyen Van [1 ]
Lu, Shijian [2 ]
Bai, Xiang [3 ]
Ouarti, Nizar [4 ]
Mokhtari, Mounir [5 ]
机构
[1] Sorbonne Univ Pierre & Marie CURIE, CNRS IPAL UMI 2955, I2R, Paris, France
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[3] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan, Hubei, Peoples R China
[4] Sorbonne Univ Pierre & Marie CURIE, CNRS IPAL UMI 2955, Paris, France
[5] Inst Mines Telecom, CNRS IPAL UMI 2955, Paris, France
关键词
D O I
10.1109/ICDAR.2017.213
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic reading texts in scenes is an attracting increasing interest in recent years due to various context awareness applications. Leverage on the advantages of object proposal in generic object detection, we propose a max-pooling based scene text proposal technique aiming for automatic extraction of texts in scenes. Given a scene image, a max-pooling based grouping technique is designed to search for scene text proposals within a feature map which is computed from image edges. Searched proposals are then ranked by a scoring function that is defined based on the histogram of oriented gradient. The proposed technique has been evaluated on two publicly available scene text datasets, including the ICDAR2015 dataset and the Street View Text (SVT) dataset. Experiments show that the proposed technique obtains superior proposal performance as compared with state-of-the-arts, especially when a small number of proposals is selected. In addition, it also obtains state-of-the-art scene text spotting when integrated with a scene text recognition model.
引用
收藏
页码:1295 / 1300
页数:6
相关论文
共 50 条
  • [1] A pooling based scene text proposal technique for scene text reading in the wild
    Dinh NguyenVan
    Lu, Shijian
    Tian, Shangxuan
    Ouarti, Nizar
    Mokhtari, Mounir
    [J]. PATTERN RECOGNITION, 2019, 87 : 118 - 129
  • [2] Mining the displacement of max-pooling for text recognition
    Zheng, Yuchen
    Lwana, Brian Kenji
    Uchida, Seiichi
    [J]. PATTERN RECOGNITION, 2019, 93 : 558 - 569
  • [3] Natural scene text detection based on multiscale connectionist text proposal network
    Huang, Min
    Lan, Chaohao
    Huang, Wei
    Tao, Yang
    [J]. JOURNAL OF ENGINEERING-JOE, 2020, 2020 (13): : 326 - 329
  • [4] Scene Text Detection with Inception Text Proposal Generation Module
    Zhang, Hang
    Liu, Jiahang
    Chen, Tieqiao
    [J]. ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 456 - 460
  • [5] FTPN: Scene Text Detection With Feature Pyramid Based Text Proposal Network
    Liu, Fagui
    Chen, Cheng
    Gu, Dian
    Zheng, Jingzhong
    [J]. IEEE ACCESS, 2019, 7 : 44219 - 44228
  • [6] Scene Text Detection Based on Text Probability and Pruning Algorithm
    Zhou, Gang
    Liu, Yajun
    Shi, Fei
    Hu, Ying
    [J]. INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2016, PT III, 2016, 9773 : 726 - 735
  • [7] Scene Text Detection Based on Multi-Scale Pooling and Bidirectional Feature Fusion
    Wei, Zheliang
    Li, Yueyang
    Luo, Haichi
    [J]. Computer Engineering and Applications, 2024, 60 (02) : 154 - 161
  • [8] Holistic Vertical Regional Proposal Network for Scene Text Detection
    Ehen, Xu
    Guo, Qiang
    Li, Shuohao
    Zhang, Jun
    [J]. 2017 2ND INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2017), 2017, : 72 - 77
  • [9] Scene Text Detection based on Structural Features
    Nguyen, Khanh
    Ngo Duc Thanh
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL, INFORMATICS, AND ITS APPLICATIONS (IC3INA) - RECENT PROGRESS IN COMPUTER, CONTROL, AND INFORMATICS FOR DATA SCIENCE, 2016, : 48 - 53
  • [10] Scene Text Detection Based On Fusion Network
    Zhao, Xuezhuan
    Zhou, Ziheng
    Li, Lingling
    Pei, Lishen
    Ye, Zhaoyi
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (10)