MASK-MOST NET: MASK APPROXIMATION BASED MULTI-ORIENTED SCENE TEXT DETECTION NETWORK

被引:4
|
作者
Guo, Xiaobao [1 ]
Li, Jinxing [2 ]
Chen, Bingzhi [1 ]
Lu, Guangming [1 ]
机构
[1] Harbin Inst Technol Shenzhen, Sch Comp Sci & Technol, Shenzhen, Guangdong, Peoples R China
[2] Chinese Univ Hong Kong Shenzhen, Sch Sci & Engn, Shenzhen, Guangdong, Peoples R China
关键词
text detection; mask; contextual module; regression; text instance;
D O I
10.1109/ICME.2019.00044
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, a novel multi-task cascade framework, which jointly takes the detection and the segmentation into account, is presented for the scene text detection. To address the issue of multi-oriented scene text detection, we propose an instance-level mask approximation method through the auxiliary regression task on center and comer points. Specifically, the text instance in the image is first coarsely detected, followed by a contextual module which can capture more accurate instances. To cope with the scale variation existing in these detected instances, a combination of high-level semantic and low-level features is further exploited, achieving more robust and better performance. A series of experiments conducted on different benchmark datasets demonstrate the effectiveness of the proposed method.
引用
收藏
页码:206 / 211
页数:6
相关论文
共 50 条
  • [31] A Laplacian Approach to Multi-Oriented Text Detection in Video
    Shivakumara, Palaiahnakote
    Phan, Trung Quy
    Tan, Chew Lim
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (02) : 412 - 419
  • [32] Arbitrarily Shaped Scene Text Detection With a Mask Tightness Text Detector
    Liu, Yuliang
    Jin, Lianwen
    Fang, Chuanming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 2918 - 2930
  • [33] Multi-Oriented Text Detection with Fully Convolutional Networks
    Zhang, Zheng
    Zhang, Chengquan
    Shen, Wei
    Yao, Cong
    Liu, Wenyu
    Bai, Xiang
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4159 - 4167
  • [34] IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection
    Yang, Qiangpeng
    Cheng, Mengli
    Zhou, Wenmeng
    Chen, Yan
    Qiu, Minghui
    Lin, Wei
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 1071 - 1077
  • [35] A Multi-Oriented Scene Text Detector with Position-Sensitive Segmentation
    Cheng, Peirui
    Wang, Weiqiang
    ICMR '18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2018, : 152 - 159
  • [36] A Method for Multi-Oriented Thai Text Localization in Natural Scene Images using Convolutional Neural Network
    Kobchaisawat, Thananop
    Chalidabhongse, Thanarat H.
    2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS (ICSIPA), 2015, : 220 - 225
  • [37] Multi-Oriented Real-time Arabic Scene Text Detection with Deep Fully Convolutional Networks
    Sassi, M. Saifeddine Hadj
    Beltaief, Ines
    Zekri, Manel
    Ben Yahia, Sadok
    2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,
  • [38] OPMP: An Omnidirectional Pyramid Mask Proposal Network for Arbitrary-Shape Scene Text Detection
    Zhang, Sheng
    Liu, Yuliang
    Jin, Lianwen
    Wei, Zhongrong
    Shen, Chunhua
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 454 - 467
  • [39] Multi-oriented text detection from natural scene images based on a CNN and pruning non-adjacent graph edges
    Wei, Yuanwang
    Shen, Wei
    Zeng, Dan
    Ye, Lihua
    Zhang, Zhijiang
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 64 : 89 - 98
  • [40] Graph fusion network for multi-oriented object detection
    Zhang, Shi-Xue
    Zhu, Xiaobin
    Hou, Jie-Bo
    Yin, Xu-Cheng
    APPLIED INTELLIGENCE, 2023, 53 (02) : 2280 - 2294