MASK-MOST NET: MASK APPROXIMATION BASED MULTI-ORIENTED SCENE TEXT DETECTION NETWORK

被引:4
|
作者
Guo, Xiaobao [1 ]
Li, Jinxing [2 ]
Chen, Bingzhi [1 ]
Lu, Guangming [1 ]
机构
[1] Harbin Inst Technol Shenzhen, Sch Comp Sci & Technol, Shenzhen, Guangdong, Peoples R China
[2] Chinese Univ Hong Kong Shenzhen, Sch Sci & Engn, Shenzhen, Guangdong, Peoples R China
关键词
text detection; mask; contextual module; regression; text instance;
D O I
10.1109/ICME.2019.00044
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, a novel multi-task cascade framework, which jointly takes the detection and the segmentation into account, is presented for the scene text detection. To address the issue of multi-oriented scene text detection, we propose an instance-level mask approximation method through the auxiliary regression task on center and comer points. Specifically, the text instance in the image is first coarsely detected, followed by a contextual module which can capture more accurate instances. To cope with the scale variation existing in these detected instances, a combination of high-level semantic and low-level features is further exploited, achieving more robust and better performance. A series of experiments conducted on different benchmark datasets demonstrate the effectiveness of the proposed method.
引用
收藏
页码:206 / 211
页数:6
相关论文
共 50 条
  • [41] Graph fusion network for multi-oriented object detection
    Shi-Xue Zhang
    Xiaobin Zhu
    Jie-Bo Hou
    Xu-Cheng Yin
    Applied Intelligence, 2023, 53 : 2280 - 2294
  • [42] A comparative approach on detecting multi-lingual and multi-oriented text in natural scene images
    Yegnaraman, Aparna
    Valli, S.
    APPLIED INTELLIGENCE, 2021, 51 (06) : 3696 - 3717
  • [43] A comparative approach on detecting multi-lingual and multi-oriented text in natural scene images
    Aparna Yegnaraman
    S. Valli
    Applied Intelligence, 2021, 51 : 3696 - 3717
  • [44] CM-Net: Concentric Mask Based Arbitrary-Shaped Text Detection
    Yang, Chuang
    Chen, Mulin
    Xiong, Zhitong
    Yuan, Yuan
    Wang, Qi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 2864 - 2877
  • [45] Fractals based multi-oriented text detection system for recognition in mobile video images
    Shivakumara, Palaiahnakote
    Wu, Liang
    Lu, Tong
    Tan, Chew Lim
    Blumenstein, Michael
    Anami, Basavaraj S.
    PATTERN RECOGNITION, 2017, 68 : 158 - 174
  • [46] A Deep Learning Approach for Robust, Multi-oriented, and Curved Text Detection
    Ranjbarzadeh, Ramin
    Jafarzadeh Ghoushchi, Saeid
    Anari, Shokofeh
    Safavi, Sadaf
    Tataei Sarshar, Nazanin
    Babaee Tirkolaee, Erfan
    Bendechache, Malika
    COGNITIVE COMPUTATION, 2024, 16 (04) : 1979 - 1991
  • [47] Mask-CDNet: A mask based pixel change detection network
    Bu, Shuhui
    Li, Qing
    Han, Pengcheng
    Leng, Pengyu
    Li, Ke
    NEUROCOMPUTING, 2020, 378 : 166 - 178
  • [48] Single shot multi-oriented text detection based on local and non-local features
    Li, XiaoQian
    Liu, Jie
    Zhang, ShuWu
    Zhang, GuiXuan
    Zheng, Yang
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2020, 23 (04) : 241 - 252
  • [49] Single shot multi-oriented text detection based on local and non-local features
    XiaoQian Li
    Jie Liu
    ShuWu Zhang
    GuiXuan Zhang
    Yang Zheng
    International Journal on Document Analysis and Recognition (IJDAR), 2020, 23 : 241 - 252
  • [50] A new Histogram Oriented Moments descriptor for multi-oriented moving text detection in video
    Khare, Vijeta
    Shivakumara, Palaiahnakote
    Raveendran, Paramesran
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (21) : 7627 - 7640