Fused Text Segmentation Networks for Multi-oriented Scene Text Detection

被引:0
|
作者
Dai, Yuchen [1 ]
Huang, Zheng [1 ,2 ]
Gao, Yuting [1 ]
Xu, Youxuan [3 ]
Chen, Kai [1 ]
Guo, Jie [1 ]
Qiu, Weidong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect, Shanghai, Peoples R China
[2] Westone Cryptol Res Ctr, Beijing, Peoples R China
[3] Xiamen 1 High Sch, Xiamen, Fujian, Peoples R China
关键词
READING TEXT; COMPETITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we introduce a novel end-end framework for multi-oriented scene text detection from an instance-aware semantic segmentation perspective. We present Fused Text Segmentation Networks, which combine multi-level features during the feature extracting as text instance may rely on finer feature expression compared to general objects. It detects and segments the text instance jointly and simultaneously, leveraging merits from both semantic segmentation task and region proposal based object detection task. Not involving any extra pipelines, our approach surpasses the current state of the art on multi-oriented scene text detection benchmarks: ICDAR2015 Incidental Scene Text and MSRA-TD500 reaching Hmean 84.1% and 82.0% respectively. Morever, we report a baseline on total-text containing curved text which suggests effectiveness of the proposed approach.
引用
收藏
页码:3604 / 3609
页数:6
相关论文
共 50 条
  • [31] MASK-MOST NET: MASK APPROXIMATION BASED MULTI-ORIENTED SCENE TEXT DETECTION NETWORK
    Guo, Xiaobao
    Li, Jinxing
    Chen, Bingzhi
    Lu, Guangming
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 206 - 211
  • [32] Multi-oriented English text line identification
    Pal, U
    Sinha, S
    Chaudhuri, BB
    IMAGE ANALYSIS, PROCEEDINGS, 2003, 2749 : 1146 - 1153
  • [33] Recognition of Indian multi-oriented and curved text
    Pal, U
    Tripathy, N
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 141 - 145
  • [34] Multi-oriented Bangla and Devnagari text recognition
    Pal, Umapada
    Roy, Partha Pratim
    Tripathy, Nilamadhaba
    Llados, Josep
    PATTERN RECOGNITION, 2010, 43 (12) : 4124 - 4136
  • [35] Multi-Oriented Text Extraction in Stylistic Documents
    Singh, Brij Mohan
    Sharma, Rahul
    Ghosh, Debashis
    Mittal, Ankush
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2015, 15 (01)
  • [36] A comparative approach on detecting multi-lingual and multi-oriented text in natural scene images
    Yegnaraman, Aparna
    Valli, S.
    APPLIED INTELLIGENCE, 2021, 51 (06) : 3696 - 3717
  • [37] A comparative approach on detecting multi-lingual and multi-oriented text in natural scene images
    Aparna Yegnaraman
    S. Valli
    Applied Intelligence, 2021, 51 : 3696 - 3717
  • [38] A Deep Learning Approach for Robust, Multi-oriented, and Curved Text Detection
    Ranjbarzadeh, Ramin
    Jafarzadeh Ghoushchi, Saeid
    Anari, Shokofeh
    Safavi, Sadaf
    Tataei Sarshar, Nazanin
    Babaee Tirkolaee, Erfan
    Bendechache, Malika
    COGNITIVE COMPUTATION, 2024, 16 (04) : 1979 - 1991
  • [39] Semantic Compensation Based Dual-Stream Feature Interaction Network for Multi-oriented Scene Text Detection
    Wang, Siyan
    Li, Sumei
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [40] Multi-oriented touching text character segmentation in graphical documents using dynamic programming
    Pratim Roy, Partha
    Pal, Umapada
    Llados, Josep
    Delalandre, Mathieu
    PATTERN RECOGNITION, 2012, 45 (05) : 1972 - 1983