Fused Text Segmentation Networks for Multi-oriented Scene Text Detection

被引:0
|
作者
Dai, Yuchen [1 ]
Huang, Zheng [1 ,2 ]
Gao, Yuting [1 ]
Xu, Youxuan [3 ]
Chen, Kai [1 ]
Guo, Jie [1 ]
Qiu, Weidong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect, Shanghai, Peoples R China
[2] Westone Cryptol Res Ctr, Beijing, Peoples R China
[3] Xiamen 1 High Sch, Xiamen, Fujian, Peoples R China
关键词
READING TEXT; COMPETITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we introduce a novel end-end framework for multi-oriented scene text detection from an instance-aware semantic segmentation perspective. We present Fused Text Segmentation Networks, which combine multi-level features during the feature extracting as text instance may rely on finer feature expression compared to general objects. It detects and segments the text instance jointly and simultaneously, leveraging merits from both semantic segmentation task and region proposal based object detection task. Not involving any extra pipelines, our approach surpasses the current state of the art on multi-oriented scene text detection benchmarks: ICDAR2015 Incidental Scene Text and MSRA-TD500 reaching Hmean 84.1% and 82.0% respectively. Morever, we report a baseline on total-text containing curved text which suggests effectiveness of the proposed approach.
引用
收藏
页码:3604 / 3609
页数:6
相关论文
共 50 条
  • [41] A new Histogram Oriented Moments descriptor for multi-oriented moving text detection in video
    Khare, Vijeta
    Shivakumara, Palaiahnakote
    Raveendran, Paramesran
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (21) : 7627 - 7640
  • [42] Recognition of Multi-Oriented, Multi-Sized, and Curved Text
    Chiang, Yao-Yi
    Knoblock, Craig A.
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 1399 - 1403
  • [43] Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection
    Liu, Yuliang
    Jin, Lianwen
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3454 - 3461
  • [44] A new method for multi-oriented graphics-scene-3D text classification in video
    Xu, Jiamin
    Shivakumara, Palaiahnakote
    Lu, Tong
    Tan, Chew Lim
    Uchida, Seiichi
    PATTERN RECOGNITION, 2016, 49 : 19 - 42
  • [45] FC2RN: A FULLY CONVOLUTIONAL CORNER REFINEMENT NETWORK FOR ACCURATE MULTI-ORIENTED SCENE TEXT DETECTION
    Qin, Xugong
    Zhou, Yu
    Guo, Youhui
    Wu, Dayan
    Wang, Weiping
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4350 - 4354
  • [46] Rotation-Invariant Features for Multi-Oriented Text Detection in Natural Images
    Yao, Cong
    Zhang, Xin
    Bai, Xiang
    Liu, Wenyu
    Ma, Yi
    Tu, Zhuowen
    PLOS ONE, 2013, 8 (08):
  • [47] Enhancing Scene Text Detection via Fused Semantic Segmentation Network with Attention
    Liu, Chao
    Zou, Yuexian
    Yang, Dongming
    MULTIMEDIA MODELING (MMM 2019), PT I, 2019, 11295 : 531 - 542
  • [48] TK-Text: Multi-shaped Scene Text Detection via Instance Segmentation
    Song, Xiaoge
    Wu, Yirui
    Wang, Wenhai
    Lu, Tong
    MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 : 201 - 213
  • [49] Extraction and Recognition of Multi-oriented Text from Trademark Images
    Tripathi, Priyanka
    Indoria, Ajay Kumar
    2015 INTERNATIONAL CONFERENCE ON COGNITIVE COMPUTING AND INFORMATION PROCESSING (CCIP), 2015,
  • [50] Bayesian classifier for multi-oriented video text recognition system
    Roy, Sangheeta
    Shivakumara, Palaiahnakote
    Roy, Partha Pratim
    Pal, Umapada
    Tan, Chew Lim
    Lu, Tong
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (13) : 5554 - 5566